ERIC Educational Resources Information Center
Green, Samuel B.; Yang, Yanyun
2015-01-01
In the lead article, Davenport, Davison, Liou, & Love demonstrate the relationship among homogeneity, internal consistency, and coefficient alpha, and also distinguish among them. These distinctions are important because too often coefficient alpha--a reliability coefficient--is interpreted as an index of homogeneity or internal consistency.…
Ponterotto, Joseph G; Ruckdeschel, Daniel E
2007-12-01
The present article addresses issues in reliability assessment that are often neglected in psychological research such as acceptable levels of internal consistency for research purposes, factors affecting the magnitude of coefficient alpha (alpha), and considerations for interpreting alpha within the research context. A new reliability matrix anchored in classical test theory is introduced to help researchers judge adequacy of internal consistency coefficients with research measures. Guidelines and cautions in applying the matrix are provided.
Delimiting Coefficient a from Internal Consistency and Unidimensionality
ERIC Educational Resources Information Center
Sijtsma, Klaas
2015-01-01
I discuss the contribution by Davenport, Davison, Liou, & Love (2015) in which they relate reliability represented by coefficient a to formal definitions of internal consistency and unidimensionality, both proposed by Cronbach (1951). I argue that coefficient a is a lower bound to reliability and that concepts of internal consistency and…
Lee, Chin-Pang; Chiu, Yu-Wen; Chu, Chun-Lin; Chen, Yu; Jiang, Kun-Hao; Chen, Jiun-Liang; Chen, Ching-Yen
2016-12-01
The aging males' symptoms (AMS) scale is an instrument used to determine the health-related quality of life in adult and elderly men. The purpose of this study was to synthesize internal consistency (Cronbach's alpha) and test-retest reliability for the AMS scale and its three subscales. Of the 123 studies reviewed, 12 provided alpha coefficients which were then used in the meta-analyses of internal consistency. Seven of the 12 included studies provided test-retest coefficients, and these were used in the meta-analyses of test-retest reliability. The AMS scale had excellent internal consistency [α = 0.89 (95% CI 0.88-0.90)]; the mean alpha estimates across the AMS subscales ranged from 0.79 to 0.82. The AMS scale also had good test-retest reliability [r = 0.85 (95% CI 0.82-0.88]; the test-retest reliability coefficients of the AMS subscales ranged from 0.76 to 0.83. There was significant heterogeneity among the included studies. The AMS scale and the three subscales had fairly good internal consistency and test-retest reliability. Future psychometric studies of the AMS scale should report important characteristics of the participants, details of item scores, and test-retest reliability.
Assessment of the Maximal Split-Half Coefficient to Estimate Reliability
ERIC Educational Resources Information Center
Thompson, Barry L.; Green, Samuel B.; Yang, Yanyun
2010-01-01
The maximal split-half coefficient is computed by calculating all possible split-half reliability estimates for a scale and then choosing the maximal value as the reliability estimate. Osburn compared the maximal split-half coefficient with 10 other internal consistency estimates of reliability and concluded that it yielded the most consistently…
Coefficient Alpha and Reliability of Scale Scores
ERIC Educational Resources Information Center
Almehrizi, Rashid S.
2013-01-01
The majority of large-scale assessments develop various score scales that are either linear or nonlinear transformations of raw scores for better interpretations and uses of assessment results. The current formula for coefficient alpha (a; the commonly used reliability coefficient) only provides internal consistency reliability estimates of raw…
Use of Internal Consistency Coefficients for Estimating Reliability of Experimental Tasks Scores
Green, Samuel B.; Yang, Yanyun; Alt, Mary; Brinkley, Shara; Gray, Shelley; Hogan, Tiffany; Cowan, Nelson
2017-01-01
Reliabilities of scores for experimental tasks are likely to differ from one study to another to the extent that the task stimuli change, the number of trials varies, the type of individuals taking the task changes, the administration conditions are altered, or the focal task variable differs. Given reliabilities vary as a function of the design of these tasks and the characteristics of the individuals taking them, making inferences about the reliability of scores in an ongoing study based on reliability estimates from prior studies is precarious. Thus, it would be advantageous to estimate reliability based on data from the ongoing study. We argue that internal consistency estimates of reliability are underutilized for experimental task data and in many applications could provide this information using a single administration of a task. We discuss different methods for computing internal consistency estimates with a generalized coefficient alpha and the conditions under which these estimates are accurate. We illustrate use of these coefficients using data for three different tasks. PMID:26546100
[The reliability of a questionnaire regarding Colombian children's physical activity].
Herazo-Beltrán, Aliz Y; Domínguez-Anaya, Regina
2012-10-01
Reporting the Physical Activity Questionnaire for school children's (PAQ-C) test-retest reliability and internal consistency. This was a descriptive study of 100 school-aged children aged 9 to 11 years old attending a school in Cartagena, Colombia. The sample was randomly selected. The PAQ-C was given twice, one week apart, after the informed consent forms had been signing by the children's parents and school officials. Cronbach's alpha coefficient of reliability was used for assessing internal consistency and an intra-class correlation coefficient for test-retest reliability SPSS (version 17.0) was used for statistical analysis. The questionnaire scored 0.73 internal consistencies during the first measurement and 0.78 on the second; intra-class correlation coefficient was 0.60. There were differences between boys and girls regarding both measurements. The PAQ-C had acceptable internal consistency and test-retest reliability, thereby making it useful for measuring children's self-reported physical activity and a valuable tool for population studies in Colombia.
Xiao, Yuan-mei; Wang, Zhi-ming; Wang, Mian-zhen; Lan, Ya-jia
2005-06-01
To test the reliability and validity of two mental workload assessment scales, i.e. subjective workload assessment technique (SWAT) and NASA task load index (NASA-TLX). One thousand two hundred and sixty-eight mental workers were sampled from various kinds of occupations, such as scientific research, education, administration and medicine, etc, with randomized cluster sampling. The re-test reliability, split-half reliability, Cronbach's alpha coefficient and correlation coefficients between item score and total score were adopted to test the reliability. The test of validity included structure validity. The re-test reliability coefficients of these two scales and their items were ranged from 0.516 to 0.753 (P < 0.01), indicating the two scales had good re-test reliability; the split-half reliability of SWAT was 0.645, and its Cronbach's alpha coefficient was more than 0.80, all the correlation coefficients between its items score and total score were more than 0.70; as for NASA-TLX, both the split-half reliability and Cronbach's alpha coefficient were more than 0.80, the correlation coefficients between its items score and total score were all more than 0.60 (P < 0.01) except the item of performance. Both scales had good inner consistency. The Pearson correlation coefficient between the two scales was 0.492 (P < 0.01), implying the results of the two scales had good consistency. Factor analysis showed that the two scales had good structure validity. Both SWAT and NASA-TLX have good reliability and validity and may be used as a valid tool to assess mental workload in China after being revised properly.
Psychometric Inferences from a Meta-Analysis of Reliability and Internal Consistency Coefficients
ERIC Educational Resources Information Center
Botella, Juan; Suero, Manuel; Gambara, Hilda
2010-01-01
A meta-analysis of the reliability of the scores from a specific test, also called reliability generalization, allows the quantitative synthesis of its properties from a set of studies. It is usually assumed that part of the variation in the reliability coefficients is due to some unknown and implicit mechanism that restricts and biases the…
Reliability of a structured interview for admission to an emergency medicine residency program.
Blouin, Danielle
2010-10-01
Interviews are most important in resident selection. Structured interviews are more reliable than unstructured ones. We sought to measure the interrater reliability of a newly designed structured interview during the selection process to an Emergency Medicine residency program. The critical incident technique was used to extract the desired dimensions of performance. The interview tool consisted of 7 clinical scenarios and 1 global rating. Three trained interviewers marked each candidate on all scenarios without discussing candidates' responses. Interitem consistency and estimates of variance were computed. Twenty-eight candidates were interviewed. The generalizability coefficient was 0.67. Removing the central tendency ratings increased the coefficient to 0.74. Coefficients of interitem consistency ranged from 0.64 to 0.74. The structured interview tool provided good although suboptimal interrater reliability. Increasing the number of scenarios improves reliability as does applying differential weights to the rating scale anchors. The latter would also facilitate the identification of those candidates with extreme ratings.
Reliability of the ecSatter Inventory as a tool to measure eating competence.
Stotts, Jodi L; Lohse, Barbara
2007-01-01
To examine the reliability of the ecSatter Inventory (ecSI), a measure of eating competence. Self-report questionnaires were administered in person or by mail. Retesting occurred 2 to 6 weeks after completion of the first questionnaire. Both administrations of the questionnaire were completed by 259 participants who were mostly food secure, white females with some college education; mean age was 26.9 +/- 10.4 years. Test-retest reliability and internal consistency. Spearman's rank correlation coefficients to estimate test-retest reliability and Cronbach alpha coefficients to estimate internal consistency. Spearman's rank correlation coefficient for ecSI total score was 0.68; subscale coefficients were 0.70 for eating attitudes, 0.70 for contextual skills, 0.65 for food acceptance, and 0.52 for internal regulation. Cronbach alpha coefficient for ecSI total score was 0.77. Subscale alphas coefficients were 0.80 for eating attitudes, 0.69 for contextual skills, 0.68 for food acceptance, and 0.66 for internal regulation. This study provides psychometric evidence about the reliability of ecSI as a measure of eating competence in this sample. Although some ecSI items may require revision, results suggest that the instrument may be used to evaluate nutrition education designed to improve eating competence.
Yalin Sapmaz, Şermin; Ergin, Dilek; Özek Erkuran, Handan; Şen Celasin, Nesrin; Öztürk, Masum; Karaarslan, Duygu; Köroğlu, Ertuğrul; Aydemir, Ömer
2017-09-01
This study assessed the validity and reliability of the Turkish version of the DSM-5 Posttraumatic Stress Symptom Severity Scale-Child Form for use among the Turkish population. The study group consisted of 30 patients that had been treated in a child psychiatry unit and diagnosed with posttraumatic stress disorder and 83 healthy volunteers that were attending middle or high school during the study period. For reliability analyses, the internal consistency coefficient and the test-retest correlation coefficient were measured. For validity analyses, the exploratory factor analysis and correlation analysis with the Child Posttraumatic Stress Reaction Index for concurrent validity were measured. The Cronbach's alpha (the internal consistency coefficient) of the scale was 0.909, and the test-retest correlation coefficient was 0.663. One factor that could explain 58.5% of the variance was obtained and was congruent with the original construct of the scale. As for concurrent validity, the scale showed high correlation with the Child Posttraumatic Stress Reaction Index. It was concluded that the Turkish version of the DSM-5 Posttraumatic Stress Symptom Severity Scale-Child Form can be used as a valid and reliable tool.
Charter, Richard A
2005-12-01
Confidence intervals are provided for the validity coefficients calculated by Veazey, et al. for the M-FAST. Two coefficients alpha are also presented along with suggestions for different approaches to calculating the M-FAST internal consistency reliability.
ERIC Educational Resources Information Center
Davenport, Ernest C.; Davison, Mark L.; Liou, Pey-Yan; Love, Quintin U.
2015-01-01
This article uses definitions provided by Cronbach in his seminal paper for coefficient a to show the concepts of reliability, dimensionality, and internal consistency are distinct but interrelated. The article begins with a critique of the definition of reliability and then explores mathematical properties of Cronbach's a. Internal consistency…
The Bahasa Melayu version of the Nursing Stress Scale among nurses: a reliability study in Malaysia.
Rosnawati, Muhamad Robat; Moe, Htay; Masilamani, Retneswari; Darus, A
2010-10-01
The Nursing Stress Scale (NSS) has been shown to be a valid and reliable instrument to assess occupational stressors among nurses. The NSS, which was previously used in the English version, was translated and back-translated into Bahasa Melayu. This study was conducted to assess the reliability of the Bahasa Melayu version of the NSS among nurses for future studies in this country. The reliability of the NSS was assessed after its readministration to 30 nurses with a 2-week interval. The Spearman coefficient was calculated to assess its stability. The internal consistency was measured through 4 measures: Cronbach's α, Spearman-Brown, Guttman split-half, and standardized item α coefficients. The total response rate was 70%. Test-retest reliability showed remarkable stability (Spearman's ρ exceeded .70). All 4 measures of internal consistency among items indicated a satisfactory level (coefficients in the range of .68 to .87). In conclusion, the Bahasa Melayu version of the NSS is a reliable and useful instrument for measuring the possible stressors at the workplace among nurses.
[Estimators of internal consistency in health research: the use of the alpha coefficient].
da Silva, Franciele Cascaes; Gonçalves, Elizandra; Arancibia, Beatriz Angélica Valdivia; Bento, Gisele Graziele; Castro, Thiago Luis da Silva; Hernandez, Salma Stephany Soleman; da Silva, Rudney
2015-01-01
Academic production has increased in the area of health, increasingly demanding high quality in publications of great impact. One of the ways to consider quality is through methods that increase the consistency of data analysis, such as reliability which, depending on the type of data, can be evaluated by different coefficients, especially the alpha coefficient. Based on this, the present review systematically gathers scientific articles produced in the last five years, which in a methodological manner gave the α coefficient psychometric use as an estimator of internal consistency and reliability in the processes of construction, adaptation and validation of instruments. The identification of the studies was conducted systematically in the databases BioMed Central Journals, Web of Science, Wiley Online Library, Medline, SciELO, Scopus, Journals@Ovid, BMJ and Springer, using inclusion and exclusion criteria. Data analyses were performed by means of triangulation, content analysis and descriptive analysis. It was found that most studies were conducted in Iran (f=3), Spain (f=2) and Brazil (f=2). These studies aimed to test the psychometric properties of instruments, with eight studies using the α coefficient to assess reliability and nine for assessing internal consistency. All studies were classified as methodological research when their objectives were analyzed. In addition, four studies were also classified as correlational and one as descriptive-correlational. It can be concluded that though the α coefficient is widely used as one of the main parameters for assessing internal consistency of questionnaires in health sciences, its use as an estimator of trust of the methodology used and internal consistency has some critiques that should be considered.
Reliability and validity of the Chinese pediatric voice handicap index.
Liu, Kena; Liu, Shaofeng; Zhou, Zhou; Ren, Qinyi; Zhong, Jie; Luo, Renzhong; Qin, Huabiao; Zhang, Siyi; Ge, Pingjiang
2018-02-01
To evaluate the reliability and validity of the Chinese version of pediatric voice handicap index (pVHI). The original English version-pVHI was translated into Chinese. Parents of 52 children with voice dysphonia and 43 children with no history or symptoms of voice problems were asked to fill the Chinese pVHI questionnaires twice with an interval of 2 weeks. GRB (Grade, Roughness, Breathiness) scale was used for perceptual assessment by two otolaryngologists and one speech pathologist for each child's voice. The internal consistency was assessed using Cronbach's alpha coefficient. Pearson's correlation coefficient was used to evaluate the test-retest reliability. The Kendall's coefficient of concordance W was used to assess the consistency of GRB scores of 3 voice specialists. The nonparametric Mann-Whitney test was used to assess the differences between the dysphonia group and controls. The correlation between pVHI and GRB scores were assessed using Pearson's correlation coefficient. The internal consistency of total score and three subscales scores of Chinese pVHI were 0.788-0.944. The test-retest reliability was 0.631-0.887(P < .001). The pVHI scores of control group significantly were lower than the pathological group (P = .000). The GRB scores of 3 voice specialists have an excellent consistency (W = 0.694-0.807, P = .000). The pVHI scores positively correlated with GRB assessment (P < .01). The Chinese version of pVHI had a good reliability and validity. It can be applicable and useful supplementary tool for evaluating parents' perception of their children's dysphonia. Copyright © 2017. Published by Elsevier B.V.
Study samples are too small to produce sufficiently precise reliability coefficients.
Charter, Richard A
2003-04-01
In a survey of journal articles, test manuals, and test critique books, the author found that a mean sample size (N) of 260 participants had been used for reliability studies on 742 tests. The distribution was skewed because the median sample size for the total sample was only 90. The median sample sizes for the internal consistency, retest, and interjudge reliabilities were 182, 64, and 36, respectively. The author presented sample size statistics for the various internal consistency methods and types of tests. In general, the author found that the sample sizes that were used in the internal consistency studies were too small to produce sufficiently precise reliability coefficients, which in turn could cause imprecise estimates of examinee true-score confidence intervals. The results also suggest that larger sample sizes have been used in the last decade compared with those that were used in earlier decades.
Rosa-Rizzotto, M; Visonà Dalla Pozza, L; Corlatti, A; Luparia, A; Marchi, A; Molteni, F; Facchin, P; Pagliano, E; Fedrizzi, E
2014-10-01
In hemiplegic children, the recognition of the activity limitation pattern and the possibility of grading its severity are relevant for clinicians while planning interventions, monitoring results, predicting outcomes. Aim of the study is to examine the reliability and validity of Besta Scale, an instrument used to measure in hemiplegic children from 18 months to 12 years of age both grasp on request (capacity) and spontaneous use of upper limb (performance) in bimanual play activities and in ADL. Psychometric analysis of reliability and of validity of the Besta scale was performed. Outpatient study sample Reliability study: A sample of 39 patients was enrolled. The administration of Besta scale was video-recorded in a standardized manner. All videos were scored by 20 independent raters on subsequent viewing. 3 raters randomly selected from the 20-raters group rescored the same video two years later for intra-rater reliability. Intra and inter-rater reliability were calculated using Intraclass Correlation Coefficient (ICC) and Kendall's coefficient (K), respectively. Internal consistency reliability was assessed using Alpha's Chronbach coefficient. Validity study: a sample of 105 children was assessed 5 times (at t0 and 2, 3, 6 and 12 months later) by 20 independent raters. Each patient underwent at the same time to QUEST and Besta scale administration and assessment. Criterion validity was calculated using rho-Pearson coefficient. Reliability study: The inter-rater reliability calculated with Kendall's coefficient resulted moderate K=0.47. The intra-rater (or test-retest) reliability for 3 raters was excellent (ICC=0.927). The Cronbach's alpha for internal consistency was 0.972. Validity study: Besta scale showed a good criterion validity compared to QUEST increasing by age and severity of impairment. Rho Pearson's correlation coefficient r was 0.81 (P<0.0001). Limitations. Besta scales in infants finds hard to distinguish between mild to moderately impaired hand function. Besta scale scoring system is a valid and reliable tool, utilizable in a clinical setting to monitor evolution of unimanual and bimanual manipulation and to distinguish hand's capacity from performance.
Reliability Generalization of Scores on the Spielberger State-Trait Anxiety Inventory.
ERIC Educational Resources Information Center
Barnes, Laura L. B.; Harp, Diane; Jung, Woo Sik
2002-01-01
Conducted a reliability generalization study for the State-Trait Anxiety Inventory (C. Spielberger, 1983) by reviewing and classifying 816 research articles. Average reliability coefficients were acceptable for both internal consistency and test-retest reliability, but variation was present among the estimates. Other differences are discussed.…
Moore, Amy Lawson; Miller, Terissa M
2018-01-01
The purpose of the current study is to evaluate the validity and reliability of the revised Gibson Test of Cognitive Skills, a computer-based battery of tests measuring short-term memory, long-term memory, processing speed, logic and reasoning, visual processing, as well as auditory processing and word attack skills. This study included 2,737 participants aged 5-85 years. A series of studies was conducted to examine the validity and reliability using the test performance of the entire norming group and several subgroups. The evaluation of the technical properties of the test battery included content validation by subject matter experts, item analysis and coefficient alpha, test-retest reliability, split-half reliability, and analysis of concurrent validity with the Woodcock Johnson III Tests of Cognitive Abilities and Tests of Achievement. Results indicated strong sources of evidence of validity and reliability for the test, including internal consistency reliability coefficients ranging from 0.87 to 0.98, test-retest reliability coefficients ranging from 0.69 to 0.91, split-half reliability coefficients ranging from 0.87 to 0.91, and concurrent validity coefficients ranging from 0.53 to 0.93. The Gibson Test of Cognitive Skills-2 is a reliable and valid tool for assessing cognition in the general population across the lifespan.
ERIC Educational Resources Information Center
Green, Carlton E.; Chen, Cynthia E.; Helms, Janet E.; Henze, Kevin T.
2011-01-01
Helms, Henze, Sass, and Mifsud (2006) defined good practices for internal consistency reporting, interpretation, and analysis consistent with an alpha-as-data perspective. Their viewpoint (a) expands on previous arguments that reliability coefficients are group-level summary statistics of samples' responses rather than stable properties of scales…
2011-01-01
Background Although measures of knowledge translation and exchange (KTE) effectiveness based on the theory of planned behavior (TPB) have been used among patients and providers, no measure has been developed for use among health system policymakers and stakeholders. A tool that measures the intention to use research evidence in policymaking could assist researchers in evaluating the effectiveness of KTE strategies that aim to support evidence-informed health system decision-making. Therefore, we developed a 15-item tool to measure four TPB constructs (intention, attitude, subjective norm and perceived control) and assessed its face validity through key informant interviews. Methods We carried out a reliability study to assess the tool's internal consistency and test-retest reliability. Our study sample consisted of 62 policymakers and stakeholders that participated in deliberative dialogues. We assessed internal consistency using Cronbach's alpha and generalizability (G) coefficients, and we assessed test-retest reliability by calculating Pearson correlation coefficients (r) and G coefficients for each construct and the tool overall. Results The internal consistency of items within each construct was good with alpha ranging from 0.68 to alpha = 0.89. G-coefficients were lower for a single administration (G = 0.34 to G = 0.73) than for the average of two administrations (G = 0.79 to G = 0.89). Test-retest reliability coefficients for the constructs ranged from r = 0.26 to r = 0.77 and from G = 0.31 to G = 0.62 for a single administration, and from G = 0.47 to G = 0.86 for the average of two administrations. Test-retest reliability of the tool using G theory was moderate (G = 0.5) when we generalized across a single observation, but became strong (G = 0.9) when we averaged across both administrations. Conclusion This study provides preliminary evidence for the reliability of a tool that can be used to measure TPB constructs in relation to research use in policymaking. Our findings suggest that the tool should be administered on more than one occasion when the intervention promotes an initial 'spike' in enthusiasm for using research evidence (as it seemed to do in this case with deliberative dialogues). The findings from this study will be used to modify the tool and inform further psychometric testing following different KTE interventions. PMID:21702956
Influences on and Limitations of Classical Test Theory Reliability Estimates.
ERIC Educational Resources Information Center
Arnold, Margery E.
It is incorrect to say "the test is reliable" because reliability is a function not only of the test itself, but of many factors. The present paper explains how different factors affect classical reliability estimates such as test-retest, interrater, internal consistency, and equivalent forms coefficients. Furthermore, the limits of classical test…
Adaptation of the Arabic Version of the Tinnitus Handicap Inventory.
Barake, Rana; Rizk, Samer Abou; Ziade, Georges; Zaytoun, George; Bassim, Marc
2016-03-01
To translate the Tinnitus Handicap Inventory (THI) into literary Arabic to come up with a unified Arabic version and to determine its validity and reliability in assessing the quality of life of Arabic-speaking patients with tinnitus. Clinical measurement study. Tertiary care center. The original English THI was translated into literary Arabic by a forward- and back-translation process according to the published guidelines for cross-cultural adaptation of health-related quality-of-life measures and applied to 100 patients with chronic tinnitus. Internal consistency reliability was then assessed by calculating Cronbach's alpha coefficient. Pearson correlation coefficients were also calculated for the different scales and the different baseline characteristics. Results showed high internal consistency and reliability coefficients (total THI: 0.93, functional subscale: 0.86, emotional subscale: 0.86, catastrophic subscale: 0.66) comparable to those of the original English THI. The Arabic version of the THI is a valid and reliable tool for the assessment of the impact of tinnitus on the quality of life of Arabic-speaking patients with the complaint of chronic tinnitus. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2015.
Reliability and validity of television food advertising questionnaire in Malaysia.
Zalma, Abdul Razak; Safiah, Md Yusof; Ajau, Danis; Khairil Anuar, Md Isa
2015-09-01
Interventions to counter the influence of television food advertising amongst children are important. Thus, reliable and valid instrument to assess its effect is needed. The objective of this study was to determine the reliability and validity of such a questionnaire. The questionnaire was administered twice on 32 primary schoolchildren aged 10-11 years in Selangor, Malaysia. The interval between the first and second administration was 2 weeks. Test-retest method was used to examine the reliability of the questionnaire. Intra-rater reliability was determined by kappa coefficient and internal consistency by Cronbach's alpha coefficient. Construct validity was evaluated using factor analysis. The test-retest correlation showed moderate-to-high reliability for all scores (r = 0.40*, p = 0.02 to r = 0.95**, p = 0.00), with one exception, consumption of fast foods (r = 0.24, p = 0.20). Kappa coefficient showed acceptable-to-strong intra-rater reliability (K = 0.40-0.92), except for two items under knowledge on television food advertising (K = 0.26 and K = 0.21) and one item under preference for healthier foods (K = 0.33). Cronbach's alpha coefficient indicated acceptable internal consistency for all scores (0.45-0.60). After deleting two items under Consumption of Commonly Advertised Food, the items showed moderate-to-high loading (0.52, 0.84, 0.42 and 0.42) with the Scree plot showing that there was only one factor. The Kaiser-Meyer-Olkin was 0.60, showing that the sample was adequate for factor analysis. The questionnaire on television food advertising is reliable and valid to assess the effect of media literacy education on television food advertising on schoolchildren. © The Author (2013). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Miki, Emi; Yamane, Shingo; Yamaoka, Mai; Fujii, Hiroe; Ueno, Hiroka; Kawahara, Toshie; Tanaka, Keiko; Tamashiro, Hiroaki; Inoue, Eiji; Okamoto, Takatsugu; Kuriyama, Masaru
2016-09-01
The study aim was to investigate the validity and reliability of the Functional Independence Measure and Functional Assessment Measure (FIM + FAM), which is unfamiliar in Japan, by using its Japanese version (FIM + FAM-j) in patients with cerebrovascular accident (CVA). Forty-two CVA patients participated. Criterion validity was examined by correlating the full scale and subscales of FIM + FAM-j with several well-established measurements using Spearman's correlation coefficient. Reliability was evaluated by internal consistency (tested by Cronbach's alpha coefficient) and intra-rater reliability (tested by Kendall's tau correlation coefficient). Good-to-excellent criterion validity was found between the full scale and motor subscales of the FIM + FAM-j and the Barthel Index, National Institutes of Health Stroke Scale, modified Rankin Scale, and lower extremity Brunnstrom Recovery Stage. High internal consistency was observed within the full-scale FIM + FAM-j and the motor and cognitive subscales (Cronbach's alphas were 0.968, 0.954, and 0.948, respectively). Additionally, good intra-rater reliability was observed within the full scale and motor subscales, and excellent reliability for the cognitive subscales (taus were 0.83, 0.80, and 0.98, respectively). This study showed that the FIM + FAM-j demonstrated acceptable levels of validity and reliability when used for CVA as a measure of disability.
Salyers, M P; McHugo, G J; Cook, J A; Razzano, L A; Drake, R E; Mueser, K T
2001-09-01
Reliability of well-known instruments was examined in 202 people with severe mental illness participating in a multisite vocational study. We examined interrater reliability of the Positive and Negative Syndrome Scale (PANSS) and the internal consistency and test-retest reliability of the PANSS, the Rosenberg Self-Esteem Scale, the Medical Outcomes Study Short Form-36 (SF-36), and the Quality of Life Interview. Most scales had good levels of reliability, with intraclass correlation coefficients (ICCs) and coefficient alphas above .70. However, the SF-36 scales were generally less stable over time, particularly Social Functioning (ICC = .55). Test-retest reliability was lower among less educated respondents and among ethnic minorities. We recommend close monitoring of psychometric issues in future multisite studies.
[Developing Perceived Competence Scale (PCS) for Adolescents].
Özer, Arif; Gençtanirim Kurt, Dilek; Kizildağ, Seval; Demırtaş Zorbaz, Selen; Arici Şahın, Fatma; Acar, Tülin; Ergene, Tuncay
2016-01-01
In this study, Perceived Competence Scale was developed to measure high school students' perceived competence. Scale development process was verified on three different samples. Participants of the research are some high school students in 2011-2012 academic terms from Ankara. Participants' numbers are incorporated in exploratory factor analysis, confirmatory factor analysis and test-retest reliability respectively, as follows: 372, 668 and 75. Internal consistency coefficients (Cronbach's and stratified α) are calculated separately for each group. For data analysis Factor 8.02 and LISREL 8.70 package programs were used. According to results of the analyses, internal consistency coefficients (α) are .90 - .93 for academic competence, .82 - .86 for social competence in the samples that exploratory and confirmatory factor analysis performed. For the whole scale internal consistency coefficient (stratified α) is calculated as .91. As a result of test-retest reliability, adjusted correlation coefficients (r) are .94 for social competence and .90 for academic competence. In addition, to fit indexes and regression weights obtained from factor analysis, findings related convergent and discriminant validity, indicating that competence can be addressed in two dimensions which are academic (16 items) and social (14 items).
Charalambous, A; Molassiotis, A
2017-01-01
The Short Form Chronic Respiratory Questionnaire (SF-CRQ) is frequently used in patients with obstructive pulmonary disease and it has demonstrated excellent psychometric properties. Since there is no psychometric information for its use with lung cancer patients, this study explored its validity and reliability in this population. Forty-six patients were assessed at two time points (with a 4-week interval) using the SF-CRQ, the modified Borg Scale, five numerical rating scales related to Perceived Severity of Breathlessness, and the Hospital Anxiety and Depression Scale. Internal consistency reliability was investigated by Cronbach's alpha reliability coefficient, test-retest reliability by Spearman-Brown reliability coefficient (P), content validity as well as convergent validity by Pearson's correlation coefficient between the SF-CRQ, and the conceptual similar scales mentioned above were explored. A principal component factor analysis was performed. The internal consistency was high [α = 0.88 (baseline) and 0.91 (after 1 month)]. The SF-CRQ had good stability with test-retest reliability ranging from r = 0.64 to 0.78, P < 0.001. Factor analysis suggests a single construct in this population. The preliminary data analyses supported the convergent, content, and construct validity of the SF-CRQ providing promising evidence that this can be a valid and reliable instrument for the assessment of quality of life related to breathlessness in lung cancer patients. © 2015 John Wiley & Sons Ltd.
ERIC Educational Resources Information Center
Helms, LuAnn Sherbeck
This paper discusses the fact that reliability is about scores and not tests and how reliability limits effect sizes. The paper also explores the classical reliability coefficients of stability, equivalence, and internal consistency. Stability is concerned with how stable test scores will be over time, while equivalence addresses the relationship…
Reliability generalization of the Multigroup Ethnic Identity Measure-Revised (MEIM-R).
Herrington, Hayley M; Smith, Timothy B; Feinauer, Erika; Griner, Derek
2016-10-01
[Correction Notice: An Erratum for this article was reported in Vol 63(5) of Journal of Counseling Psychology (see record 2016-33161-001). The name of author Erika Feinauer was misspelled as Erika Feinhauer. All versions of this article have been corrected.] Individuals' strength of ethnic identity has been linked with multiple positive indicators, including academic achievement and overall psychological well-being. The measure researchers use most often to assess ethnic identity, the Multigroup Ethnic Identity Measure (MEIM), underwent substantial revision in 2007. To inform scholars investigating ethnic identity, we performed a reliability generalization analysis on data from the revised version (MEIM-R) and compared it with data from the original MEIM. Random-effects weighted models evaluated internal consistency coefficients (Cronbach's alpha). Reliability coefficients for the MEIM-R averaged α = .88 across 37 samples, a statistically significant increase over the average of α = .84 for the MEIM across 75 studies. Reliability coefficients for the MEIM-R did not differ across study and participant characteristics such as sample gender and ethnic composition. However, consistently lower reliability coefficients averaging α = .81 were found among participants with low levels of education, suggesting that greater attention to data reliability is warranted when evaluating the ethnic identity of individuals such as middle-school students. Future research will be needed to ascertain whether data with other measures of aspects of personal identity (e.g., racial identity, gender identity) also differ as a function of participant level of education and associated cognitive or maturation processes. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
2016-10-01
Reports an error in "Reliability Generalization of the Multigroup Ethnic Identity Measure-Revised (MEIM-R)" by Hayley M. Herrington, Timothy B. Smith, Erika Feinauer and Derek Griner ( Journal of Counseling Psychology , Advanced Online Publication, Mar 17, 2016, np). The name of author Erika Feinauer was misspelled as Erika Feinhauer. All versions of this article have been corrected. (The following abstract of the original article appeared in record 2016-13160-001.) Individuals' strength of ethnic identity has been linked with multiple positive indicators, including academic achievement and overall psychological well-being. The measure researchers use most often to assess ethnic identity, the Multigroup Ethnic Identity Measure (MEIM), underwent substantial revision in 2007. To inform scholars investigating ethnic identity, we performed a reliability generalization analysis on data from the revised version (MEIM-R) and compared it with data from the original MEIM. Random-effects weighted models evaluated internal consistency coefficients (Cronbach's alpha). Reliability coefficients for the MEIM-R averaged α = .88 across 37 samples, a statistically significant increase over the average of α = .84 for the MEIM across 75 studies. Reliability coefficients for the MEIM-R did not differ across study and participant characteristics such as sample gender and ethnic composition. However, consistently lower reliability coefficients averaging α = .81 were found among participants with low levels of education, suggesting that greater attention to data reliability is warranted when evaluating the ethnic identity of individuals such as middle-school students. Future research will be needed to ascertain whether data with other measures of aspects of personal identity (e.g., racial identity, gender identity) also differ as a function of participant level of education and associated cognitive or maturation processes. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Developing a Danish version of the "Impact on Participation and Autonomy Questionnaire".
Ghaziani, Emma; Krogh, Anne Grethe; Lund, Hans
2013-05-01
To translate the "Impact on Participation and Autonomy Questionnaire" into Danish (IPAQ-DK), and estimate its internal consistency and test-retest reliability in order to promote participation-based interventions and research. Translation and two successive reliability assessments through test-retest. 137 adults with varying degrees of impairment; of these, 67 participated in the final reliability assessment. The translation followed guidelines set forth by the "European Group for Quality of Life Assessment and Health Measurement". Internal consistency for subscales was estimated by Chronbach's alpha. Weighted kappa coefficients and intraclass correlation coefficients were calculated to assess the test-retest reliability at item and subscale level, respectively. A preliminary reliability assessment revealed residual issues regarding the translation and cultural adaptation of the instrument. The revised version (IPAQ-DK) was subsequently subjected to a similar assessment demonstrating Chronbach's alpha values from 0.698 to 0.817. Weighted kappa ranged from 0.370 to 0.880; 78% of these values were higher than 0.600. The intraclass correlation coefficient covered values from 0.701 to 0.818. IPAQ-DK is a useful instrument for identifying person-perceived participation restrictions and satisfaction with participation. Further studies of IPAQ-DK's floor/ceiling effects and responsiveness to change are recommended, and whether there is a need for further linguistic improvement of certain items.
Shisslak, C M; Renger, R; Sharpe, T; Crago, M; McKnight, K M; Gray, N; Bryson, S; Estes, L S; Parnaby, O G; Killen, J; Taylor, C B
1999-03-01
To describe the development, test-retest reliability, internal consistency, and convergent validity of the McKnight Risk Factor Survey-III (MRFS-III). The MRFS-III was designed to assess a number of potential risk and protective factors for the development of disordered eating in preadolescent and adolescent girls. Several versions of the MRFS were pilot tested before the MRFS-III was administered to a sample of 651 4th through 12th- grade girls to establish its psychometric properties. Most of the test-retest reliability coefficients of individual items on the MRFS-III were r > .40. Alpha coefficients for each risk and protective factor domain on the MRFS-III were also computed. The majority of these coefficients were r > .60. High convergent validity coefficients were obtained for specific items on the MRFS-III and measures of self-esteem (Rosenberg Self-Esteem Scale) and weight concerns (Weight Concerns Scale). The test-retest reliability, internal consistency, and convergent validity of the MRFS-III suggest that it is a useful new instrument to assess potential risk and protective factors for the development of disordered eating in preadolescent and adolescent girls.
Ban, Ilija; Troelsen, Anders; Kristensen, Morten Tange
2016-10-01
The Constant score (CS) has been the primary endpoint in most studies on clavicle fractures. However, the CS was not developed to assess patients with clavicle fractures. Our aim was to examine inter-rater reliability and agreement of the CS in patients with clavicle fractures. The secondary aim was to estimate the correlation between the CS and the Disabilities of the Arm, Shoulder and Hand score and the internal consistency of the 2 scores. On the basis of sample sizing, 36 patients (31 male and 5 female patients; mean age, 41.3 years) with clavicle fractures underwent standardized CS assessment at a mean of 6.8 weeks (SD, 1.0 weeks) after injury. Reliability and agreement of the CS were determined by 2 raters. The interclass correlation coefficient (ICC2,1), standard error of measurement, minimal detectable change, Cronbach α coefficient, and Pearson correlation coefficient were estimated. Inter-rater reliability of the total CS was excellent (interclass correlation coefficient, 0.94; 95% confidence interval, 0.88-0.97), with no systematic difference between the 2 raters (P = .75). The standard error of measurement (measurement error at the group level) was 4.9, whereas the minimal detectable change (smallest change needed to indicate a real change for an individual) was 13.6 CS points. The internal consistency of the 10 CS items was good, with a Cronbach α of .85, and we found a strong correlation (r = -0.92) between the CS and Disabilities of the Arm, Shoulder and Hand score. The CS was found to be reliable for assessing patients with clavicle fractures, especially at the group level. With high inter-rater reliability and agreement, in addition to good internal consistency, the standardized CS used in this study can be used for comparison of results from different settings. Copyright © 2016 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Elsevier Inc. All rights reserved.
Is Coefficient Alpha Robust to Non-Normal Data?
Sheng, Yanyan; Sheng, Zhaohui
2011-01-01
Coefficient alpha has been a widely used measure by which internal consistency reliability is assessed. In addition to essential tau-equivalence and uncorrelated errors, normality has been noted as another important assumption for alpha. Earlier work on evaluating this assumption considered either exclusively non-normal error score distributions, or limited conditions. In view of this and the availability of advanced methods for generating univariate non-normal data, Monte Carlo simulations were conducted to show that non-normal distributions for true or error scores do create problems for using alpha to estimate the internal consistency reliability. The sample coefficient alpha is affected by leptokurtic true score distributions, or skewed and/or kurtotic error score distributions. Increased sample sizes, not test lengths, help improve the accuracy, bias, or precision of using it with non-normal data. PMID:22363306
Development of the Anxiety Scale for Natural Disaster: Examination of its Reliability
NASA Astrophysics Data System (ADS)
Matsumoto, Miki; Yatabe, Ryuichi
The objective of present study was to develop the a nxiety scale for natural disaster, and to examineits reliability. We developed the 14 items for the anxiety scale based on anticipated damage of Nankai earthquake in Ehime prefecture. The subjects consist of 391 people in Yawatahama city, Ehime prefecture. Firstly, we analyzed the latent factors which influenced the anxiety for natural disaster by using the factor analysis method. Secondly, we cal culated Cronbach's coefficient alpha. The result of the factor analysis confirmed the three factors such as "anxiety for lifeline damage", "anxiety for second ary disaster" and "fear for others". Cronbach's coefficient alpha for each factor showed the high interna l consistency reliability. We considered that each factor could prove to be a valuable tool for researc h about the person's anxiety for natural disaster.
Koritar, Priscila; Philippi, Sonia Tucunduva; Alvarenga, Marle dos Santos; Santos, Bernardo dos
2014-08-01
The scope of this study was to show the cross-cultural adaptation and validation of the Health and Taste Attitude Scale in Portuguese. The methodology included translation of the scale; evaluation of conceptual, operational and item-based equivalence by 14 experts and 51 female undergraduates; semantic equivalence and measurement assessment by 12 bilingual women by the paired t-test, the Pearson correlation coefficient and the coefficient intraclass correlation; internal consistency and test-retest reliability by Cronbach's alpha and intraclass correlation coefficient, respectively, after application on 216 female undergraduates; assessment of discriminant and concurrent validity via the t-test and Spearman's correlation coefficient, respectively, in addition to Confirmatory Factor and Exploratory Factor Analysis. The scale was considered adequate and easily understood by the experts and university students and presented good internal consistency and reliability (µ 0.86, ICC 0.84). The results show that the scale is valid and can be used in studies with women to better understand attitudes related to taste.
ERIC Educational Resources Information Center
Schumacker, Randall E.; Smith, Everett V., Jr.
2007-01-01
Measurement error is a common theme in classical measurement models used in testing and assessment. In classical measurement models, the definition of measurement error and the subsequent reliability coefficients differ on the basis of the test administration design. Internal consistency reliability specifies error due primarily to poor item…
Reliability of the Test of Integrated Language and Literacy Skills (TILLS).
Mailend, Marja-Liisa; Plante, Elena; Anderson, Michele A; Applegate, E Brooks; Nelson, Nickola W
2016-07-01
As new standardized tests become commercially available, it is critical that clinicians have access to the information about a test's psychometric properties, including aspects of reliability. The purpose of the three studies reported in this article was to investigate the reliability of a new test, the Test of Integrated Language and Literacy Skills (TILLS), with consideration of both internal and external sources of measurement error. The TILLS was administered to children aged 6;0-18;11 years. The participants varied in terms of their language and literacy skills and included children with typical language development as well as those diagnosed with language or learning disability. The sample of children also varied in terms of their racial and socioeconomic backgrounds. Study 1 (N = 1056) assessed the internal consistency of TILLS calculating the coefficient omega for each subtest. Study 2 (N = 103) and Study 3 (N = 39) used the intra-class correlation coefficients to report on test-retest and inter-rater reliability respectively. The results indicate strong internal consistency and inter-rater reliability for all subtests of TILLS. The test-retest reliability was strong for all but one subtest, for which the intra-class correlation coefficient was in the acceptable range. This article provides clinicians with essential scientific information that supports the internal and external reliability of a new test of oral and written language skills, the TILLS. Information about reliability is critical for guiding the selection of an appropriate diagnostic tool amongst a number of options. © 2016 Royal College of Speech and Language Therapists.
Health Service Quality Scale: Brazilian Portuguese translation, reliability and validity.
Rocha, Luiz Roberto Martins; Veiga, Daniela Francescato; e Oliveira, Paulo Rocha; Song, Elaine Horibe; Ferreira, Lydia Masako
2013-01-17
The Health Service Quality Scale is a multidimensional hierarchical scale that is based on interdisciplinary approach. This instrument was specifically created for measuring health service quality based on marketing and health care concepts. The aim of this study was to translate and culturally adapt the Health Service Quality Scale into Brazilian Portuguese and to assess the validity and reliability of the Brazilian Portuguese version of the instrument. We conducted a cross-sectional, observational study, with public health system patients in a Brazilian university hospital. Validity was assessed using Pearson's correlation coefficient to measure the strength of the association between the Brazilian Portuguese version of the instrument and the SERVQUAL scale. Internal consistency was evaluated using Cronbach's alpha coefficient; the intraclass (ICC) and Pearson's correlation coefficients were used for test-retest reliability. One hundred and sixteen consecutive postoperative patients completed the questionnaire. Pearson's correlation coefficient for validity was 0.20. Cronbach's alpha for the first and second administrations of the final version of the instrument were 0.982 and 0.986, respectively. For test-retest reliability, Pearson's correlation coefficient was 0.89 and ICC was 0.90. The culturally adapted, Brazilian Portuguese version of the Health Service Quality Scale is a valid and reliable instrument to measure health service quality.
Nakano, Hideki; Kodama, Takayuki; Ukai, Kazumasa; Kawahara, Satoru; Horikawa, Shiori; Murata, Shin
2018-01-01
In this study, we aimed to (1) translate the English version of the Kinesthetic and Visual Imagery Questionnaire (KVIQ), which assesses motor imagery ability, into Japanese, and (2) investigate the reliability and validity of the Japanese KVIQ. We enrolled 28 healthy adults in this study. We used Cronbach’s alpha coefficients to assess reliability reflected by the internal consistency. Additionally, we assessed validity reflected by the criterion-related validity between the Japanese KVIQ and the Japanese version of the Movement Imagery Questionnaire-Revised (MIQ-R) with Spearman’s rank correlation coefficients. The Cronbach’s alpha coefficients for the KVIQ-20 were 0.88 (Visual) and 0.91 (Kinesthetic), which indicates high reliability. There was a significant positive correlation between the Japanese KVIQ-20 (Total) and the Japanese MIQ-R (Total) (r = 0.86, p < 0.01). Our results suggest that the Japanese KVIQ is an assessment that is a reliable and valid index of motor imagery ability. PMID:29724042
Nakano, Hideki; Kodama, Takayuki; Ukai, Kazumasa; Kawahara, Satoru; Horikawa, Shiori; Murata, Shin
2018-05-02
In this study, we aimed to (1) translate the English version of the Kinesthetic and Visual Imagery Questionnaire (KVIQ), which assesses motor imagery ability, into Japanese, and (2) investigate the reliability and validity of the Japanese KVIQ. We enrolled 28 healthy adults in this study. We used Cronbach’s alpha coefficients to assess reliability reflected by the internal consistency. Additionally, we assessed validity reflected by the criterion-related validity between the Japanese KVIQ and the Japanese version of the Movement Imagery Questionnaire-Revised (MIQ-R) with Spearman’s rank correlation coefficients. The Cronbach’s alpha coefficients for the KVIQ-20 were 0.88 (Visual) and 0.91 (Kinesthetic), which indicates high reliability. There was a significant positive correlation between the Japanese KVIQ-20 (Total) and the Japanese MIQ-R (Total) (r = 0.86, p < 0.01). Our results suggest that the Japanese KVIQ is an assessment that is a reliable and valid index of motor imagery ability.
A Reliability Generalization Meta-Analysis of Coefficient Alpha for the Maslach Burnout Inventory
ERIC Educational Resources Information Center
Wheeler, Denna L.; Vassar, Matt; Worley, Jody A.; Barnes, Laura L. B.
2011-01-01
The purpose of this study was to synthesize internal consistency reliability for the subscale scores on the Maslach Burnout Inventory (MBI). The authors addressed three research questions: (a) What is the mean subscale score reliability for the MBI across studies? (b) What factors are associated with observed variance in MBI subscale score…
ERIC Educational Resources Information Center
Green, Samuel B.; Yang, Yanyun
2009-01-01
A method is presented for estimating reliability using structural equation modeling (SEM) that allows for nonlinearity between factors and item scores. Assuming the focus is on consistency of summed item scores, this method for estimating reliability is preferred to those based on linear SEM models and to the most commonly reported estimate of…
Sun, Wei; Chou, Chih-Ping; Stacy, Alan W; Ma, Huiyan; Unger, Jennifer; Gallaher, Peggy
2007-02-01
Cronbach's a is widely used in social science research to estimate the internal consistency of reliability of a measurement scale. However, when items are not strictly parallel, the Cronbach's a coefficient provides a lower-bound estimate of true reliability, and this estimate may be further biased downward when items are dichotomous. The estimation of standardized Cronbach's a for a scale with dichotomous items can be improved by using the upper bound of coefficient phi. SAS and SPSS macros have been developed in this article to obtain standardized Cronbach's a via this method. The simulation analysis showed that Cronbach's a from upper-bound phi might be appropriate for estimating the real reliability when standardized Cronbach's a is problematic.
Wu, Y Z; Wang, W J; Feng, N P; Chen, B; Li, G C; Liu, J W; Liu, H L; Yang, Y Y
2016-07-06
To evaluate the validity, reliability, and acceptability of the brief version of the self-management knowledge, attitude, and behavior (KAB) assessment scale for diabetes patients. Diabetes patients who were managed at the Xinkaipu Community Health Service Center of Tianxin in Changsha, Hunan Province were selected for survey by cluster sampling. A total of 350 diabetes patients were surveyed using the brief scale to collect data on knowledge, attitudes, and behaviors of self-management. Content validity was evaluated by Pearson correlation coefficient between the brief scale and subscales of knowledge, attitude, and behavior. Structure validity was evaluated by factor analysis, and discrimination validity was evaluated by an independent sample t-test between the high-score and low-score groups. Reliability was tested by internal consistency reliability and split-half reliability. The evaluation indexes of internal consistency reliability were Cronbach's α coefficients, θ coefficient, and Ω coefficient. Acceptability was evaluated by valid response rate and completion time of the brief scale. A total of 346(98.9%) valid questionnaires were returned, with average survey time of (11.43±3.4) minutes. Average score of the brief scale was 78.85 ± 11.22; scores of the knowledge, attitude, and behavior subscales were 16.45 ± 4.42, 21.33 ± 2.03, and 41.07 ± 8.34, respectively. Pearson correlation coefficients between the brief scale and the knowledge, attitude, and behavior subscales were 0.92, 0.42, and 0.60, respectively; P-values were all less than 0.01, indicating that the face validity and content validity of the brief scale were achieved to a good level. The common factor cumulative variance contribution rate of the brief scale and three subscales was from 53.66% to 61.75%, which achieved more than 50% of the approved standard. There were 11 common factors; 41 of the total 42 items had factor loadings above 0.40 in their relevant common factor, indicating that the brief scale and three subscales had good construct validity. Patients were divided into a high-score group and a low-score group, then scores of the brief scale and three subscales were compared between the groups using a t-test. The results were all significant, indicating that the brief scale and three subscales had good discriminate validity. Mean scores of the brief scale and three subscales of the high-score group were 91.55±6.81, 19.51±2.17, 22.74±1.88, and 49.30±6.20, respectively; these were higher than the low-score group (65.89±5.79, 12.29±4.76, 20.22±1.88, and 33.39±6.17, respectively) with t-values 27.76, 13.31, 9.20, and 17.56 (P-values were less than 0.001). The Cronbach's α coefficient, θ coefficient, Ω coefficient, and split-half reliability of the brief scale were 0.83, 0.87, 0.96, and 0.84, respectively. These values for the three subscales were all above 0.70, except for the θ coefficient of the attitude subscale with 0.64, indicating that the brief scale and three subscales had acceptable internal consistency reliability. The brief version of the diabetes self-management knowledge, attitude, and behavior assessment scale showed good acceptability, validity, and reliability, to responsibly evaluate self-management KAB among patients with diabetes.
González-Ortiz, Ailema Janeth; Arce-Santander, Celene Viridiana; Vega-Vega, Olynka; Correa-Rotter, Ricardo; Espinosa-Cuevas, María de Los Angeles
2014-10-04
The protein-energy wasting syndrome (PEW) is a condition of malnutrition, inflammation, anorexia and wasting of body reserves resulting from inflammatory and non-inflammatory conditions in patients with chronic kidney disease (CKD).One way of assessing PEW, extensively described in the literature, is using the Malnutrition Inflammation Score (MIS). To assess the reliability and consistency of MIS for diagnosis of PEW in Mexican adults with CKD on hemodialysis (HD). Study of diagnostic tests. A sample of 45 adults with CKD on HD were analyzed during the period June-July 2014.The instrument was applied on 2 occasions; the test-retest reliability was calculated using the Intraclass Correlation Coefficient (ICC); the internal consistency of the questionnaire was analyzed using Cronbach's αcoefficient. A weighted Kappa test was used to estimate the validity of the instrument; the result was subsequently compared with the Bilbrey nutritional index (BNI). The reliability of the questionnaires, evaluated in the patient sample, was ICC=0.829.The agreement between MIS observations was considered adequate, k= 0.585 (p <0.001); when comparing it with BNI, a value of k = 0.114 was obtained (p <0.001).In order to estimate the tendency, a correlation test was performed. The r² correlation coefficient was 0.488 (P <0.001). MIS has adequate reliability and validity for diagnosing PEW in the population with chronic kidney disease on HD. Copyright AULA MEDICA EDICIONES 2014. Published by AULA MEDICA. All rights reserved.
Bayesian Meta-Analysis of Coefficient Alpha
ERIC Educational Resources Information Center
Brannick, Michael T.; Zhang, Nanhua
2013-01-01
The current paper describes and illustrates a Bayesian approach to the meta-analysis of coefficient alpha. Alpha is the most commonly used estimate of the reliability or consistency (freedom from measurement error) for educational and psychological measures. The conventional approach to meta-analysis uses inverse variance weights to combine…
Reliability and validity of a questionnaire for self-assessment of complete dentures.
Komagamine, Yuriko; Kanazawa, Manabu; Kaiba, Yoshinori; Sato, Yusuke; Minakuchi, Shunsuke
2014-05-02
Demand for complete denture treatment is expected to rise over several decades. However, to date, no questionnaire on complete dentures, as evaluated by edentulous patients, has been shown to be reliable and valid. This study sought to assess the reliability and validity of Patient's Denture Assessment (PDA), which provides a multidimensional evaluation of dentures among edentulous patients. Patients, who had new complete dentures fabricated at the University Hospital of Dentistry, Tokyo Medical and Dental University through 2009 to 2010, were enrolled. The reliability of the PDA was determined by examining internal consistency and test-retest reliability. Internal consistency for all of the question items and the six subscales was measured using Cronbach's α and average inter-item correlation coefficients among 93 participants. For 33 of these participants, test-retest reliability was determined at a 2 month-interval using the interclass correlation coefficients (ICCs) and 95% confidence interval for the summary scores and the six subscale scores. The PDA was validated in 93 participants by examining the difference in the summary score and the six subscale scores of the PDA before and after replacement with new dentures by the paired t-test. Ability to detect change was also tested in 93 patients using effect size. The Cronbach's α for the PDA ranged from 0.56 to 0.93. The average inter-item correlation coefficients ranged from 0.28 to 0.83. ICCs for the PDA ranged from 0.37 to 0.83. The paired t-test showed a significant difference between the summary score and the six subscale scores before and after replacement with new dentures (p < 0.05) and the effect size was 0.97. The PDA demonstrated good reliability by assessing internal consistency and test-retest reliability. In addition, the PDA demonstrated good validity by assessing discriminant validity. Thus, the PDA could help dentists obtain a detailed understanding of the patients' perceptions in using their dentures.
Papadopoulou, Soultana L.; Exarchakos, Georgios; Christodoulou, Dimitrios; Theodorou, Stavroula; Beris, Alexandre; Ploumis, Avraam
2016-01-01
Introduction The Ohkuma questionnaire is a validated screening tool originally used to detect dysphagia among patients hospitalized in Japanese nursing facilities. Objective The purpose of this study is to evaluate the reliability and validity of the adapted Greek version of the Ohkuma questionnaire. Methods Following the steps for cross-cultural adaptation, we delivered the validated Ohkuma questionnaire to 70 patients (53 men, 17 women) who were either suffering from dysphagia or not. All of them completed the questionnaire a second time within a month. For all of them, we performed a bedside and VFSS study of dysphagia and asked participants to undergo a second VFSS screening, with the exception of nine individuals. Statistical analysis included measurement of internal consistency with Cronbach's α coefficient, reliability with Cohen's Kappa, Pearson's correlation coefficient and construct validity with categorical components, and One-Way Anova test. Results According to Cronbach's α coefficient (0.976) for total score, there was high internal consistency for the Ohkuma Dysphagia questionnaire. Test-retest reliability (Cohen's Kappa) ranged from 0.586 to 1.00, exhibiting acceptable stability. We also estimated the Pearson's correlation coefficient for the test-retest total score, which reached high levels (0.952; p = 0.000). The One-Way Anova test in the two measurement times showed statistically significant correlation in both measurements (p = 0.02 and p = 0.016). Conclusion The adapted Greek version of the questionnaire is valid and reliable and can be used for the screening of dysphagia in the Greek-speaking patients. PMID:28050209
Papadopoulou, Soultana L; Exarchakos, Georgios; Christodoulou, Dimitrios; Theodorou, Stavroula; Beris, Alexandre; Ploumis, Avraam
2017-01-01
Introduction The Ohkuma questionnaire is a validated screening tool originally used to detect dysphagia among patients hospitalized in Japanese nursing facilities. Objective The purpose of this study is to evaluate the reliability and validity of the adapted Greek version of the Ohkuma questionnaire. Methods Following the steps for cross-cultural adaptation, we delivered the validated Ohkuma questionnaire to 70 patients (53 men, 17 women) who were either suffering from dysphagia or not. All of them completed the questionnaire a second time within a month. For all of them, we performed a bedside and VFSS study of dysphagia and asked participants to undergo a second VFSS screening, with the exception of nine individuals. Statistical analysis included measurement of internal consistency with Cronbach's α coefficient, reliability with Cohen's Kappa, Pearson's correlation coefficient and construct validity with categorical components, and One-Way Anova test. Results According to Cronbach's α coefficient (0.976) for total score, there was high internal consistency for the Ohkuma Dysphagia questionnaire. Test-retest reliability (Cohen's Kappa) ranged from 0.586 to 1.00, exhibiting acceptable stability. We also estimated the Pearson's correlation coefficient for the test-retest total score, which reached high levels (0.952; p = 0.000). The One-Way Anova test in the two measurement times showed statistically significant correlation in both measurements ( p = 0.02 and p = 0.016). Conclusion The adapted Greek version of the questionnaire is valid and reliable and can be used for the screening of dysphagia in the Greek-speaking patients.
Maizura, Husna; Masilamani, Retneswari; Aris, Tahir
2009-04-01
This small, cross-sectional study assessed the reliability of 3 scales from the Job Content Questionnaire (JCQ)-decision latitude, psychological job demand, and social support-in a group of office workers in a multinational company in Kuala Lumpur. A universal sample of 30 white-collar workers from a department of the company self-administered the English version of the JCQ comprising 21 core items selected from the full recommended version of 49 items on-site. Reliability (internal consistency) was evaluated using Cronbach's alpha coefficients for each scale. Corrected item-total correlation was presented for each and every item. Cronbach's alpha coefficients were acceptable for decision latitude (.76) and social support (.79) but slightly lower for psychological job demand (.64). Values for all item-total correlations for all 3 scales were greater than .3. In conclusion, this study suggests that the JCQ is a reliable scale for assessing job stress in this group of workers.
Myer, Gregory D; Wordeman, Samuel C; Sugimoto, Dai; Bates, Nathaniel A; Roewer, Benjamin D; Medina McKeon, Jennifer M; DiCesare, Christopher A; Di Stasi, Stephanie L; Barber Foss, Kim D; Thomas, Staci M; Hewett, Timothy E
2014-05-01
Multi-center collaborations provide a powerful alternative to overcome the inherent limitations to single-center investigations. Specifically, multi-center projects can support large-scale prospective, longitudinal studies that investigate relatively uncommon outcomes, such as anterior cruciate ligament injury. This project was conceived to assess within- and between-center reliability of an affordable, clinical nomogram utilizing two-dimensional video methods to screen for risk of knee injury. The authors hypothesized that the two-dimensional screening methods would provide good-to-excellent reliability within and between institutions for assessment of frontal and sagittal plane biomechanics. Nineteen female, high school athletes participated. Two-dimensional video kinematics of the lower extremity during a drop vertical jump task were collected on all 19 study participants at each of the three facilities. Within-center and between-center reliability were assessed with intra- and inter-class correlation coefficients. Within-center reliability of the clinical nomogram variables was consistently excellent, but between-center reliability was fair-to-good. Within-center intra-class correlation coefficient for all nomogram variables combined was 0.98, while combined between-center inter-class correlation coefficient was 0.63. Injury risk screening protocols were reliable within and repeatable between centers. These results demonstrate the feasibility of multi-site biomechanical studies and establish a framework for further dissemination of injury risk screening algorithms. Specifically, multi-center studies may allow for further validation and optimization of two-dimensional video screening tools. 2b.
Validation of the MISSCARE-BRASIL survey - A tool to assess missed nursing care.
Siqueira, Lillian Dias Castilho; Caliri, Maria Helena Larcher; Haas, Vanderlei José; Kalisch, Beatrice; Dantas, Rosana Aparecida Spadoti
2017-12-21
to analyze the metric validity and reliability properties of the MISSCARE-BRASIL survey. methodological research conducted by assessing construct validity and reliability via confirmatory factor analysis, known-groups validation, convergent construct validation, analysis of internal consistency and test-retest reliability. The sample consisted of 330 nursing professionals, of whom 86 participated in the retest phase. of the 330 participants, 39.7% were aides, 33% technicians, 20.9% nurses, and 6.4% nurses with administrative roles. Confirmatory factorial analysis demonstrated that the Brazilian Portuguese version of the instrument is adequately adjusted to the dimensional structure the scale authors originally proposed. The correlation between "satisfaction with position/role" and "satisfaction with teamwork" and the survey's missed care variables was moderate (Spearman's coefficient =0.35; p<0.001). The results of the Student's t-test indicated known-group validity. Professionals from closed units reported lower levels of missed care in comparison with the other units. The reliability showed a strong correlation, with the exception of "institutional management/leadership style" (intraclass correlation coefficient (ICC)=0.15; p=0.04). The internal consistency was adequate (Cronbach's alpha was greater than 0.70). the MISSCARE-BRASIL was valid and reliable in the group studied. The application of the MISSCARE-BRASIL can contribute to identifying solutions for missed nursing care.
Bos, Nanne; Sturms, Leontien M; Stellato, Rebecca K; Schrijvers, Augustinus J P; van Stel, Henk F
2015-10-01
Patients' experiences are an indicator of health-care performance in the accident and emergency department (A&E). The Consumer Quality Index for the Accident and Emergency department (CQI A&E), a questionnaire to assess the quality of care as experienced by patients, was investigated. The internal consistency, construct validity and discriminative capacity of the questionnaire were examined. In the Netherlands, twenty-one A&Es participated in a cross-sectional survey, covering 4883 patients. The questionnaire consisted of 78 questions. Principal components analysis determined underlying domains. Internal consistency was determined by Cronbach's alpha coefficients, construct validity by Pearson's correlation coefficients and the discriminative capacity by intraclass correlation coefficients and reliability of A&E-level mean scores (G-coefficient). Seven quality domains emerged from the principal components analysis: information before treatment, timeliness, attitude of health-care professionals, professionalism of received care, information during treatment, environment and facilities, and discharge management. Domains were internally consistent (range: 0.67-0.84). Five domains and the 'global quality rating' had the capacity to discriminate among A&Es (significant intraclass correlation coefficient). Four domains and the 'global quality rating' were close to or above the threshold for reliably demonstrating differences among A&Es. The patients' experiences score on the domain timeliness showed the largest range between the worst- and best-performing A&E. The CQI A&E is a validated survey to measure health-care performance in the A&E from patients' perspective. Five domains regarding quality of care aspects and the 'global quality rating' had the capacity to discriminate among A&Es. © 2013 John Wiley & Sons Ltd.
ERIC Educational Resources Information Center
Saupe, Joe L.; Eimers, Mardy T.
2013-01-01
The purpose of this paper is to explore differences in the reliabilities of cumulative college grade point averages (GPAs), estimated for unweighted and weighted, one-semester, 1-year, 2-year, and 4-year GPAs. Using cumulative GPAs for a freshman class at a major university, we estimate internal consistency (coefficient alpha) reliabilities for…
Ahlström, Isabell; Hellström, Karin; Emtner, Margareta; Anens, Elisabeth
2015-03-01
To examine the test-retest reliability of the Swedish translated version of the Exercise Self-Efficacy Scale (S-ESES) in people with neurological disease and to examine internal consistency. Test-retest study. A total of 30 adults with neurological diseases including: Parkinson's disease; Multiple Sclerosis; Cervical Dystonia; and Charcot-Marie-Tooth disease. The S-ESES was sent twice by surface mail. Completion interval mean was 16 days apart. Weighted kappa, intraclass correlation coefficient 2,1 [ICC (2,1)], standard error of measurement (SEM), also expressed as a percentage value (SEM%), and Cronbach's alpha were calculated. The relative reliability of the test-retest results showed substantial agreement measured using weighted kappa (MD = 0.62) and a very high-reliability ICC (2,1) (0.92). Absolute reliability measured using SEM was 5.3 and SEM% was 20.7. Excellent internal consistency was shown, with an alpha coefficient of 0.91 (test 1) and 0.93 (test 2). The S-ESES is recommended for use in research and in clinical work for people with neurological diseases. The low-absolute reliability, however, indicates a limited ability to measure changes on an individual level.
Wang, W J; Dong, J; Ren, Z P; Chen, B; He, W; Li, W D; Hao, Z W
2016-07-06
To evaluate the validity, reliability, and acceptability of the scale of knowledge, attitude, and behavior of lifestyle intervention in a diabetes high-risk population (HILKAB), and provide scientific evidence for its usage. By convenient sampling, we selected 406 individuals at high risk for diabetes for survey using the HILKAB. Pearson correlation coefficient, factor analysis, independent sampling, and t-test for high- and low-score groups were used to evaluate the content validity, construct validity, and discriminant validity of the scale. Reliability of the scale was evaluated by internal consistency, which included Cronbach's α coefficient, θ coefficient, Ω coefficient, and split-half reliability. Scale acceptability was evaluated by acceptance rate and completion time of the survey. In this study, 366 questionnaires (90.1%) was qnalified and the completion time was (8.62±2.79) minutes. Scores for knowledge, attitude, and behavior were 10.60±3.73, 26.56±3.58, 17.09±9.74, respectively. The scale had good face validity and content validity. The correlation coefficient of items and the dimension to which they belong was between 0.25 and 0.97, and the correlation coefficient of three dimensions and the entire scale was between 0.64 and 0.91, all with P<0.001. Factor analysis of the scale extracted eight common factors. The cumulative variance contribution rate was 65.23%, thereby reaching the 50% approved standard. Of 30 items there were 29 items with factor loadings ≥0.40, indicating the scale had good construct validity. For the high-score group, scores for knowledge, attitude, and behavior dimensions were 13.89±2.55, 29.56± 2.46, 28.05 ± 2.93, respectively, which were higher than those for the low-score group (7.67 ± 2.78, 23.89 ± 3.35, 6.25 ± 3.13); t-values were 55.14, 119.40, 95.29, respectively, with P<0.001. The scale consisted of three dimensions: knowledge, attitude, and behavior. The Cronbach's α coefficient was between 0.84 and 0.92, the θ coefficient was between 0.85 and 0.96, the Ω coefficient was between 0.90 and 0.94, and the split-half reliability was between 0.77 and 0.95, reaching the 0.70 standard letter. The validity, reliability, and acceptability of the HILKAB scale were satisfactory for use in a population at high risk of diabetes.
Yalın Sapmaz, Şermin; Özek Erkuran, Handan; Ergin, Dilek; Öztürk, Masum; Şen Celasin, Nesrin; Karaarslan, Duygu; Aydemir, Ömer
2018-02-23
Background/aim: This study aimed to assess the validity and reliability of the Turkish version of the DSM-5 Generalized Anxiety Disorder Severity Scale - Child Form. Materials and methods: The study sample consisted of 32 patients treated in a child psychiatry unit and diagnosed with generalized anxiety disorder and 98 healthy volunteers who were attending middle or high school during the study period. For the assessment, the Screen for Child Anxiety and Related Emotional Disorders (SCARED) was also used along with the DSM-5 Generalized Anxiety Disorder Severity Scale - Child Form. Results: Regarding reliability analyses, the Cronbach alpha internal consistency coefficient was calculated as 0.932. The test-retest correlation coefficient was calculated as r = 0.707. As for construct validity, one factor that could explain 62.6% of the variance was obtained and this was consistent with the original construct of the scale. As for concurrent validity, the scale showed a high correlation with SCARED. Conclusion: It was concluded that Turkish version of the DSM-5 Generalized Anxiety Disorder Severity Scale - Child Form could be utilized as a valid and reliable tool both in clinical practice and for research purposes.
The Cardiff Acne Disability Index (CADI): linguistic and cultural validation in Serbian.
Jankovic, Slavenka; Vukicevic, Jelica; Djordjevic, Sanja; Jankovic, Janko; Marinkovic, Jelena; Basra, Mohammad K A
2013-02-01
The aims of this study were to translate the Cardiff Acne Disability Index (CADI) into Serbian and to assess its validity and reliability in Serbian acne patients. The CADI was translated and linguistically validated into Serbian according to published guidelines. This version of CADI, along with the Serbian version of Children's Dermatology Life Quality Index (CDLQI) and a short demographic questionnaire, was administrated to a cohort of secondary school pupils. The Global Acne Grading Score was used to measure the clinical severity of acne. The internal consistency reliability of the Serbian version of CADI was assessed by Cronbach's alpha coefficient while its concurrent validity was assessed by Spearman's correlation coefficient. Construct validity was examined by factor analysis. A total of 465 pupils completed questionnaires. Self-reported acne was present in 76% of pupils (353/465). The Serbian version of CADI showed high internal consistency reliability (Cronbach's alpha coefficient = 0.79). The mean item-total correlation coefficient was 0.74 with a range of 0.53-0.81. The concurrent validity of the scale was supported by a moderate but highly significant correlation with the CDLQI (Spearman's rho = 0.66; P < 0.001). Factor analysis revealed the presence of two dimensions underlying the factor structure of the scale. The Serbian version of the CADI is a reliable, valid, and valuable tool for assessing the impact of acne on the quality of life of Serbian-speaking patients.
Gupta, Aayush; Sharma, Yugal K; Dash, K; Verma, Sampurna
2015-01-01
Acne vulgaris is known to impair many aspects of the quality of life (QoL) of its patients. To translate the Cardiff Acne Disability Index (CADI) from English into Hindi and to assess its validity and reliability in Hindi speaking patients with acne from India. Hindi version of CADI, translated and linguistically validated as per published international guidelines, along with a previously translated Hindi version of dermatology life quality index (DLQI) and a demographic questionnaire were administered to acne patients. The internal consistency reliability of the Hindi version of CADI and its concurrent validity were assessed by Cronbach's alpha co-efficient and Spearman's correlation co-efficient respectively. Construct validity was examined by factor analysis. Statistical analysis was carried out using the Statistical Package for the Social Sciences (SPSS) version 20 (SPSS Inc., Chicago, IL, USA) for Windows. One hundred Hindi speaking patients with various grades of acne participated in the study. Hindi version of CADI showed high internal consistency reliability (Cronbach's alpha co-efficient = 0.722). Mean item-to-total correlation co-efficient ranged from 0.502 to 0.760. Concurrent validity of the scale was supported by a significant correlation with the Hindi DLQI. Factor analysis revealed the presence of two dimensions underlying the factor structure of the scale. Hindi CADI is equivalent to the original English version and constitutes a reliable and valid tool for clinical assessment of the impact of acne on QoL.
The reliability and validity of a Japanese version of symptom checklist 90 revised
Tomioka, Mitsunao; Shimura, Midori; Hidaka, Mikio; Kubo, Chiharu
2008-01-01
Objective To examine the validity and reliability of a Japanese version of the Symptom Checklist 90 Revised (SCL-90-R (J)). Methods The English SCL-90-R was translated to Japanese and the Japanese version confirmed by back-translation. To determine the factor validity and internal consistency of the nine primary subscales, 460 people from the community completed SCL-90-R(J). Test-retest reliability was examined for 104 outpatients and 124 healthy undergraduate students. The convergent-discriminant validity was determined for 80 inpatients who replied to both SCL-90-R(J) and the Minnesota Multiphasic Personality Inventory (MMPI). Results The correlation coefficients between the nine primary subscales and items were .26 to .78. Cronbach's alpha coefficients were from .76 (Phobic Anxiety) to .86 (Interpersonal Sensitivity). Pearson's correlation coefficients between test-retest scores were from .81 (Psychoticism) to .90 (Somatization) for the outpatients and were from .64 (Phobic Anxiety) to .78 (Paranoid Ideation) for the students. Each of the nine primary subscales correlated well with their corresponding constructs in the MMPI. Conclusion We confirmed the validity and reliability of SCL-90-R(J) for the measurement of individual distress. The nine primary subscales were consistent with the items of the original English version. PMID:18957078
Rosales, Roberto S; Martin-Hidalgo, Yolanda; Reboso-Morales, Luis; Atroshi, Isam
2016-03-03
The purpose of this study was to assess the reliability and construct validity of the Spanish version of the 6-item carpal tunnel syndrome (CTS) symptoms scale (CTS-6). In this cross-sectional study 40 patients diagnosed with CTS based on clinical and neurophysiologic criteria, completed the standard Spanish versions of the CTS-6 and the disabilities of the arm, shoulder and hand (QuickDASH) scales on two occasions with a 1-week interval. Internal-consistency reliability was assessed with the Cronbach alpha coefficient and test-retest reliability with the intraclass correlation coefficient, two way random effect model and absolute agreement definition (ICC2,1). Cross-sectional precision was analyzed with the Standard Error of the Measurement (SEM). Longitudinal precision for test-retest reliability coefficient was assessed with the Standard Error of the Measurement difference (SEMdiff) and the Minimal Detectable Change at 95 % confidence level (MDC95). For assessing construct validity it was hypothesized that the CTS-6 would have a strong positive correlation with the QuickDASH, analyzed with the Pearson correlation coefficient (r). The standard Spanish version of the CTS-6 presented a Cronbach alpha of 0.81 with a SEM of 0.3. Test-retest reliability showed an ICC of 0.85 with a SRMdiff of 0.36 and a MDC95 of 0.7. The correlation between CTS-6 and the QuickDASH was concordant with the a priori formulated construct hypothesis (r 0.69) CONCLUSIONS: The standard Spanish version of the 6-item CTS symptoms scale showed good internal consistency, test-retest reliability and construct validity for outcomes assessment in CTS. The CTS-6 will be useful to clinicians and researchers in Spanish speaking parts of the world. The use of standardized outcome measures across countries also will facilitate comparison of research results in carpal tunnel syndrome.
Neurology objective structured clinical examination reliability using generalizability theory
Park, Yoon Soo; Lukas, Rimas V.; Brorson, James R.
2015-01-01
Objectives: This study examines factors affecting reliability, or consistency of assessment scores, from an objective structured clinical examination (OSCE) in neurology through generalizability theory (G theory). Methods: Data include assessments from a multistation OSCE taken by 194 medical students at the completion of a neurology clerkship. Facets evaluated in this study include cases, domains, and items. Domains refer to areas of skill (or constructs) that the OSCE measures. G theory is used to estimate variance components associated with each facet, derive reliability, and project the number of cases required to obtain a reliable (consistent, precise) score. Results: Reliability using G theory is moderate (Φ coefficient = 0.61, G coefficient = 0.64). Performance is similar across cases but differs by the particular domain, such that the majority of variance is attributed to the domain. Projections in reliability estimates reveal that students need to participate in 3 OSCE cases in order to increase reliability beyond the 0.70 threshold. Conclusions: This novel use of G theory in evaluating an OSCE in neurology provides meaningful measurement characteristics of the assessment. Differing from prior work in other medical specialties, the cases students were randomly assigned did not influence their OSCE score; rather, scores varied in expected fashion by domain assessed. PMID:26432851
Neurology objective structured clinical examination reliability using generalizability theory.
Blood, Angela D; Park, Yoon Soo; Lukas, Rimas V; Brorson, James R
2015-11-03
This study examines factors affecting reliability, or consistency of assessment scores, from an objective structured clinical examination (OSCE) in neurology through generalizability theory (G theory). Data include assessments from a multistation OSCE taken by 194 medical students at the completion of a neurology clerkship. Facets evaluated in this study include cases, domains, and items. Domains refer to areas of skill (or constructs) that the OSCE measures. G theory is used to estimate variance components associated with each facet, derive reliability, and project the number of cases required to obtain a reliable (consistent, precise) score. Reliability using G theory is moderate (Φ coefficient = 0.61, G coefficient = 0.64). Performance is similar across cases but differs by the particular domain, such that the majority of variance is attributed to the domain. Projections in reliability estimates reveal that students need to participate in 3 OSCE cases in order to increase reliability beyond the 0.70 threshold. This novel use of G theory in evaluating an OSCE in neurology provides meaningful measurement characteristics of the assessment. Differing from prior work in other medical specialties, the cases students were randomly assigned did not influence their OSCE score; rather, scores varied in expected fashion by domain assessed. © 2015 American Academy of Neurology.
Validity and cross-cultural adaptation of the persian version of the oxford elbow score.
Ebrahimzadeh, Mohammad H; Kachooei, Amir Reza; Vahedi, Ehsan; Moradi, Ali; Mashayekhi, Zeinab; Hallaj-Moghaddam, Mohammad; Azami, Mehran; Birjandinejad, Ali
2014-01-01
Oxford Elbow Score (OES) is a patient-reported questionnaire used to assess outcomes after elbow surgery. The aim of this study was to validate and adapt the OES into Persian language. After forward-backward translation of the OES into Persian, a total number of 92 patients after elbow surgeries completed the Persian OES along with the Persian DASH and SF-36. To assess test-retest reliability, 31 randomly selected patients (34%) completed the Persian OES again after three days while abstaining from all forms of therapeutic regimens. Reliability of the Persian OES was assessed by measuring intraclass correlation coefficient (ICC) for test-retest reliability and Cronbach's alpha for internal consistency. Spearman's correlation coefficient was used to test the construct validity. Cronbach's alpha coefficient was 0.92 showing excellent reliability. Cronbach's alpha for function, pain, and social-psychological subscales was 0.95, 0.86, and 0.85, respectively. Intraclass correlation coefficient (ICC) was 0.85 for the overall questionnaire and 0.90, 0.76, and 0.75 for function, pain, and social-psychological subscales, respectively. Construct validity was confirmed as the Spearman correlation between OES and DASH was 0.80. Persian OES is a valid and reliable patient-reported outcome measure to assess postsurgical elbow status in Persian speaking population.
[Reliability and Validity of the Scale for Homophobia in Medicine Students].
Campo-Arias, Adalberto; Lafaurie, María Mercedes; Gaitán-Duarte, Hernando G
2012-12-01
There are several scales to quantify homophobia in different populations. However, the reliability and validity of these instruments among Colombian students are unknown. Consequently, this work is intended to assess reliability (inner consistency) as well as the validity of the Scale for Homophobia in Medicine students from a private university in Bogotá (Colombia). Methodological study with 199 Medicine students from 1st to 5th semester that filled out the Homophobia Scale form, the general welfare questionnaire, the Attitude Towards Gays and Lesbians Scale (ATGL), WHO-5 (divergent validity) and the Francis Scale of Attitude Toward Christianity (nomologic validity). Pearson's correlations were computed, the Cronbach's alfa coefficient, the omega coefficient (construct's reliability) and confirmatory factorial analysis. The Scale for Homophobia showed an alpha Cronbach coefficient of 0,785, an omega coefficient of 0,790 and a Pearson correlation with the ATGL of 0,844; with WHO-5, -0,059; and a Francis Scale of Attitude Toward Christianity, 0,187. The Scale toward Homophobia exhibited a relevant factor of 44,7% of the total variance. The Scale for Homophobia showed acceptable reliability and validity. New studies should investigate the stability of the scale and the nomologic validity regarding other constructs. Copyright © 2012 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.
[Development of skill scale for communication skill measurement of pharmacist].
Teramachi, Hitomi; Komada, Natsuki; Tanizawa, Katsuya; Kuzuya, Yumi; Tsuchiya, Teruo
2011-04-01
To purpose of this study was to develop a pharmacist communication skill scale. A 38 items scale was made and 283 pharmacists responded. The original questionnaire consisted of 38 items, with 1-5 graded Likert scale. Completed responses of 228 pharmacists data were used for testing the reliability and the validity of this scale. The first group of items from the original questionnaire were 38, and finally 38 original items were chosen for investigation of content validity, correlation coefficient and commonality. From factor analysis, four factors were chosen among the 31 items as follows: patient respect reception skill, problem discovery and solution skill, positive approach skill, feelings processing skill. The correlation coefficient between this original scale and the KiSS-18 (Social Skill) received high score (r=0.694). The reliability of this scale showed high internal consistency (Cronbach α coefficient=0.951), so the result of test for the validity of this scale supports high content validity. Thus we propose adoption of pharmacist communication skill scale to carry a brief eponymous name as TePSS-31. The above findings indicate that this developed scale possess adequate validity and reliability for practical use.
ERIC Educational Resources Information Center
Lucas, Richard E.; Donnellan, M. Brent
2012-01-01
Life satisfaction is often assessed using single-item measures. However, estimating the reliability of these measures can be difficult because internal consistency coefficients cannot be calculated. Existing approaches use longitudinal data to isolate occasion-specific variance from variance that is either completely stable or variance that…
Artilheiro, Mariana Cunha; Fávero, Francis Meire; Caromano, Fátima Aparecida; Oliveira, Acary de Souza Bulle; Carvas, Nelson; Voos, Mariana Callil; Sá, Cristina Dos Santos Cardoso de
2017-12-08
The Jebsen-Taylor Test evaluates upper limb function by measuring timed performance on everyday activities. The test is used to assess and monitor the progression of patients with Parkinson disease, cerebral palsy, stroke and brain injury. To analyze the reliability, internal consistency and validity of the Jebsen-Taylor Test in people with Muscular Dystrophy and to describe and classify upper limb timed performance of people with Muscular Dystrophy. Fifty patients with Muscular Dystrophy were assessed. Non-dominant and dominant upper limb performances on the Jebsen-Taylor Test were filmed. Two raters evaluated timed performance for inter-rater reliability analysis. Test-retest reliability was investigated by using intraclass correlation coefficients. Internal consistency was assessed using the Cronbach alpha. Construct validity was conducted by comparing the Jebsen-Taylor Test with the Performance of Upper Limb. The internal consistency of Jebsen-Taylor Test was good (Cronbach's α=0.98). A very high inter-rater reliability (0.903-0.999), except for writing with an Intraclass correlation coefficient of 0.772-1.000. Strong correlations between the Jebsen-Taylor Test and the Performance of Upper Limb Module were found (rho=-0.712). The Jebsen-Taylor Test is a reliable and valid measure of timed performance for people with Muscular Dystrophy. Copyright © 2017 Associação Brasileira de Pesquisa e Pós-Graduação em Fisioterapia. Publicado por Elsevier Editora Ltda. All rights reserved.
[Construction of a psychological aging scale for healthy people].
Lin, Fei; Long, Yao; Zeng, Ni; Wu, Lei; Huang, Helang
2017-04-28
To construct a psychological aging scale, and to provide a tool and indexes for scientific evaluation on aging. Methods: The age-related psychological items were collected through literature screening and expert interview. The importance, feasibilityand the degree of authority for the psychological index system were graded by two rounds of Delphi method. Using analytic hierarchy process, the weight of dimensions and items were determined. The analysis for internal consistency reliability, correlation and exploratory factor was performed to evaluate the reliability and validity of the scales. Results: By two rounds of Delphi method, 17 experts offered the results as follows: the coefficient of expert authorities was 0.88±0.06, the coordination coefficients for the importance and feasibility in second round were 0.456 (P<0.01) and 0.666 (P<0.01), respectively. The consistency was good. The psychological aging scale for healthy people included 4 dimensions as follows: cognitive function, emotion, personality and motivation. The weight coefficients for the 4 dimensions were 0.338, 0.250, 0.166 and 0.258, respectively. The Cronbach's α coefficient for the scale was 0.822, the reliability was 0.817, the content validity index (CVI) was 0.847, and the cumulative contribution rate for the 5 factors was51.42%. Conclusion: The psychological aging scale is satisfied, which can provide reference for the evaluation for aging. The indicators were representative and well-recognized.
[Validating the Spanish version of the Nursing Activities Score].
Sánchez-Sánchez, M M; Arias-Rivera, S; Fraile-Gamo, M P; Thuissard-Vasallo, I J; Frutos-Vivar, F
2015-01-01
Validating workload scores ensures that they are appropriate for the purpose for which they were developed. To validate the Nursing Activities Score (NAS) Spanish version. Observational and prospective study. 1,045 patients who were admitted to a medical-surgical unit and a serious burns unit in 2006 were included. The nurse in charge assessed patient workloads by Nine Equivalent of Nursing Manpower use Score and NAS. To assess the internal consistency of the measurements of NAS, item-test correlations, Cronbach's α and Cronbach's α corrected by omitting each of the items were calculated. The intraobserver and interobserver reliability were assessed with the intraclass correlation coefficient by viewing recordings and Kappa (interobserver reliability) was estimated. For the analysis of internal validity, a factorial principal components analysis was performed. Convergent validity was assessed using the Spearman correlation coefficient values obtained from the Nine Equivalent of Nursing Manpower use Score and Spanish-NAS scales. For internal consistency, 164 questionnaires were analysed and a Cronbach's α of 0.373 was calculated. The intraclass correlation coefficient for intraobserver reliability estimate was 0.837 (95% IC: 0.466-0.950) and 0.662 (95% IC: 0.033-0.882) for interobserver reliability. The estimated kappa was 0.371. For internal validity, exploratory factor analysis showed that the first item explained 58.9% of the variance of the questionnaire. For convergent validity 1006 questionnaires were included and a Spearman correlation coefficient of 0.746 was observed. The psychometric properties of Spanish-NAS are acceptable. Copyright © 2014 Elsevier España, S.L.U. y SEEIUC. All rights reserved.
Stoyanova, Rumyana; Dimova, Rositsa; Tarnovska, Miglena; Boeva, Tatyana
2018-05-20
Patient safety (PS) is one of the essential elements of health care quality and a priority of healthcare systems in most countries. Thus the creation of validated instruments and the implementation of systems that measure patient safety are considered to be of great importance worldwide. The present paper aims to illustrate the process of linguistic validation, cross-cultural verification and adaptation of the Bulgarian version of the Hospital Survey on Patient Safety Culture (B-HSOPSC) and its test-retest reliability. The study design is cross-sectional. The HSOPSC questionnaire consists of 42 questions, grouped in 12 different subscales that measure patient safety culture. Internal con-sistency was assessed using Cronbach's alpha. The Wilcoxon signed-rank test and the split-half method were used; the Spear-man-Brown coefficient was calculated. The overall Cronbach's alpha for B-HSOPSC is 0.918. Subscales 7 Staffing and 12 Overall perceptions of safety had the lowest coefficients. The high reliability of the instrument was confirmed by the Split-half method (0.97) and ICC-coefficient (0.95). The lowest values of Spearmen-Broun coefficients were found in items A13 and A14. The study offers an analysis of the results of the linguistic validation of the B-HSOPSC and its test-retest reliability. The psychometric characteristics of the questions revealed good validity and reliability, except two questions. In the future, the instrument will be administered to the target population in the main study so that the psychometric properties of the instrument can be verified.
Verification of reliability and validity of a Japanese version of the Rathus Assertiveness Schedule.
Suzuki, Eiko; Kanoya, Yuka; Katsuki, Takeshi; Sato, Chifumi
2007-07-01
To verify the reliability and validity of a Japanese version of the Rathus Assertiveness Schedule in novice nurses to contribute to nursing management. An adequate scale is needed to measure the assertiveness and the effect of assertion training for Japanese nurses and to compare them with those in other countries. Rathus Assertiveness Schedule was adapted to Japanese with back-translation and its validity was examined in 989 novice nurses. The Japanese version showed a high coefficient of reliability in a split-half reliability test (r=0.76; P<0.01). The coefficient of reliability of Cronbach's alpha was high (r=0.84; P<0.01) indicating high internal consistency. The similarity with the concept of stress coping was shown. We extracted eight principal factors using factor analysis with varimax rotation. Elements of these factors were similar to those of the original Rathus Assertiveness Schedule. The Japanese version of Rathus Assertiveness Schedule was verified.
Yalin Sapmaz, Şermin; Ergin, Dilek; Şen Celasin, Nesrin; Karaarslan, Duygu; Öztürk, Masum; Özek Erkuran, Handan; Köroğlu, Ertuğrul; Aydemir, Ömer
2017-12-01
This study aimed to assess the validity and reliability of the Turkish version of the Diagnostic and statistical manual of Mental Disorders. (5 th ed.) (DSM-5) Social Anxiety Disorder Severity Scale- Child Form. The scale was prepared by carrying out the translation and back translation of the DSM-5 Social Anxiety Disorder Severity Scale - Child Form. The study group consisted of 31 patients that had been treated in a child psychiatry unit and diagnosed with social anxiety disorder and 99 healthy volunteers that were attending middle or high school during the study period. For the assessment, the Screen for Child Anxiety and Related Emotional Disorders (SCARED) was also used along with the DSM-5 Social Anxiety Disorder Severity Scale - Child Form. Regarding reliability analyses, Cronbach's alpha internal consistency coefficient was calculated as 0.941, while item-total score correlation coefficients were measured between 0.566 and 0.866. A test-retest correlation coefficient was calculated as r=0.711. As for construct validity, one factor that could explain 66.0 % of the variance was obtained. As for concurrent validity, the scale showed a high correlation with the SCARED. It was concluded that the Turkish version of the DSM-5 Social Anxiety Disorder Severity Scale - Child Form could be utilized as a valid and reliable tool both in clinical practice and for research purposes.
The reliability of the Hendrich Fall Risk Model in a geriatric hospital.
Heinze, Cornelia; Halfens, Ruud; Dassen, Theo
2008-12-01
Aims and objectives. The purpose of this study was to test the interrater reliability of the Hendrich Fall Risk Model, an instrument to identify patients in a hospital setting with a high risk of falling. Background. Falls are a serious problem in older patients. Valid and reliable fall risk assessment tools are required to identify high-risk patients and to take adequate preventive measures. Methods. Seventy older patients were independently and simultaneously assessed by six pairs of raters made up of nursing staff members. Consensus estimates were calculated using simple percentage agreement and consistency estimates using Spearman's rho and intra class coefficient. Results. Percentage agreement ranged from 0.70 to 0.92 between the six pairs of raters. Spearman's rho coefficients were between 0.54 and 0.80 and the intra class coefficients were between 0.46 and 0.92. Conclusions. Whereas some pairs of raters obtained considerable interobserver agreement and internal consistency, the others did not. Therefore, it is concluded that the Hendrich Fall Risk Model is not a reliable instrument. The use of more unambiguous operationalized items is preferred. Relevance to clinical practice. In practice, well operationalized fall risk assessment tools are necessary. Observer agreement should always be investigated after introducing a standardized measurement tool. © 2008 The Authors. Journal compilation © 2008 Blackwell Publishing Ltd.
Translation, Cultural Adaptation and Validation of the Simple Shoulder Test to Spanish
Arcuri, Francisco; Barclay, Fernando; Nacul, Ivan
2015-01-01
Background: The validation of widely used scales facilitates the comparison across international patient samples. Objective: The objective was to translate, culturally adapt and validate the Simple Shoulder Test into Argentinian Spanish. Methods: The Simple Shoulder Test was translated from English into Argentinian Spanish by two independent translators, translated back into English and evaluated for accuracy by an expert committee to correct the possible discrepancies. It was then administered to 50 patients with different shoulder conditions.Psycometric properties were analyzed including internal consistency, measured with Cronbach´s Alpha, test-retest reliability at 15 days with the interclass correlation coefficient. Results: The internal consistency, validation, was an Alpha of 0,808, evaluated as good. The test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.835, evaluated as excellent. Conclusion: The Simple Shoulder Test translation and it´s cultural adaptation to Argentinian-Spanish demonstrated adequate internal reliability and validity, ultimately allowing for its use in the comparison with international patient samples.
Kim, Hee-Ju; Abraham, Ivo
2017-01-01
Evidence is needed on the clinicometric properties of single-item or short measures as alternatives to comprehensive measures. We examined whether two single-item fatigue measures (i.e., Likert scale, numeric rating scale) or a short fatigue measure were comparable to a comprehensive measure in reliability (i.e., internal consistency and test-retest reliability) and validity (i.e., convergent, concurrent, and predictive validity) in Korean young adults. For this quantitative study, we selected the Functional Assessment of Chronic Illness Therapy-Fatigue for the comprehensive measure and the Profile of Mood States-Brief, Fatigue subscale for the short measure; and constructed two single-item measures. A total of 368 students from four nursing colleges in South Korea participated. We used Cronbach's alpha and item-total correlation for internal consistency reliability and intraclass correlation coefficient for test-retest reliability. We assessed Pearson's correlation with a comprehensive measure for convergent validity, with perceived stress level and sleep quality for concurrent validity and the receiver operating characteristic curve for predictive validity. The short measure was comparable to the comprehensive measure in internal consistency reliability (Cronbach's alpha=0.81 vs. 0.88); test-retest reliability (intraclass correlation coefficient=0.66 vs. 0.61); convergent validity (r with comprehensive measure=0.79); concurrent validity (r with perceived stress=0.55, r with sleep quality=0.39) and predictive validity (area under curve=0.88). Single-item measures were not comparable to the comprehensive measure. A short fatigue measure exhibited similar levels of reliability and validity to the comprehensive measure in Korean young adults. Copyright © 2016 Elsevier Ltd. All rights reserved.
Simões, Luan; Teixeira-Salmela, Luci Fuscaldi; Magalhães, Lívia; Stuge, Britt; Laurentino, Glória; Wanderley, Elaine; Barros, Raphaela; Lemos, Andrea
2018-04-24
The purpose of this study was to evaluate test-retest reliability, construct validity, and internal consistency of the Brazilian version of the Pelvic Girdle Questionnaire (PGQ-Brazil). Analysis of the measurement properties was carried out in 4 steps. Step 1 was the pilot study, on which basis 4 hypotheses were formulated. These hypotheses were tested during the next step (construct validity, step 2) by completion of the questionnaire by the 2 groups (in pain [n = 105] and not in pain [n = 52]). For implementation of the PGQ-Brazil in the group with pain, we calculated the internal consistency (step 3) and, 7 days later, test-retest reliability (step 4) by re-application of the instrument in this group. First, the PGQ-Brazil was able to discriminate between these groups (construct validity). Second, test-retest reliability (intraclass correlation coefficients for Activities subscale [0.97 with 95% confidence interval of 0.95-0.98] and Symptoms subscale [0.98 with 95% confidence interval of 0.97-0.98] and κ coefficient between 0.50 and 0.89 for the items) was found to be good; the Bland-Altman test indicated satisfactory agreement. The Rasch analysis indicated good internal consistency, and the instrument's ability to divide the participants into at least 3 levels of skills was confirmed. In contrast, a ceiling effect was observed, as 24% of pregnant women exhibited skills superior to what the PGQ-Brazil could evaluate. The PGQ-Brazil had good internal consistency, test-retest reliability, and construct validity in assessment of limitations in activities and symptoms of pregnant women with pelvic girdle pain. Copyright © 2018. Published by Elsevier Inc.
Nurses' Empowerment Scale for ICU patients' families: an instrument development study.
Li, Hong; Liu, Ya-Lan; Qiu, Li; Chen, Qiao-Ling; Wu, Jing-Bing; Chen, Li-Li; Li, Na
2016-09-01
Family members provide essential support for ICU patients, contributing to their mental and physical recovery. Empowering ICU patients' families may help them overcome inadequacies and meet their own and patients' acknowledged needs. Nursing should understand and address patients' families' empowerment status. To develop a tool, the Nurses' Empowerment Scale for Intensive Care Unit (ICU) Patients' Families (NESIPF), to help ICU nursing staff assess the empowerment status of patients' families. Four-phase instrument development study. A 19-item instrument was initially generated based on literature review and interviews with family members of ICU patients. The Delphi research method was applied to gain expert opinion and consensus via rounds of questionnaires. A panel of 27 experts experienced in critical care medicine, nursing and psychology participated in two Delphi rounds and their input helped formulate an 18-item pretest instrument. Families of 20 patients were recruited to examine instrument readability. After a 2-week interval, another 20 patients' families were recruited to examine test-retest reliability. Two hundred questionnaires were then administered and analysed to examine the instrument's construct validity, criterion-related validity and internal consistency. Expert authority coefficients of two Delphi rounds reached 0·89 and 0·91. Kendall' W coefficients of 0·113 (P < 0·001) in round 1 and 0·220 (P < 0·001) in round 2 indicated slight to fair agreement among experts. Content validity index (CVI) reached 1·0 for 12 items; the CVI for item 13 was <0·7 so it was excluded. Cronbach's α coefficient was 0·92, indicating acceptable internal consistency reliability. The coefficient of internal consistency of each dimension was 0·717-0·921. The Pearson correlation coefficient >0·9 (P < 0·05) showed an acceptable test-retest reliability. The instrument has acceptable reliability and validity and can assess the empowerment status of families of critically ill patients. Knowledge of families' empowerment status may help to address their psychological needs and their ability to provide family support. © 2014 British Association of Critical Care Nurses.
ERIC Educational Resources Information Center
Hatami, Gissou; Motamed, Niloofar; Ashrafzadeh, Mahshid
2010-01-01
Validity and reliability of Persian adaptation of MSLSS in the 12-18 years, middle and high school students (430 students in grades 6-12 in Bushehr port, Iran) using confirmatory factor analysis by means of LISREL statistical package were checked. Internal consistency reliability estimates (Cronbach's coefficient [alpha]) were all above the…
ERIC Educational Resources Information Center
Liukkonen, Jarmo; Leskinen, Esko
1999-01-01
Analyzed the reliability and validity of scores of 557 14-year-old Finnish male soccer players on the children's version of the Perception of Success Questionnaire (G. Roberts and others, 1998). Internal consistency coefficients for the two subscales' scores were high, and scores on both scales had strong construct validity. (LSD)
Reliability and validity of the Dutch pediatric Voice Handicap Index.
Veder, Laura; Pullens, Bas; Timmerman, Marieke; Hoeve, Hans; Joosten, Koen; Hakkesteegt, Marieke
2017-05-01
The pediatric voice handicap index (pVHI) has been developed to provide a better insight into the parents' perception of their child's voice related quality of life. The purpose of the present study was to validate the Dutch pVHI by evaluating its internal consistency and reliability. Furthermore, we determined the optimal cut-off point for a normal pVHI score. All items of the English pVHI were translated into Dutch. Parents of children in our dysphonic and control group were asked to fill out the questionnaire. For the test re-test analysis we used a different study group who filled out the pVHI twice as part of a large follow up study. Internal consistency was analyzed through Cronbach's α coefficient. The test-retest reliability was assessed by determining Pearson's correlation coefficient. Mann-Whitney test was used to compare the scores of the questionnaire of the control group with the dysphonic group. By calculating receiver operating characteristic (ROC) curves, sensitivity and specificity we were able to set a cut-off point. We obtained data from 122 asymptomatic children and from 79 dysphonic children. The scores of the questionnaire significantly differed between both groups. The internal consistency showed an overall Cronbach α coefficient of 0.96 and an excellent test-retest reliability of the total pVHI questionnaire with a Pearson's correlation coefficient of 0.90. A cut-off point for the total pVHI questionnaire was set at 7 points with a specificity of 85% and sensitivity of 100%. A cut-off point for the VAS score was set at 13 with a specificity of 93% and sensitivity of 97%. The Dutch pVHI is a valid and reliable tool for the assessment of children with voice problems. By setting a cut-off point for the score of the total pVHI questionnaire of 7 points and the VAS score of 13, the pVHI might be used as a screening tool to assess dysphonic complaints and the pVHI might be a useful and complementary tool to identify children with dysphonia. Copyright © 2017 Elsevier B.V. All rights reserved.
Effect of knee angle on neuromuscular assessment of plantar flexor muscles: A reliability study
Cornu, Christophe; Jubeau, Marc
2018-01-01
Introduction This study aimed to determine the intra- and inter-session reliability of neuromuscular assessment of plantar flexor (PF) muscles at three knee angles. Methods Twelve young adults were tested for three knee angles (90°, 30° and 0°) and at three time points separated by 1 hour (intra-session) and 7 days (inter-session). Electrical (H reflex, M wave) and mechanical (evoked and maximal voluntary torque, activation level) parameters were measured on the PF muscles. Intraclass correlation coefficients (ICC) and coefficients of variation were calculated to determine intra- and inter-session reliability. Results The mechanical measurements presented excellent (ICC>0.75) intra- and inter-session reliabilities regardless of the knee angle considered. The reliability of electrical measurements was better for the 90° knee angle compared to the 0° and 30° angles. Conclusions Changes in the knee angle may influence the reliability of neuromuscular assessments, which indicates the importance of considering the knee angle to collect consistent outcomes on the PF muscles. PMID:29596480
Psychometric properties of the Exercise Benefits/Barriers Scale in Mexican elderly women
Enríquez-Reyna, María Cristina; Cruz-Castruita, Rosa María; Ceballos-Gurrola, Oswaldo; García-Cadena, Cirilo Humberto; Hernández-Cortés, Perla Lizeth; Guevara-Valtier, Milton Carlos
2017-01-01
ABSTRACT Objective: analyze and assess the psychometric properties of the subscales in the Spanish version of the Exercise Benefits/Barriers Scale in an elderly population in the Northeast of Mexico. Method: methodological study. The sample consisted of 329 elderly associated with one of the five public centers for senior citizens in the metropolitan area of Northeast Mexico. The psychometric properties included the assessment of the Cronbach's alpha coefficient, the Kaiser Meyer Olkin coefficient, the inter-item correlation, exploratory and confirmatory factor analysis. Results: in the principal components analysis, two components were identified based on the 43 items in the scale. The item-total correlation coefficient of the exercise benefits subscale was good. Nevertheless, the coefficient for the exercise barriers subscale revealed inconsistencies. The reliability and validity were acceptable. The confirmatory factor analysis revealed that the elimination of items improved the goodness of fit of the baseline scale, without affecting its validity or reliability. Conclusion: the Exercise Benefits/Barriers subscale presented satisfactory psychometric properties for the Mexican context. A 15-item short version is presented with factorial structure, validity and reliability similar to the complete scale. PMID:28591306
Imaeda, Toshihiko; Uchiyama, Shigeharu; Wada, Takuro; Okinaga, Shuji; Sawaizumi, Takuya; Omokawa, Shohei; Momose, Toshimitsu; Moritomo, Hisao; Gotani, Hiroyuki; Abe, Yukio; Nishida, Jun; Kanaya, Fuminori
2010-07-01
The Patient-Rated Wrist Evaluation is a regionspecific, self-administered questionnaire consisting of a pain scale (PRWE-P) and a functional scale (PRWE-F), with the latter consisting of specific function (PRWE-SF) and usual function (PRWE-UF). The PRWE was cross-culturally adapted from the original English version by the Impairment Evaluation Committee, Japanese Society for Surgery of the Hand (JSSH). The purpose of this study was to test the reliability, validity, and responsiveness of the Japanese version of PRWE (PRWE-J). A consecutive series of 117 patients with wrist disorders completed the PRWE-J, the JSSH version of the Disabilities of the Arm, Shoulder, and Hand (DASH-JSSH) questionnaire and the 36-Item Short-Form Health Survey (SF-36). Of the 117 patients, 71 were reassessed for test-retest reliability 1 or 2 weeks later. Reliability was investigated by reproducibility and internal consistency. To analyze the validity, a factor analysis (principal axis factoring) of PRWE-J and correlation coefficients between PRWE-J and DASH-JSSH were obtained. Responsiveness was examined by calculating the standardized response mean (SRM) (mean change/SD) and effect size (mean change/SD of baseline value) after open surgery in 50 patients. Cronbach's alpha coefficients for PRWE-P, PRWE-F, and PRWE were 0.90, 0.95, and 0.95, respectively. The intraclass correlation coefficients (ICCs) for the same were 0.86, 0.93, and 0.92, respectively. Unidimensionality of PRWE-P was con-firmed. Bidimensionality of PRWE-F was confirmed and separated clearly into PRWE-SF and PRWE-UF. The correlation coefficients between PRWE-P and PRWE-F or DASH-JSSH were 0.63 or 0.63, respectively. The correlation coefficient between PRWE-F and DASH-JSSH was 0.80. The correlation coefficients between DASH-JSSH and PRWE-SF or PRWE-UF were 0.76 or 0.73, respectively. Moderate correlation was observed in "physical functioning" for SF-36 and PRWE-SF (r = -0.46), PRWE-F (r = -0.46), or PRWE (r = -0.46). The SRMs/effect sizes of PRWE-P, PRWE-F, or PRWE were respectively excellent: 1.7/2.2, 1.2/1.3, 1.6/1.9. The PRWE-J has evaluation capacities equivalent to those of the original PRWE.
Perez, Concepcion; Galvez, Rafael; Huelbes, Silvia; Insausti, Joaquin; Bouhassira, Didier; Diaz, Silvia; Rejas, Javier
2007-01-01
Background This study assesses the validity and reliability of the Spanish version of DN4 questionnaire as a tool for differential diagnosis of pain syndromes associated to a neuropathic (NP) or somatic component (non-neuropathic pain, NNP). Methods A study was conducted consisting of two phases: cultural adaptation into the Spanish language by means of conceptual equivalence, including forward and backward translations in duplicate and cognitive debriefing, and testing of psychometric properties in patients with NP (peripheral, central and mixed) and NNP. The analysis of psychometric properties included reliability (internal consistency, inter-rater agreement and test-retest reliability) and validity (ROC curve analysis, agreement with the reference diagnosis and determination of sensitivity, specificity, and positive and negative predictive values in different subsamples according to type of NP). Results A sample of 164 subjects (99 women, 60.4%; age: 60.4 ± 16.0 years), 94 (57.3%) with NP (36 with peripheral, 32 with central, and 26 with mixed pain) and 70 with NNP was enrolled. The questionnaire was reliable [Cronbach's alpha coefficient: 0.71, inter-rater agreement coefficient: 0.80 (0.71–0.89), and test-retest intra-class correlation coefficient: 0.95 (0.92–0.97)] and valid for a cut-off value ≥ 4 points, which was the best value to discriminate between NP and NNP subjects. Discussion This study, representing the first validation of the DN4 questionnaire into another language different than the original, not only supported its high discriminatory value for identification of neuropathic pain, but also provided supplemental psychometric validation (i.e. test-retest reliability, influence of educational level and pain intensity) and showed its validity in mixed pain syndromes. PMID:18053212
Yang, Baoqi; Chen, Guo; Yang, Qing; Yan, Xiaoxiao; Zhang, Zhaoxia; Murrell, Dédée F; Zhang, Furen
2017-02-02
The autoimmune bullous diseases quality of life (ABQOL) questionnaire was recently developed by an Australian group and has been validated in Australian and North American patient cohorts. It is a 17-item, multidimensional, self-administered English questionnaire. The study aimed to validate the Chinese version of the ABQOL questionnaire and evaluate the reliability in Chinese patients. The Chinese version of the ABQOL questionnaire was produced by forward-backward translation and cross-cultural adaptation of the original English version. The ABQOL questionnaire was then distributed to a total of 101 patients with autoimmune bullous diseases (AIBDs) together with the Dermatology Life Quality Index (DLQI) and the 36-item Short Form Health Survey (SF-36). Validity was analyzed across a range of indices and reliability was assessed using internal consistency and test-retest methods. The Chinese version of the ABQOL questionnaire has a high internal consistency (Cronbach's alpha coefficient, 0.88) and test-retest reliability (the intraclass correlation coefficient, 0.87). Face and content validity were satisfactory. Convergent validity testing showed that the correlation coefficients for the ABQOL and DLQI was 0.77 and for the ABQOL and SF-36 was -0.62. In terms of discriminant validity, there was no significant difference between the proportions of insensitive items in ABQOL and DLQI (p = 0.236). There was no significant difference between the proportions of insensitive items in ABQOL and SF-36 (p = 0.823). The Chinese version of the ABQOL questionnaire has adequate validity and reliability. It may constitute a useful instrument to measure disease burden in Chinese patients with AIBDs.
Vespa, Anna; Giulietti, Maria Velia; Spatuzzi, Roberta; Fabbietti, Paolo; Meloni, Cristina; Gattafoni, Pisana; Ottaviani, Marica
2017-06-01
This study aimed at assessing the reliability and construct validity of Brief Multidimensional Measure of Religiousness/Spirituality (BMMRS) on Italian sample. 353 Italian participants: 58.9% affected by different diseases and 41.1% healthy subjects. The results of descriptive statistics of internal consistency reliabilities (Chronbach's coefficient) of the BMMRS revealed a remarkable consistency and reliability of different scales DSE, SpC, SC, CSC, VB, SPY-WELL and a good Inter-Class Correlations ≥70 maintaining a good stability of the measures over the time. BMMRS is a useful inventory for the evaluation of the principal spiritual dimensions.
Test-retest reliability of the safe driving behavior measure for community-dwelling elderly drivers.
Song, Chiang-Soon; Lee, Joo-Hyun; Han, Sang-Woo
2016-06-01
[Purpose] The Safe Driving Behavior Measure (SDBM) is a self-report measurement tools that assesses the safe-driving behaviors of the elderly. The purpose of this study was to evaluate the test-retest reliability of the SDBM among community-dwelling elderly drivers. [Subjects and Methods] A total of sixty-one community-dwelling elderly were enrolled to investigate the reliability of the SDBM. The SDBM was assessed in two sessions that were conducted three days apart in a quiet and well-organized assessment room. That test-retest reliability of overall scores and three domain scores of the SDBM were statistically evaluated using intraclass correlation coefficients [ICC (2.1)]. Pearson correlation coefficients were used to quantify bivariate associations among the three domains of the SDBM. [Results] The SDBM demonstrated excellent rest-retest reliability for community-dwelling elderly drivers. The Cronbach alpha coefficients of the three domains of person-vehicle (0.979), person-environment (0.944), and person-vehicle-environment (0.971) of the SDBM indicate high internal consistency. [Conclusion] The results of this study suggest that the SDBM is a reliable measure for evaluating the safe- driving of automobiles by community-dwelling elderly, and is adequate for detecting changes in scores in clinical settings.
Gunaydin, Gurkan; Citaker, Seyit; Meray, Jale; Cobanoglu, Gamze; Gunaydin, Ozge Ece; Hazar Kanik, Zeynep
2016-11-01
Validation of a self-report questionnaire. The purpose of this study was to investigate adaptation, validity, and reliability of the Turkish version of the Bournemouth Questionnaire. Low back pain is one of the most frequent disorders leading to activity limitation. This pain affects most of people in their lives. The most important point to evaluate patient's functional abilities and to decide a successful therapy procedure is to manage the assessment questionnaires precisely. One hundred ten patients with chronic low back pain were included in present study. To assess reliability, test-retest and internal consistency analyses were applied. The results of test-retest analysis were assessed by using Intraclass Correlation Coefficient method (95% confidence interval). For internal consistency, Cronbach alpha value was calculated. Validity of the questionnaire was assessed in terms of construct validity. For construct validity, factor analysis and convergent validity were tested. For convergent validity, total points of the Bournemouth Questionnaire were assessed with the total points of Quebec Back Pain Disability Scale and Roland Morris Disability Questionnaire by using Pearson correlation coefficient analysis. Cronbach alpha value was found 0.914, showing that this questionnaire has high internal consistency. The results of test-retest analysis were varying between 0.851 and 0.927, which shows that test-retest results are highly correlated. Factor analysis test indicated that this questionnaire had one factor. Pearson correlation coefficient of the Bournemouth Questionnaire with Roland Morris Disability Questionnaire was calculated 0.703 and it was found with Quebec Back Pain Disability Scale is 0.659. These results showed that the Bournemouth Questionnaire is very good correlated with Roland Morris Disability Questionnaire and Quebec Back Pain Disability Scale. The Turkish version of the Bournemouth Questionnaire is valid and reliable. 3.
İlçin, Nursen; Gürpınar, Barış; Bayraktar, Deniz; Savcı, Sema; Çetin, Pınar; Sarı, İsmail; Akkoç, Nurullah
2016-01-01
[Purpose] This study describes the cultural adaptation, validation, and reliability of the Turkish version of the Pain Catastrophizing Scale in patients with ankylosing spondylitis. [Methods] The validity of the Turkish version of the Pain Catastrophizing Scale was assessed by evaluating data quality (missing data and floor and ceiling effects), principal components analysis, internal consistency (Cronbach’s alpha), and construct validity (Spearman’s rho). Reproducibility analyses included standard measurement error, minimum detectable change, limits of agreement, and intraclass correlation coefficients. [Results] Sixty-four adult patients with ankylosing spondylitis with a mean age of 42.2 years completed the study. Factor analysis revealed that all questionnaire items could be grouped into two factors. Excellent internal consistency was found, with a Chronbach’s alpha value of 0.95. Reliability analyses showed an intraclass correlation coefficient (95% confidence interval) of 0.96 for the total score. There was a low correlation coefficient between the Turkish version of the Pain Catastrophizing Scale and body mass index, pain levels at rest and during activity, health-related quality of life, and fear and avoidance behaviors. [Conclusion] The results of this study indicate that the Turkish version of the Pain Catastrophizing Scale is a valid and reliable clinical and research tool for patients with ankylosing spondylitis. PMID:26957778
Baradaran, Aslan; Ebrahimzadeh, Mohammad H; Birjandinejad, Ali; Kachooei, Amir Reza
2016-04-01
Prospective study. We aimed to validate the Persian version of the modified Oswestry disability questionnaire (MODQ) in patients with low back pain. Modified Oswestry low back pain disability questionnaire is a well-known condition-specific outcome measure that helps quantify disability in patients with lumbar syndromes. To test the validity in a pilot study, the Persian MODQ was administered to 25 individuals with low back pain. We then enrolled 200 consecutive patients with low back pain to fill the Persian MODQ as well as the short form 36 (SF-36) questionnaire. Convergent validity of the MODQ was tested using the Spearman's correlation coefficient between the MODQ and SF-36 subscales. Intraclass correlation coefficient (ICC) and Cronbach's α coefficient were measured to test the reliability between test and retest and internal consistency of all items, respectively. ICC for individual items ranged from 0.43 to 0.80 showing good reliability and reproducibility of each individual item. Cronbach's α coefficient was 0.69 showing good internal consistency across all 10 items of the Persian MODQ. Total MODQ score showed moderate to strong correlation with the eight subscales and the two domains of the SF-36. The highest correlation was between the MODQ and the physical functioning subscale of the SF-36 (r=-0.54, p<0.001) and the physical component domain of the SF-36 (r=-0.55, p<0.001) showing that MODQ is measuring what it is supposed to measure in terms of disability and physical function. Persian version of the MODQ is a valid and reliable tool for the assessment of the disability following low back pain.
Nair, Rahul; Tsakos, Georgios; Yee Ting Fai, Robert
2016-12-01
To cross-culturally adapt the oral impacts on daily performance (OIDP) and assess its reliability and validity on Chinese-speaking community dwelling elderly Singaporeans. There are no previous reports of valid oral health-related quality of life instruments for elderly Singaporeans or perceived conditions associated with impacts reported in OIDP among the Singaporean elders. The OIDP was translated from English to Chinese and then back translated. The OIDP questionnaire along with questions related to overall quality of life and self-rated dental health was administered to 202 Chinese-speaking elderly Singaporeans by trained interviewers, and it was repeated after 1 month. Test-retest reliability was assessed using intraclass correlation coefficient; internal consistency was established using Cronbach's alpha, and construct validity using correlation coefficients with self-reported oral health-related and global quality of life measures. In addition, Kruskal-Wallis tests assessed differences in the OIDP score between different subjective health and global quality of life groups. The median age of participants was 75 years. About 19% reported oral impacts and difficulty eating was the most prevalent oral impact. Internal consistency was good with a Cronbach's alpha of 0.75, and the intraclass correlation coefficient was 0.75 (0.67-0.81). OIDP was significantly correlated with all measures of self-reported oral health and global ratings of quality of life, with correlation coefficients ranging between 0.15 and 0.52. Groups with worse perceptions about their health and quality of life had significantly higher OIDP scores. The OIDP showed successful reliability and validity for its use among Chinese-speaking older Singaporeans. © 2015 John Wiley & Sons A/S and The Gerodontology Association. Published by John Wiley & Sons Ltd.
Cheung, Kenneth M C; Senkoylu, Alpaslan; Alanay, Ahmet; Genc, Yasemin; Lau, Sarah; Luk, Keith D
2007-05-01
Validation study to define validity and reliability of an adapted and translated questionnaire. Assessment of the concurrent validity and reliability of a Chinese version of SRS-22 outcome instrument. No valid health-related quality of life (HRQL) outcome instrument exists for patients with spinal deformity in Chinese. The modified SRS-22 questionnaire was proven to be an appropriate outcome instrument in English, and has already been translated and validated in several other languages. The English version of the SRS-22 questionnaire was adapted to Chinese according to the International Quality of Life Assessment Project guidelines. To assess reliability, 48 subjects with adolescent idiopathic scoliosis (mean age, 16.5 years) filled the questionnaire on 2 separate occasions (Group 1). To assess concurrent validity, 50 subjects (mean age, 21 years) filled in the same questionnaire and a previously validated Chinese version of the Short Form-36 (SF36) questionnaire (Group 2). Internal consistency, reproducibility and concurrent validity were determined with Cronbach's alpha coefficient, interclass correlation coefficient and Pearson correlation coefficient, respectively. Cronbach's alpha coefficient for the 4 major domains (function/activity, pain, self-image/appearance and mental health) were high. Intraclass correlation was also excellent for all domains. For concurrent validity, excellent correlation was found in 1 domain, good in 12 domains, moderate in 3 domains, and poor in 1 domain of the 17 relevant domains. Both cultural adaptation and linguistic translation are essential in any attempt to use a HRQL questionnaire across cultures. The Chinese version of the SRS-22 outcome instrument has satisfactory internal consistency and excellent reproducibility. It is ready for use in clinical studies on idiopathic scoliosis in Chinese-speaking societies.
Reliable scar scoring system to assess photographs of burn patients.
Mecott, Gabriel A; Finnerty, Celeste C; Herndon, David N; Al-Mousawi, Ahmed M; Branski, Ludwik K; Hegde, Sachin; Kraft, Robert; Williams, Felicia N; Maldonado, Susana A; Rivero, Haidy G; Rodriguez-Escobar, Noe; Jeschke, Marc G
2015-12-01
Several scar-scoring scales exist to clinically monitor burn scar development and maturation. Although scoring scars through direct clinical examination is ideal, scars must sometimes be scored from photographs. No scar scale currently exists for the latter purpose. We modified a previously described scar scale (Yeong et al., J Burn Care Rehabil 1997) and tested the reliability of this new scale in assessing burn scars from photographs. The new scale consisted of three parameters as follows: scar height, surface appearance, and color mismatch. Each parameter was assigned a score of 1 (best) to 4 (worst), generating a total score of 3-12. Five physicians with burns training scored 120 representative photographs using the original and modified scales. Reliability was analyzed using coefficient of agreement, Cronbach alpha, intraclass correlation coefficient, variance, and coefficient of variance. Analysis of variance was performed using the Kruskal-Wallis test. Color mismatch and scar height scores were validated by analyzing actual height and color differences. The intraclass correlation coefficient, the coefficient of agreement, and Cronbach alpha were higher for the modified scale than those of the original scale. The original scale produced more variance than that in the modified scale. Subanalysis demonstrated that, for all categories, the modified scale had greater correlation and reliability than the original scale. The correlation between color mismatch scores and actual color differences was 0.84 and between scar height scores and actual height was 0.81. The modified scar scale is a simple, reliable, and useful scale for evaluating photographs of burn patients. Copyright © 2015 Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Wise, Edward A.; Streiner, David L.; Walfish, Steven
2010-01-01
This article provides a review of the literature to determine the most frequently used personality tests. Based on this review, internal consistency and test-retest reliability coefficients from the test manuals for the Minnesota Multiphasic Personality Inventory-2 (MMPI-2), Millon Clinical Multiaxial Inventory-III (MCMI-III), and Personality…
ERIC Educational Resources Information Center
Martinkova, Patricia; Goldhaber, Dan
2015-01-01
Inter-rater reliability, commonly assessed by intra-class correlation coefficient ICC, is an important index for describing the extent to which there is consistency amongst two or more raters in assigned measures. In organizational research, the data structure is often hierarchical and designs deviate substantially from the ideal of a balanced…
The reliability and validity of fatigue measures during multiple-sprint work: an issue revisited.
Glaister, Mark; Howatson, Glyn; Pattison, John R; McInnes, Gill
2008-09-01
The ability to repeatedly produce a high-power output or sprint speed is a key fitness component of most field and court sports. The aim of this study was to evaluate the validity and reliability of eight different approaches to quantify this parameter in tests of multiple-sprint performance. Ten physically active men completed two trials of each of two multiple-sprint running protocols with contrasting recovery periods. Protocol 1 consisted of 12 x 30-m sprints repeated every 35 seconds; protocol 2 consisted of 12 x 30-m sprints repeated every 65 seconds. All testing was performed in an indoor sports facility, and sprint times were recorded using twin-beam photocells. All but one of the formulae showed good construct validity, as evidenced by similar within-protocol fatigue scores. However, the assumptions on which many of the formulae were based, combined with poor or inconsistent test-retest reliability (coefficient of variation range: 0.8-145.7%; intraclass correlation coefficient range: 0.09-0.75), suggested many problems regarding logical validity. In line with previous research, the results support the percentage decrement calculation as the most valid and reliable method of quantifying fatigue in tests of multiple-sprint performance.
Barzegar-Bafrooei, Ebrahim; Bakhtiary, Jalal; Khatoonabadi, Ahmad Reza; Fatehi, Farzad; Maroufizadeh, Saman; Fathali, Mojtaba
2016-01-01
Background: Dysphagia as a common condition affecting many aspects of the patient’s life. The Dysphagia Handicap Index (DHI) is a reliable self-reported questionnaire developed specifically to measure the impact of dysphagia on the patient’s quality of life. The aim of this study was to translate the questionnaire to Persian and to measure its validity and reliability in patients with neurogenic oropharyngeal dysphagia. Methods: A formal forward-backward translation of DHI was performed based on the guidelines for the cross-cultural adaptation of self-report measures. A total of 57 patients with neurogenic dysphagia who were referred to the neurology clinics of Tehran University of Medical Sciences, Iran, participated in this study. Internal consistency reliability of the DHI was examined using Cronbach’s alpha, and test-retest reliability of the scale was evaluated using intraclass correlation coefficient (ICC). Results: The internal consistency of the Persian DHI (P-DHI) was considered to be good; Cronbach’s alpha coefficient for the total P-DHI was 0.88. The test-retest reliability for the total and three subscales of the P-DHI ranged from 0.95 to 0.98 using ICC. Conclusion: The P-DHI demonstrated a good reliability, and it can be a valid instrument for evaluating the dysphagia effects on quality of life among Persian language population. PMID:27648173
Li, Hong-Yan; Bi, Rui-Xue; Zhong, Qing-Ling
2017-12-01
Disaster nurse education has received increasing importance in China. Knowing the abilities of disaster response in undergraduate nursing students is beneficial to promote teaching and learning. However, there are few valid and reliable tools that measure the abilities of disaster response in undergraduate nursing students. To develop a self-report scale of self-efficacy in disaster response for Chinese undergraduate nursing students and test its psychometric properties. Nursing students (N=318) from two medical colleges were chosen by purposive sampling. The Disaster Response Self-Efficacy Scale (DRSES) was developed and psychometrically tested. Reliability and content validity were studied. Construct validity was tested by exploratory and confirmatory factor analysis. Reliability was tested by internal consistency and test-retest reliability. The DRSES consisted of 3 factors and 19 items with a 5-point rating. The content validity was 0.91, Cronbach's alpha coefficient was 0.912, and the intraclass correlation coefficient for test-retest reliability was 0.953. The construct validity was good (χ 2 /df=2.440, RMSEA=0.068, NFI=0.907, CFI=0.942, IFI=0.430, p<0.001). The newly developed DRSES has proven good reliability and validity. It could therefore be used as an assessment tool to evaluate self-efficacy in disaster response for Chinese undergraduate nursing students. Copyright © 2017. Published by Elsevier Ltd.
Kadar, Masne; Ibrahim, Suhaili; Razaob, Nor Afifi; Chai, Siaw Chui; Harun, Dzalani
2018-02-01
The Lawton Instrumental Activities of Daily Living Scale is a tool often used to assess independence among elderly at home. Its suitability to be used with the elderly population in Malaysia has not been validated. This current study aimed to assess the validity and reliability of the Lawton Instrumental Activities of Daily Living Scale - Malay Version to Malay speaking elderly in Malaysia. This study was divided into three phases: (1) translation and linguistic validity involving both forward and backward translations; (2) establishment of face validity and content validity; and (3) establishment of reliability involving inter-rater, test-retest and internal consistency analyses. Data used for these analyses were obtained by interviewing 65 elderly respondents. Percentages of Content Validity Index for 4 criteria were from 88.89 to 100.0. The Cronbach α coefficient for internal consistency was 0.838. Intra-class Correlation Coefficient of inter-rater reliability and test-retest reliability was 0.957 and 0.950 respectively. The result shows that the Lawton Instrumental Activities of Daily Living Scale - Malay Version has excellent reliability and validity for use with the Malay speaking elderly people in Malaysia. This scale could be used by professionals to assess functional ability of elderly who live independently in community. © 2018 Occupational Therapy Australia.
Reliability and Validity of an Internet-based Questionnaire Measuring Lifetime Physical Activity
De Vera, Mary A.; Ratzlaff, Charles; Doerfling, Paul; Kopec, Jacek
2010-01-01
Lifetime exposure to physical activity is an important construct for evaluating associations between physical activity and disease outcomes, given the long induction periods in many chronic diseases. The authors' objective in this study was to evaluate the measurement properties of the Lifetime Physical Activity Questionnaire (L-PAQ), a novel Internet-based, self-administered instrument measuring lifetime physical activity, among Canadian men and women in 2005–2006. Reliability was examined using a test-retest study. Validity was examined in a 2-part study consisting of 1) comparisons with previously validated instruments measuring similar constructs, the Lifetime Total Physical Activity Questionnaire (LT-PAQ) and the Chasan-Taber Physical Activity Questionnaire (CT-PAQ), and 2) a priori hypothesis tests of constructs measured by the L-PAQ. The L-PAQ demonstrated good reliability, with intraclass correlation coefficients ranging from 0.67 (household activity) to 0.89 (sports/recreation). Comparison between the L-PAQ and the LT-PAQ resulted in Spearman correlation coefficients ranging from 0.41 (total activity) to 0.71 (household activity); comparison between the L-PAQ and the CT-PAQ yielded coefficients of 0.58 (sports/recreation), 0.56 (household activity), and 0.50 (total activity). L-PAQ validity was further supported by observed relations between the L-PAQ and sociodemographic variables, consistent with a priori hypotheses. Overall, the L-PAQ is a useful instrument for assessing multiple domains of lifetime physical activity with acceptable reliability and validity. PMID:20876666
Cañón-Montañez, Wilson; Oróstegui-Arenas, Myriam
2015-01-01
To determine the reliability (internal consistency, inter-rater reproducibility and level of agreement) of nursing outcome: "Knowledge: cardiac disease management (1830)" of the version published in Spanish, in outpatients with heart failure. A reliability study was conducted on 116 outpatients with heart failure. Six indicators of nursing outcome were operationalized. All participants were assessed simultaneously by two evaluators. Three evaluation periods were defined: initial (at baseline), final (a month later), and follow-up (two months later). Internal consistency by Cronbach alpha coefficient, inter-rater reproducibility with intraclass correlation coefficient of reproducibility or agreement and level agreement using the 95% limits of Bland and Altman. Cronbach's alpha was 0.83 (95% CI: 0.77 - 0.89) in the final evaluation, and follow-up values of 0.85 (95% CI: 0.82-0.89) and 0.83 (95% CI: 0.78 - 0.88) were found for the first and second evaluator, respectively. The intraclass correlation coefficient showed values greater 0.9 in the three evaluation periods in both the random and mixed model. The Bland-Altman 95% limits of agreement were close to zero in the three evaluations performed. The questionnaire operationalized to assess the nursing outcome: "Knowledge: cardiac disease management (1830)" in its Spanish version, is a reliable method to measure skills and knowledge in outpatients with heart failure in the Colombian context. Copyright © 2015 Elsevier España, S.L.U. All rights reserved.
Reliability and validity of an internet-based questionnaire measuring lifetime physical activity.
De Vera, Mary A; Ratzlaff, Charles; Doerfling, Paul; Kopec, Jacek
2010-11-15
Lifetime exposure to physical activity is an important construct for evaluating associations between physical activity and disease outcomes, given the long induction periods in many chronic diseases. The authors' objective in this study was to evaluate the measurement properties of the Lifetime Physical Activity Questionnaire (L-PAQ), a novel Internet-based, self-administered instrument measuring lifetime physical activity, among Canadian men and women in 2005-2006. Reliability was examined using a test-retest study. Validity was examined in a 2-part study consisting of 1) comparisons with previously validated instruments measuring similar constructs, the Lifetime Total Physical Activity Questionnaire (LT-PAQ) and the Chasan-Taber Physical Activity Questionnaire (CT-PAQ), and 2) a priori hypothesis tests of constructs measured by the L-PAQ. The L-PAQ demonstrated good reliability, with intraclass correlation coefficients ranging from 0.67 (household activity) to 0.89 (sports/recreation). Comparison between the L-PAQ and the LT-PAQ resulted in Spearman correlation coefficients ranging from 0.41 (total activity) to 0.71 (household activity); comparison between the L-PAQ and the CT-PAQ yielded coefficients of 0.58 (sports/recreation), 0.56 (household activity), and 0.50 (total activity). L-PAQ validity was further supported by observed relations between the L-PAQ and sociodemographic variables, consistent with a priori hypotheses. Overall, the L-PAQ is a useful instrument for assessing multiple domains of lifetime physical activity with acceptable reliability and validity.
2010-01-01
Background As a result of scientific and medical professionals gaining interest in Stress and Health Related Quality of Life (HRQL), the aim of our research is, thus, to validate into Spanish the German questionnaire Bad Sobernheim Stress Questionnaire (BSSQ) (mit Korsett), for adolescents wearing braces. Methods The methodology used adheres to literature on trans-cultural adaptation by doing a translation and a back translation; it involved 35 adolescents, ages ranging between 10 and 16, with Adolescent Idiopathic Scoliosis (AIS) and wearing the same kind of brace (Rigo System Chêneau Brace). The materials used were a socio-demographics data questionnaire, the SRS-22 and the Spanish version of BSSQ(brace).es. The statistical analysis calculated the reliability (test-retest reliability and internal consistency) and the validity (convergent and construct validity) of the BSSQ (brace).es. Results BSSQ(brace).es is reliable because of its satisfactory internal consistency (Cronbach's alpha coefficient was 0.809, p < 0.001) and temporal stability (test-retest method with a Pearson correlation coefficient of 0.902 (p < 0.01)). It demonstrated convergent validity with SRS-22 since the Pearson correlation coefficient was 0.656 (p < 0.01). By undertaking an Exploratory Principal Components Analysis, a latent structure was found based on two Components which explicate the variance at 60.8%. Conclusions BSSQ (brace).es is reliable and valid and can be used with Spanish adolescents to assess the stress level caused by the brace. PMID:20633253
Hallegraeff, Joannes M; van der Schans, Cees P; Krijnen, Wim P; de Greef, Mathieu H G
2013-02-01
The eight-item Brief Illness Perception Questionnaire is used as a screening instrument in physical therapy to assess mental defeat in patients with acute low back pain, besides patient perception might determine the course and risk for chronic low back pain. However, the psychometric properties of the Brief Illness Perception Questionnaire in common musculoskeletal disorders like acute low back pain have not been adequately studied. Patients' perceptions vary across different populations and affect coping styles. Thus, our aim was to determine the internal consistency, test-retest reliability and validity of the Dutch language version of the Brief Illness Perception Questionnaire in acute non-specific low back pain patients in primary care physical therapy. A non-experimental cross-sectional study with two measurements was performed. Eighty-four acute low back pain patients, in multidisciplinary health care center in Dutch primary care with a sample mean (SD) age of 42 (12) years, participated in the study. Internal consistency (Cronbach's α) and test-retest procedures (Intraclass Correlation Coefficients and limits of agreement) were evaluated at a one-week interval. The concurrent validity of the Brief Illness Perception Questionnaire was examined by using the Mental Health Component of the Short Form 36 Health Survey. The Cronbach's α for internal consistency was 0.73 (95% CI, 0.67 - 0.83); and the Intraclass Correlation Coefficient test-retest reliability was acceptable: 0.72 (95% CI, 0.53 - 0.82), however, the limits of agreement were large. The Intraclass Correlation Coefficient measuring concurrent validity 0.65 (95% CI, 0.46 - 0.80). The Dutch version of the Brief Illness Perception Questionnaire is an appropriate instrument for measuring patients' perceptions in acute low back pain patients, showing acceptable internal consistency and reliability. Concurrent validity is adequate, however, the instrument may be unsuitable for detecting changes in low back pain perception over time.
The Validation of a Case-Based, Cumulative Assessment and Progressions Examination
Coker, Adeola O.; Copeland, Jeffrey T.; Gottlieb, Helmut B.; Horlen, Cheryl; Smith, Helen E.; Urteaga, Elizabeth M.; Ramsinghani, Sushma; Zertuche, Alejandra; Maize, David
2016-01-01
Objective. To assess content and criterion validity, as well as reliability of an internally developed, case-based, cumulative, high-stakes third-year Annual Student Assessment and Progression Examination (P3 ASAP Exam). Methods. Content validity was assessed through the writing-reviewing process. Criterion validity was assessed by comparing student scores on the P3 ASAP Exam with the nationally validated Pharmacy Curriculum Outcomes Assessment (PCOA). Reliability was assessed with psychometric analysis comparing student performance over four years. Results. The P3 ASAP Exam showed content validity through representation of didactic courses and professional outcomes. Similar scores on the P3 ASAP Exam and PCOA with Pearson correlation coefficient established criterion validity. Consistent student performance using Kuder-Richardson coefficient (KR-20) since 2012 reflected reliability of the examination. Conclusion. Pharmacy schools can implement internally developed, high-stakes, cumulative progression examinations that are valid and reliable using a robust writing-reviewing process and psychometric analyses. PMID:26941435
Sharif Nia, Hamid; Pahlevan Sharif, Saeed; Boyle, Christopher; Yaghoobzadeh, Ameneh; Tahmasbi, Bahram; Rassool, G Hussein; Taebei, Mozhgan; Soleimani, Mohammad Ali
2018-04-01
This study aimed to determine the factor structure of the spiritual well-being among a sample of the Iranian veterans. In this methodological research, 211 male veterans of Iran-Iraq warfare completed the Paloutzian and Ellison spiritual well-being scale. Maximum likelihood (ML) with oblique rotation was used to assess domain structure of the spiritual well-being. The construct validity of the scale was assessed using confirmatory factor analysis (CFA), convergent validity, and discriminant validity. Reliability was evaluated with Cronbach's alpha, Theta (θ), and McDonald Omega (Ω) coefficients, intra-class correlation coefficient (ICC), and construct reliability (CR). Results of ML and CFA suggested three factors which were labeled "relationship with God," "belief in fate and destiny," and "life optimism." The ICC, coefficients of the internal consistency, and CR were >.7 for the factors of the scale. Convergent validity and discriminant validity did not fulfill the requirements. The Persian version of spiritual well-being scale demonstrated suitable validity and reliability among the veterans of Iran-Iraq warfare.
Sexual behaviors among club drug users: prevalence and reliability
Shacham, Enbal; Cottler, Linda B.
2013-01-01
HIV prevention efforts require a focus on reducing high risk sexual behavior. Because these are self-reported, assessments that reduce memory bias and improve elicitation of data are needed. As part of a multi-site psychometric study of club drug use, abuse, and dependence, data were collected with a test-retest design that measured the reliability of the Washington University Risk Behavior Assessment for Club Drugs (WU-RBA-CD). Reliability was assessed separately by sex via kappa coefficients and intraclass correlation coefficients (ICC); z tests compared coefficients by sex. A total of 603 participants were interviewed by independent assessors with 5 days in between interviews. Reliability for all 51 items of the sexual activity section of the WU-RBA-CD ranged from .23 to 1.00; 71% (n = 36) of items resulted in moderate to high reliability (.55–1.00). Number of lifetime sex partners was consistently reported for same-sex partners for both men and women and opposite-sex partners. Items with high reliability included reporting ever being under the influence of ecstasy (.87) or GHB (.87) while having sex. Items with lower reliability included those that queried the determinants of condom use (.45–.82) and about behaviors and attitudes experienced while using drugs (.23–.87). Very few sex differences were revealed in the reliability of reported sexual activities. Overall, the WU-RBA-CD performed with fairly high reliability rates. Assessing situations of when, how, and why individuals use condoms may offer the clearest evaluation of determinants of sexual behaviors, yet those items are not as reliable. PMID:19757011
Shou, Juan; Ren, Limin; Wang, Haitang; Yan, Fei; Cao, Xiaoyun; Wang, Hui; Wang, Zhiliang; Zhu, Shanzhu; Liu, Yao
2016-04-01
The 12-item Short-Form Health Survey (SF-12) is the abridged practical version of SF-36. This cross-sectional study was aimed to assess the reliability and validity of SF-12 for the health status of Chinese community elderly population. The Chinese community elderly people in Xujiahui district of Shanghai were investigated. The internal consistency reliability was assessed using Cronbach's alpha and split-half reliability coefficients. Construct validity was analyzed using exploratory factor analysis (EFA) and confirmatory factor analysis (CFA). Spearman's correlation coefficient (ρ) was used for the evaluation of criterion, convergent, and discriminant validity with Spearman's ρ ≥ 0.4 as satisfactory. Comparisons of the SF-12 summary scores among populations that differed in demographics were performed for discriminant validity. Total 1343 individuals aged ≥60 and <85 years old (response rate: 91.3 %) were analyzed. The Cronbach's α value (0.910) and the split-half reliability coefficient (0.812) reflected satisfactory internal consistency reliability of SF-12. EFA extracted a two-factor model (physical and mental health). About 60.7 % of the total variance was explained by the two factors. CFA showed that the two-factor solution provided a good fit to the data. Good convergent validity and discriminant validity of SF-12 were proved by the correction analyses (Spearman's ρ > 0.4) and the comparisons of the SF-12 summary scores among populations (P < 0.05). SF-12 summary scores were significantly correlated with the SF-36 summary scores (Spearman's ρ > 0.4, P < 0.05). In conclusion, SF-12 had satisfactory reliability and validity in measuring health status of Chinese community elderly population in Xujiahui district of Shanghai.
Coefficient alpha and interculture test selection.
Thurber, Steven; Kishi, Yasuhiro
2014-04-01
The internal consistency reliability of a measure can be a focal point in an evaluation of the potential adequacy of an instrument for adaptation to another cultural setting. Cronbach's alpha (α) coefficient is often used as the statistical index for such a determination. However, alpha presumes a tau-equivalent test and may constitute an inaccurate population estimate for multidimensional tests. These notions are expanded and examined with a Japanese version of a questionnaire on nursing attitudes toward suicidal patients, originally constructed in Sweden using the English language. The English measure was reported to have acceptable internal consistency (α) albeit the dimensionality of the questionnaire was not addressed. The Japanese scale was found to lack tau-equivalence. An alternative to alpha, "composite reliability," was computed and found to be below acceptable standards in magnitude and precision. Implications for research application of the Japanese instrument are discussed. © The Author(s) 2012.
Perspectives in adolescent risk-taking through instrument development.
Busen, N H; Kouzekanani, K
2000-01-01
Understanding the high-risk adolescent's perception of risk taking is essential for health professionals to determine appropriate interventions. The purpose of this study was to examine the psychometric properties of the revised Adolescent Risk-Taking Instrument (ARTI) designed to measure the high-risk adolescent's perception of risk taking. This study also examined the variables that are most predictive of social adaptation and risk taking. An ex post facto design was used to standardize data collection and to assess the psychometric properties of the revised ARTI. The nonprobability sample consisted of 167 adolescents attending school in an urban, health-underserved area. Exploratory factor analysis supported construct validity, and Chronbach's Coefficient Alpha supported internal consistency reliability. The reliability coefficient for the risk taking and social adaptation constructs were .80 and .77, respectively. Current perspectives on adolescent risk taking and implications for the use of the ARTI in clinical practice are addressed.
Mahdavi, Mohammad Ebrahim; Pourbakht, Akram; Parand, Akram; Jalaie, Shohreh
2018-03-01
Evaluation of dichotic listening to digits is a common part of many studies for diagnosis and managing auditory processing disorders in children. Previous researchers have verified test-retest relative reliability of dichotic digits results in normal children and adults. However, detecting intervention-related changes in the ear scores after dichotic listening training requires information regarding trial-to-trial typical variation of individual ear scores that is estimated using indices of absolute reliability. Previous studies have not addressed absolute reliability of dichotic listening results. To compare the results of the Persian randomized dichotic digits test (PRDDT) and its relative and absolute indices of reliability between typical achieving (TA) and learning-disabled (LD) children. A repeated measures observational study. Fifteen LD children were recruited from a previously performed study with age range of 7-12 yr. The control group consisted of 15 TA schoolchildren with age range of 8-11 yr. The Persian randomized dichotic digits test was administered on the children under free recall condition in two test sessions 7-12 days apart. We compared the average of the ear scores and ear advantage between TA and LD children. Relative indices of reliability included Pearson's correlation and intraclass correlation (ICC 2,1 ) coefficients and absolute reliability was evaluated by calculation of standard error of measurement (SEM) and minimal detectable change (MDC) using the raw ear scores. The Pearson correlation coefficient indicated that in both groups of children the ear scores of test and retest sessions were strongly and positively (greater than +0.8) correlated. The ear scores showed excellent ICC coefficient of consistency (0.78-0.82) and fair to excellent ICC coefficient of absolute agreement (0.62-0.74) in TA children and excellent ICC coefficients of consistency and absolute agreement in LD children (0.76-0.87). SEM and SEM% of the ear scores in TA children were 1.46 and 1.44% for the right ear and 4.68 and 5.47% for the left ear. SEM and SEM% of the ear scores in LD children were 4.55 and 5.88% for the right ear to 7.56 and 12.81% for the left ear. MDC and MDC% of the ear scores in TA children varied from 4.03 and 3.99% for the right ear to 12.93 and 15.13% for the left ear. MDC and MDC% of the ear scores in LD children varied from 12.57 and 16.25% for the right ear to 20.89 and 35.39% for the left ear. The LD children indicated test-retest relative reliability as high as TA children in the ear scores measured by PRDDT. However, within-subject variations of the ear scores calculated by indices of absolute reliability were considerably higher in LD children versus TA children. The results of the current study could have implications for detecting real training-related changes in the ear scores. American Academy of Audiology
Skinner, Ian W; Hübscher, Markus; Moseley, G Lorimer; Lee, Hopin; Wand, Benedict M; Traeger, Adrian C; Gustin, Sylvia M; McAuley, James H
2017-08-15
Eyetracking is commonly used to investigate attentional bias. Although some studies have investigated the internal consistency of eyetracking, data are scarce on the test-retest reliability and agreement of eyetracking to investigate attentional bias. This study reports the test-retest reliability, measurement error, and internal consistency of 12 commonly used outcome measures thought to reflect the different components of attentional bias: overall attention, early attention, and late attention. Healthy participants completed a preferential-looking eyetracking task that involved the presentation of threatening (sensory words, general threat words, and affective words) and nonthreatening words. We used intraclass correlation coefficients (ICCs) to measure test-retest reliability (ICC > .70 indicates adequate reliability). The ICCs(2, 1) ranged from -.31 to .71. Reliability varied according to the outcome measure and threat word category. Sensory words had a lower mean ICC (.08) than either affective words (.32) or general threat words (.29). A longer exposure time was associated with higher test-retest reliability. All of the outcome measures, except second-run dwell time, demonstrated low measurement error (<6%). Most of the outcome measures reported high internal consistency (α > .93). Recommendations are discussed for improving the reliability of eyetracking tasks in future research.
Pisconti, Fernando; Mahmoud Smaili Santos, Suhaila; Lopes, Josiane; Rosa Cardoso, Jefferson; Lopes Lavado, Edson
2017-11-29
The Exercise Self-Efficacy scale (ESES) is a reliable measure, in the English language, of exercise self-efficacy in individuals with spinal cord injury. The aim of this study was to culturally adjust and validate the Exercise Self-Efficacy scale in the Portuguese language. The Exercise Self-Efficacy scale was applied to 76 subjects, with three-month intervals (three applications in total). The reliability was appraised using the intra-class correlation coefficient and Bland-Altman methods, and the internal consistency was evaluated using Cronbach´s alpha. The Exercise Self-Efficacy scale was correlated with the domains of the Quality of life Questionnaire SF-36 and Functional Independence Measure and tested using the Spearman rho coefficient. The Exercise Self-Efficacy scale-Brazil presented good internal consistency (alpha 1 = 0.856; alpha 2 = 0.855; alpha 3 = 0.822) and high reliability in the test-retest (intra-class correlation coefficient = 0.97). There was a strong correlation between the Exercise Self-Efficacy scale-Brazil and the SF-36 only in the functional capacity domain (rho = 0.708). There were no changes in Exercise Self-Efficacy scale-Brazil scores between the three applications (p = 0.796). The validation of the Exercise Self-Efficacy scale questionnaire permits the assessor to use it reliably in Portuguese speaking countries, since it is the first instrument measuring self-efficacy specifically during exercises in individuals with spinal cord injury. Furthermore, the questionnaire can be used as an instrument to verify the effectiveness of interventions that use exercise as an outcome. The results of the Brazilian version of the Exercise Self-Efficacy scale support its use as a reliable and valid measurement of exercise self-efficacy for this population.
Test-retest reliability of the Military Pre-training Questionnaire.
Robinson, M; Stokes, K; Bilzon, J; Standage, M; Brown, P; Thompson, D
2010-09-01
Musculoskeletal injuries are a significant cause of morbidity during military training. A brief, inexpensive and user-friendly tool that demonstrates reliability and validity is warranted to effectively monitor the relationship between multiple predictor variables and injury incidence in military populations. To examine the test-retest reliability of the Military Pre-training Questionnaire (MPQ), designed specifically to assess risk factors for injury among military trainees across five domains (physical activity, injury history, diet, alcohol and smoking). Analyses were based on a convenience sample of 58 male British Army trainees. Kappa (kappa), weighted kappa (kappa(w)) and intraclass correlation coefficients (ICC) were used to evaluate the 2-week test-retest reliability of the MPQ. For index measures constituting the assessment of a given construct, internal consistency was assessed by Cronbach's alpha (alpha) coefficients. Reliability of individual items ranged from poor to almost perfect (kappa range = 0.45-0.86; kappa(w) range = 0.11-0.91; ICC range = 0.34-0.86) with most items demonstrating moderate reliability. Overall scores related to physical activity, diet, alcohol and smoking constructs were reliable between both administrations (ICC = 0.63-0.85). Support for the internal consistency of the incorporated alcohol (alpha = 0.78) and cigarette (alpha = 0.75) scales was also provided. The MPQ is a reliable self-report instrument for assessing multiple injury-related risk factors during initial military training. Further assessment of the psychometric properties of the MPQ (e.g. different types of validity) with military populations/samples will support its interpretation and use in future surveillance and epidemiological studies.
Brosseau, Lucie; Laroche, Chantal; Guitard, Paulette; King, Judy; Poitras, Stéphane; Casimiro, Lynn; Barette, Julie Alexandra; Cardinal, Dominique; Cavallo, Sabrina; Laferrière, Lucie; Martini, Rose; Champoux, Nicholas; Taverne, Jennifer; Paquette, Chanyque; Tremblay, Sébastien; Sutton, Ann; Galipeau, Roseline; Tourigny, Jocelyne; Toupin-April, Karine; Loew, Laurianne; Demers, Catrine; Sauvé-Schenk, Katrine; Paquet, Nicole; Savard, Jacinthe; Lagacé, Josée; Pharand, Denyse; Vaillancourt, Véronique
2017-01-01
Objectives: The primary objective was to produce a French-Canadian translation of AMSTAR (a measurement tool to assess systematic reviews) and to examine the validity of the translation's contents. The secondary and tertiary objectives were to assess the inter-rater reliability and factorial construct validity of this French-Canadian version of AMSTAR. Methods: A modified approach to Vallerand's methodology (1989) for cross-cultural validation was used. 1 First, a parallel back-translation of AMSTAR 2 was performed, by both professionals and future professionals. Next, a first committee of experts (P1) examined the translations to create a first draft of the French-Canadian version of the AMSTAR tool. This draft was then evaluated and modified by a second committee of experts (P2). Following that, 18 future professionals (master's students in physiotherapy) rated this second draft of the instrument for clarity using a seven-point scale (1: very clear; 7: very ambiguous). Lastly, the principal co-investigators then reviewed the problematic elements and proposed final changes. Four independent raters used this French-Canadian version of AMSTAR to assess 20 systematic reviews that were published in French after the year 2000. An intraclass correlation coefficient (ICC) and kappa coefficient were calculated to measure the tool's inter-rater reliability. A Cronbach's alpha coefficient was also calculated to measure internal consistency. In addition, factor analysis was used to evaluate construct validity in order to determine the number of dimensions. Results: The statements on the final version of the AMSTAR tool received an average ambiguity rating of between 1.0 and 1.4. No statement received an average rating below 1.4, which indicates a high level of clarity. Inter-rater reliability ( n =4) for the instrument's total score was moderate, with an intraclass correlation coefficient of 0.61 (95% confidence interval [CI]: 0.29, 0.97). Inter-rater reliability for 82% of the individual items was good, according to the kappa values obtained. Internal consistency was excellent, with a Cronbach's alpha coefficient of 0.91 (95% CI: 0.83, 0.99). The French-Canadian version of AMSTAR is a unidimensional tool, as confirmed by factor analysis and community values greater than 0.30. Conclusion: A valid French-Canadian version of AMSTAR was created using this rigorous five-step process. This version is unidimensional, with moderate inter-rater reliability for the elements overall, and with excellent internal consistency. This tool could be valuable to French-Canadian professionals and researchers, and could also be of interest to the international Francophone community.
ERIC Educational Resources Information Center
Aypay, Ayse
2011-01-01
The primary purpose of this study was to adapt the Teaching-learning Approaches Questionnaire. The working group of the study consisted of 341 student-teachers. The results indicated that the factor structure is partially consistent with the model. Cronbach reliability coefficient for the whole instrument was 0.71, while sub-scale reliabilities…
Fung, Christina Hoi Ling; Nguyen, Michelle; Moineddin, Rahim; Colantonio, Angela; Wiseman-Hakes, Catherine
2014-06-01
The Daily Cognitive Communicative and Sleep Profile (DCCASP) is a seven-item instrument that captures daily subjective sleep quality, perceived mood, cognitive, and communication functions. The objective of this study was to evaluate the reliability and validity of the DCCASP. The DCCASP was self-administered daily to a convenience sample of young adults (n = 54) for two two-week blocks, interspersed with a two-week rest period. Afterwards, participants completed the Pittsburgh Sleep Quality Index (PSQI). Internal consistency and criterion validity were calculated by Cronbach's α coefficient, Concordance Correlation Coefficient (CCC), and Spearman rank (rs) correlation coefficient, respectively. Results indicated high internal consistency (Cronbach-s α = 0.864-0.938) among mean ratings of sleep quality on the DCCASP. There were significant correlations between mean ratings of sleep quality and all domains (rs=0.38-0.55, p<0.0001). Criterion validity was established between mean sleep quality ratings on the DCCASP and PSQI (rs=0.40, p<0.001). The DCCASP is a reliable and valid self-report instrument to monitor daily sleep quality and perceived mood, cognitive, and communication functions over time, amongst a normative sample of young adults. Further studies on its psychometric properties are necessary to clarify its utility in a clinical population. Copyright © 2014 John Wiley & Sons, Ltd.
Identifying dyspepsia in the Greek population: translation and validation of a questionnaire.
Anastasiou, Foteini; Antonakis, Nikos; Chaireti, Georgia; Theodorakis, Pavlos N; Lionis, Christos
2006-03-04
Studies on clinical issues, including diagnostic strategies, are considered to be the core content of general practice research. The use of standardised instruments is regarded as an important component for the development of Primary Health Care research capacity. Demand for epidemiological cross-cultural comparisons in the international setting and the use of common instruments and definitions valid to each culture is bigger than ever. Dyspepsia is a common complaint in primary practice but little is known with respect to its incidence in Greece. There are some references about the Helicobacter Pylori infection in patients with functional dyspepsia or gastric ulcer in Greece but there is no specific instrument for the identification of dyspepsia. This paper reports on the validation and translation into Greek, of an English questionnaire for the identification of dyspepsia in the general population and discusses several possibilities of its use in the Greek primary care. The selected English postal questionnaire for the identification of people with dyspepsia in the general population consists of 30 items and was developed in 1995. The translation and cultural adaptation of the questionnaire has been performed according to international standards. For the validation of the instrument the internal consistency of the items was established using the alpha coefficient of Chronbach, the reproducibility (test - retest reliability) was measured by kappa correlation coefficient and the criterion validity was calculated against the diagnosis of the patients' records using also kappa correlation coefficient. The final Greek version of the postal questionnaire for the identification of dyspepsia in the general population was reliably translated. The internal consistency of the questionnaire was good, Chronbach's alpha was found to be 0.88 (95% CI: 0.81-0.93), suggesting that all items were appropriate to measure. Kappa coefficient for reproducibility (test - retest reliability) was found 0.66 (95% CI: 0.62-0.71), whereas the kappa analysis for criterion validity was 0.63 (95% CI: 0.36-0.89). This study indicates that the Greek translation is comparable with the English-language version in terms of validity and reliability, and is suitable for epidemiological research within the Greek primary health care setting.
Mousavian, Alireza; Ebrahimzadeh, Mohammad H; Birjandinejad, Ali; Omidi-Kashani, Farzad; Kachooei, Amir Reza
2015-12-01
In this study, we aimed to translate and test the validity and reliablity of the Persian version of the Manchester-Oxford Foot Questionnaire in foot and ankle patients. We translated the Manchester-Oxford Foot Questionnaire to Persian language according to the accepted guidelines, then assessed the psychometric properties including the validity and reliability on 308 patients with long-standing foot and ankle problems. To test the reliability, we calculated the intra-class correlation coefficient (ICC) for test-retest reliability and measured Cronbach's alpha to test the internal consistency. To test the construct validity of the Manchester-Oxford Foot Questionnaire we also administered the Short-Form 36 to patients. Construct validity was supported by significant correlation with SF36 subscales except for pain subscale of the persian MOXFQ with mental health of the SF36 (r=0.207). Intraclass correlation coefficient was 0.79 for the total MOXFQ and ranged from 0.83 to 0.89 for the three subscales. Cronbach's alpha for pain, walking/standing, and social interaction was 0.86, 0.88, and 0.89, respectively, and was 0.79 for the total MOXFQ showing good internal consistency in each domain. The Persian Manchester-Oxford Foot Questionnaire health scoring system is a valid and reliable patient-reported instrument for foot and ankle problems. Copyright © 2015. Published by Elsevier Ltd.
Martínez-Gómez, David; Martínez-de-Haro, Vicente; Pozo, Tamara; Welk, Gregory J; Villagra, Ariel; Calle, Marisa E; Marcos, Ascensión; Veiga, Oscar L
2009-01-01
Questionnaires are feasible instruments to assess physical activity (PA) in large samples. The aim of the current study was to evaluate the reliability and validity of the PAQ-A questionnaire in Spanish adolescents using the measurement of PA by accelerometer as criterion. In a sample of 82 adolescents, aged 12 to 17 years, 1-week PAQ-A test-retest was administered. Reliability was analyzed by the Intraclass Correlation Coefficient (ICC) and the internal consistency by the Cronbach's alpha Coefficient. Two hundred thirty-two adolescents, aged 13-17 years, completed the PAQ-A and wore the ActiGraph GT1M accelerometer during 7-days. The PAQ-A was compared against total PA and moderate to vigorous PA (MVPA) obtained by the accelerometer. Test-retest reliability showed ICC = 0.71 for the final score of PAQ-A. Internal consistency was alpha = 0.65 in the first self-report, alpha = 0.67 in the retest in 82 adolescents sample, and alpha = 0.74 in the 232 adolescents sample. The PAQ-A was moderately correlated with total PA (rho = 0.39) and MVPA (rho= 0.34) assessed by the accelerometer. The PAQ-A obtained significantly moderate correlations in boys but not in girls against the accelerometer. The PAQ-A questionnaire shows an adequate reliability and a reasonable validity for assessing PA in Spanish adolescents.
Kevern, Mark A.; Beecher, Michael; Rao, Smita
2014-01-01
Context: Athletes who participate in throwing and racket sports consistently demonstrate adaptive changes in glenohumeral-joint internal and external rotation in the dominant arm. Measurements of these motions have demonstrated excellent intrarater and poor interrater reliability. Objective: To determine intrarater reliability, interrater reliability, and standard error of measurement for shoulder internal rotation, external rotation, and total arc of motion using an inclinometer in 3 testing procedures in National Collegiate Athletic Association Division I baseball and softball athletes. Design: Cross-sectional study. Setting: Athletic department. Patients or Other Participants Thirty-eight players participated in the study. Shoulder internal rotation, external rotation, and total arc of motion were measured by 2 investigators in 3 test positions. The standard supine position was compared with a side-lying test position, as well as a supine test position without examiner overpressure. Results: Excellent intrarater reliability was noted for all 3 test positions and ranges of motion, with intraclass correlation coefficient values ranging from 0.93 to 0.99. Results for interrater reliability were less favorable. Reliability for internal rotation was highest in the side-lying position (0.68) and reliability for external rotation and total arc was highest in the supine-without-overpressure position (0.774 and 0.713, respectively). The supine-with-overpressure position yielded the lowest interrater reliability results in all positions. The side-lying position had the most consistent results, with very little variation among intraclass correlation coefficient values for the various test positions. Conclusions: The results of our study clearly indicate that the side-lying test procedure is of equal or greater value than the traditional supine-with-overpressure method. PMID:25188316
Kim, J K; Lim, H M
2015-02-01
The purpose of this study was to translate and culturally adapt the Carpal Tunnel Questionnaire to produce an equivalent Korean version. A total of 53 patients completed the Korean version of the Carpal Tunnel Questionnaire pre-operatively and 3 months after open carpal tunnel release. All 53 also completed the Korean version of the Disabilities of Arm, Shoulder, and Hand questionnaire pre-operatively and 3 months post-operatively. Reliability was measured by determining the test-retest reliability and internal consistency. Test-retest reliability was assessed using intraclass correlation coefficients and paired t-tests, and internal consistency using Cronbach's alpha coefficients. Pearson correlation analysis was carried out on the Korean version of the Carpal Tunnel Questionnaire scores and the Korean version of the Disabilities of Arm, Shoulder, and Hand scores to assess construct validity. Responsiveness was evaluated using effect sizes and standardized response means. The reliability of the Korean version of the Carpal Tunnel Questionnaire was good. The scores in the Korean version of the Disabilities of Arm, Shoulder, and Hand strongly correlated with the scores in the Korean version of the Carpal Tunnel Questionnaire. Standardized response mean and effect size were both large for the Korean version of the Carpal Tunnel Questionnaire. The study shows that the Korean version of the Carpal Tunnel Questionnaire is a reliable, valid and responsive instrument for measuring outcomes in carpal tunnel syndrome. © The Author(s) 2014.
Martignon, Stefania; Bautista-Mendoza, Gloria; González-Carrera, María; Lafaurie-Villamil, Gloria; Morales, Veicy; Santamaría, Ruth
2008-01-01
Designing three instruments for evaluating oral health knowledge, attitudes and practice in parents/caregivers of low social-economic status 0-5 year-olds. Evaluating the instruments' reliability in terms of internal consistency and analysing items. Three instruments were constructed for evaluating low social-economic status 0-5 year-olds' parents/caregivers' oral health knowledge, attitudes and practice in the municipality of Usaquén , Bogotá , Colombia . 47 parents/caregivers were given a test establishing the instrument's reliability in terms of internal consistency and the adults' level of knowledge, attitudes and practice. A sub-sample was qualitatively analysed (content verification and understanding). Reliability was evaluated using Cronbach's alpha coefficient. Items were analysed for improving constructing and understanding the questions, taking four criteria into account: corrected homogeneity index (CHI), response trend, correlation between items and qualitative analysis. Cronbach's alpha coefficient for knowledge, attitudes and practice was 0,82, 0,80 and 0,62, respectively. Participants' level of knowledge, attitudes and practice was acceptable (60 %, 55 % and 91 %, respectively). This study found two out of the three evaluated instruments to be reliable (knowledge and attitudes); all three of them were then redesigned. The resulting instruments represent a valuable tool which can be used in future studies for describing and evaluating preventative programmes.
Reliability of an interactive computer program for advance care planning.
Schubart, Jane R; Levi, Benjamin H; Camacho, Fabian; Whitehead, Megan; Farace, Elana; Green, Michael J
2012-06-01
Despite widespread efforts to promote advance directives (ADs), completion rates remain low. Making Your Wishes Known: Planning Your Medical Future (MYWK) is an interactive computer program that guides individuals through the process of advance care planning, explaining health conditions and interventions that commonly involve life or death decisions, helps them articulate their values/goals, and translates users' preferences into a detailed AD document. The purpose of this study was to demonstrate that (in the absence of major life changes) the AD generated by MYWK reliably reflects an individual's values/preferences. English speakers ≥30 years old completed MYWK twice, 4 to 6 weeks apart. Reliability indices were assessed for three AD components: General Wishes; Specific Wishes for treatment; and Quality-of-Life values (QoL). Twenty-four participants completed the study. Both the Specific Wishes and QoL scales had high internal consistency in both time periods (Knuder Richardson formula 20 [KR-20]=0.83-0.95, and 0.86-0.89). Test-retest reliability was perfect for General Wishes (κ=1), high for QoL (Pearson's correlation coefficient=0.83), but lower for Specific Wishes (Pearson's correlation coefficient=0.57). MYWK generates an AD where General Wishes and QoL (but not Specific Wishes) statements remain consistent over time.
Reliability of an Interactive Computer Program for Advance Care Planning
Levi, Benjamin H.; Camacho, Fabian; Whitehead, Megan; Farace, Elana; Green, Michael J
2012-01-01
Abstract Despite widespread efforts to promote advance directives (ADs), completion rates remain low. Making Your Wishes Known: Planning Your Medical Future (MYWK) is an interactive computer program that guides individuals through the process of advance care planning, explaining health conditions and interventions that commonly involve life or death decisions, helps them articulate their values/goals, and translates users' preferences into a detailed AD document. The purpose of this study was to demonstrate that (in the absence of major life changes) the AD generated by MYWK reliably reflects an individual's values/preferences. English speakers ≥30 years old completed MYWK twice, 4 to 6 weeks apart. Reliability indices were assessed for three AD components: General Wishes; Specific Wishes for treatment; and Quality-of-Life values (QoL). Twenty-four participants completed the study. Both the Specific Wishes and QoL scales had high internal consistency in both time periods (Knuder Richardson formula 20 [KR-20]=0.83–0.95, and 0.86–0.89). Test-retest reliability was perfect for General Wishes (κ=1), high for QoL (Pearson's correlation coefficient=0.83), but lower for Specific Wishes (Pearson's correlation coefficient=0.57). MYWK generates an AD where General Wishes and QoL (but not Specific Wishes) statements remain consistent over time. PMID:22512830
Alsous, Mervat; Alhalaiqa, Fadwa; Abu Farha, Rana; Abdel Jalil, Mariam; McElnay, James; Horne, Robert
2017-01-01
Objectives to evaluate the reliability and discriminant validity of Arabic translation of the Medication Adherence Report Scale (MARS) and the Beliefs about Medication Questionnaire-specific (BMQ-specific). Methods Having developed Arabic translations of the study instruments, a cross-sectional study was carried out between March and October 2015 in two multidisciplinary governmental hospitals in Jordan. An expert panel monitored the forward and backward translation of the MARS and BMQ. Standard Arabic was used (with no specific dialect inclusion) to allow greater generalisability across Arabic speaking countries. Once the Arabic translations of the questionnaires were developed they were tested for consistency, validity and reliability on a group of children with chronic diseases and their parents. Results A total of 258 parents and 208 children were included in the study. The median age of participated children and parents was 15 years and 42 years respectively. Principle component analysis of all questionnaires indicated that all had good construct validity as they clearly measured one construct. The questionnaires were deemed reliable based on the results of Cronbach alpha coefficient. Furthermore, reliability of the questionnaires was demonstrated by test-retest intraclass correlation coefficients (ICC) which ranged from good to excellent for all scales (ICC>0.706). The Pearson correlation coefficient ranged from 0.546–0.805 for the entire sample which indicated a significant moderate to strong positive correlation between MARS and BMQ items at time 1 and 2. Reported adherence was greater than 59% using MARS-children and MARS-parents scales, and was correlated with beliefs in necessity and independent of the concerns regarding medications. Conclusion The Arabic translations of both BMQ and MARS for use in children and their parents have good internal consistency and proved to be valid and reliable tools that can be used by researchers in clinical practice to measure adherence and beliefs about medications in Arabic speaking patient populations. PMID:28192467
Alsous, Mervat; Alhalaiqa, Fadwa; Abu Farha, Rana; Abdel Jalil, Mariam; McElnay, James; Horne, Robert
2017-01-01
to evaluate the reliability and discriminant validity of Arabic translation of the Medication Adherence Report Scale (MARS) and the Beliefs about Medication Questionnaire-specific (BMQ-specific). Having developed Arabic translations of the study instruments, a cross-sectional study was carried out between March and October 2015 in two multidisciplinary governmental hospitals in Jordan. An expert panel monitored the forward and backward translation of the MARS and BMQ. Standard Arabic was used (with no specific dialect inclusion) to allow greater generalisability across Arabic speaking countries. Once the Arabic translations of the questionnaires were developed they were tested for consistency, validity and reliability on a group of children with chronic diseases and their parents. A total of 258 parents and 208 children were included in the study. The median age of participated children and parents was 15 years and 42 years respectively. Principle component analysis of all questionnaires indicated that all had good construct validity as they clearly measured one construct. The questionnaires were deemed reliable based on the results of Cronbach alpha coefficient. Furthermore, reliability of the questionnaires was demonstrated by test-retest intraclass correlation coefficients (ICC) which ranged from good to excellent for all scales (ICC>0.706). The Pearson correlation coefficient ranged from 0.546-0.805 for the entire sample which indicated a significant moderate to strong positive correlation between MARS and BMQ items at time 1 and 2. Reported adherence was greater than 59% using MARS-children and MARS-parents scales, and was correlated with beliefs in necessity and independent of the concerns regarding medications. The Arabic translations of both BMQ and MARS for use in children and their parents have good internal consistency and proved to be valid and reliable tools that can be used by researchers in clinical practice to measure adherence and beliefs about medications in Arabic speaking patient populations.
Hellweg, Stephanie; Schuster-Amft, Corina
2016-07-19
Agitation is frequently observed during early recovery after traumatic brain injury (TBI). Agitated behaviour often interferes with a goal-orientated rehabilitation and can be a substantial hindrance to therapy. Despite the relatively high occurance of agitation in TBI population there is no objective assessement in German (G) available. An existing scale with excellent psychometric properties is the "Agitated Behavior Scale (ABS)" developed by Corrigan in 1989. The aim of the study was to translate the Agitated Behavior Scale (ABS) into German (ABS-G) and investigate the inter- and intrarater reliability and internal consistency in patients with moderate to severe TBI. A formal nine-step translation and cross-cultural adaptation procedure (TCCA) was applied. Subsequently a prospective observational patient study was conducted. To examine the interrater reliability and internal consistency, two therapists rated 20 patients independently after a therapy session. This procedure was repeated twice on a weekly basis. The intrarater reliability was assessed through video recordings from three patients. Nine raters scored the demonstrated behaviour on the videotape with the ABS-G independently twice within one month. The inter- and intrarater reliability were evaluated with the Spearman rank correlation coefficient and the quadratic weighted kappa. The internal consistency was tested with Cronbach's alpha. Behaviour of 20 patients (18 males; mean age 41 ± 20.7; mean Functional Independence Measure (FIM) cognitive score on admission 7.1 ± 4.04; mean ABS-G score at first observation 17.3 ± 2.83) was assessed threefold. Interrater reliability yielded a correlation coefficient for ABS-G total score of all 60 paired observations of r s 0.845 and a weighted Kappa of 0.738. Intrarater reliability for ABS-G total score ranged between r s 0.719 and 0.953 and showed a weighted Kappa between 0.871 and 0.953. Cronbach's alpha indicated moderate internal consistency with 0.661. This study demonstrates that the ABS-G is a reliable instrument for evaluating agitation in patients with moderate to severe TBI. Hereby it would be possible to monitor agitation objectively and optimise the management of agitated patients according to international recommendations.
Estimation of Reliability Coefficients Using the Test Information Function and Its Modifications.
ERIC Educational Resources Information Center
Samejima, Fumiko
1994-01-01
The reliability coefficient is predicted from the test information function (TIF) or two modified TIF formulas and a specific trait distribution. Examples illustrate the variability of the reliability coefficient across different trait distributions, and results are compared with empirical reliability coefficients. (SLD)
Interobserver Reliability of the Total Body Score System for Quantifying Human Decomposition.
Dabbs, Gretchen R; Connor, Melissa; Bytheway, Joan A
2016-03-01
Several authors have tested the accuracy of the Total Body Score (TBS) method for quantifying decomposition, but none have examined the reliability of the method as a scoring system by testing interobserver error rates. Sixteen participants used the TBS system to score 59 observation packets including photographs and written descriptions of 13 human cadavers in different stages of decomposition (postmortem interval: 2-186 days). Data analysis used a two-way random model intraclass correlation in SPSS (v. 17.0). The TBS method showed "almost perfect" agreement between observers, with average absolute correlation coefficients of 0.990 and average consistency correlation coefficients of 0.991. While the TBS method may have sources of error, scoring reliability is not one of them. Individual component scores were examined, and the influences of education and experience levels were investigated. Overall, the trunk component scores were the least concordant. Suggestions are made to improve the reliability of the TBS method. © 2016 American Academy of Forensic Sciences.
Keessen, Paul; Maaskant, Jolanda; Visser, Bart
2018-08-01
The standardized Mensendieck test (SMT) was developed to quantify posture, movement, gait, and respiration. In the hands of an experienced therapist, the SMT is proven to be a reliable tool. It is unclear whether posture, movement, gait, and respiration are related to the degree of functional disability in patients with chronic pain. The objective of this study was to assess the reliability and convergent validity of the SMT in a heterogeneous sample of 50 patients with chronic pain. Internal consistency was determined by Cronbach's α and interrater reliability by the intraclass correlation coefficient (ICC). Convergent validity was assessed by determining the Spearman rank correlation coefficient between the movement quality measured in the SMT and functional limitation measured on the disability rating index (DRI). The internal consistency was Cronbach's α 0.91. Substantial reliability was found for the items: movement (ICC = 0.68), gait (ICC = 0.69), sitting posture (ICC = 0.63), and respiration (ICC = 0.64). Insufficient reliability was found for standing posture (ICC = 0.23). A moderate correlation was found between average test score SMT and the DRI (r = -0.37) and respiration and DRI (r = -0.45). The SMT is a reasonably reliable tool to assess movement, gait, sitting posture, and respiration. None of the items in the domain standing posture has sufficient reliability. A thorough study of this domain should be considered. The results show little evidence for convergent validity. Several items of the SMT correlated moderately with functional limitation with the DRI. These items were global movement, hip flexion, pelvis rotation, and all respiration items.
Apivatgaroon, Adinun; Angthong, Chayanin; Sanguanjit, Prakasit; Chernchujit, Bancha
2016-10-01
To develop a Thai version of the Kujala score and show the evaluation of the validity and reliability of the score. The Thai version of the Kujala score was developed using the forward-backward translation protocol. The 49 PFPS patients answered the Thai version of questionnaires including the Kujala score, Short Form-36 (SF-36) and International Knee Documentation Committee (IKDC) Subjective Knee Form. The validity between the scores has been tested. The reliability was assessed using test-retest reliability and internal consistency. The Thai version of the Kujala score showed a good correlation with Thai IKDC Subjective Knee Form (Pearson's correlation coefficient; r = 0.74: p < 0.01) and moderate correlation with the Thai SF-36 subscales of physical component summary, total score and role physical (r = 0.586, 0.571 and 0.524, respectively: p < 0.01). The test-retest reliability was excellent with an intra-class correlation coefficient of 0.908 (p < 0.001; 95% CI [0.842-0.947]). The internal consistency was strong with Cronbach's alpha of 0.952 (p < 0.001). No floor and ceiling effects were observed. The Thai version of the Kujala score has shown good validity and reliability. This score can be effectively used for evaluating Thai patients with patellofemoral pain syndrome. Implications for Rehabilitation The Kujala score is a self-administered questionnaire for patients with patellofemoral pain syndrome (PFPS). The validity and reliability of the Thai version of Kujala are compatible with other versions (Turkish, Chinese and Persian version). The Thai version of Kujala has been shown to have validity and reliability in Thai PFPS patients and can be used for clinical evaluation and also in the research work.
Development of the Seasonal Migrant Agricultural Worker Stress Scale in Sanliurfa, Southeast Turkey.
Simsek, Zeynep; Ersin, Fatma; Kirmizitoprak, Evin
2016-01-01
Stress is one of the main causes of health problems, especially mental disorders. These health problems cause a significant amount of ability loss and increase cost. It is estimated that by 2020, mental disorders will constitute 15% of the total disease burden, and depression will rank second only after ischemic heart disease. Environmental experiences are paramount in increasing the liability of mental disorders in those who constantly face sustained high levels of stress. The objective of this study was to develop a stress scale for seasonal migrant agricultural workers aged 18 years and older. The sample consisted of 270 randomly selected seasonal migrant agricultural workers. The average age of the participants was 33.1 ± 14, and 50.7% were male. The Cronbach alpha coefficient and test-retest methods were used for reliability analyses. Although the factor analysis was performed for the structure validity of the scale, the Kaiser-Meyer-Olkin coefficient and Bartlett test were used to determine the convenience of the data for the factor analysis. In the reliability analyses, the Cronbach alpha coefficient of internal consistency was calculated as .96, and the test-retest reliability coefficient was .81. In the exploratory factor analysis for validity of the scale, four factors were obtained, and the factors represented workplace physical conditions (25.7% of the total variance), workplace psychosocial and economic factors (19.3% of the total variance), workplace health problems (15.2% of the total variance), and school problems (10.1% of the total variance). The four factors explained 70.3% of the total variance. As a result of the expert opinions and analyses, a stress scale with 48 items was developed. The highest score to be obtained from the scale was 144, and the lowest score was 0. The increase in the score indicates the increase in the stress levels. The findings show that the scale is a valid and reliable assessment instrument that can be used in epidemiological research and planning interventions.
Validation of the Chinese Version of the Quality of Nursing Work Life Scale
Fu, Xia; Xu, Jiajia; Song, Li; Li, Hua; Wang, Jing; Wu, Xiaohua; Hu, Yani; Wei, Lijun; Gao, Lingling; Wang, Qiyi; Lin, Zhanyi; Huang, Huigen
2015-01-01
Quality of Nursing Work Life (QNWL) serves as a predictor of a nurse’s intent to leave and hospital nurse turnover. However, QNWL measurement tools that have been validated for use in China are lacking. The present study evaluated the construct validity of the QNWL scale in China. A cross-sectional study was conducted conveniently from June 2012 to January 2013 at five hospitals in Guangzhou, which employ 1938 nurses. The participants were asked to complete the QNWL scale and the World Health Organization Quality of Life abbreviated version (WHOQOL-BREF). A total of 1922 nurses provided the final data used for analyses. Sixty-five nurses from the first investigated division were re-measured two weeks later to assess the test-retest reliability of the scale. The internal consistency reliability of the QNWL scale was assessed using Cronbach’s α. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC). Criterion-relation validity was assessed using the correlation of the total scores of the QNWL and the WHOQOL-BREF. Construct validity was assessed with the following indices: χ2 statistics and degrees of freedom; relative mean square error of approximation (RMSEA); the Akaike information criterion (AIC); the consistent Akaike information criterion (CAIC); the goodness-of-fit index (GFI); the adjusted goodness of fit index; and the comparative fit index (CFI). The findings demonstrated high internal consistency (Cronbach’s α = 0.912) and test-retest reliability (interclass correlation coefficient = 0.74) for the QNWL scale. The chi-square test (χ2 = 13879.60, df [degree of freedom] = 813 P = 0.0001) was significant. The RMSEA value was 0.091, and AIC = 1806.00, CAIC = 7730.69, CFI = 0.93, and GFI = 0.74. The correlation coefficient between the QNWL total scores and the WHOQOL-BREF total scores was 0.605 (p<0.01). The QNWL scale was reliable and valid in Chinese-speaking nurses and could be used as a clinical and research instrument for measuring work-related factors among nurses in China. PMID:25950838
Measuring family-centred practices of professionals in early intervention services in Taiwan.
Kang, L-J; Palisano, R J; Simeonsson, R J; Hwang, A-W
2017-09-01
Family-centred practices emphasize professional supports for forming partnerships with families in early intervention. The Measure of Processes of Care for Service Providers (MPOC-SP) measures the perceptions of paediatric service providers in supporting children and families. This study aimed to establish reliability of the Chinese version of the MPOC-SP (C-MPOC-SP) and to examine professional perceptions of family-centred practices in relation to professional discipline and years of experience. A convenience sample of 94 physical therapists, occupational therapists, speech-language pathologists, social workers and early childhood educators completed the C-MPOC-SP. Thirty-seven professionals completed the measure a second time within 2-4 weeks for test-retest reliability. Internal consistency and test-retest reliability were examined by Cronbach's α and intra-class correlation coefficient. Comparisons were made across professional disciplines by multivariate analyses of variance followed by analyses of variance. Relationships between years of experience and ratings of family-centred practices were examined by Pearson's correlation coefficients (r). Cronbach's α for items on each of the four scales of the C-MPOC-SP ranged from 0.80 to 0.92, indicating adequate internal consistency. Intra-class correlation coefficient between the initial and repeat completion of the C-MPOC-SP for each scale ranged from 0.56 to 0.77, indicating adequate to excellent test-retest reliability. Mean ratings for the Communicating Specific Information were significantly higher for physical therapists, occupational therapists and speech-language pathologists than for social workers (P = 0.001). The C-MPOC-SP scores were positively correlated with years of experience for all four scales (r = 0.23-0.38; P < 0.05). This study established adequate internal consistency and adequate to excellent test-retest reliability of the C-MPOC-SP in measuring perceptions of family centeredness of early intervention service providers. Cross-discipline differences were found in communicating specific information about the child. Higher perceptions of family centeredness were associated with more years of experience. The results support the utility of the C-MPOC-SP in professional education and programme evaluation of early intervention services in Taiwan. © 2017 John Wiley & Sons Ltd.
Reliability and validity of the Nurse Practitioners' Roles and Competencies Scale.
Lin, Li-Chun; Lee, Sheuan; Ueng, Steve Wen-Neng; Tang, Woung-Ru
2016-01-01
The objective of this study was to test the reliability and construct validity of the Nurse Practitioners' Roles and Competencies Scale. The role of nurse practitioners has attracted international attention. The advanced nursing role played by nurse practitioners varies with national conditions and medical environments. To date, no suitable measurement tool has been available for assessing the roles and competencies of nurse practitioners in Asian countries. Secondary analysis of data from three studies related to nurse practitioners' role competencies. We analysed data from 563 valid questionnaires completed in three studies to identify the factor structure of the Nurse Practitioners' Roles and Competencies Scale. To this end, we performed exploratory factor analysis using principal component analysis extraction with varimax orthogonal rotation. The internal consistency reliabilities of the overall scale and its subscales were examined using Cronbach's alpha coefficient. The scale had six factors: professionalism, direct care, clinical research, practical guidance, medical assistance, as well as leadership and reform. These factors explained 67·5% of the total variance in nurse practitioners' role competencies. Cronbach's alpha coefficient for the overall scale was 0·98, and those of its subscales ranged from 0·83-0·97. The internal consistency reliability and construct validity of the Nurse Practitioners' Roles and Competencies Scale were good. The high internal consistency reliabilities suggest item redundancy, which should be minimised by using item response theory to enhance the applicability of this questionnaire for future academic and clinical studies. The Nurse Practitioners' Roles and Competencies Scale can be used as a tool for assessing the roles and competencies of nurse practitioners in Taiwan. Our findings can also serve as a reference for other Asian countries to develop the nurse practitioner role. © 2015 John Wiley & Sons Ltd.
Elfering, Achim; Cronenberg, Sonja; Grebner, Simone; Tamcan, Oezguer; Müller, Urs
2017-12-01
A newly developed questionnaire assessing limitations in activity of daily living (LADL-Q) that should improve assessment of LADL is tested in a large population-based validation study. This survey was paper-based. Overall, 16,634 individuals who were representative of the working population in the German-speaking part of Switzerland participated in the study. Item analysis was used the final version of the LADL-Q to four items per subscale that correspond to potential problems in three body regions (back and neck, upper extremities, lower extremities). Analysis included tests for reliability, internal consistency, dimensionality and convergent validity. Test-retest reliability coefficients after 2 weeks ranged from 0.82 to 0.99 (Mdn = 0.87), with no item having a coefficient below 0.60. The median item-total coefficients ranged between moderate and good. Correlation coefficients between LADL-Q subscales and three validated clinical instruments (Western Ontario and McMaster Universities osteoarthritis index, shoulder pain disability index, Oswestry) ranged from 0.63 to 0.81. In structural equation modeling the three subscales were significantly related with two important outcomes in occupational rehabilitation: self-reported general health and daily task performance. The new LADL-Q is a brief, reliable and valid tool for assessment of LADL in studies on musculoskeletal health.
Stucki, G; Sangha, O; Stucki, S; Michel, B A; Tyndall, A; Dick, W; Theiler, R
1998-03-01
To compare the metric properties and validity of German versions of the WOMAC (Western Ontario and McMaster Universities) and a self-administered questionnaire-format of the Lequesne-Algofunctional-Index in patients with osteoarthritis (OA) of the lower extremities. Cross-sectional analysis of the instruments' internal consistency (Cronbach's coefficient alpha) and construct validity (correlation with radiological OA-severity and limitation in range-of-motion) in ambulatory patients and patients before hip arthroplasty. Test-retest reliability was assessed on a subsample after 10 days. Data from 51 patients out of 91 contacted could be analyzed. Twenty-nine patients had knee and 22 patients had hip OA. Both the WOMAC and Lequesne OA-indices and their scales or sections had a satisfactory test-retest reliability (Intraclass correlation coefficient 0.43-0.96). All scales of the WOMAC were internally consistent (Cronbach's coefficient alpha 0.81-0.96) and associated with radiological OA-severity and joint range of motion. However, only the function but not the symptom sections (Cronbach's coefficient alpha knee: 0.55; hip: 0.63) of the self-administered Lequesne OA index were internally consistent for both, patients with knee and hip OA. Also, the symptom components were not or only weakly associated with radiological OA-severity and joint range of motion. Although our results are based on a German version using a self-report format we may caution using the self-administered Lequesne OA index without prior testing of its metric properties and validity.
Kaneko, Mei; Sato, Iori; Soejima, Takafumi; Kamibeppu, Kiyoko
2014-09-01
The purpose of the study is to develop a Japanese version of the Pediatric Quality of Life Inventory (PedsQL) Generic Core Scales Young Adult Version (PedsQL-YA-J) and determine the feasibility, reliability, and validity of the scales. Translation equivalence and content validity were verified using back-translation and cognitive debriefing tests. A total of 428 young adults recruited from one university, two vocational schools, or five companies completed questionnaires. We determined questionnaire feasibility, internal consistency, and test-retest reliability; checked concurrent validity against the Center for Epidemiologic Studies Depression Scale (CES-D); determined convergent and discriminant validity with the Medical Outcome Study 36-item Short Form Health Survey (SF-36); described known-groups validity with regard to subjective symptoms, illness or injury requiring regular medical visits, and depression; and verified factorial validity. All scales were internally consistent (Cronbach's coefficient alpha = 0.77-0.86); test-retest reliability was acceptable (intraclass correlation coefficient = 0.57-0.69); and all scales were concurrently valid with depression (Pearson's correlation coefficient = 0.43-0.57). The scales convergent and discriminant validity with the SF-36 and CES-D were acceptable. Evaluation of known-groups validity confirmed that the Physical Functioning scale was sensitive for subjective symptoms, the Emotional Functioning scale for depression, and the Work/School Functioning scale for illness or injury requiring regular medical visits. Exploratory factor analysis found a six-factor structure consistent with the assumed structure (cumulative proportion = 57.0%). The PedsQL-YA-J is suitable for assessing health-related quality of life in young adults in education, employment, or training, and for clinical trials and epidemiological research.
Testing the Difference between Reliability Coefficients Alpha and Omega
ERIC Educational Resources Information Center
Deng, Lifang; Chan, Wai
2017-01-01
Reliable measurements are key to social science research. Multiple measures of reliability of the total score have been developed, including coefficient alpha, coefficient omega, the greatest lower bound reliability, and others. Among these, the coefficient alpha has been most widely used, and it is reported in nearly every study involving the…
Rask, Marie; Oscarsson, Marie; Ludwig, Neil; Swahnberg, Katarina
2017-04-04
Cervical dysplasia is a precancerous condition, which has been shown to create anxiety in women. To be able to investigate these women's health-related quality of life, a disease-specific instrument is required. There does not seem to be a Swedish version of an instrument to screen for this specific disease. Therefore, this study aims to translate and cross-culturally adapt the Functional Assessment of Chronic Illness Therapy - Cervical Dysplasia (FACIT-CD) into a Swedish context and evaluate its linguistic validity and reliability. The Functional Assessment of Chronic Illness Therapy (FACIT) translation methodology was used, which consists of several steps including pilot testing of the FACIT-CD instrument through cognitive debriefing interviews. Ten women diagnosed with cervical dysplasia participated in the cognitive debriefing interviews. The internal consistency reliability of the Swedish FACIT-CD was estimated by Cronbach's alpha coefficient. Homogeneity of the items was evaluated by corrected item-total correlations. The sample consists of 34 women who were diagnosed with cervical dysplasia. The translation and cross-cultural adaptation went smoothly without any problems for the majority of the items. The cognitive debriefing interviews indicated that the Swedish FACIT-CD consists of relevant items, is easy to understand and complete, and has unambiguous and comprehensive response categories. The translation and cross-cultural adaptation resulted in a Swedish FACIT-CD, which is conceptually and semantically equivalent to the English version and linguistically valid. The total scale of the Swedish FACIT-CD exhibited good internal consistency reliability with a Cronbach's alpha coefficient of 0.84, and all of the subscales exhibited acceptable value between 0.71 and 0.81 except the Relationships subscale, which had a value of 0.67. Finally, all but four items exceeded the acceptable level for the corrected item-total correlations of ≥ 0.20. The Swedish FACIT-CD is conceptually and semantically equivalent to the English version and linguistically valid; further, it exhibits good internal consistency reliability.
Validation of general job satisfaction in the Korean Labor and Income Panel Study.
Park, Shin Goo; Hwang, Sang Hee
2017-01-01
The purpose of this study is to assess the validity and reliability of general job satisfaction (JS) in the Korean Labor and Income Panel Study (KLIPS). We used the data from the 17th wave (2014) of the nationwide KLIPS, which selected a representative panel sample of Korean households and individuals aged 15 or older residing in urban areas. We included in this study 7679 employed subjects (4529 males and 3150 females). The general JS instrument consisted of five items rated on a scale from 1 (strongly disagree) to 5 (strongly agree). The general JS reliability was assessed using the corrected item-total correlation and Cronbach's alpha coefficient. The validity of general JS was assessed using confirmatory factor analysis (CFA) and Pearson's correlation. The corrected item-total correlations ranged from 0.736 to 0.837. Therefore, no items were removed. Cronbach's alpha for general JS was 0.925, indicating excellent internal consistency. The CFA of the general JS model showed a good fit. Pearson's correlation coefficients for convergent validity showed moderate or strong correlations. The results obtained in our study confirm the validity and reliability of general JS.
Ando, Yukako; Kataoka, Tsuyoshi; Okamura, Hitoshi; Tanaka, Katsutoshi; Kobayashi, Toshio
2013-12-01
The purpose of this research is to verify the reliability and validity of a job stressor scale for nurses caring for patients with intractable neurological diseases. A mail survey was conducted using a self-report questionnaire. The subjects were 263 nurses and assistant nurses working in wards specializing in intractable neurological diseases. The response rate was 71.9% (valid response rate, 66.2%). With regard to reliability, internal consistency and stability were assessed. Internal consistency was examined via Cronbach's alpha. For stability, the test-retest method was performed and stability was examined via intraclass correlation coefficients. With regard to validity, factor validity, criterion-related validity, and content validity were assessed. Exploratory factor analysis was used for factor validity. For criterion-related validity, an existing scale was used as an external criterion; concurrent validity was examined via Spearman's rank correlation coefficients. As a result of analysis, there were 26 items in the scale created with an eight factor structure. Cronbach's a for the 26 items was 0.90; with the exception of two factors, alpha for all of the individual sub-factors was high at 0.7 or higher. The intraclass correlation coefficient for the 26 items was 0.89 (p < 0.001). With regard to criterion-related validity, concurrent validity was confirmed and the correlation coefficient with an external criterion was 0.73 (p < 0.001). For content validity, subjects who responded that "The questionnaire represents a stressor well or to a degree" accounted for 81% of the total responses. Reliability and validity were confirmed, so the scale created in the current research is a usable scale.
Transcultural validation of the Oxford Shoulder Score for the French-speaking population.
Tuton, D; Barbe, C; Salmon, J-H; Dramé, M; Nérot, C; Ohl, X
2016-09-01
Patient-reported outcome measures (PROMs) have been gaining in popularity over the last decade. The Oxford Shoulder Score (OSS) is a well-established self-administered questionnaire for shoulder evaluation adapted for the English-speaking population. The aim of the present study was to develop a translation and a transcultural adaptation of the OSS and to assess its validity in native French-speaker patients with shoulder pain. The translation process was carried out following a translation/back-translation methodology by two translators. All patients completed the French OSS, the Subjective Shoulder Value (SSV), and the Constant score. Internal consistency was tested using Cronbach's α coefficient. Validity was assessed by calculating the Pearson correlation coefficient between the OSS and the Constant score and the SSV. One hundred forty-four patients suffering from degenerative or inflammatory diseases of the shoulder were included in this study. The average time required to complete the French OSS was 2min and 45s. Seventy patients were asked to complete the questionnaire twice (test/retest reliability). Internal consistency was high with Cronbach's α coefficient=0.93. The intraclass correlation coefficient was 0.91 (95% CI: 0.88-0.94) for test/retest reliability. The French OSS score was significantly correlated with the Constant-Murley score (r=0.73 and P<0.0001) and with the SSV (r=0.68 and P<0.0001). The present study shows that the French version of the OSS is reliable, valid, and reproducible. The sensitivity to change now needs to be evaluated. This score was adapted to the French-speaking population for the self-assessment of patients with degenerative or inflammatory disorders of the shoulder. Level 1, Test of previously developed criteria, diagnostic test study. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Imaeda, Toshihiko; Uchiyama, Shigeharu; Toh, Satoshi; Wada, Takuro; Okinaga, Shuji; Sawaizumi, Takuya; Nishida, Jun; Kusunose, Koichi; Omokawa, Shohei
2007-01-01
The Carpal Tunnel Syndrome Instrument (CTSI) is a disease-specific, self-administered questionnaire that consists of a symptom severity scale (SS) and a functional status scale (FS). The CTSI was cross-culturally adapted and developed by the Impairment Evaluation Committee, Japanese Society for Surgery of the Hand (JSSH). The purpose of this study was to test the reliability, validity, and responsiveness of the Japanese version of the CTSI (CTSI-JSSH). A consecutive series of 87 patients with carpal tunnel syndrome completed the CTSI-JSSH, the JSSH version of the Disability of the Arm, Shoulder, and Hand questionnaire (DASH-JSSH), and the 36-Item Short-Form Health Survey (SF-36). Seventy-two of the patients were reassessed for test-retest reliability 1 or 2 weeks later. Reliability was investigated by the reproducibility and the internal consistency. To analyze the validity, a factor analysis (principal axis factoring) of the CTSI-JSSH and the correlation coefficients between the CTSI-JSSH and DASH-JSSH were obtained. The responsiveness was examined by calculating the standardized response mean (SRM; mean change/SD) and effect size (mean change/SD of baseline value) after carpal tunnel release in 42 patients. Cronbach's alpha coefficients for the CTSI-JSSH-SS and the CTSI-JSSH-FS were 0.84 and 0.90, respectively, and the intraclass correlation coefficients were 0.82 and 0.83, respectively. The unidimensionality of the CTSI-JSSH-SS was barely confirmed; the unidimensionality of the CTSI-JSSH-FS was confirmed. The correlation coefficients between the CTSI-JSSH-FS and the CTSI-JSSH-SS or DASH-JSSH were 0.58 and 0.80, respectively. The correlation coefficient between the CTSI-JSSH-SS and DASH-JSSH was 0.54. The correlation coefficients between the subscales of SF-36 and the CTSI-JSSH-SS or the CTSI-JSSH-FS ranged from -0.23 to -0.66 and from -0.19 to -0.63, respectively. The SRMs/effect sizes of the CTSI-JSSH-SS and the CTSI-JSSH-FS were -0.85/-0.99 and -0.70/-0.61, which indicated that they were more than moderately sensitive. The CTSI-JSSH has sufficient reliability, validity, and responsiveness to assess the health status in carpal tunnel syndrome.
Covariate-free and Covariate-dependent Reliability.
Bentler, Peter M
2016-12-01
Classical test theory reliability coefficients are said to be population specific. Reliability generalization, a meta-analysis method, is the main procedure for evaluating the stability of reliability coefficients across populations. A new approach is developed to evaluate the degree of invariance of reliability coefficients to population characteristics. Factor or common variance of a reliability measure is partitioned into parts that are, and are not, influenced by control variables, resulting in a partition of reliability into a covariate-dependent and a covariate-free part. The approach can be implemented in a single sample and can be applied to a variety of reliability coefficients.
Validation of Turkish version of brief negative symptom scale.
Polat Nazlı, Irmak; Ergül, Ceylan; Aydemir, Ömer; Chandhoke, Swati; Üçok, Alp; Gönül, Ali Saffet
2016-11-01
Negative symptoms in schizophrenia have been assessed by many instruments. However, a current consensus on these symptoms has been built and new tools, such as the Brief Negative Symptom Scale (BNSS), are generated. This study aimed to evaluate reliability and validity of the Turkish version of BNSS. The scale was translated to Turkish and backtranslated to English. After the approval of the translation, 75 schizophrenia patients were interviewed with BNSS, Positive and Negative Syndrome Scale (PANSS), Calgary Depression Scale for Schizophrenia (CDSS) and Extrapyramidal Symptom Rating Scale (ESRS). Reliability and validity analyses were then calculated. In the reliability analysis, the Cronbach's alpha coefficient was 0.96 and item-total score correlation coefficients were between 0.655-0.884. The intraclass correlation coefficient was 0.665. The inter-rater reliability was 0.982 (p < 0.0001). In the validity analysis, the total score of BNSS-TR was correlated with PANSS Total Score, Positive Symptoms Subscale, Negative Symptoms Subscale, and General Psychopathology Subscale. CDSS and ESRS were not correlated with BNSS-TR. The factor structure of the scale was consisting the same items as in the original version. Our study confirms that the Turkish version of BNSS is an applicable tool for the evaluation of negative symptoms in schizophrenia.
Development and validation of a fatigue assessment scale for U.S. construction workers.
Zhang, Mingzong; Sparer, Emily H; Murphy, Lauren A; Dennerlein, Jack T; Fang, Dongping; Katz, Jeffrey N; Caban-Martinez, Alberto J
2015-02-01
To develop a fatigue assessment scale and test its reliability and validity for commercial construction workers. Using a two-phased approach, we first identified items (first phase) for the development of a Fatigue Assessment Scale for Construction Workers (FASCW) through review of existing scales in the scientific literature, key informant interviews (n = 11) and focus groups (three groups with six workers each) with construction workers. The second phase included assessment for the reliability, validity, and sensitivity of the new scale using a repeated-measures study design with a convenience sample of construction workers (n = 144). Phase one resulted in a 16-item preliminary scale that after factor analysis yielded a final 10-item scale with two sub-scales ("Lethargy" and "Bodily Ailment"). During phase two, the FASCW and its subscales demonstrated satisfactory internal consistency (alpha coefficients were FASCW [0.91], Lethargy [0.86] and Bodily Ailment [0.84]) and acceptable test-retest reliability (Pearson Correlations Coefficients: 0.59-0.68; Intraclass Correlation Coefficients: 0.74-0.80). Correlation analysis substantiated concurrent and convergent validity. A discriminant analysis demonstrated that the FASCW differentiated between groups with arthritis status and different work hours. The 10-item FASCW with good reliability and validity is an effective tool for assessing the severity of fatigue among construction workers. © 2015 Wiley Periodicals, Inc.
Baker, Richard S; Bazargan, Mohsen; Calderón, José L; Hays, Ron D
2006-08-01
To compare the psychometric performance of Spanish versions of the 25-item National Eye Institute Visual Function Questionnaire (NEI VFQ-25) and the NEI VFQ-39 administered to Latino patients with the psychometric performance of the standard English NEI VFQ-25 and NEI VFQ-39 administered to non-Latino patients. Clinic-based cross-sectional survey. Four hundred three patients (160 Latinos and 243 non-Latinos) recruited from general ophthalmology clinics of an urban public hospital over a 6-month period. Structured face-to-face interviews were conducted in Spanish and English to collect data for the NEI VFQ-25 and NEI VFQ-39. We calculated the mean, standard deviation, and percentage of participants having the minimum (floor) and maximum (ceiling) possible score for each item and scale. Internal consistency reliability of the NEI VFQ-25 and NEI VFQ-39 was estimated using the Cronbach alpha and average inter-item correlation. Construct validity for the instruments was assessed by comparing scores for participants classified as having normal versus impaired visual acuity. Instrument scales for general health; general vision; ocular pain; near activities; distance activities; vision-specific social functioning, mental health, role difficulties, and dependency; driving; color vision; and peripheral vision. Internal consistency reliability was significantly lower in the Spanish version than in the English version for 3 scales of the NEI VFQ-25. More importantly, 3 scales in the Spanish version manifested inadequate reliability (alpha< or =0.70), compared with only 1 inadequately reliable subscale in the English version. Reliability coefficients associated with the Spanish NEI VFQ-39 scales exceeded commonly accepted minimum standards. Comparison of reliability coefficients between Latino and non-Latino subgroups demonstrated statistically significant differences for 4 scales: Ocular Pain, Mental Health, Role Difficulties, and Dependency. In each case, the Latino group had the lower internal consistency reliability. However, only for the Ocular Pain subscale was reliability both significantly lower and inadequate (alpha<0.70). Overall performance of the NEI VFQ in Latino populations is adequate. However, in the absence of modifications to improve the reliability of specific Spanish version subscales, comparisons between Latino and non-Latino subgroups using the NEI VFQ must be interpreted with appropriate caution.
Spanish validation of the social stigma scale: Community Attitudes towards Mental Illness.
Ochoa, Susana; Martínez-Zambrano, Francisco; Vila-Badia, Regina; Arenas, Oti; Casas-Anguera, Emma; García-Morales, Esther; Villellas, Raúl; Martín, José Ramón; Pérez-Franco, María Belén; Valduciel, Tamara; García-Franco, Mar; Miguel, Jose; Balsera, Joaquim; Pascual, Gemma; Julia, Eugènia; Casellas, Diana; Haro, Josep Maria
2016-01-01
The stigma against people with mental illness is very high. In Spain there are currently no tools to assess this construct. The aim of this study was to validate the Spanish version of the Community Attitudes towards Mental Illness questionnaire in an adolescent population, and determining its internal consistency and temporal stability. Another analysis by gender will be also performed. A translation and back-translation of the Community Attitudes towards Mental Illness was performed. A total of 150 students of between 14 and 18 years-old were evaluated with this tool in two stages. Internal consistency was tested using Cronbach α; and intraclass correlation coefficient was used for test-retest reliability. Gender-stratified analyses were also performed. The Cronbach α was 0.861 for the first evaluation and 0.909 for the second evaluation. The values of the intraclass correlation coefficient ranged from 0.775 to 0.339 in the item by item analysis, and between 0.88 and 0.81 in the subscales. In the segmentation by gender, it was found that girls scored between 0.797 and 0.863 in the intraclass correlation coefficient, and boys scored between 0.889 and 0.774. In conclusion, the Community Attitudes towards Mental Illness is a reliable tool for the assessment of social stigma. Although reliable results have been found for boys and girls, our results found some gender differences in the analysis. Copyright © 2014 SEP y SEPB. Published by Elsevier España. All rights reserved.
Sun, Zhi-Jing; Zhu, Lan; Liang, Maolian; Xu, Tao; Lang, Jing-He
2016-08-01
WeChat is a promising tool for capturing electronic data; however, no research has examined its use. This study evaluates the reliability and feasibility of WeChat for administering the Pelvic Floor Impact Questionnaire Short Form 7 questionnaire to women with pelvic floor disorders. Sixty-eight pelvic floor rehabilitation women were recruited between June and December 2015 and crossover randomized to two groups. All participants completed two questionnaire formats. One group completed the paper version followed by the WeChat version; the other group completed the questionnaires in reverse order. Two weeks later, each group completed the two versions in reverse order. The WeChat version's reliability was assessed using intraclass correlation coefficients and test-retest reliability. Forty-two women (61.8%) preferred the WeChat to the paper format, eight (11.8%) preferred the paper format, and 18 (26.5%) had no preference. The younger women preferred WeChat. Completion time was 116.5 (61.3) seconds for the WeChat version and 133.4 (107.0) seconds for the paper version, with no significant difference (P = 0.145). Age and education did not impact completion time (P > 0.05). Consistency between the WeChat and paper versions was excellent. The intraclass correlation coefficients of the Pelvic Floor Impact Questionnaire Short Form 7 and the three subscales ranged from 0.915 to 0.980. The Bland-Altman analysis and linear regression results also showed high consistency. The test-retest study had a Pearson's correlation coefficient of 0.908, demonstrating a strong correlation. WeChat-based questionnaires were well accepted by women with pelvic floor disorders and had good data quality and reliability.
Maciel, João; Infante, Paulo; Ribeiro, Susana; Ferreira, André; Silva, Artur C; Caravana, Jorge; Carvalho, Manuel G
2014-11-01
The prevalence of obesity has increased worldwide. An assessment of the impact of obesity on health-related quality of life (HRQoL) requires specific instruments. The Moorehead-Ardelt Quality of Life Questionnaire II (MA-II) is a widely used instrument to assess HRQoL in morbidly obese patients. The objective of this study was to translate and validate a Portuguese version of the MA-II.The study included forward and backward translations of the original MA-II. The reliability of the Portuguese MA-II was estimated using the internal consistency and test-retest methods. For validation purposes, the Spearman's rank correlation coefficient was used to evaluate the correlation between the Portuguese MA-II and the Portuguese versions of two other questionnaires, the 36-item Short Form Health Survey (SF-36) and the Impact of Weight on Quality of Life-Lite (IWQOL-Lite).One hundred and fifty morbidly obese patients were randomly assigned to test the reliability and validity of the Portuguese MA-II. Good internal consistency was demonstrated by a Cronbach's alpha coefficient of 0.80, and a very good agreement in terms of test-retest reliability was recorded, with an overall intraclass correlation coefficient (ICC) of 0.88. The total sums of MA-II scores and each item of MA-II were significantly correlated with all domains of SF-36 and IWQOL-Lite. A statistically significant negative correlation was found between the MA-II total score and BMI. Moreover, age, gender and surgical status were independent predictors of MA-II total score.A reliable and valid Portuguese version of the MA-II was produced, thus enabling the routine use of MA-II in the morbidly obese Portuguese population.
Demoulin, Christophe; Brasseur, Pauline; Roussel, Nathalie; Brereton, Clara; Humblet, Fabienne; Flynn, Daniel; Van Beveren, Julien; Osinsky, Thomas; Donneau, Anne-Françoise; Crielaard, Jean-Michel; Vanderthommen, Marc; Bruyère, Olivier
2017-11-01
Pain physiology education is an important component in the management of patients with chronic musculoskeletal pain. The Neurophysiology of Pain Questionnaire (NPQ) was developed in English to assess pain physiology knowledge in patients. This study aimed to translate the NPQ into French (NPQ-Fr) and to investigate the main psychometric properties of the NPQ-Fr. The translation was performed using the best practice translation guidelines. One hundred and one French-speaking patients with chronic non-specific spinal pain completed the NPQ-Fr to assess its acceptability and presence of floor/ceiling effects and test its dimensionality. The construct validity was tested by comparing the patients' NPQ-Fr scores to those of 17 physiotherapists and investigating its correlation with subscales of the Short Form-36 questionnaire. The reliability (i.e., internal consistency and test-retest reliability) was also investigated. To test the test-retest reliability, 70 patients were asked to complete the NPQ-Fr twice with one week in between. Regarding the NPQ-Fr psychometric properties: 1) acceptability was good; 2) internal consistency reached a Cronbach α-coefficient of 0.44; 3) no floor and ceiling effects were observed in patients; 4) a principal factor analysis generated three major factors; 5) construct validity was good; and 6) reliability was acceptable (intraclass correlation coefficient = 0.644; standard error of measurement = 1.5). The NPQ-Fr has satisfactory basic psychometric properties in patients with chronic spinal pain.
Development and validation of the Chinese version of dry eye related quality of life scale.
Zheng, Bang; Liu, Xiao-Jing; Sun, Yue-Qian Fiona; Su, Jia-Zeng; Zhao, Yang; Xie, Zheng; Yu, Guang-Yan
2017-07-17
To develop the Chinese version of quality of life scale for dry eye patients based on the Impact of Dry Eye on Everyday Life (IDEEL) questionnaire and to assess the reliability and validity of the developed scale. The original IDEEL was adapted cross-culturally to Chinese language and further developed following standard procedures. A total of 100 Chinese patients diagnosed with dry eye syndrome were included to investigate the psychometric properties of the Chinese version of scale. Psychometric tests included internal consistency (Cronbach's ɑ coefficients), construct validity (exploratory factor analysis), and known-groups validity (the analysis of variance). The Chinese version of Dry Eye Related Quality of Life (CDERQOL) Scale contains 45 items classified into 5 domains. Good to excellent internal consistency reliability was demonstrated for all 5 domains (Cronbach's ɑ coefficients range from 0.716 to 0.913). Construct validity assessment indicated a consistent factorial structure of the CDERQOL scale with hypothesized construct, with the exception of "Dry Eye Symptom-Bother" domain. All domain scores were detected with significant difference across three severity groups of dry eye patients (P < 0.05) except for "Satisfaction with Treatment" domain, indicating good known-groups validity. The results indicated that the CDERQOL scale is a reliable and valid instrument for patients with dry eye syndrome among Chinese population, and could be used as a supplementary diagnostic and treatment-effectiveness measure.
Judging in Rhythmic Gymnastics at Different Levels of Performance.
Leandro, Catarina; Ávila-Carvalho, Lurdes; Sierra-Palmeiro, Elena; Bobo-Arce, Marta
2017-12-01
This study aimed to analyse the quality of difficulty judging in rhythmic gymnastics, at different levels of performance. The sample consisted of 1152 difficulty scores concerning 288 individual routines, performed in the World Championships in 2013. The data were analysed using the mean absolute judge deviation from the final difficulty score, a Cronbach's alpha coefficient and intra-class correlations, for consistency and reliability assessment. For validity assessment, mean deviations of judges' difficulty scores, the Kendall's coefficient of concordance W and ANOVA eta-squared values were calculated. Overall, the results in terms of consistency (Cronbach's alpha mostly above 0.90) and reliability (intra-class correlations for single and average measures above 0.70 and 0.90, respectively) were satisfactory, in the first and third parts of the ranking on all apparatus. The medium level gymnasts, those in the second part of the ranking, had inferior reliability indices and highest score dispersion. In this part, the minimum of corrected item-total correlation of individual judges was 0.55, with most values well below, and the matrix for between-judge correlations identified remarkable inferior correlations. These findings suggest that the quality of difficulty judging in rhythmic gymnastics may be compromised at certain levels of performance. In future, special attention should be paid to the judging analysis of the medium level gymnasts, as well as the Code of Points applicability at this level.
Judging in Rhythmic Gymnastics at Different Levels of Performance
Ávila-Carvalho, Lurdes; Sierra-Palmeiro, Elena; Bobo-Arce, Marta
2017-01-01
Abstract This study aimed to analyse the quality of difficulty judging in rhythmic gymnastics, at different levels of performance. The sample consisted of 1152 difficulty scores concerning 288 individual routines, performed in the World Championships in 2013. The data were analysed using the mean absolute judge deviation from the final difficulty score, a Cronbach’s alpha coefficient and intra-class correlations, for consistency and reliability assessment. For validity assessment, mean deviations of judges’ difficulty scores, the Kendall’s coefficient of concordance W and ANOVA eta-squared values were calculated. Overall, the results in terms of consistency (Cronbach’s alpha mostly above 0.90) and reliability (intra-class correlations for single and average measures above 0.70 and 0.90, respectively) were satisfactory, in the first and third parts of the ranking on all apparatus. The medium level gymnasts, those in the second part of the ranking, had inferior reliability indices and highest score dispersion. In this part, the minimum of corrected item-total correlation of individual judges was 0.55, with most values well below, and the matrix for between-judge correlations identified remarkable inferior correlations. These findings suggest that the quality of difficulty judging in rhythmic gymnastics may be compromised at certain levels of performance. In future, special attention should be paid to the judging analysis of the medium level gymnasts, as well as the Code of Points applicability at this level. PMID:29339996
Reliability and validity of the Japanese version of the Resilience Scale and its short version.
Nishi, Daisuke; Uehara, Ritei; Kondo, Maki; Matsuoka, Yutaka
2010-11-17
The clinical relevance of resilience has received considerable attention in recent years. The aim of this study is to demonstrate the reliability and validity of the Japanese version of the Resilience Scale (RS) and short version of the RS (RS-14). The original English version of RS was translated to Japanese and the Japanese version was confirmed by back-translation. Participants were 430 nursing and university psychology students. The RS, Center for Epidemiologic Studies Depression Scale (CES-D), Rosenberg Self-Esteem Scale (RSES), Social Support Questionnaire (SSQ), Perceived Stress Scale (PSS), and Sheehan Disability Scale (SDS) were administered. Internal consistency, convergent validity and factor loadings were assessed at initial assessment. Test-retest reliability was assessed using data collected from 107 students at 3 months after baseline. Mean score on the RS was 111.19. Cronbach's alpha coefficients for the RS and RS-14 were 0.90 and 0.88, respectively. The test-retest correlation coefficients for the RS and RS-14 were 0.83 and 0.84, respectively. Both the RS and RS-14 were negatively correlated with the CES-D and SDS, and positively correlated with the RSES, SSQ and PSS (all p < 0.05), although the correlation between the RS and CES-D was somewhat lower than that in previous studies. Factor analyses indicated a one-factor solution for RS-14, but as for RS, the result was not consistent with previous studies. This study demonstrates that the Japanese version of RS has psychometric properties with high degrees of internal consistency, high test-retest reliability, and relatively low concurrent validity. RS-14 was equivalent to the RS in internal consistency, test-retest reliability, and concurrent validity. Low scores on the RS, a positive correlation between the RS and perceived stress, and a relatively low correlation between the RS and depressive symptoms in this study suggest that validity of the Japanese version of the RS might be relatively low compared with the original English version.
Escobar, A; Quintana, J M; Bilbao, A; Azkárate, J; Güenaga, J I
2002-11-01
The aim of this study was to validate a translated version of the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) questionnaire in Spanish patients with hip or knee osteoarthritis (OA). The WOMAC questionnaire and the SF-36 were administered to a sample of 269 patients on the waiting list for hip or knee replacement. We studied the convergent validity and the item-scale correlation using Pearson's correlation coefficient and Spearman's pi. For the reliability study we used another sample of 58 patients who received the WOMAC twice within 15 days. The Pearson's, Spearman's pi, and intraclass correlation coefficients were calculated. Internal consistency was measured by Cronbach's alpha. The responsiveness study was carried out by resending the two questionnaires to all patients 6 months after surgical intervention; responsiveness was measured by means of the paired t-test, the effect size I and the standardised response mean. The Pearson's coefficients for the convergent validity ranged from -0.52 to -0.63. The coefficients obtained for the item-scale correlation of the pain area were 0.74 or higher, 0.91 or higher for stiffness, and 0.61 or higher for function. When measuring the test-retest reliability, the coefficients ranged from 0.66 to 0.81. Internal consistency yielded a Cronbach's alpha ranging from 0.81 to 0.93. The responsiveness showed an effect size I ranging from 1.5 to 2.2 in patients who underwent hip replacement; for those who underwent knee replacement the range was 1 to 1.8. The standardised response mean ranged from 1.3 to 1.9 for patients with hip OA; those with knee OA ranged from 0.8 to 1.5. The Spanish version of WOMAC is a valid, reliable and responsive instrument in patients with hip or knee OA.
Assessment of the reliability of protein-protein interactions and protein function prediction.
Deng, Minghua; Sun, Fengzhu; Chen, Ting
2003-01-01
As more and more high-throughput protein-protein interaction data are collected, the task of estimating the reliability of different data sets becomes increasingly important. In this paper, we present our study of two groups of protein-protein interaction data, the physical interaction data and the protein complex data, and estimate the reliability of these data sets using three different measurements: (1) the distribution of gene expression correlation coefficients, (2) the reliability based on gene expression correlation coefficients, and (3) the accuracy of protein function predictions. We develop a maximum likelihood method to estimate the reliability of protein interaction data sets according to the distribution of correlation coefficients of gene expression profiles of putative interacting protein pairs. The results of the three measurements are consistent with each other. The MIPS protein complex data have the highest mean gene expression correlation coefficients (0.256) and the highest accuracy in predicting protein functions (70% sensitivity and specificity), while Ito's Yeast two-hybrid data have the lowest mean (0.041) and the lowest accuracy (15% sensitivity and specificity). Uetz's data are more reliable than Ito's data in all three measurements, and the TAP protein complex data are more reliable than the HMS-PCI data in all three measurements as well. The complex data sets generally perform better in function predictions than do the physical interaction data sets. Proteins in complexes are shown to be more highly correlated in gene expression. The results confirm that the components of a protein complex can be assigned to functions that the complex carries out within a cell. There are three interaction data sets different from the above two groups: the genetic interaction data, the in-silico data and the syn-express data. Their capability of predicting protein functions generally falls between that of the Y2H data and that of the MIPS protein complex data. The supplementary information is available at the following Web site: http://www-hto.usc.edu/-msms/AssessInteraction/.
Yalin Sapmaz, Şermin; Özek Erkuran, Handan; Yalin, Nefize; Önen, Özlem; Öztekin, Siğnem; Kavurma, Canem; Köroğlu, Ertuğrul; Aydemir, Ömer
2017-12-01
This study aimed to assess the validity and reliability of the Turkish version of Diagnostic and Statistical Manual of Mental Disorders (DSM-5) Level 2 Anger Scale. The scale was prepared by translation and back translation of DSM-5 Level 2 Anger Scale. Study groups consisted of a clinical sample of cases diagnosed with depressive disorder and treated in a child and adolescent psychiatry unit and a community sample. The study was continued with 218 children and 160 parents. In the assessment process, child and parent forms of DSM-5 Level 2 Anger Scale and Children's Depression Inventory and Strengths and Difficulties Questionnaire-Parent Form were used. In the reliability analyses, the Cronbach alpha internal consistency coefficient values were found very high regarding child and parent forms. Item-total score correlation coefficients were high and very high, respectively, for child and parent forms indicating a statistical significance. As for construct validity, one factor was maintained for each form and was found to be consistent with the original form of the scale. As for concurrent validity, the child form of the scale showed significant correlation with Children's Depression Inventory, while the parent form showed significant correlation with Strengths and Difficulties Questionnaire-Parent Form. It was found that the Turkish version of DSM-5 Level 2 Anger Scale could be utilized as a valid and reliable tool both in clinical practice and for research purposes.
Fearon, A M; Ganderton, C; Scarvell, J M; Smith, P N; Neeman, T; Nash, C; Cook, J L
2015-12-01
Greater trochanteric pain syndrome (GTPS) is common, resulting in significant pain and disability. There is no condition specific outcome score to evaluate the degree of severity of disability associated with GTPS in patients with this condition. To develop a reliable and valid outcome measurement capable of evaluating the severity of disability associated with GTPS. A phenomenological framework using in-depth semi structured interviews of patients and medical experts, and focus groups of physiotherapists was used in the item generation. Item and format clarification was undertaken via piloting. Multivariate analysis provided the basis for item reduction. The resultant VISA-G was tested for reliability with the inter class co-efficient (ICC), internal consistency (Cronbach's Alpha), and construct validity (correlation co-efficient) on 52 naïve participants with GTPS and 31 asymptomatic participants. The resultant outcome measurement tool is consistent in style with existing tendinopathy outcome measurement tools, namely the suite of VISA scores. The VISA-G was found to be have a test-retest reliability of ICC2,1 (95% CI) of 0.827 (0.638-0.923). Internal consistency was high with a Cronbach's Alpha of 0.809. Construct validity was demonstrated: the VISA-G measures different constructs than tools previously used in assessing GTPS, the Harris Hip Score and the Oswestry Disability Index (Spearman Rho:0.020 and 0.0205 respectively). The VISA-G did not demonstrate any floor or ceiling effect in symptomatic participants. The VISA-G is a reliable and valid score for measuring the severity of disability associated GTPS. Copyright © 2015 Elsevier Ltd. All rights reserved.
Mortsiefer, Achim; Immecke, Janine; Rotthoff, Thomas; Karger, André; Schmelzer, Regine; Raski, Bianca; Schmitten, Jürgen In der; Altiner, Attila; Pentzek, Michael
2014-06-01
To evaluate the summative assessment (OSCE) of a communication training programme for dealing with challenging doctor-patient encounters in the 4th study year. Our OSCE consists of 4 stations (breaking bad news, guilt and shame, aggressive patients, shared decision making), using a 4-item global rating (GR) instrument. We calculated reliability coefficients for different levels, discriminability of single items and interrater reliability. Validity was estimated by gender differences and accordance between GR and a checklist. In a pooled sample of 456 students in 3 OSCEs over 3 terms, total reliability was α=0.64, reliability coefficients for single stations were >0.80, and discriminability in 3 of 4 stations was within the range of 0.4-0.7. Except for one station, interrater reliability was moderate to strong. Reliability on item level was poor and pointed to some problems with the use of the GR. The application of the GR on regular undergraduate medical education shows moderate reliability in need of improvement and some traits of validity. Ongoing development and evaluation is needed with particular regard to the training of the examiners. Our CoMeD-OSCE proved suitable for the summative assessment of communication skills in challenging doctor-patient encounters. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Psychometric properties of the Symptom Status Questionnaire-Heart Failure.
Heo, Seongkum; Moser, Debra K; Pressler, Susan J; Dunbar, Sandra B; Mudd-Martin, Gia; Lennie, Terry A
2015-01-01
Many patients with heart failure (HF) experience physical symptoms, poor health-related quality of life (HRQOL), and high rates of hospitalization. Physical symptoms are associated with HRQOL and are major antecedents of hospitalization. However, reliable and valid physical symptom instruments have not been established. Therefore, this study examined the psychometric properties of the Symptom Status Questionnaire-Heart Failure (SSQ-HF) in patients with HF. Data on symptoms using the SSQ-HF were collected from 249 patients (aged 61 years, 67% male, 45% in New York Heart Association functional class III/IV). Internal consistency reliability was assessed using Cronbach's α. Item homogeneity was assessed using item-total and interitem correlations. Construct validity was assessed using factor analysis and testing hypotheses on known relationships. Data on depressive symptoms (Beck Depression Inventory II), HRQOL (Minnesota Living With Heart Failure Questionnaire), and event-free survival were collected to test known relationships. Internal consistency reliability was supported: Cronbach's α was .80. Item-total correlation coefficients and interitem correlation coefficients were acceptable. Factor analysis supported the construct validity of the instrument. More severe symptoms were associated with more depressive symptoms, poorer HRQOL, and more risk for hospitalization, emergency department visit, or death, controlling for covariates. The findings of this study support the reliability and validity of the SSQ-HF. Clinicians and researchers can use this instrument to assess physical symptoms in patients with HF.
Rodrigues, Marcelo F; Michel-Crosato, Edgard; Cardoso, Jefferson R; Traebert, Jefferson
2009-06-01
Cross-cultural translation and psychometric testing. To translate and cross-culturally adapt the Quebec Back Pain Disability Scale (QDS) to Brazilian Portuguese and to examine its validity and reliability. Current literature shows the need to adopt reliable and internationally standardized methods for the analysis of low back pain. To our knowledge, this specific questionnaire has not been translated and validated for Portuguese-speaking patients. The translation and cross-cultural adaptation of the QDS were developed in agreement with internationally recommended methodology, and the resulting product was evaluated in this study with 54 consecutive patients. Internal consistency was obtained through Cronbach's alpha; reliability was estimated through the intraclass correlation coefficient and the Bland and Altman agreement (d = mean difference). Validity was determined by correlating the scores of the Brazil-QDS with the Brazilian version of the Roland-Morris Questionnaire and Visual Analogue Pain Scale by means of the Spearman rank correlation coefficient. The internal consistency obtained was excellent (Cronbach's alpha = 0.97). Intraobserver and interobserver reliability were considered strong (ICC = 0.93-d = 0.68 and 0.96-d = 0.57, respectively). The correlation with Brazilian Roland-Morris Questionnaire and with the Visual Analogue Scale was high (r = 0.857; r = 0.758, respectively). The data showed that the process of translation and cross-cultural adaptation were successful and that the adapted instrument demonstrated excellent psychometric properties.
Reliability of the detailed assessment of speed of handwriting on Flemish children.
Simons, Johan; Probst, Michel
2014-01-01
This study evaluates the reliability of the Detailed Assessment of Speed of Handwriting (DASH) in a Dutch-speaking sample of children. The sample included 650 boys and 513 girls (age range = 9-16 years). Handwriting speed measurements were obtained using the DASH. Interrater agreement, test-retest reliability, and internal consistency were calculated; gender and age effects were analyzed. Interrater agreement shows excellent reliability with intraclass correlation coefficients of at least 0.94. Test-retest correlations ranged from r = 0.65 to r = 0.81. The internal consistency measures, calculated with Cronbach's alpha, were between 0.88 and 0.94. Both gender and age have a significant effect on handwriting speed, with F (7.1144) = 17.43 (P < .001) for gender and F (7.1144) = 21.8 (P < .001) for age. The DASH is a reliable assessment tool to evaluate handwriting speed of Dutch-speaking children. There is a tendency of girls to write faster than boys.
Bengi, Göksel; Yalçın, Mustafa; Akpınar, Hale; Keskinoğlu, Pembe; Ellidokuz, Hülya
2015-07-01
There are few specific evaluation forms for evaluating the quality of life among patients with chronic constipation. Our study aimed to determine the validity and reliability of the translated Patient Assessment of Constipation Quality of Life (PAC-QOL) questionnaire for the Turkish population because evidence of its reliability and validity is required to justify its use in other studies and clinical practice. This study included 154 patients with constipation who were treated at the Department of Gastroenterology, Dokuz Eylül University Hospital between January and June 2012. The translated PAC-QOL questionnaire was completed by patients at the clinic and also at a 2-week follow-up to test its reliability. Cronbach's alpha coefficient (internal consistency) was 0.91 (good) for the translated PAC-QOL questionnaire. Time validity was evaluated using the intraclass correlation coefficient (ICC) method, and the ICC value for all questions was confirmed as 0.68 at the 2-week follow-up. The validity of the tool in the study group was evaluated using factor analysis, and the results were highly significant (Kaiser-Meyer-Olkin value: 0.857; Bartlett's test: p=0.001). Questions were categorized according to six factors based on the factor analysis, and these factors explained 65.1% of the total variation. For hypothesis verification of the tool, the correlation coefficient for PAC-QOL and PAC Symptoms (PAC-SYM) was r=0.577 (p<0.001), whereas the correlation coefficient for PAC-QOL and constipation severity score was r=0.457 (p<0.001). The PAC-QOL questionnaire was reliable, although not valid because of the limited sample group.
Kaux, Jean-François; Delvaux, François; Schaus, Jean; Demoulin, Christophe; Locquet, Médéa; Buckinx, Fanny; Beaudart, Charlotte; Dardenne, Nadia; Van Beveren, Julien; Croisier, Jean-Louis; Forthomme, Bénédicte; Bruyère, Olivier
Translation and validation of algo-functional questionnaire. The lateral elbow tendinopathy is a common injury in tennis players and physical workers. The Patient-Rated Tennis Elbow Evaluation (PRTEE) Questionnaire was specifically designed to measure pain and functional limitations in patients with lateral epicondylitis (tennis elbow). First developed in English, this questionnaire has since been translated into several languages. The aims of the study were to translate and cross-culturally adapt the PRTEE questionnaire into French and to evaluate the reliability and validity of this translated version of the questionnaire (PRTEE-F). The PRTEE was translated and cross-culturally adapted into French according to international guidelines. To assess the reliability and validity of the PRTEE-F, 115 participants were asked twice to fill in the PRTEE-F, and once the Disabilities of Arm, Shoulder and Hand Questionnaire (DASH) and the Short Form Health Survey (SF-36). Internal consistency (using Cronbach's alpha), test-retest reliability (using intraclass correlation coefficient (ICC), standard error of measurement and minimal detectable change), and convergent and divergent validity (using the Spearman's correlation coefficients respectively with the DASH and with some subscales of the SF-36) were assessed. The PRTEE was translated into French without any problems. PRTEE-F showed a good test-retest reliability for the overall score (ICC 0.86) and for each item (ICC 0.8-0.96) and a high internal consistency (Cronbach's alpha = 0.98). The correlation analyses revealed high correlation coefficients between PRTEE-F and DASH (convergent validity) and, as expected, a low or moderate correlation with the divergent subscales of the SF-36 (discriminant validity). There was no floor or ceiling effect. The PRTEE questionnaire was successfully cross-culturally adapted into French. The PRTEE-F is reliable and valid for evaluating French-speaking patients with lateral elbow tendinopathy. Copyright © 2016 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Read, Paul J; Oliver, Jon L; Croix, Mark Ba De Ste; Myer, Gregory D; Lloyd, Rhodri S
2016-12-01
Read, P, Oliver, JL, Croix, MD, Myer, GD, and Lloyd, RS. Consistency of field-based measures of neuromuscular control using force-plate diagnostics in elite male youth soccer players. J Strength Cond Res 30(12): 3304-3311, 2016-Deficits in neuromuscular control during movement patterns such as landing are suggested pathomechanics that underlie sport-related injury. A common mode of assessment is measurement of landing forces during jumping tasks; however, these measures have been used less frequently in male youth soccer players, and reliability data are sparse. The aim of this study was to examine the reliability of a field-based neuromuscular control screening battery using force-plate diagnostics in this cohort. Twenty-six pre-peak height velocity (PHV) and 25 post-PHV elite male youth soccer players completed a drop vertical jump (DVJ), single-leg 75% horizontal hop and stick (75%HOP), and single-leg countermovement jump (SLCMJ). Measures of peak landing vertical ground reaction force (pVGRF), time to stabilization, time to pVGRF, and pVGRF asymmetry were recorded. A test-retest design was used, and reliability statistics included change in mean, intraclass correlation coefficient, and coefficient of variation (CV). No significant differences in mean score were reported for any of the assessed variables between test sessions. In both groups, pVGRF and asymmetry during the 75%HOP and SLCMJ demonstrated largely acceptable reliability (CV ≤ 10%). Greater variability was evident in DVJ pVGRF and all other assessed variables, across the 3 protocols (CV range = 13.8-49.7%). Intraclass correlation coefficient values ranged from small to large and were generally higher in the post-PHV players. The results of this study suggest that pVGRF and asymmetry can be reliably assessed using a 75%HOP and SLCMJ in this cohort. These measures could be used to support a screening battery for elite male youth soccer players and for test-retest comparison.
Yun, Young-Ju; Shin, Yong-Beom; Kim, Soo-Yeon; Shin, Myung-Jun; Kim, Ra-Jin; Oh, Tae-Young
2016-07-01
[Purpose] The purpose of this study was to develop the Korean version of the PedsQL(TM) 3.0 Cerebral Palsy Module to evaluate the health-related quality of life of children with cerebral palsy and to test the reliability and validity. [Subjects and Methods] The study included 108 caregivers of children with cerebral palsy aged 2 to 4 years and 72 caregivers of children aged 5 to 7 years, who visited multiple sites between February and August 2015. The Translation Commission performed the first translation with the approval of the Mapi Research Trust Company to create a Korean-version of the PedsQL(TM). Afterwards, back-translation was performed by one translator specializing in health and medical treatment who was a native English-speaker fluent in Korean, and one native Korean-speaker fluent in English. The consistency of each question was confirmed and a translation-integrated version was created. Test components were explained to caregivers during a one-on-one interview; caregivers then completed the PedsQL(TM) questionnaire and a Pediatric Evaluation Disability Inventory (PEDI) questionnaire. Subjects contributing to test-retest measures were asked to repeat the PedsQL questionnaire one week later and return it by mail. To assess data quality for the survey question results, non-response rate, ceiling effect, and floor effect were analyzed. Test-retest reliability and internal consistency reliability were assessed. For test-retest reliability, an intraclass correlation coefficient (ICC) was calculated, and for internal consistency reliability, Cronbach's alpha was used. To test criterion-related validity, Pearson's correlation coefficient was used. [Results] The content validity of the PedsQL 3.0 Cerebral Palsy Module was high for both age groups, and demonstrated significant internal consistency (>0.7) in all areas. For test-retest reliability, both groups demonstrated a significant ICC (>0.61). Correlation with the PEDI was statistically significant in all areas except pain and hurt. [Conclusion] The Korean version of the PedsQL(TM) 3.0 Cerebral Palsy Module was found to be reliable and valid, and is expected to contribute greatly to the evaluation of the quality of life of children with cerebral palsy.
Trathitiphan, Warayos; Paholpak, Permsak; Sirichativapee, Winai; Wisanuyotin, Taweechok; Laupattarakasem, Pat; Sukhonthamarn, Kamolsak; Jeeravipoolvarn, Polasak; Kosuwon, Weerachai
2016-10-01
HOOS was developed as an extension of the Western Ontario and McMaster Universities' Osteoarthritis Index questionnaire for measuring symptoms and functional limitations related to the hip(s) of patients with osteoarthritis. To determine the validity and reliability of the Thai version of the Hip disability and Osteoarthritis Outcome Score (HOOS) vis-à-vis hip osteoarthritis, the original HOOS was translated into a Thai version of HOOS, according to international recommendations. Patients with hip osteoarthritis (n = 57; 25 males) were asked to complete the Thai version of HOOS twice: once then again after a 3-week interval. The test-retest reliability was analyzed using the intraclass correlation coefficient (ICC). Internal consistencies were analyzed using Cronbach's alpha, while the construct validity was tested by comparing the Thai HOOS with the Thai modified SF-36 and calculating the Spearman's rank correlation coefficients. The Thai HOOS produced good reliability (i.e., the ICC was greater than 0.9 in all five subscales). All of the Cronbach's alpha showed that the Thai HOOS had high internal consistency (Cronbach's alpha greater than 0.8), especially for the pain and ADL subscales (0.89 and 0.90, respectively). The Spearman's rank correlation for all five subscales of the Thai HOOS had moderate correlation with the Bodily Pain subscale of the Thai SF-36. The pain subscale of the Thai HOOS had a high correlation with the Vitality and Social Function subscales of the Thai SF-36 (r = 0.55 and 0.54)-with which the symptom subscale had a moderate correlation. The Thai version of HOOS had excellent internal consistency, excellent test-retest reliability, and good construct validity. It can be used as a reliable tool for assessing quality of life for patients with hip osteoarthritis in Thailand.
Standards and reliability in evaluation: when rules of thumb don't apply.
Norcini, J J
1999-10-01
The purpose of this paper is to identify situations in which two rules of thumb in evaluation do not apply. The first rule is that all standards should be absolute. When selection decisions are being made or when classroom tests are given, however, relative standards may be better. The second rule of thumb is that every test should have a reliability of .80 or better. Depending on the circumstances, though, the standard error of measurement, the consistency of pass/fail classifications, and the domain-referenced reliability coefficients may be better indicators of reproducibility.
Identifying dyspepsia in the Greek population: translation and validation of a questionnaire
Anastasiou, Foteini; Antonakis, Nikos; Chaireti, Georgia; Theodorakis, Pavlos N; Lionis, Christos
2006-01-01
Background Studies on clinical issues, including diagnostic strategies, are considered to be the core content of general practice research. The use of standardised instruments is regarded as an important component for the development of Primary Health Care research capacity. Demand for epidemiological cross-cultural comparisons in the international setting and the use of common instruments and definitions valid to each culture is bigger than ever. Dyspepsia is a common complaint in primary practice but little is known with respect to its incidence in Greece. There are some references about the Helicobacter Pylori infection in patients with functional dyspepsia or gastric ulcer in Greece but there is no specific instrument for the identification of dyspepsia. This paper reports on the validation and translation into Greek, of an English questionnaire for the identification of dyspepsia in the general population and discusses several possibilities of its use in the Greek primary care. Methods The selected English postal questionnaire for the identification of people with dyspepsia in the general population consists of 30 items and was developed in 1995. The translation and cultural adaptation of the questionnaire has been performed according to international standards. For the validation of the instrument the internal consistency of the items was established using the alpha coefficient of Chronbach, the reproducibility (test – retest reliability) was measured by kappa correlation coefficient and the criterion validity was calculated against the diagnosis of the patients' records using also kappa correlation coefficient. Results The final Greek version of the postal questionnaire for the identification of dyspepsia in the general population was reliably translated. The internal consistency of the questionnaire was good, Chronbach's alpha was found to be 0.88 (95% CI: 0.81–0.93), suggesting that all items were appropriate to measure. Kappa coefficient for reproducibility (test – retest reliability) was found 0.66 (95% CI: 0.62–0.71), whereas the kappa analysis for criterion validity was 0.63 (95% CI: 0.36–0.89). Conclusion This study indicates that the Greek translation is comparable with the English-language version in terms of validity and reliability, and is suitable for epidemiological research within the Greek primary health care setting. PMID:16515708
Validity and reliability of the Turkish Migraine Disability Assessment (MIDAS) questionnaire.
Ertaş, Mustafa; Siva, Aksel; Dalkara, Turgay; Uzuner, Nevzat; Dora, Babür; Inan, Levent; Idiman, Fethi; Sarica, Yakup; Selçuki, Deniz; Sirin, Hadiye; Oğuzhanoğlu, Atilla; Irkeç, Ceyla; Ozmenoğlu, Mehmet; Ozbenli, Taner; Oztürk, Musa; Saip, Sabahattin; Neyal, Münife; Zarifoğlu, Mehmet
2004-09-01
The aim of this study is to assess the comprehensibility, internal consistency, patient-physician reliability, test-retest reliability, and validity of Turkish version of Migraine Disability Assessment (MIDAS) questionnaire in patients with headache. MIDAS questionnaire has been developed by Stewart et al and shown to be reliable and valid to determine the degree of disability caused by migraine. This study was designed as a national multicenter study to demonstrate the reliability and validity of Turkish version of MIDAS questionnaire. Patients applying to 17 Neurology Clinics in Turkey were evaluated at the baseline (visit 1), week 4 (visit 2), and week 12 (visit 3) visits in terms of disease severity and comprehensibility, internal consistency, test-retest reliability, and validity of MIDAS. Since the severity of the disease has been found to change significantly at visit 2 compared to visit 1, test-retest reliability was assessed using the MIDAS scores of a subgroup of patients whose disease severity remained unchanged (up to +/-3 days difference in the number of days with headache between visits 1 and 2). A total of 306 patients (86.2% female, mean age: 35.0 +/- 9.8 years) were enrolled into the study. A total of 65.7%, 77.5%, 82.0% of patients reported that "they had fully understood the MIDAS questionnaire" in visits 1, 2, and 3, respectively. A highly positive correlation was found between physician and patient and the applied total MIDAS scores in all three visits (Spearman correlation coefficients were R= 0.87, 0.83, and 0.90, respectively, P <.001). Internal consistency of MIDAS was assessed using Cronbach's alpha and was found at acceptable (>0.7) or excellent (>0.8) levels in both patient and physician applied MIDAS scores, respectively. Total MIDAS score showed good test-retest reliability (R= 0.68). Both the number of days with headache and the total MIDAS scores were positively correlated at all visits with correlation coefficients between 0.47 and 0.63. There was also a moderate degree of correlation (R= 0.54) between the total MIDAS score at week 12 and the number of days with headache at visit 2 + visit 3, which quantify headache-related disability over a 3-month period similar to MIDAS questionnaire. These findings demonstrated that the Turkish translation is equivalent to the English version of MIDAS in terms of internal consistency, test-retest reliability, and validity. Physicians can reliably use the Turkish translation of the MIDAS questionnaire in defining the severity of illness and its treatment strategy when applied as a self-administered report by migraine patients themselves.
Psychometrics Matter in Health Behavior: A Long-term Reliability Generalization Study.
Pickett, Andrew C; Valdez, Danny; Barry, Adam E
2017-09-01
Despite numerous calls for increased understanding and reporting of reliability estimates, social science research, including the field of health behavior, has been slow to respond and adopt such practices. Therefore, we offer a brief overview of reliability and common reporting errors; we then perform analyses to examine and demonstrate the variability of reliability estimates by sample and over time. Using meta-analytic reliability generalization, we examined the variability of coefficient alpha scores for a well-designed, consistent, nationwide health study, covering a span of nearly 40 years. For each year and sample, reliability varied. Furthermore, reliability was predicted by a sample characteristic that differed among age groups within each administration. We demonstrated that reliability is influenced by the methods and individuals from which a given sample is drawn. Our work echoes previous calls that psychometric properties, particularly reliability of scores, are important and must be considered and reported before drawing statistical conclusions.
Coefficient Alpha: A Reliability Coefficient for the 21st Century?
ERIC Educational Resources Information Center
Yang, Yanyun; Green, Samuel B.
2011-01-01
Coefficient alpha is almost universally applied to assess reliability of scales in psychology. We argue that researchers should consider alternatives to coefficient alpha. Our preference is for structural equation modeling (SEM) estimates of reliability because they are informative and allow for an empirical evaluation of the assumptions…
Reliable and valid assessment of Lichtenstein hernia repair skills.
Carlsen, C G; Lindorff-Larsen, K; Funch-Jensen, P; Lund, L; Charles, P; Konge, L
2014-08-01
Lichtenstein hernia repair is a common surgical procedure and one of the first procedures performed by a surgical trainee. However, formal assessment tools developed for this procedure are few and sparsely validated. The aim of this study was to determine the reliability and validity of an assessment tool designed to measure surgical skills in Lichtenstein hernia repair. Key issues were identified through a focus group interview. On this basis, an assessment tool with eight items was designed. Ten surgeons and surgical trainees were video recorded while performing Lichtenstein hernia repair, (four experts, three intermediates, and three novices). The videos were blindly and individually assessed by three raters (surgical consultants) using the assessment tool. Based on these assessments, validity and reliability were explored. The internal consistency of the items was high (Cronbach's alpha = 0.97). The inter-rater reliability was very good with an intra-class correlation coefficient (ICC) = 0.93. Generalizability analysis showed a coefficient above 0.8 even with one rater. The coefficient improved to 0.92 if three raters were used. One-way analysis of variance found a significant difference between the three groups which indicates construct validity, p < 0.001. Lichtenstein hernia repair skills can be assessed blindly by a single rater in a reliable and valid fashion with the new procedure-specific assessment tool. We recommend this tool for future assessment of trainees performing Lichtenstein hernia repair to ensure that the objectives of competency-based surgical training are met.
Lin, Yu-Hua; Wang, Liching Sung
2010-08-01
The purpose of this study was to assess the reliability and validity of a Chinese version of the revised nurses professional values scale (NPVS-R). The convenient sampling method, including senior undergraduate nursing students (n=110) and clinical nurses (n=223), was applied to recruit appropriate samples from southern Taiwan. The revised nurses professional values scale (NPVS-R) was used in this study. Content validity, construct validity, internal consistency, and reliability were assessed. The final sample consisted of 286 subjects. three factors were detected in the results, accounting for 60.12% of the explained variance. The first factor was titled professionalism, and included 13 items. The second factor was named caring, and consisted of seven items. Activism was the third factor, which included six items. Overall Cronbach's alpha coefficient was 0.90, taken from values for each of the three factors of 0.88, 0.90, and 0.81, respectively. The Chinese version of the NPVS-R can be considered a reliable and valid scale for assigning values that can mark professionalism in Taiwanese nurses. Copyright 2009 Elsevier Ltd. All rights reserved.
An Improved Internal Consistency Reliability Estimate.
ERIC Educational Resources Information Center
Cliff, Norman
1984-01-01
The proposed coefficient is derived by assuming that the average Goodman-Kruskal gamma between items of identical difficulty would be the same for items of different difficulty. An estimate of covariance between items of identical difficulty leads to an estimate of the correlation between two tests with identical distributions of difficulty.…
Fatehi, Zahra; Baradaran, Hamid Reza; Asadpour, Mohamad; Rezaeian, Mohsen
2017-01-01
Background: Individuals' listening styles differs based on their characters, professions and situations. This study aimed to assess the validity and reliability of Listening Styles Profile- Revised (LSP- R) in Iranian students. Methods: After translating into Persian, LSP-R was employed in a sample of 240 medical and nursing Persian speaking students in Iran. Statistical analysis was performed to test the reliability and validity of the LSP-R. Results: The study revealed high internal consistency and good test-retest reliability for the Persian version of the questionnaire. The Cronbach's alpha coefficient was 0.72 and intra-class correlation coefficient 0.87. The means for the content validity index and the content validity ratio (CVR) were 0.90 and 0.83, respectively. Exploratory factor analysis (EFA) yielded a four-factor solution accounted for 60.8% of the observed variance. Majority of medical students (73%) as well as majority of nursing students (70%) stated that their listening styles were task-oriented. Conclusion: In general, the study finding suggests that the Persian version of LSP-R is a valid and reliable instrument for assessing listening styles profile in the studied sample.
Short version of the Depression Anxiety Stress Scale-21: is it valid for Brazilian adolescents?
da Silva, Hítalo Andrade; dos Passos, Muana Hiandra Pereira; de Oliveira, Valéria Mayaly Alves; Palmeira, Aline Cabral; Pitangui, Ana Carolina Rodarti; de Araújo, Rodrigo Cappato
2016-01-01
ABSTRACT Objective To evaluate the interday reproducibility, agreement and validity of the construct of short version of the Depression Anxiety Stress Scale-21 applied to adolescents. Methods The sample consisted of adolescents of both sexes, aged between 10 and 19 years, who were recruited from schools and sports centers. The validity of the construct was performed by exploratory factor analysis, and reliability was calculated for each construct using the intraclass correlation coefficient, standard error of measurement and the minimum detectable change. Results The factor analysis combining the items corresponding to anxiety and stress in a single factor, and depression in a second factor, showed a better match of all 21 items, with higher factor loadings in their respective constructs. The reproducibility values for depression were intraclass correlation coefficient with 0.86, standard error of measurement with 0.80, and minimum detectable change with 2.22; and, for anxiety/stress: intraclass correlation coefficient with 0.82, standard error of measurement with 1.80, and minimum detectable change with 4.99. Conclusion The short version of the Depression Anxiety Stress Scale-21 showed excellent values of reliability, and strong internal consistency. The two-factor model with condensation of the constructs anxiety and stress in a single factor was the most acceptable for the adolescent population. PMID:28076595
Fuller, Joel T; Archer, Jane; Buckley, Jonathan D; Tsiros, Margarita D; Thewlis, Dominic
2016-01-01
To investigate the reliability of a simple, efficient technique for measuring bone mineral density (BMD) in the metatarsals using dual-energy X-ray absorptiometry (DXA). BMD of the right foot of 32 trained male distance runners was measured using a DXA scanner with the foot in the plantar position. Separate regions of interest (ROI) were used to assess the BMD of each metatarsal shaft (1st-5th) for each participant. ROI analysis was repeated by the same investigator to determine within-scan intra-rater reliability and by a different investigator to determine within-scan inter-rater reliability. Repeat DXA scans were undertaken for ten participants to assess between-scan intra-rater reliability. Assessment of BMD was consistently most reliable for the first metatarsal across all domains of reliability assessed (intra-class correlation coefficient [ICC] ≥0.97; coefficient of variation [CV] ≤1.5%; limits of agreement [LOA] ≤4.2%). Reasonable levels of intra-rater reliability were also achieved for the second and fifth metatarsals (ICC ≥0.90; CV ≤4.2%; LOA ≤11.9%). Poorer levels of reliability were demonstrated for the third (ICC ≥0.64; CV ≤8.2%; LOA ≤23.6%) and fourth metatarsals (ICC ≥0.67; CV ≤9.6%; LOA ≤27.5%). BMD was greatest in the first and second metatarsals (P < 0.01). Reliable measurements of BMD were achieved for the first, second and fifth metatarsals.
Development and evaluation of the nurse quality of communication with patient questionnaire.
Vuković, Mira; Gvozdenović, Branislav S; Stamatović-Gajić, Branka; Ilić, Miodrag; Gajić, Tomislav
2010-01-01
Nurse/patient relationship as a complex interrelation or as an interaction of the factor patient and factor nurse has been a subject of a number of studies during the past ten years. Nurse/patient communication is a special entity, usually observed within a framework of the wider nurse/patient relationship. In that regard, we wanted to develop a standardized questionnaire that could reliably measure the quality of communication between nurse and patient, and be used by nurses. The main goal of this study was to develop and evaluate construct validity of the Nurse Quality of Communication with Patient Questionnaire (NQCPQ), as well as to evaluate its reliability. The goal was also to establish a measure of inter-raters reliability, using two repeated measurements of results by items and scores of the NQCPQ, on the same observed units by two assessors. The starting NQCPQ that consists of 25 items, was filled in by two groups of nurses. Each nurse was questioned during morning and afternoon shifts, in order to evaluate their communication with hospitalized patients, using marks from 1 to 6. To evaluate construct validity, we used the analysis of main components, while reliability was assessed using intraclass correlation coefficient and Cronbach-alpha coefficient. To evaluate interraters reliability, we used Pearson correlation coefficient. Using a group of 118 patients, we explained 86% of the unknown, regarding the investigated phenomenon (communication nurse/patient), using one component by which we separated 6 items of the questionnaire. Inter-item correlation (alpha) in this component was 0.96. Pearson correlation coefficient was highly significant, value 0.7 by item, and correlation coefficient for scores at repeated measurements was 0.84. NQCPQ is 6-item instrument with high construct validity. It can be used to measure quality of nurse/patient communication in a simple, fast and reliable way. It could contribute to more adequate research and defining of this problem, and as such could be used in studies of interaction of psychometric, clinical, biochemical, socio-cultural, demographic and other parameters as well.
Alqarni, Ayidh M; Vennu, Vishal; Alshammari, Sulaiman A; Bindawas, Saad M
2018-01-01
Older adults are the fastest growing population group worldwide. Regular physical activity (PA) is reported to reduce the risk of health conditions and improve personal well-being. Few validated instruments can be used to measure the PA levels among older adults in Saudi Arabia. The Physical Activity Scale for the Elderly (PASE) is used worldwide for evaluating the PA levels of the elderly in epidemiological studies. However, this scale has not been translated into Arabic. This study aimed to cross-culturally adapt the PASE into Arabic language and evaluate its reliability and validity among community-dwelling older adults in Saudi Arabia. This study was a cross-sectional one following Beaton guidelines to translate and perform cultural adaptation, as well as test the reliability and validity of the PASE Arabic version (PASE-A). Elderly (N=74) people from both genders, who lived in a community dwelling in Riyadh city, were selected from several primary health care centers. The study used Cronbach's alpha coefficient to assess the internal consistency reliability, while intraclass correlation coefficient (ICC 2,1 ) was used for test-retest reliability and the Spearman's rank correlation coefficient ( r ) was used to evaluate the correlation among PASE-A and grip strength, Timed Up and Go test, body mass index, and fat percentage. Out of 74 older adults, 59 (79.7%) completed the PASE-A questionnaire twice. The internal consistency of the PASE-A components was good (Cronbach's alpha 0.70-0.75), and the reliability of the components was excellent (ICC 2,1 0.90-0.98). A higher PASE-A score was associated with higher grip strength ( r =0.28, p =0.05) and with shorter Timed Up and Go test times ( r =-0.45, p =0.01). The PASE-A version was easy, understandable, and relevant for Saudi older adults' culture. This scale was a reliable and valid tool for evaluating and assessing the PA level among community-dwelling older adults in Saudi Arabia.
Tatewaki, Yasuko; Higano, Shuichi; Taki, Yasuyuki; Thyreau, Benjamin; Murata, Takaki; Mugikura, Shunji; Ito, Daisuke; Takase, Kei; Takahashi, Shoki
2014-01-01
Quantitative signal targeting with alternating radiofrequency labeling of arterial regions (QUASAR) is a recent spin labeling technique that could improve the reliability of brain perfusion measurements. Although it is considered reliable for measuring gray matter as a whole, it has never been evaluated regionally. Here we assessed this regional reliability. Using a 3-Tesla Philips Achieva whole-body system, we scanned four times 10 healthy volunteers, in two sessions 2 weeks apart, to obtain QUASAR images. We computed perfusion images and ran a voxel-based analysis within all brain structures. We also calculated mean regional cerebral blood flow (rCBF) within regions of interest configured for each arterial territory distribution. The mean CBF over whole gray matter was 37.74 with intraclass correlation coefficient (ICC) of .70. In white matter, it was 13.94 with an ICC of .30. Voxel-wise ICC and coefficient-of-variation maps showed relatively lower reliability in watershed areas and white matter especially in deeper white matter. The absolute mean rCBF values were consistent with the ones reported from PET, as was the relatively low variability in different feeding arteries. Thus, QUASAR reliability for regional perfusion is high within gray matter, but uncertain within white matter. © 2014 The Authors. Journal of Neuroimaging published by the American Society of Neuroimaging.
Tatewaki, Yasuko; Higano, Shuichi; Taki, Yasuyuki; Thyreau, Benjamin; Murata, Takaki; Mugikura, Shunji; Ito, Daisuke; Takase, Kei; Takahashi, Shoki
2014-01-01
BACKGROUND AND PURPOSE Quantitative signal targeting with alternating radiofrequency labeling of arterial regions (QUASAR) is a recent spin labeling technique that could improve the reliability of brain perfusion measurements. Although it is considered reliable for measuring gray matter as a whole, it has never been evaluated regionally. Here we assessed this regional reliability. METHODS Using a 3-Tesla Philips Achieva whole-body system, we scanned four times 10 healthy volunteers, in two sessions 2 weeks apart, to obtain QUASAR images. We computed perfusion images and ran a voxel-based analysis within all brain structures. We also calculated mean regional cerebral blood flow (rCBF) within regions of interest configured for each arterial territory distribution. RESULTS The mean CBF over whole gray matter was 37.74 with intraclass correlation coefficient (ICC) of .70. In white matter, it was 13.94 with an ICC of .30. Voxel-wise ICC and coefficient-of-variation maps showed relatively lower reliability in watershed areas and white matter especially in deeper white matter. The absolute mean rCBF values were consistent with the ones reported from PET, as was the relatively low variability in different feeding arteries. CONCLUSIONS Thus, QUASAR reliability for regional perfusion is high within gray matter, but uncertain within white matter. PMID:25370338
Moghadam, Manije; Salavati, Mahyar; Sahaf, Robab; Rassouli, Maryam; Moghadam, Mojgan; Kamrani, Ahmad Ali Akbari
2018-03-01
After forward-backward translation, the LSS was administered to 334 Persian speaking, cognitively healthy elderly aged 60 years and over recruited through convenience sampling. To analyze the validity of the model's constructs and the relationships between the constructs, a confirmatory factor analysis followed by PLS analysis was performed. The Construct validity was further investigated by calculating the correlations between the LSS and the "Short Form Health Survey" (SF-36) subscales measuring similar and dissimilar constructs. The LSS was re-administered to 50 participants a month later to assess the reliability. For the eight-factor model of the life satisfaction construct, adequate goodness of fit between the hypothesized model and the model derived from the sample data was attained (positive and statistically significant beta coefficients, good R-squares and acceptable GoF). Construct validity was supported by convergent and discriminant validity, and correlations between the LSS and SF-36 subscales. Minimum Intraclass Correlation Coefficient level of 0.60 was exceeded by all subscales. Minimum level of reliability indices (Cronbach's α, composite reliability and indicator reliability) was exceeded by all subscales. The Persian-version of the Life Satisfaction Scale is a reliable and valid instrument, with psychometric properties which are consistent with the original version.
Development and Validation of a Fatigue Assessment Scale for U.S. Construction Workers
Zhang, Mingzong; Sparer, Emily H.; Murphy, Lauren A.; Dennerlein, Jack T.; Fang, Dongping; Katz, Jeffrey N.; Caban-Martinez, Alberto J.
2015-01-01
Objective To develop a fatigue assessment scale and test its reliability and validity for commercial construction workers. Methods Using a two-phased approach, we first identified items for the development of a Fatigue Assessment Scale for Construction Workers (FASCW) through review of existing scales in the scientific literature, key informant interviews (n=11) and focus groups (3 groups with 6 workers each) with construction workers. The second phase included assessment for the reliability, validity and sensitivity of the new scale using a repeated-measures study design with a convenience sample of construction workers (n=144). Results Phase one resulted in a 16-item preliminary scale that after factor analysis yielded a final 10-item scale with two sub-scales (“Lethargy” and “Bodily Ailment”).. During phase two, the FASCW and its subscales demonstrated satisfactory internal consistency (alpha coefficients were FASCW (0.91), Lethargy (0.86) and Bodily Ailment (0.84)) and acceptable test-retest reliability (Pearson Correlations Coefficients: 0.59–0.68; Intraclass Correlation Coefficients: 0.74–0.80). Correlation analysis substantiated concurrent and convergent validity. A discriminant analysis demonstrated that the FASCW differentiated between groups with arthritis status and different work hours. Conclusions The 10-item FASCW with good reliability and validity is an effective tool for assessing the severity of fatigue among construction workers. PMID:25603944
Benz, Thomas; Lehmann, Susanne; Gantenbein, Andreas R; Sandor, Peter S; Stewart, Walter F; Elfering, Achim; Aeschlimann, André G; Angst, Felix
2018-03-09
The Migraine Disability Assessment (MIDAS) is a brief questionnaire and measures headache-related disability. This study aimed to translate and cross-culturally adapt the original English version of the MIDAS to German and to test its reliability. The standardized translation process followed international guidelines. The pre-final version was tested for clarity and comprehensibility by 34 headache sufferers. Test-retest reliability of the final version was quantified by 36 headache patients completing the MIDAS twice with an interval of 48 h. Reliability was determined by intraclass correlation coefficients and internal consistency by Cronbach's α. All steps of the translation process were followed, documented and approved by the developer of the MIDAS. The expert committee discussed in detail the complex phrasing of the questions that refer to one to another, especially exclusion of headache-days from one item to the next. The German version contains more active verb sentences and prefers the perfect to the imperfect tense. The MIDAS scales intraclass correlation coefficients ranged from 0.884 to 0.994 and was 0.991 (95% CI: 0.982-0.995) for the MIDAS total score. Cronbach's α for the MIDAS as a whole was 0.69 at test and 0.67 at retest. The translation process was challenged by the comprehensibility of the questionnaire. The German version of the MIDAS is a highly reliable instrument for assessing headache related disability with moderate internal consistency. Provided validity testing of the German MIDAS is successful, it can be recommended for use in clinical practice as well as in research.
Vanneste, S; Plazier, M; van der Loo, E; Ost, J; Meeus, O; Van de Heyning, P; De Ridder, D
2011-01-01
Up to 30% of the adult population experiences tinnitus at some point in life. The aim of the present study was to validate the Mini-Tinnitus Questionnaire (TQ) in a Dutch-speaking population for measuring tinnitus-related distress and compare it with the extended version normally used in clinical practice and research. We assessed 181 patients at the Tinnitus Research Initiative clinic of Antwerp University Hospital. Twelve items from the TQ chosen by Hiller and Goebel based on the optimal combination of high item correlation, reliability, and sensitivity were selected and correlated to the different subscale and global scores of the TQ. Internal consistency was evaluated using Cronbach's alpha coefficient, and the Guttman split-half coefficient was used to confirm reliability. Correlation to the global TQ score was .93, internal consistency was .87, and reliability was .89. This study further revealed that the Mini-TQ correlates better with the different subscales of the TQ in the Dutch-speaking population. The convergence validity was confirmed, ensuring that this new instrument measures distress. In addition, the norms suggested by Hiller and Goebel were verified and established. Based on these results, the Mini-TQ is recommended as a valid instrument for evaluating tinnitus-related distress in Dutch-speaking populations for a compact, quick, and economical assessment.
ERIC Educational Resources Information Center
Onwuegbuzie, Anthony J.; Daniel, Larry G.
The purposes of this paper are to identify common errors made by researchers when dealing with reliability coefficients and to outline best practices for reporting and interpreting reliability coefficients. Common errors that researchers make are: (1) stating that the instruments are reliable; (2) incorrectly interpreting correlation coefficients;…
Quek, Kia Fatt; Chua, Chong Beng; Razack, Azad Hassan; Low, Wah Yun; Loh, Chit Sin
2005-01-01
The purpose of the present study was to validate the Mandarin version of the International Prostate Symptom Score (Mand-IPSS) in a Malaysian population. The validity and reliability were studied in patients with lower urinary tract symptoms (LUTS; benign prostatic hyperplasia [BPH] group) and without LUTS (control group). Test-retest methodology was used to assess the reliability while Cronbach alpha was used to assess the internal consistency. Sensitivity to change was used to express the effect size index in the preintervention versus post-intervention score in patients with LUTS who underwent transurethral resection of the prostate. For the control group and BPH group, the internal consistency was excellent and a high degree of internal consistency was observed for all seven items (Cronbach alpha = 0.86-0.98 and 0.90-0.98, respectively). Test-retest correlation coefficients for all items were highly significant. Intraclass correlation coefficient (ICC) was high for the control (ICC = 0.93-0.99) and BPH group (ICC = 0.91-0.99). The sensitivity and specificity showed a high degree of sensitivity and specificity to the effects of treatment. A high degree of significance between baseline and post-treatment scores was observed across all seven items in the BPH group but not in the control group. The Mand-IPSS is a suitable, reliable, valid and sensitive instrument to measure clinical change in the Malaysian population.
Ebrahimzadeh, Mohammad H; Birjandinejad, Ali; Kachooei, Amir Reza
2015-01-01
We aimed to validate a cross-culturally adapted version of the Persian Michigan Hand Outcomes Questionnaire (MHOQ). We followed the Beaton's guideline to translate the questionnaire to Persian. We administered the final version to 223 patients among which 79 patients returned 3 days later to respond to the Persian MHOQ for the second time. In the first visit, respondents also filled the Disabilities of the Arm Shoulder and Hand (DASH) and rated the pain based on the Visual Analogue Scale (VAS). Cronbach's alpha for the total MHOQ was 0.79 which showed good internal consistency. Intraclass correlation coefficient (ICC) for the total MHOQ was 0.84 which demonstrated good reliability between test and retest. The absolute correlation coefficient between total MHOQ and the DASH was as high as 0.74. Persian version of the MHOQ proved to be a reliable and valid instrument to be implemented among Persian population with the hand and wrist disorders.
Development and reliability testing of the Worksite and Energy Balance Survey.
Hoehner, Christine M; Budd, Elizabeth L; Marx, Christine M; Dodson, Elizabeth A; Brownson, Ross C
2013-01-01
Worksites represent important venues for health promotion. Development of psychometrically sound measures of worksite environments and policy supports for physical activity and healthy eating are needed for use in public health research and practice. Assess the test-retest reliability of the Worksite and Energy Balance Survey (WEBS), a self-report instrument for assessing perceptions of worksite supports for physical activity and healthy eating. The WEBS included items adapted from existing surveys or new items on the basis of a review of the literature and expert review. Cognitive interviews among 12 individuals were used to test the clarity of items and further refine the instrument. A targeted random-digit-dial telephone survey was administered on 2 occasions to assess test-retest reliability (mean days between time periods = 8; minimum = 5; maximum = 14). Five Missouri census tracts that varied by racial-ethnic composition and walkability. Respondents included 104 employed adults (67% white, 64% women, mean age = 48.6 years). Sixty-three percent were employed at worksites with less than 100 employees, approximately one-third supervised other people, and the majority worked a regular daytime shift (75%). Test-retest reliability was assessed using Spearman correlations for continuous variables, Cohen's κ statistics for nonordinal categorical variables, and 1-way random intraclass correlation coefficients for ordinal categorical variables. Test-retest coefficients ranged from 0.41 to 0.97, with 80% of items having reliability coefficients of more than 0.6. Items that assessed participation in or use of worksite programs/facilities tended to have lower reliability. Reliability of some items varied by gender, obesity status, and worksite size. Test-retest reliability and internal consistency for the 5 scales ranged from 0.84 to 0.94 and 0.63 to 0.84, respectively. The WEBS items and scales exhibited sound test-retest reliability and may be useful for research and surveillance. Further evaluation is needed to document the validity of the WEBS and associations with energy balance outcomes.
Lehotkay, R; Saraswathi Devi, T; Raju, M V R; Bada, P K; Nuti, S; Kempf, N; Carminati, G Galli
2015-03-01
In this study realised in collaboration with the department of psychology and parapsychology of Andhra University, validation of the Aberrant Behavior Checklist-Community (ABC-C) in Telugu, the official language of Andhra Pradesh, one of India's 28 states, was carried out. To assess the factor validity and reliability of this Telugu version, 120 participants with moderate to profound intellectual disability (94 men and 26 women, mean age 25.2, SD 7.1) were rated by the staff of the Lebenshilfe Institution for Mentally Handicapped in Visakhapatnam, Andhra Pradesh, India. Rating data were analysed with a confirmatory factor analysis. The internal consistency was estimated by Cronbach's alpha. To confirm the test-retest reliability, 50 participants were rated twice with an interval of 4 weeks, and 50 were rated by pairs of raters to assess inter-rater reliability. Confirmatory factor analysis revealed that the root mean square error of approximation (RMSEA) was equal to 0.06, the comparative fit index (CFI) was equal to 0.77, and the Tucker Lewis index (TLI) was equal to 0.77, which indicated that the model with five correlated factors had a good fit. Coefficient alpha ranged from 0.85 to 0.92 across the five subscales. Spearman's rank correlation coefficients for inter-rater reliability tests ranged from 0.65 to 0.75, and the correlations for test-retest reliability ranged from 0.58 to 0.76. All reliability coefficients were statistically significant (P < 0.01). The factor validity and reliability of Telugu version of the ABC-C evidenced factor validity and reliability comparable to the original English version and appears to be useful for assessing behaviour disorders in Indian people with intellectual disabilities. © 2014 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
What to Do With "Moderate" Reliability and Validity Coefficients?
Post, Marcel W
2016-07-01
Clinimetric studies may use criteria for test-retest reliability and convergent validity such that correlation coefficients as low as .40 are supportive of reliability and validity. It can be argued that moderate (.40-.60) correlations should not be interpreted in this way and that reliability coefficients <.70 should be considered as indicative of unreliability. Convergent validity coefficients in the .40 to .60 or .40 to .70 range should be considered as indications of validity problems, or as inconclusive at best. Studies on reliability and convergent should be designed in such a way that it is realistic to expect high reliability and validity coefficients. Multitrait multimethod approaches are preferred to study construct (convergent-divergent) validity. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Mentiplay, Benjamin F; Perraton, Luke G; Bower, Kelly J; Pua, Yong-Hao; McGaw, Rebekah; Heywood, Sophie; Clark, Ross A
2015-07-16
The revised Xbox One Kinect, also known as the Microsoft Kinect V2 for Windows, includes enhanced hardware which may improve its utility as a gait assessment tool. This study examined the concurrent validity and inter-day reliability of spatiotemporal and kinematic gait parameters estimated using the Kinect V2 automated body tracking system and a criterion reference three-dimensional motion analysis (3DMA) marker-based camera system. Thirty healthy adults performed two testing sessions consisting of comfortable and fast paced walking trials. Spatiotemporal outcome measures related to gait speed, speed variability, step length, width and time, foot swing velocity and medial-lateral and vertical pelvis displacement were examined. Kinematic outcome measures including ankle flexion, knee flexion and adduction and hip flexion were examined. To assess the agreement between Kinect and 3DMA systems, Bland-Altman plots, relative agreement (Pearson's correlation) and overall agreement (concordance correlation coefficients) were determined. Reliability was assessed using intraclass correlation coefficients, Cronbach's alpha and standard error of measurement. The spatiotemporal measurements had consistently excellent (r≥0.75) concurrent validity, with the exception of modest validity for medial-lateral pelvis sway (r=0.45-0.46) and fast paced gait speed variability (r=0.73). In contrast kinematic validity was consistently poor to modest, with all associations between the systems weak (r<0.50). In those measures with acceptable validity, the inter-day reliability was similar between systems. In conclusion, while the Kinect V2 body tracking may not accurately obtain lower body kinematic data, it shows great potential as a tool for measuring spatiotemporal aspects of gait. Copyright © 2015 Elsevier Ltd. All rights reserved.
Rauseo Vera, Mayra; Gutiérrez-González, Luis Arturo; Maldonado, Irama; Al Snih, Soham
2017-09-21
Spondyloarthropathies (SpA) are disabling diseases with a prevalence of 1.9% in the general population. The indices designed for monitoring the disease should be valid, reliable and cross-culturally adapted for decision-making concerning the appropriate treatment. Changing an adjective or pronoun in a self-administered questionnaire could be the big difference in condensing an idea in a few words and transmitting that concept to all those who share the same language. To develop a Venezuelan version of the original English version of the BASDAI/BASFI and to evaluate its reliability and validity in Venezuelan patients with SpA. Certified linguists were needed for the translation of a Venezuelan version of the BASDAI/BASFI. The evaluation of reliability and validity was performed by calculating correlation coefficients in addition to Cronbach's alpha correlation between the BASDAI score and the clinical parameters (for example: erythrocyte sedimentation rate, C-reactive protein, modified Schöber test, occiput-to-wall distance and enthesis count). We studied 40 patients including 31 men (77.5%) and 9 women (22.5%). The mean age was 35.9 years ± standard deviation (SD) 12.01 and the disease duration was 11.5 years (± SD 9.5). The most common diagnoses were undifferentiated spondyloarthritis (45%), ankylosing spondylitis (27.5%) and psoriatic arthritis (20%). The incidences of reactive arthritis, ankylosing spondylitis and juvenile Reiter's syndrome were 2.5% each. The test-retest reliability of the BASDAI and BASFI was high (R = 0.99 and 0.99, respectively; P<.0001). The internal consistency for the BASDAI was high (Cronbach's alpha = 0.88; P=.002) and the intraclass correlation coefficient for internal consistency: 0.9867 (P=.001). Internal consistency for the BASFI: Cronbach's alpha = 0.7985 (P=.002), intraclass correlation coefficient for internal consistency: 0.9055 (P=.001). Construct validity of the BASDAI was high for general well-being of the patient (R = 0.84) and for enthesis count (R = 0.84). Low back pain showed moderate correlation with BASDAI (R = 0.69; P<.0001) and the erythrocyte sedimentation rate showed a low correlation (R = 0.39683; P=.0112). The Venezuelan version of the BASDAI/BASFI could be used in clinical research to assess and evaluate the course of disease activity in Venezuelan SpA patients. Copyright © 2017 Elsevier España, S.L.U. and Sociedad Española de Reumatología y Colegio Mexicano de Reumatología. All rights reserved.
Kepler, Christopher K; Vaccaro, Alexander R; Koerner, John D; Dvorak, Marcel F; Kandziora, Frank; Rajasekaran, Shanmuganathan; Aarabi, Bizhan; Vialle, Luiz R; Fehlings, Michael G; Schroeder, Gregory D; Reinhold, Maximilian; Schnake, Klaus John; Bellabarba, Carlo; Cumhur Öner, F
2016-04-01
The aims of this study were (1) to demonstrate the AOSpine thoracolumbar spine injury classification system can be reliably applied by an international group of surgeons and (2) to delineate those injury types which are difficult for spine surgeons to classify reliably. A previously described classification system of thoracolumbar injuries which consists of a morphologic classification of the fracture, a grading system for the neurologic status and relevant patient-specific modifiers was applied to 25 cases by 100 spinal surgeons from across the world twice independently, in grading sessions 1 month apart. The results were analyzed for classification reliability using the Kappa coefficient (κ). The overall Kappa coefficient for all cases was 0.56, which represents moderate reliability. Kappa values describing interobserver agreement were 0.80 for type A injuries, 0.68 for type B injuries and 0.72 for type C injuries, all representing substantial reliability. The lowest level of agreement for specific subtypes was for fracture subtype A4 (Kappa = 0.19). Intraobserver analysis demonstrated overall average Kappa statistic for subtype grading of 0.68 also representing substantial reproducibility. In a worldwide sample of spinal surgeons without previous exposure to the recently described AOSpine Thoracolumbar Spine Injury Classification System, we demonstrated moderate interobserver and substantial intraobserver reliability. These results suggest that most spine surgeons can reliably apply this system to spine trauma patients as or more reliably than previously described systems.
Trouli, Marianna N; Vernon, Howard T; Kakavelakis, Kyriakos N; Antonopoulou, Maria D; Paganas, Aristofanis N; Lionis, Christos D
2008-07-22
Neck pain is a highly prevalent condition resulting in major disability. Standard scales for measuring disability in patients with neck pain have a pivotal role in research and clinical settings. The Neck Disability Index (NDI) is a valid and reliable tool, designed to measure disability in activities of daily living due to neck pain. The purpose of our study was the translation and validation of the NDI in a Greek primary care population with neck complaints. The original version of the questionnaire was used. Based on international standards, the translation strategy comprised forward translations, reconciliation, backward translation and pre-testing steps. The validation procedure concerned the exploration of internal consistency (Cronbach alpha), test-retest reliability (Intraclass Correlation Coefficient, Bland and Altman method), construct validity (exploratory factor analysis) and responsiveness (Spearman correlation coefficient, Standard Error of Measurement and Minimal Detectable Change) of the questionnaire. Data quality was also assessed through completeness of data and floor/ceiling effects. The translation procedure resulted in the Greek modified version of the NDI. The latter was culturally adapted through the pre-testing phase. The validation procedure raised a large amount of missing data due to low applicability, which were assessed with two methods. Floor or ceiling effects were not observed. Cronbach alpha was calculated as 0.85, which was interpreted as good internal consistency. Intraclass correlation coefficient was found to be 0.93 (95% CI 0.84-0.97), which was considered as very good test-retest reliability. Factor analysis yielded one factor with Eigenvalue 4.48 explaining 44.77% of variance. The Spearman correlation coefficient (0.3; P = 0.02) revealed some relation between the change score in the NDI and Global Rating of Change (GROC). The SEM and MDC were calculated as 0.64 and 1.78 respectively. The Greek version of the NDI measures disability in patients with neck pain in a reliable, valid and responsive manner. It is considered a useful tool for research and clinical settings in Greek Primary Health Care.
Trouli, Marianna N; Vernon, Howard T; Kakavelakis, Kyriakos N; Antonopoulou, Maria D; Paganas, Aristofanis N; Lionis, Christos D
2008-01-01
Background Neck pain is a highly prevalent condition resulting in major disability. Standard scales for measuring disability in patients with neck pain have a pivotal role in research and clinical settings. The Neck Disability Index (NDI) is a valid and reliable tool, designed to measure disability in activities of daily living due to neck pain. The purpose of our study was the translation and validation of the NDI in a Greek primary care population with neck complaints. Methods The original version of the questionnaire was used. Based on international standards, the translation strategy comprised forward translations, reconciliation, backward translation and pre-testing steps. The validation procedure concerned the exploration of internal consistency (Cronbach alpha), test-retest reliability (Intraclass Correlation Coefficient, Bland and Altman method), construct validity (exploratory factor analysis) and responsiveness (Spearman correlation coefficient, Standard Error of Measurement and Minimal Detectable Change) of the questionnaire. Data quality was also assessed through completeness of data and floor/ceiling effects. Results The translation procedure resulted in the Greek modified version of the NDI. The latter was culturally adapted through the pre-testing phase. The validation procedure raised a large amount of missing data due to low applicability, which were assessed with two methods. Floor or ceiling effects were not observed. Cronbach alpha was calculated as 0.85, which was interpreted as good internal consistency. Intraclass correlation coefficient was found to be 0.93 (95% CI 0.84–0.97), which was considered as very good test-retest reliability. Factor analysis yielded one factor with Eigenvalue 4.48 explaining 44.77% of variance. The Spearman correlation coefficient (0.3; P = 0.02) revealed some relation between the change score in the NDI and Global Rating of Change (GROC). The SEM and MDC were calculated as 0.64 and 1.78 respectively. Conclusion The Greek version of the NDI measures disability in patients with neck pain in a reliable, valid and responsive manner. It is considered a useful tool for research and clinical settings in Greek Primary Health Care. PMID:18647393
Sathira-Angkura, Vera; Pithankuakul, Kongkit; Sakulpipatana, Susana; Piyaskulkaew, Chaiwat; Kunakornsawat, Sombat
2012-04-20
Cross-sectional observational study to investigate psychometric properties of an adapted Thai version of the refined Scoliosis Research Society-22 (SRS-22) questionnaire. To evaluate the reliability and validity of the adapted Thai version of the refined SRS-22 questionnaire. The SRS-22 questionnaire is a valid instrument for assessing the health-related quality of life for patients with adolescent idiopathic scoliosis. Recently, the questionnaire has been translated and validated in many languages for non-English-speaking countries. Translation/retranslation of the English version of the SRS-22 was conducted, and the cross-cultural adaptation process was performed. The Thai version SRS-22 and previously validated Thai version Short-Form survey version 2.0 (SF-36V2) questionnaires were administered to 77 patients with adolescent idiopathic scoliosis who had surgical treatment. Fifty-eight patients (52 adolescent girls) had filled out the first set of questionnaires. Thirty patients of the first-time responders completed the second set of questionnaires. The mean age at the time of operation was 14.6 years and the mean age at the time of the final follow-up was 18.7 years. The mean preoperative scoliosis curve magnitude was 55.4° (range, 30°-95°) and postoperative curve magnitude was 20.1° (range, 0°-60°). Internal consistency was determined with Cronbach α coefficient. Intraclass correlation coefficient was used for test-retest reliability. Concurrent validity was evaluated by comparing SRS-22 domains with relevant domains in the SF-36V2 questionnaire, using the Pearson correlation coefficient. The mean overall Cronbach α coefficient of the adapted Thai version SRS-22 was 0.76. The 2 of corresponding domains (mental health = 0.80 and self-image = 0.83) had satisfactory internal consistency and the remaining domains (pain = 0.78; function/activity = 0.74; and satisfaction = 0.76) were good. The intraclass correlation coefficient for 5 domains was ranged from 0.79 to 0.90, which demonstrated the satisfactory test/retest reproducibility. The concurrent validity, determined by the Pearson correlation coefficient between SRS-22 and SF-36V2 domains, had a good correlation for 15 relevant comparisons (r = 0.50-0.75). The adapted Thai version of the SRS-22 questionnaire had validity and reliability, which can be used to assess the outcome of treatment among Thai-speaking patients with adolescent idiopathic scoliosis.
Shafeei, Asrin; Mokhtarinia, Hamid Reza; Maleki-Ghahfarokhi, Azam; Piri, Leila
2017-08-01
Observational study. To cross-culturally translate the Orebro Musculoskeletal Pain Screening Questionnaire (OMPQ) into Persian and then evaluate its psychometric properties (reliability, validity, ceiling, and flooring effects). To the authors' knowledge, prior to this study there has been no validated instrument to screen the risk of chronicity in Persian-speaking patients with low back pain (LBP) in Iran. The OMPQ was specifically developed as a self-administered screening tool for assessing the risk of LBP chronicity. The forward-backward translation method was used for the translation and cross-cultural adaptation of the original questionnaire. In total, 202 patients with subacute LBP completed the OMPQ and the pain disability questionnaire (PDQ), which was used to assess convergent validity. 62 patients completed the OMPQ a week later as a retest. Slight changes were made to the OMPQ during the translation/cultural adaptation process; face validity of the Persian version was obtained. The Persian OMPQ showed excellent test-retest reliability (intraclass correlation coefficient=0.89). Its internal consistency was 0.71, and its convergent validity was confirmed by good correlation coefficient between the OMPQ and PDQ total scores ( r =0.72, p <0.05). No ceiling or floor effects were observed. The Persian version of the OMPQ is acceptable for the target society in terms of face validity, construct validity, reliability, and consistency. It is therefore considered a useful instrument for screening Iranian patients with LBP.
A Generalizability Analysis of Score Consistency for the Balanced Inventory of Desirable Responding
ERIC Educational Resources Information Center
Vispoel, Walter P.; Tao, Shuqin
2013-01-01
Our goal in this investigation was to evaluate the reliability of scores from the Balanced Inventory of Desirable Responding (BIDR) more comprehensively than in prior research using a generalizability-theory framework based on both dichotomous and polytomous scoring of items. Generalizability coefficients accounting for specific-factor, transient,…
A Note on the Incremental Validity of Aggregate Predictors.
ERIC Educational Resources Information Center
Day, H. D.; Marshall, David
Three computer simulations were conducted to show that very high aggregate predictive validity coefficients can occur when the across-case variability in absolute score stability occurring in both the predictor and criterion matrices is quite small. In light of the increase in internal consistency reliability achieved by the method of aggregation…
[Transcultural adaptation of the Antifat Attitudes Test to Brazilian Portuguese].
Obara, Angélica Almeida; Alvarenga, Marle Dos Santos
2018-05-01
Obese individuals are often blamed for their own condition and the targets of discrimination and prejudice. The scope of this study is to describe the cross-cultural adaptation to Brazilian Portuguese and the validation of the Antifat Attitudes Test - specifically developed for evaluation of negative attitudes toward the obese individual. The scale has 34 statements distributed in three subscales - Social/Character Disparagement (15 items), Physical/Romantic Unattractiveness (10 items) and Weight Control/Blame (9 items). The method involved the translation of the scale; evaluation of the conceptual, operational and item equivalence; evaluation of the semantic equivalence using the paired t test, the Pearson correlation coefficient and the intraclass correlation coefficient (ICC); internal consistency evaluation (Cronbach's alpha) and test-retest reliability (ICC) and Confirmatory Factor Analysis - after application in 340 college students in the area of health. The results showed good global internal consistency and reliability (α 0.85; CCI 0.83), and factor analysis showed that the original subscales can be kept in the adaptation, and therefore the scale adapted to the Brazilian-Portuguese version is valid and useful in studies to explore negative attitudes toward obese individuals.
Chatterji, Madhabi
2002-01-01
This study examines validity of data generated by the School Readiness for Reforms: Leader Questionnaire (SRR-LQ) using an iterative procedure that combines classical and Rasch rating scale analysis. Following content-validation and pilot-testing, principal axis factor extraction and promax rotation of factors yielded a five factor structure consistent with the content-validated subscales of the original instrument. Factors were identified based on inspection of pattern and structure coefficients. The rotated factor pattern, inter-factor correlations, convergent validity coefficients, and Cronbach's alpha reliability estimates supported the hypothesized construct properties. To further examine unidimensionality and efficacy of the rating scale structures, item-level data from each factor-defined subscale were subjected to analysis with the Rasch rating scale model. Data-to-model fit statistics and separation reliability for items and persons met acceptable criteria. Rating scale results suggested consistency of expected and observed step difficulties in rating categories, and correspondence of step calibrations with increases in the underlying variables. The combined approach yielded more comprehensive diagnostic information on the quality of the five SRR-LQ subscales; further research is continuing.
Alwi, N; Harun, D; Omar, B; Ahmad, M; Zagan, M; Leonard, J H
2015-01-01
Caregivers face challenges to adapt while handling individual with learning disabilities (LD). The Family Crisis Oriented Personal Evaluation Scale (F-COPES) is a widely used instrument to measure coping strategies among caregivers. The current study performed cross cultural translation of F-COPES in Malay language. This study aims to examine the reliability by testing internal consistency of Malay version of F-COPES which is developed through back to back translation method from original English version. The Malay version of F-COPES was administered among 30 caregivers. The reliability of F-COPES in Malay version is good with Cronbach's alpha coefficient value of 0.79. The internal consistency on sub domains of F-COPES such as reframing, acquiring social support and seeking spiritual support also acceptable with Cronbach's alpha values 0.67, 0.74, and 0.80, respectively. The Malay version of F-COPES is a reliable tool to evaluate the coping strategies adopted by the caregivers of individual with LD.
ERIC Educational Resources Information Center
Padilla, Miguel A.; Divers, Jasmin
2016-01-01
Coefficient omega and alpha are both measures of the composite reliability for a set of items. Unlike coefficient alpha, coefficient omega remains unbiased with congeneric items with uncorrelated errors. Despite this ability, coefficient omega is not as widely used and cited in the literature as coefficient alpha. Reasons for coefficient omega's…
Papadopoulos, Costas; Constantinou, Antonis; Cheimonidou, Areti-Zoi; Stasinopoulos, Dimitrios
2017-04-01
To cross-culturally adapt and validate the Greek version of the Kujala anterior knee pain scale (KAKPS). The Greek KAKPS was translated from the original English version following standard forward and backward translation procedures. The survey was then conducted in clinical settings by a questionnaire comprising the Greek KAKPS and patellofemoral pain syndrome (PFPS) severity scale. A total of 130 (62 women and 68 men) Greek-reading patients between 18 and 45 years old with anterior knee pain (AKP) for at least four weeks were recruited from physical therapy clinics. To establish test-retest reliability, the patients were asked to complete the KAKPS at initial visit and 2-3 days after the initial visit. The Greek version of the PFPS severity scale was also administered once at initial visit. Internal consistency of the translated instrument was measured using Cronbach's α. An intraclass correlation coefficient was used to assess the test-retest reliability of the KAKPS. Concurrent validity was measured by correlating the KAKPS with the PFPS severity scale using Pearson's correlation coefficient. The results showed that the Greek KAKPS has good internal consistency (Cronbach's α = 0.942), test-retest reliability (ICC = 0.921) and concurrent validity (r > 0.7). This study has shown that the Greek KAKPS has good internal consistency, test-retest reliability and concurrent validity when correlated with the PFPS severity scale in adult patients with AKP for at least four weeks. Implications for rehabilitation The Greek version of the KAKPS has been found to be reliable and valid when used in adult patients with AKP for at least four weeks. The results of the psychometric characteristics were compatible with those of the original English version. The KAKPS could be applied in a Greek-speaking population to assess functional limitations and symptoms in patients aged 18-45 years old with AKP for at least four weeks.
Development and psychometric evaluation of the Professional Practice Environment (PPE) scale.
Erickson, Jeanette Ives; Duffy, Mary E; Gibbons, M Patricia; Fitzmaurice, Joan; Ditomassi, Marianne; Jones, Dorothy
2004-01-01
To describe the Professional Practice Environment (PPE) scale, its conceptual development and psychometric evaluation, and its uses in measuring eight characteristics of the professional practice environment in an acute care setting. The 38-item PPE Scale was validated on a sample of 849 professional practice staff at the Massachusetts General Hospital in Boston. Psychometric analysis included: item analysis, principal components analysis (PCA) with varimax rotation and Kaiser normalization, and internal consistency reliability using Cronbach's alpha coefficient. Eight components were shown, confirming the original conceptually derived model's structure and accounting for 61% of explained variance. Cronbach's alpha coefficients for the eight PPE subscales ranged from .78 to .88. Findings showed the 38-item PPE Scale was reliable and valid for use in health outcomes research to examine the professional practice environment of staff working in acute care settings.
Intraday and Interday Reliability of Ultra-Short-Term Heart Rate Variability in Rugby Union Players.
Nakamura, Fábio Y; Pereira, Lucas A; Esco, Michael R; Flatt, Andrew A; Moraes, José E; Cal Abad, Cesar C; Loturco, Irineu
2017-02-01
Nakamura, FY, Pereira, LA, Esco, MR, Flatt, AA, Moraes, JE, Cal Abad, CC, and Loturco, I. Intraday and interday reliability of ultra-short-term heart rate variability in rugby union players. J Strength Cond Res 31(2): 548-551, 2017-The aim of this study was to examine the intraday and interday reliability of ultra-short-term vagal-related heart rate variability (HRV) in elite rugby union players. Forty players from the Brazilian National Rugby Team volunteered to participate in this study. The natural log of the root mean square of successive RR interval differences (lnRMSSD) assessments were performed on 4 different days. The HRV was assessed twice (intraday reliability) on the first day and once per day on the following 3 days (interday reliability). The RR interval recordings were obtained from 2-minute recordings using a portable heart rate monitor. The relative reliability of intraday and interday lnRMSSD measures was analyzed using the intraclass correlation coefficient (ICC). The typical error of measurement (absolute reliability) of intraday and interday lnRMSSD assessments was analyzed using the coefficient of variation (CV). Both intraday (ICC = 0.96; CV = 3.99%) and interday (ICC = 0.90; CV = 7.65%) measures were highly reliable. The ultra-short-term lnRMSSD is a consistent measure for evaluating elite rugby union players, in both intraday and interday settings. This study provides further validity to using this shortened method in practical field conditions with highly trained team sports athletes.
[Test for assessing levels of alcohol consumption in Bucaramanga, Colombia: design and validation].
Herrán, Oscar F; Ardila, María F; Barba, Diana M
2008-03-01
Excessive alcohol intake can pose a serious problem in public health. The development of instruments to classify the consumers correctly is the first stage in the epidemiologic investigation. The internal validity and the reliability was evaluated for a test of problematic alcohol consumption (CP-alcohol) in Bucaramanga, Colombia. 2005--2006. This work provides a measure that is internally consistent and improved reliability of diagnostic technology. Six hundred one subjects between 18 and 60 years participated in the test for CP-alcohol on two occasions. At the same time, a survey on biological variables (VB), socioeconomic (VSE) and dietary (D) was administered. The internal consistency of CP-alcohol was evaluated by calculating the coefficient alpha of Cronbach, and the reliability with coefficients of Spearman and Cohens Kappa. To evaluate the associations among problematic consumption, VB, VSE, D and the risk of alcoholism, the prevalence ratios were calculated using binomial regression. The frequency of problematic alcohol consumption was of 46.9 (CI 42.9-50.9). Men presented an increased frequency of problematic alcohol use 1.6 times that of women (p<0.001). The coefficient alpha of Cronbach was moderate for all the questions of the test (minimum 0.41, maximum 0.61). In the first application of CP-alcohol, Cronbachs alpha was 0.63, and, in the second, 0.49. Spearmans correlation coefficient was of 0.87 (CI 0.84-0.90) for the population-for men 0.86 (CI 0.82-0.90) and for women 0.86 (CI 0.82-0.90). The Kappas obtained were very good, 0.70 to 0.89. Sex, pleasure provided by alcoholic drinks , risk of alcoholism according to Cut Down on Drinking, Annoyed by Criticism, Guilty Feeling, and Eye Opener (CAGE) and the quantity of consumed alcohol were all correlated with problematic consumption. CP-alcohol is a useful test for investigating the epidemiology of health problems associated with alcohol use.
Intrarater Reliability and Other Psychometrics of the Health Promoting Activities Scale (HPAS).
Muskett, Rachel; Bourke-Taylor, Helen; Hewitt, Alana
The Health Promoting Activities Scale (HPAS) measures the self-rated frequency with which adults participate in activities that promote health. We evaluated the internal consistency, construct validity, and intrarater reliability of the HPAS with a cohort of mothers (N = 56) of school-age children. We used an online survey that included the HPAS and measures of mental and physical health. Statistical analysis included intraclass correlation coefficients (ICCs), measurement error, error range, limits of agreement, and minimum detectable change (MDC). The HPAS showed good internal consistency (Cronbach's α = .73). Construct validity was supported by a significant difference in HPAS scores among participants grouped by physical activity level; no other differences were significant. Results included a high aggregate ICC of .90 and an MDC of 5 points. Our evaluation of the HPAS revealed good reliability and stability, suggesting suitability for ongoing evaluation as an outcome measure. Copyright © 2017 by the American Occupational Therapy Association, Inc.
Ward, Dianne S; Mazzucca, Stephanie; McWilliams, Christina; Hales, Derek
2015-09-26
Early care and education (ECE) centers are important settings influencing young children's diet and physical activity (PA) behaviors. To better understand their impact on diet and PA behaviors as well as to evaluate public health programs aimed at ECE settings, we developed and tested the Environment and Policy Assessment and Observation - Self-Report (EPAO-SR), a self-administered version of the previously validated, researcher-administered EPAO. Development of the EPAO-SR instrument included modification of items from the EPAO, community advisory group and expert review, and cognitive interviews with center directors and classroom teachers. Reliability and validity data were collected across 4 days in 3-5 year old classrooms in 50 ECE centers in North Carolina. Center teachers and directors completed relevant portions of the EPAO-SR on multiple days according to a standardized protocol, and trained data collectors completed the EPAO for 4 days in the centers. Reliability and validity statistics calculated included percent agreement, kappa, correlation coefficients, coefficients of variation, deviations, mean differences, and intraclass correlation coefficients (ICC), depending on the response option of the item. Data demonstrated a range of reliability and validity evidence for the EPAO-SR instrument. Reporting from directors and classroom teachers was consistent and similar to the observational data. Items that produced strongest reliability and validity estimates included beverages served, outside time, and physical activity equipment, while items such as whole grains served and amount of teacher-led PA had lower reliability (observation and self-report) and validity estimates. To overcome lower reliability and validity estimates, some items need administration on multiple days. This study demonstrated appropriate reliability and validity evidence for use of the EPAO-SR in the field. The self-administered EPAO-SR is an advancement of the measurement of ECE settings and can be used by researchers and practitioners to assess the nutrition and physical activity environments of ECE settings.
Translation, Adaptation and Cross Language Validation of Tinnitus Handicap Inventory in Urdu.
Aqeel, Muhammad; Ahmed, Ammar
2017-12-01
Tinnitus is characterized as a perception of numerous auditory sounds in absence of external stimulus. Tinnitus can have a considerable consequence on a person's quality of life, and is considered to be very complicated to quantify. The aim of this study was to investigate the reliability and validity of Urdu translation of the Tinnitus Handicap Inventory (THI) in Pakistan. It was designed to assess the presence of various auditory sounds without the external stimulus. Scale consisted of 25 items having three subscales functional, emotional, and catastrophic. The study comprised into two stages, preliminary and main studies. The results of preliminary study revealed that the overall scale had high internal consistency [alpha coefficient of Urdu version of THI (THI-U)= 0.99, alpha coefficient of English version of THI=0.98]. The overall scale had test-retest correlation over a fifteen days period of interval (0.99). Main study was performed on 110 tinnitus patients. The results of main study showed that the internal consistency and reliability of Urdu version was (α=0.93). The THI-U and its subscales demonstrated good internal consistency reliability ( α =0.81 to 0.86). High to moderate correlations were noted between tinnitus symptom ratings. A confirmatory factor analysis was used to validate the three subscales of THI-U, and high inter-correlations were found between the subscales also results revealed that a three-factor model for the THI-U was most tenable. The results displayed that the confirmatory factor analysis confirmed to validate the three subscales of THI-U. THI-U might present important information about precise facets of tinnitus distress along with diagnostic interviews in clinical practice.
The psychometric properties of exercise benefits/barriers scale among women
Farahani, Leila Amiri; Parvizy, Soroor; Mohammadi, Eesa; Asadi-Lari, Mohsen; Kazemnejad, Anoshiravan; Hasanpoor-Azgahdy, Seyede Batool; Taghizadeh, Ziba
2017-01-01
Background and objective Despite the numerous health benefits of regular physical activity (PA), physical inactivity is a major health issue among women. The goal of the current study was to measure the validity and reliability assessment of the exercise benefits/barriers scale among women between the ages of 18 and 65 years. This study was carried out among women residing in Khoramroudi neighborhood in Tehran between December 2013 and February 2014. Methods In this descriptive, methodological study, 278 women residing in Khoramroudi neighborhood in Tehran between December 2013 and February 2014 completed three questionnaires: the demographic data form, the Exercise Benefits/Barriers Scale. The construct validity, internal consistency, and stability of the study were measured by confirmatory factor analyses, Cronbach’s alpha, and Spearman Brown correlation coefficient by using SPSS 21 and LISREL 8.80, respectively. Results The confirmatory factor analysis showed the Persian version of EBBS was structured well. The Cronbach’s alpha coefficients for the total scale and its subscales were 0.927, 0.94 and 0.82, respectively. Spearman Brown correlation coefficient also showed good test-retest reliability. Conclusion The results of this study verified the reliability and validity of the applied instrument and introduced it as a tool to measure the benefits and barriers of physical activity among Iranian women. PMID:28894535
The psychometric properties of exercise benefits/barriers scale among women.
Farahani, Leila Amiri; Parvizy, Soroor; Mohammadi, Eesa; Asadi-Lari, Mohsen; Kazemnejad, Anoshiravan; Hasanpoor-Azgahdy, Seyede Batool; Taghizadeh, Ziba
2017-07-01
Despite the numerous health benefits of regular physical activity (PA), physical inactivity is a major health issue among women. The goal of the current study was to measure the validity and reliability assessment of the exercise benefits/barriers scale among women between the ages of 18 and 65 years. This study was carried out among women residing in Khoramroudi neighborhood in Tehran between December 2013 and February 2014. In this descriptive, methodological study, 278 women residing in Khoramroudi neighborhood in Tehran between December 2013 and February 2014 completed three questionnaires: the demographic data form, the Exercise Benefits/Barriers Scale. The construct validity, internal consistency, and stability of the study were measured by confirmatory factor analyses, Cronbach's alpha, and Spearman Brown correlation coefficient by using SPSS 21 and LISREL 8.80, respectively. The confirmatory factor analysis showed the Persian version of EBBS was structured well. The Cronbach's alpha coefficients for the total scale and its subscales were 0.927, 0.94 and 0.82, respectively. Spearman Brown correlation coefficient also showed good test-retest reliability. The results of this study verified the reliability and validity of the applied instrument and introduced it as a tool to measure the benefits and barriers of physical activity among Iranian women.
Grover, S; Chakrabarti, S; Ghormode, D; Dutt, A; Kate, N; Kulhara, P
2011-12-01
The Involvement Evaluation Questionnaire (IEQ) is a comprehensive, conceptually valid and reliable means of assessing caregiver burden. However, its psychometric properties have rarely been examined in non-European settings. The aim of the present study was to evaluate the psychometric properties of an Indian translation of the IEQ (Hindi-IEQ). The European Union (English) version of IEQ was translated into Hindi and reviewed by a group of experts and caregivers for translation accuracy, cultural appropriateness, and for relevance and acceptability of items and constructs. The Hindi-IEQ was then administered to 162 primary caregivers of patients with severe mental illnesses. Eighteen caregivers completed both the English and Hindi versions to check the level of agreement between them. Another 27 completed the Hindi-IEQ twice, a week apart, to evaluate its test-retest reliability. Factor structure of the Hindi-IEQ was examined using an exploratory, principal components and factor analysis. Pearson's correlation coefficients were significant for 24 items, while intraclass correlation coefficients were significant for 28 of the 31 items (P < 0.05), indicating a satisfactory level of agreement between the Hindi and English versions. Test-retest reliability for all items of the Hindi-IEQ was adequate, with kappa values ranging from 0.46 to 0.95 and intraclass correlation coefficients from 0.76 to 1.00. Internal consistency (Cronbach's alpha = 0.89) and the split-half reliability (Spearman-Brown coefficient = 0.68) of the Hindi-IEQ were also satisfactory. However, several differences were noted in the factor structure and distribution of scores of the Hindi-IEQ, which were quite unlike that of the European Union version. The similarities and differences between the 2 versions of the IEQ indicated that sociocultural factors could influence assessment of caregiver burden across different cultures.
The Trunk Impairment Scale - modified to ordinal scales in the Norwegian version.
Gjelsvik, Bente; Breivik, Kyrre; Verheyden, Geert; Smedal, Tori; Hofstad, Håkon; Strand, Liv Inger
2012-01-01
To translate the Trunk Impairment Scale (TIS), a measure of trunk control in patients after stroke, into Norwegian (TIS-NV), and to explore its construct validity, internal consistency, intertester and test-retest reliability. TIS was translated according to international guidelines. The validity study was performed on data from 201 patients with acute stroke. Fifty patients with stroke and acquired brain injury were recruited to examine intertester and test-retest reliability. Construct validity was analyzed with exploratory and confirmatory factor analysis and item response theory, internal consistency with Cronbach's alpha test, and intertester and test-retest reliability with kappa and intraclass correlation coefficient tests. The back-translated version of TIS-NV was validated by the original developer. The subscale Static sitting balance was removed. By combining items from the subscales Dynamic sitting balance and Coordination, six ordinal superitems (testlets) were constructed. The TIS-NV was renamed the modified TIS-NV (TIS-modNV). After modifications the TIS-modNV fitted well to a locally dependent unidimensional item response theory model. It demonstrated good construct validity, excellent internal consistency, and high intertester and test-retest reliability for the total score. This study supports that the TIS-modNV is a valid and reliable scale for use in clinical practice and research.
ERIC Educational Resources Information Center
Kim, Seonghoon; Feldt, Leonard S.
2010-01-01
The primary purpose of this study is to investigate the mathematical characteristics of the test reliability coefficient rho[subscript XX'] as a function of item response theory (IRT) parameters and present the lower and upper bounds of the coefficient. Another purpose is to examine relative performances of the IRT reliability statistics and two…
Bazzo, Stefania; Battistella, Giuseppe; Riscica, Patrizia; Moino, Giuliana; Dal Pozzo, Giuseppe; Bottarel, Mery; Geromel, Mariasole; Czerwinsky, Loredana
2015-01-01
Alcohol consumption during pregnancy can result in a range of harmful effects on the developing foetus and newborn, called Fetal Alcohol Spectrum Disorders (FASD). The identification of pregnant women who use alcohol enables to provide information, support and treatment for women and the surveillance of their children. The AUDIT-C (the shortened consumption version of the Alcohol Use Disorders Identification Test) is used for investigating risky drinking with different populations, and has been applied to estimate alcohol use and risky drinking also in antenatal clinics. The aim of the study was to investigate the reliability of a self-report Italian version of the AUDIT-C questionnaire to detect alcohol consumption during pregnancy, regardless of its use as a screening tool. The questionnaire was filled in by two independent consecutive series of pregnant women at the 38th gestation week visit in the two birth locations of the Local Health Authority of Treviso (Italy), during the years 2010 and 2011 (n=220 and n=239). Reliability analysis was performed using internal consistency, item-total score correlations, and inter-item correlations. The "discriminatory power" of the test was also evaluated. Results. Overall, about one third of women recalled alcohol consumption at least once during the current pregnancy. The questionnaire had an internal consistency of 0.565 for the group of the year 2010, of 0.516 for the year 2011, and of 0.542 for the overall group. The highest item total correlations' coefficient was 0.687 and the highest inter-item correlations' coefficient was 0.675. As for the discriminatory power of the questionnaire, the highest Ferguson's delta coefficient was 0.623. These findings suggest that the Italian self-report version of the AUDIT-C possesses unsatisfactory reliability to estimate alcohol consumption during pregnancy when used as self-report questionnaire in an obstetric setting.
Okamoto, Nozomi; Hisashige, Akinori; Tanaka, Yuu; Kurumatani, Norio
2013-01-01
The 15D is a self-administered questionnaire for assessment of health-related quality of life, which contains 15 questions with 5 response options each. This study was conducted to evaluate the reliability and validity of the Japanese 15D. The subjects were 430 community-dwelling elderly people. Each item of the 15D was scored on a 5-point Likert scale, with level 1 being the best, score 1. Reliability was assessed by determination of the internal consistency and test-retest reliability. Criterion-based validity was assessed using the Japanese version of the Nottingham Health Profile (NHP) and Tokyo Metropolitan Institute of Gerontology Index of Competence (TMIG index). Acceptability was assessed by inquiring about the time required to complete the questionnaire and the burden felt in responding to it. The answers of 423 individuals who responded to all items were analyzed. The median time required to complete the questionnaire was 5.0 minutes, and the proportion of subjects who indicated that the questionnaire was easy to complete was 98.3%. The Cronbach's alpha coefficients for all 15 items in the 2 surveys were 0.793 and 0.792, respectively. The intraclass correlation coefficients for the 15 items ranged from 0.44 to 0.72. In the relationship between the 15D and the NHP, the correlation coefficients between the corresponding domains were higher than those between non-corresponding domains. The prevalence of disability in higher-level functional capacity was higher in the "level 2 to 5" group than in the "level 1" group. The Japanese version of the 15D showed sufficient internal consistency and moderate repeatability. Because of the short time required to complete the Japanese 15D and the significant relationships between the scores on the 15D and the NHP, and between the 15D and higher-level functional capacity, the acceptability and validity of the Japanese 15D were considered to be sufficient.
Rushton, Paula W; Smith, Emma M; Miller, William C; Kirby, R Lee; Daoust, Geneviève
2018-01-31
The aim of this study was to evaluate the internal consistency, test-retest reliability and responsiveness of the Self-Efficacy in Assessing, Training and Spotting manual wheelchair skills (SEATS-M) and Self-Efficacy in Assessing, Training and Spotting power wheelchair skills (SEATS-P). A 2-week test-retest design was used with a convenience sample of occupational and physical therapists who worked at a provincial rehabilitation centre (inpatient and outpatient services). Sixteen participants completed the SEATS-M and 18 participants completed the SEATS-P. For the SEATS-M assessment, training, spotting and documentation sections, Cronbach's alpha coefficients ranged from 0.90 to 0.97, the 2-week intraclass correlation coefficients (ICC 1,1 ) ranged from 0.81 to 0.95, the standard error of measurements (SEM) ranged from 5.06 to 8.70 and the smallest real differences (SRD) ranged from 6.24 to 8.18. For the SEATS-P assessment, training, spotting and documentation sections, Cronbach's alpha coefficients ranged from 0.83 to 0.92, the ICCs ranged from 0.72 to 0.86, the SEMs ranged from 4.54 to 8.91 and the SRDs ranged from 5.90 to 8.27. There is preliminary evidence that both the SEATS-M and the SEATS-P have high internal consistency, good test-retest reliability and support for responsiveness. These tools can be used in evaluating clinician self-efficacy with assessing, training, spotting and documenting wheelchair skills included on the Wheelchair Skills Test. Implications for Rehabilitation There is preliminary evidence that the SEATS-M and SEATS-P are reliable and responsive outcome measures that can be used to evaluate the self-efficacy of clinicians to administer the Wheelchair Skills Program. Measurement of clinicians' self-efficacy in this area of practice may enable an enhanced understanding of the areas in which clinicians lack self-efficacy, thereby informing the development of improved knowledge translation interventions.
Validation of the Turkish version of the Breast Reduction Assessed Severity Scale.
Kececi, Yavuz; Sir, Emin; Zengel, Baha
2013-01-01
Measuring patient-reported outcomes has become increasingly important in cosmetic and reconstructive breast surgery. There is no validated questionnaire in Turkish to evaluate quality-of-life issues for patients with mammary hypertrophy. The authors describe the reliability and validity of a translated Breast Reduction Assessed Severity Scale (BRASS) in evaluating Turkish patients. The BRASS, developed by Sigurdson et al, was translated into Turkish adhering strictly to the guidelines of questionnaire translations. Statistical analysis was carried out with Cronbach's α to test the internal consistency and intraclass correlation coefficient for test-retest reliability. Exploratory factor analysis was carried out using principal component analysis with oblimin rotation to test its construct validity. Correlations between subscales identified in the factor analysis and corresponding domains in the Short Form-36 and Rosenberg Self-Esteem Scale were analyzed. The total instrument was found to have an α coefficient of 0.92 and subscale α coefficients ranging from 0.76 to 0.87. Intraclass correlation coefficient was 0.93 for the total scale and ranged from 0.81 to 0.91 for the subscales. Exploratory factor analysis resulted in a 5-factor structure: physical implications, body pain, physical appearance, poor self-concept, and negative social interactions. With this study, the reliability and validity of the Turkish version of the BRASS were revealed. This translated version can be used to evaluate the effect of mammary hypertrophy on quality of life in Turkish patients.
Reliability of Use, Abuse, and Dependence of Four Types of Inhalants in Adolescents and Young Adults
Ridenour, Ty A.; Bray, Bethany C.; Cottler, Linda B.
2007-01-01
Inhalants, as a class of drugs, consists of heterogeneous substances that include some of the most dangerous drugs on a per use basis. Research on inhalant abuse has lagged behind other drugs partly because of the need for a diagnostic instrument of different types of inhalants. This study was conducted to obtain reliability estimates for the new Substance Abuse Module DSM-IV inhalants diagnoses for four types of inhalants: aerosols, gases, nitrites, and solvents as well as different diagnostic configurations of inhalant use. Participants were 162 community sample adolescents or young adults (mean age = 20.3 years, SD = 2.4). Two-thirds of the sample was male and 83.3% was Caucasian. Kappas and intraclass correlation coefficients were computed to estimate test-retest reliabilities. Results suggested (a) abuse was more common than dependence (34.6% vs. 12.3%), (b) reliabilities of abuse criteria and diagnosis were good to excellent across subtypes, and (c) reliabilities of dependence criteria and diagnoses were poor to good across subtypes. Alternative configurations of DSM-IV criteria that were consistent with previous research on adolescents provided excellent reliabilities across subtypes of inhalants. Moreover, 11.1% of participants experienced inhalants withdrawal. PMID:17576041
Nazary-Moghadam, Salman; Zeinalzadeh, Afsaneh; Salavati, Mahyar; Almasi, Simin; Negahban, Hossein
2017-01-01
The aim of the present study was to culturally adapt and evaluate reliability and validity of Health Assessment Questionnaire-Disability Index (HAQ-DI) in Iranian patients with rheumatoid arthritis (RA). 234 patients with RA for validation study, Eighty-six participants for reliability study. Test-retest relative reliability and internal consistency of Persian version of HAQ-DI were examined by intraclass correlation coefficient (ICC) and Cronbach's alpha, respectively. Additionally, HAQ-DI construct validity (Spearman's correlation) was examined using Persian version of Short-Form 36 Health survey (SF-36), activity and severity parameters. Persian version of HAQ-DI total score showed excellent test-retest reliability (ICC = 0.98) and internal consistency (Cronbach's alpha = 0.95). Spearman's correlations between the total PHAQ-DI score and activity and severity parameters were above 0.55. Correlation between PHAQ-DI and SF-36 Physical Health were higher as compared with SF-36 Mental Health. Persian version of HAQ-DI is a reliable and valid culturally-adapted instrument in order to measure functional limitations in Iranian people with RA. Copyright © 2016 Elsevier Ltd. All rights reserved.
Scarponi, Letizia; de Felicio, Claudia Maria; Sforza, Chiarella; Pimenta Ferreira, Claudia Lucia; Ginocchio, Daniela; Pizzorni, Nicole; Barozzi, Stefania; Mozzanica, Francesco; Schindler, Antonio
2018-05-30
To evaluate the reliability, validity, and responsiveness of the Italian OMES (I-OMES). The study consisted of 3 phases: (1) internal consistency and reliability, (2) validity, and (3) responsiveness analysis. The recruited population included 27 patients with orofacial myofunctional disorders (OMD) and 174 healthy volunteers. Forty-seven subjects, 18 healthy and all recruited patients with OMD were assessed for inter-rater and test-retest reliability analysis. I-OMES and Nordic Orofacial Test - Screening (NOT-S) scores of the patients were correlated for concurrent validity analysis. I-OMES scores from 27 patients with OMD and 27 age- and gender-matched healthy subjects were compared to investigate construct validity. I-OMES scores before and after successful swallowing rehabilitation in patients were compared for responsiveness analysis. Adequate internal consistency (Cronbach α = 0.71) and strong inter-rater and test-retest reliability (intraclass coefficient correlation = 0.97 and 0.98, respectively) were found. I-OMES and NOT-S scores significantly and inversely correlated (r = -0.38). A statistical significance (p < 0.001) was found between the pathological group and the control group for the total I-OMES score. The mean I-OMES score improved from 90 (78-102) to 99 (89-103) after myofunctional rehabilitation (p < 0.001). The I-OMES is a reliable and valid tool to evaluate OMD. © 2018 S. Karger AG, Basel.
ERIC Educational Resources Information Center
Weber, Deborah A.
Greater understanding and use of confidence intervals is central to changes in statistical practice (G. Cumming and S. Finch, 2001). Reliability coefficients and confidence intervals for reliability coefficients can be computed using a variety of methods. Estimating confidence intervals includes both central and noncentral distribution approaches.…
Kolokotroni, Philippa; Anagnostopoulos, Fotios; Missitzis, Ioannis
2017-07-01
The study and measurement of psychosocial adjustment is important for evaluating patients' well-being, and assessing the illness's course, treatment's success, and patients' recovery. In this study, internal consistency reliability and construct validity of the Greek version of the Psychosocial Adjustment to Illness Scale-Self-Report (PAIS-SR) were examined. Demographic and psychosocial data were collected from a sample of 243 women with breast cancer, recruited from September 2011 to December 2012. With some exceptions in specific items, the original conceptually-derived PAIS-SR subscales emerged in a seven-factor solution. Social Environment, Job and Household Duties, and Psychological Distress accounted for more of the total variance than other subscales. PAIS-SR showed good internal consistency reliability, with Cronbach's alpha coefficients >0.62. Correlations of PAIS-SR domains with measures of quality of life and posttraumatic stress symptoms supported the convergent validity of the PAIS-SR and its significance for cancer research. The Greek version of the PAIS-SR has acceptable internal consistency reliability and construct validity, as well as satisfactory convergent validity. Results provide some suggestions for the development of programs to evaluate adjustment status and implement psychosocial interventions among breast cancer survivors.
Incorporation of detailed eye model into polygon-mesh versions of ICRP-110 reference phantoms
NASA Astrophysics Data System (ADS)
Tat Nguyen, Thang; Yeom, Yeon Soo; Kim, Han Sung; Wang, Zhao Jun; Han, Min Cheol; Kim, Chan Hyeong; Lee, Jai Ki; Zankl, Maria; Petoussi-Henss, Nina; Bolch, Wesley E.; Lee, Choonsik; Chung, Beom Sun
2015-11-01
The dose coefficients for the eye lens reported in ICRP 2010 Publication 116 were calculated using both a stylized model and the ICRP-110 reference phantoms, according to the type of radiation, energy, and irradiation geometry. To maintain consistency of lens dose assessment, in the present study we incorporated the ICRP-116 detailed eye model into the converted polygon-mesh (PM) version of the ICRP-110 reference phantoms. After the incorporation, the dose coefficients for the eye lens were calculated and compared with those of the ICRP-116 data. The results showed generally a good agreement between the newly calculated lens dose coefficients and the values of ICRP 2010 Publication 116. Significant differences were found for some irradiation cases due mainly to the use of different types of phantoms. Considering that the PM version of the ICRP-110 reference phantoms preserve the original topology of the ICRP-110 reference phantoms, it is believed that the PM version phantoms, along with the detailed eye model, provide more reliable and consistent dose coefficients for the eye lens.
Validation of the Turkish Cervical Cancer and Human Papilloma Virus Awareness Questionnaire.
Özdemir, E; Kısa, S
2016-09-01
The aim of this study was to determine the validity and reliability of the 'Cervical Cancer and Human Papilloma Virus Awareness Questionnaire' among fertility age women by adapting the scale into Turkish. Cervical cancer is the fourth most commonly form seen among women. Death from cervical cancer ranks third among causes and is one of the most preventable forms of cancer. This cross-sectional study included 360 women from three family health centres between January 5 and June 25, 2014. Internal consistency showed that the Kuder-Richardson 21 reliability coefficient in the first part was 0.60, Cronbach's alpha reliability coefficient was 0.61 in the second part. The Kaiser-Meyer-Olkin value of the items on the scale was 0.712. The Barlett test was significant. The confirmatory factor analysis indicated that the model matched the data adequately. This study shows that the Turkish version of the instrument is a valid and reliable tool to evaluate knowledge, perceptions and preventive behaviours of women regarding human papilloma virus and cervical cancer. Nurses who work in the clinical and primary care settings need to screen, detect and refer women who may be at risk from cervical cancer. © 2016 International Council of Nurses.
Gorgos, Kara S; Wasylyk, Nicole T; Van Lunen, Bonnie L; Hoch, Matthew C
2014-04-01
Joint mobilizations are commonly used by clinicians to decrease pain and restore joint arthrokinematics following musculoskeletal injury. The force applied during a joint mobilization treatment is subjective to the individual clinician but may have an effect on patient outcomes. The purpose of this systematic review was to critically appraise and synthesize the studies which examined the reliability of clinicians' force application during joint mobilization. A systematic search of PubMed and EBSCO Host databases from inception to March 1, 2013 was conducted to identify studies assessing the reliability of force application during joint mobilizations. Two reviewers utilized the Quality Appraisal of Reliability Studies (QAREL) assessment tool to determine the quality of included studies. The relative reliability of the included studies was examined through intraclass correlation coefficients (ICC) to synthesize study findings. All results were collated qualitatively with a level of evidence approach. A total of seven studies met the eligibility and were included. Five studies were included that assessed inter-clinician reliability, and six studies were included that assessed intra-clinician reliability. The overall level of evidence for inter-clinician reliability was strong for poor-to-moderate reliability (ICC = -0.04 to 0.70). The overall level of evidence for intra-clinician reliability was strong for good reliability (ICC = 0.75-0.99). This systematic review indicates there is variability in force application between clinicians but individual clinicians apply forces consistently. The results of this systematic review suggest innovative instructional methods are needed to improve consistency and validate the forces applied during of joint mobilization treatments. This is particularly evident for improving the consistency of force application across clinicians. Copyright © 2014 Elsevier Ltd. All rights reserved.
Boer, Annemarie; Dutmer, Alisa L; Schiphorst Preuper, Henrica R; van der Woude, Lucas H V; Stewart, Roy E; Deyo, Richard A; Reneman, Michiel F; Soer, Remko
2017-10-01
Validation study with cross-sectional and longitudinal measurements. To translate the US National Institutes of Health (NIH)-minimal dataset for clinical research on chronic low back pain into the Dutch language and to test its validity and reliability among people with chronic low back pain. The NIH developed a minimal dataset to encourage more complete and consistent reporting of clinical research and to be able to compare studies across countries in patients with low back pain. In the Netherlands, the NIH-minimal dataset has not been translated before and measurement properties are unknown. Cross-cultural validity was tested by a formal forward-backward translation. Structural validity was tested with exploratory factor analyses (comparative fit index, Tucker-Lewis index, and root mean square error of approximation). Hypothesis testing was performed to compare subscales of the NIH dataset with the Pain Disability Index and the EurQol-5D (Pearson correlation coefficients). Internal consistency was tested with Cronbach α and test-retest reliability at 2 weeks was calculated in a subsample of patients with Intraclass Correlation Coefficients and weighted Kappa (κω). In total, 452 patients were included of which 52 were included for the test-retest study. factor analysis for structural validity pointed into the direction of a seven-factor model (Cronbach α = 0.78). Factors and total score of the NIH-minimal dataset showed fair to good correlations with Pain Disability Index (r = 0.43-0.70) and EuroQol-5D (r = -0.41 to -0.64). Reliability: test-retest reliability per item showed substantial agreement (κω=0.65). Test-retest reliability per factor was moderate to good (Intraclass Correlation Coefficient = 0.71). The Dutch language version measurement properties of the NIH-minimal were satisfactory. N/A.
Anees, Muhammad; Ibrahim, Muhammad; Imtiaz, Marium; Batool, Shazia; Elahi, Irfan; Malik, Muzammil Riaz
2016-08-01
To translate, validate and assess the reliability of kidney disease quality of life - short form (KDQOL-SF-36) in Urdu, national language of Pakistan. Amulticentric descriptive cross-sectional study. Department of Nephrology, Mayo Hospital, Lahore, from February to July 2015. Patients of end-stage renal disease (ESRD) on maintenance hemodialysis (MHD) for more than three months, were included in the study. Patients of ESRD not on dialysis, and those with acute renal failure were excluded. The English version of KDQOL-SF-36 was translated in Urdu and then translated back in English; further validation was done by a senior professor of Punjab University, Lahore. One hundred and thirty patients were included in the study. Fifty patients were from Mayo Hospital, 35 from Shalamar Hospital and 50 from Shaikh Zayed Hospital, Lahore. The internal consistency reliability coefficient for overall scale was 0.84. Twelve sub-scales (symptoms, effect of kidney disease, burden of kidney disease, cognitive function, quality of social interaction, sexual function, social support, physical functioning, role physical, pain, emotional well-being and role emotional) had more than 0.70 internal consistency reliability coefficient. Overall mean score of the domains i.e kidney disease component score (KDCS), physical component score (PCS), and mental component score (MCS) was 60.62 ±17.61, 43.12 ±19.54, and 49.27 ±14.52, respectively. Asignificant positive relationship was observed between KDCS and MCS domains, KDCS and PCS domains, PCS, and MCS domains. The Urdu version of KDQOL-SF-36 is a reliable and valid version to measure QOLin kidney disease patients on dialysis in Pakistan.
2012-01-01
Background This study aimed to investigate the reliability and validity of the Iranian version of the Pediatric Quality of Life Inventory™ 4.0 (PedsQL™ 4.0) Generic Core Scales in children. Methods A standard forward and backward translation procedure was used to translate the US English version of the PedsQL™ 4.0 Generic Core Scales for children into the Iranian language (Persian). The Iranian version of the PedsQL™ 4.0 Generic Core Scales was completed by 503 healthy and 22 chronically ill children aged 8-12 years and their parents. The reliability was evaluated using internal consistency. Known-groups discriminant comparisons were made, and exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) were conducted. Results The internal consistency, as measured by Cronbach's alpha coefficients, exceeded the minimum reliability standard of 0.70. All monotrait-multimethod correlations were higher than multitrait-multimethod correlations. The intraclass correlation coefficients (ICC) between the children self-report and parent proxy-reports showed moderate to high agreement. Exploratory factor analysis extracted six factors from the PedsQL™ 4.0 for both self and proxy reports, accounting for 47.9% and 54.8% of total variance, respectively. The results of the confirmatory factor analysis for 6-factor models for both self-report and proxy-report indicated acceptable fit for the proposed models. Regarding health status, as hypothesized from previous studies, healthy children reported significantly higher health-related quality of life than those with chronic illnesses. Conclusions The findings support the initial reliability and validity of the Iranian version of the PedsQL™ 4.0 as a generic instrument to measure health-related quality of life of children in Iran. PMID:22221765
Amiri, Parisa; Eslamian, Ghazaleh; Mirmiran, Parvin; Shiva, Niloofar; Jafarabadi, Mohammad Asghari; Azizi, Fereidoun
2012-01-05
This study aimed to investigate the reliability and validity of the Iranian version of the Pediatric Quality of Life Inventory™ 4.0 (PedsQL™ 4.0) Generic Core Scales in children. A standard forward and backward translation procedure was used to translate the US English version of the PedsQL™ 4.0 Generic Core Scales for children into the Iranian language (Persian). The Iranian version of the PedsQL™ 4.0 Generic Core Scales was completed by 503 healthy and 22 chronically ill children aged 8-12 years and their parents. The reliability was evaluated using internal consistency. Known-groups discriminant comparisons were made, and exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) were conducted. The internal consistency, as measured by Cronbach's alpha coefficients, exceeded the minimum reliability standard of 0.70. All monotrait-multimethod correlations were higher than multitrait-multimethod correlations. The intraclass correlation coefficients (ICC) between the children self-report and parent proxy-reports showed moderate to high agreement. Exploratory factor analysis extracted six factors from the PedsQL™ 4.0 for both self and proxy reports, accounting for 47.9% and 54.8% of total variance, respectively. The results of the confirmatory factor analysis for 6-factor models for both self-report and proxy-report indicated acceptable fit for the proposed models. Regarding health status, as hypothesized from previous studies, healthy children reported significantly higher health-related quality of life than those with chronic illnesses. The findings support the initial reliability and validity of the Iranian version of the PedsQL™ 4.0 as a generic instrument to measure health-related quality of life of children in Iran.
Çelik, Derya; Can, Canan; Aslan, Yasemin; Ceylan, Hasan Huseyin; Bilsel, Kerem; Ozdincler, Arzu Razak
2014-01-01
The Harris Hip Score (HHS) developed to assess function and pain from the perspective of patients hip pathologies. The purpose of this study was to translate and culturally adapt the HHS into Turkish, and thereby determine the reliability and validity of the translated version. The HHS was translated into Turkish in accordance with the stages recommended by Beaton. The measurement properties of the HHS were tested in 80 patients; 52 males, mean age 51 years (range 21-75 years) suffering from different hip pathologies. The test-retest reliability was tested in 58 patients; 28 males mean age, 52 years (range 30-73 years) after an interval of seven days. The Cronbach's Alpha was used to assess internal consistency and the intra-class correlation coefficient (ICC) was used to estimate the test-retest reliability. Patients were asked to answer the Oxford Hip Score (OHS), the Western Ontario and McMaster Universities Arthritis Index (WOMAC), the VAS and the Short Form-36 (SF-36) for the validity of the estimation. The Turkish version of the HHS showed sufficient internal consistency (Cronbach's alpha,0.70) and test-retest reliability (ICC = 0.91). The correlation coefficients between the HHS, the WOMAC and the OHS were 0.64 and 0.89 respectively. The highest correlations between the HHS and SF-36 were with the physical function scale (r = 0.72), and the lowest correlations were with the mental function scale (r = 0.10). We observed no floor or ceiling effects. The Turkish version of the HHS has sufficient reliability and validity to measure patient-reported outcome for Turkish-speaking individuals with a variety of hip disorders.
The reliability of the Adelaide in-shoe foot model.
Bishop, Chris; Hillier, Susan; Thewlis, Dominic
2017-07-01
Understanding the biomechanics of the foot is essential for many areas of research and clinical practice such as orthotic interventions and footwear development. Despite the widespread attention paid to the biomechanics of the foot during gait, what largely remains unknown is how the foot moves inside the shoe. This study investigated the reliability of the Adelaide In-Shoe Foot Model, which was designed to quantify in-shoe foot kinematics and kinetics during walking. Intra-rater reliability was assessed in 30 participants over five walking trials whilst wearing shoes during two data collection sessions, separated by one week. Sufficient reliability for use was interpreted as a coefficient of multiple correlation and intra-class correlation coefficient of >0.61. Inter-rater reliability was investigated separately in a second sample of 10 adults by two researchers with experience in applying markers for the purpose of motion analysis. The results indicated good consistency in waveform estimation for most kinematic and kinetic data, as well as good inter-and intra-rater reliability. The exception is the peak medial ground reaction force, the minimum abduction angle and the peak abduction/adduction external hindfoot joint moments which resulted in less than acceptable repeatability. Based on our results, the Adelaide in-shoe foot model can be used with confidence for 24 commonly measured biomechanical variables during shod walking. Copyright © 2017 Elsevier B.V. All rights reserved.
Nyitray, Alan G; Harris, Robin B; Abalos, Andrew T; Nielson, Carrie M; Papenfuss, Mary; Giuliano, Anna R
2010-12-01
Accurate knowledge about human sexual behaviors is important for increasing our understanding of human sexuality; however, there have been few studies assessing the reliability of sexual behavior questionnaires designed for community samples of adult men. A test-retest reliability study was conducted on a questionnaire completed by 334 men who had been recruited in Tucson, Arizona. Reliability coefficients and refusal rates were calculated for 39 non-sexual and sexual behavior questionnaire items. Predictors of unreliable reporting for lifetime number of female sexual partners were also assessed. Refusal rates were generally low, with slightly higher refusal rates for questions related to immigration, income, the frequency of sexual intercourse with women, lifetime number of female sexual partners, and the lifetime number of male anal sex partners. Kappa and intraclass correlation coefficients were substantial or almost perfect for all non-sexual and sexual behavior items. Reliability dropped somewhat, but was still substantial, for items that asked about household income and the men's knowledge of their sexual partners' health, including abnormal Pap tests and prior sexually transmitted diseases (STD). Age and lifetime number of female sexual partners were independent predictors of unreliable reporting while years of education was inversely associated with unreliable reporting. These findings among a community sample of adult men are consistent with other test-retest reliability studies with populations of women and adolescents.
Yang, Nan; Waddington, Gordon; Adams, Roger; Han, Jia
2018-05-01
Quantitative assessments of handedness and footedness are often required in studies of human cognition and behaviour, yet no reliable Chinese versions of commonly used handedness and footedness questionnaires are available. Accordingly, the objective of the present study was to translate the Edinburgh Handedness Inventory (EHI) and the Waterloo Footedness Questionnaire-Revised (WFQ-R) into Mandarin Chinese and to evaluate the reliability and validity of these translated versions in healthy Chinese people. In the first stage of the study, Chinese versions of the EHI and WFQ-R were produced from a process of translation, back translation and examination, with necessary cultural adaptations. The second stage involved determining the reliability and validity of the translated EHI and WFQ-R for the Chinese population. One hundred and ten Chinese participants were tested online, and the results showed that the Cronbach's alpha coefficient of internal consistency was 0.877 for the translated EHI and 0.855 for the translated WFQ-R. Another 170 Chinese participants were tested and re-tested after a 30-day interval. The intra-class correlation coefficients showed high reliability, 0.898 for the translated EHI and 0.869 for the translated WFQ-R. This preliminary validation study found the translated versions to be reliable and valid tools for assessing handedness and footedness in this population.
Mi, Misa; Moseley, James L; Green, Michael L
2012-02-01
Many residency programs offer training in evidence-based medicine (EBM). However, these curricula often fail to achieve optimal learning outcomes, perhaps because they neglect various contextual factors in the learning environment. We developed and validated an instrument to characterize the environment for EBM learning and practice in residency programs. An EBM Environment Scale was developed following scale development principles. A survey was administered to residents across six programs in primary care specialties at four medical centers. Internal consistency reliability was analyzed with Cronbach's coefficient alpha. Validity was assessed by comparing predetermined subscales with the survey's internal structure as assessed via factor analysis. Scores were also compared for subgroups based on residency program affiliation and residency characteristics. Out of 262 eligible residents, 124 completed the survey (response rate 47%). The overall mean score was 3.89 (standard deviation=0.56). The initial reliability analysis of the 48-item scale had a high reliability coefficient (Cronbach α=.94). Factor analysis and further item analysis resulted in a shorter 36-item scale with a satisfactory reliability coefficient (Cronbach α=.86). Scores were higher for residents with prior EBM training in medical school (4.14 versus 3.62) and in residency (4.25 versus 3.69). If further testing confirms its properties, the EBM Environment Scale may be used to understand the influence of the learning environment on the effectiveness of EBM training. Additionally, it may detect changes in the EBM learning environment in response to programmatic or institutional interventions.
Psychometric assessment of a scale to measure bonding workplace social capital
Tsutsumi, Akizumi; Inoue, Akiomi; Odagiri, Yuko
2017-01-01
Objectives Workplace social capital (WSC) has attracted increasing attention as an organizational and psychosocial factor related to worker health. This study aimed to assess the psychometric properties of a newly developed WSC scale for use in work environments, where bonding social capital is important. Methods We assessed the psychometric properties of a newly developed 6-item scale to measure bonding WSC using two data sources. Participants were 1,650 randomly selected workers who completed an online survey. Exploratory factor analyses were conducted. We examined the item–item and item–total correlations, internal consistency, and associations between scale scores and a previous 8-item measure of WSC. We evaluated test–retest reliability by repeating the survey with 900 of the respondents 2 weeks later. The overall scale reliability was quantified by an intraclass coefficient and the standard error of measurement. We evaluated convergent validity by examining the association with several relevant workplace psychosocial factors using a dataset from workers employed by an electrical components company (n = 2,975). Results The scale was unidimensional. The item–item and item–total correlations ranged from 0.52 to 0.78 (p < 0.01) and from 0.79 to 0.89 (p < 0.01), respectively. Internal consistency was good (Cronbach’s α coefficient: 0.93). The correlation with the 8-item scale indicated high criterion validity (r = 0.81) and the scale showed high test–retest reliability (r = 0.74, p < 0.01). The intraclass coefficient and standard error of measurement were 0.74 (95% confidence intervals: 0.71–0.77) and 4.04 (95% confidence intervals: 1.86–6.20), respectively. Correlations with relevant workplace psychosocial factors showed convergent validity. Conclusions The results confirmed that the newly developed WSC scale has adequate psychometric properties. PMID:28662058
Lonjon, Guillaume; Ilharreborde, Brice; Odent, Thierry; Moreau, Sébastien; Glorion, Christophe; Mazda, Keyvan
2014-01-01
Outcome study to determine the internal consistency, reproducibility, and concurrent validity of the French-Canadian version of the Scoliosis Research Society 22 (SRS-22 fcv) patient questionnaire in France. To determine whether the SRS-22 fcv can be used in a population from France. The SRS-22 has been translated and validated in multiple countries, notably in the French-Canadian language in Quebec, Canada. Use of SRS-22 fcv seems appropriate for evaluating adolescent idiopathic scoliosis in France. However, French-Canadian French is noticeably different from the French spoken in France, and no study has investigated the use of a French-Canadian version of a health-quality questionnaire in another French population. The methods used for validating the SRS-22 fcv in Quebec were adopted for use with a group of 200 adolescents with idiopathic scoliosis and 60 healthy adolescents in France. Reliability and reproducibility were measured by the Cronbach α and intraclass correlation coefficient (ICC), construct validity by factorial analysis, concurrent validity by the Short-Form of the survey, and discriminant validity by analysis of variance and multivariate linear regression. In France, the SRS-22 fcv showed good global internal consistency (Cronbach α = 0.87, intraclass correlation coefficient = 0.92), a coherent factorial structure, and high correlation coefficients between the SRS-22 fcv and Short-Form of the survey (P < 0.001). However, reliability and validity were slightly less than that for the instrument's original validation and the validation of the SRS-22 fcv in Quebec. These differences could be explained by language and cultural differences. The SRS-22 fcv is relevant for use in France, but further development and validation of a specific French questionnaire remain necessary to improve the assessment of functional outcomes of adolescents with scoliosis in France. N/A.
Haidar, Rachid K; Kassak, Kassem; Masrouha, Karim; Ibrahim, Kamal; Mhaidli, Hani
2015-09-01
Cross-sectional validation and reliability assessment study of Arabic version of Scoliosis Research Society-22 (SRS-22r) Questionnaire. To develop and validate the Arabic version of the SRS-22r questionnaire. The diagnosis and treatment of adolescent idiopathic scoliosis may influence patient quality of life. SRS-22r is an internationally validated questionnaire used to assess function/activity, pain, self-image, and mental health of patients with scoliosis. It has been translated into several languages but not into Arabic language. Therefore, a valid health-related quality-of-life outcome questionnaire for patients with spinal deformity is still lacking in Arabic language. The English version of SRS-22r questionnaire was translated, back-translated, and culturally adapted to Arabic language. Then, 81 patients with idiopathic adolescent scoliosis were allocated randomly into either the reliability testing group (group 1) or the validity testing group (group 2). Group 1 patients completed Arabic version of SRS-22r questionnaire twice with 1-week interval in-between. Cronbach α and intraclass correlation coefficient were measured to determine internal consistency and temporal reliability. Group 2 patients completed the Arabic version of SRS-22r questionnaire and the previously validated Arabic version of 36-Item Short Form Health Survey (Short Form-36) questionnaire concurrently, and Pearson correlation coefficient was obtained to assess validity. Content analysis, internal consistency reliability, test/retest reproducibility (intraclass correlation coefficient range: 0.82-0.90), and test of concurrent validity showed satisfactory results. Function/activity and satisfaction with management domains had a lower Cronbach α (0.58 and 0.44, respectively, vs. 0.71-0.85 range for others). Self-image/appearance and satisfaction with management had a lower correlation with domains of the 36-Item Short Form Health Survey. An Arabic version of the SRS-22r questionnaire has been developed and validated. This questionnaire will aid health care workers and researchers in evaluation of patient perception of the deformity, satisfaction with treatment, and quality of life in Arabic-speaking populations. 3.
Ko, Jupil; Rosen, Adam B; Brown, Cathleen N
2015-12-01
The Cumberland Ankle Instability Tool (CAIT) is a valid and reliable patient reported outcome used to assess the presence and severity of chronic ankle instability (CAI). The CAIT has been cross-culturally adapted into other languages for use in non-English speaking populations. However, there are no valid questionnaires to assess CAI in individuals who speak Korean. The purpose of this study was to translate, cross-culturally adapt, and validate the CAIT, for use in a Korean-speaking population with CAI. Cross-cultural reliability study. The CAIT was cross-culturally adapted into Korean according to accepted guidelines and renamed the Cumberland Ankle Instability Tool-Korean (CAIT-K). Twenty-three participants (12 males, 11 females) who were bilingual in English and Korean were recruited and completed the original and adapted versions to assess agreement between versions. An additional 168 national level Korean athletes (106 male, 62 females; age = 20.3 ± 1.1 yrs), who participated in ≥ 90 minutes of physical activity per week, completed the final version of the CAIT-K twice within 14 days. Their completed questionnaires were assessed for internal consistency, test-retest reliability, criterion validity, and construct validity. For bilingual participants, intra-class correlation coefficients (ICC2,1) between the CAIT and the CAIT-K for test-retest reliability were 0.95 (SEM=1.83) and 0.96 (SEM=1.50) in right and left limbs, respectively. The Cronbach's alpha coefficients were 0.92 and 0.90 for the CAIT-K in right and left limbs, respectively. For native Korean speakers, the CAIT-K had high internal consistency (Cronbach's α=0.89) and intra-class correlation coefficient (ICC2,1 = 0.94, SEM=1.72), correlation with the physical component score (rho=0.70, p = 0.001) of the Short-Form Health Survey (SF-36), and the Kaiser-Meyer-Olkin score was 0.87. The original CAIT was translated, cross-culturally adapted, and validated from English to Korean. The CAIT-K appears to be valid and reliable and could be useful in assessing the Korean speaking population with CAI.
Evidence of Validity for the Japanese Version of the Foot and Ankle Ability Measure
Uematsu, Daisuke; Suzuki, Hidetomo; Sasaki, Shogo; Nagano, Yasuharu; Shinozuka, Nobuyuki; Sunagawa, Norihiko; Fukubayashi, Toru
2015-01-01
Context: The Foot and Ankle Ability Measure (FAAM) is a valid, reliable, and self-reported outcome instrument for the foot and ankle region. Objective: To provide evidence for translation, cross-cultural adaptation, validity, and reliability of the Japanese version of the FAAM (FAAM-J). Design: Cross-sectional study. Setting: Collegiate athletic training/sports medicine clinical setting. Patients or Other Participants: Eighty-three collegiate athletes. Main Outcome Measure(s): All participants completed the Activities of Daily Living and Sports subscales of the FAAM-J and the Physical Functioning and Mental Health subscales of the Japanese version of the Short Form-36v2 (SF-36). Also, 19 participants (23%) whose conditions were expected to be stable completed another FAAM-J 2 to 6 days later for test-retest reliability. We analyzed the scores of those subscales for convergent and divergent validity, internal consistency, and test-retest reliability. Results: The Activities of Daily Living and Sports subscales of the FAAM-J had correlation coefficients of 0.86 and 0.75, respectively, with the Physical Functioning section of the SF-36 for convergent validity. For divergent validity, the correlation coefficients with Mental Health of the SF-36 were 0.29 and 0.27 for each subscale, respectively. Cronbach α for internal consistency was 0.99 for the Activities of Daily Living and 0.98 for the Sports subscale. A 95% confidence interval with a single measure was ±8.1 and ±14.0 points for each subscale. The test-retest reliability measures revealed intraclass correlation coefficient values of 0.87 for the Activities of Daily Living and 0.91 for the Sports subscales with minimal detectable changes of ±6.8 and ±13.7 for the respective subscales. Conclusions: The FAAM was successfully translated for a Japanese version, and the FAAM-J was adapted cross-culturally. Thus, the FAAM-J can be used as a self-reported outcome measure for Japanese-speaking individuals; however, the scores must be interpreted with caution, especially when applied to different populations and other types of injury than those included in this study. PMID:25310247
Togari, Taisuke; Yamazaki, Yoshihiko; Koide, Syotaro; Miyata, Ayako
2006-01-01
In community and workplace health plans, the Perceived Health Competence Scale (PHCS) is employed as an index of health competency. The purpose of this research was to examine the reliability and validity of a modified Japanese PHCS. Interviews were sought with 3,000 randomly selected Japanese individuals using a two-step stratified method. Valid PHCS responses were obtained from 1,910 individuals, yielding a 63.7% response rate. Reliability was assessed using Cronbach's alpha coefficient (henceforth, alpha) to evaluate internal consistency, and by employing item-total correlation and alpha coefficient analyses to assess the effect of removal of variables from the model. To examine content validity, we assessed the correlation between the PHCS score and four respondent attribute characteristics, that is, sex, age, the presence of chronic disease, and the existence of chronic disease at age 18. The correlation between PHCS score and commonly employed healthy lifestyle indices was examined to assess construct validity. General linear model statistical analysis was employed. The modified Japanese PHCS demonstrated a satisfactory alpha coefficient of 0.869. Moreover, reliability was confirmed by item-total correlation and alpha coefficient analyses after removal of variables from the model. Differences in PHCS scores were seen between individuals 60 years and older, and younger individuals. These with current chronic disease, or who had had a chronic disease at age 18, tended to have lower PHCS scores. After controlling for the presence of current or age 18 chronic disease, age, and sex, significant correlations were seen between PHCS scores and tobacco use, dietary habits, and exercise, but not alcohol use or frequency of medical consultation. This study supports the reliability and validity, and hence supports the use, of the modified Japanese PHCS. Future longitudinal research is needed to evaluate the predictive power of modified Japanese PHCS scores, to examine factors influencing the development of perceived health competence, and to assess the effects of interventions on perceived health competence.
ERIC Educational Resources Information Center
Barchard, Kimberly A.
2012-01-01
This article introduces new statistics for evaluating score consistency. Psychologists usually use correlations to measure the degree of linear relationship between 2 sets of scores, ignoring differences in means and standard deviations. In medicine, biology, chemistry, and physics, a more stringent criterion is often used: the extent to which…
Aksoy Derya, Yeşim; Timur Taşhan, Sermin; Duman, Mesude; Durgun Ozan, Yeter
2018-07-01
The purpose of this study was to create a Turkish version of the Pregnancy-Related Anxiety Questionnaire-Revised 2 (PRAQR2), which was revised for application to multiparous and primiparous pregnancy, and to explore its psychometric characteristics in multiparous and primiparous pregnancy. This study was methodologically designed to assess the reliability and validity of the PRAQ-R2. The study was carried out in the obstetrics clinic of a training and research hospital in Malatya. A total of 616 healthy pregnant women (399 multiparous and 217 primiparous) constituted the sample of the study. The cultural adaptation process of the questionnaire was conducted in three phases: language validity, content validity, and pilot application. Exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) were used to test the construct validity of the questionnaire. The reliability of the PRAQ-R2 was evaluated with Cronbach's alpha internal consistency coefficient, item-total correlation, test-retest analysis, and parallel forms reliability. The EFA revealed that the PRAQ-R2 consists of 10 items for the multiparous group and 11 for the primiparous group after adding the item ``I am anxious about the delivery because I have never experienced one before.'' The CFA for both groups supported the three-factor questionnaire yielded by the EFA. Good fit index values were obtained in both groups. Cronbach's alpha internal consistency coefficient ranged from 0.81 to 0.93 for the multiparous group and 0.87 to 0.94 for the primiparous group for the complete PRAQ-R2 and each of its subdimensions. In addition, the item-total correlation, test-retest analysis, and parallel forms reliability of the questionnaire were highly correlated. The PRAQ-R2 is a valid and reliable instrument that can be used to evaluate the level of anxiety in Turkish pregnant women irrespective of parity. The use of the PRAQ-R2 in prenatal healthcare services will contribute to the early diagnosis, treatment, and management of pregnancy-related anxiety. Copyright © 2018 Elsevier Ltd. All rights reserved.
Charafeddine, Lama; Sinno, Durriyah; Ammous, Farah; Yassin, Walid; Al-Shaar, Laila; Mikati, Mohamad A
2013-09-01
Early detection of developmental delay is essential to initiate early intervention. The Ages and Stages Questionnaires (ASQ) correlate well with physician's assessment and have high predictive value. No such tool exists in Arabic. Translate and test the applicability and reliability of Arabic translated Ages and Stages Questionnaires (A-ASQ) in an Arabic speaking population. 733 healthy children were assessed. ASQ-II for 10 age groups (4-60 months) were translated to Arabic, back translations and cultural adaptation were performed. Test-retest reliability and internal consistency were evaluated using Pearson Correlation Coefficient (CC) and Cronbach's alpha (Cα). Mean scores per domain were compared to US normative scores using t-test. A-ASQ, after culturally relevant adaptations, was easily administered for 4-36 months age groups but not for 4-5 year old due to numerous cultural differences in the later. For the 4-36 month age groups Pearson CC ranged from 0.345 to 0.833. The internal consistency coefficients Cα scores ranged from 0.111 to 0.816. Significant differences were found in the mean domain scores of all age groups between Lebanese and US normative sample (p-value <0.001) with some exceptions in gross motor, fine motor and personal social domains. A-ASQ was easily translated and administered with acceptable internal consistency and reliability in the younger age groups. It proved to be culturally sensitive, which should be taken into consideration when adapting such tool to non-western populations. Copyright © 2013 European Paediatric Neurology Society. Published by Elsevier Ltd. All rights reserved.
Melguizo-Herrera, Estela; Álvarez-Romero, Yuleysi; Cabarcas-Mendoza, Mayerlin Vanessa; Calvo-Rodríguez, Rossy Stefanie; Flórez-Almanza, Jeomaidis; Moadie-Contreras, Olga Patricia; Campo-Arias, Adalberto
2015-01-01
There are many stereotypes and prejudices about the sexual lives of the elderly. However, there are no validated and reliable tools for measuring these in the Latin-American context. To determine the internal consistency, dimensionality, differential item functioning (DIF) by gender and stability of the Attitudes towards Sexuality in the Elderly Questionnaire (ASEQ) in adults over 60 years-old in Cartagena, Colombia. A validation study was designed that included a sample of 130 participants without cognitive impairment attending a Life Center. The ages ranged between 60 and 90 years (mean, 73.7±8.0), and there were 61.5% females. Internal consistency was calculated using Cronbach alpha and McDonald omega, exploratory factor analysis (EFA) (dimensionality), DIF by gender (item response theory) with Kendall correlation, and stability (reproducibility) with Pearson correlation and intraclass correlation coefficient (ICC). The ASEQ showed high internal consistency on the first application (α=.83 and ω=.87) and in the second one (α=.85 and ω=.89). AFE showed two salient factors (prejudices and limitations) that explained 42.6% of the total variance. The IDF presented appropriate coefficients, with the exception of item 14 that showed a high value (τ=.37). ASEQ showed high stability (r=.82 and ICC=.89; 95% confidence interval, 0.83- 0.92; P<.001). ASEQ is a two-dimensional and reliable scale in older adults attending a Life Center in Cartagena, Colombia. New studies are required to evaluate the performance in a representative sample. Copyright © 2014 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.
Panah, Sara Hojat; Baharlouie, Hamze; Rezaeian, Zahra Sadat; Hawker, Gilian
2016-01-01
The present study aimed to translate and evaluate the reliability and validity of the Persian version of the 11-item Intermittent and Constant Osteoarthritis Pain (ICOAP) measure in Iranian subjects with Knee Osteoarthritis (KOA). The ICOAP questionnaire was translated according to the Manufacturers Alliance for Productivity and Innovation (MAPI) protocol. The procedure consisted of forward and backward translation, as well as the assessment of the psychometric properties of the Persian version of the questionnaire. A sample of 230 subjects with KOA was asked to complete the Persian versions of ICOAP and Knee injury and Osteoarthritis Outcome Score (KOOS). The ICOAP was readministered to forty subjects five days after the first visit. Test-retest reliability was assessed using Intraclass Correlation Coefficient (ICC), and internal consistency was assessed by Cronbach's alpha and item-total correlation. The correlation between ICOAP and KOOS was determined using Spearman's correlation coefficient. Subjects found the Persian-version of the ICOAP to be clear, simple, and unambiguous, confirming its face validity. Spearman correlations between ICOAP total and subscale scores with KOOS scores were between 0.5 and 0.7, confirming construct validity. Cronbach's alpha, used to assess internal consistency, was 0.89, 0.93, and 0.92 for constant pain, intermittent pain, and total pain scores, respectively. The ICC was 0.90 for constant pain and 0.91 for the intermittent pain and total pain score. The Persian version of the ICOAP is a reliable and valid outcome measure that can be used in Iranian subjects with KOA.
Assessing the Reliability of Student Evaluations of Teaching: Choosing the Right Coefficient
ERIC Educational Resources Information Center
Morley, Donald
2014-01-01
Many of the studies used to support the claim that student evaluations of teaching are reliable measures of teaching effectiveness have frequently calculated inappropriate reliability coefficients. This paper points to three coefficients that would be appropriate depending on if student evaluations were used for formative or summative purposes.…
Development of the Assessment of Belief Conflict in Relationship-14 (ABCR-14).
Kyougoku, Makoto; Teraoka, Mutsumi; Masuda, Noriko; Ooura, Mariko; Abe, Yasushi
2015-01-01
Nurses and other healthcare workers frequently experience belief conflict, one of the most important, new stress-related problems in both academic and clinical fields. In this study, using a sample of 1,683 nursing practitioners, we developed The Assessment of Belief Conflict in Relationship-14 (ABCR-14), a new scale that assesses belief conflict in the healthcare field. Standard psychometric procedures were used to develop and test the scale, including a qualitative framework concept and item-pool development, item reduction, and scale development. We analyzed the psychometric properties of ABCR-14 according to entropy, polyserial correlation coefficient, exploratory factor analysis, confirmatory factor analysis, average variance extracted, Cronbach's alpha, Pearson product-moment correlation coefficient, and multidimensional item response theory (MIRT). The results of the analysis supported a three-factor model consisting of 14 items. The validity and reliability of ABCR-14 was suggested by evidence from high construct validity, structural validity, hypothesis testing, internal consistency reliability, and concurrent validity. The result of the MIRT offered strong support for good item response of item slope parameters and difficulty parameters. However, the ABCR-14 Likert scale might need to be explored from the MIRT point of view. Yet, as mentioned above, there is sufficient evidence to support that ABCR-14 has high validity and reliability. The ABCR-14 demonstrates good psychometric properties for nursing belief conflict. Further studies are recommended to confirm its application in clinical practice.
Eren, Nurhan
2014-12-01
In this study, we aimed to develop two reliable and valid assessment instruments for investigating the level of difficulties mental health workers experience while working with patients with personality disorders and the attitudes they develop tt the patients. The research was carried out based on the general screening model. The study sample consisted of 332 mental health workers in several mental health clinics of Turkey, with a certain amount of experience in working with personality disorders, who were selected with a random assignment method. In order to collect data, the Personal Information Questionnaire, Difficulty of Working with Personality Disorders Scale (PD-DWS), and Attitudes Towards Patients with Personality Disorders Scale (PD-APS), which are being examined for reliability and validity, were applied. To determine construct validity, the Adjective Check List, Maslach Burnout Inventory, and State and Trait Anxiety Inventory were used. Explanatory factor analysis was used for investigating the structural validity, and Cronbach alpha, Spearman-Brown, Guttman Split-Half reliability analyses were utilized to examine the reliability. Also, item reliability and validity computations were carried out by investigating the corrected item-total correlations and discriminative indexes of the items in the scales. For the PD-DWS KMO test, the value was .946; also, a significant difference was found for the Bartlett sphericity test (p<.001). The computed test-retest coefficient reliability was .702; the Cronbach alpha value of the total test score was .952. For PD-APS KMO, the value was .925; a significant difference was found in Bartlett sphericity test (p<.001); the computed reliability coefficient based on continuity was .806; and the Cronbach alpha value of the total test score was .913. Analyses on both scales were based on total scores. It was found that PD-DWS and PD-APS have good psychometric properties, measuring the structure that is being investigated, are compatible with other scales, have high levels of internal reliability between their items, and are consistent across time. Therefore, it was concluded that both scales are valid and reliable instruments.
Testing the reliability of the Fall Risk Screening Tool in an elderly ambulatory population.
Fielding, Susan J; McKay, Michael; Hyrkas, Kristiina
2013-11-01
To identify and test the reliability of a fall risk screening tool in an ambulatory outpatient clinic. The Fall Risk Screening Tool (Albert Lea Medical Center, MN, USA) was scripted for an interview format. Two interviewers separately screened a convenience sample of 111 patients (age ≥ 65 years) in an ambulatory outpatient clinic in a northeastern US city. The interviewers' scoring of fall risk categories was similar. There was good internal consistency (Cronbach's α = 0.834-0.889) and inter-rater reliability [intra-class correlation coefficients (ICC) = 0.824-0.881] for total, Risk Factor and Client's Health Status subscales. The Physical Environment scores indicated acceptable internal consistency (Cronbach's α = 0.742) and adequate reliability (ICC = 0.688). Two Physical Environment items (furniture and medical equipment condition) had low reliabilities [Kappa (K) = 0.323, P = 0.08; K = -0.078, P = 0.648), respectively. The scripted Fall Risk Screening Tool demonstrated good reliability in this sample. Rewording two Physical Environment items will be considered. A reliable instrument such as the scripted Fall Risk Screening Tool provides a standardised assessment for identifying high fall risk patients. This tool is especially useful because it assesses personal, behavioural and environmental factors specific to community-dwelling patients; the interview format also facilitates patient-provider interaction. © 2013 John Wiley & Sons Ltd.
Measurement properties of the WOMAC LK 3.1 pain scale.
Stratford, P W; Kennedy, D M; Woodhouse, L J; Spadoni, G F
2007-03-01
The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) is applied extensively to patients with osteoarthritis of the hip or knee. Previous work has challenged the validity of its physical function scale however an extensive evaluation of its pain scale has not been reported. Our purpose was to estimate internal consistency, factorial validity, test-retest reliability, and the standard error of measurement (SEM) of the WOMAC LK 3.1 pain scale. Four hundred and seventy-four patients with osteoarthritis of the hip or knee awaiting arthroplasty were administered the WOMAC. Estimates of internal consistency (coefficient alpha), factorial validity (confirmatory factor analysis), and the SEM based on internal consistency (SEM(IC)) were obtained. Test-retest reliability [Type 2,1 intraclass correlation coefficients (ICC)] and a corresponding SEM(TRT) were estimated on a subsample of 36 patients. Our estimates were: internal consistency alpha=0.84; SEM(IC)=1.48; Type 2,1 ICC=0.77; SEM(TRT)=1.69. Confirmatory factor analysis failed to support a single factor structure of the pain scale with uncorrelated error terms. Two comparable models provided excellent fit: (1) a model with correlated error terms between the walking and stairs items, and between night and sit items (chi2=0.18, P=0.98); (2) a two factor model with walking and stairs items loading on one factor, night and sit items loading on a second factor, and the standing item loading on both factors (chi2=0.18, P=0.98). Our examination of the factorial structure of the WOMAC pain scale failed to support a single factor and internal consistency analysis yielded a coefficient less than optimal for individual patient use. An alternate strategy to summing the five-item responses when considering individual patient application would be to interpret item responses separately or to sum only those items which display homogeneity.
Pruitt, Sandi L; Jeffe, Donna B; Yan, Yan; Schootman, Mario
2012-04-01
Limited psychometric research has examined the reliability of self-reported measures of neighbourhood conditions, the effect of measurement error on associations between neighbourhood conditions and health, and potential differences in the reliabilities between neighbourhood strata (urban vs rural and low vs high poverty). We assessed overall and stratified reliability of self-reported perceived neighbourhood conditions using five scales (social and physical disorder, social control, social cohesion, fear) and four single items (multidimensional neighbouring). We also assessed measurement error-corrected associations of these conditions with self-rated health. Using random-digit dialling, 367 women without breast cancer (matched controls from a larger study) were interviewed twice, 2-3 weeks apart. Test-retest (intraclass correlation coefficients (ICC)/weighted κ) and internal consistency reliability (Cronbach's α) were assessed. Differences in reliability across neighbourhood strata were tested using bootstrap methods. Regression calibration corrected estimates for measurement error. All measures demonstrated satisfactory internal consistency (α ≥ 0.70) and either moderate (ICC/κ=0.41-0.60) or substantial (ICC/κ=0.61-0.80) test-retest reliability in the full sample. Internal consistency did not differ by neighbourhood strata. Test-retest reliability was significantly lower among rural (vs urban) residents for two scales (social control, physical disorder) and two multidimensional neighbouring items; test-retest reliability was higher for physical disorder and lower for one multidimensional neighbouring item among the high (vs low) poverty strata. After measurement error correction, the magnitude of associations between neighbourhood conditions and self-rated health were larger, particularly in the rural population. Research is needed to develop and test reliable measures of perceived neighbourhood conditions relevant to the health of rural populations.
Reliability and validity of the de Morton Mobility Index in individuals with sub-acute stroke.
Braun, Tobias; Marks, Detlef; Thiel, Christian; Grüneberg, Christian
2018-02-04
To establish the validity and reliability of the de Morton Mobility Index (DEMMI) in patients with sub-acute stroke. This cross-sectional study was performed in a neurological rehabilitation hospital. We assessed unidimensionality, construct validity, internal consistency reliability, inter-rater reliability, minimal detectable change and possible floor and ceiling effects of the DEMMI in adult patients with sub-acute stroke. The study included a total sample of 121 patients with sub-acute stroke. We analysed validity (n = 109) and reliability (n = 51) in two sub-samples. Rasch analysis indicated unidimensionality with an overall fit to the model (chi-square = 12.37, p = 0.577). All hypotheses on construct validity were confirmed. Internal consistency reliability (Cronbach's alpha = 0.94) and inter-rater reliability (intraclass correlation coefficient = 0.95; 95% confidence interval: 0.92-0.97) were excellent. The minimal detectable change with 90% confidence was 13 points. No floor or ceiling effects were evident. These results indicate unidimensionality, sufficient internal consistency reliability, inter-rater reliability, and construct validity of the DEMMI in patients with a sub-acute stroke. Advantages of the DEMMI in clinical application are the short administration time, no need for special equipment and interval level data. The de Morton Mobility Index, therefore, may be a useful performance-based bedside test to measure mobility in individuals with a sub-acute stroke across the whole mobility spectrum. Implications for Rehabilitation The de Morton Mobility Index (DEMMI) is an unidimensional measurement instrument of mobility in individuals with sub-acute stroke. The DEMMI has excellent internal consistency and inter-rater reliability, and sufficient construct validity. The minimal detectable change of the DEMMI with 90% confidence in stroke rehabilitation is 13 points. The lack of any floor or ceiling effects on hospital admission indicates applicability across the whole mobility spectrum of patients with sub-acute stroke.
Reliability, validity, and significance of assessment of sense of contribution in the workplace.
Takaki, Jiro; Taniguchi, Toshiyo; Fujii, Yasuhito
2014-01-29
The purpose of this study was to assess the validity and reliability of the Sense of Contribution Scale (SCS), a newly developed, 7-item questionnaire used to measure sense of contribution in the workplace. Workers at 272 organizations answered questionnaires that included the SCS. Because of non-participation or missing data, the number of subjects included in the analyses for internal consistency and validity varied from 1,675 to 2,462 (response rates 54.6%-80.2%). Fifty-four workers were included in the analysis of test-retest reliability (response rate, 77.1%). The SCS showed high internal consistency (Cronbach's α coefficients in men and women were 0.85 and 0.86, respectively) and test-retest reliability (intraclass correlation coefficient = 0.91). Significant (p < 0.001), positive, moderate correlations were found between the SCS score and scores for organization-based self-esteem and work engagement in both genders, which support the SCS's convergent and discriminant validity. The criterion validity of the SCS was supported by the finding that in both genders, the SCS scores were significantly (p < 0.05) and inversely associated with psychological distress and sleep disturbance in crude and in multivariable analyses that adjusted for demographics, organization-based self-esteem, work engagement, effort-reward ratio, workplace bullying, and procedural and interactional justice. The SCS is a psychometrically satisfactory measure of sense of contribution in the workplace. The SCS provides a new and useful instrument to measure sense of contribution, which is independently associated with mental health in workers, for studies in organizational science, occupational health psychology and occupational medicine.
Psychometric Properties of the Persian Translation of the Sexual Quality of Life–Male Questionnaire
Maasoumi, Raziyeh; Mokarami, Hamidreza; Nazifi, Morteza; Stallones, Lorann; Taban, Abrahim; Yazdani Aval, Mohsen; Samimi, Kazem
2016-01-01
Sexual dysfunction has been demonstrated to be related to a poor quality of life. These dysfunctions are especially prevalent among men. This cross-sectional study aimed to investigate the psychometric properties of the Persian translation of the Sexual Quality of Life–Male (SQOL-M), translated and adapted to measure sexual quality of life among Iranian men. Forward–backward procedures were applied in translating the original SQOL-M into Persian, and then the psychometric properties of the Persian translation of the SQOL-M were studied. A total of 181 participants (23-60 years old) were included in the study. Validity was assessed by construct validity using confirmatory factor analysis, convergent validity, and content validity. The international index of erectile function (IIEF) and the work ability index were used to study the convergent validity. Reliability was evaluated through internal consistency and test–retest reliability analyses. The results from confirmatory factor analysis confirmed a one-factor solution for the Persian version of the SQOL-M. Content validity of the translated measure was endorsed by 10 specialists. Pearson correlations indicated that work ability index score, dimensions of the IIEF, and the IIEF total score were positively correlated with the Persian version of the SQOL-M (p < .001). Reliability evaluation indicated a high internal consistency and test–retest reliability. The Cronbach’s alpha coefficient and intraclass correlation coefficients were .96 and .95, respectively. Results indicated that the Persian version of the SQOL-M has good to excellent psychometric properties and can be used to assess the sexual quality of life among Iranian men. PMID:26856758
Psychometric Properties of the Persian Translation of the Sexual Quality of Life-Male Questionnaire.
Maasoumi, Raziyeh; Mokarami, Hamidreza; Nazifi, Morteza; Stallones, Lorann; Taban, Abrahim; Yazdani Aval, Mohsen; Samimi, Kazem
2017-05-01
Sexual dysfunction has been demonstrated to be related to a poor quality of life. These dysfunctions are especially prevalent among men. This cross-sectional study aimed to investigate the psychometric properties of the Persian translation of the Sexual Quality of Life-Male (SQOL-M), translated and adapted to measure sexual quality of life among Iranian men. Forward-backward procedures were applied in translating the original SQOL-M into Persian, and then the psychometric properties of the Persian translation of the SQOL-M were studied. A total of 181 participants (23-60 years old) were included in the study. Validity was assessed by construct validity using confirmatory factor analysis, convergent validity, and content validity. The international index of erectile function (IIEF) and the work ability index were used to study the convergent validity. Reliability was evaluated through internal consistency and test-retest reliability analyses. The results from confirmatory factor analysis confirmed a one-factor solution for the Persian version of the SQOL-M. Content validity of the translated measure was endorsed by 10 specialists. Pearson correlations indicated that work ability index score, dimensions of the IIEF, and the IIEF total score were positively correlated with the Persian version of the SQOL-M ( p < .001). Reliability evaluation indicated a high internal consistency and test-retest reliability. The Cronbach's alpha coefficient and intraclass correlation coefficients were .96 and .95, respectively. Results indicated that the Persian version of the SQOL-M has good to excellent psychometric properties and can be used to assess the sexual quality of life among Iranian men.
Reliability, Validity, and Significance of Assessment of Sense of Contribution in the Workplace
Takaki, Jiro; Taniguchi, Toshiyo; Fujii, Yasuhito
2014-01-01
The purpose of this study was to assess the validity and reliability of the Sense of Contribution Scale (SCS), a newly developed, 7-item questionnaire used to measure sense of contribution in the workplace. Workers at 272 organizations answered questionnaires that included the SCS. Because of non-participation or missing data, the number of subjects included in the analyses for internal consistency and validity varied from 1,675 to 2,462 (response rates 54.6%–80.2%). Fifty-four workers were included in the analysis of test–retest reliability (response rate, 77.1%). The SCS showed high internal consistency (Cronbach’s α coefficients in men and women were 0.85 and 0.86, respectively) and test–retest reliability (intraclass correlation coefficient = 0.91). Significant (p < 0.001), positive, moderate correlations were found between the SCS score and scores for organization-based self-esteem and work engagement in both genders, which support the SCS’s convergent and discriminant validity. The criterion validity of the SCS was supported by the finding that in both genders, the SCS scores were significantly (p < 0.05) and inversely associated with psychological distress and sleep disturbance in crude and in multivariable analyses that adjusted for demographics, organization-based self-esteem, work engagement, effort–reward ratio, workplace bullying, and procedural and interactional justice. The SCS is a psychometrically satisfactory measure of sense of contribution in the workplace. The SCS provides a new and useful instrument to measure sense of contribution, which is independently associated with mental health in workers, for studies in organizational science, occupational health psychology and occupational medicine. PMID:24481035
Ruiz, Begoña; Urzúa, Iván; Cabello, Rodrigo; Rodríguez, Gonzalo; Espelid, Ivar
2013-01-01
To translate and validate a Spanish version of the "Questionnaire on the treatment of approximal and occlusal caries" as a method of collecting information about treatment decisions on caries management in Chilean primary health care services. The original questionnaire proposed by Espelid et al. was translated into Spanish using the forward-backward translation technique. Subsequently, validation of the Spanish version was undertaken. Data were collected from two separate samples; first, from 132 Spanish-speaking dentists recruited from primary health care services and second, from 21 individuals characterised as cariologists. Internal consistency was evaluated by the generation of Cronbach's alpha, test-retest reliability was evaluated by Cohen's kappa, convergent validity was evaluated by comparing the total scale scores to a global evaluation of treatment trends and discriminant validity was evaluated by investigating the differences in total scale scores between the Spanish-speaking dentist and cariologist samples. Cronbach's alpha indicated an internal consistency of 0.63 for the entire scale. Cohen's kappa correlation coefficient expressed a test-retest reliability of 0.83. Convergent validity determined a Pearson's correlation coefficient of 0.24 (p < 0.01). The comparison of proportions (chi-squared) indicated that discriminant validity was statistically significant (p < 0.01), using a one-tailed test. The Spanish version of the "Questionnaire on the treatment of approximal and occlusal caries" is a valid and reliable instrument for collecting information regarding treatment decisions in cariology. The clinical relevance of this study is to acquire a reliable instrument that allows for the determination of treatment decisions in Spanish-speaking dentists.
[KON-2006--Neurotic Personality Questionnaire].
Aleksandrowicz, Jerzy W; Klasa, Katarzyna; Sobański, Jerzy A; Stolarska, Dorota
2007-01-01
Construction of a questionnaire describing personality traits connected to the occurrence and persistence of neurotic disorders. Responses of 794 patients (before treatment) and 520 persons from the control group on items of the constructed personality questionnaire and the symptom checklist "0". Analyses of subscales reliability and item-scale correlations, test-retest and split-half reliability. Factor analyses estimating internal reliability of the questionnaire. Cross-validation with the KO"0". symptom checklist Psychometric properties of KON-2006 questionnaire indicate that it is consistent and reliable enough. Validity analyses indicate a large probability that the X-KON coefficient informs on personality dysfunctions related to neurotic disorders. The Neurotic Personality Questionnaire KON-2006 may serve to estimate personality traits connected to the occurrence and persistence of neurotic disorders as well as changes resulting from psychotherapy.
The technological influence on health professionals' care: translation and adaptation of scales1
Almeida, Carlos Manuel Torres; Almeida, Filipe Nuno Alves dos Santos; Escola, Joaquim José Jacinto; Rodrigues, Vitor Manuel Costa Pereira
2016-01-01
Objectives: in this study, two research tools were validated to study the impact of technological influence on health professionals' care practice. Methods: the following methodological steps were taken: bibliographic review, selection of the scales, translation and cultural adaptation and analysis of psychometric properties. Results: the psychometric properties of the scale were assessed based on its application to a sample of 341 individuals (nurses, physicians, final-year nursing and medical students). The validity, reliability and internal consistency were tested. Two scales were found: Caring Attributes Questionnaire (adapted) with a Cronbach's Alpha coefficient of 0.647 and the Technological Influence Questionnaire (adapted) with an Alpha coefficient of 0.777. Conclusions: the scales are easy to apply and reveal reliable psychometric properties, an additional quality as they permit generalized studies on a theme as important as the impact of technological influence in health care. PMID:27143537
Psychometric properties of the Thai Spiritual Well-Being Scale.
Chaiviboontham, Suchira; Phinitkhajorndech, Noppawan; Hanucharurnkul, Somchit; Noipiang, Thaniya
2016-04-01
The purpose of this study was to investigate the psychometric properties of the modified Thai Spiritual Well-Being Scale in patients with advanced cancer. This cross-sectional study was employed to investigate psychometric properties. Some 196 participants from three tertiary hospitals in Bangkok and suburban Thailand were asked to complete a Personal Information Questionnaire (PIQ), The Memorial Symptom Assessment Scale (MSAS), and the Spiritual Well-Being Scale (SWBS). Validity was determined by known-group, concurrent, and constructs validity. Reliability was estimated using internal consistency by Cronbach's α coefficients. Three factors were extracted: so-called existential well-being, religious well-being, and peacefulness accounted for 71.44% of total variance. The Cronbach's α coefficients for total SWB, EWB, RWB, and peacefulness were 0.96, 0.94, and 0.93, respectively. These findings indicate that the Thai SWBS is a valid and reliable instrument, and it presented one more factor than the original version.
Alghadir, Ahmad; Anwer, Shahnawaz; Iqbal, Zaheen Ahmed; Alsanawi, Hisham Abdulaziz
2016-01-01
We adapted the reduced Western Ontario and McMaster Universities Osteoarthritis (WOMAC) index for the Arabic language and tested its metric properties in patients with knee osteoarthritis (OA). One hundred and twenty-one consecutive patients who were referred for physiotherapy to the outpatient department were asked to answer the Arabic version of the reduced WOMAC index (ArWOMAC). After the completion of the ArWOMAC, the intensity of knee pain and general health status were assessed using the visual analog scale (VAS) and the 12-item short form health survey (SF-12), respectively. A second assessment was performed at least 48 h after the first session to assess test-retest reliability. The test-retest reliability was quantified using the intra-class correlation coefficient (ICC), and Cronbach's alpha was calculated to assess the internal consistency of the Arabic questionnaire. The construct validity was assessed using Spearman rank correlation coefficients. The total ArWOMAC scale and pain and function subscales were internally consistent with Cronbach's coefficient alpha of 0.91, 0.89 and 0.90, respectively. Test-retest reliability was good to excellent with ICC of 0.91, 0.89 and 0.90, respectively. SF-12 and VAS score significantly correlated with ArWOMAC index (p < 0.01), which support the construct validity. The standard error of measurement (SEM) of the total scale was 2.94, based on repeated measurements for test-retest. The minimum detectable change based on the SEM for test-retest was 8.15. The ArWOMAC index is a reliable and valid instrument for evaluating the severity of knee OA, with metric properties in agreement with the original version. Although, the reduced WOMAC index has been clinically utilized within the Saudi population, the Arabic version of this instrument is not validated for an Arab population to measure lower limb functional disability caused by OA. The Arabic version of reduced WOMAC (ArWOMAC) index is a reliable and valid scale to measure lower limb functional disability in patients with knee OA. The ArWOMAC index could be suitable in Saudi Arabia and other Arab countries where the language, culture and the life style are similar.
Fan, Yahui; Zhang, Shaoru; Li, Yan; Li, Yuelu; Zhang, Tianhua; Liu, Weiping; Jiang, Hualin
2018-05-08
TB outbreaking in schools is extremely complex, and presents a major challenge for public health. Understanding the knowledge, attitudes and practices among student TB patients in such settings is fundamental when it comes to decreasing future TB cases. The objective of this study was to develop a Knowledge, Attitudes and Practices Questionnaire among Student Tuberculosis Patients (STBP-KAPQ), and evaluate its psychometric properties. This study was conducted in three stages: item construction, pilot testing in 10 student TB patients and psychometric testing, including reliability and validity. The item pool for the questionnaire was compiled from literature review and early individual interviews. The questionnaire items were evaluated by the Delphi method based on 12 experts. Reliability and validity were assessed using student TB patients (n = 416) and healthy students (n = 208). Reliability was examined with internal consistency reliability and test-retest reliability. Content validity was calculated by content validity index (CVI); Construct validity was examined using exploratory factor analysis (EFA) and confirmatory factor analysis (CFA); The Public Tuberculosis Knowledge, Attitudes and Practices Questionnaire (PTB-KAPQ) was applied to evaluate criterion validity; As concerning discriminant validity, T-test was performed. The final STBP-KAPQ consisted of three dimensions and 25 items. Cronbach's α coefficient and intraclass correlation coefficient (ICC) was 0.817 and 0.765, respectively. Content validity index (CVI) was 0.962. Seven common factors were extracted by principal factor analysis and varimax rotation, with a cumulative contribution of 66.253%. The resulting CFA model of the STBP-KAPQ exhibited an appropriate model fit (χ2/df = 1.74, RMSEA = 0.082, CFI = 0.923, NNFI = 0.962). STBP-KAPQ and PTB-KAPQ had a strong correlation in the knowledge part, and the correlation coefficient was 0.606 (p < 0.05). Discriminant validity was supported through a significant difference between student TB patients and healthy students across all domains (p < 0.05). An instrument, "Knowledge, Attitudes and Practices Questionnaire among Student Tuberculosis Patients (STBP-KAPQ)" was developed. Psychometric testing indicated that it had adequate validity and reliability for use in KAP researches with student TB patients in China. The new tool might help public health researchers evaluate the level of KAP in student TB patients, and it could also be used to examine the effects of TB health education.
Bower, W F; Vlantis, A C; Chung, T M L; Cheung, S K C; Bjordal, K; van Hasselt, C A
2009-07-01
High convergent and discriminant validity between subscales was achieved after the translation of EORTC QLQ-H&N35 into Cantonese. Most subscales were assessing distinct components of quality of life (QoL). The study aimed to translate the EORTC QLQ-H&N35 cancer module into Cantonese and to confirm validity and reliability for use in a Hong Kong head and neck (H&N) cancer population. An ethnocentric forward-backward translation of EORTC QLQ-H&N35 was conducted by bilingual head and neck health professionals. Discrepancies were identified and problematic wording and concepts revised. Further review preceded pilot testing in 119 postoperative H&N cancer patients. Internal consistency within each subscale, convergent and discriminant validity to check the item relevance and item representativeness within and between subscales were examined. Mean and standard deviations of each subscale and single item and Cronbach's alpha coefficients for subscales were calculated. Six of seven subscales achieved standard reliability (Cronbach's alpha coefficient >0.7). Correlation coefficients between an item and its own subscale were significantly higher than the coefficients with other subscales. Scaling success was found in all subscales. Pearson's correlation coefficient between subscales was <0.70, except between the subscales swallowing and trouble with social eating (r = 0.795), and speech problems and social contact (r = 0.754).
Reliability and Validity of the Turkish Version of the Gastrointestinal Symptom Rating Scale.
Turan, Nuray; Aşt, Türkinaz Atabek; Kaya, Nurten
The purpose of this methodological study is to investigate the validity and reliability of the Turkish version of the Gastrointestinal Symptom Rating Scale (GSRS). The scale was adapted to the Turkish language via backward translation. Content validity was examined by referring to experts. Reliability was examined via test-retest reliability and internal consistency, and validity was examined with divergent and convergent validity. The Epworth Sleepiness Scale (ESS) and the Marlowe-Crowne Social Desirability Scale (MCSDS) were used for divergent validity. As for convergent validity, the Constipation Severity Instrument (CSI) and the Patient Assessment of Constipation Quality of Life Scale (PAC-QOLQ) were utilized. The relationship between the GSRS and the health-related quality of life (36-item short-form health survey [SF-36]) was also analyzed. The study population consisted of patients in orthopedic clinic who volunteered to participate. Test-retest reliability was examined with the participation of 30 patients; internal consistency and validity were examined with 150 patients. Test-retest reliability correlation coefficients of the GSRS varied from 0.39 to 0.87 for all items. For internal consistency, the GSRS's item total correlation was found to be 0.17-0.67, and Cronbach α was 0.82 for all items. There was a positive linear significant correlation between the GSRS, CSI, and PAC-QOLQ. There was no significant correlation between the GSRS, MCSDS, and ESS. Higher GSRS scores inversely correlated with general quality of life (SF-36). The Turkish version of the GSRS has been found to be a reliable and valid instrument for assessing patients' gastrointestinal symptoms. Therefore, this instrument can be confidently used with Turkish individuals.
The Brazilian version of the effort-reward imbalance questionnaire to assess job stress.
Chor, Dóra; Werneck, Guilherme Loureiro; Faerstein, Eduardo; Alves, Márcia Guimarães de Mello; Rotenberg, Lúcia
2008-01-01
The effort-reward imbalance (ERI) model has been used to assess the health impact of job stress. We aimed at describing the cross-cultural adaptation of the ERI questionnaire into Portuguese and some psychometric properties, in particular internal consistency, test-retest reliability, and factorial structure. We developed a Brazilian version of the ERI using a back-translation method and tested its reliability. The test-retest reliability study was conducted with 111 health workers and University staff. The current analyses are based on 89 participants, after exclusion of those with missing data. Reproducibility (interclass correlation coefficients) for the "effort", "'reward", and "'overcommitment"' dimensions of the scale was estimated at 0.76, 0.86, and 0.78, respectively. Internal consistency (Cronbach's alpha) estimates for these same dimensions were 0.68, 0.78, and 0.78, respectively. The exploratory factorial structure was fairly consistent with the model's theoretical components. We conclude that the results of this study represent the first evidence in favor of the application of the Brazilian Portuguese version of the ERI scale in health research in populations with similar socioeconomic characteristics.
Latvala, E; Saranto, K; Pekkala, E
2004-10-01
The main purpose of the project was to develop computerized instruments that could be used by nurses and patients to assess their cooperation and mutual contributions to care. This paper presents a part of the project: the reliability and validity testing phase of a process of instrument development. To test the validity and reliability of the instruments, data were collected with questionnaires from nurses (n = 146) and patients (n = 286). The validity evaluated as construct validity and the reliability evaluated as internal consistency of the instruments were quite good. Construct validity was tested by factor analysis, and internal consistency was tested by Cronbach's alpha coefficient, which varied from 0.69 to 0.79. The instruments, which consisted of a software application that can be operated in a www environment, were meant to be used as tools in the psychiatric nursing context for assessing the cooperation between the nurses and patients and the patient's participation in his/her care. Furthermore, the computer programme can be used as a tool for developing and assessing the patient orientation in nursing.
Correcting Coefficient Alpha for Correlated Errors: Is [alpha][K]a Lower Bound to Reliability?
ERIC Educational Resources Information Center
Rae, Gordon
2006-01-01
When errors of measurement are positively correlated, coefficient alpha may overestimate the "true" reliability of a composite. To reduce this inflation bias, Komaroff (1997) has proposed an adjusted alpha coefficient, ak. This article shows that ak is only guaranteed to be a lower bound to reliability if the latter does not include correlated…
Stucki, G; Meier, D; Stucki, S; Michel, B A; Tyndall, A G; Dick, W; Theiler, R
1996-01-01
The WOMAC (Western Ontario and McMaster Universities) Osteoarthritis Index is a tested questionnaire to assess symptoms and physical functional disability. We adapted the WOMAC for the German language and tested its metric properties, test-retest reliability and validity in 51 patients with knee and hip OA. All WOMAC scales (pain, stiffness, function) were internally consistent with Cronbach's coefficient alpha ranging from 0.80 to 0.96. Test-retest reliability was satisfactory with intraclass correlation coefficients ranging from 0.55 to 0.74. All scales and the global index calculated as the mean of scale scores had a bimodal distribution and a slight ceiling effect. As hypothesized the WOMAC scales were associated with radiological OA-severity and limitations of range-of-motion. Patients with more severe symptoms and functional disability perceived more limitations in their roles at home and at work. The presented German version of the WOMAC is a reliable and valid instrument for the assessment of symptoms and physical functional disability in patients with knee and hip OA.
Evaluation of Reliability Coefficients for Two-Level Models via Latent Variable Analysis
ERIC Educational Resources Information Center
Raykov, Tenko; Penev, Spiridon
2010-01-01
A latent variable analysis procedure for evaluation of reliability coefficients for 2-level models is outlined. The method provides point and interval estimates of group means' reliability, overall reliability of means, and conditional reliability. In addition, the approach can be used to test simple hypotheses about these parameters. The…
Bergeron, Lise; Smolla, Nicole; Berthiaume, Claude; Renaud, Johanne; Breton, Jean-Jacques; St-Georges, Marie; Morin, Pauline; Zavaglia, Elissa; Labelle, Réal
2017-03-01
The Dominic Interactive for Adolescents-Revised (DIA-R) is a multimedia self-report screen for 9 mental disorders, borderline personality traits, and suicidality defined by the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders ( DSM-5). This study aimed to examine the reliability and the validity of this instrument. French- and English-speaking adolescents aged 12 to 15 years ( N = 447) were recruited from schools and clinical settings in Montreal and were evaluated twice. The internal consistency was estimated by Cronbach alpha coefficients and the test-retest reliability by intraclass correlation coefficients. Cutoff points on the DIA-R scales were determined by using clinically relevant measures for defining external validation criteria: the Schedule for Affective Disorders and Schizophrenia for School-Aged Children, the Beck Hopelessness Scale, and the Abbreviated-Diagnostic Interview for Borderlines. Receiver operating characteristic (ROC) analyses provided accuracy estimates (area under the ROC curve, sensitivity, specificity, likelihood ratio) to evaluate the ability of the DIA-R scales to predict external criteria. For most of the DIA-R scales, reliability coefficients were excellent or moderate. High or moderate accuracy estimates from ROC analyses demonstrated the ability of the DIA-R thresholds to predict psychopathological conditions. These thresholds were generally capable to discriminate between clinical and school subsamples. However, the validity of the obsessions/compulsions scale was too low. Findings clearly support the reliability and the validity of the DIA-R. This instrument may be useful to assess a wide range of adolescents' mental health problems in the continuum of services. This conclusion applies to all scales, except the obsessions/compulsions one.
[An instrument in Spanish to evaluate the performance of clinical teachers by students].
Bitran, Marcela; Mena, Beltrán; Riquelme, Arnoldo; Padilla, Oslando; Sánchez, Ignacio; Moreno, Rodrigo
2010-06-01
The modernization of clinical teaching has called for the creation of faculty development programs, and the design of suitable instruments to evaluate clinical teachers' performance. To report the development and validation of an instrument in Spanish designed to measure the students' perceptions of their clinical teachers' performance and to provide them with feedback to improve their teaching practices. In a process that included the active participation of authorities, professors in charge of courses and internships, clinical teachers, students and medical education experts, we developed a 30-item questionnaire called MEDUC30 to evaluate the performance of clinical teachers by their students. The internal validity was assessed by factor analysis of 5214 evaluations of 265 teachers, gathered from 2004 to 2007. The reliability was measured with the Cronbach's alpha coefficient and the generalizability coefficient (g). MEDUC30 had good content and construct validity. Its internal structure was compatible with four factors: patient-centered teaching, teaching skills, assessment skills and learning climate, and it proved to be consistent with the structure anticipated by the theory. The scores were highly reliable (Cronbach's alpha: 0.97); five evaluations per teacher were sufficient to reach a reliability coefficient (g) of 0.8. MEDUC30 is a valid, reliable and useful instrument to evaluate the performance of clinical teachers. To our knowledge, this is the first instrument in Spanish for which solid validity and reliability evidences have been reported. We hope that MEDUC30 will be used to improve medical education in Spanish-speaking medical schools, providing teachers a specific feedback upon which to improve their pedagogical practice, and authorities with valuable information for the assessment of their faculty.
Cole, Jason C; Ito, Diane; Chen, Yaozhu J; Cheng, Rebecca; Bolognese, Jennifer; Li-McLeod, Josephine
2014-09-04
There is a lack of validated instruments to measure the level of burden of Alzheimer's disease (AD) on caregivers. The Impact of Alzheimer's Disease on Caregiver Questionnaire (IADCQ) is a 12-item instrument with a seven-day recall period that measures AD caregiver's burden across emotional, physical, social, financial, sleep, and time aspects. Primary objectives of this study were to evaluate psychometric properties of IADCQ administered on the Web and to determine most appropriate scoring algorithm. A national sample of 200 unpaid AD caregivers participated in this study by completing the Web-based version of IADCQ and Short Form-12 Health Survey Version 2 (SF-12v2™). The SF-12v2 was used to measure convergent validity of IADCQ scores and to provide an understanding of the overall health-related quality of life of sampled AD caregivers. The IADCQ survey was also completed four weeks later by a randomly selected subgroup of 50 participants to assess test-retest reliability. Confirmatory factor analysis (CFA) was implemented to test the dimensionality of the IADCQ items. Classical item-level and scale-level psychometric analyses were conducted to estimate psychometric characteristics of the instrument. Test-retest reliability was performed to evaluate the instrument's stability and consistency over time. Virtually none (2%) of the respondents had either floor or ceiling effects, indicating the IADCQ covers an ideal range of burden. A single-factor model obtained appropriate goodness of fit and provided evidence that a simple sum score of the 12 items of IADCQ can be used to measure AD caregiver's burden. Scales-level reliability was supported with a coefficient alpha of 0.93 and an intra-class correlation coefficient (for test-retest reliability) of 0.68 (95% CI: 0.50-0.80). Low-moderate negative correlations were observed between the IADCQ and scales of the SF-12v2. The study findings suggest the IADCQ has appropriate psychometric characteristics as a unidimensional, Web-based measure of AD caregiver burden and is supported by strong model fit statistics from CFA, high degree of item-level reliability, good internal consistency, moderate test-retest reliability, and moderate convergent validity. Additional validation of the IADCQ is warranted to ensure invariance between the paper-based and Web-based administration and to determine an appropriate responder definition.
Terry, Leann; Kelley, Ken
2012-11-01
Composite measures play an important role in psychology and related disciplines. Composite measures almost always have error. Correspondingly, it is important to understand the reliability of the scores from any particular composite measure. However, the point estimates of the reliability of composite measures are fallible and thus all such point estimates should be accompanied by a confidence interval. When confidence intervals are wide, there is much uncertainty in the population value of the reliability coefficient. Given the importance of reporting confidence intervals for estimates of reliability, coupled with the undesirability of wide confidence intervals, we develop methods that allow researchers to plan sample size in order to obtain narrow confidence intervals for population reliability coefficients. We first discuss composite reliability coefficients and then provide a discussion on confidence interval formation for the corresponding population value. Using the accuracy in parameter estimation approach, we develop two methods to obtain accurate estimates of reliability by planning sample size. The first method provides a way to plan sample size so that the expected confidence interval width for the population reliability coefficient is sufficiently narrow. The second method ensures that the confidence interval width will be sufficiently narrow with some desired degree of assurance (e.g., 99% assurance that the 95% confidence interval for the population reliability coefficient will be less than W units wide). The effectiveness of our methods was verified with Monte Carlo simulation studies. We demonstrate how to easily implement the methods with easy-to-use and freely available software. ©2011 The British Psychological Society.
Mantarova, Stefka G; Velcheva, Irena V; Georgieva, Spaska O; Stambolieva, Katerina I
2013-01-01
The last twenty years have witnessed a surge of interest in the autonomic symptoms in Parkinson's disease (PD) and the possibilities to diagnose and treat them. The specialized questionnaire assessing the autonomic symptoms in Parkinson's disease (SCOPA-AUT) has been validated and available in English, Dutch and Spanish. In this study we aim at evaluating the validity, reliability and applicability of the Bulgarian version of SCOPA-AUT (SCOPA-AUT-BG). The study included 55 patients with idiopathic PD (mean age 64.4 +/- 8.9 yrs), and 40 healthy controls (mean age 58.5 +/- 9.4 yrs). Clinical severity and disease stage were assessed by United Parkinson's disease rating scale (UPRDS) and Hoen and Yahr (H&Y). Thirty-two of the PD patients completed SCOPA-AUT-BG again after a 7-day interval. Questionnaire reliability was analyzed by determining the internal consistency, homogeneity, discriminatory and construct validity and test-retest reliability. Analyses showed good internal consistency of the summary evaluation of SCOPA-AUT-BG (coefficient alpha of Cronbach = 0.79), which indicates the high reliability of the questionnaire. The lowest Cronbach's alpha coefficient (0.53) was found for the subscale "cardiovascular functions". A dominant role belongs to the subscales for gastrointestinal and urinary functions (Cronbach's Alpha > 0.7), where a significantly high correlation of PD with the UPDRS scale was observed. We found high test-retest reliability based on the responses associated with dysfunction of the gastrointestinal, urinary, thermoregulatory and pupillary autonomic systems. The correlation of the results of SCOPA-AUT-BG with UPDRS is higher than that with H&Y, and the construct validity is high except for the cardiovascular and pupillomotor functions subscales. The results of this study show that SCOPA-AUT-BG is a valid and reliable specialized questionnaire to evaluate autonomic function in patients with Parkinson's disease. Using it allows for more detailed clinical evaluation of these patients and justifies the need to refer them to specialized examination of autonomic functions.
Development and psychometric testing of the Cancer Knowledge Scale for Elders.
Su, Ching-Ching; Chen, Yuh-Min; Kuo, Bo-Jein
2009-03-01
To develop the Cancer Knowledge Scale for Elders and test its validity and reliability. The number of elders suffering from cancer is increasing. To facilitate cancer prevention behaviours among elders, they shall be educated about cancer-related knowledge. Prior to designing a programme that would respond to the special needs of elders, understanding the cancer-related knowledge within this population was necessary. However, extensive review of the literature revealed a lack of appropriate instruments for measuring cancer-related knowledge. A valid and reliable cancer knowledge scale for elders is necessary. A non-experimental methodological design was used to test the psychometric properties of the Cancer Knowledge Scale for Elders. Item analysis was first performed to screen out items that had low corrected item-total correlation coefficients. Construct validity was examined with a principle component method of exploratory factor analysis. Cancer-related health behaviour was used as the criterion variable to evaluate criterion-related validity. Internal consistency reliability was assessed by the KR-20. Stability was determined by two-week test-retest reliability. The factor analysis yielded a four-factor solution accounting for 49.5% of the variance. For criterion-related validity, cancer knowledge was positively correlated with cancer-related health behaviour (r = 0.78, p < 0.001). The KR-20 coefficients of each factor were 0.85, 0.76, 0.79 and 0.67 and 0.87 for the total scale. Test-retest reliability over a two-week period was 0.83 (p < 0.001). This study provides evidence for content validity, construct validity, criterion-related validity, internal consistency and stability of the Cancer Knowledge Scale for Elders. The results show that this scale is an easy-to-use instrument for elders and has adequate validity and reliability. The scale can be used as an assessment instrument when implementing cancer education programmes for elders. It can also be used to evaluate the effects of education programmes.
Multanen, Juhani; Honkanen, Mikko; Häkkinen, Arja; Kiviranta, Ilkka
2018-05-22
The Knee Injury and Osteoarthritis Outcome Score (KOOS) is a commonly used knee assessment and outcome tool in both clinical work and research. However, it has not been formally translated and validated in Finnish. The purpose of this study was to translate and culturally adapt the KOOS questionnaire into Finnish and to determine its validity and reliability among Finnish middle-aged patients with knee injuries. KOOS was translated and culturally adapted from English into Finnish. Subsequently, 59 patients with knee injuries completed the Finnish version of KOOS, Western Ontario and McMaster Osteoarthritis Index (WOMAC), Short-Form 36 Health Survey (SF-36) and Numeric Pain Rating Scale (Pain-NRS). The same KOOS questionnaire was re-administered 2 weeks later. Psychometric assessment of the Finnish KOOS was performed by testing its construct validity and reliability by using internal consistency, test-retest reliability and measurement error. The floor and ceiling effects were also examined. The cross-cultural adaptation revealed only minor cultural differences and was well received by the patients. For construct validity, high to moderate Spearman's Correlation Coefficients were found between the KOOS subscales and the WOMAC, SF-36, and Pain-NRS subscales. The Cronbach's alpha was from 0.79 to 0.96 for all subscales indicating acceptable internal consistency. The test-retest reliability was good to excellent, with Intraclass Correlation Coefficients ranging from 0.73 to 0.86 for all KOOS subscales. The minimal detectable change ranged from 17 to 34 on an individual level and from 2 to 4 on a group level. No floor or ceiling effects were observed. This study yielded an appropriately translated and culturally adapted Finnish version of KOOS which demonstrated good validity and reliability. Our data indicate that the Finnish version of KOOS is suitable for assessment of the knee status of Finnish patients with different knee complaints. Further studies are needed to evaluate the predictive ability of KOOS in the Finnish population.
The Effects of Methods of Imputation for Missing Values on the Validity and Reliability of Scales
ERIC Educational Resources Information Center
Cokluk, Omay; Kayri, Murat
2011-01-01
The main aim of this study is the comparative examination of the factor structures, corrected item-total correlations, and Cronbach-alpha internal consistency coefficients obtained by different methods used in imputation for missing values in conditions of not having missing values, and having missing values of different rates in terms of testing…
Vertically aligned CNT-Cu nano-composite material for stacked through-silicon-via interconnects.
Sun, Shuangxi; Mu, Wei; Edwards, Michael; Mencarelli, Davide; Pierantoni, Luca; Fu, Yifeng; Jeppson, Kjell; Liu, Johan
2016-08-19
For future miniaturization of electronic systems using 3D chip stacking, new fine-pitch materials for through-silicon-via (TSV) applications are likely required. In this paper, we propose a novel carbon nanotube (CNT)/copper nanocomposite material consisting of high aspect ratio, vertically aligned CNT bundles coated with copper. These bundles, consisting of hundreds of tiny CNTs, were uniformly coated by copper through electroplating, and aspect ratios as high as 300:1 were obtained. The resistivity of this nanomaterial was found to be as low as ∼10(-8) Ω m, which is of the same order of magnitude as the resistivity of copper, and its temperature coefficient was found to be only half of that of pure copper. The main advantage of the composite TSV nanomaterial is that its coefficient of thermal expansion (CTE) is similar to that of silicon, a key reliability factor. A finite element model was set up to demonstrate the reliability of this composite material and thermal cycle simulations predicted very promising results. In conclusion, this composite nanomaterial appears to be a very promising material for future 3D TSV applications offering both a low resistivity and a low CTE similar to that of silicon.
Carvalho, Hudson W de; Andreoli, Sérgio B; Lara, Diogo R; Patrick, Christopher J; Quintana, Maria Inês; Bressan, Rodrigo A; Melo, Marcelo F de; Mari, Jair de J; Jorge, Miguel R
2013-01-01
Positive and negative affect are the two psychobiological-dispositional dimensions reflecting proneness to positive and negative activation that influence the extent to which individuals experience life events as joyful or as distressful. The Positive and Negative Affect Schedule (PANAS) is a structured questionnaire that provides independent indexes of positive and negative affect. This study aimed to validate a Brazilian interview-version of the PANAS by means of factor and internal consistency analysis. A representative community sample of 3,728 individuals residing in the cities of São Paulo and Rio de Janeiro, Brazil, voluntarily completed the PANAS. Exploratory structural equation model analysis was based on maximum likelihood estimation and reliability was calculated via Cronbach's alpha coefficient. Our results provide support for the hypothesis that the PANAS reliably measures two distinct dimensions of positive and negative affect. The structure and reliability of the Brazilian version of the PANAS are consistent with those of its original version. Taken together, these results attest the validity of the Brazilian adaptation of the instrument.
Odetunde, Marufat Oluyemisi; Akinpelu, Aderonke Omobonike; Odole, Adesola Christiana
2017-10-19
Psychometric evidence is necessary to establish scientific integrity and clinical usefulness of translations and cultural adaptations of the Stroke-Specific Quality of Life (SS-QoL) scale. However, the limited evidence on psychometrics of Yoruba version of SS-QoL 2.0 (SS-QoL(Y)) is a significant shortcoming. This study assessed the test-retest reliability, internal consistency, convergent, divergent, discriminant and known-group validity of the SS-QoL(Y). Yoruba version of the WHOQoL-BREF was used to test the convergent and divergent validity of the SS-QoL(Y) among 100 consenting stroke survivors. The WHOQoL-BREF and SS-QoL(Y) was administered randomly in order to eliminate bias. The test-retest reliability of the SS-QoL(Y) was carried out among 68 of the respondents within an interval of 7 days. All respondents were purposively recruited from selected secondary and tertiary health facilities in South-west Nigeria. Data were analysed using descriptive statistics of mean and standard deviation, and inferential statistics of Spearman correlation, Cronbach's alpha, Intra-class Correlation Coefficient (ICC), Independent t-test and One-way ANOVA. Alpha level was set at p < 0.05. The physical health, psychological health, social relationship and environment domains on WHOQoL-BREF with correlation coefficient that ranged from 0.214 to 0.360 showed significant correlation with similar domains on SS-QoL(Y). Dissimilar domains between the two scales had r values from 0.035 to 0.366. Discriminant validity of SS-QoL(Y) showed that items' r value ranged from 0.711 to 0.920 with their hypothesized domains. The scale demonstrated moderate to strong test-retest reliability with Intra-class correlation coefficient (ICC) for the domains and overall scores (r = 0.47 to 0.81) and moderate to high internal consistency (Cronbach's alpha =0.61 to 0.82) for domains scores. These correlations were also significant for the domains and overall scores (p < 0.05). There were no significant differences across different age groups or gender for the domains or overall scores of SS-QoL(Y). Discriminant and known-group validity, test-retest reliability and internal consistency of the Yoruba version of the Stroke Specific Quality of Life 2.0 are adequate while the convergent and divergent validity are low but acceptable. The SS-QoL(Y) is recommended for assessing health-related quality of life among Yoruba stroke survivors.
Iwata, Shintaro; Uehara, Kosuke; Ogura, Koichi; Akiyama, Toru; Shinoda, Yusuke; Yonemoto, Tsukasa; Kawai, Akira
2016-09-01
The Musculoskeletal Tumor Society (MSTS) scoring system is a widely used functional evaluation tool for patients treated for musculoskeletal tumors. Although the MSTS scoring system has been validated in English and Brazilian Portuguese, a Japanese version of the MSTS scoring system has not yet been validated. We sought to determine whether a Japanese-language translation of the MSTS scoring system for the lower extremity had (1) sufficient reliability and internal consistency, (2) adequate construct validity, and (3) reasonable criterion validity compared with the Toronto Extremity Salvage Score (TESS) and SF-36 using psychometric analysis. The Japanese version of the MSTS scoring system was developed using accepted guidelines, which included translation of the English version of the MSTS into Japanese by five native Japanese bilingual musculoskeletal oncology surgeons and integrated into one document. One hundred patients with a diagnosis of intermediate or malignant bone or soft tissue tumors located in the lower extremity and who had undergone tumor resection with or without reconstruction or amputation participated in this study. Reliability was evaluated by test-retest analysis, and internal consistency was established by Cronbach's alpha coefficient. Construct validity was evaluated using the principal factor analysis and Akaike information criterion network. Criterion validity was evaluated by comparing the MSTS scoring system with the TESS and SF-36. Test-retest analysis showed a high intraclass correlation coefficient (0.92; 95% CI, 0.88-0.95), indicating high reliability of the Japanese version of the MSTS scoring system, although a considerable ceiling effect was observed, with 23 patients (23%) given the maximum score. Cronbach's alpha coefficient was 0.87 (95% CI, 0.82-0.90), suggesting a high level of internal consistency. Factor analysis revealed that all items had high loading values and communalities; we identified a central role for the items "walking" and "gait" according to the Akaike information criterion network. The total MSTS score was correlated with that of the TESS (r = 0.81; 95% CI, 0.73-0.87; p < 0.001) and the physical component summary and physical functioning of the SF-36. The Japanese-language translation of the MSTS scoring system for the lower extremity has sufficient reliability and reasonable validity. Nevertheless, the observation of a ceiling effect suggests poor ability of this system to discriminate from among patients who have a high level of function.
The validity and reliability of tinnitus handicap inventory Thai version.
Limviriyakul, Siriporn; Supavanich, Walop
2012-11-01
Demonstrate the reliability and validity of the Tinnitus Handicap Inventory Thai Version (THI-T), a self-report measure of tinnitus. A cross-sectional psychometric validation study was used to determine internal consistency reliability and validity of the Tinnitus Handicap Inventory Thai Version at the Otoneurology clinic at Tertiary care center The cross-cultural adaptation of the Tinnitus Handicapped Inventory English version (Newman et al, 1996) was translated into Thai version following the steps indicated by Guillemin et al. The reliability was constructed by using Cronbach's coefficient alpha. The validity was analyzed by the correlation between Tinnitus Handicap Inventory Thai version and the 36-items short form health survey and visual analog scale using Spearman and Pearson test. The result showed good internal consistency reliabilities of total, functional, emotional, and catastrophic scale (a = 0.902, 0.804, 0.831 and 0.661, respectively) of Tinnitus Handicap Inventory Thai Version. Spearman correlation showed the significant correlation of Tinnitus Handicap Inventory to 36-items short form health survey and visual analog scale. Tinnitus Handicap Inventory Thai Version will be a vigorous tool in evaluating tinnitus patients as well as monitoring the progress of their symptoms.
Hypertension Knowledge-Level Scale (HK-LS): a study on development, validity and reliability.
Erkoc, Sultan Baliz; Isikli, Burhanettin; Metintas, Selma; Kalyoncu, Cemalettin
2012-03-01
This study was conducted to develop a scale to measure knowledge about hypertension among Turkish adults. The Hypertension Knowledge-Level Scale (HK-LS) was generated based on content, face, and construct validity, internal consistency, test re-test reliability, and discriminative validity procedures. The final scale had 22 items with six sub-dimensions. The scale was applied to 457 individuals aged ≥ 18 years, and 414 of them were re-evaluated for test-retest reliability. The six sub-dimensions encompassed 60.3% of the total variance. Cronbach alpha coefficients were 0.82 for the entire scale and 0.92, 0.59, 0.67, 0.77, 0.72, and 0.76 for the sub-dimensions of definition, medical treatment, drug compliance, lifestyle, diet, and complications, respectively. The scale ensured internal consistency in reliability and construct validity, as well as stability over time. Significant relationships were found between knowledge score and age, gender, educational level, and history of hypertension of the participants. No correlation was found between knowledge score and working at an income-generating job. The present scale, developed to measure the knowledge level of hypertension among Turkish adults, was found to be valid and reliable.
Narin, Selnur; Unver, Bayram; Bakırhan, Serkan; Bozan, Ozgür; Karatosun, Vasfi
2014-01-01
The purpose of this study was to adapt the English version of the Hospital for Special Surgery (HSS) knee score for use in a Turkish population and to evaluate its validity, reliability and cultural adaptation. Standard forward-back translation of the HSS knee score was performed and the Turkish version was applied in 73 patients. The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC), Mini-Mental State Examination and sit-to-stand test were also performed and analyzed. Internal consistency reliability was tested using Cronbach's alpha. The intraclass correlation coefficient (ICC) was used to calculate the test-retest reliability at one-week intervals. Validity was assessed by calculating the Pearson correlation between the HSS, WOMAC and sit-to-stand test scores. The ICC ranged from 0.98 to 0.99 with high internal consistency (Cronbach's alpha: 0.87). The WOMAC score correlated with total HSS score (r: -0.80, p<0.001) and sit-to-stand score (r: 0.12, p: 0.312). The Turkish version of the HSS knee score is reliable and valid in evaluating the total knee arthroplasty in Turkish patients.
Understanding a Widely Misunderstood Statistic: Cronbach's "Alpha"
ERIC Educational Resources Information Center
Ritter, Nicola L.
2010-01-01
It is important to explore score reliability in virtually all studies, because tests are not reliable. The present paper explains the most frequently used reliability estimate, coefficient alpha, so that the coefficient's conceptual underpinnings will be understood. Researchers need to understand score reliability because of the possible impact…
ERIC Educational Resources Information Center
Black, Ryan A.; Yang, Yanyun; Beitra, Danette; McCaffrey, Stacey
2015-01-01
Estimation of composite reliability within a hierarchical modeling framework has recently become of particular interest given the growing recognition that the underlying assumptions of coefficient alpha are often untenable. Unfortunately, coefficient alpha remains the prominent estimate of reliability when estimating total scores from a scale with…
Doig, Emmah; Prescott, Sarah; Fleming, Jennifer; Cornwell, Petrea; Kuipers, Pim
2016-01-01
To examine the internal reliability and test-retest reliability of the Client-Centeredness of Goal Setting (C-COGS) scale. The C-COGS scale was administered to 42 participants with acquired brain injury after completion of multidisciplinary goal planning. Internal reliability of scale items was examined using item-partial total correlations and Cronbach's α coefficient. The scale was readministered within a 1-mo period to a subsample of 12 participants to examine test-retest reliability by calculating exact and close percentage agreement for each item. After examination of item-partial total correlations, test items were revised. The revised items demonstrated stronger internal consistency than the original items. Preliminary evaluation of test-retest reliability was fair, with an average exact percent agreement across all test items of 67%. Findings support the preliminary reliability of the C-COGS scale as a tool to evaluate and promote client-centered goal planning in brain injury rehabilitation. Copyright © 2016 by the American Occupational Therapy Association, Inc.
Factor structure of the Bulimia Test--Revised in college women from four ethnic groups.
Fernandez, Senaida; Malcarne, Vanessa L; Malacrne, Vanessa L; Wilfley, Denise E; McQuaid, John
2006-07-01
The factor structure of the Bulimia Test--Revised (BULIT-R) was investigated using confirmatory factor analysis (CFA) and exploratory factor analysis (EFA). The sample consisted of 2,671 female college students (African American, Asian American, Caucasian American, and Latino American). Reliability coefficients were excellent across groups. African Americans scored significantly lower on the BULIT-R than Caucasian Americans. Across groups, CFA and EFA results suggest a six-factor solution is most appropriate. Consistent across groups were factors representing bingeing, body image, purging, and extreme weight loss behaviors, while few differences were observed across groups. These findings suggest that the measure is reliable and valid for use with diverse ethnic groups. Future research should focus on culturally salient psychological correlates of disordered eating in diverse ethnic groups.
Dąbrowska, Marta; Krakowiak, Karolina; Radlińska, Olga; Rybka, Aleksandra; Grabczak, Elżbieta M; Maskey-Warzęchowska, Marta; Korczyński, Piotr; Birring, Surinder S; Krenke, Rafał
2016-01-01
The Leicester Cough Questionnaire (LCQ) is one of the few specific quality-of-life questionnaires (QOLQ) dedicated to measuring the impact of chronic cough on patients' health/condition. The aim of the study was to validate the Polish version of the LCQ. The LCQ was translated forward and backward. The Polish version of the LCQ was tested on 35 patients suffering from chronic cough (23 women, median age 60 years, nonor ex-smokers, median cough duration of 23 weeks). Its validity was tested by comparison to a visual analogue scale (VAS) of cough intensity and other health questionnaires (hospital anxiety and depression scale - HADS, Euro-Quality of Life Questionnaire - EQ5D, St. George's Respiratory Questionnaire - SGRQ). The internal reliability of the Polish version of the LCQ was determined using the Cronbach alpha coefficient and its repeatability by the intraclass consistency coefficient. The translation of the LCQ into Polish was accepted by the author of the original LCQ. The Cronbach's alpha coefficient for total LCQ was 0.89, and reached 0.82, 0.86 and 0.78 for the physical, psychological and social domain, respectively. There were significant negative correlations between cough severity measured by VAS, the results of the EQ5D and SGRQ and the Polish version of the LCQ. The intraclass correlation coefficient of the test-retest reliability was significant (0.99). The Polish version of the LCQ has been validated and is a reliable tool to measure the impact of chronic cough on quality of life of patients with chronic cough.
Hu, Guo-Qing; Rao, Ke-Qin; Sun, Zhen-Qiu
2008-12-01
To develop a capacity questionnaire in public health emergency for Chinese local governments. Literature reviews, conceptual modelling, stake-holder analysis, focus group, interview, and Delphi technique were employed together to develop the questionnaire. Classical test theory and case study were used to assess the reliability and validity. (1) A 2-dimension conceptual model was built. A preparedness and response capacity questionnaire in public health emergency with 10 dimensions and 204 items, was developed. (2) Reliability and validity results. Internal consistency: except for dimension 3 and 8, the Cronbach's alpha coefficient of other dimensions was higher than 0.60. The alpha coefficients of dimension 3 and dimension 8 were 0.59 and 0.39 respectively; Content validity: the questionnaire was recognized by the investigatees; Construct validity: the Spearman correlation coefficients among the 10 dimensions fluctuated around 0.50, ranging from 0.26 to 0.75 (P<0.05); Discrimination validity: comparisons of 10 dimensions among 4 provinces did not show statistical significance using One-way analysis of variance (P>0.05). Criterion-related validity: case study showed significant difference among the 10 dimensions in Beijing between February 2003 (before SARS event) and November 2005 (after SARS event). The preparedness and response capacity questionnaire in public health emergency is a reliable and valid tool, which can be used in all provinces and municipalities in China.
Praxedes, Marcus Fernando da Silva; de Abreu, Mauro Henrique Nogueira Guimarães; Paiva, Saul Martins; Mambrini, Juliana Vaz de Melo; Marcolino, Milena Soriano; Martins, Maria Auxiliadora Parreiras
2016-06-24
The aim of this study was to evaluate the psychometric properties of the Brazilian version of the Oral Anticoagulation Knowledge (OAK) Test. This study, conducted in an anticoagulation clinic, included 201 Brazilian participants aged over 18 years, who had been using warfarin for more than two months. The reliability of the instrument was evaluated by assessing internal consistency (Kuder-Richardson coefficient) and reproducibility (test-retest reliability). The validity was evaluated by hypothesizing that there would be a positive correlation of moderate to strong intensity between the correctness levels of the OAK Test and time within therapeutic range (TTR) values, which is a measure used to evaluate the quality of oral anticoagulation. The instrument exhibited good psychometric properties. The total a Kuder-Richardson coefficient value was 0.818 and intraclass correlation coefficient was 0.967. The validity revealed a strong positive correlation between the values of the level of knowledge, as measured by the OAK Test and the TTR values (rs = 0.780). The instrument proved to be a reliable and valid tool for evaluating the knowledge of Brazilian patients on oral anticoagulation therapy with warfarin. This instrument may be incorporated into the practice of health care for substantiating the structuring of educational activities to ensure the improvement of knowledge about the use of warfarin, thereby increasing the effectiveness and safety of treatment.
Ibrahim, Edward F; Petrou, Charalambos; Galanos, Antonis
2015-01-01
Background The purpose of the present study was to validate the Functional Shoulder Score (FSS), a new patient-reported outcome score specifically designed to evaluate patients with rotator cuff disorders. Methods One hundred and nineteen patients were assessed using two shoulder scoring systems [the FSS and the Constant–Murley Score (CMS)] at 3 weeks pre- and 6 months post-arthroscopic rotator cuff surgery. The reliability, validity, responsiveness and interpretability of the FSS were evaluated. Results Reliability analysis (test–retest) showed an intraclass correlation coefficient value of 0.96 [95% confidence interval (CI) = 0.92 to 0.98]. Internal consistency analysis revealed a Cronbach's alpha coefficient of 0.93. The Pearson correlation coefficient FSS-CMS was 0.782 pre-operatively and 0.737 postoperatively (p < 0.0005). There was a statistically significant increase in FSS scores postoperatively, an effect size of 3.06 and standardized response mean of 2.80. The value for minimal detectable change was ±8.38 scale points (based on a 90% CI) and the minimal clinically important difference for improvement was 24.7 ± 5.4 points. Conclusions The FSS is a patient-reported outcome measure that can easily be incorporated into clinical practice, providing a quick, reliable, valid and practical measure for rotator cuff problems. The questionnaire is highly sensitive to clinical change. PMID:27582986
Reliability of the social skills rating system in a group of Iranian children.
Shahim, S
2001-12-01
The purpose of this study was to investigate reliability of the Social Skills Rating Systems of Gresham and Elliott for use in Iran. The sample consisted of 304 students aged 6 to 12 years, selected from the elementary schools in Shiraz, Iran. Parents' and teachers' ratings of social skills and behavioural problems and self-rating of social skills were applied in this study. Pearson correlations between parents' and teachers' ratings were low to moderate. Correlations between social skills subdomains and behavioural problems subdomains were low to high. Cronbach coefficients alpha were satisfactory for the two subdomains.
Béliard, Sophie; Coudert, Mathieu; Valéro, René; Charbonnier, Laurie; Duchêne, Emilie; Allaert, François André; Bruckert, Éric
2012-12-01
The purpose of our study was to develop and validate a short food frequency questionnaire which could assess the nutritional lifestyles of hypercholesterolemic patients consulting in daily practice. The questionnaire explores 11 nutrient categories. Hundred and thirty-one patients were recruited for the construct validity and 58 patients for the external validity in La Pitié Hospital, Paris. The reference method used was the diet history. To measure the internal consistency and to test the sensibility to change on a large scale, the questionnaire was used in an observational study conducted in Spain in 1048 moderate hypercholesterolemic patients. Psychometric analyses included construct validity, internal consistency, test-retest reliability, external validity and sensibility to change. Validation of the questionnaire indicated a good internal consistency (Cronbach Coefficient Alpha at 0.69) and test-retest reliability (intraclass correlation coefficient=0.89). The correlation between the scores of the FFQ and those of the diet history was significant with a Pearson correlation coefficient at 0.3 (P=0.029). The comparison between the ranking of the patients showed an agreement of 72% with a kappa of 0.48 [0.10; 0.69]. The sensibility to change was good with a score evolution improving one and four months after nutrition advices: 28.2% of patients ranked in group 1 at inclusion versus 61.3% (P<0.0001) at one month and 75.2% (P<0.0001) at four months. In conclusion, we developed and validated a food questionnaire for hypercholesterolemic patients, which can be used as a therapeutic education tool in daily practice or in clinical research. Copyright © 2012. Published by Elsevier Masson SAS.
Validation of the Physical Activity Questionnaire for Older Children (PAQ-C) among Chinese Children.
Wang, Jing Jing; Baranowski, Tom; Lau, Wc Patrick; Chen, Tzu An; Pitkethly, Amanda Jane
2016-03-01
This study initially validates the Chinese version of the Physical Activity Questionnaire for Older Children (PAQ-C), which has been identified as a potentially valid instrument to assess moderate-to-vigorous physical activity (MVPA) in children among diverse racial groups. The psychometric properties of the PAQ-C with 742 Hong Kong Chinese children were assessed with the scale's internal consistency, reliability, test-retest reliability, confirmatory factory analysis (CFA) in the overall sample, and multistep invariance tests across gender groups as well as convergent validity with body mass index (BMI), and an accelerometry-based MVPA. The Cronbach alpha coefficient (α=0.79), composite reliability value (ρ=0.81), and the intraclass correlation coefficient (α=0.82) indicate the satisfactory reliability of the PAQ-C score. The CFA indicated data fit a single factor model, suggesting that the PAQ-C measures only one construct, on MVPA over the previous 7 days. The multiple-group CFAs suggested that the factor loadings and variances and covariances of the PAQ-C measurement model were invariant across gender groups. The PAQ-C score was related to accelerometry-based MVPA (r=0.33) and inversely related to BMI (r=-0.18). This study demonstrates the reliability and validity of the PAQ-C in Chinese children. Copyright © 2016 The Editorial Board of Biomedical and Environmental Sciences. Published by China CDC. All rights reserved.
Validation of the French version of the Burn Specific Health Scale-Brief (BSHS-B) questionnaire.
Gandolfi, S; Auquit-Auckbur, I; Panunzi, S; Mici, E; Grolleau, J-L; Chaput, B
2016-11-01
The Burn Specific Health Scale-Brief questionnaire is a widely validated tool for estimating the health related quality of life and for assessing the best multidisciplinary management of burn patients. The aim of this study was to translate the BSHS-B into French and to investigate its reliability and validity. According to the procedure proposed by the Scientific Advisory Committee of the Medical Outcomes Trust, the Burn Specific Health Scale-Brief (BSHS-B) was translated from the English version into French. In order to test the reliability of the French version of the BSHS-B, 53 burn patients French speakers completed the BSHS-B and SF-36 questionnaires from two to four years after burn. Ten of them have been re-tested at 6 months after the first evaluation. To evaluate clinical utility of the BSHS-F, internal consistency, construct validity (using SF-36) and stability in time were assessed using Cronbach's alpha statistic, Spearman rank test, and intra-class correlation coefficient respectively. The French version of the BSHS-B Cronbach's alpha coefficient was 0.93 and was >0.80 for all the sub-domains. French version of the BSHS-B and the SF-36 were positively correlated, all the associations were statistically significant (p<0.01). Intra-class correlation coefficients for test-retest ranged between 0.95 and 0.99 for the sub-domains. The intra-class correlation coefficient (ICC) for the total score was 0.98. The French version of the BSHS-B shows a robust rate of internal consistency, construct validity and stability in time, supporting its application in routine clinical practice as well as in international studies. Copyright © 2016 Elsevier Ltd and ISBI. All rights reserved.
Schuler, M; Musekamp, G; Bengel, J; Schwarze, M; Spanier, K; Gutenbrunner, Chr; Ehlebracht-König, I; Nolte, S; Osborne, R H; Faller, H
2014-11-01
To assess stable effects of self-management programs, measurement instruments should primarily capture the attributes of interest, for example, the self-management skills of the measured persons. However, measurements of psychological constructs are always influenced by both aspects of the situation (states) and aspects of the person (traits). This study tests whether the Health Education Impact Questionnaire (heiQ™), an instrument assessing a wide range of proximal outcomes of self-management programs, is primarily influenced by person factors instead of situational factors. Furthermore, measurement invariance over time, changes in traits and predictors of change for each heiQ™ scale were examined. Subjects were N = 580 patients with rheumatism, asthma, orthopedic conditions or inflammatory bowel disease, who filled out the heiQ™ at the beginning, the end of and 3 months after a disease-specific inpatient rehabilitation program in Germany. Structural equation modeling techniques were used to estimate latent trait-change models and test for measurement invariance in each heiQ™ scale. Coefficients of consistency, occasion specificity and reliability were computed. All scales showed scalar invariance over time. Reliability coefficients were high (0.80-0.94), and consistency coefficients (0.49-0.79) were always substantially higher than occasion specificity coefficients (0.14-0.38), indicating that the heiQ™ scales primarily capture person factors. Trait-changes with small to medium effect sizes were shown in five scales and were affected by sex, age and diagnostic group. The heiQ™ can be used to assess stable effects in important outcomes of self-management programs over time, e.g., changes in self-management skills or emotional well-being.
Siahaan, Laura A; Syam, Ari F; Simadibrata, Marcellus; Setiati, Siti
2017-01-01
to obtain a valid and reliable GERD-QOL questionnaire for Indonesian application. at the initial stage, the GERD-QOL questionnaire was first translated into Indonesian language and the translated questionnaire was subsequently translated back into the original language (back-to-back translation). The results were evaluated by the researcher team and therefore, an Indonesian version of GERD-QOL questionnaire was developed. Ninety-one patients who had been clinically diagnosed with GERD based on the Montreal criteria were interviewed using the Indonesian version of GERD-QOL questionnaire and the SF 36 questionnaire. The validity was evaluated using a method of construct validity and external validity, and reliability can be tested by the method of internal consistency and test retest. the Indonesian version of GERD-QOL questionnaire had a good internal consistency reliability with a Cronbach Alpha of 0.687-0.842 and a good test retest reliability with an intra-class correlation coefficient of 0.756-0.936; p<0.05). The questionnaire had also been demonstrated to have a good validity with a proven high correlation to each question of SF-36 (p<0.05). the Indonesian version of GERD-QOL questionnaire has been proven valid and reliable to evaluate the quality of life of GERD patients.
Longo Imedio, Isabel; Serra-Guillén, Carlos
2016-01-01
While there are questionnaires for evaluating the effects of skin cancer on patient quality of life, there are no specific questionnaires available in Spanish for evaluating quality of life in patients with actinic keratosis. The aim of this study was to translate and culturally adapt the Actinic Keratosis Quality of Life (AKQoL) questionnaire into Spanish. The original questionnaire was translated into Spanish following the guidelines for the cross-cultural adaptation of self-report measures. Several measures of general reliability and validity were calculated, including Cronbach α for internal consistency and the Spearman rank-order correlation coefficient and a Bland-Altman plot for test-retest reliability. To test concurrent validity, we used the Pearson correlation coefficient to measure the correlation between AKQoL and Skindex-29 scores. The final version of the questionnaire was administered to 621 patients with actinic keratosis, who scored a mean (SD) of 5.25 (4.73) points (total possible score, 0-25). The Cronbach α reliability coefficient analysis was 0.84. The correlation between the mean (SD) score on the Skindex-29 (1.87 [4.07]) and on the AKQoL (1.97 [2.98] was 0.344 (P=.002, Spearman's rho), with a proportion of shared variance of 11.8%. The translation, cross-cultural adaptation, and validation of the original AKQoL produced a reliable, easily understandable questionnaire for evaluating the impact of actinic keratosis on the quality of life of patients in our setting. Copyright © 2016 AEDV. Published by Elsevier España, S.L.U. All rights reserved.
The reliability of multidimensional neuropsychological measures: from alpha to omega.
Watkins, Marley W
To demonstrate that Coefficient omega, a model-based estimate, is more a more appropriate index of reliability than coefficient alpha for the multidimensional scales that are commonly employed by neuropsychologists. As an illustration, a structural model of an overarching general factor and four first-order factors for the WAIS-IV based on the standardization sample of 2200 participants was identified and omega coefficients were subsequently computed for WAIS-IV composite scores. Alpha coefficients were ≥ .90 and omega coefficients ranged from .75 to .88 for WAIS-IV factor index scores, indicating that the blend of general and group factor variance in each index score created a reliable multidimensional composite. However, the amalgam of variance from general and group factors did not allow the precision of Full Scale IQ (FSIQ) and factor index scores to be disentangled. In contrast, omega hierarchical coefficients were low for all four factor index scores (.10-.41), indicating that most of the reliable variance of each factor index score was due to the general intelligence factor. In contrast, the omega hierarchical coefficient for the FSIQ score was .84. Meaningful interpretation of WAIS-IV factor index scores as unambiguous indicators of group factors is imprecise, thereby fostering unreliable identification of neurocognitive strengths and weaknesses, whereas the WAIS-IV FSIQ score can be interpreted as a reliable measure of general intelligence. It was concluded that neuropsychologists should base their clinical decisions on reliable scores as indexed by coefficient omega.
Rating scales for dystonia in cerebral palsy: reliability and validity.
Monbaliu, E; Ortibus, E; Roelens, F; Desloovere, K; Deklerck, J; Prinzie, P; de Cock, P; Feys, H
2010-06-01
This study investigated the reliability and validity of the Barry-Albright Dystonia Scale (BADS), the Burke-Fahn-Marsden Movement Scale (BFMMS), and the Unified Dystonia Rating Scale (UDRS) in patients with bilateral dystonic cerebral palsy (CP). Three raters independently scored videotapes of 10 patients (five males, five females; mean age 13 y 3 mo, SD 5 y 2 mo, range 5-22 y). One patient each was classified at levels I-IV in the Gross Motor Function Classification System and six patients were classified at level V. Reliability was measured by (1) intraclass correlation coefficient (ICC) for interrater reliability, (2) standard error of measurement (SEM) and smallest detectable difference (SDD), and (3) Cronbach's alpha for internal consistency. Validity was assessed by Pearson's correlations among the three scales used and by content analysis. Moderate to good interrater reliability was found for total scores of the three scales (ICC: BADS=0.87; BFMMS=0.86; UDRS=0.79). However, many subitems showed low reliability, in particular for the UDRS. SEM and SDD were respectively 6.36% and 17.72% for the BADS, 9.88% and 27.39% for the BFMMS, and 8.89% and 24.63% for the UDRS. High internal consistency was found. Pearson's correlations were high. Content validity showed insufficient accordance with the new CP definition and classification. Our results support the internal consistency and concurrent validity of the scales; however, taking into consideration the limitations in reliability, including the large SDD values and the content validity, further research on methods of assessment of dystonia is warranted.
Development of the Facial Skin Care Index: A Health-Related Outcomes Index for Skin Cancer Patients
Matthews, B. Alex; Rhee, John S.; Neuburg, Marcy; Burzynski, Mary L.; Nattinger, Ann B.
2006-01-01
BACKGROUND Existing health-related quality-of-life (HRQOL) tools do not appear to capture patients' specific skin cancer concerns. OBJECTIVE To describe the conceptual foundation, item generation, reduction process, and reliability testing for the Facial Skin Cancer Index (FSCI), a HRQOL outcomes tool for skin cancer researchers and clinicians. METHODS Participants in Phases I to III consisted of adult patients (N = 134) diagnosed with biopsy-proven nonmelanoma cervicofacial skin cancer. Data were collected via self-report surveys and clinical records. RESULTS Seventy-one distinct items were generated in Phase I and rated for their importance by an independent sample during Phase II; 36 items representing six theoretical HRQOL domains were retained. Test–retest I results indicated that four subscales showed adequate reliability coefficients (α = 0.60 to 0.91). Twenty-six items remained for test–retest II. Results indicated excellent internal consistency for emotional, social, appearance, and modified financial/work subscales (range 0.79 to 0.95); test–retest correlation coefficients were consistent across time (range 0.81 to 0.97; lifestyle omitted). CONCLUSION Pretesting afforded the opportunity to select items that optimally met our a priori conceptual and psychometric criteria for high data quality. Phase IV testing (validity and sensitivity before surgery and 4 months after Mohs micrographic surgery) for the 20-item FSCI is under way. PMID:16875475
Psychometric properties of the Thought-Action Fusion Scale in a Turkish sample.
Yorulmaz, Orçun; Yilmaz, A Esin; Gençöz, Tülin
2004-10-01
The aim of the present study was to reveal the cross-cultural utility of the Thought-Action Fusion Scale (TAFS; J. Anxiety Disord. 10 (1996) 379). Thought-action fusion (TAF) refers to the tendency to overvalue the significance and the consequences of thoughts. Two hundred and fifty one undergraduate Turkish students participated in the current study. The reliability and validity analyses of the Turkish version of the scale indicated that the TAFS had adequate psychometric properties in a Turkish sample. Consistent with the original TAF, the Turkish version of TAFS revealed two subscales as TAF-Likelihood and TAF-Morality. Reliability analysis showed that TAF Scale and its factors had adequate internal consistencies and split-half reliability coefficients. Confirming the expectations, TAFS scores were found to be significantly and positively correlated with obsessive-compulsive symptoms, responsibility, and guilt measures. Moreover, it was found that people with high obsessive-compulsive symptoms had higher TAFS scores than those with low symptoms.
The Reliability and Validity of Prostate Cancer Fatalism Inventory in Turkish Language.
Aydoğdu, Nihal Gördes; Çapık, Cantürk; Ersin, Fatma; Kissal, Aygul; Bahar, Zuhal
2017-10-01
This study aimed to conduct the reliability and validity study of the Prostate Cancer Fatalism Inventory in Turkish language. The study carried out in methodological type and consisted of 171 men. The ages of the participants ranged between 40 and 82. The content validity index was determined to be 0.80, Kaiser-Meyer-Olkin value 0.825, Bartlett's test X 2 = 750.779 and p = 0.000. Then the principal component analysis was applied to the 15-item inventory. The inventory consisted of one dimension, and the load factors were over 0.30 for all items. The explained variance of the inventory was found 33.3 %. The Kuder-Richardson-20 coefficient was determined to be 0.849 and the item-total correlations ranged between 0.335 and 0.627. The Prostate Cancer Fatalism Inventory was a reliable and valid measurement tool in Turkish language. Integrating psychological strategies for prostate cancer screening may be required to strengthen the positive effects of nursing education.
An instrument for assessment of videotapes of general practitioners' performance.
Cox, J; Mulholland, H
1993-01-01
OBJECTIVES--To identify those important characteristics of doctors' and patients' behaviour that distinguish between "good" and "bad" consultations when viewed on videotape; to use these characteristics to develop a reliable instrument for assessing general practitioners' performance in their own consultations. DESIGN--Questionnaires completed by patients, general practitioner trainers, and general practitioner trainees. Reliability of draft instrument tested by general practitioner trainers. SETTING--All vocational training schemes for general practice in the Northern region of England. SUBJECTS--First stage: 76 patients in seven groups, 108 general practice trainers in 12 groups, and 122 general practice trainees in 10 groups. Second stage: 85 general practice trainers in 12 groups. MAIN OUTCOME MEASURES--Trainers' ratings of importance; alpha coefficients of draft instrument by trainee, group, and consultation. RESULTS--6890 characteristics of good and bad consultations were consolidated into a draft assessment instrument consisting of 46 pairs of definitions separated by six point bipolar scales. Nine statement pairs given low importance ratings by trainers were eliminated, reducing the instrument to 37 statement pairs. To test reliability, general practitioner trainers used the instrument to assess three consultations. With the exception of one group of trainers, all alpha coefficients exceeded the acceptable level of 0.80. CONCLUSION--The instrument produced is reliable for assessing general practitioners' performance in their own consultations. PMID:8490501
Dehghan, Parvin; Asghari-Jafarabadi, Mohammad; Salekzamani, Shabnam
2015-01-01
Background: The aim of this study was to assess the validity, reliability and feasibility of eating behavior pattern questionnaire (EBPQ) in female university students. Methods: In this study, after forward-backward translation, the questionnaire was reviewed by a panel of nutritionists and a psychologist and further thirty participants for the content validity measurement. The translated and modified questionnaire was completed by 225 female students of Tabriz University in 2013. Principle axis factoring, confirmatory factor analysis and known group analysis were conducted for construct, convergent and discriminant validity. Internal consistency and test–retest reliability were assessed by Cronbach’s α coefficient and intra-class correlation coefficient (ICC). Ceiling and floor effects were also performed for evaluating the feasibility of the instrument. Results: By using exploratory factor analysis, nine factors were extracted. Confirmatory factor analysis confirmed the convergent validity. Cronbach ’s αand ICC were ranged between 0.55 to 0.78 and 0.67 to 0.89, respectively. The significant difference for some three subscales between diabetes and healthy subjects determined the discriminant validity. No ceiling and floor effects were found. Conclusion: Our findings demonstrate the initial validity, reliability and feasibility of the Iranian version of EBPQ as a useful tool for eating behavior studies in young females. PMID:26290828
[Testing reliability and validity of reduced substitutes for leadership scales(rd-SLS)].
Kim, Jeong-Hee
2005-10-01
This paper was conducted to test the reliability and validity of rd-SLS, developed by Podsakoff, et al. (1993) which measured 'substitutes for leadership'. The subjects were 345 nurses in 5 general hospitals. Cronbach's and the Guttman split-half coefficient were used to test the reliability of rd-SLS. Factor analysis, and the correlations of the rv-SLS and SLS with rd-SLS were used for convergent and discriminant validity. Cronbach's data was 0.76 and the Guttman split-half coefficient was 0.52. Twelve factors evolved by factor analysis, which explained 70.4% of the total variance. This result was similar to previous study results. However, 'Indifference toward organizational rewards'-related items were classified two factors. It was not clear t hat the rd-SLS consisted of 13 concepts(factors). The correlations of the rv-SLS and SLS with the rd-SLS were 0.93 and 0.87 respectively. The rd-SLS showed a moderate degree of validity and reliability. Thus, it is recommended to use the rd-SLS in general nursing organizations for screening for leadership substitutes. In addition, it is necessary to clarify the concept of organizational rewards. In a further study, the factor structure of the rd-SLS may be considered.
Roque, Fátima; Soares, Sara; Breitenfeld, Luiza; Gonzalez-Gonzalez, Cristian; Figueiras, Adolfo; Herdeiro, Maria Teresa
2014-01-01
To develop and evaluate the reliability of a self-administered questionnaire designed to assess the attitudes and knowledge of community pharmacists in Portugal about microbial resistance and the antibiotic dispensing process. This study was divided into the following three stages: (1) design of the questionnaire, which included a literature review and a qualitative study with focus-group sessions; (2) assessment of face and content validity, using a panel of experts and a pre-test of community pharmacists; and, (3) pilot study and reliability analysis, which included a test-retest study covering fifty practising pharmacists based at community pharmacies in five districts situated in Northern Portugal. Questionnaire reproducibility was quantified using the intraclass correlation coefficient (ICC; 95% confidence interval) computed by means of one-way analysis of variance (ANOVA). Internal consistency was evaluated using Cronbach's alpha. The correlation coefficients were fair to good (ICC>0.4) for all statements (scale-items) regarding knowledge of and attitudes to antibiotic resistance, and ranged from fair to good to excellent for statements about situations in which pharmacists acknowledged that antibiotics were sometimes dispensed without a medical prescription (ICC>0.8). Cronbach's alpha for this section was 0.716. The questionnaire designed in this study is valid and reliable in terms of content validity, face validity and reproducibility.
Uggioni, Paula Lazzarin; Salay, Elisabete
2012-04-01
Validated and reliable instruments for measuring consumer attitudes regarding food quality certifications are lacking, but the measurement of consumer attitude could be an important tool for understanding consumer behavior. Thus the objective of this study was to develop an instrument for measuring consumer attitudes regarding private food safety certifications for commercial restaurants. To this end, the following steps were carried out: development of the interview items; complete pilot testing; item analyses (influence of social desirability and total-item correlation); reliability test (internal consistency and test-retest); and validity assessment (content and discriminative validity and exploratory and confirmatory factor analysis). The subjects, all over the age of 18 and drawn from six non-probabilistic samples (n=7-350) in the city of Campinas, Brazil, were all subjected to an interview. The final scale included 24 items and had a Cronbach's alpha coefficient of 0.79 and a content validation coefficient of 0.99, both within acceptable limits. The confirmatory factor analysis validated a model with five factors and the final instrument discriminated reasonably well between the groups and showed satisfactory reproducibility (r=0.955). Furthermore, the scale validity and reliability were satisfactory, suggesting it could also be applied to future studies. Copyright © 2011 Elsevier Ltd. All rights reserved.
The intra-individual reproducibility of flash-evoked potentials in a sample of children.
Schellberg, D; Gasser, T; Köhler, W
1987-07-01
Visual evoked potentials (VEPs) to flash stimuli were recorded twice from 26 children aged 10-13 years, with an intersession interval of about 10 months. Test-retest reliability was poor for recordings taken from scalp locations overlying non-specific cortex and somewhat better for specific cortex. The size of consistency coefficients (i.e. correlations within session) showed that noise and artefacts were not the decisive factors which lower reliability. A comparison with retest correlations of broad band parameters of the EEG at rest for the same sample showed, to our surprise, smaller retest reliability for VEP parameters. Variability of the VEP in children over time seems to be a substantial as its well-known inter-individual variability.
Portuguese Version of the Pain Beliefs and Perceptions Inventory: A Multicenter Validation Study.
Azevedo, Luís Filipe; Sampaio, Rute; Camila Dias, Cláudia; Romão, José; Lemos, Laurinda; Agualusa, Luís; Vaz-Serra, Sílvia; Patto, Teresa; Costa-Pereira, Altamiro; Castro-Lopes, José Manuel
2017-07-01
We aimed to perform the translation, cultural adaptation, and validation of the Pain Beliefs and Perceptions Inventory (PBPI) for the European Portuguese language and chronic pain population. This is a longitudinal multicenter validation study. A Portuguese version of the PBPI (PBPI-P) was created through a process of translation, back translation, and expert panel evaluation. The PBPI-P was administered to a total of 122 patients from 13 chronic pain clinics in Portugal, at baseline and after 7 days. Internal consistency and test-retest reliability were assessed by Cronbach's alpha (α) and intraclass correlation coefficient (ICC). Construct (convergent and discriminant) validity was assessed based on a set of previously developed theoretical hypotheses about interrelations between the PBPI-P and other measures. Exploratory and confirmatory factor analyses were performed to test the theoretical structure of the PBPI-P. The internal consistency and test-retest reliability coefficients for each respective subscale were α = 0.620 and ICC = 0.801 for mystery; α = 0.744 and ICC = 0.841 for permanence; α = 0.778 and ICC = 0.791 for constancy; and α = 0.764 and ICC = 0.881 for self-blame. Exploratory and confirmatory factor analysis revealed a four-factor structure (performance, constancy, self-blame, and mystery) that explained 63% of the variance. The construct validity of the PBPI-P was shown to be adequate, with more than 90% of the previously defined hypotheses regarding interrelations with other measures confirmed. The PBPI-P has been shown to be adequate and to have excellent reliability, internal consistency, and validity. It may contribute to a better pain assessment and is suitable for research and clinical use. © 2016 World Institute of Pain.
Campiñez Navarro, Manuel; Pérula de Torres, Luis Ángel; Bosch Fontcuberta, Josep M; Barragán Brun, Nieves; Arbonies Ortiz, Juan Carlos; Novo Rodríguez, Jesús Manuel; Bóveda Fontán, Julia; Martín Alvarez, Remedios; Prados Castillejo, Jose Antonio; Rivas Doutreleau, Gabriela Renée; Domingo Peña, Carmen; Castro Moreno, Jaime Jesús; Romero Rodríguez, Esperanza María
2016-09-01
Motivational interviewing (MI) is a collaborative, goal-oriented method to help patients change behaviour. Tools that are often used to measure MI are the motivational interviewing skills code' (MISC), the 'motivational interviewing treatment integrity' (MITI) and the 'behaviour change counselling index' (BECCI). The first two instruments have not been designed to be used in primary healthcare (PHC) settings. The BECCI actually is time-consuming. The motivational interviewing assessment scale (MIAS, 'EVEM' in Spanish) was developed to measure MI in PHC encounters as an alternative to the previous instruments. To validate MIAS as an instrument to assess the quality of MI in PHC settings. (a) Sixteen experts in MI participated in the design, face and consensus validity, using a Delphi-type methodology. (b) 27 PHC centres located in Spain. four experts in MI tested its psychometric properties with 332 video recordings coming from the Dislip-EM study (consultations provided by 37 practitioners). dimensionality, internal consistency, reliability (intra-class correlation coefficient-ICC), sensitivity to change and convergent validity with the BECCI scale. A 14-item scale was obtained after the validation process. Factor analysis: two factors explained 76.6% of the total variance. Internal consistency, α = 0.99. Reliability: intra-rater ICC = 0.96; inter-rater ICC = 0.97. Sensitivity to change: means before and after training were 23.63 versus 38.57 (P < 0.001). Spearman's coefficient between the MIAS and the BECCI scale was 0.98 (P < 0.001). The MIAS is a consistent and reliable instrument to assess the use of MI in PHC settings. [Box: see text].
Hadi, Azlihanis Abdul; Naing, Nyi Nyi; Daud, Aziah; Nordin, Rusli
2006-11-01
This study was conducted to assess the reliability and construct validity of the Malay version of Job Content Questionnaire (JCQ) among secondary school teachers in Kota Bharu, Kelantan. A total of 68 teachers consented to participate in the study and were administered the Malay version of JCQ. Reliability was determined using Cronbach's alpha for internal consistency whilst construct validity was assessed using factor analysis. The results indicated that Cronbach's alpha coefficients revealed decision latitude (0.75), psychological job demand (0.50) and social support (0.84). Factor analysis showed three meaningful common factors that could explain the construct of Karasek's demand-control-social support model. The study suggests the JCQ scales are reliable and valid tools for assessing job stress in school teachers.
Murray, Nicholas P.; Hunfalvay, Melissa; Bolte, Takumi
2017-01-01
Purpose The purpose of this study was to determine the reliability of interpupillary distance (IPD) and pupil diameter (PD) measures using an infrared eye tracker and central point stimuli. Validity of the test compared to known clinical tools was determined, and normative data was established against which individuals can measure themselves. Methods Participants (416) across various demographics were examined for normative data. Of these, 50 were examined for reliability and validity. Validity for IPD measured the test (RightEye IPD/PD) against the PL850 Pupilometer and the Essilor Digital CRP. For PD, the test was measured against the Rosenbaum Pocket Vision Screener (RPVS). Reliability was analyzed with intraclass correlation coefficients (ICC) between trials with Cronbach's alpha (CA) and the standard error of measurement for each ICC. Convergent validity was investigated by calculating the bivariate correlation coefficient. Results Reliability results were strong (CA > 0.7) for all measures. High positive significant correlations were found between the RightEye IPD test and the PL850 Pupilometer (P < 0.001) and Essilor Digital CRP (P < 0.001) and for the RightEye PD test and the RPVS (P < 0.001). Conclusions Using infrared eye tracking and the RightEye IPD/PD test stimuli, reliable and accurate measures of IPD and PD were found. Results from normative data showed an adequate comparison for people with normal vision development. Translational Relevance Results revealed a central point of fixation may remove variability in examining PD reliably using infrared eye tracking when consistent environmental and experimental procedures are conducted. PMID:28685104
Fiori, Simona; Cioni, Giovanni; Klingels, Katrjin; Ortibus, Els; Van Gestel, Leen; Rose, Stephen; Boyd, Roslyn N; Feys, Hilde; Guzzetta, Andrea
2014-09-01
To describe the development of a novel rating scale for classification of brain structural magnetic resonance imaging (MRI) in children with cerebral palsy (CP) and to assess its interrater and intrarater reliability. The scale consists of three sections. Section 1 contains descriptive information about the patient and MRI. Section 2 contains the graphical template of brain hemispheres onto which the lesion is transposed. Section 3 contains the scoring system for the quantitative analysis of the lesion characteristics, grouped into different global scores and subscores that assess separately side, regions, and depth. A larger interrater and intrarater reliability study was performed in 34 children with CP (22 males, 12 females; mean age at scan of 9 y 5 mo [SD 3 y 3 mo], range 4 y-16 y 11 mo; Gross Motor Function Classification System level I, [n=22], II [n=10], and level III [n=2]). Very high interrater and intrarater reliability of the total score was found with indices above 0.87. Reliability coefficients of the lobar and hemispheric subscores ranged between 0.53 and 0.95. Global scores for hemispheres, basal ganglia, brain stem, and corpus callosum showed reliability coefficients above 0.65. This study presents the first visual, semi-quantitative scale for classification of brain structural MRI in children with CP. The high degree of reliability of the scale supports its potential application for investigating the relationship between brain structure and function and examining treatment response according to brain lesion severity in children with CP. © 2014 Mac Keith Press.
Development and validation of the Japanese version of cognitive flexibility scale.
Oshiro, Keiko; Nagaoka, Sawako; Shimizu, Eiji
2016-05-17
Various instruments have been developed to assess cognitive flexibility, which is an important construct in psychology. Among these, the self-report cognitive flexibility scale (CFS) is particularly popular for use with English speakers; however, there is not yet a Japanese version of this scale. This study reports on the development of a Japanese version of the cognitive flexibility scale (CFS-J), and the assessment of its internal consistency, test-retest reliability, and validities. We used the standard translation-back-translation process to develop the Japanese wording of the items and tested these using a sample of 335 eligible participants who did not have a mental illness, were aged 18 years or older, and lived in the suburbs of Tokyo. Participants included office workers, public servants, and college students; 71.6 % were women and 64.8 % were students. The translated scale's internal consistency reliability was assessed by calculating Cronbach's alpha and McDonald's omega, and test-retest reliability was assessed with 107 eligible participants via intra-class correlation coefficient (ICC) and Spearman's correlation of coefficient. Exploratory factory analysis (EFA) and correlations with other scales were used to examine the factor-based and concurrent validities of the CFS-J. Results indicated that the CFS-J has good internal consistency (Cronbach's alpha = 0.847, McDonald's omega = 0.871) and acceptable test-retest reliability (Spearman's = 0.687, ICC = 0.689). EFA provided evidence that the CFS-J has a one-factor structure and factor loadings were generally appropriate. The total CFS-J score was significantly and positively correlated with the cognitive flexibility inventory-Japanese version and its two subscales, along with the cognitive control scale and the positive subscale of the short Japanese version of the automatic thought questionnaire-revised (ATQ-R); further, it had a significantly negative correlation with the negative subscale of the ATQ-R (ps < 0.001). This study developed a Japanese version of the cognitive flexibility scale and confirmed its reliability and validity among a sample of people with no current mental illness, who were living in the suburbs of Tokyo.
Uehara, Kosuke; Ogura, Koichi; Akiyama, Toru; Shinoda, Yusuke; Iwata, Shintaro; Kobayashi, Eisuke; Tanzawa, Yoshikazu; Yonemoto, Tsukasa; Kawano, Hirotaka; Kawai, Akira
2017-09-01
The Musculoskeletal Tumor Society (MSTS) scoring system developed in 1993 is a widely used disease-specific evaluation tool for assessment of physical function in patients with musculoskeletal tumors; however, only a few studies have confirmed its reliability and validity. The aim of this study was to validate the MSTS scoring system for the upper extremity (MSTS-UE) in Japanese patients with musculoskeletal tumors for use by others in research. Does the MSTS-UE have: (1) sufficient reliability and internal consistency; (2) adequate construct validity; and (3) reasonable criterion validity in comparison to the Toronto Extremity Salvage Score (TESS) or SF-36? Reliability was performed using test-retest analysis, and internal consistency was evaluated with Cronbach's alpha coefficient. Construct validity was evaluated using a scree plot to confirm the construct number and the Akaike information criterion network. Criterion validity was evaluated by comparing the MSTS-UE with the TESS and SF-36. The test-retest reliability with intraclass correlation coefficient (0.95; 95% CI, 0.91-0.97) was excellent, and internal consistency with Cronbach's α (0.7; 95% CI, 0.53-0.81) was acceptable. There were no ceiling and floor effects. The Akaike Information Criterion network showed that lifting ability, pain, and dexterity played central roles among the components. The MSTS-UE showed substantial correlation with the TESS scoring scale (r = 0.75; p < 0.001) and fair correlation with the SF-36 physical component summary (r = 0.37; p = 0.007). Although the MSTS-UE showed slight correlation with the SF-36 mental component summary, the emotional acceptance component of the MSTS-UE showed fair correlation (r = 0.29; p = 0.039). We can conclude that the MSTS is not an adequate measure of general health-related quality of life; however, this system was designed mainly to be a simple measure of function in a single extremity. To evaluate the mental state of patients with musculoskeletal tumors in the upper extremity, further study is needed.
ERIC Educational Resources Information Center
Morgan, Grant B.; Zhu, Min; Johnson, Robert L.; Hodge, Kari J.
2014-01-01
Common estimators of interrater reliability include Pearson product-moment correlation coefficients, Spearman rank-order correlations, and the generalizability coefficient. The purpose of this study was to examine the accuracy of estimators of interrater reliability when varying the true reliability, number of scale categories, and number of…
Cross-cultural Adaption and Validation of the Danish Voice Handicap Index.
Sorensen, Jesper Roed; Printz, Trine; Mehlum, Camilla Slot; Heidemann, Christian Hamilton; Groentved, Aagot Moeller; Godballe, Christian
2018-02-02
We aimed to assess psychometric properties, including internal consistency, reliability, and clinical validity of the Danish version of the Voice Handicap Index (VHI). A cross-sectional survey study was carried out. For validation, the existing nonvalidated Danish version of the VHI was used. Data from 208 patients with voice disorders of different etiology (neurogenic, functional, and structural) and a control group of 85 vocally healthy individuals were included. A test-retest reliability analysis of 42 patients and 45 control persons was performed. The internal consistency, test-retest reliability, and clinical validity of the questionnaire were assessed. Internal consistency was high with a Cronbach α >0.90 for both the patient and control group. Test-retest reliability measured as intraclass correlation coefficient was good with 0.93 (95% confidence interval [95% confidence interval]: 0.87-0.96) for patients and 0.78 (95% confidence interval: 0.63-0.87) for the control group which indicates sufficient reliability of the questionnaire. The Danish VHI has good clinical validity as it has a strong correlation between patient's perception of the severity of their voice disorder and the VHI score from the Spearman correlation of 0.69. The existing Danish version of the VHI has been thoroughly validated and found to be in line with the original VHI from Jacobsen et al. It showed good internal consistency, test-retest reliability, and clinical validity. It is suitable for use in daily practice and in research projects as it is able to assess patients' perception of their voice disorder severity. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Muhamad, Zailani; Ramli, Ayiesah; Amat, Salleh
2015-05-01
The aim of this study was to determine the content validity, internal consistency, test-retest reliability and inter-rater reliability of the Clinical Competency Evaluation Instrument (CCEVI) in assessing the clinical performance of physiotherapy students. This study was carried out between June and September 2013 at University Kebangsaan Malaysia (UKM), Kuala Lumpur, Malaysia. A panel of 10 experts were identified to establish content validity by evaluating and rating each of the items used in the CCEVI with regards to their relevance in measuring students' clinical competency. A total of 50 UKM undergraduate physiotherapy students were assessed throughout their clinical placement to determine the construct validity of these items. The instrument's reliability was determined through a cross-sectional study involving a clinical performance assessment of 14 final-year undergraduate physiotherapy students. The content validity index of the entire CCEVI was 0.91, while the proportion of agreement on the content validity indices ranged from 0.83-1.00. The CCEVI construct validity was established with factor loading of ≥0.6, while internal consistency (Cronbach's alpha) overall was 0.97. Test-retest reliability of the CCEVI was confirmed with a Pearson's correlation range of 0.91-0.97 and an intraclass coefficient correlation range of 0.95-0.98. Inter-rater reliability of the CCEVI domains ranged from 0.59 to 0.97 on initial and subsequent assessments. This pilot study confirmed the content validity of the CCEVI. It showed high internal consistency, thereby providing evidence that the CCEVI has moderate to excellent inter-rater reliability. However, additional refinement in the wording of the CCEVI items, particularly in the domains of safety and documentation, is recommended to further improve the validity and reliability of the instrument.
Maddali Bongi, S; Del Rosso, A; Miniati, I; Galluccio, F; Landi, G; Tai, G; Matucci-Cerinic, M
2012-09-01
In systemic sclerosis (SSc), mouth and face involvement leads to problems in oral health-related quality of life (OHRQoL). Mouth Handicap in Systemic Sclerosis scale (MHISS) is a 12-item questionnaire specifically quantifying mouth disability in SSc, organized in 3 subscales. Our aim was to validate Italian version of MHISS, by assessing its test-retest reliability and internal and external consistency in Italian SSc patients. Forty SSc patients (7 dSSc, 33 lSSc; age and disease duration: 57.27 ± 11.41, 9.4 ± 4.4 years; 22 with sicca syndrome) were evaluated with MHISS. MHISS was translated following a forward-backward translation procedure, with independent translations and counter-translation. Test-retest reliability was evaluated, comparing the results of two administrations, with intraclass correlation coefficient (ICC). Internal consistency was assessed by Cronbach's α and external consistency by comparison with mouth opening. MHISS has a good test-retest reliability (ICC: 0.93) and internal consistency (Cronbach's α:0.99). A good external consistency was confirmed by correlation with mouth opening (rho: -0,3869, p: 0.0137). Total MHISS score was 17.65 ± 5.20, with scores of subscale 1 (reduced mouth opening) of 6.60 ± 2.85 and scores of subscales 2 (sicca syndrome) and 3 (aesthetic concerns) of 7.82 ± 2.59 and 3.22 ± 1.14. Total and subscale 2 scores are higher in dSSc than in lSSc. This result may be due to the higher presence of sicca syndrome in dSSc than in lSSc (p = 0.0109). Our results support validity and reliability in Italian SSc patients of MHISS, specifically measuring SSc OHRQoL.
Cross-cultural adaptation and validation of the Turkish version of Oxford hip score.
Tuğay, Baki Umut; Tuğay, Nazan; Güney, Hande; Hazar, Zeynep; Yüksel, İnci; Atilla, Bülent
2015-06-01
The purpose of this study was to translate the Oxford hip score (OHS) into Turkish and to evaluate the psychometric properties by testing the internal consistency, reproducibility, construct validity, and responsiveness in patients with hip osteoarthritis (OA). Oxford hip score was translated and culturally adapted according to the guidelines in the literature. Seventy patients (mean age 61.45 ± 9.29 years) with hip osteoarthritis participated in the study. Patients completed the Turkish Oxford hip score (OHS-TR), the Short-Form 36 (SF-36), and Western Ontario and McMaster Universities Index (WOMAC). Internal consistency was tested using Cronbach's α coefficient. Patients completed OHS-TR questionnaire twice in 7 days for determining the reproducibility. Correlation between the total results of both tests was determined by the Pearson correlation coefficient and intraclass correlation coefficient (ICC). Validity was assessed by calculating the Pearson correlation coefficient between the OHS-TR and WOMAC and SF-36 scores. Floor and ceiling effects were analyzed. The internal consistency was high (Cronbach's α 0.93). The construct validity showed a significant correlation between the OHS-TR and WOMAC and related SF-36 domains (p < 0.001). The ICC's ranged between 0.80 and 0.99. There was no floor or ceiling effect in total OHS-TR score. The OHS-TR questionnaire is valid, reliable, and responsive for the Turkish-speaking patients with hip OA.
Jones, G L; Morrell, C J; Cooke, J M; Speier, D; Anumba, D; Stewart-Brown, S
2011-09-01
To develop and psychometrically evaluate two questionnaires measuring both positive and negative postnatal health of mothers (M-PHI) and fathers (F-PHI) during the first year of parenting. The M-PHI and the F-PHI were developed in four stages. Stage 1: Postnatal women's focus group (M-PHI) and postnatal fathers' postal questionnaire (F-PHI); Stage 2: Qualitative interviews; Stage 3: Pilot postal survey and main postal survey; and Stage 4: Test-retest postal survey. The M-PHI consisted of a 29-item core questionnaire with six main scales and five conditional scales. The F-PHI consisted of a 27-item questionnaire with six main scales. All scales achieved good internal reliability (Cronbach's α 0.66-0.87 for M-PHI, 0.72-0.90 for F-PHI). Intraclass correlation coefficients demonstrated high test-retest reliability (0.60-0.88). Correlation coefficients supported the criterion validity of the M-PHI and the F-PHI when tested against the Short-Form-12 (SF-12), Edinburgh Postnatal Depression Scale (EPDS) and the Warwick and Edinburgh Mental Well-Being Scale (WEMWBS). The M-PHI and F-PHI are valid, reliable, parent-generated instruments. These unique instruments will be invaluable for practitioners wishing to promote family-centred care and for trialists and other researchers requiring a validated instrument to measure both positive and negative health during the first postnatal year, as to date no such measurement has existed.
Psychometric Properties of the Persian Version of the Simple Shoulder Test (SST) Questionnaire.
Ebrahimzadeh, Mohammad H; Vahedi, Ehsan; Baradaran, Aslan; Birjandinejad, Ali; Seyyed-Hoseinian, Seyyed-Hadi; Bagheri, Farshid; Kachooei, Amir Reza
2016-10-01
To validate the Persian version of the simple shoulder test in patients with shoulder joint problems. Following Beaton`s guideline, translation and back translation was conducted. We reached to a consensus on the Persian version of SST. To test the face validity in a pilot study, the Persian SST was administered to 20 individuals with shoulder joint conditions. We enrolled 148 consecutive patients with shoulder problem to fill the Persian SST, shoulder specific measure including Oxford shoulder score (OSS) and two general measures including DASH and SF-36. To measure the test-retest reliability, 42 patients were randomly asked to fill the Persian-SST for the second time after one week. Cronbach's alpha coefficient was used to demonstrate internal consistency over the 12 items of Persian-SST. ICC for the total questionnaire was 0.61 showing good and acceptable test-retest reliability. ICC for individual items ranged from 0.32 to 0.79. The total Cronbach's alpha was 0.84 showing good internal consistency over the 12 items of the Persian-SST. Validity testing showed strong correlation between SST and OSS and DASH. The correlation with OSS was positive while with DASH scores was negative. The correlation was also good to strong with all physical and most mental subscales of the SF-36. Correlation coefficient was higher with DASH and OSS in compare to SF-36. Persian version of SST found to be valid and reliable instrument for shoulder joint pain and function assessment in Iranian population.
Development of the Assessment of Belief Conflict in Relationship-14 (ABCR-14)
Kyougoku, Makoto; Teraoka, Mutsumi; Masuda, Noriko; Ooura, Mariko; Abe, Yasushi
2015-01-01
Purpose Nurses and other healthcare workers frequently experience belief conflict, one of the most important, new stress-related problems in both academic and clinical fields. Methods In this study, using a sample of 1,683 nursing practitioners, we developed The Assessment of Belief Conflict in Relationship-14 (ABCR-14), a new scale that assesses belief conflict in the healthcare field. Standard psychometric procedures were used to develop and test the scale, including a qualitative framework concept and item-pool development, item reduction, and scale development. We analyzed the psychometric properties of ABCR-14 according to entropy, polyserial correlation coefficient, exploratory factor analysis, confirmatory factor analysis, average variance extracted, Cronbach’s alpha, Pearson product-moment correlation coefficient, and multidimensional item response theory (MIRT). Results The results of the analysis supported a three-factor model consisting of 14 items. The validity and reliability of ABCR-14 was suggested by evidence from high construct validity, structural validity, hypothesis testing, internal consistency reliability, and concurrent validity. The result of the MIRT offered strong support for good item response of item slope parameters and difficulty parameters. However, the ABCR-14 Likert scale might need to be explored from the MIRT point of view. Yet, as mentioned above, there is sufficient evidence to support that ABCR-14 has high validity and reliability. Conclusion The ABCR-14 demonstrates good psychometric properties for nursing belief conflict. Further studies are recommended to confirm its application in clinical practice. PMID:26247356
A Psychometric Analysis of Quality of Life Tools in Lung Cancer Patients Who Smoke
Browning, Kristine K.; Ferketich, Amy K.; Otterson, Gregory A.; Reynolds, Nancy R.; Wewers, Mary Ellen
2009-01-01
Lung cancer is the leading cause of cancer death for both men and women in the United States. Patient quality of life (QOL) prior to cancer treatment is known to be a strong predictor of survival and toleration of treatment toxicities. A lung cancer patient’s self-assessment of QOL is highly valued among clinicians as it guides treatment-related decisions and impacts clinical outcomes. Smokers are known to report a lower QOL. Limited research has been conducted on QOL outcomes in lung cancer patients who continue to smoke. To assess QOL, a reliable and valid QOL measure specific to lung cancer is required. The Functional Assessment of Cancer Therapy-Lung Cancer (FACT-L) and Lung Cancer Symptom Scale (LCSS) are instruments that specifically examine QOL among lung cancer patients. The LCSS is a focused QOL instrument that includes physical and functional domains of QOL and disease symptomatology. The FACT-L is a broader QOL instrument that includes physical, functional, social and emotional domains and disease symptomatology. Both are psychometrically valid and are widely used in the literature, but have not been exclusively evaluated in smokers. Furthermore, there is no ‘gold standard’ instrument since there has never been a correlation study to compare estimates of reliability and validity between these instruments. The purpose of this study is to report the internal consistency and convergence validity of the FACT-L and the LCSS among newly diagnosed lung cancer patients who smoke. This data were collected and analyzed from a larger study examining smoking behavior among newly diagnosed lung cancer patients (n=51). Descriptive statistics were calculated on the FACT-L and LCSS scores, internal consistency was assessed by estimating Cronbach’s alpha coefficients, and Pearson correlation coefficients were estimated between the two scales. Internal consistency coefficients demonstrated good reliability for both scales, and the two instruments demonstrated a strong correlation, suggesting good convergence validity. Either of these instruments are appropriate measures for QOL in lung cancer patients who smoke. Given the conceptual difference between the two instruments, it is important to carefully consider the research aims when selecting the appropriate QOL measurement instrument. PMID:19181418
Nessen, Thomas; Demmelmaier, Ingrid; Nordgren, Birgitta; Opava, Christina H
2015-01-01
The aim of the present study was to investigate aspects of reliability and validity of the Exercise Self-Efficacy Scale (ESES-S) in a rheumatoid arthritis (RA) population. A total of 244 people with RA participating in a physical activity study were included. The six-item ESES-S, exploring confidence in performing exercise, was assessed for test-retest reliability over 4-6 months, and for internal consistency. Construct validity investigated correlation with similar and other constructs. An intraclass correlation coefficient (ICC) of 0.59 (95% CI 0.37-0.73) was found for 84 participants with stable health perceptions between measurement occasions. Cronbach's alpha coefficients of 0.87 and 0.89 were found at the first and second measurements. Corrected item-total correlation single ESES-S items ranged between 0.53 and 0.73. Construct convergent validity for the ESES-S was partly confirmed by correlations with health-enhancing physical activity and outcome expectations respectively (Pearson's r = 0.18, p < 0.01). Construct divergent validity was confirmed by the absence of correlations with age or gender. No floor or ceiling effects were found for ESES-S. The results indicate that the ESES-S has moderate test-retest reliability and respectable internal consistency in people with RA. Construct validity was partially supported in the present sample. Further research on construct validity of the ESES-S is recommended. Physical exercise is crucial for management of symptoms and co-morbidity in rheumatoid arthritis. Self-efficacy for exercise is important to address in rehabilitation as it regulates exercise motivation and behavior. Measurement properties of self-efficacy scales need to be assessed in specific populations and different languages.
Saez de la Fuente, Javier; Such Diaz, Ana; Cañamares-Orbis, Irene; Ramila, Estela; Izquierdo-Garcia, Elsa; Esteban, Concepcion; Escobar-Rodríguez, Ismael
2016-11-01
The most widely used validated instrument to assess the complexity of medication regimens is the Medication Regimen Complexity Index (MRCI). This study aimed to translate, adapt, and validate a reliable version of the MRCI adapted to Spanish (MRCI-E). The cross-cultural adaptation process consisted of an independent translation by 3 clinical pharmacists and a backtranslation by 2 native English speakers. A reliability analysis was conducted on 20 elderly randomly selected patients. Two clinical pharmacists calculated the MRCI-E from discharge treatments and 2 months later. For the validity analysis, the sample was augmented to 60 patients. Convergent validity was assessed by analyzing the correlation between the number of medications; discriminant validity was stratified by gender; and predictive validity was determined by analyzing the ability to predict readmission and mortality at 3 and 6 months. The MRCI-E retained the original structure of 3 sections. The reliability analysis demonstrated an excellent internal consistency (Cronbach's α=0.83), and the intraclass correlation coefficient exceeded 0.9 in all cases. The correlation coefficient with the number of medications was 0.883 ( P<0.001). No significant differences were found when stratified by gender (3.6; 95%CI=-2.9 to 10.2; P=0.27). Patients who were readmitted at 3 months had a higher MRCI-E score (10.7; 95%CI=4.4 to 17.2; P=0.001). The differences remained significant in patients readmitted at 6 months, but differences in mortality were not detected. The MRCI-E retains the reliability and validity of the original index and provides a suitable tool to assess the complexity of medication regimens in Spanish.
Cross-Cultural Adaptation and Validation of the Back Beliefs Questionnaire to the Arabic Language.
Alamrani, Samia; Alsobayel, Hana; Alnahdi, Ali H; Moloney, Niamh; Mackey, Martin
2016-06-01
Translation, cross-cultural adaptation, and psychometric testing. To translate the Back Beliefs Questionnaire (BBQ) into Arabic and investigate its psychometric properties in an Arabic-speaking sample of individuals with low back pain (LBP). Back pain beliefs are associated with pain chronicity and disability in people with LBP. The BBQ is a recognized and frequently used tool for measuring these beliefs. To date the BBQ has not been translated into Arabic. The English version of the BBQ was translated and culturally adapted into Arabic (BBQ-Ar) according to published guidelines. The BBQ-Ar was then tested in a sample of 115 Arabic-speaking individuals with LBP. Reliability was evaluated through internal consistency (Cronbach α) and test-retest reliability (intraclass correlation coefficient), the latter in a subgroup of 25. Construct validity was assessed using exploratory factor analysis and by examining the correlation between the BBQ-Ar, the Oswestry Disability Index and a Numerical Pain Rating Scale. Internal consistency of the BBQ-Ar was good (Cronbach α = 0.77). Test-retest reliability was good (intraclass correlation coefficient [2,1] = 0.88). Exploratory factor analysis revealed a three-factor structure, explaining 46% of total variance, with the first factor alone explaining 24%. Eight of the nine scoring items were loaded on the first factor thus forming a unidimensional scale. A significant negative correlation was found between Oswestry Disability Index and BBQ-Ar scores (r = -0.307; P < 0.01), whereas no significant correlation was found between BBQ-Ar and Pain Rating Scale scores. No floor or celling effects were observed. The BBQ-Ar is a valid and reliable tool that can be used to assess back pain beliefs in Arabic-speaking individuals. N/A.
Chen, Yu-Ming; He, Li-Ping; Mai, Jin-Cheng; Hao, Yuan-Tao; Xiong, Li-Hua; Chen, Wei-Qing; Wu, Jiang-Nan
2008-06-01
To evaluate the reliability and validity of parent proxy-report scales of Pediatric Quality of Life Inventory Version 4.0 (PedsQL 4.0) Generic Core Scales, the Chinese Version. 3493 school students aged 6-18 years were recruited using multistage cluster sampling method. Health-related quality of life was assessed using the above-mentioned PedsQL 4.0 scales. The internal consistency was assessed, using Cronbach's a coefficient, while its validity was tested through correlation analysis, t-test and exploratory factor analysis. The internal consistency reliability for Total Scale Score (Cronbach's alpha = 0.90), Physical Health Summary Score (alpha= 0.81), and Psychosocial Health Summary Score (alpha= 0.89) were excellent. Six major factors were extracted by factor analysis which basically matched the designed structure of the original version accounting for nearly 66% of the variance. The total Scale Score significantly decreased by 3.5 to 13.3 (P < 0.05) in children and adolescents who had diseases including cold, skin hypersensitiveness, food allergy, courbature or arthralgia, breathlessness with a frequency of 6 times or more per year or had asthma as compared to those with lower frequency (< or = 5 times/y) of the diseases or without asthma. We found moderate to high correlations between items and the subscales. Correlation coefficients ranged between 0.45 to 0.84 (P < 0.01). The reliability and validity of the parent proxy-report scales of PedsQL 4.0 Generic Core Scales of the Chinese Version were as good as the original version. Our findings suggested that the scales could be applied to evaluate the health-related quality of life in childhood children in similar Chinese regions to Guangzhou.
McAlinden, Colm; Pesudovs, Konrad; Moore, Jonathan E
2010-11-01
To develop an instrument to measure subjective quality of vision: the Quality of Vision (QoV) questionnaire. A 30-item instrument was designed with 10 symptoms rated in each of three scales (frequency, severity, and bothersome). The QoV was completed by 900 subjects in groups of spectacle wearers, contact lens wearers, and those having had laser refractive surgery, intraocular refractive surgery, or eye disease and investigated with Rasch analysis and traditional statistics. Validity and reliability were assessed by Rasch fit statistics, principal components analysis (PCA), person separation, differential item functioning (DIF), item targeting, construct validity (correlation with visual acuity, contrast sensitivity, total root mean square [RMS] higher order aberrations [HOA]), and test-retest reliability (two-way random intraclass correlation coefficients [ICC] and 95% repeatability coefficients [R(c)]). Rasch analysis demonstrated good precision, reliability, and internal consistency for all three scales (mean square infit and outfit within 0.81-1.27; PCA >60% variance explained by the principal component; person separation 2.08, 2.10, and 2.01 respectively; and minimal DIF). Construct validity was indicated by strong correlations with visual acuity, contrast sensitivity and RMS HOA. Test-retest reliability was evidenced by a minimum ICC of 0.867 and a minimum 95% R(c) of 1.55 units. The QoV Questionnaire consists of a Rasch-tested, linear-scaled, 30-item instrument on three scales providing a QoV score in terms of symptom frequency, severity, and bothersome. It is suitable for measuring QoV in patients with all types of refractive correction, eye surgery, and eye disease that cause QoV problems.
Abdolahi, Amir; Scoglio, Nicholas; Killoran, Annie; Dorsey, Ray; Biglan, Kevin M.
2013-01-01
Background By permitting remote assessments of patients and research participants, telemedicine has the potential to reshape clinical care and clinical trials for Parkinson disease. While the majority of the motor Unified Parkinson’s Disease Rating Scale (UPDRS) items can be conducted visually, rigidity and retropulsion pull testing require hands-on assessment by the rater and are less feasible to perform remotely in patients' homes. Methods In a secondary data analysis of the Comparison of the Agonist pramipexole vs. Levodopa on Motor complications in Parkinson’s Disease (CALM-PD) study, a randomized clinical trial, we assessed the cross-sectional (baseline and 2 years) and longitudinal (change from baseline to 2 years) reliability of a modified motor UPDRS (removing rigidity and retropulsion items) compared to the standard motor UPDRS (all items) using intraclass correlation coefficients (ICC), stratified by treatment group. Internal consistency of the modified UPDRS (mUPDRS) was measured using Cronbach’s alpha, and concurrent validity was assessed using Pearson’s correlation coefficient (r) between the standard motor UPDRS and mUPDRS. Results The mUPDRS versus standard motor UPDRS is cross-sectionally (ICC ≥ 0.92) and longitudinally (ICC ≥ 0.92) reliable for both treatment groups. High internal consistencies were also observed (α ≥ 0.96). The mUPDRS had high concurrent validity with the standard UPDRS at both time points and longitudinally (r ≥ 0.93, p < 0.0001). Conclusions A modified version of the motor UPDRS without rigidity and retropulsion pull testing is reliable and valid and may lay the foundation for its use in remote assessments of patients and research participants. PMID:23102808
Abdolahi, Amir; Scoglio, Nicholas; Killoran, Annie; Dorsey, E Ray; Biglan, Kevin M
2013-02-01
By permitting remote assessments of patients and research participants, telemedicine has the potential to reshape clinical care and clinical trials for Parkinson disease. While the majority of the motor Unified Parkinson's Disease Rating Scale (UPDRS) items can be conducted visually, rigidity and retropulsion pull testing require hands-on assessment by the rater and are less feasible to perform remotely in patients' homes. In a secondary data analysis of the Comparison of the Agonist pramipexole vs. Levodopa on Motor complications in Parkinson's Disease (CALM-PD) study, a randomized clinical trial, we assessed the cross-sectional (baseline and 2 years) and longitudinal (change from baseline to 2 years) reliability of a modified motor UPDRS (removing rigidity and retropulsion items) compared to the standard motor UPDRS (all items) using intraclass correlation coefficients (ICC), stratified by treatment group. Internal consistency of the modified UPDRS (mUPDRS) was measured using Cronbach's alpha, and concurrent validity was assessed using Pearson's correlation coefficient (r) between the standard motor UPDRS and mUPDRS. The mUPDRS versus standard motor UPDRS is cross-sectionally (ICC ≥ 0.92) and longitudinally (ICC ≥ 0.92) reliable for both treatment groups. High internal consistencies were also observed (α ≥ 0.96). The mUPDRS had high concurrent validity with the standard UPDRS at both time points and longitudinally (r ≥ 0.93, p < 0.0001). A modified version of the motor UPDRS without rigidity and retropulsion pull testing is reliable and valid and may lay the foundation for its use in remote assessments of patients and research participants. Copyright © 2012 Elsevier Ltd. All rights reserved.
Abanto, Jenny; Albites, Ursula; Bönecker, Marcelo; Paiva, Saul M; Castillo, Jorge L; Aguilar-Gálvez, Denisse
2015-12-01
The lack of a Family Impact Scale (FIS) in Spanish language limits its use as an indicator in Spanish-speaking countries and precludes comparisons with data from other cultural and ethnic groups. The purpose of this study was therefore to adapt the FIS cross-culturally to the Peruvian Spanish language and assess its reliability and validity. In order to translate and adapt the FIS cross-culturally, it was answered by 60 parents in two pilot tests, after which it was tested on 200 parents of children aged 11 to 14 years who were clinically examined for dental caries experience and malocclusions. Internal consistency was assessed by Cronbach's alpha coefficient while repeat administration of the FIS on the same 200 parents enabled the test-retest reliability to be assessed via intraclass correlation coefficient (ICC). Construct and discriminant validity were based on associations of the FIS with global ratings of oral health and clinical groups, respectively. Mean (standard deviation) FIS total score was 5.20 (5.86). Internal consistency was confirmed by Cronbach's alpha 0.84. Test-retest reliability revealed excellent reproducibility (ICC = 0.96). Construct validity was good, demonstrating statistically significant associations between total FIS score and global ratings of oral health (p=0.007) and overall wellbeing (p=0.002), as well as for the subscale scores (p<0.05) with exception of the financial burden subscale. The FIS was also able to discriminate between children with and without dental caries experience and malocclusions (p<0.05). Satisfactory psychometric results for the Peruvian Spanish FIS confirm it as a reliable, valid instrument for assessing the impact on the family caused by children's oral conditions. Sociedad Argentina de Investigación Odontológica.
Karataş, Tuğba; Özen, Şükrü; Kutlutürkan, Sevinç
2017-01-01
Objective: The main aim of this study was to investigate the factor structure and psychometric properties of the Brief Illness Perception Questionnaire (BIPQ) in Turkish cancer patients. Methods: This methodological study involved 135 cancer patients. Statistical methods included confirmatory or exploratory factor analysis and Cronbach alpha coefficients for internal consistency. Results: The values of fit indices are within the acceptable range. The alpha coefficients for emotional illness representations, cognitive illness representations, and total scale are 0.83, 0.80, and 0.85, respectively. Conclusions: The results confirm the two-factor structure of the Turkish BIPQ and demonstrate its reliability and validity. PMID:28217734
Reliability of self-reported smoking history and age at initial tobacco use.
Huerta, Michael; Chodick, Gabriel; Balicer, Ran D; Davidovitch, Nadav; Grotto, Itamar
2005-08-01
Many studies use questionnaires to determine smoking status and age of smoking onset. This study aimed to determine the reliability of self-reported smoking history and age of smoking initiation. The proportion of inconsistent answers and correlation coefficients of reported age of initial smoking were measured by an answer-reanswer analysis of questionnaires in an ongoing, two-step, population-based survey of health behavior. Interviews were conducted on the day of recruitment to and the day of discharge from mandatory military service in Israel among a sample of 25,437 young men and women recruited between 1986 and 2000. Of 7276 participants reporting current or past smoking upon recruitment, 559 (7.7%) reported never having smoked upon discharge, thus demonstrating prima facie inconsistency. Variables significantly associated with reliable reporting in a multivariate logistic regression model were female gender (P = 0.04) and more than 4 years of military service (P < 0.01). 6010 subjects who reported a positive smoking history at both recruitment and discharge were available for analysis of reliability of reported age at smoking onset. Intraclass correlation coefficients for recruitment/discharge consistency in reported age at first cigarette were 0.73 (95% CI: 0.71-0.74) and 0.76 (95% CI: 0.74-0.78) for men and women, respectively. Eastern origin, lower subject education level, and lower paternal education level were also associated with lower reliability. Our results showed a relatively high level of answer-reanswer reliability, with some variance attributable to personal characteristics. These results suggest that self-reported age at onset of tobacco use is practical and reliable in normative, young adult populations. However, time elapsed between questionnaires and demographic and lifestyle characteristics may affect reliability rates, and thus should be carefully regarded in future studies.
Kobayashi, Sarah; Peduto, Anthony; Simic, Milena; Fransen, Marlene; Refshauge, Kathryn; Mah, Jean; Pappas, Evangelos
2018-04-01
This work aimed to assess inter-rater reliability and agreement of a magnetic resonance imaging (MRI)-based Kellgren and Lawrence (K&L) grading for patellofemoral joint osteoarthritis (OA) and to validate it against the MRI Osteoarthritis Knee Score (MOAKS). MRI scans from people aged 45 to 75 years with chronic knee pain participating in a randomised clinical trial evaluating dietary supplements were utilised. Fifty participants were randomly selected and scored using the MRI-based K&L grading using axial and sagittal MRI scans. Raters conducted inter-rater reliability, blinded to clinical information, radiology reports and other rater results. Intra- and inter-rater reliability and agreement were evaluated using the intra-class correlation coefficient (ICC) and Cohen's weighted kappa. There was a 2-week interval between the first and second readings for intra-rater reliability. Validity was assessed using the MOAKS and evaluated using Spearman's correlation coefficient. Intra-rater reliability of the K&L system was excellent: ICC 0.91 (95% CI 0.82-0.95); weighted kappa (ĸ = 0.69). Inter-rater reliability was high (ICC 0.88; 95% CI 0.79-0.93), while agreement between raters was moderate (ĸ = 0.49-0.57). Validity analysis demonstrated a strong correlation between the total MOAKS features score and the K&L grading system (ρ = 0.62-0.67) but weak correlations when compared with individual MOAKS features (ρ = 0.19-0.61). The high reliability and good agreement show consistency in grading the severity of patellofemoral OA with the MRI-based K&L score. Our validity results suggest that the scale may be useful, particularly in the clinical environment. Future research should validate this method against clinical findings.
Development and validation of a Malawian version of the primary care assessment tool.
Dullie, Luckson; Meland, Eivind; Hetlevik, Øystein; Mildestvedt, Thomas; Gjesdal, Sturla
2018-05-16
Malawi does not have validated tools for assessing primary care performance from patients' experience. The aim of this study was to develop a Malawian version of Primary Care Assessment Tool (PCAT-Mw) and to evaluate its reliability and validity in the assessment of the core primary care dimensions from adult patients' perspective in Malawi. A team of experts assessed the South African version of the primary care assessment tool (ZA-PCAT) for face and content validity. The adapted questionnaire underwent forward and backward translation and a pilot study. The tool was then used in an interviewer administered cross-sectional survey in Neno district, Malawi, to test validity and reliability. Exploratory factor analysis was performed on a random half of the sample to evaluate internal consistency, reliability and construct validity of items and scales. The identified constructs were then tested with confirmatory factor analysis. Likert scale assumption testing and descriptive statistics were done on the final factor structure. The PCAT-Mw was further tested for intra-rater and inter-rater reliability. From the responses of 631 patients, a 29-item PCAT-Mw was constructed comprising seven multi-item scales, representing five primary care dimensions (first contact, continuity, comprehensiveness, coordination and community orientation). All the seven scales achieved good internal consistency, item-total correlations and construct validity. Cronbach's alpha coefficient ranged from 0.66 to 0.91. A satisfactory goodness of fit model was achieved (GFI = 0.90, CFI = 0.91, RMSEA = 0.05, PCLOSE = 0.65). The full range of possible scores was observed for all scales. Scaling assumptions tests were achieved for all except the two comprehensiveness scales. Intra-class correlation coefficient (ICC) was 0.90 (n = 44, 95% CI 0.81-0.94, p < 0.001) for intra-rater reliability and 0.84 (n = 42, 95% CI 0.71-0.96, p < 0.001) for inter-rater reliability. Comprehensive metric analyses supported the reliability and validity of PCAT-Mw in assessing the core concepts of primary care from adult patients' experience. This tool could be used for health service research in primary care in Malawi.
Liu, Zifeng; Yuan, Lianxiong; Huang, Yixiang; Zhang, Lingling; Luo, Futian
2016-01-01
Objective We aimed to develop a questionnaire for quantitative evaluation of the autonomy of public hospitals in China. Method An extensive literature review was conducted to select possible items for inclusion in the questionnaire, which was then reviewed by 5 experts. After a two-round Delphi method, we distributed the questionnaire to 404 secondary and tertiary hospitals in Guangdong Province, China, and 379 completed questionnaires were collected. The final questionnaire was then developed on the basis of the results of exploratory and confirmatory factor analysis. Results Analysis suggested that all internal consistency reliabilities exceeded the minimum reliability standard of 0.70 for the α coefficient. The overall scale coefficient was 0.87, and 6 subscale coefficients were 0.92 (strategic management), 0.81 (budget and expenditure), 0.85 (financing), 0.75 (financing, medical management), 0.86 (human resources) and 0.86 (accountability). Correlation coefficients between and among items and their hypothesised subscales were higher than those with other subscales. The value of average variance extracted (AVE) was higher than 0.5, the value of construct reliability (CR) was higher than 0.7, and the square roots of the AVE of each subscale were larger than the correlation of the specific subscale with the other subscales, supporting the convergent and discriminant validity of the Chinese version of the Hospital Autonomy Questionnaire (CVHAQ). The model fit indices were all acceptable: χ2/df=1.73, Goodness of Fit Index (GFI) = 0.93, Adjusted Goodness of Fit Index (AGFI) = 0.91, Non-Normed Fit Index (NNFI) = 0.96, Comparative Fit Index (CFI) = 0.97, Root Mean Square Error of Approximation (RMSEA) = 0.04, Standardised Root Mean Square Residual (SRMR) = 0.07. Conclusions This study demonstrated the reliability and validity of a CVHAQ and provides a quantitative method for the assessment of hospital autonomy. PMID:26911587
Ghavidel Parsa, Banafsheh; Amir Maafi, Alireza; Haghdoost, Afrooz; Arabi, Yasaman; Khojamli, Monire; Chatrnour, Gelayol; Bidari, Ali
2014-02-01
The Revised Fibromyalgia Impact Questionnaire (FIQR), an updated version of the Fibromyalgia Impact Questionnaire (FIQ) achieved a better balance among different domains (i.e., function, overall impact, and symptom severity) and attempts to address the limitations of FIQ. As there is no Persian version of the FIQR available, we aimed to investigate the validity and reliability of a Persian translation of the FIQR in Iranian patients. After translating the FIQR into Persian, it was administered to 77 female patients with fibromyalgia syndrome. All of the patients filled out the questionnaire together with a Persian version of the FIQ, short form-12 (SF-12). The tender-point count was also calculated. One week later, FM patients filled out the Persian FIQR at their second visit. Reliability was analyzed by internal consistency and reproducibility including Cronbach's α coefficient and intra-class correlation coefficient. Construct validity was evaluated by Spearman's correlation coefficient and Pearson's correlation coefficient. Statistical analysis was performed using SPSS for Windows version 17.0. All patients included in this study were female, and the mean age was 38.23 ± 10.68 years. The total scores of the FIQR and FIQ were 49.77 ± 18.27 and 54.05 ± 14.00 that were closely correlated (r = 0.63, p < 0.01), and each of the three domains of the Persian FIQR was also correlated well with the three related FIQ domains (r = 0.36-0.63, p < 0.01). Also some significant inverse correlations of FIQR with quality-of-life (assessed by SF-12) domains and items were found. Cronbach's α was 0.87 for FIQR in the first visit. The Persian FIQR showed adequate reliability and validity. This instrument can be used in the clinical evaluation of Iranian patients with fibromyalgia.
Development and validation of sustainability criteria of administrative green schools in Iran.
Meiboudi, Hossein; Lahijanian, Akramolmolok; Shobeiri, Seyed Mohammad; Jozi, Seyed Ali; Azizinezhad, Reza
2017-07-15
Environmental responsibility in school has led to the emergence of a variety of criteria to administer green schools' contributions to sustainability. Sustainability criteria of administrative green schools need validity, reliability and norms. The aim of the current study was to develop and validate assessment criteria for green schools in Iran based on the role of academia. A national survey was conducted to obtain data on sustainability criteria initiatives for green schools and the Iranian profile was defined. An initial pool of 71 items was generated and after its first edition, 63 items were selected to comprise the sustainability criteria. Engineering-architectural and behavioral aspects of this sustainability criteria were evaluated through a sample of 1218 graduate students with environmental degrees from Iran's universities. Exploratory factor analysis using principal components and promax rotation method showed that these 9 criteria have simple structures and are consistent with the theoretical framework. The reliability coefficients of subscales ranged between 0.62 (participation) and 0.84 (building location and position). The study's survey of correlation coefficients between items and subscales illustrated that those coefficients varied between 0.24 and 0.68. Copyright © 2017 Elsevier Ltd. All rights reserved.
Quantified Risk Ranking Model for Condition-Based Risk and Reliability Centered Maintenance
NASA Astrophysics Data System (ADS)
Chattopadhyaya, Pradip Kumar; Basu, Sushil Kumar; Majumdar, Manik Chandra
2017-06-01
In the recent past, risk and reliability centered maintenance (RRCM) framework is introduced with a shift in the methodological focus from reliability and probabilities (expected values) to reliability, uncertainty and risk. In this paper authors explain a novel methodology for risk quantification and ranking the critical items for prioritizing the maintenance actions on the basis of condition-based risk and reliability centered maintenance (CBRRCM). The critical items are identified through criticality analysis of RPN values of items of a system and the maintenance significant precipitating factors (MSPF) of items are evaluated. The criticality of risk is assessed using three risk coefficients. The likelihood risk coefficient treats the probability as a fuzzy number. The abstract risk coefficient deduces risk influenced by uncertainty, sensitivity besides other factors. The third risk coefficient is called hazardous risk coefficient, which is due to anticipated hazards which may occur in the future and the risk is deduced from criteria of consequences on safety, environment, maintenance and economic risks with corresponding cost for consequences. The characteristic values of all the three risk coefficients are obtained with a particular test. With few more tests on the system, the values may change significantly within controlling range of each coefficient, hence `random number simulation' is resorted to obtain one distinctive value for each coefficient. The risk coefficients are statistically added to obtain final risk coefficient of each critical item and then the final rankings of critical items are estimated. The prioritization in ranking of critical items using the developed mathematical model for risk assessment shall be useful in optimization of financial losses and timing of maintenance actions.
Meta-Analysis of Coefficient Alpha
ERIC Educational Resources Information Center
Rodriguez, Michael C.; Maeda, Yukiko
2006-01-01
The meta-analysis of coefficient alpha across many studies is becoming more common in psychology by a methodology labeled reliability generalization. Existing reliability generalization studies have not used the sampling distribution of coefficient alpha for precision weighting and other common meta-analytic procedures. A framework is provided for…
ERIC Educational Resources Information Center
Morley, Donald D.
2012-01-01
The vast majority of the research on student evaluation of instruction has assessed the reliability of groups of courses and yielded either a single reliability coefficient for the entire group, or grouped reliability coefficients for each student evaluation of teaching (SET) item. This manuscript argues that these practices constitute a form of…
Feng, Lifen; Zhang, Yingfen; Chen, Ruoqing; Hao, Yuantao
2011-08-07
Health-related quality of life (HRQOL) has been recognized as an important health outcome measurement for pediatric patients. One of the most promising instruments in measuring pediatric HRQOL emerged in recent years is the Pediatric Quality of Life Inventory (PedsQL™). The PedsQL™ 3.0 Asthma Module, one of the PedsQL™disease-specific scales, was designed to measure HRQOL dimensions specifically tailored for pediatric asthma. The present study is aimed to evaluate the psychometric properties of the Chinese version of the PedsQL™ 3.0 Asthma Module. The PedsQL™ 3.0 Asthma Module was translated into Chinese following the PedsQL™ Measurement Model Translation Methodology. The Chinese version scale was administered to 204 children with asthma and 337 parents of children with asthma from four Triple A hospitals. The psychometric properties were then evaluated. The percentage of missing value for each item of the scale ranged from 0.00% to 8.31%. All child self-report subscales and parent proxy-report subscales approached or exceeded the minimum reliability standard of 0.70 for alpha coefficient, except 3 subscales of Young Child (aged 5-7) self-report (alphas ranging from 0.59 to 0.68). Test-retest reliability was satisfactory with intraclass correlation coefficients (ICCs) which exceeded the recommended standard of 0.80 in all subscales. Correlation coefficients between items and their hypothesized subscales were higher than those with other subscales. The PedsQL™ 3.0 Asthma Module distinguished between outpatients and inpatients. Patients with mild asthma reported higher scores than those with moderate/severe asthma in majority of subscales. The intercorrelations among the PedsQL™ 3.0 Asthma Module subscales and the PedsQL™ 4.0 Generic Core Scales were in medium to large effect size. The child self-report scores were consistent with the parent proxy-report scores. The Chinese version of the PedsQL™ 3.0 Asthma Module has acceptable psychometric properties, except the internal consistency reliability for Young Child (aged 5-7) self-report. Further studies should be focused on testing responsiveness of the Chinese version scale in longitudinal studies, evaluating the reliability and validity of the scale for the patients with severe asthma or teens independently, and assessing HRQOL of children with asthma in other areas.
Tsuji, Naoko; Kakee, Naoko; Ishida, Yasushi; Asami, Keiko; Tabuchi, Ken; Nakadate, Hisaya; Iwai, Tsuyako; Maeda, Miho; Okamura, Jun; Kazama, Takuro; Terao, Yoko; Ohyama, Wataru; Yuza, Yuki; Kaneko, Takashi; Manabe, Atsushi; Kobayashi, Kyoko; Kamibeppu, Kiyoko; Matsushima, Eisuke
2011-04-10
The PedsQL 3.0 Cancer Module is a widely used instrument to measure pediatric cancer specific health-related quality of life (HRQOL) for children aged 2 to 18 years. We developed the Japanese version of the PedsQL Cancer Module and investigated its reliability and validity among Japanese children and their parents. Participants were 212 children with cancer and 253 of their parents. Reliability was determined by internal consistency using Cronbach's coefficient alpha and test-retest reliability using intra-class correlation coefficient (ICC). Validity was assessed through factor validity, convergent and discriminant validity, concurrent validity, and clinical validity. Factor validity was examined by exploratory factor analysis. Convergent and discriminant validity were examined by multitrait scaling analysis. Concurrent validity was assessed using Spearman's correlation coefficients between the Cancer Module and Generic Core Scales, and the comparison of the scores of child self-reports with those of other self-rating depression scales for children. Clinical validity was assessed by comparing the on- and off- treatment scores using Kruskal-Wallis and Mann-Whitney U tests. Cronbach's coefficient alpha was over 0.70 for the total scale and over 0.60 for each subscale by age except for the 'pain and hurt' subscale for children aged 5 to 7 years. For test-retest reliability, the ICC exceeded 0.70 for the total scale for each age. Exploratory factor analysis demonstrated sufficient factorial validity. Multitrait scaling analysis showed high success rates. Strong correlations were found between the reports by children and their parents, and the scores of the Cancer Module and the Generic Core Scales except for 'treatment anxiety' subscales for child reports. The Depression Self-Rating Scale for Children (DSRS-C) scores were significantly correlated with emotional domains and the total score of the cancer module. Children who had been off treatment over 12 months demonstrated significantly higher scores than those on treatment. The results demonstrate the reliability and validity of the Japanese version of the PedsQL Cancer Module among Japanese children.
One Iota Fills the Quota: A Paradox in Multifacet Reliability Coefficients.
ERIC Educational Resources Information Center
Conger, Anthony J.
1983-01-01
A paradoxical phenomenon of decreases in reliability as the number of elements averaged over increases is shown to be possible in multifacet reliability procedures (intraclass correlations or generalizability coefficients). Conditions governing this phenomenon are presented along with implications and cautions. (Author)
Arunakul, Marut; Arunakul, Preeyaphan; Suesiritumrong, Chakhrist; Angthong, Chayanin; Chernchujit, Bancha
2015-06-01
Self-administered questionnaires have become an important aspect for clinical outcome assessment of foot and ankle-related problems. The Foot and Ankle Ability Measure (FAAM) subjective form is a region-specific questionnaire that is widely used and has sufficient validity and reliability from previous studies. Translate the original English version of FAAM into a Thai version and evaluate the validity and reliability of Thai FAAM in patients with foot and ankle-related problems. The FAAM subjective form was translated into Thai using forward-backward translation protocol. Afterward, reliability and validity were tested. Following responses from 60 consecutive patients on two questionnaires, the Thai FAAM subjective form and the short form (SF)-36, were used. The validity was tested by correlating the scores from both questionnaires. The reliability was adopted by measuring the test-retest reliability and internal consistency. Thai FAAM score including activity of daily life (ADL) and Sport subscale demonstrated the sufficient correlations with physical functioning (PF) and physical composite score (PCS) domains of the SF-36 (statistically significant with p < 0.001 level and ≥ 0.5 values). The result of reliability revealed highly intra-class correlation coefficient as 0.8 and 0.77, respectively from test-retest study. The internal consistency was strong (Cronbach alpha = 0.94 and 0.88, respectively). The Thai version of FAAM subjective form retained the characteristics of the original version and has proved a reliable evaluation instrument for patients with foot and ankle-related problems.
Babbin, Steven F.; Yin, Hui-Qing; Rossi, Joseph S.; Redding, Colleen A.; Paiva, Andrea L.; Velicer, Wayne F.
2015-01-01
The Self-Efficacy Scale for Sun Protection consists of two correlated factors with three items each for Sunscreen Use and Avoidance. This study evaluated two crucial psychometric assumptions, factorial invariance and scale reliability, with a sample of adults (N = 1356) participating in a computer-tailored, population-based intervention study. A measure has factorial invariance when the model is the same across subgroups. Three levels of invariance were tested, from least to most restrictive: (1) Configural Invariance (nonzero factor loadings unconstrained); (2) Pattern Identity Invariance (equal factor loadings); and (3) Strong Factorial Invariance (equal factor loadings and measurement errors). Strong Factorial Invariance was a good fit for the model across seven grouping variables: age, education, ethnicity, gender, race, skin tone, and Stage of Change for Sun Protection. Internal consistency coefficient Alpha and factor rho scale reliability, respectively, were .84 and .86 for Sunscreen Use, .68 and .70 for Avoidance, and .78 and .78 for the global (total) scale. The psychometric evidence demonstrates strong empirical support that the scale is consistent, has internal validity, and can be used to assess population-based adult samples. PMID:26457203
Frame-of-reference training for simulation-based intraoperative communication assessment.
Gardner, Aimee K; Russo, Michael A; Jabbour, Ibrahim I; Kosemund, Matthew; Scott, Daniel J
2016-09-01
The purpose of this study was to examine the impact of frame-of-reference (FOR) training on assessments of intraoperative communication skills and identify areas of need to inform curricular efforts. Simulation instructors (M.D., Ph.D., Research Fellow, Simulation Technician) underwent a 2-hour FOR training session with the operating room communication instrument. They then independently rated communication skills of 19 PGY1s who participated in a team-based simulation. Residents completed self-assessments via video review of the scenario. Intraclass correlation coefficients were used to examine inter-rater reliability. Relationships between trained raters and resident scores were assessed with Pearson correlation coefficients and paired sample t tests. Inter-reliability after FOR training was .91. The correlation between trained rater scores and resident evaluations was nonsignificant. Residents significantly underestimated their intraoperative communication skills (P < .05). Use of names, closed loop communication, and sharing information with team members demonstrated consistently low ratings among all residents. These findings reveal that a number of individuals can be trained to reliably rate resident intraoperative communication performance and that residents tend to under-rate their communication skills. Copyright © 2016 Elsevier Inc. All rights reserved.
Mercier, Catherine; Roche, Sylvain; Gaillard, Ségolène; Kassai, Behrouz; Arzimanoglou, Alexis; Herbillon, Vania; Roy, Pascal; Rheims, Sylvain
2016-05-01
Attention deficit hyperactivity disorder (ADHD) is a well-known comorbidity in children with epilepsy. In English-speaking countries, the scores of the original ADHD-rating scale IV are currently used as main outcomes in various clinical trials in children with epilepsy. In French-speaking countries, several French versions are in use though none has been fully validated yet. We sought here for a partial validation of a French version of the ADHD-RS IV regarding construct validity, internal consistency (i.e., scale reliability), item reliability, and responsiveness in a group of French children with ADHD and epilepsy. The study involved 167 children aged 6-15years in 10 French neuropediatric units. The factorial structure and item reliability were assessed with a confirmatory factorial analysis for ordered categorical variables. The dimensions' internal consistency was assessed with Guttman's lambda 6 coefficient. The responsiveness was assessed by the change in score under methylphenidate and in comparison with a control group. The results confirmed the original two-dimensional factorial structure (inattention, hyperactivity/impulsivity) and showed a satisfactory reliability of most items, a good dimension internal consistency, and a good responsiveness of the total score and the two subscores. The studied French version of the ADHD-RS IV is thus validated regarding construct validity, reliability, and responsiveness. It can now be used in French-speaking countries in clinical trials of treatments involving children with ADHD and epilepsy. The full validation requires further investigations. Copyright © 2016 Elsevier Inc. All rights reserved.
Sari Motlagh, Reza; Hajebrahimi, Sakineh; Sadeghi-Bazargani, Homayoun; Joodi Tutunsaz, Javad
2015-05-01
Overactive bladder syndrome is a common syndrome in the world in both men and women. Correct diagnosis and accurate measurement of symptoms severity and also quality of life of patients is necessary to ensure proper treatment and to facilitate sound relationships among patients, researchers and doctors. The International Consultation on Incontinence Questionnaire in Over Active Bladder (ICIQ-OAB) questionnaire is a concise and strong tool to evaluate the symptoms of OAB and their effects on patients' quality of life and treatment results. The objective of this study was to translate and validate a simple and strong tool that could be used in clinics and research. First, the original British English questionnaire was translated into Persian by two bilingual and originally Persian-speaking translators. Then the Persian version was back translated to English and a native English speaker studied and compared the questionnaire with the original version. At the end, the translated and corrected Persian version was finalized by a research team. Content validity of the items and ensuring that the questions could convey the main concept to readers was assessed through Modified Content Validity Index (MCVI). Reliability was calculated by Cronbach's α coefficient. Internal Consistency of the questionnaire with the calculation of Kendall correlation coefficient were evaluated by performing test-retest in 50 participants. The modified content validity index was > 0.78 for all of the questions. Cronbach's α coefficient was calculated 0.76 for all of the participants. Kendall correlation coefficient was calculated for test-re-test assessment 0.66. Both of which indicates the reliability of this questionnaire. Persian version of ICIQ-OAB questionnaire is a simple and strong tool for research, treatment and screening purposes. © 2014 Wiley Publishing Asia Pty Ltd.
Bergeron, Lise; Berthiaume, Claude; St-Georges, Marie; Piché, Geneviève; Smolla, Nicole
2013-08-01
As no single informant can be considered the gold standard of child psychopathology, interviewing of children regarding their own symptoms is necessary. Our study focused on the reliability, validity, and clinical use of the Dominic Interactive (DI), a multimedia self-report screen to assess symptoms for the most frequent Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision, mental disorders in school-aged children. A sample of 585 children aged 6 to 11 years from the community and psychiatric clinics was used to analyze the internal consistency, the test-retest estimate of reliability, and the criterion-related validity of the DI against the referral status. In addition, cross-informant correlation coefficients between this instrument (child report) and the Child Symptom Inventory (parent report) were explored in a subsample of 292 participants. For the total sample, Cronbach alpha coefficients ranged from 0.63 to 0.91. Test-retest kappas varied from 0.42 to 0.62 for categories based on cut-off points, except for specific phobias. Intraclass correlation coefficients ranged from 0.70 to 0.81 for symptom scales. The DI discriminated between referred and non-referred children in psychiatric clinics for all symptom scales. Significant cross-informant correlation coefficients were higher for the externalizing symptoms (0.35 to 0.48) than the internalizing symptoms (0.14 to 0.27). Findings of our study reasonably support adequate psychometric properties of the DI. This instrument offers a developmentally sensitive screening method to obtain unique information from young children about their mental health problems in front-line services, psychiatric clinics, and research settings.
Cohen, Harvey J.; Popat, Rita A.; Halamek, Louis P.
2015-01-01
Abstract Background: Interventions to improve pediatric trainee education in palliative care have been limited by a lack of reliable and valid tools for measuring effectiveness. Objective: We developed a questionnaire to measure pediatric fellows' self-efficacy (comfort), knowledge, and perceived adequacy of prior medical education. We measured the questionnaire's reliability and validity. Methods: The questionnaire contains questions regarding self-efficacy (23), knowledge (10), fellow's perceived adequacy of prior medical education (6), and demographics. The survey was developed with palliative care experts, and sent to fellows in U.S. pediatric cardiology, critical care, hematology/ oncology, and neonatal-perinatal medicine programs. Measures of reliability, internal consistency, and validity were calculated. Results: One hundred forty-seven fellows completed the survey at test and retest. The self-efficacy and medical education questionnaires showed high internal consistency of 0.95 and 0.84. The test-retest reliability for the Self-Efficacy Summary Score, measured by intraclass correlation coefficient (ICC) and weighted kappa, was 0.78 (item range 0.44–0.81) and 0.61 (item range 0.36–0.70), respectively. For the Adequacy of Medical Education Summary Score, ICC was 0.85 (item range 0.6–0.78) and weighted kappa was 0.63 (item range 0.47–0.62). Validity coefficients for these two questionnaires were 0.88 and 0.92. Fellows answered a mean of 8.8/10 knowledge questions correctly; percentage agreement ranged from 65% to 99%. Conclusions: This questionnaire is capable of assessing self-efficacy and fellow-perceived adequacy of their prior palliative care training. We recommend use of this tool for fellowship programs seeking to evaluate fellow education in palliative care, or for research studies assessing the effectiveness of a palliative care educational intervention. PMID:26185912
Devoogdt, Nele; De Groef, An; Hendrickx, Ad; Damstra, Robert; Christiaansen, Anke; Geraerts, Inge; Vervloesem, Nele; Vergote, Ignace; Van Kampen, Marijke
2014-05-01
Patients may develop primary (congenital) or secondary (acquired) lymphedema, causing significant physical and psychosocial problems. To plan treatment for lymphedema and monitor a patient's progress, swelling, and problems in functioning associated with lymphedema development should be assessed at baseline and follow-up. The purpose of this study was to investigate the reliability (test-retest, internal consistency, and measurement variability) and validity (content and construct) of data obtained with the Lymphoedema Functioning, Disability and Health Questionnaire for Lower Limb Lymphoedema (Lymph-ICF-LL). This was a multicenter, cross-sectional study. The Lymph-ICF-LL is a descriptive, evaluative tool containing 28 questions about impairments in function, activity limitations, and participation restrictions in patients with lower limb lymphedema. The questionnaire has 5 domains: physical function, mental function, general tasks/household activities, mobility activities, and life domains/social life. The reliability and validity of the Lymph-ICF-LL were examined in 30 participants with objective lower limb lymphedema. Intraclass correlation coefficients for test-retest reliability ranged from .69 to .94, and Cronbach alpha coefficients for internal consistency ranged from .82 to .97. Measurement variability was acceptable (standard error of measurement=5.9-12.6). Content validity was good because all questions were understandable for 93% of participants, the scoring system (visual analog scale) was clear, and the questionnaire was comprehensive for 90% of participants. Construct validity was good. All hypotheses for assessing convergent validity and divergent validity were accepted. The known-groups validity and responsiveness of the Dutch Lymph-ICF-LL and the cross-cultural validity of the English version of the Lymph-ICF-LL were not investigated. The Lymph-ICF-LL is a Dutch questionnaire with evidence of reliability and validity for assessing impairments in function, activity limitations, and participation restrictions in people with primary or secondary lower limb lymphedema.
Page, Stephen J; Hade, Erinn; Persch, Andrew
2015-01-01
There remains a need for a quickly administered, stroke-specific, bedside measure of active wrist and finger movement for the expanding stroke population. The wrist stability and hand mobility scales of the upper extremity Fugl-Meyer Assessment (w/h UE FM) constitute a valid, reliable measure of paretic UE impairment in patients with active wrist and finger movement. The aim of this study was to determine performance on the w/h UE FM in a stable cohort of survivors of stroke with only palpable movement in their paretic wrist flexors. A single-center cohort study was conducted. Thirty-two individuals exhibiting stable, moderate upper extremity hemiparesis (15 male, 17 female; mean age=56.6 years, SD=10.1; mean time since stroke=4.6 years, SD=5.8) participated in the study, which was conducted at an outpatient rehabilitation clinic in the midwestern United States. The w/h UE FM and Action Research Arm Test (ARAT) were administered twice. Intraclass correlation coefficients (ICCs), Cronbach alpha, and ordinal alpha were computed to determine reliability, and Spearman rank correlation coefficients and Bland-Altman plots were computed to establish validity. Intraclass correlation coefficients for the w/h UE FM and ARAT were .95 and .99, respectively. The w/h UE FM intrarater reliability and internal consistency were greater than .80, and concurrent validity was greater than .70. This also was the first stroke rehabilitative study to apply ordinal alpha to examine internal consistency values, revealing w/h UE FM levels greater than .85. Concurrent validity findings were corroborated by Bland-Altman plots. It appears that the w/h UE FM is a promising tool to measure distal upper extremity movement in patients with little active paretic wrist and finger movement. This finding widens the segment of patients on whom the w/h UE FM can be effectively used and addresses a gap, as commonly used measures necessitate active distal upper extremity movement. © 2015 American Physical Therapy Association.
Ghisi, Gabriela Lima de Melo; Dos Santos, Rafaella Zulianello; Bonin, Christiani Batista Decker; Roussenq, Suellen; Grace, Sherry L; Oh, Paul; Benetti, Magnus
2014-01-01
To translate, culturally adapt and psychometrically validate the Information Needs in Cardiac Rehabilitation (INCR) tool to Portuguese. The identification of information needs is considered the first step to improve knowledge that ultimately could improve health outcomes. The Portuguese version generated was tested in 300 cardiac rehabilitation patients (CR) (34% women; mean age = 61.3 ± 2.1 years old). Test-retest reliability was assessed using intraclass correlation coefficient (ICC), the internal consistency using Cronbach's alpha, and the criterion validity was assessed with regard to patients' education and duration in CR. All 9 subscales were considered internally consistent (á > 0.7). Significant differences between mean total needs and educational level (p < 0.05) and duration in CR (p = 0.03) supported criterion validity. The overall mean (4.6 ± 0.4), as well as the means of the 9 subscales were high (emergency/safety was the greatest need). The Portuguese INCR was demonstrated to have sufficient reliability, consistency and validity. Copyright © 2014 Elsevier Inc. All rights reserved.
Lei, Pingguang; Lei, Guanghe; Tian, Jianjun; Zhou, Zengfen; Zhao, Miao; Wan, Chonghua
2014-10-01
This paper is aimed to develop the irritable bowel syndrome (IBS) scale of the system of Quality of Life Instruments for Chronic Diseases (QLICD-IBS) by the modular approach and validate it by both classical test theory and generalizability theory. The QLICD-IBS was developed based on programmed decision procedures with multiple nominal and focus group discussions, in-depth interview, and quantitative statistical procedures. One hundred twelve inpatients with IBS were used to provide the data measuring QOL three times before and after treatments. The psychometric properties of the scale were evaluated with respect to validity, reliability, and responsiveness employing correlation analysis, factor analyses, multi-trait scaling analysis, t tests and also G studies and D studies of generalizability theory analysis. Multi-trait scaling analysis, correlation, and factor analyses confirmed good construct validity and criterion-related validity when using SF-36 as a criterion. Test-retest reliability coefficients (Pearson r and intra-class correlation (ICC)) for the overall score and all domains were higher than 0.80; the internal consistency α for all domains at two measurements were higher than 0.70 except for the social domain (0.55 and 0.67, respectively). The overall score and scores for all domains/facets had statistically significant changes after treatments with moderate or higher effect size standardized response mean (SRM) ranging from 0.72 to 1.02 at domain levels. G coefficients and index of dependability (Ф coefficients) confirmed the reliability of the scale further with more exact variance components. The QLICD-IBS has good validity, reliability, responsiveness, and some highlights and can be used as the quality of life instrument for patients with IBS.
Lambron, Julien; Rakotonjanahary, Josué; Loisel, Didier; Frampas, Eric; De Carli, Emilie; Delion, Matthieu; Rialland, Xavier; Toulgoat, Frédérique
2016-02-01
Magnetic resonance (MR) images from children with optic pathway glioma (OPG) are complex. We initiated this study to evaluate the accuracy of MR imaging (MRI) interpretation and to propose a simple and reproducible imaging classification for MRI. We randomly selected 140 MRIs from among 510 MRIs performed on 104 children diagnosed with OPG in France from 1990 to 2004. These images were reviewed independently by three radiologists (F.T., 15 years of experience in neuroradiology; D.L., 25 years of experience in pediatric radiology; and J.L., 3 years of experience in radiology) using a classification derived from the Dodge and modified Dodge classifications. Intra- and interobserver reliabilities were assessed using the Bland-Altman method and the kappa coefficient. These reviews allowed the definition of reliable criteria for MRI interpretation. The reviews showed intraobserver variability and large discrepancies among the three radiologists (kappa coefficient varying from 0.11 to 1). These variabilities were too large for the interpretation to be considered reproducible over time or among observers. A consensual analysis, taking into account all observed variabilities, allowed the development of a definitive interpretation protocol. Using this revised protocol, we observed consistent intra- and interobserver results (kappa coefficient varying from 0.56 to 1). The mean interobserver difference for the solid portion of the tumor with contrast enhancement was 0.8 cm(3) (limits of agreement = -16 to 17). We propose simple and precise rules for improving the accuracy and reliability of MRI interpretation for children with OPG. Further studies will be necessary to investigate the possible prognostic value of this approach.
Flosadottir, Vala; Roos, Ewa M; Ageberg, Eva
2017-09-01
The Activity Rating Scale (ARS) for disorders of the knee evaluates the level of activity by the frequency of participation in 4 separate activities with high demands on knee function, with a score ranging from 0 (none) to 16 (pivoting activities 4 times/wk). To translate and cross-culturally adapt the ARS into Swedish and to assess measurement properties of the Swedish version of the ARS. Cohort study (diagnosis); Level of evidence, 2. The COSMIN guidelines were followed. Participants (N = 100 [55 women]; mean age, 27 years) who were undergoing rehabilitation for a knee injury completed the ARS twice for test-retest reliability. The Knee injury and Osteoarthritis Outcome Score (KOOS), Tegner Activity Scale (TAS), and modernized Saltin-Grimby Physical Activity Level Scale (SGPALS) were administered at baseline to validate the ARS. Construct validity and responsiveness of the ARS were evaluated by testing predefined hypotheses regarding correlations between the ARS, KOOS, TAS, and SGPALS. The Cronbach alpha, intraclass correlation coefficients, absolute reliability, standard error of measurement, smallest detectable change, and Spearman rank-order correlation coefficients were calculated. The ARS showed good internal consistency (α ≈ 0.96), good test-retest reliability (intraclass correlation coefficient >0.9), and no systematic bias between measurements. The standard error of measurement was less than 2 points, and the smallest detectable change was less than 1 point at the group level and less than 5 points at the individual level. More than 75% of the hypotheses were confirmed, indicating good construct validity and good responsiveness of the ARS. The Swedish version of the ARS is valid, reliable, and responsive for evaluating the level of activity based on the frequency of participation in high-demand knee sports activities in young adults with a knee injury.
Azari, Nadia; Soleimani, Farin; Vameghi, Roshanak; Sajedi, Firoozeh; Shahshahani, Soheila; Karimi, Hossein; Kraskian, Adis; Shahrokhi, Amin; Teymouri, Robab; Gharib, Masoud
2017-01-01
Bayley Scales of infant & toddler development is a well-known diagnostic developmental assessment tool for children aged 1-42 months. Our aim was investigating the validity & reliability of this scale in Persian speaking children. The method was descriptive-analytic. Translation- back translation and cultural adaptation was done. Content & face validity of translated scale was determined by experts' opinions. Overall, 403 children aged 1 to 42 months were recruited from health centers of Tehran, during years of 2013-2014 for developmental assessment in cognitive, communicative (receptive & expressive) and motor (fine & gross) domains. Reliability of scale was calculated through three methods; internal consistency using Cronbach's alpha coefficient, test-retest and interrater methods. Construct validity was calculated using factor analysis and comparison of the mean scores methods. Cultural and linguistic changes were made in items of all domains especially on communication subscale. Content and face validity of the test were approved by experts' opinions. Cronbach's alpha coefficient was above 0.74 in all domains. Pearson correlation coefficient in various domains, were ≥ 0.982 in test retest method, and ≥0.993 in inter-rater method. Construct validity of the test was approved by factor analysis. Moreover, the mean scores for the different age groups were compared and statistically significant differences were observed between mean scores of different age groups, that confirms validity of the test. The Bayley Scales of Infant and Toddler Development is a valid and reliable tool for child developmental assessment in Persian language children.
Eustachian Tube Mucosal Inflammation Scale Validation Based on Digital Video Images.
Kivekäs, Ilkka; Pöyhönen, Leena; Aarnisalo, Antti; Rautiainen, Markus; Poe, Dennis
2015-12-01
The most common cause for Eustachian tube dilatory dysfunction is mucosal inflammation. The aim of this study was to validate a scale for Eustachian tube mucosal inflammation, based on digital video clips obtained during diagnostic rigid endoscopy. A previously described four-step scale for grading the degree of inflammation of the mucosa of the Eustachian tube lumen was used for this validation study. A tutorial for use of the scale, including static images and 10 second video clips, was presented to 26 clinicians with various levels of experience. Each clinician then reviewed 35 short digital video samples of Eustachian tubes from patients and rated the degree of inflammation. A subset of the clinicians performed a second rating of the same video clips at a subsequent time. Statistical analysis of the ratings provided inter- and intrarater reliability scores. Twenty-six clinicians with various levels of experience rated a total of 35 videos. Thirteen clinicians rated the videos twice. The overall correlation coefficient for the rating of inflammation severity was relatively good (0.74, 95% confidence interval, 0.72-0.76). The intralevel correlation coefficient for intrarater reliability was high (0.86). For those who rated videos twice, the intralevel correlation coefficient improved after the first rating (0.73, to 0.76), but improvement was not statistically significant. The inflammation scale used for Eustachian tube mucosal inflammation is reliable and this scale can be used with a high level of consistency by clinicians with various levels of experience.
Measuring the Cobb angle with the iPhone in kyphoses: a reliability study.
Jacquot, Frederic; Charpentier, Axelle; Khelifi, Sofiane; Gastambide, Daniel; Rigal, Regis; Sautet, Alain
2012-08-01
Smartphones have gained widespread use in the healthcare field to fulfill a variety of tasks. We developed a small iPhone application to take advantage of the built-in position sensor to measure angles in a variety of spinal deformities. We present a reliability study of this tool in measuring kyphotic angles. Radiographs taken from 20 different patients' charts were presented to a panel of six operators at two different times. Radiographs were measured with the protractor and the iPhone application and statistical analysis was applied to measure intraclass correlation coefficients between both measurement methods, and to measure intra- and interobserver reliability The intraclass correlation coefficient calculated between methods (i.e. CobbMeter application on the iPhone versus standard method with the protractor) was 0.963 for all measures, indicating excellent correlation was obtained between the CobbMeter application and the standard method. The interobserver correlation coefficient was 0.965. The intraobserver ICC was 0.977, indicating excellent reproductibility of measurements at different times for all operators. The interobserver ICC between fellowship trained senior surgeons and general orthopaedic residents was 0.989. Consistently, the ICC for intraobserver and interobserver correlations was higher with the CobbMeter application than with the regular protractor method. This difference was not statistically significant. Measuring kyphotic angles with the iPhone application appears to be a valid procedure and is in no way inferior to the standard way of measuring the Cobb angle in kyphotic deformities.
Validation of an adapted arabic version of fibromyalgia syndrome impact questionnaire.
El-Naby, Mai Abd; Hefny, Mohamed Ahmed; Fahim, Ayman Ekram; Awadalla, Magdy Ahmed
2013-10-01
Fibromyalgia (FM) is the most common chronic pain syndrome encountered in medical practice, affecting females more than males, and the estimated prevalence of FM in Egypt is 1.3 %. The aim was to translate and adapt the Fibromyalgia Impact Questionnaire (FIQ) into Arabic and assess reliability and validity. The Arabic version of Fibromyalgia Impact Questionnaire (FIQ-A) was adapted following the forward/backward translation approach. Fifty-one female patients with FM were studied to assess psychometric properties of the FIQ-A. Reliability was analyzed by the correlation coefficient between test and retest. Internal consistency was checked by the Cronbach's alpha coefficient. Construct validity was assessed comparing FIQ-A with Health Assessment Questionnaire (HAQ), Health Assessment Questionnaire of Fibromyalgia (FHAQ), The Medical Outcome Survey Short-Form-36 (SF-36), and the Total Visual Analog Scale (TVAS) for FM symptom, and feasibility was assessed by the time taken in completing the FIQ-A and the proportion of patients completed the questionnaire. Patients studied were 33.2 ± 9.8 years old. Translation was concordant. Adaptation affected 4 sub-items of physical function. Test-retest correlation coefficient was 0.89 for total FIQ-A and Cronbach's alpha was 0.76. Excellent to good statistically significant correlations (p < 0.05) were found between the FIQ-A items and HAQ, FHAQ, and SF-36. The FIQ-A is a reliable, valid for measuring health status and physical function in Arabic-speaking FM patients.
Tarescavage, Anthony M; Scheman, Judith; Ben-Porath, Yossef S
2015-06-01
The purpose of the current study was to investigate the reliability and concurrent validity of Minnesota Multiphasic Personality Inventory (MMPI)-2-Restructured Form (2-RF) (Ben-Porath & Tellegen, 2008/2011) scores in a sample of 811 chronic low back pain patients (346 males, 529 females) beginning treatment in a short-term interdisciplinary pain rehabilitation program. We calculated internal consistency coefficients, mean-item correlations, and SEM for all substantive scales, as well as zero-order correlations with collateral medical record information and self-report testing. Results indicated reliability and validity for most of the MMPI-2-RF substantive scales. Implications of these findings and limitations of this study are discussed. (c) 2015 APA, all rights reserved).
Vaingankar, Janhavi Ajit; Subramaniam, Mythily; Abdin, Edimansyah; Picco, Louisa; Chua, Boon Yiang; Eng, Goi Khia; Sambasivam, Rajeswari; Shafie, Saleha; Zhang, Yunjue; Chong, Siow Ann
2014-06-01
The 47-item positive mental health (PMH) instrument measures the level of PMH in multiethnic adult Asian populations. This study aimed to (1) develop a short PMH instrument and (2) establish its validity and reliability among the adult Singapore population. Two separate studies were conducted among adult community-dwelling Singapore residents of Chinese, Malay or Indian ethnicity where participants completed self-administered questionnaires. In the first study, secondary data analysis was conducted using confirmatory factor analysis (CFA) to shorten the PMH instrument. In the second study, the newly developed short PMH instrument and other scales were administered to 201 residents to establish its factor structure, validity and reliability. A 20-item short PMH instrument fulfilling a higher-order six-factor structure was developed following secondary analysis. The mean age of the participants in the second study was 41 years and about 53% were women. One item with poor factor loading was further removed to generate a 19-item version of the PMH instrument. CFA demonstrated a first-order six-factor model of the short PMH instrument. The PMH-19 instrument and its subscales fulfilled criterion validity hypotheses. Internal consistency and test-retest reliability of the PMH-19 instrument were high (Cronbach's α coefficient = 0.87; intraclass correlation coefficient = 0.93, respectively). The 19-item PMH instrument is multidimensional, valid and reliable, and most importantly, with its reduced administration time, the short PMH instrument can be used to measure and evaluate PMH in Asian communities.
Kim, Chung-Il; Han, Dong-Wook; Park, Il-Hyeok
2014-04-01
The Test of Gross Motor Development-II (TGMD-II) is a frequently used assessment tool for measuring motor ability. The purpose of this study is to investigate the reliability and validity of TGMD-II's weighting scores (by comparing pre-weighted TGMD-II scores with post ones) as well as examine applicability of the TGMD-II on Korean preschool children. A total of 121 Korean children (three kindergartens) participated in this study. There were 65 preschoolers who were 5-years-old (37 boys and 28 girls) and 56 preschoolers who were 6-years-old (34 boys and 22 girls). For internal consistency, reliability, and construct validity, only one researcher evaluated all of the children using the TGMD-II in the following areas: running; galloping; sliding; hopping; leaping; horizontal jumping; overhand throwing; underhand rolling; striking a stationary ball; stationary dribbling; kicking; and catching. For concurrent validity, the evaluator measured physical fitness (strength, flexibility, power, agility, endurance, and balance). The key findings were as follows: first, the reliability coefficient and the validity coefficient between pre-weighted and post-weighted TGMD-II scores were quite similar. Second, the research showed adequate reliability and validity of the TGMD-II for Korean preschool children. The TGMD-II is a proper instrument to test Korean children's motor development. Yet, applying relative weighting on the TGMD-II should be a point of consideration. Copyright © 2014 Elsevier Ltd. All rights reserved.
Reliability and Validity of the PAQ-C Questionnaire to Assess Physical Activity in Children.
Benítez-Porres, Javier; López-Fernández, Iván; Raya, Juan Francisco; Álvarez Carnero, Sabrina; Alvero-Cruz, José Ramón; Álvarez Carnero, Elvis
2016-09-01
Physical activity (PA) assessment by questionnaire is a cornerstone in the field of sport epidemiology studies. The Physical Activity Questionnaire for Children (PAQ-C) has been used widely to assess PA in healthy school populations. The aim of this study was to evaluate the reliability and validity of the PAQ-C questionnaire in Spanish children using triaxial accelerometry as criterion. Eighty-three (N = 46 boys, N = 37 girls) healthy children (age 10.98 ± 1.17 years, body mass index 19.48 ± 3.51 kg/m(2) ) were volunteers and completed the PAQ-C twice and wore an accelerometer for 8 consecutive days. Reliability was analyzed by the intraclass correlation coefficient (ICC) and the internal consistency by the Cronbach's α coefficient. The PAQ-C was compared against total PA and moderate to vigorous PA (MVPA) obtained by accelerometry. Test-retest reliability showed an ICC = 0.96 for the final score of PAQ-C. Small differences between first and second questionnaire administration were detected. Few and low correlations (rho = 0.228-0.278, all ps < .05) were observed between PAQ-C and accelerometry. The highest correlation was observed for item 9 (rho = 0.311, p < .01). PAQ-C had a high reliability but a questionable validity for assessing total PA and MVPA in Spanish children. Therefore, PA measurement in children should not be limited only to self-report measurements. © 2016, American School Health Association.
A Turkish Version of the Critical-Care Pain Observation Tool: Reliability and Validity Assessment.
Aktaş, Yeşim Yaman; Karabulut, Neziha
2017-08-01
The study aim was to evaluate the validity and reliability of the Critical-Care Pain Observation Tool in critically ill patients. A repeated measures design was used for the study. A convenience sample of 66 patients who had undergone open-heart surgery in the cardiovascular surgery intensive care unit in Ordu, Turkey, was recruited for the study. The patients were evaluated by using the Critical-Care Pain Observation Tool at rest, during a nociceptive procedure (suctioning), and 20 minutes after the procedure while they were conscious and intubated after surgery. The Turkish version of the Critical-Care Pain Observation Tool has shown statistically acceptable levels of validity and reliability. Inter-rater reliability was supported by moderate-to-high-weighted κ coefficients (weighted κ coefficient = 0.55 to 1.00). For concurrent validity, significant associations were found between the scores on the Critical-Care Pain Observation Tool and the Behavioral Pain Scale scores. Discriminant validity was also supported by higher scores during suctioning (a nociceptive procedure) versus non-nociceptive procedures. The internal consistency of the Critical-Care Pain Observation Tool was 0.72 during a nociceptive procedure and 0.71 during a non-nociceptive procedure. The validity and reliability of the Turkish version of the Critical-Care Pain Observation Tool was determined to be acceptable for pain assessment in critical care, especially for patients who cannot communicate verbally. Copyright © 2016 American Society of PeriAnesthesia Nurses. Published by Elsevier Inc. All rights reserved.
Hagiwara, Akiko; Ito, Naomi; Sawai, Kazuhiko; Kazuma, Keiko
2008-09-01
In Japan, there are no valid and reliable physical activity questionnaires for elderly people. In this study, we translated the Physical Activity Scale for the Elderly (PASE) into Japanese and assessed its validity and reliability. Three hundred and twenty-five healthy and elderly subjects over 65 years were enrolled. Concurrent validity was evaluated by Spearman's rank correlation coefficient between PASE scores and an accelerometer (waking steps and energy expenditure), a physical activity questionnaire for adults in general (the Japan Arteriosclerosis Longitudinal Study Physical Activity Questionnaire, JALSPAQ), grip strength, mid-thigh muscle area per bodyweight, static valance and bodyfat percentage. Reliability was evaluated by the test-retest method over a period of 3-4 weeks. The mean PASE score in this study was 114.9. The PASE score was significantly correlated with walking steps (rho = 0.17, P = 0.014), energy expenditure (rho = 0.16, P = 0.024), activity measured with the JALSPAQ (rho = 0.48, P < 0.001), mid-thigh muscle area per bodyweight (rho = 0.15, P = 0.006) and static balance (rho = 0.19, P = 0.001). The proportion of consistency in the response between the first and second surveys was adequately high. The intraclass correlation coefficient for the PASE score was 0.65. The Japanese version of PASE was shown to have acceptable validity and reliability. The PASE is useful to measure the physical activity of elderly people in Japan.
Reliability and Validity of Athletes Disability Index Questionnaire.
Noormohammadpour, Pardis; Hosseini Khezri, Alireza; Farahbakhsh, Farzin; Mansournia, Mohammad Ali; Smuck, Matthew; Kordi, Ramin
2018-03-01
The purpose of this study was to evaluate validity and reliability of a new proposed questionnaire for assessment of functional disability in athletes with low back pain (LBP). Validity and reliability study. Elite athletes participating in different fields of sports. Participants were 165 male and female athletes (between 12 and 50 years old) with LBP. Athlete Disability Index (ADI) Questionnaire which is developed by the authors for assessing LBP-related disability in athletes, Oswestry Disability Index (ODI), and the Roland-Morris Disability Questionnaire (RDQ). Self-reported responses were collected regarding LBP-related disability through ADI, ODI, and RDQ. The test-retest reliability was strong, and intraclass correlation value ranged between 0.74 and 0.94. The Cronbach alpha coefficient value of 0.91 (P < 0.001) demonstrated excellent internal consistency of the questionnaire. The correlation coefficient between ADI and ODI was r = 0.918 (P < 0.0001), between ADI and RDQ was r = 0.669 (P < 0.0001), and between ADI and visual analog scale was r = 0.626 (P < 0.001). According to ODI and RDQ, disability levels were mild in the large majority of subjects (91.5% and 86.0%, respectively). Alternatively, disability assessments by the ADI did not cluster at the mild level and ranged more broadly from mild to very high. The ADI is a reliable and valid instrument for assessing disability in athletes with LBP. Compared with the available LBP disability questionnaires used in the general population, ADI can more precisely stratify the disability levels of athletes due to LBP.
NASA Astrophysics Data System (ADS)
Fisher, W. P., Jr.; Elbaum, B.; Coulter, A.
2010-07-01
Reliability coefficients indicate the proportion of total variance attributable to differences among measures separated along a quantitative continuum by a testing, survey, or assessment instrument. Reliability is usually considered to be influenced by both the internal consistency of a data set and the number of items, though textbooks and research papers rarely evaluate the extent to which these factors independently affect the data in question. Probabilistic formulations of the requirements for unidimensional measurement separate consistency from error by modelling individual response processes instead of group-level variation. The utility of this separation is illustrated via analyses of small sets of simulated data, and of subsets of data from a 78-item survey of over 2,500 parents of children with disabilities. Measurement reliability ultimately concerns the structural invariance specified in models requiring sufficient statistics, parameter separation, unidimensionality, and other qualities that historically have made quantification simple, practical, and convenient for end users. The paper concludes with suggestions for a research program aimed at focusing measurement research more on the calibration and wide dissemination of tools applicable to individuals, and less on the statistical study of inter-variable relations in large data sets.
Reliability of adapted version of Italian Label tobacco Impact Index for the adolescent: ALII.
Guerra, F; Mannocci, A; Colamesta, V; De Luca, G; Fiore, M; Firenze, A; Ferrara, M; Langiano, E; De Vito, E; Bonaccorsi, G; La Torre, G
2017-01-01
The aim of this study is to assess the reliability of the Adolescent Label Impact Index (ALII) , it is an adolescent adapted version of Italian LII of the tobacco products warnings. A sample including students aged 13-15 years was considered. The ALII is constructed by 4 items: salience, harm, quitting and forgo. The questionnaire was self-administered to study participants twice with 3 days between each administration (T1 and T2) to measure reliability. The internal consistency using Cronbach's alpha and Corrected Item-Total Correlations (CITC) and the test-retest reliability applying Pearson's correlation were computed. Cronbach's alpha ranges from 0.625 at T1 to 0.715 at T2. The "salience" resulted the item with the lowest CITC value (=0.281). The Pearson's coefficient was r=0.909 (p<0.001). The instruments is low in cost and easy to administer and analyses in a setting people aged 13-15 years. The ALII shown an acceptable consistency and excellent stability over time. However, attention has to be paid when the ALII is administered to the no smoking teens and who has never seen the tobacco product labels to allow an appropriate interpretation of the data collected.
Clark, S; Rose, D J
2001-04-01
To establish reliability estimates of the 75% Limits of Stability Test (75% LOS test) when administered to community-dwelling older adults with a history of falls. Generalizability theory was used to estimate both the relative contribution of identified error sources to the total measurement error and generalizability coefficients. A random effects repeated-measures analysis of variance (ANOVA) was used to assess consistency of LOS test movement variables across both days and targets. A motor control research laboratory in a university setting. Fifty community-dwelling older adults with 2 or more falls in the previous year. Spatial and temporal measures of dynamic balance derived from the 75% LOS test included average movement velocity, maximum center of gravity (COG) excursion, end-point COG excursion, and directional control. Estimated generalizability coefficients for 2 testing days ranged from.58 to.87. Total variance in LOS test measures attributable to inconsistencies in day-to-day test performance (Day and Subject x Day facets) ranged from 2.5% to 8.4%. The ANOVA results indicated that no significant differences were observed in the LOS test variables across the 2 testing days. The 75% LOS test administered to older adult fallers on 2 consecutive days provides consistent and reliable measures of dynamic balance.
Synkinesis assessment in facial palsy: validation of the Dutch Synkinesis Assessment Questionnaire.
Kleiss, Ingrid J; Beurskens, Carien H G; Stalmeier, Peep F M; Ingels, Koen J A O; Marres, Henri A M
2016-06-01
The objective of this study is to validate an existing health-related quality of life questionnaire for patients with synkinesis in facial palsy for implementation in the Dutch language and culture. The Synkinesis Assessment Questionnaire was translated into the Dutch language using a forward-backward translation method. A pilot test with the translated questionnaire was performed in 10 patients with facial palsy and 10 normal subjects. Finally, cross-cultural adaption was accomplished at our outpatient clinic for facial palsy. Analyses for internal consistency, test-retest reliability, and construct validity were performed. Sixty-six patients completed the Dutch Synkinesis Assessment Questionnaire and the Dutch Facial Disability Index. Cronbach's α, representing internal consistency, was 0.80. Test-retest reliability was 0.53 (Spearman's correlation coefficient, P < 0.01). Correlations with the House-Brackmann score, Sunnybrook score, Facial Disability Index physical function, and social/well-being function were -0.29, 0.20, -0.29, and -0.32, respectively. Correlation with the Sunnybrook synkinesis subscore was 0.50 (Spearman's correlation coefficient). The Dutch Synkinesis Assessment Questionnaire shows good psychometric values and can be implemented in the management of Dutch-speaking patients with facial palsy and synkinesis in the Netherlands. Translation of the instrument into other languages may lead to widespread use, making evaluation, and comparison possible among different providers.
Validation of a Persian version of the Fibromyalgia Impact Questionnaire (FIQ-P).
Bidari, Ali; Hassanzadeh, Morteza; Mohabat, Mohamad-Farzam; Talachian, Elham; Khoei, Effat Merghati
2014-02-01
The aim of this study is to translate, adapt, and validate a Persian version of the Fibromyalgia (FM) Impact Questionnaire (FIQ-P). The FIQ-P was adapted following the translation and back-translation approach; then, it was administered to thirty females with FM. Participants also completed two other validated questionnaires, the Medical Outcome Survey Short Form-36 (SF-36) and the Beck Depression Inventory (BDI). Internal consistency within the FIQ-P items and its test-retest reliability were assessed with Cronbach's alpha and Spearman's correlation coefficient, respectively. Construct validity was analyzed by Spearman's r when correlating the FIQ-P to other questionnaires. The translated version was concordant. Adaptation affected two sub-items of physical function. Participants' mean age ± standard deviation was 40.4 ± 9.0 years. Internal consistency proved good with α = 0.80. Test-retest coefficient ranged from 0.50 for the item "work days missed" to 0.79 for all FIQ-P items. Fair and statistically significant (P < 0.01) correlations were found between the FIQ-P items and two other questionnaires, SF-36 (r = -0.57) and BDI (r = 0.53). We concluded that the FIQ-P is a valid and reliable instrument for measuring health status of Persian-speaking FM patients.
Öksüz, Çigdem; Alemdaroglu, Ipek; Kilinç, Muhammed; Abaoğlu, Hatice; Demirci, Cevher; Karahan, Sevilay; Yilmaz, Oznur; Yildirim, Sibel Aksu
2017-10-01
This study was performed to examine the reliability and validity of the Turkish version of ABILHAND-Kids questionnaire which assesses manual functions of children with neuromuscular diseases (NMDs). A cross sectional survey study design and Rasch analysis were used to assess the reliability and validity of the Turkish version of scale. Ninety-three children with different neuromuscular disorders and their parents were included in the study. The scale was applied to the parents with face-to-face interview twice; on their first visit and after an interval of 15 days. The test-retest reliability was assessed with intraclass correlation coefficient (ICC), and internal consistency of the multi-item subscales by calculating Cronbach alpha values. Brooke Upper Extremity Functional Classification (BUEFC) and Wee-Functional Independency Measurement (Wee-FIM) were correlated to determine the construct validity. The ICC value for the test/retest reliability was 0.94. The internal consistency was 0.81. Floor (1.1%) and ceiling (11.8%) effects were not significant. There were moderate correlations between the Turkish version of ABILHAND-Kids and Wee-FIM (0.67) and BUEFC (-0.37). Rasch analysis indicated good item fit, unidimensionality, and model fit. The Turkish version of ABILHAND-Kids questionnaire was found to be a reliable and valid scale for the assessment of the manual ability of children with NMDs.
Validity and reliability of the Utrecht Work Engagement Scale-Student Version in Sri Lanka.
Wickramasinghe, Nuwan Darshana; Dissanayake, Devani Sakunthala; Abeywardena, Gihan Sajiwa
2018-05-04
The present study was aimed at assessing the validity and the reliability of the Sinhala version of the Utrecht Work Engagement Scale-Student Version (UWES-S) among collegiate cycle students in Sri Lanka. The 17-item UWES-S was translated to Sinhala and the judgmental validity was assessed by a multi-disciplinary panel of experts. Construct validity of the UWES-S was appraised by using multi-trait scaling analysis and exploratory factor analysis (EFA) on data obtained from a sample of 194 grade thirteen students in the Kurunegala district, Sri Lanka. Reliability of the UWES-S was assessed by using internal consistency and test-retest reliability. Except for item 13, all other items showed good psychometric properties in judgemental validity, item-convergent validity and item-discriminant validity. EFA using principal component analysis with Oblimin rotation, suggested a three-factor solution (including vigor, dedication and absorption subscales) explaining 65.4% of the total variance for the 16-item UWES-S (with item 13 deleted). All three subscales show high internal consistency with Cronbach's α coefficient values of 0.867, 0.819, and 0.903 and test-retest reliability was high (p < 0.001). Hence, the Sinhala version of the 16-item UWES-S is a valid and a reliable instrument to assess work engagement among collegiate cycle students in Sri Lanka.
Asgari, Fatemeh; Haghdoost, Faraidoon; Masjedi, Samaneh Sadat; Manouchehri, Navid; Banihashemi, Mahboobeh; Ghorbani, Abbas; Najafi, Mohammad Reza; Saadatnia, Mohammad; Lipton, Richard B.
2014-01-01
Introduction. MIDAS is a valid and reliable short questionnaire for assessment of headache related disability. Linguistic validation of Persian MIDAS and assessment of psychometric properties between tension type headache (TTH) and migraine were the aims of this study. Methods. Patients with migraine or TTH were included. At the first visit, we administered a headache symptom questionnaire, MIDAS, and SF-36. Patients filled out MIDAS in second and third visit within three and eight weeks after base line visit. Internal consistency (Cronbach α) and test-retest reproducibility (Spearman correlation coefficient) were used to assess reliability. Convergent validity and MIDAS capability to differentiate between chronic and episodic headaches (migraine and TTH) were also assessed. Results. The 267 participants had episodic migraine (EM-64%), chronic migraine (CM-13.5%), episodic TTH (ETTH-13.5%), and chronic TTH (CTTH-9). Internal consistency reliability was 0.8 for the entire sample, 0.72 for TTH, and 0.82 for migraine. Test-retest reliability for all questions between visit 1 and visit 2 varied from 0.54 to 0.71. Convergent validity was assessed using SF-36 as an external referent. Patients with episodic headaches (EM and ETTH) had significantly lower MIDAS scores than chronic headaches (CM and CTTH). Conclusion. Persian MIDAS is a valid and reliable questionnaire for migraine and TTH that can differentiate between episodic headache and chronic headache. PMID:24527462
Peterson, Eleanor B; Calhoun, Aaron W; Rider, Elizabeth A
2014-09-01
With increased recognition of the importance of sound communication skills and communication skills education, reliable assessment tools are essential. This study reports on the psychometric properties of an assessment tool based on the Kalamazoo Consensus Statement Essential Elements Communication Checklist. The Gap-Kalamazoo Communication Skills Assessment Form (GKCSAF), a modified version of an existing communication skills assessment tool, the Kalamazoo Essential Elements Communication Checklist-Adapted, was used to assess learners in a multidisciplinary, simulation-based communication skills educational program using multiple raters. 118 simulated conversations were available for analysis. Internal consistency and inter-rater reliability were determined by calculating a Cronbach's alpha score and intra-class correlation coefficients (ICC), respectively. The GKCSAF demonstrated high internal consistency with a Cronbach's alpha score of 0.844 (faculty raters) and 0.880 (peer observer raters), and high inter-rater reliability with an ICC of 0.830 (faculty raters) and 0.89 (peer observer raters). The Gap-Kalamazoo Communication Skills Assessment Form is a reliable method of assessing the communication skills of multidisciplinary learners using multi-rater methods within the learning environment. The Gap-Kalamazoo Communication Skills Assessment Form can be used by educational programs that wish to implement a reliable assessment and feedback system for a variety of learners. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
SPSS Macros for Assessing the Reliability and Agreement of Student Evaluations of Teaching
ERIC Educational Resources Information Center
Morley, Donald D.
2009-01-01
This article reports and demonstrates two SPSS macros for calculating Krippendorff's alpha and intraclass reliability coefficients in repetitive situations where numerous coefficients are needed. Specifically, the reported SPSS macros were used to evaluate the interrater agreement and reliability of student evaluations of teaching in thousands of…
An interrater reliability study of the Braden scale in two nursing homes.
Kottner, Jan; Dassen, Theo
2008-10-01
Adequate risk assessment is essential in pressure ulcer prevention. Assessment scales were designed to support practitioners in identifying persons at pressure ulcer risk. The Braden scale is one of the most extensively studied risk assessment instruments, although the majority of studies focused on validity rather than reliability. The first aim was to measure the interrater reliability of the Braden scale and its individual items. The second aim was to study different statistical approaches regarding interrater reliability estimation. An interrater reliability study was conducted in two German nursing homes. Residents (n = 152) from 8 units were assessed twice. The raters were trained nurses with a work experience ranging from 0.5 to 30 years. Data were analysed using an overall percentage of agreement, weighted and unweighted kappa and the intraclass correlation coefficient. Differences between nurses rating the overall Braden score ranged from 0 up to 9 points. Interrater reliability expressed by the intraclass correlation coefficient ranged from 0.73 (95% CI 0.26 - 0.91) to 0.95 (95% CI 0.87 - 0.98). Calculated intraclass correlation coefficients for individual items ranged from 0.06 (95% CI -0.31 to 0.48) to 0.97 (95% CI 0.93-0.99) with the lowest values being measured for the items "sensory perception" and "nutrition". There was no association between work experience and the level of interrater reliability. With two exceptions, simple kappa-values were always lower than weighted kappa-values and intraclass correlation coefficients. Although the calculated interrater reliability coefficients for the total Braden score were high in some cases, several clinically relevant differences occurred between the nurses. Due to interrater reliability being very low for the items "sensory perception" and "nutrition", it is doubtful if their assessment contributes to any valid results. The calculation of weighted kappa or intraclass correlation coefficients is the most appropriate interrater reliability estimates.
Koho, P; Aho, S; Kautiainen, H; Pohjolainen, T; Hurri, H
2014-12-01
To estimate the internal consistency, test-retest reliability and comparability of paper and computer versions of the Finnish version of the Tampa Scale of Kinesiophobia (TSK-FIN) among patients with chronic pain. In addition, patients' personal experiences of completing both versions of the TSK-FIN and preferences between these two methods of data collection were studied. Test-retest reliability study. Paper and computer versions of the TSK-FIN were completed twice on two consecutive days. The sample comprised 94 consecutive patients with chronic musculoskeletal pain participating in a pain management or individual rehabilitation programme. The group rehabilitation design consisted of physical and functional exercises, evaluation of the social situation, psychological assessment of pain-related stress factors, and personal pain management training in order to regain overall function and mitigate the inconvenience of pain and fear-avoidance behaviour. The mean TSK-FIN score was 37.1 [standard deviation (SD) 8.1] for the computer version and 35.3 (SD 7.9) for the paper version. The mean difference between the two versions was 1.9 (95% confidence interval 0.8 to 2.9). Test-retest reliability was 0.89 for the paper version and 0.88 for the computer version. Internal consistency was considered to be good for both versions. The intraclass correlation coefficient for comparability was 0.77 (95% confidence interval 0.66 to 0.85), indicating substantial reliability between the two methods. Both versions of the TSK-FIN demonstrated substantial intertest reliability, good test-retest reliability, good internal consistency and acceptable limits of agreement, suggesting their suitability for clinical use. However, subjects tended to score higher when using the computer version. As such, in an ideal situation, data should be collected in a similar manner throughout the course of rehabilitation or clinical research. Copyright © 2014 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.
Reliability and validity of the Turkish version of the Berg Balance Scale.
Sahin, Fusun; Yilmaz, Figen; Ozmaden, Asli; Kotevolu, Nurdan; Sahin, Tulay; Kuran, Banu
2008-01-01
The purpose of this study was to develop a Turkish version of the Berg Balance Scale (BBS) and assess its reliability and validity. Sixty healthy volunteers older than 65 years were included in to the study. Subjects who had lower extremity amputation, or were armchair or bedridden were excluded. After translation process, the Turkish version of the scale was administered to each participant twice with an interval of 2 weeks. The intraclass correlation coefficient (ICC) was calculated to assess intra- and inter-observer reliability. Chronbach alpha was calculated to evaluate internal consistency of the total BBS score. Interclass correlation coefficient was calcuated to examine test-retest reliability. Convergent validity was assessed by correlating the scale with Modified Barthel Index (MBI) and Timed Up and Go Test (TUG). Construct validity was assessed with factor analysis. The mean age in years of the participants were 77.00+/-5.67 (range: 67-92 yrs). The ICC for intra- and inter- observer reliability was 0.98 (p<0.0001) and 0.97 (p<0.0001), respectively. Chronbach alpha of the Turkish version of the BBS was 0.98. The test-retest reliability (ICC) of the Turkish version of the BBS was determined as 0.98 for the total score, and ranged from 0.86-0.99 for individual items. In terms of validity, the Turkish version of the BBS was correlated with the MBI (in positive direction) and TUG (in negative direction) (r=0.67 p<0.0001; r=-0.75 p<0.0001, respectively). The Turkish version of the BBS is a reliable and valid scale to be used in balance assessment of Turkish older adults.
Development of a questionnaire to evaluate asthma control in Japanese asthma patients.
Tohda, Yuji; Hozawa, Soichiro; Tanaka, Hiroshi
2018-01-01
The asthma control questionnaires used in Japan are Japanese translations of those developed outside Japan, and have some limitations; a questionnaire designed to optimally evaluate asthma control levels for Japanese may be necessary. The present study was conducted to validate the Japan Asthma Control Survey (JACS) questionnaire in Japanese asthma patients. A total of 226 adult patients with mild to severe persistent asthma were enrolled and responded to the JACS questionnaire, asthma control questionnaire (ACQ), and Mini asthma quality of life questionnaire (Mini AQLQ) at Weeks 0 and 4. The reliability, validity, and sensitivity/responsiveness of the JACS questionnaire were evaluated. The intra-class correlation coefficients (ICCs) were within the range of 0.55-0.75 for all JACS scores, indicating moderate/substantial reproducibility. For internal consistency, Cronbach's alpha coefficients ranged from 0.76 to 0.92 in total and subscale scores, which were greater than the lower limit of internal consistency. As for factor validity, the cumulative contribution ratio of four main factors was 0.66. For criterion-related validity, the correlation coefficients between the JACS total score and ACQ5, ACQ6, and Mini AQLQ scores were -0.78, -0.78, and 0.77, respectively, showing a significant correlation (p < 0.0001). The JACS questionnaire was validated in terms of reliability and validity. It will be necessary to evaluate the therapeutic efficacy measured by the JACS questionnaire and calculate cutoff values for the asthma control status in a higher number of patients. UMIN000016589. Copyright © 2017 Japanese Society of Allergology. Production and hosting by Elsevier B.V. All rights reserved.
Swiss (German) Version of the Actinic Keratosis Quality of Life questionnaire.
Meier, Larissa S; Schubert, Maria; Göksu, Yasemin; Esmann, Solveig; Vinding, Gabrielle R; Jemec, Gregor B E; Hofbauer, Günther F L
2018-04-18
Actinic keratosis (AK) is a sun-induced skin lesion that may progress to invasive squamous cell carcinoma of the skin. Recently, the Actinic Keratosis Quality of Life questionnaire (AKQoL) was designed for patients with AK in Denmark as a specific quality of life instrument for AK patients. The objective of this study was to adapt the AKQoL for the German language region of Switzerland and to evaluate its psychometric properties (validity, reliability). Translation and cultural adaptation of the questionnaire were assessed by using the technique of cognitive interviewing. During the translation process, 34 patients with AK from the Department of Dermatology, University Hospital Zurich, were interviewed in 3 sessions of cognitive interviewing. The translated questionnaire was then distributed together with the Dermatology Life Quality Index (DLQI) to a second group of 113 patients for validation and reliability testing. Within this group, we measured the internal consistency by the Cronbach coefficient α and Spearman correlation coefficient between the AKQoL and the DLQI. The problems encountered during the translation process led to changes in 5 categories as described by Epstein: stylistic changes, change in breadth, change in actual meaning, change in frequency and time frame, change in intensity. We found a Cronbach α of 0.82, an acceptable internal consistency. The Spearman correlation coefficient between total scores of AKQoL and DLQI was 0.57. We culturally adapted and validated a Swiss (German) version of the AKQoL questionnaire applicable for the population of a university center in Switzerland to measure and monitor the quality of life in patients with AK. © 2018 S. Karger AG, Basel.
Han, Paul K J; Joekes, Katherine; Mills, Greg; Gutheil, Caitlin; Smith, Kahsi; Cochran, Nancy E; Elwyn, Glyn
2016-12-01
To develop and evaluate a brief observational measure of clinical risk communication competence. A 4-item checklist-type measure, the BRISK (Brief Risk Information Skill) Scale, was developed by selecting and refining items from a more comprehensive measure of clinical risk communication competence. Six volunteer raters received brief training on the measure and then used the BRISK Scale to evaluate 52 video-recorded encounters between 2nd-year medical students and standardized patients conducted as part of an Observed Structured Clinical Examination (OSCE) involving a risk communication task. Internal consistency reliability, inter-rater reliability, and criterion validity were assessed. Raters reported no difficulties using the BRISK Scale; scores across all raters and subjects ranged from 0 to 16 with a mean score of 6.49 (SD=3.17). The BRISK Scale showed good internal consistency reliability (α=0.64), and inter-rater reliability at the scale level (Intraclass Correlation Coefficient (ICC)=0.79 for consistency, and 0.75 for absolute agreement) and individual-item level (ICC range: 0.62-.91). Novice raters' BRISK Scale scores were highly correlated (r=0.84, p<0.01) with expert raters' scores on the Risk Communication Content measure, a more comprehensive measure of risk communication competence. The BRISK Scale is a promising new brief observational measure of clinical risk communication competence. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Le, Minh Thi Hong; Tran, Thach Duc; Holton, Sara; Nguyen, Huong Thanh; Wolfe, Rory; Fisher, Jane
2017-01-01
To assess the internal consistency, latent structure and convergent validity of the Depression, Anxiety and Stress Scale-21 (DASS-21) among adolescents in Vietnam. An anonymous, self-completed questionnaire was conducted among 1,745 high school students in Hanoi, Vietnam between October, 2013 and January, 2014. Confirmatory factor analyses were performed to assess the latent structure of the DASS-21. Factorial invariance between girls and boys was examined. Cronbach alphas and correlation coefficients between DASS-21 factor scores and the domain scores of the Duke Health Profile Adolescent Vietnamese validated version (ADHP-V) were calculated to assess DASS-21 internal consistency and convergent validity. A total of 1,606/ 1,745 (92.6%) students returned the questionnaire. Of those, 1,387 students provided complete DASS-21 data. The scale demonstrated adequate internal consistency (Cronbach α: 0.761 to 0.906). A four-factor model showed the best fit to the data. Items loaded significantly on a common general distress factor, the depression, and the anxiety factors, but few on the stress factor (p<0.05). DASS-21 convergent validity was confirmed with moderate correlation coefficients (-0.47 to -0.66) between its factor scores and the ADHP-V mental health related domains. The DASS-21 is reliable and suitable for use to assess symptoms of common mental health problems, especially depression and anxiety among Vietnamese adolescents. However, its ability in detecting stress among these adolescents may be limited. Further research is warrant to explore these results.
Bayesian Meta-Analysis of Cronbach's Coefficient Alpha to Evaluate Informative Hypotheses
ERIC Educational Resources Information Center
Okada, Kensuke
2015-01-01
This paper proposes a new method to evaluate informative hypotheses for meta-analysis of Cronbach's coefficient alpha using a Bayesian approach. The coefficient alpha is one of the most widely used reliability indices. In meta-analyses of reliability, researchers typically form specific informative hypotheses beforehand, such as "alpha of…
ERIC Educational Resources Information Center
Wilson, Celia M.
2010-01-01
Research pertaining to the distortion of the squared canonical correlation coefficient has traditionally been limited to the effects of sampling error and associated correction formulas. The purpose of this study was to compare the degree of attenuation of the squared canonical correlation coefficient under varying conditions of score reliability.…
Jafari, Najmeh; Zamani, Ahmadreza; Lazenby, Mark; Farajzadegan, Ziba; Emami, Hamid; Loghmani, Amir
2013-02-01
The Functional Assessment of Chronic Illness Therapy-Spiritual Well-Being (FACIT-Sp) scale is a valid and reliable instrument to provide an inclusive measure of spirituality in research and clinical practice. The aim of this study was to translate and investigate the reliability and validity of the Persian version of the FACIT-Sp. The 12 item spiritual well-being subscale of the FACIT-Sp Version 4 was translated into the Persian language, Farsi, using the FACIT translation methodology. The questionnaire was administered to a diverse sample of 153 patients in treatment for cancer. Internal consistency was assessed by Cronbach's α coefficient, confirmatory factor analysis (CFA) was applied to assess construct validity, and regression analysis was used to assess the predictor role of the FACIT-Sp in health-related quality of life (HRQOL). Cronbach's α reliability coefficient for the FACIT-Sp subscales ranged from 0.72 to 0.90. The CFA generally replicated the original conceptualization of the three subscales of the FACIT-Sp12 (Peace, Meaning, and Faith). All three subscales significant predicted HRQOL. The Persian version of the FACIT-Sp scale is a reliable and valid tool for the clinical assessment of, and research into, the spiritual well-being of Muslim Iranian and Farsi-speaking patients in other regions of the world who are in treatment for cancer.
Martín-Nieto, A; Atín-Arratibel, M Á; Bravo-Llatas, C; Moreno-Bermejo, M I; Martín-Casas, P
2018-06-08
The aim of this study was to develop and validate a Spanish-language version of the Scale for Contraversive Pushing, used to diagnose and measure pusher behaviour in stroke patients. Translation-back translation was used to create the Spanish-language Scale for Contraversive Pushing; we subsequently evaluated its validity and reliability by administering it to a sample of patients. We also analysed its sensitivity to change in patients identified as pushers who received neurological physiotherapy. Experts indicated that the content of the scale was valid. Internal consistency was very good (Cronbach's alpha of 0.94). The intraclass correlation coefficient showed high intra- and interobserver reliability (0.999 and 0.994, respectively). The Kappa and weighted Kappa coefficients were used to measure the reliability of each item; the majority obtained values above 0.9. Lastly, the differences between baseline and final evaluations of pushers were significant (paired sample t test), showing that the scale is sensitive to changes obtained through physical therapy. The Spanish-language version of the Scale for Contraversive Pushing is valid and reliable for measuring pusher behaviour in stroke patients. In addition, it is able to evaluate the ongoing changes in patients who have received physical therapy. Copyright © 2018 Sociedad Española de Neurología. Publicado por Elsevier España, S.L.U. All rights reserved.
Saad, Karen Ruggeri; Colombo, Alexandra S; João, Silvia M Amado
2009-01-01
The purpose of this study was to investigate the reliability and validity of photogrammetry in measuring the lateral spinal inclination angles. Forty subjects (32 female and 8 males) with a mean age of 23.4 +/- 11.2 years had their scoliosis evaluated by radiographs of their trunk, determined by the Cobb angle method, and by photogrammetry. The statistical methods used included Cronbach alpha, Pearson/Spearman correlation coefficients, and regression analyses. The Cronbach alpha values showed that the photogrammetric measures showed high internal consistency, which indicated that the sample was bias free. The radiograph method showed to be more precise with intrarater reliabilities of 0.936, 0.975, and 0.945 for the thoracic, lumbar, and thoracolumbar curves, respectively, and interrater reliabilities of 0.942 and 0.879 for the angular measures of the thoracic and thoracolumbar segments, respectively. The regression analyses revealed a high determination coefficient although limited to the adjusted linear model between the radiographic and photographic measures. It was found that with more severe scoliosis, the lateral curve measures obtained with the photogrammetry were for the thoracic and lumbar regions (R = 0.619 and 0.551). The photogrammetric measures were found to be reproducible in this study and could be used as supplementary information to decrease the number of radiographs necessary for the monitoring of scoliosis.
Internal consistency and stability of the CANTAB neuropsychological test battery in children.
Syväoja, Heidi J; Tammelin, Tuija H; Ahonen, Timo; Räsänen, Pekka; Tolvanen, Asko; Kankaanpää, Anna; Kantomaa, Marko T
2015-06-01
The Cambridge Neuropsychological Test Automated Battery (CANTAB) is a computer-assessed test battery widely use in different populations. The internal consistency and 1-year stability of CANTAB tests were examined in school-age children. Two hundred-thirty children (57% girls) from five schools in the Jyväskylä school district in Finland participated in the study in spring 2011. The children completed the following CANTAB tests: (a) visual memory (pattern recognition memory [PRM] and spatial recognition memory [SRM]), (b) executive function (spatial span [SSP], Stockings of Cambridge [SOC], and intra-extra dimensional set shift [IED]), and (c) attention (reaction time [RTI] and rapid visual information processing [RVP]). Seventy-four children participated in the follow-up measurements (64% girls) in spring 2012. Cronbach's alpha reliability coefficient was used to estimate the internal consistency of the nonhampering test, and structural equation models were applied to examine the stability of these tests. The reliability and the stability could not be determined for IED or SSP because of the nature of these tests. The internal consistency was acceptable only in the RTI task. The 1-year stability was moderate-to-good for the PRM, RTI, and RVP. The SSP and IED showed a moderate correlation between the two measurement points. The SRM and the SOC tasks were not reliable or stable measures in this study population. For research purposes, we recommend using structural equation modeling to improve reliability. The results suggest that the reliability and the stability of computer-based test batteries should be confirmed in the target population before using them for clinical or research purposes. (c) 2015 APA, all rights reserved).
Validity and Reliability of the Persian Version of the Dysphagia Handicap Index (DHI).
Asadollahpour, Faezeh; Baghban, Kowsar; Asadi, Mozhgan
2015-05-01
The Dysphagia Handicap Index (DHI) is one of the instruments used for measuring a dysphagic patient's self-assessment. In some ways, it reflects the patient's quality of life. Although it has been recognized and widely applied in English speaking populations, it has not been used in its present forms in Persian speaking countries. The purpose of this study was to adapt a Persian version of the DHI and to evaluate its validity, consistency, and reliability in the Persian population with oropharyngeal dysphagia. Some stages for cross-cultural adaptation were performed, which consisted in translation, synthesis, back translation, review by an expert committee, and final proof reading. The generated Persian DHI was administered to 85 patients with oropharyngeal dysphagia and 89 control subjects at Zahedan city between May 2013 and August 2013. The patients and control subjects answered the same questionnaire 2 weeks later to verify the test-retest reliability. Internal consistency and test-retest reliability were evaluated. The results of the patients and the control group were compared. The Persian DHI showed good internal consistency (Cronbach's alpha coefficients range from 0.82 to 0.94). Also, good test-retest reliability was found for the total scores of the Persian DHI (r=0.89). There was a significant difference between the DHI scores of the control group and those of the oropharyngeal dysphagia group (P‹0.001). The Persian version of the DHI achieved Face and translation validity. This study demonstrated that the Persian DHI is a valid tool for self-assessment of the handicapping effects of dysphagia on the physical, functional, and emotional aspects of patient life and can be a useful tool for screening and treatment planning for the Persian-speaking dysphagic patients, regardless of the cause or the severity of the dysphagia.
Aguiar, A S; Bataglion, C; Visscher, C M; Bevilaqua Grossi, D; Chaves, T C
2017-07-01
Fear of movement (kinesiophobia) seems to play an important role in the development of chronic pain. However, for temporomandibular disorders (TMD), there is a scarcity of studies about this topic. The Tampa Scale for Kinesiophobia for TMD (TSK/TMD) is the most widely used instrument to measure fear of movement and it is not available in Brazilian Portuguese. The purpose of this study was to culturally adapt the TSK/TMD to Brazilian Portuguese and to assess its psychometric properties regarding internal consistency, reliability, and construct and structural validity. A total of 100 female patients with chronic TMD participated in the validation process of the TSK/TMD-Br. The intraclass correlation coefficient (ICC) was used for statistical analysis of reliability (test-retest), Cronbach's alpha for internal consistency, Spearman's rank correlation for construct validity and confirmatory factor analysis (CFA) for structural validity. CFA endorsed the pre-specified model with two domains and 12-items (Activity Avoidance - AA/Somatic Focus - SF) and all items obtained a loading factor greater than 0·4. Acceptable levels of reliability were found (ICC > 0·75) for all questions and domains of the TSK/TMD-Br. For internal consistency, Cronbach's α of 0·78 for both domains were found. Moderate correlations (0·40 < r < 0.60) were observed for 84% of the analyses conducted between TSK/TMD-Br scores versus catastrophising, depression and jaw functional limitation. TSK/TMD-Br 12 items and two-factor demonstrated sound psychometric properties (transcultural validity, reliability, internal consistency and structural validity). In such a way, the instrument can be used in clinical settings and for research purposes. © 2017 John Wiley & Sons Ltd.
Slootweg, Irene A.; Lombarts, Kiki M. J. M. H.; Boerebach, Benjamin C. M.; Heineman, Maas Jan; Scherpbier, Albert J. J. A.; van der Vleuten, Cees P. M.
2014-01-01
Background Teamwork between clinical teachers is a challenge in postgraduate medical training. Although there are several instruments available for measuring teamwork in health care, none of them are appropriate for teaching teams. The aim of this study is to develop an instrument (TeamQ) for measuring teamwork, to investigate its psychometric properties and to explore how clinical teachers assess their teamwork. Method To select the items to be included in the TeamQ questionnaire, we conducted a content validation in 2011, using a Delphi procedure in which 40 experts were invited. Next, for pilot testing the preliminary tool, 1446 clinical teachers from 116 teaching teams were requested to complete the TeamQ questionnaire. For data analyses we used statistical strategies: principal component analysis, internal consistency reliability coefficient, and the number of evaluations needed to obtain reliable estimates. Lastly, the median TeamQ scores were calculated for teams to explore the levels of teamwork. Results In total, 31 experts participated in the Delphi study. In total, 114 teams participated in the TeamQ pilot. The median team response was 7 evaluations per team. The principal component analysis revealed 11 factors; 8 were included. The reliability coefficients of the TeamQ scales ranged from 0.75 to 0.93. The generalizability analysis revealed that 5 to 7 evaluations were needed to obtain internal reliability coefficients of 0.70. In terms of teamwork, the clinical teachers scored residents' empowerment as the highest TeamQ scale and feedback culture as the area that would most benefit from improvement. Conclusions This study provides initial evidence of the validity of an instrument for measuring teamwork in teaching teams. The high response rates and the low number of evaluations needed for reliably measuring teamwork indicate that TeamQ is feasible for use by teaching teams. Future research could explore the effectiveness of feedback on teamwork in follow up measurements. PMID:25393006
Slootweg, Irene A; Lombarts, Kiki M J M H; Boerebach, Benjamin C M; Heineman, Maas Jan; Scherpbier, Albert J J A; van der Vleuten, Cees P M
2014-01-01
Teamwork between clinical teachers is a challenge in postgraduate medical training. Although there are several instruments available for measuring teamwork in health care, none of them are appropriate for teaching teams. The aim of this study is to develop an instrument (TeamQ) for measuring teamwork, to investigate its psychometric properties and to explore how clinical teachers assess their teamwork. To select the items to be included in the TeamQ questionnaire, we conducted a content validation in 2011, using a Delphi procedure in which 40 experts were invited. Next, for pilot testing the preliminary tool, 1446 clinical teachers from 116 teaching teams were requested to complete the TeamQ questionnaire. For data analyses we used statistical strategies: principal component analysis, internal consistency reliability coefficient, and the number of evaluations needed to obtain reliable estimates. Lastly, the median TeamQ scores were calculated for teams to explore the levels of teamwork. In total, 31 experts participated in the Delphi study. In total, 114 teams participated in the TeamQ pilot. The median team response was 7 evaluations per team. The principal component analysis revealed 11 factors; 8 were included. The reliability coefficients of the TeamQ scales ranged from 0.75 to 0.93. The generalizability analysis revealed that 5 to 7 evaluations were needed to obtain internal reliability coefficients of 0.70. In terms of teamwork, the clinical teachers scored residents' empowerment as the highest TeamQ scale and feedback culture as the area that would most benefit from improvement. This study provides initial evidence of the validity of an instrument for measuring teamwork in teaching teams. The high response rates and the low number of evaluations needed for reliably measuring teamwork indicate that TeamQ is feasible for use by teaching teams. Future research could explore the effectiveness of feedback on teamwork in follow up measurements.
Wan, Chonghua; Li, Hezhan; Fan, Xuejin; Yang, Ruixue; Pan, Jiahua; Chen, Wenru; Zhao, Rong
2014-06-04
Quality of life (QOL) for patients with coronary heart disease (CHD) is now concerned worldwide with the specific instruments being seldom and no one developed by the modular approach. This paper is aimed to develop the CHD scale of the system of Quality of Life Instruments for Chronic Diseases (QLICD-CHD) by the modular approach and validate it by both classical test theory and Generalizability Theory. The QLICD-CHD was developed based on programmed decision procedures with multiple nominal and focus group discussions, in-depth interview, pre-testing and quantitative statistical procedures. 146 inpatients with CHD were used to provide the data measuring QOL three times before and after treatments. The psychometric properties of the scale were evaluated with respect to validity, reliability and responsiveness employing correlation analysis, factor analyses, multi-trait scaling analysis, t-tests and also G studies and D studies of Genralizability Theory analysis. Multi-trait scaling analysis, correlation and factor analyses confirmed good construct validity and criterion-related validity when using SF-36 as a criterion. The internal consistency α and test-retest reliability coefficients (Pearson r and Intra-class correlations ICC) for the overall instrument and all domains were higher than 0.70 and 0.80 respectively; The overall and all domains except for social domain had statistically significant changes after treatments with moderate effect size SRM (standardized response mea) ranging from 0.32 to 0.67. G-coefficients and index of dependability (Ф coefficients) confirmed the reliability of the scale further with more exact variance components. The QLICD-CHD has good validity, reliability, and moderate responsiveness and some highlights, and can be used as the quality of life instrument for patients with CHD. However, in order to obtain better reliability, the numbers of items for social domain should be increased or the items' quality, not quantity, should be improved.
Kilgus, Stephen P; Riley-Tillman, T Chris; Stichter, Janine P; Schoemann, Alexander M; Bellesheim, Katie
2016-09-01
The purpose of this investigation was to evaluate the reliability of Direct Behavior Ratings-Social Competence (DBR-SC) ratings. Participants included 60 students identified as possessing deficits in social competence, as well as their 23 classroom teachers. Teachers used DBR-SC to complete ratings of 5 student behaviors within the general education setting on a daily basis across approximately 5 months. During this time, each student was assigned to 1 of 2 intervention conditions, including the Social Competence Intervention-Adolescent (SCI-A) and a business-as-usual (BAU) intervention. Ratings were collected across 3 intervention phases, including pre-, mid-, and postintervention. Results suggested DBR-SC ratings were highly consistent across time within each student, with reliability coefficients predominantly falling in the .80 and .90 ranges. Findings further indicated such levels of reliability could be achieved with only a small number of ratings, with estimates varying between 2 and 10 data points. Group comparison analyses further suggested the reliability of DBR-SC ratings increased over time, such that student behavior became more consistent throughout the intervention period. Furthermore, analyses revealed that for 2 of the 5 DBR-SC behavior targets, the increase in reliability over time was moderated by intervention grouping, with students receiving SCI-A demonstrating greater increases in reliability relative to those in the BAU group. Limitations of the investigation as well as directions for future research are discussed herein. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Nascimento-Ferreira, Marcus Vinícius; De Moraes, Augusto César Ferreira; Toazza-Oliveira, Paulo Vinícius; Forjaz, Claudia L M; Aristizabal, Juan Carlos; Santaliesra-Pasías, Alba M; Lepera, Candela; Nascimento-Junior, Walter Viana; Skapino, Estela; Delgado, Carlos Alberto; Moreno, Luis Alberto; Carvalho, Heráclito Barbosa
2018-03-01
The objective of this article is to test the reliability and validity of the new and innovative physical activity (PA) questionnaire. Subsamples from the South American Youth/Child Cardiovascular and Environment Study (SAYCARE) study were included to examine its reliability (children: n = 161; adolescents: n = 177) and validity (children: n = 82; adolescents: n = 60). The questionnaire consists of three dimensions of PA (leisure, active commuting, and school) performed during the last week. To assess its validity, the subjects wore accelerometers for at least 3 days and 8 h/d (at least one weekend day). The reliability was analyzed by correlation coefficients. In addition, Bland-Altman analysis and a multilevel regression were applied to estimate the measurement bias, limits of agreement, and influence of contextual variables. In children, the questionnaire showed consistent reliability (ρ = 0.56) and moderate validity (ρ = 0.46), and the contextual variable variance explained 43.0% with -22.9 min/d bias. In adolescents, the reliability was higher (ρ = 0.76) and the validity was almost excellent (ρ = 0.88), with 66.7% of the variance explained by city level with 16.0 min/d PA bias. The SAYCARE PA questionnaire shows acceptable (in children) to strong (in adolescents) reliability and strong validity in the measurement of PA in the pediatric population from low- to middle-income countries. © 2018 The Obesity Society.
Zhao, Gao-feng; Zhang, Qiang; Pang, Yan; Ren, Zheng-jia; Peng, Dan; Jiang, Guo-guo; Liu, Shan-ming; Chen, Ying; Geng, Ting; Zhang, Shu-sen; Yang, Yan-chun; Deng, Hong
2009-11-01
To explore the reliability and validity of the Children's Impact of Event Scale (Chinese version, CRIES-13) and to determine the value and the optimal cutoff point of the score of CRIES-13 in screening posttraumatic stress disorder (PTSD), so as to provide evidence for PTSD prevention and identify children at risk in Wenchuan earthquake areas. A total of 253 children experienced the Wenchuan earthquake were tested through Stratified random cluster sampling. The authors examined CRIES-13's internal consistency, discriminative validity and predictive value of the cut-off. PTSD was assessed with the DSM-IV criteria. Area under the curve while sensitivity, specificity and Youden index were computed based on the receiver operating characteristic curve analysis. Optimal cutoff point was determined by the maximum of Youden index. 20.9% of the subjects were found to have met the DSM-IV criteria for PTSD 7 months after the Wenchuan earthquake accident. The Cronbach's coefficient of CRIES-13 was 0.903 and the mean inter-item correlation coefficients ranged from 0.283 to 0.689, the correlation coefficient of the three factors with the total scale scores ranged from 0.836 to 0.868 while the correlation coefficient among the three factors ranged from 0.568 to 0.718, PTSD cases indicated much higher scores than non-PTSD cases, the Youden index reached maximum value when the total score approached 18 in CRIES-13 with sensitivity and specificity as 81.1% and 76.5% respectively. Consistency check showed that there were no significant differences between the results of CRIES-13 score >/= 32 and clinical diagnosis (Kappa = 0.529) from the screening program. CRIES-13 appeared to be a reliable and valid measure for assessing the posttraumatic stress symptoms among children after the earthquake accident in the Wenchuan area. The CRIES-13 seemed to be a useful self-rating diagnostic instrument for survivors with PTSD symptoms as a clinical concern by using a 18 cut-off in total score. Consistency check showed that there was no significant difference between the screening result of CRIES-13 score >/= 32 and clinical diagnosis.
Validity and reliability of a new ankle dorsiflexion measurement device.
Gatt, Alfred; Chockalingam, Nachiappan
2013-08-01
The assessment of the maximum ankle dorsiflexion angle is an important clinical examination procedure. Evidence shows that the traditional goniometer is highly unreliable, and various designs of goniometers to measure the maximum ankle dorsiflexion angle rely on the application of a known force to obtain reliable results. Hence, an innovative ankle dorsiflexion measurement device was designed to make this measurement more reliable by holding the foot in a selected posture without the application of a known moment. To report on the comprehensive validity and reliability testing carried out on the new device. Following validity testing, four different trials to test reliability of the ankle dorsiflexion measurement device were performed. These trials included inter-rater and intra-rater testings with a controlled moment, intra-rater reliability testing with knees flexed and extended without a controlled moment, intra-rater testing with a patient population, and inter-rater reliability testing between four raters of varying experience without controlling moment. All raters were blinded. A series of trials to test intra-rater and inter-rater reliabilities. Intra-rater reliability intraclass correlation coefficient was 0.98 and inter-rater reliability intraclass correlation coefficient (2,1) was 0.953 with a controlled moment. With uncontrolled moment, very high reliability for intra-tester was also achieved (intraclass correlation coefficient = 0.94 with knees extended and intraclass correlation coefficient = 0.95 with knees flexed). For the trial investigating test-retest reliability with actual patients, intraclass correlation coefficient of 0.99 was obtained. In the trial investigating four different raters with uncontrolled moment, intraclass correlation coefficient of 0.91 was achieved. The new ankle dorsiflexion measurement device is a valid and reliable device for measuring ankle dorsiflexion in both healthy subjects and patients, with both controlled and uncontrolled moments, even by multiple raters of varying experience when the foot is dorsiflexed to its end of range of motion. An ankle dorsiflexion measuring device has been designed to increase the reliability of ankle dorsiflexion measurement and replace the traditional goniometer. While the majority of similar devices rely on application of a known moment to perform this measurement, it has been shown that this is not required with the new ankle dorsiflexion measurement device and, rather, foot posture should be taken into consideration as this affects the maximum ankle dorsiflexion angle.
Gale, T C E; Roberts, M J; Sice, P J; Langton, J A; Patterson, F C; Carr, A S; Anderson, I R; Lam, W H; Davies, P R F
2010-11-01
Assessment centres are an accepted method of recruitment in industry and are gaining popularity within medicine. We describe the development and validation of a selection centre for recruitment to speciality training in anaesthesia based on an assessment centre model incorporating the rating of candidate's non-technical skills. Expert consensus identified non-technical skills suitable for assessment at the point of selection. Four stations-structured interview, portfolio review, presentation, and simulation-were developed, the latter two being realistic scenarios of work-related tasks. Evaluation of the selection centre focused on applicant and assessor feedback ratings, inter-rater agreement, and internal consistency reliability coefficients. Predictive validity was sought via correlations of selection centre scores with subsequent workplace-based ratings of appointed trainees. Two hundred and twenty-four candidates were assessed over two consecutive annual recruitment rounds; 68 were appointed and followed up during training. Candidates and assessors demonstrated strong approval of the selection centre with more than 70% of ratings 'good' or 'excellent'. Mean inter-rater agreement coefficients ranged from 0.62 to 0.77 and internal consistency reliability of the selection centre score was high (Cronbach's α=0.88-0.91). The overall selection centre score was a good predictor of workplace performance during the first year of appointment. An assessment centre model based on the rating of non-technical skills can produce a reliable and valid selection tool for recruitment to speciality training in anaesthesia. Early results on predictive validity are encouraging and justify further development and evaluation.
Khalili, Robabe; Sirati Nir, Masoud; Ebadi, Abbas; Tavallai, Abbas; Habibi, Mehdi
2017-04-01
The Cohen Perceived Stress Scale is being used widely in various countries. The present study evaluated the validity and reliability of the Cohen 10-item Perceived Stress Scale (PSS-10) in assessing tension headache, migraine, and stress-related diseases in Iran. This study is a methodological and cross-sectional descriptive investigation of 100 patients with chronic headache admitted to the pain clinic of Baqiyatallah Educational and Therapeutic Center. Convenience sampling was used for subject selection. PSS psychometric properties were evaluated in two stages. First, the standard scale was translated. Then, the face validity, content, and construct of the translated version were determined. The average age of participants was 38 years with a standard deviation (SD) of 13.2. As for stress levels, 12% were within the normal range, 36% had an intermediate level, and 52% had a high level of stress. The face validity and scale content were remarkable, and the KMO coefficient was 0.82. Bartlett's test yielded 0.327 which was statistically significant (p<0.0001) representing the quality of the sample. In factor analysis of the scale, the two elements of "coping" and "distress" were determined. A Cronbach's Alpha coefficient of 0.72 was obtained. This confirmed the remarkable internal consistency and stability of the scale through repeated measure tests (0.93). The Persian PSS-10 has good internal consistency and reliability. The availability of a validated Persian PSS-10 would indicate a link between stress and chronic headache. Copyright © 2017 Elsevier B.V. All rights reserved.
Pourmomeny, Abbas Ali; Ghanei, Behnaz; Alizadeh, Farshid
2018-05-01
Assessment instruments are essential for research, allowing diagnosis and evaluating treatment outcomes in subjects with lower urinary tract disorders of both genders. The purpose of this study was to translate the Male Lower Urinary Tract Symptoms (MLUTS) Questionnaire and determine its psychometric properties in Persian subjects. After getting permission from the International Consultation on Incontinence Modular Questionnaire (ICIQ) web site, the forward and backward translation of the MLUTS questionnaire were carried out by researcher team. The content/face validity, construct validity and reliability were assessed in a sample of MLUTS Iranian patients by measuring with the Cronbach's alpha test. In total, 121 male patients were included in the study. The mean age of the patients was 60.5 years. Cronbach alpha value was 0.757, consecrated the internal consistency of the form (r > 0.7). The internal consistency of each question was examined separately and found to be over 0.7. For the evaluation of reliability test-retest was done, the test was administered to 20% of the patients for a second time with an interval of 1-2 weeks. The intraclass correlation coefficient (ICC) score was 0.901. The Correlation coefficient between the MLUTS and International Prostate Symptoms Score (IPSS) was 0.879. ICIQ-MLUTS is a robust instrument, which can be used for evaluating male LUTS in Persian patients. We believe that the Persian version of the MLUTS is an important tool for research and clinical setting. © 2017 John Wiley & Sons Australia, Ltd.
Kohara, Aiko; Han, ChangWan; Kwon, HaeJin; Kohzuki, Masahiro
2015-11-01
The improvement of the quality of life (QOL) of children with disabilities has been considered important. Therefore, the Special Needs Education Assessment Tool (SNEAT) was developed based on the concept of QOL to objectively evaluate the educational outcome of children with disabilities. SNEAT consists of 11 items in three domains: physical functioning, mental health, and social functioning. This study aimed to verify the reliability and construct validity of SNEAT using 93 children collected from the classes on independent activities of daily living for children with disabilities in Okinawa Prefecture between October and November 2014. Survey data were collected in a longitudinal prospective cohort study. The reliability of SNEAT was verified via the internal consistency method and the test-pretest method; both the coefficient of Cronbach's α and the intra-class correlation coefficient were over 0.7. The validity of SNEAT was also verified via one-way repeated-measures ANOVA and the latent growth curve model. The scores of all the items and domains and the total scores obtained from one-way repeated-measures ANOVA were the same as the predicted scores. SNEAT is valid based on its goodness-of-fit values obtained using the latent growth curve model, where the values of comparative fit index (0.983) and root mean square error of approximation (0.062) were within the goodness-of-fit range. These results indicate that SNEAT has high reliability and construct validity and may contribute to improve QOL of children with disabilities in the classes on independent activities of daily living for children with disabilities.
Kang, Lin-Ju; Yen, Chia-Feng; Bedell, Gary; Simeonsson, Rune J; Liou, Tsan-Hon; Chi, Wen-Chou; Liu, Shu-Wen; Liao, Hua-Fang; Hwang, Ai-Wen
2015-03-01
Measurement of children's participation and environmental factors is a key component of the assessment in the new Disability Evaluation System (DES) in Taiwan. The Child and Adolescent Scale of Environment (CASE) was translated into Traditional Chinese (CASE-C) and used for assessing environmental factors affecting the participation of children and youth with disabilities in the DES. The aim of this study was to validate the CASE-C. Participants were 614 children and youth aged 6.0-17.9 years with disabilities, with the largest condition group comprised of children with intellectual disability (61%). Internal structure, internal consistency, test-retest reliability, convergent validity, and discriminant (known group) validity were examined using exploratory factor analyses, Cronbach's α coefficient, intra-class correlation coefficients (ICC), correlation analyses, and univariate ANOVAs. A three-factor structure (Family/Community Resources, Assistance/Attitude Supports, and Physical Design Access) of the CASE-C was produced with 38% variance explained. The CASE-C had adequate internal consistency (Cronbach's α=.74-.86) and test-retest reliability (ICCs=.73-.90). Children and youth with disabilities who had higher levels of severity of impairment encountered more environmental barriers and those experiencing more environmental problems also had greater restrictions in participation. The CASE-C scores were found to distinguish children on the basis of disability condition and impairment severity, but not on the basis of age or sex. The CASE-C is valid for assessing environmental problems experienced by children and youth with disabilities in Taiwan. Copyright © 2014 Elsevier Ltd. All rights reserved.
A Flexible Latent Class Approach to Estimating Test-Score Reliability
ERIC Educational Resources Information Center
van der Palm, Daniël W.; van der Ark, L. Andries; Sijtsma, Klaas
2014-01-01
The latent class reliability coefficient (LCRC) is improved by using the divisive latent class model instead of the unrestricted latent class model. This results in the divisive latent class reliability coefficient (DLCRC), which unlike LCRC avoids making subjective decisions about the best solution and thus avoids judgment error. A computational…
ERIC Educational Resources Information Center
Gadermann, Anne M.; Guhn, Martin; Zumbo, Bruno D.
2012-01-01
This paper provides a conceptual, empirical, and practical guide for estimating ordinal reliability coefficients for ordinal item response data (also referred to as Likert, Likert-type, ordered categorical, or rating scale item responses). Conventionally, reliability coefficients, such as Cronbach's alpha, are calculated using a Pearson…
Large Sample Confidence Intervals for Item Response Theory Reliability Coefficients
ERIC Educational Resources Information Center
Andersson, Björn; Xin, Tao
2018-01-01
In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…
Bataclan, Rommel P; Dial, Ma Antonietta D
2009-10-01
Chronic kidney disease is the 10th leading cause of death among Filipinos. Those with chronic kidney disease are exposed to stressors which effect their daily lives. Therefore, assessment of health-related quality of life is important in these patients. The objective of the present study was to translate the Kidney Disease Quality of Life--Short Form version 1.3 (KDQOL-SF ver. 1.3) into Filipino and measure its validity and reliability. Translation and cultural adaptation began with two translations into Filipino, with reconciliation of the forward translators. Pretesting with 10 renal patients, review by experts (nephrologist, translator and dialysis nurse) and back-translation was also done. The final questionnaire was administered to 80 patients with chronic renal disease undergoing haemodialysis for at least 3 months, who could understand Filipino, and were without life-threatening or terminal conditions at the time of the test. A convenience sample of 30 patients from the group had a repeat test 10-14 days after to determine test-retest reliability. Test-retest reliability was assessed by intraclass correlation coefficient and internal consistency reliability was measured by determining the Cronbach's alpha value. Validity was measured using Pearson's correlation between the overall health rating scale and the items from the questionnaire. All of the items showed good test-retest reliability (intraclass correlation coefficient >0.40), ranging from 0.58 (social interaction) to 0.98 (role--emotional). Internal consistency reliability values were acceptable, with Cronbach's alpha ranging from 0.60 (cognitive function) to 0.80 (physical functioning and role--physical). Regarding construct validity, overall health rating in kidney disease-targeted scales was significantly correlated with symptoms/problems, effects of kidney disease and burden of kidney disease. All items in the SF 36 scales had significant correlation with overall health rating (P < 0.05) except for role--emotional. The Filipino version of the Kidney Disease Quality of Life--Short Form can be used to evaluate the health-related quality of life of Filipinos with chronic renal disease on haemodialysis.
Validity and reliability of the Diagnostic Adaptive Behaviour Scale.
Tassé, M J; Schalock, R L; Balboni, G; Spreat, S; Navas, P
2016-01-01
The Diagnostic Adaptive Behaviour Scale (DABS) is a new standardised adaptive behaviour measure that provides information for evaluating limitations in adaptive behaviour for the purpose of determining a diagnosis of intellectual disability. This article presents validity evidence and reliability data for the DABS. Validity evidence was based on comparing DABS scores with scores obtained on the Vineland Adaptive Behaviour Scale, second edition. The stability of the test scores was measured using a test and retest, and inter-rater reliability was assessed by computing the inter-respondent concordance. The DABS convergent validity coefficients ranged from 0.70 to 0.84, while the test-retest reliability coefficients ranged from 0.78 to 0.95, and the inter-rater concordance as measured by intraclass correlation coefficients ranged from 0.61 to 0.87. All obtained validity and reliability indicators were strong and comparable with the validity and reliability coefficients of the most commonly used adaptive behaviour instruments. These results and the advantages of the DABS for clinician and researcher use are discussed. © 2015 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
Bond, Frank W; Hayes, Steven C; Baer, Ruth A; Carpenter, Kenneth M; Guenole, Nigel; Orcutt, Holly K; Waltz, Tom; Zettle, Robert D
2011-12-01
The present research describes the development and psychometric evaluation of a second version of the Acceptance and Action Questionnaire (AAQ-II), which assesses the construct referred to as, variously, acceptance, experiential avoidance, and psychological inflexibility. Results from 2,816 participants across six samples indicate the satisfactory structure, reliability, and validity of this measure. For example, the mean alpha coefficient is .84 (.78-.88), and the 3- and 12-month test-retest reliability is .81 and .79, respectively. Results indicate that AAQ-II scores concurrently, longitudinally, and incrementally predict a range of outcomes, from mental health to work absence rates, that are consistent with its underlying theory. The AAQ-II also demonstrates appropriate discriminant validity. The AAQ-II appears to measure the same concept as the AAQ-I (r=.97) but with better psychometric consistency. Copyright © 2011. Published by Elsevier Ltd.
González Diaz, Sandra Nora; Cruz, Alfredo Arias; González González, Arya Yannel; Félix Berumen, José Alfredo; Weinmann, Alejandra Macías
2010-01-01
asthma is one of the most common chronic childhood diseases; is increasing in prevalence and an important cause of school absenteeism. Previous studies have failed to evaluate knowledge about asthma among elementary school teachers worldwide because of the lack of validated questionnaires. to validate a questionnaire about asthma knowledge for elementary school teachers in Monterrey, Nuevo Leon. an observational, cross sectional, descriptive study, from February to December 2004, by applying a questionnaire to a group of elementary school teachers in Monterrey, Nuevo Leon. The questionnaire is a translation and adaptation to the questionnaire of 13 questions used to assess the knowledge about asthma among parents, according to the National Asthma Education Program of US. a total of 179 questionnaires were applied, in which 6 of the 13 questions were answered correctly by more than 90% of the teachers. The internal consistency reliability was adequate with a Cronbach a coefficient of 0.75. in order to obtain reliable data using questionnaires, these must undergo a validation process. Our questionnaire got validation because of the reliability shown according to the internal consistency analysis.
A Protocol for Advanced Psychometric Assessment of Surveys
Squires, Janet E.; Hayduk, Leslie; Hutchinson, Alison M.; Cranley, Lisa A.; Gierl, Mark; Cummings, Greta G.; Norton, Peter G.; Estabrooks, Carole A.
2013-01-01
Background and Purpose. In this paper, we present a protocol for advanced psychometric assessments of surveys based on the Standards for Educational and Psychological Testing. We use the Alberta Context Tool (ACT) as an exemplar survey to which this protocol can be applied. Methods. Data mapping, acceptability, reliability, and validity are addressed. Acceptability is assessed with missing data frequencies and the time required to complete the survey. Reliability is assessed with internal consistency coefficients and information functions. A unitary approach to validity consisting of accumulating evidence based on instrument content, response processes, internal structure, and relations to other variables is taken. We also address assessing performance of survey data when aggregated to higher levels (e.g., nursing unit). Discussion. In this paper we present a protocol for advanced psychometric assessment of survey data using the Alberta Context Tool (ACT) as an exemplar survey; application of the protocol to the ACT survey is underway. Psychometric assessment of any survey is essential to obtaining reliable and valid research findings. This protocol can be adapted for use with any nursing survey. PMID:23401759
ERIC Educational Resources Information Center
Barnette, J. Jackson
2005-01-01
An Excel program developed to assist researchers in the determination and presentation of confidence intervals around commonly used score reliability coefficients is described. The software includes programs to determine confidence intervals for Cronbachs alpha, Pearson r-based coefficients such as those used in test-retest and alternate forms…
Diagnosing cysts with correlation coefficient images from 2-dimensional freehand elastography.
Booi, Rebecca C; Carson, Paul L; O'Donnell, Matthew; Richards, Michael S; Rubin, Jonathan M
2007-09-01
We compared the diagnostic potential of using correlation coefficient images versus elastograms from 2-dimensional (2D) freehand elastography to characterize breast cysts. In this preliminary study, which was approved by the Institutional Review Board and compliant with the Health Insurance Portability and Accountability Act, we imaged 4 consecutive human subjects (4 cysts, 1 biopsy-verified benign breast parenchyma) with freehand 2D elastography. Data were processed offline with conventional 2D phase-sensitive speckle-tracking algorithms. The correlation coefficient in the cyst and surrounding tissue was calculated, and appearances of the cysts in the correlation coefficient images and elastograms were compared. The correlation coefficient in the cysts was considerably lower (14%-37%) than in the surrounding tissue because of the lack of sufficient speckle in the cysts, as well as the prominence of random noise, reverberations, and clutter, which decorrelated quickly. Thus, the cysts were visible in all correlation coefficient images. In contrast, the elastograms associated with these cysts each had different elastographic patterns. The solid mass in this study did not have the same high decorrelation rate as the cysts, having a correlation coefficient only 2.1% lower than that of surrounding tissue. Correlation coefficient images may produce a more direct, reliable, and consistent method for characterizing cysts than elastograms.
Mieritz, Rune M; Bronfort, Gert; Jakobsen, Markus D; Aagaard, Per; Hartvigsen, Jan
2014-09-01
A basic premise for any instrument measuring spinal motion is that reliable outcomes can be obtained on a relevant sample under standardized conditions. The purpose of this study was to assess the overall reliability and measurement error of regional spinal sagittal plane motion in patients with chronic low back pain (LBP), and then to evaluate the influence of body mass index, examiner, gender, stability of pain, and pain distribution on reliability and measurement error. This study comprises a test-retest design separated by 7 to 14 days. The patient cohort consisted of 220 individuals with chronic LBP. Kinematics of the lumbar spine were sampled during standardized spinal extension-flexion testing using a 6-df instrumented spatial linkage system. Test-retest reliability and measurement error were evaluated using interclass correlation coefficients (ICC(1,1)) and Bland-Altman limits of agreement (LOAs). The overall test-retest reliability (ICC(1,1)) for various motion parameters ranged from 0.51 to 0.70, and relatively wide LOAs were observed for all parameters. Reliability measures in patient subgroups (ICC(1,1)) ranged between 0.34 and 0.77. In general, greater (ICC(1,1)) coefficients and smaller LOAs were found in subgroups with patients examined by the same examiner, patients with a stable pain level, patients with a body mass index less than below 30 kg/m(2), patients who were men, and patients in the Quebec Task Force classifications Group 1. This study shows that sagittal plane kinematic data from patients with chronic LBP may be sufficiently reliable in measurements of groups of patients. However, because of the large LOAs, this test procedure appears unusable at the individual patient level. Furthermore, reliability and measurement error varies substantially among subgroups of patients. Copyright © 2014 Elsevier Inc. All rights reserved.
Kanehara, Akiko; Kotake, Risa; Miyamoto, Yuki; Kumakura, Yousuke; Morita, Kentaro; Ishiura, Tomoko; Shimizu, Kimiko; Fujieda, Yumiko; Ando, Shuntaro; Kondo, Shinsuke; Kasai, Kiyoto
2017-11-07
Personal recovery is increasingly recognised as an important outcome measure in mental health services. This study aimed to develop a Japanese version of the Questionnaire about the Process of Recovery (QPR-J) and test its validity and reliability. The study comprised two stages that employed the cross-sectional and prospective cohort designs, respectively. We translated the questionnaire using a standard translation/back-translation method. Convergent validity was examined by calculating Pearson's correlation coefficients with scores on the Recovery Assessment Scale (RAS) and the Short-Form-8 Health Survey (SF-8). An exploratory factor analysis (EFA) was conducted to examine factorial validity. We used intraclass correlation and Cronbach's alpha to examine the test-retest and internal consistency reliability of the QPR-J's 22-item full scale, 17-item intrapersonal and 5-item interpersonal subscales. We conducted an EFA along with a confirmatory factor analysis (CFA). Data were obtained from 197 users of mental health services (mean age: 42.0 years; 61.9% female; 49.2% diagnosed with schizophrenia). The QPR-J showed adequate convergent validity, exhibiting significant, positive correlations with the RAS and SF-8 scores. The QPR-J's full version, subscales, showed excellent test-retest and internal consistency reliability, with the exception of acceptable but relatively low internal consistency reliability for the interpersonal subscale. Based on the results of the CFA and EFA, we adopted the factor structure extracted from the original 2-factor model based on the present CFA. The QPR-J is an adequately valid and reliable measure of the process of recovery among Japanese users with mental health services.
Reliability of Hypernasality Rating: Comparison of 3 Different Methods for Perceptual Assessment.
Yamashita, Renata Paciello; Borg, Elisabet; Granqvist, Svante; Lohmander, Anette
2018-01-01
To compare reliability in auditory-perceptual assessment of hypernasality for 3 different methods and to explore the influence of language background. Comparative methodological study. Participants and Materials: Audio recordings of 5-year-old Swedish-speaking children with repaired cleft lip and palate consisting of 73 stimuli of 9 nonnasal single-word strings in 3 different randomized orders. Four experienced speech-language pathologists (2 native speakers of Brazilian-Portuguese and 2 native speakers of Swedish) participated as listeners. After individual training, each listener performed the hypernasality rating task. Each order of stimuli was analyzed individually using the 2-step, VISOR and Borg centiMax scale methods. Comparison of intra- and inter-rater reliability, and consistency for each method within language of the listener and between listener languages (Swedish and Brazilian-Portuguese). Good to excellent intra-rater reliability was found within each listener for all methods, 2-step: κ = 0.59-0.93; VISOR: intraclass correlation coefficient (ICC) = 0.80-0.99; Borg centiMax (cM) scale: ICC = 0.80-1.00. The highest inter-rater reliability was demonstrated for VISOR (ICC = 0.60-0.90) and Borg cM-scale (ICC = 0.40-0.80). High consistency within each method was found with the highest for the Borg cM scale (ICC = 0.89-0.91). There was a significant difference in the ratings between the Swedish and the Brazilian listeners for all methods. The category-ratio scale Borg cM was considered most reliable in the assessment of hypernasality. Language background of Brazilian-Portuguese listeners influenced the perceptual ratings of hypernasality in Swedish speech samples, despite their experience in perceptual assessment of cleft palate speech disorders.
Shape encoding consistency across colors in primate V4
Bushnell, Brittany N.
2012-01-01
Neurons in primate cortical area V4 are sensitive to the form and color of visual stimuli. To determine whether form selectivity remains consistent across colors, we studied the responses of single V4 neurons in awake monkeys to a set of two-dimensional shapes presented in two different colors. For each neuron, we chose two colors that were visually distinct and that evoked reliable and different responses. Across neurons, the correlation coefficient between responses in the two colors ranged from −0.03 to 0.93 (median 0.54). Neurons with highly consistent shape responses, i.e., high correlation coefficients, showed greater dispersion in their responses to the different shapes, i.e., greater shape selectivity, and also tended to have less eccentric receptive field locations; among shape-selective neurons, shape consistency ranged from 0.16 to 0.93 (median 0.63). Consistency of shape responses was independent of the physical difference between the stimulus colors used and the strength of neuronal color tuning. Finally, we found that our measurement of shape response consistency was strongly influenced by the number of stimulus repeats: consistency estimates based on fewer than 10 repeats were substantially underestimated. In conclusion, our results suggest that neurons that are likely to contribute to shape perception and discrimination exhibit shape responses that are largely consistent across colors, facilitating the use of simpler algorithms for decoding shape information from V4 neuronal populations. PMID:22673324
ERIC Educational Resources Information Center
Woodruff, David; Wu, Yi-Fang
2012-01-01
The purpose of this paper is to illustrate alpha's robustness and usefulness, using actual and simulated educational test data. The sampling properties of alpha are compared with the sampling properties of several other reliability coefficients: Guttman's lambda[subscript 2], lambda[subscript 4], and lambda[subscript 6]; test-retest reliability;…
Chiba, Rie; Umeda, Maki; Goto, Kyohei; Miyamoto, Yuki; Yamaguchi, Sosei; Kawakami, Norito
2017-01-01
The Recovery Knowledge Inventory (RKI) is one of the influential scales to assess knowledge and attitude toward recovery-oriented practices among mental health service providers. In the present study, we aimed to develop a Japanese version of RKI and examine the validity and reliability. We translated RKI into Japanese by reference to the guidelines for translating and adapting psychometric scales. A cross-sectional questionnaire survey was conducted with mental health service providers. Of a total of 475 eligible professionals, we used data from the 299 participants without missing value for the analyses (valid response rate = 62.9%). The questionnaire included Japanese RKI, Recovery Attitudes Questionnaire, The positive attitudes scale, and Japanese-language version of the Social Distance Scale. To examine the factorial validity of RKI, explanatory factor analysis and confirmatory factor analysis was employed. Convergent validity was assessed by calculating Pearson's correlation coefficients between the total RKI score and the scores for the other three scales. We also calculated Cronbach's α coefficients for the total score and for each domain of RKI to assess internal consistency reliability. The participants' mean age was 40.4 years and 30.4% were men. 20-item RKI did not provide any adequate or interpretable factor solutions at any number of factors by EFAs. Thus four items (#1, 4, 5, and 13) were subsequently eliminated in stages, then 16-item RKI was employed as a consequence for further analyses. EFA with four factor structures yielded marginally interpretable constitution. Each factor represented the knowledge regarding psychiatric symptoms and recovery; knowledge about the recovery process; the understanding of what is important for recovery; and the understanding of the challenges and responsibility in recovery, respectively. Subsequent CFA suggested good fit to the data. Good convergent validity and understandable internal consistency reliability were also observed. The Japanese 16-item RKI revealed reasonable factorial validity, good convergent validity, and understandable internal consistency reliability among mental health professionals. Japanese cultural settings seemed to influence the four-factor structure in the present study. It can be used for future study in Japan, while future large-scale research is required to ensure robust verification.
Leite, Priscilla Lourenço; Filomensky, Tatiana Zambrano; Black, Donald W; Silva, Adriana Cardoso
2014-08-01
The Yale-Brown Obsessive Compulsive Scale-Shopping Version (YBOCS-SV) is considered the gold standard in the assessment of shopping severity. It is designed to assess cognitions and behaviors relating to compulsive buying behavior. The present study aims to assess the validity of the Brazilian version of this scale. For the study, composed the sample 610 participants: 588 subjects of a general population and 22 compulsive buyers. Factorial analysis was performed to assess the relations and the correlation between the YBOCS-SV, the Compulsive Buying Scale (CBS), and Richmond Compulsive Buying Scale (RCBS), was assessed using Pearson coefficient, for study of convergent and divergent validity. Cronbach's alpha coefficients were used to assess internal consistency. The results show good to excellent psychometric parameters for the YBOCS-SV in its Brazilian version. With regard to correlations, the YBOCS-SV is inversely and proportionally correlated with CBS and the RCBS, indicating that the YBOCS-SV is an excellent instrument for screening compulsive buying. The YBOCS-SV presented high alpha coefficient of Cronbach's alpha (0.92), demonstrating good reliability. The Brazilian version of the YBOCS-SV is indicated to diagnose compulsive buying disorder, and likely use for the purposes intended in the Brazilian population. Copyright © 2014 Elsevier Inc. All rights reserved.
Comparison of three instruments for measuring patient anxiety in a coronary care unit.
Elliott, D
1993-09-01
This paper compares the State-Trait Anxiety Inventory (STAI), Hospital Anxiety and Depression Scale (HAD Scale) and a Linear Analogue Anxiety Scale (LAAS) for evaluating anxiety in patients with acute ischaemic heart disease. The instruments were examined for correlation, reliability and internal consistency. Strong associations were demonstrated at pre-test between the STAI and the other scales. Moderate coefficients between HAD-A and HAD-D/LAAS were also apparent. Lower correlations were found at post-test than at pre-test. At post-test, strong inter-correlations occurred for STAI/LAAS. The HAD Scale demonstrated high test-retest reliability, while the STAI and LAAS were moderate in their reliability in this sample. The adequate correlation between the instruments suggest that each is a valid and appropriate measure of anxiety in this clinical sample.
Brennan, Sue E; McKenzie, Joanne E; Turner, Tari; Redman, Sally; Makkar, Steve; Williamson, Anna; Haynes, Abby; Green, Sally E
2017-01-17
Capacity building strategies are widely used to increase the use of research in policy development. However, a lack of well-validated measures for policy contexts has hampered efforts to identify priorities for capacity building and to evaluate the impact of strategies. We aimed to address this gap by developing SEER (Seeking, Engaging with and Evaluating Research), a self-report measure of individual policymakers' capacity to engage with and use research. We used the SPIRIT Action Framework to identify pertinent domains and guide development of items for measuring each domain. Scales covered (1) individual capacity to use research (confidence in using research, value placed on research, individual perceptions of the value their organisation places on research, supporting tools and systems), (2) actions taken to engage with research and researchers, and (3) use of research to inform policy (extent and type of research use). A sample of policymakers engaged in health policy development provided data to examine scale reliability (internal consistency, test-retest) and validity (relation to measures of similar concepts, relation to a measure of intention to use research, internal structure of the individual capacity scales). Response rates were 55% (150/272 people, 12 agencies) for the validity and internal consistency analyses, and 54% (57/105 people, 9 agencies) for test-retest reliability. The individual capacity scales demonstrated adequate internal consistency reliability (alpha coefficients > 0.7, all four scales) and test-retest reliability (intra-class correlation coefficients > 0.7 for three scales and 0.59 for fourth scale). Scores on individual capacity scales converged as predicted with measures of similar concepts (moderate correlations of > 0.4), and confirmatory factor analysis provided evidence that the scales measured related but distinct concepts. Items in each of these four scales related as predicted to concepts in the measurement model derived from the SPIRIT Action Framework. Evidence about the reliability and validity of the research engagement actions and research use scales was equivocal. Initial testing of SEER suggests that the four individual capacity scales may be used in policy settings to examine current capacity and identify areas for capacity building. The relation between capacity, research engagement actions and research use requires further investigation.
Kim, Eun-Mi; Kim, Sun-Aee; Lee, Ju-Ry; Burlison, Jonathan D; Oh, Eui Geum
2018-02-13
"Second victims" are defined as healthcare professionals whose wellness is influenced by adverse clinical events. The Second Victim Experience and Support Tool (SVEST) was used to measure the second-victim experience and quality of support resources. Although the reliability and validity of the original SVEST have been validated, those for the Korean tool have not been validated. The aim of the study was to evaluate the psychometric properties of the Korean version of the SVEST. The study included 305 clinical nurses as participants. The SVEST was translated into Korean via back translation. Content validity was assessed by seven experts, and test-retest reliability was evaluated by 30 clinicians. Internal consistency and construct validity were assessed via confirmatory factor analysis. The analyses were performed using SPSS 23.0 and STATA 13.0 software. The content validity index value demonstrated validity; item- and scale-level content validity index values were both 0.95. Test-retest reliability and internal consistency reliability were satisfactory: the intraclass consistent coefficient was 0.71, and Cronbach α values ranged from 0.59 to 0.87. The CFA showed a significantly good fit for an eight-factor structure (χ = 578.21, df = 303, comparative fit index = 0.92, Tucker-Lewis index = 0.90, root mean square error of approximation = 0.05). The K-SVEST demonstrated good psychometric properties and adequate validity and reliability. The results showed that the Korean version of SVEST demonstrated the extent of second victimhood and support resources in Korean healthcare workers and could aid in the development of support programs and evaluation of their effectiveness.
Extensive validation of the pain disability index in 3 groups of patients with musculoskeletal pain.
Soer, Remko; Köke, Albère J A; Vroomen, Patrick C A J; Stegeman, Patrick; Smeets, Rob J E M; Coppes, Maarten H; Reneman, Michiel F
2013-04-20
A cross-sectional study design was performed. To validate the pain disability index (PDI) extensively in 3 groups of patients with musculoskeletal pain. The PDI is a widely used and studied instrument for disability related to various pain syndromes, although there is conflicting evidence concerning factor structure, test-retest reliability, and missing items. Additionally, an official translation of the Dutch language version has never been performed. For reliability, internal consistency, factor structure, test-retest reliability and measurement error were calculated. Validity was tested with hypothesized correlations with pain intensity, kinesiophobia, Rand-36 subscales, Depression, Roland-Morris Disability Questionnaire, Quality of Life, and Work Status. Structural validity was tested with independent backward translation and approval from the original authors. One hundred seventy-eight patients with acute back pain, 425 patients with chronic low back pain and 365 with widespread pain were included. Internal consistency of the PDI was good. One factor was identified with factor analyses. Test-retest reliability was good for the PDI (intraclass correlation coefficient, 0.76). Standard error of measurement was 6.5 points and smallest detectable change was 17.9 points. Little correlations between the PDI were observed with kinesiophobia and depression, fair correlations with pain intensity, work status, and vitality and moderate correlations with the Rand-36 subscales and the Roland-Morris Disability Questionnaire. The PDI-Dutch language version is internally consistent as a 1-factor structure, and test-retest reliable. Missing items seem high in sexual and professional items. Using the PDI as a 2-factor questionnaire has no additional value and is unreliable.
Comparing Interrater reliability between eye examination and eye self-examination 1
de Lima, Maria Alzete; Pagliuca, Lorita Marlena Freitag; do Nascimento, Jennara Cândido; Caetano, Joselany Áfio
2017-01-01
Resume Objective: to compare Interrater reliability concerning two eye assessment methods. Method: quasi-experimental study conducted with 324 college students including eye self-examination and eye assessment performed by the researchers in a public university. Kappa coefficient was used to verify agreement. Results: reliability coefficients between Interraters ranged from 0.85 to 0.95, with statistical significance at 0.05. The exams to check for near acuity and peripheral vision presented a reasonable kappa >0.2. The remaining coefficients were higher, ranging from very to totally reliable. Conclusion: comparatively, the results of both methods were similar. The virtual manual on eye self-examination can be used to screen for eye conditions. PMID:29069269
Reliability of translated measures assessing dating violence among Mexican adolescents.
Hokoda, Audrey; Ramos-Lira, Luciana; Celaya, Patricia; Vilhauer, Keleigh; Angeles, Manuel; Ruíz, Serena; Malcarne, Vanessa L; Mora, Marina Duque
2006-02-01
Research on the prevalence and correlates of dating violence in Mexican teens is challenged by the lack of culturally and linguistically appropriate assessment tools. This study modified, translated, and back-translated the Conflict in Adolescent Dating Relationships Inventory (CADRI; Wolfe et al., 2001) and the Attitudes Towards Dating Violence Scales (Price, Byers, & the Dating Violence Research Team, 1999) for Mexican adolescents. Analyses on 307 adolescents (15-18 years old) from Monterrey and Mexicali, Mexico, revealed that most of the translated CADRI subscales and Attitudes Towards Dating Violence Scales had acceptable internal consistency and test-retest reliability coefficients. The study offers some evidence that the measures may be useful in assessing dating violence in Mexican teens.
Testing of the SEE and OEE post-hip fracture.
Resnick, Barbara; Orwig, Denise; Zimmerman, Sheryl; Hawkes, William; Golden, Justine; Werner-Bronzert, Michelle; Magaziner, Jay
2006-08-01
The purpose of this study was to test the reliability and validity of the Self-Efficacy for Exercise (SEE) and the Outcome Expectations for Exercise (OEE) scales in a sample of 166 older women post-hip fracture. There was some evidence of validity of the SEE and OEE based on confirmatory factor analysis and Rasch model testing, criterion based and convergent validity, and evidence of internal consistency based on alpha coefficients and separation indices and reliability based on R2 estimates. Rasch model testing demonstrated that some items had high variability. Based on these findings suggestions are made for how items could be revised and the scales improved for future use.
Translating and validating a Training Needs Assessment tool into Greek
Markaki, Adelais; Antonakis, Nikos; Hicks, Carolyn M; Lionis, Christos
2007-01-01
Background The translation and cultural adaptation of widely accepted, psychometrically tested tools is regarded as an essential component of effective human resource management in the primary care arena. The Training Needs Assessment (TNA) is a widely used, valid instrument, designed to measure professional development needs of health care professionals, especially in primary health care. This study aims to describe the translation, adaptation and validation of the TNA questionnaire into Greek language and discuss possibilities of its use in primary care settings. Methods A modified version of the English self-administered questionnaire consisting of 30 items was used. Internationally recommended methodology, mandating forward translation, backward translation, reconciliation and pretesting steps, was followed. Tool validation included assessing item internal consistency, using the alpha coefficient of Cronbach. Reproducibility (test – retest reliability) was measured by the kappa correlation coefficient. Criterion validity was calculated for selected parts of the questionnaire by correlating respondents' research experience with relevant research item scores. An exploratory factor analysis highlighted how the items group together, using a Varimax (oblique) rotation and subsequent Cronbach's alpha assessment. Results The psychometric properties of the Greek version of the TNA questionnaire for nursing staff employed in primary care were good. Internal consistency of the instrument was very good, Cronbach's alpha was found to be 0.985 (p < 0.001) and Kappa coefficient for reproducibility was found to be 0.928 (p < 0.0001). Significant positive correlations were found between respondents' current performance levels on each of the research items and amount of research involvement, indicating good criterion validity in the areas tested. Factor analysis revealed seven factors with eigenvalues of > 1.0, KMO (Kaiser-Meyer-Olkin) measure of sampling adequacy = 0.680 and Bartlett's test of sphericity, p < 0.001. Conclusion The translated and adapted Greek version is comparable with the original English instrument in terms of validity and reliability and it is suitable to assess professional development needs of nursing staff in Greek primary care settings. PMID:17474989
Bahouq, Hanane; Rostom, Samira; Bahiri, Rachid; Hakkou, Jinane; Aissaoui, Nawal; Hajjaj-Hassouni, Najia
2012-12-01
Fatigue is a frequent symptom during ankylosing spondylitis (AS) often under estimated which needs to be measured properly with respect to its intensity by appropriate measures, such as the multidimensional assessment of fatigue (MAF). The aims of this study were to translate into the classic Arabic version of the MAF questionnaire and to validate its use for assessing fatigue in Moroccan patients with AS. The MAF contains 16 items with a global fatigue index (IGF). The MAF was translated and back-translated to arabic, pretested and reviewed by a committee following the Guillemin criteria (J Clin Epidemiol 46:1417-1432, 1993). It was then validate on 110 Moroccan patients with AS. Reliability for the 3-day test-retest was assessed using internal consistency by Cronbach's alpha coefficient and the intra-class correlation coefficient (ICC). External construct validity was assessed by correlation with pain, activity of disease and other keys variable. The reproducibility of the 15 items was satisfactory with a kappa statistics of agreement superior to 0.6. The ICC for IGF score reproducibility was good and reached 0.98 (IC 95%, 0.96-0.99). The internal consistency was at 0.991 with Cronbach's alpha coefficient. The construct validity showed a positive correlation between MAF and the axial (r = 0.34) and peripheral (r = 0.32) visual analogical scale, the Bath ankylosing spondylitis disease activity index (BASDAI) (r = 0.77), the first item of BASDAI (r = 0.85), the functional disability by the Bath ankylosing spondylitis functional index (r = 0.64), the erythrocyte sedimentation rate (r = 0.43) and the C reactive protein (r = 0.30) (for all P < 0.001). There was no statistical correlation between MAF and the other variables. The Arabic version of the MAF has good comprehensibility, internal consistency, reliability and validity for the evaluation of Arabic speaking patients with AS.
The development, validity, and reliability of the Addiction Profile Index (API).
Ögel, Kültegin; Evren, Cüneyt; Karadağ, Figen; Gürol, Defne Tamar
2012-01-01
The objective of this study was to develop a practical questionnaire for multidimensional assessment of problems associated with alcohol and substance abuse that would also be useful for treatment planning. The Addiction Profile Index (API) is a self-report questionnaire consisting of 37 items and the following 5 subscales: characteristics of substance use; dependency diagnosis; the effects of subsance use on the user; craving; motivation to quit using substances. The study included 345 alcohol and/or substance abusers from 2 addiction treatment clinics and a prison addiction service. The validity of the questionnaire was assessed using the Michigan Alcoholism Screening Test (MAST), Readiness to Change Questionnaire (SOCRATES), Penn Alcohol Craving Scale (PACS), Drug Craving Scale (DCS), Structured Clinical Interview for DSM-IV Axis I Disorders (SCID-I), and Addiction Severity Index (ASI). The Cronbach's alpha coefficient for the total API was 0.89 and for the subscales it ranged from 0.63 to 0.86. Item-total correlation coefficients ranged from 0.42 to 0.89. The Spearman Brown split-half method coefficient for the total API was 0.83. In all, 4 factors were obtained using explanatory factor analysis that represented 52.3% of the total variance. The API craving subscale was observed to be consistent with PACS and the API motivation subscale was consistent with SOCRATES. The API total score was strongly correlated with the mean MAST score, and the composite ASI medical status, substance use, legal status, and family social relations subscale scores. Based on ROC analyses, the area under curve was 0.90. With a total API cut-off score of 4, the scale's sensitivity and specificity 0.85 was 0.78, respectively. The findings show that the API is a valid and reliable questionnaire that can be used to measure the severity of different dimensions of substance dependency.
Alpern, Rachelle; Canavan, Maureen E; Thompson, Jennifer T; McNatt, Zahirah; Tatek, Dawit; Lindfield, Tessa; Bradley, Elizabeth H
2013-01-01
Ethiopia is one of 57 countries identified by the World Health Report 2006 as having a severely limited number of health care professionals. In recognition of this shortage, the Ethiopian Federal Ministry of Health, through the Ethiopian Hospital Management Initiative, prioritized the need to improve retention of health care workers. Accordingly, we sought to develop the Satisfaction of Employees in Health Care (SEHC) survey for use in hospitals and health centers throughout Ethiopia. Literature reviews and cognitive interviews were used to generate a staff satisfaction survey for use in the Ethiopian healthcare setting. We pretested the survey in each of the six hospitals and four health centers across Ethiopia (98% response rate). We assessed content validity and convergent validity using factor analysis and examined reliability using the Cronbach alpha coefficients to assess internal consistency. The final survey was comprised of 18 questions about specific aspects of an individual's work and two overall staff satisfaction questions. We found support for content validity, as data from the 18 responses factored into three factors, which we characterized as 1) relationship with management and supervisors, 2) job content, and 3) relationships with coworkers. Summary scores for two factors (relationship with management and supervisors and job content) were significantly associated (P-value, <0.001) with the two overall satisfaction items. Cronbach's alpha coefficients showed good to excellent internal consistency (Cronbach alpha coefficients >0.70) for the items in the three summary scores. The introduction of consistent and reliable measures of staff satisfaction is crucial to understand and improve employee retention rates, which threaten the successful achievement of the Millennium Development Goals in low-income countries. The use of the SEHC survey in Ethiopian healthcare facilities has ample leadership support, which is essential for addressing problems that reduce staff satisfaction and exacerbate excessive workforce shortages.
Italian Validation of Homophobia Scale (HS).
Ciocca, Giacomo; Capuano, Nicolina; Tuziak, Bogdan; Mollaioli, Daniele; Limoncin, Erika; Valsecchi, Diana; Carosa, Eleonora; Gravina, Giovanni L; Gianfrilli, Daniele; Lenzi, Andrea; Jannini, Emmanuele A
2015-09-01
The Homophobia Scale (HS) is a valid tool to assess homophobia. This test is self-reporting, composed of 25 items, which assesses a total score and three factors linked to homophobia: behavior/negative affect, affect/behavioral aggression, and negative cognition. The aim of this study was to validate the HS in the Italian context. An Italian translation of the HS was carried out by two bilingual people, after which an English native translated the test back into the English language. A psychologist and sexologist checked the translated items from a clinical point of view. We recruited 100 subjects aged18-65 for the Italian validation of the HS. The Pearson coefficient and Cronbach's α coefficient were performed to test the test-retest reliability and internal consistency. A sociodemographic questionnaire including the main information as age, geographic distribution, partnership status, education, religious orientation, and sex orientation was administrated together with the translated version of HS. The analysis of the internal consistency showed an overall Cronbach's α coefficient of 0.92. In the four domains, the Cronbach's α coefficient was 0.90 in behavior/negative affect, 0.94 in affect/behavioral aggression, and 0.92 in negative cognition, whereas in the total score was 0.86. The test-retest reliability showed the following results: the HS total score was r = 0.93 (P < 0.0001), behavior/negative affect was r = 0.79 (P < 0.0001), affect/behavioral aggression was r = 0.81 (P < 0.0001), and negative cognition was r = 0.75 (P < 0.0001). The Italian validation of the HS revealed the use of this self-report test to have good psychometric properties. This study offers a new tool to assess homophobia. In this regard, the HS can be introduced into the clinical praxis and into programs for the prevention of homophobic behavior.
Italian Validation of Homophobia Scale (HS)
Ciocca, Giacomo; Capuano, Nicolina; Tuziak, Bogdan; Mollaioli, Daniele; Limoncin, Erika; Valsecchi, Diana; Carosa, Eleonora; Gravina, Giovanni L; Gianfrilli, Daniele; Lenzi, Andrea; Jannini, Emmanuele A
2015-01-01
Introduction The Homophobia Scale (HS) is a valid tool to assess homophobia. This test is self-reporting, composed of 25 items, which assesses a total score and three factors linked to homophobia: behavior/negative affect, affect/behavioral aggression, and negative cognition. Aim The aim of this study was to validate the HS in the Italian context. Methods An Italian translation of the HS was carried out by two bilingual people, after which an English native translated the test back into the English language. A psychologist and sexologist checked the translated items from a clinical point of view. We recruited 100 subjects aged18–65 for the Italian validation of the HS. The Pearson coefficient and Cronbach's α coefficient were performed to test the test–retest reliability and internal consistency. Main Outcome Measures A sociodemographic questionnaire including the main information as age, geographic distribution, partnership status, education, religious orientation, and sex orientation was administrated together with the translated version of HS. Results The analysis of the internal consistency showed an overall Cronbach's α coefficient of 0.92. In the four domains, the Cronbach's α coefficient was 0.90 in behavior/negative affect, 0.94 in affect/behavioral aggression, and 0.92 in negative cognition, whereas in the total score was 0.86. The test–retest reliability showed the following results: the HS total score was r = 0.93 (P < 0.0001), behavior/negative affect was r = 0.79 (P < 0.0001), affect/behavioral aggression was r = 0.81 (P < 0.0001), and negative cognition was r = 0.75 (P < 0.0001). Conclusions The Italian validation of the HS revealed the use of this self-report test to have good psychometric properties. This study offers a new tool to assess homophobia. In this regard, the HS can be introduced into the clinical praxis and into programs for the prevention of homophobic behavior. PMID:26468384
Yao, Min; Yang, Long; Cao, Zuo-Yuan; Cheng, Shao-Dan; Tian, Shuang-Lin; Sun, Yue-Li; Wang, Jing; Xu, Bao-Ping; Hu, Xiao-Chun; Wang, Yong-Jun; Zhang, Ying; Cui, Xue-Jun
2017-09-18
Shoulder pain is a common musculoskeletal disorder in Chinese population, which affects more than 1,3 billion individuals. To the best of our knowledge, there has been no available Chinese-language version of measurements of shoulder pain and disability so far. Moreover, the Constant-Murley score (CMS) questionnaire is a universally recognized patient-reported questionnaire for clinical practice and research. The present study was designed to evaluate a Chinese translational version of CMS and subsequently assess its reliability and validity. The Chinese translational version of CMS was formulated by means of forward-backward translation. Meanwhile, a final review was carried out by an expert committee, followed by conducting a test of the pre-final version. Therefore, the reliability and validity of the Chinese translational version of CMS could be assessed using the internal consistency, construct validity, factor analysis, reliability and floor and ceiling effects. Specifically, the reliability was assessed by testing the internal consistency (Cronbach's α) and test-retest reliability (intraclass coefficient correlation [ICC]), while the construct validity was evaluated via comparison between the Chinese translational version of CMS with visual analog scale (VAS) score and the 36-Item Short Form Health Survey (SF-36, Spearman correlation). The questionnaire was verified to be acceptable after distribution among 120 subjects with unilateral shoulder pain. Factor analysis had revealed a two-factor and 10-item solution. Moreover, the assessment results indicated that the Chinese translational version of CMS questionnaire harbored good internal consistency (Cronbach's α = 0.739) and test-retest reliability (ICC = 0.827). In addition, the Chinese translational version of CMS was moderately correlated with VAS score (r = 0.497) and SF-36 (r = 0.135). No obvious floor and ceiling effects were observed in the Chinese translational version of CMS questionnaire. Chinese translational version of CMS exhibited good reliability, which is relatively acceptable and is likely to be widely used in this population.
NASA Technical Reports Server (NTRS)
Rajkumar, T.; Aragon, Cecilia; Bardina, Jorge; Britten, Roy
2002-01-01
A fast, reliable way of predicting aerodynamic coefficients is produced using a neural network optimized by a genetic algorithm. Basic aerodynamic coefficients (e.g. lift, drag, pitching moment) are modelled as functions of angle of attack and Mach number. The neural network is first trained on a relatively rich set of data from wind tunnel tests of numerical simulations to learn an overall model. Most of the aerodynamic parameters can be well-fitted using polynomial functions. A new set of data, which can be relatively sparse, is then supplied to the network to produce a new model consistent with the previous model and the new data. Because the new model interpolates realistically between the sparse test data points, it is suitable for use in piloted simulations. The genetic algorithm is used to choose a neural network architecture to give best results, avoiding over-and under-fitting of the test data.
Validity and reliability of a Dutch version of the Foot and Ankle Ability Measure.
Weel, Hanneke; Zwiers, Ruben; Azim, Donija; Sierevelt, Inger N; Haverkamp, Daniel; van Dijk, C Niek; Kerkhoffs, Gino M M J
2016-04-01
The aim of the study was to develop a Dutch language version of the Foot and Ankle Ability Measure (FAAM) and evaluate its measurement properties according to the consensus-based standards for the selection of health measurement instruments (COSMIN) definitions. A forward-backward translation procedure was performed and subsequently the Dutch version of the FAAM was evaluated for its reliability and validity in 369 patients with a variety of foot and ankle complaints. The reliability was assessed by calculating the intraclass correlation coefficients (ICC, test-retest reliability), Cronbach's alpha (internal consistency), the standard error of measurement and the minimal detectable change (MDC). Additionally, this was done for athletes. The construct validity was assessed by the use of Spearman's correlation coefficient between FAAM domains and similar and contradictory domains of the Foot and Ankle Outcome Score, Short Form 36 and the Numeric Rating Scale for pain. The ICC of the subscales ranged from 0.62 to 0.86. Cronbach's alpha's minimum was 0.97. At individual level, the MDC ranged from 23.9 to 44.7 and at group level from 2.77 to 4.32. In the subgroup of athletes, the reliability was higher. The hypothesized correlations of the construct validity were supported by an 80% confirmation rate. The Dutch version of the FAAM met adequate measurement properties, although the reliability is not optimal. The FAAM-Sport subscale is more useful in athletes and the FAAM-Sport % seems not to contribute. In athletes with various foot and ankle symptoms, the FAAM can be used for functional assessment and follow-up at group level. For the general population, the FAAM is less appropriate. Diagnostic study, Level I.
Fink, A; Gebhard, B; Erdwiens, S; Haddenhorst, L; Nowak, S
2016-09-01
The introduction of the International Classification of Functioning, Disabilities and Health of the World Health Organization in 2001 made social participation a major rehabilitation outcome and the ultimate goal of rehabilitation services. There is no available instrument to measure the youth participation in leisure activities apart from asking the youth themselves. The goal of this study was to present a German version of the Children's Assessment of Participation and Enjoyment and Preferences for Activities of Children (CAPE/PAC). The CAPE/PAC questionnaire was translated into German, a cultural adaptation process was designed and a reliability study was conducted. One hundred and fifty-two youths with and without disabilities, with a mean age of 15.2 years (standard deviation 1.7), participated in the study. The participants completed CAPE and PAC twice within 4 weeks. Reliability was examined by intraclass correlation coefficients, standard error of measurement, smallest detectable change and Cronbach's alpha. The absolute values of participation differ between the typically developed youth group and those with impairments; the reliability of the CAPE/PAC is comparable in both groups. Intraclass correlation coefficients ranged from 0.43 to 0.74 for the CAPE and from 0.71 to 0.83 for the PAC in all participants. The alpha values for internal consistency ranged from 0.42 to 0.82 for the CAPE and from 0.65 to 0.92 for the PAC. The German version of the PAC showed satisfactory reliability; however, reliability was not satisfactory for all scores of the CAPE, but comparable with versions in other languages. The need for newly developed participation measurements requires further discussion. © 2016 John Wiley & Sons Ltd.
Tan, Christine L; Hassali, Mohamed A; Saleem, Fahad; Shafie, Asrul A; Aljadhey, Hisham; Gan, Vincent B
2015-01-01
(i) To develop the Pharmacy Value-Added Services Questionnaire (PVASQ) using emerging themes generated from interviews. (ii) To establish reliability and validity of questionnaire instrument. Using an extended Theory of Planned Behavior as the theoretical model, face-to-face interviews generated salient beliefs of pharmacy value-added services. The PVASQ was constructed initially in English incorporating important themes and later translated into the Malay language with forward and backward translation. Intention (INT) to adopt pharmacy value-added services is predicted by attitudes (ATT), subjective norms (SN), perceived behavioral control (PBC), knowledge and expectations. Using a 7-point Likert-type scale and a dichotomous scale, test-retest reliability (N=25) was assessed by administrating the questionnaire instrument twice at an interval of one week apart. Internal consistency was measured by Cronbach's alpha and construct validity between two administrations was assessed using the kappa statistic and the intraclass correlation coefficient (ICC). Confirmatory Factor Analysis, CFA (N=410) was conducted to assess construct validity of the PVASQ. The kappa coefficients indicate a moderate to almost perfect strength of agreement between test and retest. The ICC for all scales tested for intra-rater (test-retest) reliability was good. The overall Cronbach' s alpha (N=25) is 0.912 and 0.908 for the two time points. The result of CFA (N=410) showed most items loaded strongly and correctly into corresponding factors. Only one item was eliminated. This study is the first to develop and establish the reliability and validity of the Pharmacy Value-Added Services Questionnaire instrument using the Theory of Planned Behavior as the theoretical model. The translated Malay language version of PVASQ is reliable and valid to predict Malaysian patients' intention to adopt pharmacy value-added services to collect partial medicine supply.
Fang, Jin-Bo; Zhou, Chun-Fen; Huang, Jing; Qiu, Chang-Jian
2018-06-01
The Occupational Fatigue Exhaustion/Recovery Scale (OFER) was designed to assess occupational fatigue in nurses. Although the original English version of this instrument has shown high degrees of reliability and validity, a Chinese version of this scale has yet to be verified. The aim of this study was to evaluate the psychometric properties of the OFER in a population of Chinese nurses. The scale was translated using translation and back-translation. The validities and reliabilities were evaluated on 923 qualified participants using content validity index, concurrent validity, factorial validity, internal consistency reliability, and test-retest reliability. The content validity index for the OFER was .92. The correlation coefficients between the scores of the OFER subscales and the criteria in this study (varying from -.498 to .705) verified that the OFER has acceptable concurrent validity. Principal component analysis and confirmatory factor analysis revealed that three factors correspond to the structure of the original instrument and that recovery mediates the relationship between acute and chronic fatigue. The Cronbach's alpha for the chronic fatigue, acute fatigue, and intershift recovery subscales were .83, .85, and .86, respectively. Test-retest reliabilities with correlation coefficients from .61 to .78 were found in the three subscales. OFER is a reliable and valid instrument for assessing work-related fatigue in Chinese nurses. However, further improvement of the acute fatigue subscale is recommended. The OFER has the potential to elicit information that is useful for assessing fatigue in nurses in China. Furthermore, as it differentiates between acute and chronic fatigue, OFER may be an effective tool for guiding the development and implementation of various, related intervention measures.
Validity and reliability of Optojump photoelectric cells for estimating vertical jump height.
Glatthorn, Julia F; Gouge, Sylvain; Nussbaumer, Silvio; Stauffacher, Simone; Impellizzeri, Franco M; Maffiuletti, Nicola A
2011-02-01
Vertical jump is one of the most prevalent acts performed in several sport activities. It is therefore important to ensure that the measurements of vertical jump height made as a part of research or athlete support work have adequate validity and reliability. The aim of this study was to evaluate concurrent validity and reliability of the Optojump photocell system (Microgate, Bolzano, Italy) with force plate measurements for estimating vertical jump height. Twenty subjects were asked to perform maximal squat jumps and countermovement jumps, and flight time-derived jump heights obtained by the force plate were compared with those provided by Optojump, to examine its concurrent (criterion-related) validity (study 1). Twenty other subjects completed the same jump series on 2 different occasions (separated by 1 week), and jump heights of session 1 were compared with session 2, to investigate test-retest reliability of the Optojump system (study 2). Intraclass correlation coefficients (ICCs) for validity were very high (0.997-0.998), even if a systematic difference was consistently observed between force plate and Optojump (-1.06 cm; p < 0.001). Test-retest reliability of the Optojump system was excellent, with ICCs ranging from 0.982 to 0.989, low coefficients of variation (2.7%), and low random errors (±2.81 cm). The Optojump photocell system demonstrated strong concurrent validity and excellent test-retest reliability for the estimation of vertical jump height. We propose the following equation that allows force plate and Optojump results to be used interchangeably: force plate jump height (cm) = 1.02 × Optojump jump height + 0.29. In conclusion, the use of Optojump photoelectric cells is legitimate for field-based assessments of vertical jump height.
Marcin, James P; Romano, Patrick S; Dharmar, Madan; Chamberlain, James M; Dudley, Nanette; Macias, Charles G; Nigrovic, Lise E; Powell, Elizabeth C; Rogers, Alexander J; Sonnett, Meridith; Tzimenatos, Leah; Alpern, Elizabeth R; Andrews-Dickert, Rebecca; Borgialli, Dominic A; Sidney, Erika; Casper, Charlie; Dean, Jonathan Michael; Kuppermann, Nathan
2018-06-01
To evaluate the consistency, reliability, and validity of an implicit review instrument that measures the quality of care provided to children in the emergency department (ED). Medical records of randomly selected children from 12 EDs in the Pediatric Emergency Care Applied Research Network (PECARN). Eight pediatric emergency medicine physicians applied the instrument to 620 medical records. We determined internal consistency using Cronbach's alpha and inter-rater reliability using the intraclass correlation coefficient (ICC). We evaluated the validity of the instrument by correlating scores with four condition-specific explicit review instruments. Individual reviewers' Cronbach's alpha had a mean of 0.85 with a range of 0.76-0.97; overall Cronbach's alpha was 0.90. The ICC was 0.49 for the summary score with a range from 0.40 to 0.46. Correlations between the quality of care score and the four condition-specific explicit review scores ranged from 0.24 to 0.38. The quality of care instrument demonstrated good internal consistency, moderate inter-rater reliability, high inter-rater agreement, and evidence supporting validity. The instrument could be useful for systems' assessment and research in evaluating the care delivered to children in the ED. © Health Research and Educational Trust.
Atilola, Olayinka; Stevanović, Dejan
2014-04-01
Quality of life (QOL) is a universally accepted concept for measuring the impact of different aspects of life on general well-being. Adaptation of existing QOL instruments to local cultures has been identified as a better strategy than development of new ones. To translate and adapt the Paediatric Quality of Life Inventory™ Version 4.0 Generic Core Scales (PedsQL™) to the Yoruba language and culture and to test the psychometric properties of the adapted instrument among adolescents. Psychometric properties including internal consistency reliability, construct and factorial validity of the Yoruba version of PedsQL™ were evaluated using standard procedures. The self report and proxy scales of the Yoruba PedsQL™ were developed with good cultural relevance and semantic/conceptual equivalence. Results from 527 adolescents revealed a Cronbach's coefficient which exceeded 0.7 for internal consistency reliability for all scores. The healthy subjects reported higher PedsQL™ scores than those with mental health and physical problems, which confirmed construct validity. Confirmatory factor analysis revealed a good model fit for the Psychosocial Health score, but not for the other measures. The Yoruba PedsQL™ is culturally appropriate and with good internal consistency, reliability and construct validity. More work is needed regarding its factorial validity.
Loo, Jo Lin; Ang, Yee Kwang; Yim, Hip Seng
2013-01-01
To describe the development and validation of a cancer awareness questionnaire (CAQ) based on a literature review of previous studies, focusing on cancer awareness and prevention. A total of 388 Chinese undergraduate students in a private university in Kuala Lumpur, Malaysia, were recruited to evaluate the developed self-administered questionnaire. The CAQ consisted of four sections: awareness of cancer warning signs and screening tests; knowledge of cancer risk factors; barriers in seeking medical advice; and attitudes towards cancer and cancer prevention. The questionnaire was evaluated for construct validity using principal component analysis and internal consistency using Cronbach's alpha (α) coefficient. Test-retest reliability was assessed with a 10-14 days interval and measured using Pearson product-moment correlation. The initial 77-item CAQ was reduced to 63 items, with satisfactory construct validity, and a high total internal consistency (Cronbach's α=0.77). A total of 143 students completed the questionnaire for the test-retest reliability obtaining a correlation of 0.72 (p<0.001) overall. The CAQ could provide a reliable and valid measure that can be used to assess cancer awareness among local Chinese undergraduate students. However, further studies among students from different backgrounds (e.g. ethnicity) are required in order to facilitate the use of the cancer awareness questionnaire among all university students.
Hill, Peter B
2015-06-01
Grading of erythema in clinical practice is a subjective assessment that cannot be confirmed using a definitive test; nevertheless, erythema scores are typically measured in clinical trials assessing the response to treatment interventions. Most commonly, ordinal scales are used for this purpose, but the optimal number of categories in such scales has not been determined. This study aimed to compare the reliability and agreement of a four-point and a six-point ordinal scale for the assessment of erythema in digital images of canine skin. Fifteen digital images showing varying degrees of erythema were assessed by specialist dermatologists and laypeople, using either the four-point or the six-point scale. Reliability between the raters was assessed using intraclass correlation coefficients and Cronbach's α. Agreement was assessed using the variation ratio (the percentage of respondents who chose the mode, the most common answer). Intraobserver variability was assessed by comparing the results of two grading sessions, at least 6 weeks apart. Both scales demonstrated high reliability, with intraclass correlation coefficient values and Cronbach's α above 0.99. However, the four-point scale demonstrated significantly superior agreement, with variation ratios for the four-point scale averaging 74.8%, compared with 56.2% for the six-point scale. Intraobserver consistency for the four-point scale was very high. Although both scales demonstrated high reliability, the four-point scale was superior in terms of agreement. For the assessment of erythema in clinical trials, a four-point ordinal scale is recommended. © 2014 ESVD and ACVD.
Lee, Miyoung; Zhu, Weimo; Ackley-Holbrook, Elizabeth; Brower, Diana G; McMurray, Bryan
2014-07-01
It is critical to employ accurate measures when assessing physical activity (PA) barriers in any subpopulation, yet existing measures are not appropriate for persons with blindness or visual impairment (PBVI) due to a lack of validity or reliability evidence. To develop and calibrate a PA barrier scale for PBVI. An expert panel (n = 3) and 18 PBVI were recruited to establish content validity for a PA barriers subscale; 160 PBVI (96 females) completed the scale along with the Physical Activity Scale for Individuals with Physical Disabilities for calibration. To establish construct-related validity evidence, Confirmative factor analysis (CFA) and Rasch analysis were applied. To investigate internal consistency and reliability, Cronbach's alpha and the reliability coefficient (R) were employed, respectively. Following CFA and Rasch analyses, five items were eliminated due to misfits; reliability coefficients were unchanged upon deletion of these items. The barriers perceived by PBVI to have the most negative impact on PA included "lack of self-discipline" (logit = 1.40) and "lack of motivation" (logit = 1.27). "Too many stairs in the exercise facility" (logit = -1.49) was perceived to have the least impact. The newly-developed scale was found to be a valid and reliable tool for evaluating PA barriers in PBVI. To enhance promotion of health-producing levels of PA in PBVI, practitioners should consider applying this new tool as a precursor to programs aimed at improving PA participation in this group. Copyright © 2014 Elsevier Inc. All rights reserved.
Koyama, Utako; Murayama, Nobuko
2011-08-01
This qualitative and quantitative research was conducted to develop an empowerment scale for health promotion volunteers (hereinafter referred to as the ESFHPV), key persons responsible for creating healthy communities. A focus group interview was conducted with four groups of health promotion volunteers from two cities in S Public Health Center of N Prefecture. A qualitative analysis was employed and a 32-item draft scale was created. The reliability and validity of this scale were then evaluated using quantitative methods. A questionnaire survey was conducted in 2009 for all 660 health promotion volunteers across the 2 cities. Of 401 respondents (response rate, 60.8%), 356 (53.9%) provided valid responses and were thus included in the analysis. 1) Internal consistency was confirmed by item-total correlation analysis (I-T analysis), assessment of Cronbach's coefficient alpha for all except one item and good-poor analysis (G-P analysis). Four items were excluded from the 32-item draft scale because of correlation coefficients more than 0.7, leaving 28 items for analysis. 2) Based on the results obtained from the factor analysis performed on the 28 provisional empowerment questions, 28 items were chosen for inclusion in the ESFHPV. These items consisted of four sub-scales, namely 'activity for healthy community' (10 items), 'intention for solving health problems of the community' (10 items), 'democratic organization activity' (four items) and 'growth as individual health promotion volunteers' (four items). 3) The Cronbach's coefficient alpha for the ESFHPV and its four sub-scales were 0.93, 0.88, 0.89, 0.84 and 0.79 respectively. The coefficients of I-T analysis were between 0.33 and 0.69. 4) The health promotion volunteers who attended other community activities demonstrated significantly high scores for the ESFHPV and the four sub-scales. Persons who were above 60 years, had a longer duration of activity as a health promotion volunteer and were housewives showed significantly high scores on the first sub-scale, 'growth as individual health promotion volunteers' To measure the empowerment levels of health promotion volunteers, a 28-item scale was developed and its reliability and validity were confirmed. Health promotion volunteers as well as the public health nurses who assist them can use this scale to assess the empowerment levels of other health promotion volunteers.
Using the CanMEDS roles when interviewing for an ophthalmology residency program.
Hamel, Patrick; Boisjoly, Hélène; Corriveau, Christine; Fallaha, Nicole; Lahoud, Salim; Luneau, Katie; Olivier, Sébastien; Rouleau, Jacinthe; Toffoli, Daniela
2007-04-01
To improve the admissions process for the Université de Montréal (UdeM) ophthalmology residency program, the interview structure was modified to encompass the seven CanMEDS roles introduced by the Royal College of Physicians and Surgeons of Canada (RCPSC). These roles include an applicant's abilities as a communicator, collaborator, manager, health advocate, professional, scholar, and medical expert. In this retrospective pilot study, the records of all applicants were reviewed by 8 members of the admissions committee, with a high intraclass correlation coefficient of 0.814. Four 2-person interview teams were then formed. The first 3 groups asked the applicants specific questions based on 2-3 of the CanMEDS roles, marking their impressions of each candidate on a visual analogue scale. The last group answered candidates' questions about the program but assigned no mark. The intraclass correlations for the teams were 0.900, 0.739, and 0.585, demonstrating acceptable interrater reliability for 2 of the teams. Pearson correlation coefficients between groups of interviewers were considered adequate at 0.562, 0.432, and 0.417 (p < 0.05). For each interviewer, the Pearson correlation coefficient between record marking and interview scoring was either not statistically significant or very low. By basing the 2006 interview process on the CanMEDS roles defined by the RCPSC, information was obtained about the candidates that could not have been retrieved by a review of the medical students' records alone. Reliability analysis confirmed that this new method of conducting interviews provided sound and reliable judging and rating consistency between all members of the admissions committee.
AZARI, Nadia; SOLEIMANI, Farin; VAMEGHI, Roshanak; SAJEDI, Firoozeh; SHAHSHAHANI, Soheila; KARIMI, Hossein; KRASKIAN, Adis; SHAHROKHI, Amin; TEYMOURI, Robab; GHARIB, Masoud
2017-01-01
Objective Bayley Scales of infant & toddler development is a well-known diagnostic developmental assessment tool for children aged 1–42 months. Our aim was investigating the validity & reliability of this scale in Persian speaking children. Materials & Methods The method was descriptive-analytic. Translation- back translation and cultural adaptation was done. Content & face validity of translated scale was determined by experts’ opinions. Overall, 403 children aged 1 to 42 months were recruited from health centers of Tehran, during years of 2013-2014 for developmental assessment in cognitive, communicative (receptive & expressive) and motor (fine & gross) domains. Reliability of scale was calculated through three methods; internal consistency using Cronbach’s alpha coefficient, test-retest and interrater methods. Construct validity was calculated using factor analysis and comparison of the mean scores methods. Results Cultural and linguistic changes were made in items of all domains especially on communication subscale. Content and face validity of the test were approved by experts’ opinions. Cronbach’s alpha coefficient was above 0.74 in all domains. Pearson correlation coefficient in various domains, were ≥ 0.982 in test retest method, and ≥0.993 in inter-rater method. Construct validity of the test was approved by factor analysis. Moreover, the mean scores for the different age groups were compared and statistically significant differences were observed between mean scores of different age groups, that confirms validity of the test. Conclusion The Bayley Scales of Infant and Toddler Development is a valid and reliable tool for child developmental assessment in Persian language children. PMID:28277556
Consistency of Lower-Body Dimensions Using Surface Landmarks and Simple Measurement Tools.
Caia, Johnpaul; Weiss, Lawrence W; Chiu, Loren Z F; Schilling, Brian K; Paquette, Max R
2016-09-01
Caia, J, Weiss, LW, Chiu, LZF, Schilling, BK, and Paquette, MR. Consistency of lower-body dimensions using surface landmarks and simple measurement tools. J Strength Cond Res 30(9): 2600-2608, 2016-Body dimensions may influence various types of physical performance. This study was designed to establish the reliability and precision of bilateral lower-body dimensions using surface anatomic landmarks and either sliding calipers or goniometry. Fifty university students (25 men and 25 women) were measured on 2 separate occasions separated by 48 or 72 hours. A small digital caliper was used to acquire longitudinal dimensions of the feet, whereas a larger broad-blade caliper was used to measure lower-limb, hip, and pelvic dimensions. Quadriceps angle (Q-angle) was determined through surface goniometry. Data for all foot and lower-limb dimensions were both reliable and precise (intraclass correlation coefficient (ICC) ≥0.72, SEM 0.1-0.5 cm). Measures of Q-angle were also reliable and precise (ICC ≥0.85, SEM 0.2-0.4°). Findings from this investigation demonstrate that lower-body dimensions may be reliably and precisely measured through simple practical tests, when surface anatomic landmarks and standardized procedures are used. Although intertester reliability remains to be established, meticulous adherence to specific measurement protocols is likely to yield viable output for lower-body dimensions when more sophisticated methods are unavailable or inappropriate.
Sharif Nia, Hamid; Pahlevan Sharif, Saeed; Lehto, Rebecca H; Allen, Kelly A; Goudarzian, Amir Hossein; Yaghoobzadeh, Ameneh; Soleimani, Mohammad Ali
2017-07-01
Objective: Limited research has examined the psychometric properties of death depression scales in Persian populations with cardiac disease despite the need for valid assessment tools for evaluating depressive symptoms in patients with life-limiting chronic conditions. The present study aimed at evaluating the reliability and validity of the Persian Version of Death Depression Scale - Revised (DDS-R) in Iranian patients who had recent acute myocardial infarction (AMI). Method: This psychometric study was conducted with a convenience sample of 407 patients with AMI diagnosis who completed the Persian version of the DDS-R. The face, content, and construct validity of the scale were ascertained. Internal consistency, test-retest, and construct reliability (CR) were used to assess reliability of the Persian Version of DDS-R. Results: Based on maximum likelihood exploratory factor analysis and consideration of conceptual meaning, a 4-factor solution was identified, explaining 75.89% of the total variance. Goodness-of-fit indices (GFI), Comparative Fit Index (CFI), Normed Fit Index (NFI), Incremental Fit Index (IFI), and Root Mean Square Error of Approximation (RMSEA) in the final DDS-R structure demonstrated the adequacy of the 4-domain structure. The internal consistency, construct reliability, and Intra-class Correlation Coefficients (ICC) were greater than .70. Conclusion: The DDS-R was found to be a valid and reliable assessment tool for evaluating death depression symptoms in Iranian patients with AMI.
Arbab, Dariusch; van Ochten, Johannes H M; Schnurr, Christoph; Bouillon, Bertil; König, Dietmar
2017-12-01
Patient-reported outcome measures are a critical tool in evaluating the efficacy of orthopedic procedures. The intention of this study was to evaluate reliability, validity, responsiveness and minimally important change of the German version of the Hip dysfunction and osteoarthritis outcome score (HOOS). The German HOOS was investigated in 251 consecutive patients before and 6 months after total hip arthroplasty. All patients completed HOOS, Oxford-Hip Score, Short-Form (SF-36) and numeric scales for pain and disability. Test-retest reliability, internal consistency, floor and ceiling effects, construct validity and minimal important change were analyzed. The German HOOS demonstrated excellent test-retest reliability with intraclass correlation coefficient values > 0.7. Cronbach´s alpha values demonstrated strong internal consistency. As hypothesized, HOOS subscales strongly correlated with corresponding OHS and SF-36 domains. All subscales showed excellent (effect size/standardized response means > 0.8) responsiveness between preoperative assessment and postoperative follow-up. The HOOS and all subdomains showed higher changes than the minimal detectable change which indicates true changes. The German version of the HOOS demonstrated good psychometric properties. It proved to be valid, reliable and responsive to the changes instrument for use in patients with hip osteoarthritis undergoing total hip replacement.
Mota-Anaya, Evelin; Yumpo-Cárdenas, Daniel; Alva-Bravo, Edmundo; Wright-Nunes, Julie; Mayta-Tristán, Percy
2016-08-08
Chronic kidney disease (CKD) affects 50 million people globally. Several studies show the importance of implementing interventions that enhance patients knowledge about their disease. In 2011 the Kidney Disease Knowledge Survey (KiKS) was developed: a questionnaire that assesses the specific knowledge about chronic kidney disease in pre-dialysis patients. To translate to Spanish, culturally adapt and validate the Kidney Disease Knowledge Survey questionnaire in a population of patients with pre-dialysis chronic kidney disease. We carried out a Spanish translation and cross-cultural adaptation of the Kidney Disease Knowledge Survey questionnaire. Subsequently, we determined its validity and reliability. We determined the validity through construct validity; and reliability by evaluating its internal consistency and its intra-observer reliability (test-retest). We found a good internal consistency (Kuder-Richardson = 0.85). The intra-observer reliability was measured by the intra-class correlation coefficient that yielded a value of 0.78 (95% CI: 0.5-1.0). This value indicated a good reproducibility; also, the mean difference of -1.1 test-retest SD 6.0 (p = 0.369) confirms this finding. The translated Spanish version of the Kidney Disease Knowledge Survey is acceptable and equivalent to the original version; it also has a good reliability, validity and reproducibility. Therefore, it can be used in a population of patients with pre-dialysis chronic kidney disease.
Brazilian version of the body dysmorphic disorder examination.
Jorge, Renata Trajano Borges; Sabino Neto, Miguel; Natour, Jamil; Veiga, Daniela Francescato; Jones, Anamaria; Ferreira, Lydia Masako
2008-03-06
Body image improvement is considered to be the main reason for undergoing plastic surgery. The objective was to translate the Body Dysmorphic Disorder Examination (BDDE) into Brazilian Portuguese and to adapt and validate this questionnaire for use in Brazil. Cross-sectional survey, at the Department of Plastic Surgery of Universidade Federal de São Paulo. The BDDE was first translated into Portuguese and then back-translated into English. These translations were then discussed by healthcare professionals in order to establish the final Brazilian version. In a second stage, the validity and reliability of the BDDE were assessed. For this, patients were initially interviewed by two interviewers and subsequently, by only one of these interviewers. On the first occasion, in addition to the BDDE, the body shape questionnaire (BSQ) and the Rosenberg self-esteem scale were also applied. These questionnaires were applied to 90 patients. Six questions were modified during the assessment of cultural equivalence. Cronbach's alpha was 0.89 and the intraclass correlation coefficients for interobserver and test-retest reliability were 0.91 and 0.87, respectively. Pearson's coefficient showed no correlation between the BDDE and the Rosenberg self-esteem scale (0.22), whereas there was a moderate correlation between the BDDE and the BSQ (0.64). The BDDE was successfully translated and adapted, with good internal consistency, reliability and construct validity.
Developing self-concept instrument for pre-service mathematics teachers
NASA Astrophysics Data System (ADS)
Afgani, M. W.; Suryadi, D.; Dahlan, J. A.
2018-01-01
This study aimed to develop self-concept instrument for undergraduate students of mathematics education in Palembang, Indonesia. Type of this study was development research of non-test instrument in questionnaire form. A Validity test of the instrument was performed with construct validity test by using Pearson product moment and factor analysis, while reliability test used Cronbach’s alpha. The instrument was tested by 65 undergraduate students of mathematics education in one of the universities at Palembang, Indonesia. The instrument consisted of 43 items with 7 aspects of self-concept, that were the individual concern, social identity, individual personality, view of the future, the influence of others who become role models, the influence of the environment inside or outside the classroom, and view of the mathematics. The result of validity test showed there was one invalid item because the value of Pearson’s r was 0.107 less than the critical value (0.244; α = 0.05). The item was included in social identity aspect. After the invalid item was removed, Construct validity test with factor analysis generated only one factor. The Kaiser-Meyer-Olkin (KMO) coefficient was 0.846 and reliability coefficient was 0.91. From that result, we concluded that the self-concept instrument for undergraduate students of mathematics education in Palembang, Indonesia was valid and reliable with 42 items.
McEvoy, Maureen Patricia; Williams, Marie T; Olds, Timothy Stephen
2010-01-01
Previous survey tools operationalising knowledge, attitudes or beliefs about evidence-based practice (EBP) have shortcomings in content, psychometric properties and target audience. This study developed and psychometrically assessed a self-report trans-professional questionnaire to describe an EBP profile. Sixty-six items were collated from existing EBP questionnaires and administered to 526 academics and students from health and non-health backgrounds. Principal component factor analysis revealed the presence of five factors (Relevance, Terminology, Confidence, Practice and Sympathy). Following expert panel review and pilot testing, the 58-item final questionnaire was disseminated to 105 subjects on two occasions. Test-retest and internal reliability were quantified using intra-class correlation coefficients (ICCs) and Cronbach's alpha, convergent validity against a commonly used EBP questionnaire by Pearson's correlation coefficient and discriminative validity via analysis of variance (ANOVA) based on exposure to EBP training. The final questionnaire demonstrated acceptable internal consistency (Cronbach's alpha 0.96), test-retest reliability (ICCs range 0.77-0.94) and convergent validity (Practice 0.66, Confidence 0.80 and Sympathy 0.54). Three factors (Relevance, Terminology and Confidence) distinguished EBP exposure groups (ANOVA p < 0.001-0.004). The evidence-based practice profile (EBP(2)) questionnaire is a reliable instrument with the ability to discriminate for three factors, between respondents with differing EBP exposures.
Ratter, Julia; Radlinger, Lorenz; Lucas, Cees
2014-09-01
Are submaximal and maximal exercise tests reliable, valid and acceptable in people with chronic pain, fibromyalgia and fatigue disorders? Systematic review of studies of the psychometric properties of exercise tests. People older than 18 years with chronic pain, fibromyalgia and chronic fatigue disorders. Studies of the measurement properties of tests of physical capacity in people with chronic pain, fibromyalgia or chronic fatigue disorders were included. Studies were required to report: reliability coefficients (intraclass correlation coefficient, alpha reliability coefficient, limits of agreements and Bland-Altman plots); validity coefficients (intraclass correlation coefficient, Spearman's correlation, Kendal T coefficient, Pearson's correlation); or dropout rates. Fourteen studies were eligible: none had low risk of bias, 10 had unclear risk of bias and four had high risk of bias. The included studies evaluated: Åstrand test; modified Åstrand test; Lean body mass-based Åstrand test; submaximal bicycle ergometer test following another protocol other than Åstrand test; 2-km walk test; 5-minute, 6-minute and 10-minute walk tests; shuttle walk test; and modified symptom-limited Bruce treadmill test. None of the studies assessed maximal exercise tests. Where they had been tested, reliability and validity were generally high. Dropout rates were generally acceptable. The 2-km walk test was not recommended in fibromyalgia. Moderate evidence was found for reliability, validity and acceptability of submaximal exercise tests in patients with chronic pain, fibromyalgia or chronic fatigue. There is no evidence about maximal exercise tests in patients with chronic pain, fibromyalgia and chronic fatigue. Copyright © 2014. Published by Elsevier B.V.
The inter and intra rater reliability of the Netball Movement Screening Tool.
Reid, Duncan A; Vanweerd, Rebecca J; Larmer, Peter J; Kingstone, Rachel
2015-05-01
To establish the inter- and intra-rater reliability of the Netball Movement Screening Tool, for screening adolescent female netball players. Inter- and intra-rater reliability study. Forty secondary school netball players were recruited to take part in the study. Twenty subjects were screened simultaneously and independently by two raters to ascertain inter-rater agreement. Twenty subjects were scored by rater one on two occasions, separated by a week, to ascertain intra-rater agreement. Inter and intra-rater agreement was assessed utilising the two-way mixed inter class correlation coefficient and weighted kappa statistics. No significant demographic differences were found between the inter and intra-rater groups of subjects. Inter class correlation coefficients' demonstrated excellent inter-rater (two-way mixed inter class correlation coefficients 0.84, standard error of measurement 0.25) and intra-rater (two-way mixed inter class correlation coefficients 0.96, standard error of measurement 0.13) reliability for the overall Netball Movement Screening Tool score and substantial-excellent (two-way mixed inter class correlation coefficients 1.0-0.65) inter-rater and substantial-excellent intra-rater (two-way mixed inter class correlation coefficients 0.96-0.79) reliability for the component scores of the Netball Movement Screening Tool. Kappa statistic showed substantial to poor inter-rater (k=0.75-0.32) and intra-rater (k=0.77-0.27) agreement for individual tests of the NMST. The Netball Movement Screening Tool may be a reliable screening tool for adolescent netball players; however the individual test scores have low reliability. The screening tool can be administered reliably by raters with similar levels of training in the tool but variable clinical experience. On-going research needs to be undertaken to ascertain whether the Netball Movement Screening Tool is a valid tool in ascertaining increased injury risk for netball players. Copyright © 2014 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Effect of Items Direction (Positive or Negative) on the Reliability in Likert Scale. Paper-11
ERIC Educational Resources Information Center
Gul, Showkeen Bilal Ahmad; Qasem, Mamun Ali Naji; Bhat, Mehraj Ahmad
2015-01-01
In this paper an attempt was made to analyze the effect of items direction (positive or negative) on the Alpha Cronbach reliability coefficient and the Split Half reliability coefficient in Likert scale. The descriptive survey research method was used for the study and sample of 510 undergraduate students were selected by used random sampling…
Ertuğ, Nurcan
2018-06-01
The aim of this study was to determine the validity and reliability of the Turkish version of the V-scale, which measures nurses' attitudes towards vital signs monitoring in the detection of clinical deterioration. This validity and reliability study was conducted at a tertiary hospital in Ankara, Turkey, in 2016. A total of 169 ward nurses participated in the study. Exploratory factor analysis, Cronbach's alpha coefficient, and the intraclass correlation coefficient were used to determine the validity and reliability of the scale. A 5-factor, 16-item scale explained 60.823% of the total variance according to the validity analysis. Our version matched the original scale in terms of the number of items and factor structure. Cronbach's alpha coefficient of the Turkish version of the V-scale was 0.764. The test-retest reliability results were 0.855 for the overall intraclass correlation coefficient, and the t-test result was P > 0.05. The V-scale is a reliable and valid instrument to measure Turkish nurses' attitudes towards vital signs monitoring in the detection of clinical deterioration. © 2018 John Wiley & Sons Australia, Ltd.
Richter, Jörg
2015-04-01
Methods to assess intervention progress and outcome for frequent use are needed. To provide preliminary information about psychometric properties for the Norwegian version of the Brief Problems Monitor. Cronbach's alpha scores and intra-class correlation coefficients as indicators for internal consistency (reliability) and Pearson correlation coefficients between corresponding subscales of the long and short ASEBA form versions as well as multiple regression coefficients to explore the predictive power of the reduced item-set related to the corresponding scale-scores of the long version were calculated in large, representative data sets of Norwegian children and adolescents. Cronbach's alpha scores of the Norwegian version of the BPM subscales varied between 0.67 (attention BPM-youth) and 0.88 (attention BPM-teacher) and between 0.90 (BPM-youth) and 0.96 (BPM-teacher) for its total problem score. Corresponding subscales from the long versions and the BPM as well as the total problems scores were closely correlated with coefficients of high effect size (all r > 0.80). The variance of the items of the BPM explained about three-quarters or more of the variance in the corresponding subscales of the long version. The Norwegian BPM has good psychometric properties in terms of 1) being acceptable to good internal consistency and in terms of 2) regression coefficients of high effect size from the BPM items to the problem-scale scores of the long versions as validity indicators. Its use in clinical practice and research can be recommended.
Hwang, Huei-Lih; Lin, Huey-Shyan; Wang, Hsiu-Hung
2010-12-01
Death education involves acquiring knowledge, changing behavior, and developing proper views of life in both the affective and the value domains. Critical thinking that is honed through reflecting on life-and-death issues represents a way to reach these goals. Designing assessments able to measure college student content and critical thinking skills related to life-and-death issues is thus important. The Test of Critical Thinking Skills for Life-And-Death content (TCTS-LD) instrument requires the administration of additional tests to assess reliability and validity for future use in the assessment of perceptions on life and death. The purpose of this study was to refine the TCTS-LD. A cross-sectional, descriptive design was used to recruit 715 college students in southern Taiwan. Three structured scales were administered in class to the participants. Data were collected in 2004 and 2006. Confirmatory factor analysis was applied to validate the structure of scales. Examination of the reliability of the three-factor and 15-item scale revealed a Kuder-Richardson coefficient of internal consistency of .54. The split-half reliability coefficients were .47 in the Spearman-Brown correlation and .40 in the intraclass correlation coefficient (ICC). The test-retest reliability coefficients (n = 22) were .58 in Pearson correlation and .56 in ICC. In addition to content validity verification by experts and face validity by students, the validity of this test was assessed using three methods, including (a) a comparable validity rating between this test and the TCTS-A (r = .34, p < .001; (b) a contrast-group technique with different responses to the instrument between those in education and nursing majors (t = 2.71, p < .01), with scores of 10.98 (SD = 2.42) and 9.82 (SD = 2.25), respectively; and (c) a confirmatory factor analysis confirming that TCTS-LD is related to the three dimensions of assumption, evaluation, and induction (χ = 81.800, p = .158, normed chi-square χ/df = 1.169, comparative fit index [CFI] = .976, Tucker-Lewis index = .984, root mean square error of approximation [RMSEA] = 0.015). Three factors explained 31.19% of total variance for the revised TCTS-LD. The revised TCTS-LD scale improved performance and effectiveness to a certain degree. However, reliability and construct validity must be further tested to permit its use as an evaluation tool.
Devcich, Daniel A; Weller, Jennifer; Mitchell, Simon J; McLaughlin, Scott; Barker, Lauren; Rudolph, Jenny W; Raemer, Daniel B; Zammert, Martin; Singer, Sara J; Torrie, Jane; Frampton, Chris Ma; Merry, Alan F
2016-10-01
Realising the full potential of the WHO Surgical Safety Checklist (SSC) to reduce perioperative harm requires the constructive engagement of all operating room (OR) team members during its administration. To facilitate research on SSC implementation, a valid and reliable instrument is needed for measuring OR team behaviours during its administration. We developed a behaviourally anchored rating scale (BARS) for this purpose. We used a modified Delphi process, involving 16 subject matter experts, to compile a BARS with behavioural domains applicable to all three phases of the SSC. We evaluated the instrument in 80 adult OR cases and 30 simulated cases using two medical student raters and seven expert raters, respectively. Intraclass correlation coefficients were calculated to assess inter-rater reliability. Internal consistency and instrument discrimination were explored. Sample size estimates for potential study designs using the instrument were calculated. The Delphi process resulted in a BARS instrument (the WHOBARS) with five behavioural domains. Intraclass correlation coefficients calculated from the OR cases exceeded 0.80 for 80% of the instrument's domains across the SSC phases. The WHOBARS showed high internal consistency across the three phases of the SSC and ability to discriminate among surgical cases in both clinical and simulated settings. Fewer than 20 cases per group would be required to show a difference of 1 point between groups in studies of the SSC, where α=0.05 and β=0.8. We have developed a generic instrument for comprehensively rating the administration of the SSC and informing initiatives to realise its full potential. We have provided data supporting its capacity for discrimination, internal consistency and inter-rater reliability. Further psychometric evaluation is warranted. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Alfadhel, Saud A; Vennu, Vishal; Alnahdi, Ali H; Omar, Mohammed T; Alasmari, Saeed H; AlJafri, Zahra; Bindawas, Saad M
2018-06-07
The Knee Injury Osteoarthritis Outcome Score (KOOS) is a widely used joint-specific measure employed to evaluate pain, symptoms, activities of daily living, recreational activities, and quality of life in patients with knee osteoarthritis (OA). Although the original KOOS has been translated into many languages, a Saudi Arabic version is not available. This study aimed to culturally adapt and evaluate the psychometric properties of the Saudi Arabic version of the KOOS in patients with knee OA. The original KOOS was translated and adapted into Saudi Arabic version over six stages according to the guidelines suggested by Beaton and recommended by the American Association of Orthopedic Surgeons Outcome Committee. Patients diagnosed with knee OA (n = 136) were recruited to examine the psychometric properties, such as internal consistency that was tested using Cronbach's alpha, test-retest reliability that was analyzed using the intra-class correlation coefficient (ICC 2,1 ), and construct validity that examined by testing the correlations between the new version subscales, Form 36 Health Survey subscales, and the Visual Analog Scale, Spearman's correlation coefficient (r s ) was used to measure the correlations. A total of 122 (89.7%) of the 136 participants with knee OA completed the second re-test of new Saudi Arabic version. Excellent internal consistency (Cronbach's alpha = 0.87-0.92) was detected in the subscales of the adapted version, as well as excellent test-retest reliability (ICC 2,1 = 0.92-0.94). The pattern of correlation between the subscales of the Saudi Arabic version of the KOOS, SF-36 domains and the Visual Analog Scale for pain supported the construct validity of the adapted version. The Saudi Arabic version of the KOOS was well accepted and exhibited excellent reliability, internal consistency, and construct validity in Saudi patients with knee OA.
Hamamoto, Yosuke; Ito, Hiromu; Furu, Moritoshi; Ishikawa, Masahiro; Azukizawa, Masayuki; Kuriyama, Shinichi; Nakamura, Shinichiro; Matsuda, Shuichi
2015-09-01
The purposes of this study were to translate the new Knee Society Score (KSS) into Japanese and to evaluate the construct and content validity, test-retest reliability, and internal consistency of the Japanese version of the new KSS. The Japanese version of the KSS was developed according to cross-cultural guidelines by using the "translation-back translation" method to ensure content validity. KSS data were then obtained from patients who had undergone total knee arthroplasty (TKA). The psychometric properties evaluated were as follows: for feasibility, response rate, and floor and ceiling effects; for construct validity, internal consistency using Cronbach's alpha, and correlations with quality of life. Construct validity was evaluated by using Spearman's correlation coefficient to quantify the correlation between the KSS and the Japanese version of the Oxford 12-item Knee Score or Short Form 36 Health Survey (SF-36) questionnaires. The Japanese version of the KSS was sent to 93 consecutive osteoarthritic patients who underwent primary TKA in our institution. Fifty-five patients completed the questionnaires and were included in this study. Neither a floor nor ceiling effect was observed. The reliability proved excellent in the majority of domains, with intraclass correlation coefficients of 0.65-0.88. Internal consistency, assessed by Cronbach's alpha, was good to excellent for all domains (0.78-0.94). All of the four domains of the KSS correlated significantly with the Oxford 12-item Knee Score. The activity and satisfaction domains of the KSS correlated significantly with all and the majority of subscales of the SF-36, respectively, whereas symptoms and expectation domains showed significant correlations only with bodily pain and vitality subscales and with the physical function, bodily pain, and vitality subscales, respectively. The Japanese version of the new KSS is a valid, reliable, and responsive instrument to capture subjective aspects of the functional symptoms and abilities of patients who undergo TKA.
Koo, Terry K; Cohen, Jeffrey H; Zheng, Yongping
2011-11-01
Soft tissue exhibits nonlinear stress-strain behavior under compression. Characterizing its nonlinear elasticity may aid detection, diagnosis, and treatment of soft tissue abnormality. The purposes of this study were to develop a rate-controlled Mechano-Acoustic Indentor System and a corresponding finite element optimization method to extract nonlinear elastic parameters of soft tissue and evaluate its test-retest reliability. An indentor system using a linear actuator to drive a force-sensitive probe with a tip-mounted ultrasound transducer was developed. Twenty independent sites at the upper lateral quadrant of the buttock from 11 asymptomatic subjects (7 men and 4 women from a chiropractic college) were indented at 6% per second for 3 sessions, each consisting of 5 trials. Tissue thickness, force at 25% deformation, and area under the load-deformation curve from 0% to 25% deformation were calculated. Optimized hyperelastic parameters of the soft tissue were calculated with a finite element model using a first-order Ogden material model. Load-deformation response on a standardized block was then simulated, and the corresponding area and force parameters were calculated. Between-trials repeatability and test-retest reliability of each parameter were evaluated using coefficients of variation and intraclass correlation coefficients, respectively. Load-deformation responses were highly reproducible under repeated measurements. Coefficients of variation of tissue thickness, area under the load-deformation curve from 0% to 25% deformation, and force at 25% deformation averaged 0.51%, 2.31%, and 2.23%, respectively. Intraclass correlation coefficients ranged between 0.959 and 0.999, indicating excellent test-retest reliability. The automated Mechano-Acoustic Indentor System and its corresponding optimization technique offers a viable technology to make in vivo measurement of the nonlinear elastic properties of soft tissue. This technology showed excellent between-trials repeatability and test-retest reliability with potential to quantify the effects of a wide variety of manual therapy techniques on the soft tissue elastic properties. Copyright © 2011 National University of Health Sciences. Published by Mosby, Inc. All rights reserved.
Validity and Reliability of the Turkish Chronic Pain Acceptance Questionnaire
Akmaz, Hazel Ekin; Uyar, Meltem; Kuzeyli Yıldırım, Yasemin; Akın Korhan, Esra
2018-05-29
Pain acceptance is the process of giving up the struggle with pain and learning to live a worthwhile life despite it. In assessing patients with chronic pain in Turkey, making a diagnosis and tracking the effectiveness of treatment is done with scales that have been translated into Turkish. However, there is as yet no valid and reliable scale in Turkish to assess the acceptance of pain. To validate a Turkish version of the Chronic Pain Acceptance Questionnaire developed by McCracken and colleagues. Methodological and cross sectional study. A simple randomized sampling method was used in selecting the study sample. The sample was composed of 201 patients, more than 10 times the number of items examined for validity and reliability in the study, which totaled 20. A patient identification form, the Chronic Pain Acceptance Questionnaire, and the Brief Pain Inventory were used to collect data. Data were collected by face-to-face interviews. In the validity testing, the content validity index was used to evaluate linguistic equivalence, content validity, construct validity, and expert views. In reliability testing of the scale, Cronbach’s α coefficient was calculated, and item analysis and split-test reliability methods were used. Principal component analysis and varimax rotation were used in factor analysis and to examine factor structure for construct concept validity. The item analysis established that the scale, all items, and item-total correlations were satisfactory. The mean total score of the scale was 21.78. The internal consistency coefficient was 0.94, and the correlation between the two halves of the scale was 0.89. The Chronic Pain Acceptance Questionnaire, which is intended to be used in Turkey upon confirmation of its validity and reliability, is an evaluation instrument with sufficient validity and reliability, and it can be reliably used to examine patients’ acceptance of chronic pain.
Validity and Reliability of the Turkish Chronic Pain Acceptance Questionnaire
Akmaz, Hazel Ekin; Uyar, Meltem; Kuzeyli Yıldırım, Yasemin; Akın Korhan, Esra
2018-01-01
Background: Pain acceptance is the process of giving up the struggle with pain and learning to live a worthwhile life despite it. In assessing patients with chronic pain in Turkey, making a diagnosis and tracking the effectiveness of treatment is done with scales that have been translated into Turkish. However, there is as yet no valid and reliable scale in Turkish to assess the acceptance of pain. Aims: To validate a Turkish version of the Chronic Pain Acceptance Questionnaire developed by McCracken and colleagues. Study Design: Methodological and cross sectional study. Methods: A simple randomized sampling method was used in selecting the study sample. The sample was composed of 201 patients, more than 10 times the number of items examined for validity and reliability in the study, which totaled 20. A patient identification form, the Chronic Pain Acceptance Questionnaire, and the Brief Pain Inventory were used to collect data. Data were collected by face-to-face interviews. In the validity testing, the content validity index was used to evaluate linguistic equivalence, content validity, construct validity, and expert views. In reliability testing of the scale, Cronbach’s α coefficient was calculated, and item analysis and split-test reliability methods were used. Principal component analysis and varimax rotation were used in factor analysis and to examine factor structure for construct concept validity. Results: The item analysis established that the scale, all items, and item-total correlations were satisfactory. The mean total score of the scale was 21.78. The internal consistency coefficient was 0.94, and the correlation between the two halves of the scale was 0.89. Conclusion: The Chronic Pain Acceptance Questionnaire, which is intended to be used in Turkey upon confirmation of its validity and reliability, is an evaluation instrument with sufficient validity and reliability, and it can be reliably used to examine patients’ acceptance of chronic pain. PMID:29843496
Marcos-Garcés, V; Harvat, M; Molina Aguilar, P; Ferrández Izquierdo, A; Ruiz-Saurí, A
2017-08-01
Measurement of collagen bundle orientation in histopathological samples is a widely used and useful technique in many research and clinical scenarios. Fourier analysis is the preferred method for performing this measurement, but the most appropriate staining and microscopy technique remains unclear. Some authors advocate the use of Haematoxylin-Eosin (H&E) and confocal microscopy, but there are no studies comparing this technique with other classical collagen stainings. In our study, 46 human skin samples were collected, processed for histological analysis and stained with Masson's trichrome, Picrosirius red and H&E. Five microphotographs of the reticular dermis were taken with a 200× magnification with light microscopy, polarized microscopy and confocal microscopy, respectively. Two independent observers measured collagen bundle orientation with semiautomated Fourier analysis with the Image-Pro Plus 7.0 software and three independent observers performed a semiquantitative evaluation of the same parameter. The average orientation for each case was calculated with the values of the five pictures. We analyzed the interrater reliability, the consistency between Fourier analysis and average semiquantitative evaluation and the consistency between measurements in Masson's trichrome, Picrosirius red and H&E-confocal. Statistical analysis for reliability and agreement was performed with the SPSS 22.0 software and consisted of intraclass correlation coefficient (ICC), Bland-Altman plots and limits of agreement and coefficient of variation. Interrater reliability was almost perfect (ICC > 0.8) with all three histological and microscopy techniques and always superior in Fourier analysis than in average semiquantitative evaluation. Measurements were consistent between Fourier analysis by one observer and average semiquantitative evaluation by three observers, with an almost perfect agreement with Masson's trichrome and Picrosirius red techniques (ICC > 0.8) and a strong agreement with H&E-confocal (0.7 < ICC < 0.8). Comparison of measurements between the three techniques for the same observer showed an almost perfect agreement (ICC > 0.8), better with Fourier analysis than with semiquantitative evaluation (single and average). These results in nonpathological skin samples were also confirmed in a preliminary analysis in eight scleroderma skin samples. Our results show that Masson's trichrome and Picrosirius red are consistent with H&E-confocal for measuring collagen bundle orientation in histological samples and could thus be used indistinctly for this purpose. Fourier analysis is superior to average semiquantitative evaluation and should keep being used as the preferred method. © 2017 The Authors Journal of Microscopy © 2017 Royal Microscopical Society.
Machado, Roberta Ismael Lacerda; Souto, Lais Medeiros; Freire, Eutilia Andrade Medeiros
2014-01-01
Systemic sclerosis (SSc) is a multisystem disease, autoimmune disorder characterized by a fibroblastic disfunction, with significant impact on quality of life (QoL), measured by instruments or questionnaires that usually were formulated in other languages and in different cultural contexts. Translate into Brazilian Portuguese, cross cultural adaptation and assess the reliability and validity of the Systemic Sclerosis Questionnaire (SySQ). Translation and adaptation: into Portuguese and cross-cultural adaptation was performed in accordance with studies on questionnaire translation methodology into other languages. Reliability: it was analyzed using three interviews with different interviewers, two on the same day (interobserver) and the third within 14 days of the first assessment (intraobserver).Validity was assessed by correlating clinical and quality of life parameters with the domain scores of Sysc. a descriptive analysis of the study sample. Reproducibility was assessed using an intraclass correlation coefficient (ICC). Internal consistency was assessed using Cronbach's alpha coefficient. To assess validity we used Spearman correlation coefficient. Five percent was the level of significance adopted for all statistical tests. In the evaluation of the questionnaires, the results were similar to the original questionnaire, the internal consistency ranging between 0.73 and 0.93 for each item. The interobserver reproducibility was very good for all domains (α = 0.786 to 0.983) and intraobserver agreement was considered very good for general symptoms domain (ICC = 0.916), good for musculoskeletal symptoms domain (ICC = 0.897) and cardiopulmonary domain (ICC = 0.842) and reasonable for gastrointestinal symptoms domain (ICC = 0.686). The Brazilian Portuguese version of SySQ proved to be reproducible and valid for our population, using a recognized methodology for translation and cultural adaptation of questionnaires, as well as to assess the reproducibility and validity.
de los Santos, Gonzalo; Reyes, Pablo; del Castillo, Raúl; Fragola, Claudio; Royuela, Ana
2015-11-01
Our objective was to perform translation, cross-cultural adaptation and validation of the sino-nasal outcome test 22 (SNOT-22) to Spanish language. SNOT-22 was translated, back translated, and a pretest trial was performed. The study included 119 individuals divided into 60 cases, who met diagnostic criteria for chronic rhinosinusitis according to the European Position Paper on Rhinosinusitis 2012; and 59 controls, who reported no sino-nasal disease. Internal consistency was evaluated with Cronbach's alpha test, reproducibility with Kappa coefficient, reliability with intraclass correlation coefficient (ICC), validity with Mann-Whitney U test and responsiveness with Wilcoxon test. In cases, Cronbach's alpha was 0.91 both before and after treatment, as for controls, it was 0.90 at their first test assessment and 0.88 at 3 weeks. Kappa coefficient was calculated for each item, with an average score of 0.69. ICC was also performed for each item, with a score of 0.87 in the overall score and an average among all items of 0.71. Median score for cases was 47, and 2 for controls, finding the difference to be highly significant (Mann-Whitney U test, p < 0.001). Clinical changes were observed among treated patients, with a median score of 47 and 13.5 before and after treatment, respectively (Wilcoxon test, p < 0.001). The effect size resulted in 0.14 in treated patients whose status at 3 weeks was unvarying; 1.03 in those who were better and 1.89 for much better group. All controls were unvarying with an effect size of 0.05. The Spanish version of the SNOT-22 has the internal consistency, reliability, reproducibility, validity and responsiveness necessary to be a valid instrument to be used in clinical practice.
Music therapy career aptitude and generalized self-efficacy in music therapy students.
Lim, Hayoung A; Befi, Cathy M
2014-01-01
While the Music Therapy Career Aptitude Test (MTCAT) provides a measure of student aptitude, measures of perceived self-efficacy may provide additional information about a students' suitability for a music therapy career. As a first step in determining whether future studies examining combined scores from the MTCAT and the Generalized Self-Efficacy (GSE) scale would be useful to help predict academic success in music therapy, we explored the internal reliability of these two measures in a sample of undergraduate students, and the relationship (concurrent validity) of the measures to one another. Eighty undergraduate music therapy students (14 male; 66 female) completed the MTCAT and GSE. To determine internal reliability we conducted tests of normality and calculated Cronbach's Coefficient Alpha for each measure. Pearson correlation coefficients were calculated to ascertain the strength of the relationship between the MTCAT and GSE. MTCAT scores were normally distributed and had high internal consistency (Cronbach's α = 0.706). GSE scores were not normally distributed, but had high internal consistency (Cronbach's α = 0.748). The correlation coefficient analysis revealed that MTCAT and GSE scores were moderately correlated ((r = 0.426, p < 0.0001). MTCAT scores can be used to partially determine perceived self-efficacy in undergraduate music therapy students; however, a more complete picture of student suitability for music therapy may be determined by administering the GSE alongside the MTCAT. Future studies are needed to determine whether combined MTCAT and GSE scores can be used to predict student success in an undergraduate music therapy program. © the American Music Therapy Association 2014. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Development and Validation of a Social Capital Questionnaire for Adolescent Students (SCQ-AS)
Paiva, Paula Cristina Pelli; de Paiva, Haroldo Neves; de Oliveira Filho, Paulo Messias; Lamounier, Joel Alves; Ferreira, Efigênia Ferreira e; Ferreira, Raquel Conceição; Kawachi, Ichiro; Zarzar, Patrícia Maria
2014-01-01
Objectives Social capital has been studied due to its contextual influence on health. However, no specific assessment tool has been developed and validated for the measurement of social capital among 12-year-old adolescent students. The aim of the present study was to develop and validate a quick, simple assessment tool to measure social capital among adolescent students. Methods A questionnaire was developed based on a review of relevant literature. For such, searches were made of the Scientific Electronic Library Online, Latin American and Caribbean Health Sciences, The Cochrane Library, ISI Web of Knowledge, International Database for Medical Literature and PubMed Central bibliographical databases from September 2011 to January 2014 for papers addressing assessment tools for the evaluation of social capital. Focus groups were also formed by adolescent students as well as health, educational and social professionals. The final assessment tool was administered to a convenience sample from two public schools (79 students) and one private school (22 students), comprising a final sample of 101 students. Reliability and internal consistency were evaluated using the Kappa coefficient and Cronbach's alpha coefficient, respectively. Content validity was determined by expert consensus as well as exploratory and confirmatory factor analysis. Results The final version of the questionnaire was made up of 12 items. The total scale demonstrated very good internal consistency (Cronbach's alpha: 0.71). Reproducibility was also very good, as the Kappa coefficient was higher than 0.72 for the majority of items (range: 0.63 to 0.97). Factor analysis grouped the 12 items into four subscales: School Social Cohesion, School Friendships, Neighborhood Social Cohesion and Trust (school and neighborhood). Conclusions The present findings indicate the validity and reliability of the Social Capital Questionnaire for Adolescent Students. PMID:25093409
Park, Sung Shin; Kwon, Tack-Kyun; Choi, Seong Hee; Lee, Won Yong; Hong, Young Hye; Jeong, Nyun Gi; Sung, Myung-Whun; Kim, Kwang Hyun
2013-01-01
The aim of this study was to assess the reliability and validity of the Pediatric Voice Handicap Index (pVHI) for cross-cultural adaptation of the Korean version with school age children. The questionnaire was translated into Korean and was completed by 101 Korean parents who have children with or without disordered voice. The Korean version-pVHI scores were obtained with 60 parents of normal children and 41 parents who have children with voice problems. Content validity was verified by five experienced speech-language pathologists with clinical specialization in voice disorders. Internal consistency was calculated through Cronbach's α coefficient and test-retest reliability of the Korean version-pVHI score was determined using Pearson product-moment correlation coefficients. Mann-Whitney U test was used to compare GRBAS with the Korean version-pVHI scores between normal and dysphonia group. The relationship between the parent-reported the Korean version-pVHI total scores and perceptual ratings of voice quality from experts was investigated using Spearman correlation coefficients. The results showed that the Korean version-pVHI provided a high internal consistency (α=0.92) and test-retest reliability of its subscales: total (T) 0.97, functional (F) 0.90, physical (P) 0.95, emotional (E) 0.92. The Korean version-pVHI mean scores in normal group were 1.28 (T), 0.62 (F), 0.35 (P) and 0.32 (E), respectively whereas those of the Korean version-pVHI in children group with dysphonia were 23.13 (T), 8.90 (F), 9.54 (P) and 4.93 (E). Significant differences in the Korean version-pVHI (T, F, P, E) and perceptual evaluation (grade, rough, breathy) between normal and dysphonia group were revealed (P<0.05). Moreover, relatively moderate-to-high correlation between the Korean version-pVHI parameters (T) and perceptual measures (G) was exhibited in children with dysphonia. The subjective Korean version-pVHI can be applicable and useful supplementary tool for evaluating parents' perception of their children's voice dysfunction, identifying multifactors on daily life affecting their children's voice and measuring treatment efficacy before and after therapeutic intervention. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Siu, B W M; Au-Yeung, C C Y; Chan, A W L; Chan, L S Y; Yuen, K K; Leung, H W; Yan, C K; Ng, K K; Lai, A C H; Davies, S; Collins, M
Mapping forensic psychiatric services with the security needs of patients is a salient step in service planning, audit and review. A valid and reliable instrument for measuring the security needs of Chinese forensic psychiatric inpatients was not yet available. This study aimed to develop and validate the Chinese version of the Security Needs Assessment Profile for measuring the profiles of security needs of Chinese forensic psychiatric inpatients. The Security Needs Assessment Profile by Davis was translated into Chinese. Its face validity, content validity, construct validity and internal consistency reliability were assessed by measuring the security needs of 98 Chinese forensic psychiatric inpatients. Principal factor analysis for construct validity provided a six-factor security needs model explaining 68.7% of the variance. Based on the Cronbach's alpha coefficient, the internal consistency reliability was rated as acceptable for procedural security (0.73), and fair for both physical security (0.62) and relational security (0.58). A significant sex difference (p=0.002) in total security score was found. The Chinese version of the Security Needs Assessment Profile is a valid and reliable instrument for assessing the security needs of Chinese forensic psychiatric inpatients. Copyright © 2017 Elsevier Ltd. All rights reserved.
Measurement of cochlear length using the 'A' value for cochlea basal diameter: A feasibility study.
Deep, Nicholas L; Howard, Brittany E; Holbert, Sarah O; Hoxworth, Joseph M; Barrs, David M
2017-07-01
To determine whether the cochlea basal diameter (A value) measurement can be consistently and precisely obtained from high-resolution temporal bone imaging for use in cochlear length estimation. A feasibility study at a tertiary referral center was performed using the temporal bone CTs of 40 consecutive patients. The distance from the round window to the lateral wall was measured for each cochlea by two independent reviewers, a neuroradiologist and an otolaryngologist. The interrater reliability was calculated using the intraclass correlation coefficient (ICC) and the Bland-Altman plot. Forty patients (19 males, 21 females) for a total of 80 cochleae were included. Interrater reliability on the same ear had a high level of agreement by both the ICC and the Bland-Altman plot. ICCs were 0.90 (95% CI: 0.82, 0.94) for the left ear and 0.96 (95% CI: 0.92, 0.98) for the right ear. Bland-Altman plot confirmed interrater reliability with all 96% of measurements falling within the 95% limits of agreement. Measurement between the round window and lateral cochlear wall can be consistently and reliably obtained from high-resolution temporal bone CT scans. Thus, it is feasible to utilize this method to estimate the cochlear length of patients undergoing cochlear implantation.
Psychometric evaluation of the Nursing Stress Scale (NSS) among Chinese nurses in Taiwan.
Lee, Mei-Hua; Holzemer, William L; Faucett, Julia
2007-01-01
The purpose of this study was to translate the Nursing Stress Scale (NSS) into Chinese and test its reliability and validity among Chinese nurses in Taiwan. Potential participants were asked to self-administer a Chinese version of the NSS. The agreement estimation was used to determine the equivalence of the meaning between the Chinese and original English versions and was rated by five bilingual nurses as 92% accurate for the 34 items. The test-retest reliability for the NSS at 2 weeks was .71 (p = .022, n=10). Internal consistency reliability and factor analysis were tested with 770 nurses from 65 inpatient units at a medical center in Taiwan. The internal consistency of the Chinese version of the NSS for an overall coefficient alpha is .91 for the total scale, and ranges from .67 to .79 for the subscales. The Chinese version of the NSS explains 53.77% of the variance in work stressors among Chinese nurses in Taiwan. Overall, the Chinese version of the NSS is internally consistent but may not be stable over 2 weeks. There was adequate evidence of the reliability and validity of the NSS-Chinese as an instrument appropriate to measure work stress among Chinese nurses. The translated NSS could be a useful tool for examining the frequency and major sources of stress experienced by Chinese nurses in hospital settings, and for the development of appropriate interventions for stress reduction.
ERIC Educational Resources Information Center
Dolan, Conor V.; Wicherts, Jelte M.; Molenaar, Peter C. M.
2004-01-01
We consider the question of how variation in the number and reliability of indicators affects the power to reject the hypothesis that the regression coefficients are zero in latent linear regression analysis. We show that power remains constant as long as the coefficient of determination remains unchanged. Any increase in the number of indicators…
Ghisi, Gabriela Lima de Melo; Sandison, Nicole; Oh, Paul
2016-03-01
To develop, pilot test and psychometrically validate a shorter version of the coronary artery disease education questionnaire (CADE-Q), called CADE-Q SV. Based on previous versions of the CADE-Q, cardiac rehabilitation (CR) experts developed 20 items divided into 5 knowledge domains to comprise the first version of the CADE-Q SV. To establish content validity, they were reviewed by an expert panel (N=12). Refined items were pilot-tested in 20 patients, in which clarity was provided. A final version was generated and psychometrically-tested in 132CR patients. Test-retest reliability was assessed via the intraclass correlation coefficient (ICC), the internal consistency using Cronbach's alpha, and criterion validity with regard to patients' education and duration in CR. All ICC coefficients meet the minimum recommended standard. All domains were considered internally consistent (α>0.7). Criterion validity was supported by significant differences in mean scores by educational level (p<0.01) and duration in CR (p<0.05). Knowledge about exercise and nutrition was higher than knowledge about medical condition. The CADE-Q SV was demonstrated to have good reliability and validity. This is a short, quick and appropriate tool for application in clinical and research settings, assessing patients' knowledge during CR and as part of education programming. Copyright © 2015. Published by Elsevier Ireland Ltd.
Medina, Maria Del Mar; Carrillo, Alvaro; Polo, Ruben; Fernandez, Borja; Alonso, Daniel; Vaca, Miguel; Cordero, Adela; Perez, Cecilia; Muriel, Alfonso; Cobeta, Ignacio
2017-04-01
Objective To perform translation, cross-cultural adaptation, and validation of the Penn Acoustic Neuroma Quality-of-Life Scale (PANQOL) to the Spanish language. Study Design Prospective study. Setting Tertiary neurotologic referral center. Subjects and Methods PANQOL was translated and translated back, and a pretest trial was performed. The study included 27 individuals diagnosed with vestibular schwannoma. Inclusion criteria were adults with untreated vestibular schwannoma, diagnosed in the past 12 months. Feasibility, internal consistency, test-retest reliability, construct validity, and ceiling and floor effects were assessed for the present study. Results The mean overall score of the PANQOL was 69.21 (0-100 scale, lowest to highest quality of life). Cronbach's α was 0.87. Intraclass correlation coefficient was performed for each item, with an overall score of 0.92. The κ coefficient scores were between moderate and almost perfect in more than 92% of patients. Anxiety and energy domains of the PANQOL were correlated with both physical and mental components of the SF-12. Hearing, balance, and pain domains were correlated with the SF-12 physical component. Facial and general domains were not significantly correlated with any component of the SF-12. Furthermore, the overall score of the PANQOL was correlated with the physical component of the SF-12. Conclusion Feasibility, internal consistency, reliability, and construct validity outcomes in the current study support the validity of the Spanish version of the PANQOL.
Ohashi, Y; Tashiro, K; Itoyama, Y; Nakano, I; Sobue, G; Nakamura, S; Sumino, S; Yanagisawa, N
2001-04-01
Amyotrophic lateral sclerosis(ALS) is progressive, degenerative, fatal disease of the motor neuron. No efficacious therapy is available to slow the progressive loss of function, but several new approaches including neurotrophic factors, antioxidants and glutamate antagonists, are currently being evaluated as potential therapies. Mortality, and/or time to tracheostomy, muscle strength and pulmonary function are used as primary endpoints in clinical trials for treatment of ALS. The effect of new therapies on the quality of patients' lives are also important, so we sought to develop a rating scale to measure it. The revised ALS Functional Rating Scale(ALSFRS-R), which has addition of items to ALSFRS to enhance the ability to assess respiratory symptoms, is an assessment determining the degree of impairment in ALS patients' abilities to function independently in activities of daily living. It consists of 12 items to evaluate bulbar function, motor function and respiratory function and each item is scored from 0(unable) to 4(normal). We translated the English score into Japanese one with minor modification considering the inter cultural difference. And we examined reliability of the translated scale. As a measure of reliability, the intraclass correlation coefficient(ICC) was evaluated for total score and the Kappa coefficient proposed by Cohen and Kraemer was calculated for each item. Moreover, we examined sensitivity to clinical change over time and carried out the factor analysis to analyze the factorial structure. The subjects were 27 ALS patients and each was scored twice for reliability or three times for sensitivity by 2 to 5 neurologists and if possible, nurses. The ICC for total score was 0.97(95% C. I.; 0.94-0.98). Extension of the Kappa coefficients were 0.48 to 1.00 for inter-rater reliability and the averaged Kappa coefficients were 0.63 to 1.00 for intra rater reliability, respectively. Concerning the factorial structure, the contribution of the first factor(the first principal component) were 53.5% principal factor solution. The factor loadings of items were 0.52-0.91 except "salivation" and this factor almost equal to the simple sum of all items was interpreted as the general degree of deterioration. The promax votation revealed the riginally supposed factor structure with 3 factors(groups of items): neuromuscuclar function, respiratory function and bulbar function. The rating scale correlated with Global clinical impression of change(GCIC) scored by neurologists and declined with time, indicating its sensitivity to change. On the bases of these results, ALSFRS-R(Japanese version) is considered to be highly reliable enough for clinical use.
Tran, Thach Duc; Holton, Sara; Nguyen, Huong Thanh; Wolfe, Rory; Fisher, Jane
2017-01-01
Objectives To assess the internal consistency, latent structure and convergent validity of the Depression, Anxiety and Stress Scale-21 (DASS-21) among adolescents in Vietnam. Method An anonymous, self-completed questionnaire was conducted among 1,745 high school students in Hanoi, Vietnam between October, 2013 and January, 2014. Confirmatory factor analyses were performed to assess the latent structure of the DASS-21. Factorial invariance between girls and boys was examined. Cronbach alphas and correlation coefficients between DASS-21 factor scores and the domain scores of the Duke Health Profile Adolescent Vietnamese validated version (ADHP-V) were calculated to assess DASS-21 internal consistency and convergent validity. Results A total of 1,606/ 1,745 (92.6%) students returned the questionnaire. Of those, 1,387 students provided complete DASS-21 data. The scale demonstrated adequate internal consistency (Cronbach α: 0.761 to 0.906). A four-factor model showed the best fit to the data. Items loaded significantly on a common general distress factor, the depression, and the anxiety factors, but few on the stress factor (p<0.05). DASS-21 convergent validity was confirmed with moderate correlation coefficients (-0.47 to -0.66) between its factor scores and the ADHP-V mental health related domains. Conclusions The DASS-21 is reliable and suitable for use to assess symptoms of common mental health problems, especially depression and anxiety among Vietnamese adolescents. However, its ability in detecting stress among these adolescents may be limited. Further research is warrant to explore these results. PMID:28723909
Translation and validation of the German version of the Bournemouth Questionnaire for Neck Pain.
Soklic, Marina; Peterson, Cynthia; Humphreys, B Kim
2012-01-25
Clinical outcome measures are important tools to monitor patient improvement during treatment as well as to document changes for research purposes. The short-form Bournemouth questionnaire for neck pain patients (BQN) was developed from the biopsychosocial model and measures pain, disability, cognitive and affective domains. It has been shown to be a valid and reliable outcome measure in English, French and Dutch and more sensitive to change compared to other questionnaires. The purpose of this study was to translate and validate a German version of the Bournemouth questionnaire for neck pain patients. German translation and back translation into English of the BQN was done independently by four persons and overseen by an expert committee. Face validity of the German BQN was tested on 30 neck pain patients in a single chiropractic practice. Test-retest reliability was evaluated on 31 medical students and chiropractors before and after a lecture. The German BQN was then assessed on 102 first time neck pain patients at two chiropractic practices for internal consistency, external construct validity, external longitudinal construct validity and sensitivity to change compared to the German versions of the Neck Disability Index (NDI) and the Neck Pain and Disability Scale (NPAD). Face validity testing lead to minor changes to the German BQN. The Intraclass Correlation Coefficient for the test-retest reliability was 0.99. The internal consistency was strong for all 7 items of the BQN with Cronbach α's of .79 and .80 for the pre and post-treatment total scores. External construct validity and external longitudinal construct validity using Pearson's correlation coefficient showed statistically significant correlations for all 7 scales of the BQN with the other questionnaires. The German BQN showed greater responsiveness compared to the other questionnaires for all scales. The German BQN is a valid and reliable outcome measure that has been successfully translated and culturally adapted. It is shorter, easier to use, and more responsive to change than the NDI and NPAD.