comprehensibility reliability validity: Topics by Science.gov

Sample records for comprehensibility reliability validity

Measurement of fatigue: Comparison of the reliability and validity of single-item and short measures to a comprehensive measure.

PubMed

Kim, Hee-Ju; Abraham, Ivo

2017-01-01

Evidence is needed on the clinicometric properties of single-item or short measures as alternatives to comprehensive measures. We examined whether two single-item fatigue measures (i.e., Likert scale, numeric rating scale) or a short fatigue measure were comparable to a comprehensive measure in reliability (i.e., internal consistency and test-retest reliability) and validity (i.e., convergent, concurrent, and predictive validity) in Korean young adults. For this quantitative study, we selected the Functional Assessment of Chronic Illness Therapy-Fatigue for the comprehensive measure and the Profile of Mood States-Brief, Fatigue subscale for the short measure; and constructed two single-item measures. A total of 368 students from four nursing colleges in South Korea participated. We used Cronbach's alpha and item-total correlation for internal consistency reliability and intraclass correlation coefficient for test-retest reliability. We assessed Pearson's correlation with a comprehensive measure for convergent validity, with perceived stress level and sleep quality for concurrent validity and the receiver operating characteristic curve for predictive validity. The short measure was comparable to the comprehensive measure in internal consistency reliability (Cronbach's alpha=0.81 vs. 0.88); test-retest reliability (intraclass correlation coefficient=0.66 vs. 0.61); convergent validity (r with comprehensive measure=0.79); concurrent validity (r with perceived stress=0.55, r with sleep quality=0.39) and predictive validity (area under curve=0.88). Single-item measures were not comparable to the comprehensive measure. A short fatigue measure exhibited similar levels of reliability and validity to the comprehensive measure in Korean young adults. Copyright Â© 2016 Elsevier Ltd. All rights reserved.
A Low Vision Reading Comprehension Test.

ERIC Educational Resources Information Center

Watson, G. R.; And Others

1996-01-01

Fifty adults (ages 28-86) with macular degeneration were given the Low Vision Reading Comprehension Assessment (LVRCA) to test its reliability and validity in evaluating the reading comprehension of those with vision impairments. The LVRCA was found to take only nine minutes to administer and was a valid and reliable tool. (CR)
Comprehension of Written Grammar Test: Reliability and Known-Groups Validity Study with Hearing and Deaf and Hard-of-Hearing Students

ERIC Educational Resources Information Center

Cannon, Joanna E.; Hubley, Anita M.; Millhoff, Courtney; Mazlouman, Shahla

2016-01-01

The aim of the current study was to gather validation evidence for the "Comprehension of Written Grammar" (CWG; Easterbrooks, 2010) receptive test of 26 grammatical structures of English print for use with children who are deaf and hard of hearing (DHH). Reliability and validity data were collected for 98 participants (49 DHH and 49…
The Validity and reliability of the Comprehensive Home Environment Survey (CHES).

PubMed

Pinard, Courtney A; Yaroch, Amy L; Hart, Michael H; Serrano, Elena L; McFerren, Mary M; Estabrooks, Paul A

2014-01-01

Few comprehensive measures exist to assess contributors to childhood obesity within the home, specifically among low-income populations. The current study describes the modification and psychometric testing of the Comprehensive Home Environment Survey (CHES), an inclusive measure of the home food, physical activity, and media environment related to childhood obesity. The items were tested for content relevance by an expert panel and piloted in the priority population. The CHES was administered to low-income parents of children 5 to 17 years (N = 150), including a subsample of parents a second time and additional caregivers to establish test-retest and interrater reliabilities. Children older than 9 years (n = 95), as well as parents (N = 150) completed concurrent assessments of diet and physical activity behaviors (predictive validity). Analyses and item trimming resulted in 18 subscales and a total score, which displayed adequate internal consistency (α = .74-.92) and high test-retest reliability (r ≥ .73, ps < .01) and interrater reliability (r ≥ .42, ps < .01). The CHES score and a validated screener for the home environment were correlated (r = .37, p < .01; concurrent validity). CHES subscales were significantly correlated with behavioral measures (r = -.20-.55, p < .05; predictive validity). The CHES shows promise as a valid/reliable assessment of the home environment related to childhood obesity, including healthy diet and physical activity.
A Validity and Reliability Update on the Informal Reading Inventory with Suggestions for Improvement.

ERIC Educational Resources Information Center

Klesius, Janell P.; Homan, Susan P.

1985-01-01

The article reviews validity and reliability studies on the informal reading inventory, a diagnostic instrument to identify reading grade-level placement and strengths and weaknesses in work recognition and comprehension. Gives suggestions to improve the validity and reliability of existing inventories and to evaluate them in newly published…
[Reliability and validity of the Chinese version on Comprehensive Scores for Financial Toxicity based on the patient-reported outcome measures].

PubMed

Yu, H H; Bi, X; Liu, Y Y

2017-08-10

Objective: To evaluate the reliability and validity of the Chinese version on comprehensive scores for financial toxicity (COST), based on the patient-reported outcome measures. Methods: A total of 118 cancer patients were face-to-face interviewed by well-trained investigators. Cronbach's α and Pearson correlation coefficient were used to evaluate reliability. Content validity index (CVI) and exploratory factor analysis (EFA) were used to evaluate the content validity and construct validity, respectively. Results: The Cronbach's α coefficient appeared as 0.889 for the whole questionnaire, with the results of test-retest were between 0.77 and 0.98. Scale-content validity index (S-CVI) appeared as 0.82, with item-content validity index (I-CVI) between 0.83 and 1.00. Two components were extracted from the Exploratory factor analysis, with cumulative rate as 68.04% and loading>0.60 on every item. Conclusion: The Chinese version of COST scale showed high reliability and good validity, thus can be applied to assess the financial situation in cancer patients.
Validity and Reliability of Published Comprehensive Theory of Mind Tests for Normal Preschool Children: A Systematic Review.

PubMed

Ziatabar Ahmadi, Seyyede Zohreh; Jalaie, Shohreh; Ashayeri, Hassan

2015-09-01

Theory of mind (ToM) or mindreading is an aspect of social cognition that evaluates mental states and beliefs of oneself and others. Validity and reliability are very important criteria when evaluating standard tests; and without them, these tests are not usable. The aim of this study was to systematically review the validity and reliability of published English comprehensive ToM tests developed for normal preschool children. We searched MEDLINE (PubMed interface), Web of Science, Science direct, PsycINFO, and also evidence base Medicine (The Cochrane Library) databases from 1990 to June 2015. Search strategy was Latin transcription of 'Theory of Mind' AND test AND children. Also, we manually studied the reference lists of all final searched articles and carried out a search of their references. Inclusion criteria were as follows: Valid and reliable diagnostic ToM tests published from 1990 to June 2015 for normal preschool children; and exclusion criteria were as follows: the studies that only used ToM tests and single tasks (false belief tasks) for ToM assessment and/or had no description about structure, validity or reliability of their tests. METHODological quality of the selected articles was assessed using the Critical Appraisal Skills Programme (CASP). In primary searching, we found 1237 articles in total databases. After removing duplicates and applying all inclusion and exclusion criteria, we selected 11 tests for this systematic review. There were a few valid, reliable and comprehensive ToM tests for normal preschool children. However, we had limitations concerning the included articles. The defined ToM tests were different in populations, tasks, mode of presentations, scoring, mode of responses, times and other variables. Also, they had various validities and reliabilities. Therefore, it is recommended that the researchers and clinicians select the ToM tests according to their psychometric characteristics, validity and reliability.
Validity and Reliability of Published Comprehensive Theory of Mind Tests for Normal Preschool Children: A Systematic Review

PubMed Central

Ziatabar Ahmadi, Seyyede Zohreh; Jalaie, Shohreh; Ashayeri, Hassan

2015-01-01

Objective: Theory of mind (ToM) or mindreading is an aspect of social cognition that evaluates mental states and beliefs of oneself and others. Validity and reliability are very important criteria when evaluating standard tests; and without them, these tests are not usable. The aim of this study was to systematically review the validity and reliability of published English comprehensive ToM tests developed for normal preschool children. Method: We searched MEDLINE (PubMed interface), Web of Science, Science direct, PsycINFO, and also evidence base Medicine (The Cochrane Library) databases from 1990 to June 2015. Search strategy was Latin transcription of ‘Theory of Mind’ AND test AND children. Also, we manually studied the reference lists of all final searched articles and carried out a search of their references. Inclusion criteria were as follows: Valid and reliable diagnostic ToM tests published from 1990 to June 2015 for normal preschool children; and exclusion criteria were as follows: the studies that only used ToM tests and single tasks (false belief tasks) for ToM assessment and/or had no description about structure, validity or reliability of their tests. Methodological quality of the selected articles was assessed using the Critical Appraisal Skills Programme (CASP). Result: In primary searching, we found 1237 articles in total databases. After removing duplicates and applying all inclusion and exclusion criteria, we selected 11 tests for this systematic review. Conclusion: There were a few valid, reliable and comprehensive ToM tests for normal preschool children. However, we had limitations concerning the included articles. The defined ToM tests were different in populations, tasks, mode of presentations, scoring, mode of responses, times and other variables. Also, they had various validities and reliabilities. Therefore, it is recommended that the researchers and clinicians select the ToM tests according to their psychometric characteristics, validity and reliability. PMID:27006666
Application of the Modified Erikson Psychosocial Stage Inventory: 25 Years in Review.

PubMed

Darling-Fisher, Cynthia S

2018-04-01

The Modified Erikson Psychosocial Stage Inventory (MEPSI) is an 80-item, comprehensive measure of psychosocial development based on Erikson's theory with published reliability and validity data. Although designed as a comprehensive measure, some researchers have used individual subscales for specific developmental stages as a measure; however, these subscale reliability scores have not been generally shared. This article reviewed the literature to evaluate the use of the MEPSI: the major research questions, samples/populations studied, and individual subscale and total reliability and validity data. In total, 16 research articles (1990-2011) and 28 Dissertations/Theses (1991-2016) from nursing, social work, psychology, criminal justice, and religious studies met criteria. Results support the MEPSI's global reliability (aggregate scores ranged .89-.99) and validity in terms of consistent patterns of changes observed in the predicted direction. Reliability and validity data for individual subscales were more variable. Limitations of the tool and recommendations for possible revision and future research are addressed.
Validity and reliability of four language mapping paradigms.

PubMed

Wilson, Stephen M; Bautista, Alexa; Yen, Melodie; Lauderdale, Stefanie; Eriksson, Dana K

2017-01-01

Language areas of the brain can be mapped in individual participants with functional MRI. We investigated the validity and reliability of four language mapping paradigms that may be appropriate for individuals with acquired aphasia: sentence completion, picture naming, naturalistic comprehension, and narrative comprehension. Five neurologically normal older adults were scanned on each of the four paradigms on four separate occasions. Validity was assessed in terms of whether activation patterns reflected the known typical organization of language regions, that is, lateralization to the left hemisphere, and involvement of the left inferior frontal gyrus and the left middle and/or superior temporal gyri. Reliability (test-retest reproducibility) was quantified in terms of the Dice coefficient of similarity, which measures overlap of activations across time points. We explored the impact of different absolute and relative voxelwise thresholds, a range of cluster size cutoffs, and limitation of analyses to a priori potential language regions. We found that the narrative comprehension and sentence completion paradigms offered the best balance of validity and reliability. However, even with optimal combinations of analysis parameters, there were many scans on which known features of typical language organization were not demonstrated, and test-retest reproducibility was only moderate for realistic parameter choices. These limitations in terms of validity and reliability may constitute significant limitations for many clinical or research applications that depend on identifying language regions in individual participants.
Cross-cultural adaptation and validation of the Korean Toronto Extremity Salvage Score for extremity sarcoma.

PubMed

Kim, Han-Soo; Yun, JiYeon; Kang, Seungcheol; Han, Ilkyu

2015-07-01

A Korean version of Toronto Extremity Salvage Score (TESS), a widely used disease-specific patient-reported questionnaire for assessing physical function of sarcoma patients, has not been developed. 1) to translate and cross-culturally adapt the TESS into Korean, and 2) to examine its comprehensibility, reliability and validity. TESS was translated into Korean, then translated back into English, and reviewed by a committee to develop the consensus version of the Korean TESS. The Korean TESS was administered to 126 patients to examine its comprehensibility, reliability, and validity. Comprehensibility was high, as the patients rated questions as "easy" or "very easy" in 96% for the TESS lower extremity (LE) and in 97% for the TESS upper extremity (UE). Test-retest reliability with intraclass coefficient (0.874 for LE and 0.979 for UE) and internal consistency with Cronbach's alpha (0.978 for LE and 0.989 for UE) were excellent. Korean TESS correlated with the MSTS score (r = 0.772 for LE and r = 0.635 for UE), and physical functioning domain of EORTC-CLQ C30 (r = 0.840 for LE and r = 0.630 for UE). Our study suggests that Korean version of the TESS is a comprehensible, reliable, and valid instrument to measure patient-reported functional outcome in patients with extremity sarcoma. © 2015 Wiley Periodicals, Inc.
Reliability and validity of the Incontinence Quiz-Turkish version.

PubMed

Kara, Kerime C; Çıtak Karakaya, İlkim; Tunalı, Nur; Karakaya, Mehmet G

2018-01-01

The aim of this study was to investigate the reliability and validity of the Turkish version of the Incontinence Quiz, which was developed by Branch et al. (1994), to assess women's knowledge of and attitudes toward urinary incontinence. Comprehensibility of the Turkish version of the 14-item Incontinence Quiz, which was prepared following translation-back translation procedures, was tested on a pilot group of eight women, and its internal reliability, test-retest reliability and construct validity were assessed in 150 women who attended the gynecology clinics of three hospitals in İçel, Turkey. Physical and sociodemographic characteristics and presence of incontinence complaints were also recorded. Data were analyzed at the 0.05 alpha level, using SPSS version 22. The scale had good reliability and validity. The internal reliability coefficient (Cronbach α) was 0.80, test-retest correlation coefficients were 0.83-0.94; and with regard to construct validity, Kaiser-Meyer-Olkin coefficient was 0.76 and Barlett sphericity test was 562.777 (P = 0.000). Turkish version of the Incontinence Quiz had a four-factor structure, with Eigenvalues ranging from 1.17 to 4.08. The Incontinence Quiz-Turkish version is a highly comprehensible, reliable and valid scale, which may be used to assess Turkish-speaking women's knowledge of and attitudes toward urinary incontinence. © 2017 Japan Society of Obstetrics and Gynecology.
Measuring Speech Comprehensibility in Students with Down Syndrome

PubMed Central

Woynaroski, Tiffany; Camarata, Stephen

2016-01-01

Purpose There is an ongoing need to develop assessments of spontaneous speech that focus on whether the child's utterances are comprehensible to listeners. This study sought to identify the attributes of a stable ratings-based measure of speech comprehensibility, which enabled examining the criterion-related validity of an orthography-based measure of the comprehensibility of conversational speech in students with Down syndrome. Method Participants were 10 elementary school students with Down syndrome and 4 unfamiliar adult raters. Averaged across-observer Likert ratings of speech comprehensibility were called a ratings-based measure of speech comprehensibility. The proportion of utterance attempts fully glossed constituted an orthography-based measure of speech comprehensibility. Results Averaging across 4 raters on four 5-min segments produced a reliable (G = .83) ratings-based measure of speech comprehensibility. The ratings-based measure was strongly (r > .80) correlated with the orthography-based measure for both the same and different conversational samples. Conclusion Reliable and valid measures of speech comprehensibility are achievable with the resources available to many researchers and some clinicians. PMID:27299989
Development and validation of a Malawian version of the primary care assessment tool.

PubMed

Dullie, Luckson; Meland, Eivind; Hetlevik, Øystein; Mildestvedt, Thomas; Gjesdal, Sturla

2018-05-16

Malawi does not have validated tools for assessing primary care performance from patients' experience. The aim of this study was to develop a Malawian version of Primary Care Assessment Tool (PCAT-Mw) and to evaluate its reliability and validity in the assessment of the core primary care dimensions from adult patients' perspective in Malawi. A team of experts assessed the South African version of the primary care assessment tool (ZA-PCAT) for face and content validity. The adapted questionnaire underwent forward and backward translation and a pilot study. The tool was then used in an interviewer administered cross-sectional survey in Neno district, Malawi, to test validity and reliability. Exploratory factor analysis was performed on a random half of the sample to evaluate internal consistency, reliability and construct validity of items and scales. The identified constructs were then tested with confirmatory factor analysis. Likert scale assumption testing and descriptive statistics were done on the final factor structure. The PCAT-Mw was further tested for intra-rater and inter-rater reliability. From the responses of 631 patients, a 29-item PCAT-Mw was constructed comprising seven multi-item scales, representing five primary care dimensions (first contact, continuity, comprehensiveness, coordination and community orientation). All the seven scales achieved good internal consistency, item-total correlations and construct validity. Cronbach's alpha coefficient ranged from 0.66 to 0.91. A satisfactory goodness of fit model was achieved (GFI = 0.90, CFI = 0.91, RMSEA = 0.05, PCLOSE = 0.65). The full range of possible scores was observed for all scales. Scaling assumptions tests were achieved for all except the two comprehensiveness scales. Intra-class correlation coefficient (ICC) was 0.90 (n = 44, 95% CI 0.81-0.94, p < 0.001) for intra-rater reliability and 0.84 (n = 42, 95% CI 0.71-0.96, p < 0.001) for inter-rater reliability. Comprehensive metric analyses supported the reliability and validity of PCAT-Mw in assessing the core concepts of primary care from adult patients' experience. This tool could be used for health service research in primary care in Malawi.
The CPT Reading Comprehension Test: A Validity Study.

ERIC Educational Resources Information Center

Napoli, Anthony R.; Raymond, Lanette A.; Coffey, Cheryl A.; Bosco, Diane M.

1998-01-01

Describes a study done at Suffolk County Community College (New York) that assessed the validity of the College Board's Computerized Placement Test in Reading Comprehension (CPT-R) by comparing test results of 1,154 freshmen with the results of the Degree of Power Reading Test. Results confirmed the CPT-R's reliability in identifying basic…
Developing and Validating Proof Comprehension Tests in Undergraduate Mathematics

ERIC Educational Resources Information Center

Mejía-Ramos, Juan Pablo; Lew, Kristen; de la Torre, Jimmy; Weber, Keith

2017-01-01

In this article, we describe and illustrate the process by which we developed and validated short, multiple-choice, reliable tests to assess undergraduate students' comprehension of three mathematical proofs. We discuss the purpose for each stage and how it benefited the design of our instruments. We also suggest ways in which this process could…
[The Basel Screening Instrument for Psychosis (BSIP): development, structure, reliability and validity].

PubMed

Riecher-Rössler, A; Aston, J; Ventura, J; Merlo, M; Borgwardt, S; Gschwandtner, U; Stieglitz, R-D

2008-04-01

Early detection of psychosis is of growing clinical importance. So far there is, however, no screening instrument for detecting individuals with beginning psychosis in the atypical early stages of the disease with sufficient validity. We have therefore developed the Basel Screening Instrument for Psychosis (BSIP) and tested its feasibility, interrater-reliability and validity. Aim of this paper is to describe the development and structure of the instrument, as well as to report the results of the studies on reliability and validity. The instrument was developed based on a comprehensive search of literature on the most important risk factors and early signs of schizophrenic psychoses. The interraterreliability study was conducted on 24 psychiatric cases. Validity was tested based on 206 individuals referred to our early detection clinic from 3/1/2000 until 2/28/2003. We identified seven categories of relevance for early detection of psychosis and used them to construct a semistructured interview. Interrater-reliability for high risk individuals was high (Kappa .87). Predictive validity was comparable to other, more comprehensive instruments: 16 (32 %) of 50 individuals classified as being at risk for psychosis by the BSIP have in fact developed frank psychosis within an follow-up period of two to five years. The BSIP is the first screening instrument for the early detection of psychosis which has been validated based on transition to psychosis. The BSIP is easy to use by experienced psychiatrists and has a very good interrater-reliability and predictive validity.
Development of a Peer Teaching-Assessment Program and a Peer Observation and Evaluation Tool

PubMed Central

Trujillo, Jennifer M.; Barr, Judith; Gonyeau, Michael; Van Amburgh, Jenny A.; Matthews, S. James; Qualters, Donna

2008-01-01

Objectives To develop a formalized, comprehensive, peer-driven teaching assessment program and a valid and reliable assessment tool. Methods A volunteer taskforce was formed and a peer-assessment program was developed using a multistep, sequential approach and the Peer Observation and Evaluation Tool (POET). A pilot study was conducted to evaluate the efficiency and practicality of the process and to establish interrater reliability of the tool. Intra-class correlation coefficients (ICC) were calculated. Results ICCs for 8 separate lectures evaluated by 2-3 observers ranged from 0.66 to 0.97, indicating good interrater reliability of the tool. Conclusion Our peer assessment program for large classroom teaching, which includes a valid and reliable evaluation tool, is comprehensive, feasible, and can be adopted by other schools of pharmacy. PMID:19325963
Comprehension of Written Grammar Test: Reliability and Known-Groups Validity Study With Hearing and Deaf and Hard-of-Hearing Students.

PubMed

Cannon, Joanna E; Hubley, Anita M; Millhoff, Courtney; Mazlouman, Shahla

2016-01-01

The aim of the current study was to gather validation evidence for the Comprehension of Written Grammar (CWG; Easterbrooks, 2010) receptive test of 26 grammatical structures of English print for use with children who are deaf and hard of hearing (DHH). Reliability and validity data were collected for 98 participants (49 DHH and 49 hearing) in Grades 2-6. The objectives were to: (a) examine 4-week test-retest reliability data; and (b) provide evidence of known-groups validity by examining expected differences between the groups on the CWG vocabulary pretest and main test, as well as selected structures. Results indicated excellent test-retest reliability estimates for CWG test scores. DHH participants performed statistically significantly lower on the CWG vocabulary pretest and main test than the hearing participants. Significantly lower performance by DHH participants on most expected grammatical structures (e.g., basic sentence patterns, auxiliary "be" singular/plural forms, tense, comparatives, and complementation) also provided known groups evidence. Overall, the findings of this study showed strong evidence of the reliability of scores and known group-based validity of inferences made from the CWG. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
The Reliability, Validity, and Usefulness of the Objective Structured Clinical Examination (OSCE) in Dental Education

ERIC Educational Resources Information Center

Graham, Roseanna

2010-01-01

This study evaluated the reliability, validity, and educational usefulness of a comprehensive, multidisciplinary Objective Structured Clinical Examination (OSCE) in dental education. The OSCE was administered to dental students at the Columbia University College of Dental Medicine (CDM) before they entered clinical training. Participants in this…

Technical Adequacy of the easyCBM Grade 2 Reading Measures. Technical Report #1004

ERIC Educational Resources Information Center

Jamgochian, Elisa; Park, Bitnara Jasmine; Nese, Joseph F. T.; Lai, Cheng-Fei; Saez, Leilani; Anderson, Daniel; Alonzo, Julie; Tindal, Gerald

2010-01-01

In this technical report, we provide reliability and validity evidence for the easyCBM[R] Reading measures for grade 2 (word and passage reading fluency and multiple choice reading comprehension). Evidence for reliability includes internal consistency and item invariance. Evidence for validity includes concurrent, predictive, and construct…
A comprehensive scoring system to measure healthy community design in land use plans and regulations.

PubMed

Maiden, Kristin M; Kaplan, Marina; Walling, Lee Ann; Miller, Patricia P; Crist, Gina

2017-02-01

Comprehensive land use plans and their corresponding regulations play a role in determining the nature of the built environment and community design, which are factors that influence population health and health disparities. To determine the level in which a plan addresses healthy living and active design, there is a need for a systematic, reliable and valid method of analyzing and scoring health-related content in plans and regulations. This paper describes the development and validation of a scoring tool designed to measure the strength and comprehensiveness of health-related content found in land use plans and the corresponding regulations. The measures are scored based on the presence of a specific item and the specificity and action-orientation of language. To establish reliability and validity, 42 land use plans and regulations from across the United States were scored January-April 2016. Results of the psychometric analysis indicate the scorecard is a reliable scoring tool for land use plans and regulations related to healthy living and active design. Intraclass correlation coefficients (ICC) scores showed strong inter-rater reliability for total strength and comprehensiveness. ICC scores for total implementation scores showed acceptable consistency among scorers. Cronbach's alpha values for all focus areas were acceptable. Strong content validity was measured through a committee vetting process. The development of this tool has far-reaching implications, bringing standardization of measurement to the field of land use plan assessment, and paving the way for systematic inclusion of health-related design principles, policies, and requirements in land use plans and their corresponding regulations. Copyright © 2016 Elsevier Inc. All rights reserved.
The Outpatient Experience Questionnaire of comprehensive public hospital in China: development, validity and reliability.

PubMed

Hu, Yinhuan; Zhang, Zixia; Xie, Jinzhu; Wang, Guanping

2017-02-01

The objective of this study is to describe the development of the Outpatient Experience Questionnaire (OPEQ) and to assess the validity and reliability of the scale. Literature review, patient interviews, Delphi method and Cross-sectional validation survey. Six comprehensive public hospitals in China. The survey was carried out on a sample of 600 outpatients. Acceptability of the questionnaire was assessed according to the overall response rate, item non-response rate and the average completion time. Correlation coefficients and confirmatory factor analysis were used to test construct validity. Delphi method was used to assess the content validity of the questionnaire. Cronbach's coefficient alpha and split-half reliability coefficient were used to estimate the internal reliability of the questionnaire. The overall response rate was 97.2% and the item non-response rate ranged from 0% to 0.3%. The mean completion time was 6 min. The Spearman correlations of item-total score ranged from 0.466 to 0.765. The results of confirmatory factor analysis showed that all items had factor loadings above 0.40 and the dimension intercorrelation ranged from 0.449 to 0.773, the goodness of fit of the questionnaire was reasonable. The overall authority grade of expert consultation was 0.80 and Kendall's coefficient of concordance W was 0.186. The Cronbach's coefficients alpha of six dimensions ranged from 0.708 to 0.895, the split-half reliability coefficient (Spearman-Brown coefficient) was 0.969. The OPEQ is a promising instrument covering the most important aspects which influence outpatient experiences of comprehensive public hospital in China. It has good evidence for acceptability, validity and reliability. © The Author 2016. Published by Oxford University Press in association with the International Society for Quality in Health Care. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com
A Curriculum-Based Measure of Language Comprehension for Preschoolers: Reliability and Validity of the Assessment of Story Comprehension

ERIC Educational Resources Information Center

Spencer, Trina D.; Goldstein, Howard; Kelley, Elizabeth Spencer; Sherman, Amber; McCune, Luke

2017-01-01

Despite research demonstrating the importance of language comprehension to later reading abilities, curriculum-based measures to assess language comprehension abilities in preschoolers remain lacking. The Assessment of Story Comprehension (ASC) features brief, child-relevant stories and a series of literal and inferential questions with a focus on…
A Curriculum-Based Measure of Language Comprehension for Preschoolers: Reliability and Validity of the Assessment of Story Comprehension

ERIC Educational Resources Information Center

Spencer, Trina D.; Goldstein, Howard; Kelley, Elizabeth Spencer; Sherman, Amber; McCune, Luke

2017-01-01

Despite research demonstrating the importance of language comprehension to later reading abilities, curriculum based measures to assess language comprehension abilities in preschoolers remain lacking. The Assessment of Story Comprehension (ASC) features brief, child-relevant stories and a series of literal and inferential questions with a focus on…
Confirmatory Factor Analysis of the TerraNova Comprehensive Tests of Basic Skills/5

ERIC Educational Resources Information Center

Stevens, Joseph J.; Zvoch, Keith

2007-01-01

Confirmatory factor analysis was used to explore the internal validity of scores on the TerraNova Comprehensive Tests of Basic Skills/5 using samples from a southwestern school district and standardization samples reported by the publisher. One of the strengths claimed for battery-type achievement tests is provision of reliable and valid samples…
The International AIDS Questionnaire-English Version (IAQ-E): Assessing the Validity and Reliability

ERIC Educational Resources Information Center

Davis, Cindy; Sloan, Melissa; MacMaster, Samuel; Hughes, Leslie

2006-01-01

In order to address HIV infection among college students, a comprehensive measure is needed that can be used with samples from culturally diverse populations. Therefore, this paper assessed the reliability and validity of an HIV/AIDS questionnaire that measures fours dimensions of HIV/AIDS awareness--factual knowledge, prejudice, personal risk,…
Validation of the comprehensive feeding practices questionnaire in parents of preschool children in Brazil.

PubMed

Warkentin, Sarah; Mais, Laís Amaral; Latorre, Maria do Rosário Dias de Oliveira; Carnell, Susan; Taddei, José Augusto de Aguiar Carrazedo

2016-07-19

Recent national surveys in Brazil have demonstrated a decrease in the consumption of traditional food and a parallel increase in the consumption of ultra-processed food, which has contributed to a rise in obesity prevalence in all age groups. Environmental factors, especially familial factors, have a strong influence on the food intake of preschool children, and this has led to the development of psychometric scales to measure parents' feeding practices. The aim of this study was to test the validity of a translated and adapted Comprehensive Feeding Practices Questionnaire in a sample of Brazilian preschool-aged children enrolled in private schools. A transcultural adaptation process was performed in order to develop a modified questionnaire (43 items). After piloting, the questionnaire was sent to parents, along with additional questions about family characteristics. Test-retest reliability was assessed in one of the schools. Factor analysis with oblique rotation was performed. Internal reliability was tested using Cronbach's alpha and correlations between factors, discriminant validity using marker variables of child's food intake, and convergent validity via correlations with parental perceptions of perceived responsibility for feeding and concern about the child's weight were also performed. The final sample consisted of 402 preschool children. Factor analysis resulted in a final questionnaire of 43 items distributed over 6 factors. Cronbach alpha values were adequate (0.74 to 0.88), between-factor correlations were low, and discriminant validity and convergent validity were acceptable. The modified CFPQ demonstrated significant internal reliability in this urban Brazilian sample. Scale validation within different cultures is essential for a more comprehensive understanding of parental feeding practices for preschoolers.
Reliability and validity of advanced theory-of-mind measures in middle childhood and adolescence.

PubMed

Hayward, Elizabeth O; Homer, Bruce D

2017-09-01

Although theory-of-mind (ToM) development is well documented for early childhood, there is increasing research investigating changes in ToM reasoning in middle childhood and adolescence. However, the psychometric properties of most advanced ToM measures for use with older children and adolescents have not been firmly established. We report on the reliability and validity of widely used, conventional measures of advanced ToM with this age group. Notable issues with both reliability and validity of several of the measures were evident in the findings. With regard to construct validity, results do not reveal a clear empirical commonality between tasks, and, after accounting for comprehension, developmental trends were evident in only one of the tasks investigated. Statement of contribution What is already known on this subject? Second-order false belief tasks have acceptable internal consistency. The Eyes Test has poor internal consistency. Validity of advanced theory-of-mind tasks is often based on the ability to distinguish clinical from typical groups. What does this study add? This study examines internal consistency across six widely used advanced theory-of-mind tasks. It investigates validity of tasks based on comprehension of items by typically developing individuals. It further assesses construct validity, or commonality between tasks. © 2017 The British Psychological Society.
Reliability and validity of the Computerized Comprehension Task (CCT): data from American English and Mexican Spanish infants*

PubMed Central

FRIEND, MARGARET; KEPLINGER, MELANIE

2017-01-01

Early language comprehension may be one of the most important predictors of developmental risk. The need for performance-based assessment is predicated on limitations identified in the exclusive use of parent report and on the need for a performance measure with which to assess the convergent validity of parent report of comprehension. Child performance data require the development of procedures to facilitate infant attention and compliance. Forty infants (20 at 1;4 and 20 at 1;8) acquiring English completed a standard picture book task and the same task was administered on a touch-sensitive screen. The computerized task significantly improved task attention, compliance and performance. Reliability was high, indicating that infants were not responding randomly. Convergent validity with parent report and 4-month stability was substantial. Preliminary data extending this approach to Mexican-Spanish are presented. Results are discussed in terms of the promise of this technique for clinical and research settings and the potential influences of cultural factors on performance. PMID:18300430
Development and validation of the Multidimensional Home Environment Scale (MHES) for adolescents and their mothers.

PubMed

Tabbakh, Tamara; Freeland-Graves, Jeanne

2016-08-01

The home environment is an important setting for the development of weight status in adolescence. At present a limited number of valid and reliable tools are available to evaluate the weight-related comprehensive home environment of this population. The goal of this research was to develop the Multidimensional Home Environment Scale which measures multiple components of the home. It includes psychological, social, and environmental domains from the perspective of an adolescent and the mother. Items were generated based on a literature review and then assessed for content validity by an expert panel and focus group in the target population. Internal consistency reliability was determined using Cronbach's α. Principal components analysis with varimax rotation was employed for assessment of construct validity. Temporal stability was evaluated using paired sample t-tests and bivariate correlations between responses at two different times, 1-2weeks apart. Associations between adolescent and mother responses were utilized for convergent validity. The final versions contained 32-items for adolescents and 36-items for mothers; these were administered to 218 adolescents and mothers. The subscales on the questionnaires exhibited high construct validity, internal consistency reliability (adolescent: α=0.82, mother: α=0.83) and test-retest reliability (adolescent: r=0.90, p<0.01; mother: r=0.91, p<0.01). Total home environment scores were computed, with greater scores reflecting a better health environment. These results verify the utility of the MHES as a valid and reliable instrument. This promising tool can be utilized to capture the comprehensive home environment of young adolescents (11-14years old). Copyright © 2016 Elsevier Ltd. All rights reserved.
A validation of the construct and reliability of an emotional intelligence scale applied to nursing students1

PubMed Central

Espinoza-Venegas, Maritza; Sanhueza-Alvarado, Olivia; Ramírez-Elizondo, Noé; Sáez-Carrillo, Katia

2015-01-01

OBJECTIVE: The current study aimed to validate the construct and reliability of an emotional intelligence scale. METHOD: The Trait Meta-Mood Scale-24 was applied to 349 nursing students. The process included content validation, which involved expert reviews, pilot testing, measurements of reliability using Cronbach's alpha, and factor analysis to corroborate the validity of the theoretical model's construct. RESULTS: Adequate Cronbach coefficients were obtained for all three dimensions, and factor analysis confirmed the scale's dimensions (perception, comprehension, and regulation). CONCLUSION: The Trait Meta-Mood Scale is a reliable and valid tool to measure the emotional intelligence of nursing students. Its use allows for accurate determinations of individuals' abilities to interpret and manage emotions. At the same time, this new construct is of potential importance for measurements in nursing leadership; educational, organizational, and personal improvements; and the establishment of effective relationships with patients. PMID:25806642
Beyond Screening and Progress Monitoring: An Examination of the Reliability and Concurrent Validity of Maze Comprehension Assessments for Fourth-Grade Students

ERIC Educational Resources Information Center

Brasher, Casey F.

2017-01-01

Reading comprehension assessments often lack instructional utility because they do not accurately pinpoint why a student has difficulty. The varying formats, directions, and response requirements of comprehension assessments lead to differential measurement of underlying skills and contribute to noted amounts of unshared variance among tests. Maze…
Reliability and validity of the C-BiLLT: a new instrument to assess comprehension of spoken language in young children with cerebral palsy and complex communication needs.

PubMed

Geytenbeek, Joke J; Mokkink, Lidwine B; Knol, Dirk L; Vermeulen, R Jeroen; Oostrom, Kim J

2014-09-01

In clinical practice, a variety of diagnostic tests are available to assess a child's comprehension of spoken language. However, none of these tests have been designed specifically for use with children who have severe motor impairments and who experience severe difficulty when using speech to communicate. This article describes the process of investigating the reliability and validity of the Computer-Based Instrument for Low Motor Language Testing (C-BiLLT), which was specifically developed to assess spoken Dutch language comprehension in children with cerebral palsy and complex communication needs. The study included 806 children with typical development, and 87 nonspeaking children with cerebral palsy and complex communication needs, and was designed to provide information on the psychometric qualities of the C-BiLLT. The potential utility of the C-BiLLT as a measure of spoken Dutch language comprehension abilities for children with cerebral palsy and complex communication needs is discussed.
Validity and reliability of Internet-based physiotherapy assessment for musculoskeletal disorders: a systematic review.

PubMed

Mani, Suresh; Sharma, Shobha; Omar, Baharudin; Paungmali, Aatit; Joseph, Leonard

2017-04-01

Purpose The purpose of this review is to systematically explore and summarise the validity and reliability of telerehabilitation (TR)-based physiotherapy assessment for musculoskeletal disorders. Method A comprehensive systematic literature review was conducted using a number of electronic databases: PubMed, EMBASE, PsycINFO, Cochrane Library and CINAHL, published between January 2000 and May 2015. The studies examined the validity, inter- and intra-rater reliabilities of TR-based physiotherapy assessment for musculoskeletal conditions were included. Two independent reviewers used the Quality Appraisal Tool for studies of diagnostic Reliability (QAREL) and the Quality Assessment of Diagnostic Accuracy Studies (QUADAS) tool to assess the methodological quality of reliability and validity studies respectively. Results A total of 898 hits were achieved, of which 11 articles based on inclusion criteria were reviewed. Nine studies explored the concurrent validity, inter- and intra-rater reliabilities, while two studies examined only the concurrent validity. Reviewed studies were moderate to good in methodological quality. The physiotherapy assessments such as pain, swelling, range of motion, muscle strength, balance, gait and functional assessment demonstrated good concurrent validity. However, the reported concurrent validity of lumbar spine posture, special orthopaedic tests, neurodynamic tests and scar assessments ranged from low to moderate. Conclusion TR-based physiotherapy assessment was technically feasible with overall good concurrent validity and excellent reliability, except for lumbar spine posture, orthopaedic special tests, neurodynamic testa and scar assessment.
The comprehensive care project: measuring physician performance in ambulatory practice.

PubMed

Holmboe, Eric S; Weng, Weifeng; Arnold, Gerald K; Kaplan, Sherrie H; Normand, Sharon-Lise; Greenfield, Sheldon; Hood, Sarah; Lipner, Rebecca S

2010-12-01

To investigate the feasibility, reliability, and validity of comprehensively assessing physician-level performance in ambulatory practice. Ambulatory-based general internists in 13 states participated in the assessment. We assessed physician-level performance, adjusted for patient factors, on 46 individual measures, an overall composite measure, and composite measures for chronic, acute, and preventive care. Between- versus within-physician variation was quantified by intraclass correlation coefficients (ICC). External validity was assessed by correlating performance on a certification exam. Medical records for 236 physicians were audited for seven chronic and four acute care conditions, and six age- and gender-appropriate preventive services. Performance on the individual and composite measures varied substantially within (range 5-86 percent compliance on 46 measures) and between physicians (ICC range 0.12-0.88). Reliabilities for the composite measures were robust: 0.88 for chronic care and 0.87 for preventive services. Higher certification exam scores were associated with better performance on the overall (r = 0.19; p<.01), chronic care (r = 0.14, p = .04), and preventive services composites (r = 0.17, p = .01). Our results suggest that reliable and valid comprehensive assessment of the quality of chronic and preventive care can be achieved by creating composite measures and by sampling feasible numbers of patients for each condition. © Health Research and Educational Trust.
Non-Technical Skills for Surgeons (NOTSS): Critical appraisal of its measurement properties.

PubMed

Jung, James J; Borkhoff, Cornelia M; Jüni, Peter; Grantcharov, Teodor P

2018-02-17

To critically appraise the development and measurement properties, including sensibility, reliability, and validity of the Non-Technical Skills of Surgeons (NOTSS) system. Articles that described development process of the NOTSS system were identified. Relevant primary studies that presented evidence of reliability and validity were identified through a comprehensive literature review. NOTSS was developed through robust item generation and reduction strategies. It was shown to have good content validity, acceptability, and feasibility. Inter-rater reliability increased with greater expertise and number of assessors. Studies demonstrated evidence of cross-sectional construct validity, in that the tool was able to differentiate known groups of varied non-technical skill levels. Evidence of longitudinal construct validity also existed to demonstrate that NOTSS detected changes in non-technical skills before and after targeted training. In populations and settings presented in our critical appraisal, NOTSS provided reliable and valid measurements of intraoperative non-technical skills of surgeons. Copyright © 2018 Elsevier Inc. All rights reserved.
Recommendations to guide revision of the Guides to the Evaluation of Permanent Impairment. American Medical Association.

PubMed

Spieler, E A; Barth, P S; Burton, J F; Himmelstein, J; Rudolph, L

2000-01-26

The American Medical Association's Guides to the Evaluation of Permanent Impairment, Fourth Edition, is the most commonly used tool in the United States for rating permanent impairments for disability systems. The Guides, currently undergoing revision, has been the focus of considerable controversy. Criticisms have focused on 2 areas: internal deficiencies, including the lack of a comprehensive, valid, reliable, unbiased, and evidence-based system for rating impairments; and the way in which workers' compensation systems use the ratings, resulting in inappropriate compensation. We focus on the internal deficiencies and recommend that the Guides remains a tool for evaluation of permanent impairment, not disability. To maintain wide acceptance of the Guides, its authors need to improve the validity, internal consistency, and comprehensiveness of the ratings; document reliability and reproducibility of the results; and make the Guides easily comprehensible and accessible to physicians.
A comprehensive review of the psychometric properties of the Drug Abuse Screening Test.

PubMed

Yudko, Errol; Lozhkina, Olga; Fouts, Adriana

2007-03-01

This article reviews the reliability and the validity of the (10-, 20-, and 28-item) Drug Abuse Screening Test (DAST). The reliability and the validity of the adolescent version of the DAST are also reviewed. An extensive literature review was conducted using the Medline and Psychinfo databases from the years 1982 to 2005. All articles that addressed the reliability and the validity of the DAST were examined. Publications in which the DAST was used as a screening tool but had no data on its psychometric properties were not included. Descriptive information about each version of the test, as well as discussion of the empirical literature that has explored measures of the reliability and the validity of the DAST, has been included. The DAST tended to have moderate to high levels of test-retest, interitem, and item-total reliabilities. The DAST also tended to have moderate to high levels of validity, sensitivity, and specificity. In general, all versions of the DAST yield satisfactory measures of reliability and validity for use as clinical or research tools. Furthermore, these tests are easy to administer and have been used in a variety of populations.
Reliability and Factorial Validity of the Artes de Lenguaje.

ERIC Educational Resources Information Center

Powers, Stephen; And Others

1984-01-01

Spanish speaking first graders were administered the Artes de Lenguage (ADL)--a Spanish, criterion-referenced, language arts test. Reliability analyses indicated the adequacy of three of the four subscales (Phonetic Analysis, Vocabulary Development, Comprehension Skills, and General Skills). A principal factors analysis of the intercorrelation…

Developing an assessment of fire-setting to guide treatment in secure settings: the St Andrew's Fire and Arson Risk Instrument (SAFARI).

PubMed

Long, Clive G; Banyard, Ellen; Fulton, Barbara; Hollin, Clive R

2014-09-01

Arson and fire-setting are highly prevalent among patients in secure psychiatric settings but there is an absence of valid and reliable assessment instruments and no evidence of a significant approach to intervention. To develop a semi-structured interview assessment specifically for fire-setting to augment structured assessments of risk and need. The extant literature was used to frame interview questions relating to the antecedents, behaviour and consequences necessary to formulate a functional analysis. Questions also covered readiness to change, fire-setting self-efficacy, the probability of future fire-setting, barriers to change, and understanding of fire-setting behaviour. The assessment concludes with indications for assessment and a treatment action plan. The inventory was piloted with a sample of women in secure care and was assessed for comprehensibility, reliability and validity. Staff rated the St Andrews Fire and Risk Instrument (SAFARI) as acceptable to patients and easy to administer. SAFARI was found to be comprehensible by over 95% of the general population, to have good acceptance, high internal reliability, substantial test-retest reliability and validity. SAFARI helps to provide a clear explanation of fire-setting in terms of the complex interplay of antecedents and consequences and facilitates the design of an individually tailored treatment programme in sympathy with a cognitive-behavioural approach. Further studies are needed to verify the reliability and validity of SAFARI with male populations and across settings.
Diagnostic Criteria for Temporomandibular Disorders (DC/TMD) for Clinical and Research Applications: Recommendations of the International RDC/TMD Consortium Network* and Orofacial Pain Special Interest Group†

PubMed Central

Schiffman, Eric; Ohrbach, Richard; Truelove, Edmond; Look, John; Anderson, Gary; Goulet, Jean-Paul; List, Thomas; Svensson, Peter; Gonzalez, Yoly; Lobbezoo, Frank; Michelotti, Ambra; Brooks, Sharon L.; Ceusters, Werner; Drangsholt, Mark; Ettlin, Dominik; Gaul, Charly; Goldberg, Louis J.; Haythornthwaite, Jennifer A.; Hollender, Lars; Jensen, Rigmor; John, Mike T.; De Laat, Antoon; de Leeuw, Reny; Maixner, William; van der Meulen, Marylee; Murray, Greg M.; Nixdorf, Donald R.; Palla, Sandro; Petersson, Arne; Pionchon, Paul; Smith, Barry; Visscher, Corine M.; Zakrzewska, Joanna; Dworkin, Samuel F.

2015-01-01

Aims The original Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) Axis I diagnostic algorithms have been demonstrated to be reliable. However, the Validation Project determined that the RDC/TMD Axis I validity was below the target sensitivity of ≥ 0.70 and specificity of ≥ 0.95. Consequently, these empirical results supported the development of revised RDC/TMD Axis I diagnostic algorithms that were subsequently demonstrated to be valid for the most common pain-related TMD and for one temporomandibular joint (TMJ) intra-articular disorder. The original RDC/TMD Axis II instruments were shown to be both reliable and valid. Working from these findings and revisions, two international consensus workshops were convened, from which recommendations were obtained for the finalization of new Axis I diagnostic algorithms and new Axis II instruments. Methods Through a series of workshops and symposia, a panel of clinical and basic science pain experts modified the revised RDC/TMD Axis I algorithms by using comprehensive searches of published TMD diagnostic literature followed by review and consensus via a formal structured process. The panel's recommendations for further revision of the Axis I diagnostic algorithms were assessed for validity by using the Validation Project's data set, and for reliability by using newly collected data from the ongoing TMJ Impact Project—the follow-up study to the Validation Project. New Axis II instruments were identified through a comprehensive search of the literature providing valid instruments that, relative to the RDC/TMD, are shorter in length, are available in the public domain, and currently are being used in medical settings. Results The newly recommended Diagnostic Criteria for TMD (DC/TMD) Axis I protocol includes both a valid screener for detecting any pain-related TMD as well as valid diagnostic criteria for differentiating the most common pain-related TMD (sensitivity ≥ 0.86, specificity ≥ 0.98) and for one intra-articular disorder (sensitivity of 0.80 and specificity of 0.97). Diagnostic criteria for other common intra-articular disorders lack adequate validity for clinical diagnoses but can be used for screening purposes. Inter-examiner reliability for the clinical assessment associated with the validated DC/TMD criteria for pain-related TMD is excellent (kappa ≥ 0.85). Finally, a comprehensive classification system that includes both the common and less common TMD is also presented. The Axis II protocol retains selected original RDC/TMD screening instruments augmented with new instruments to assess jaw function as well as behavioral and additional psychosocial factors. The Axis II protocol is divided into screening and comprehensive self-report instrument sets. The screening instruments’ 41 questions assess pain intensity, pain-related disability, psychological distress, jaw functional limitations, and parafunctional behaviors, and a pain drawing is used to assess locations of pain. The comprehensive instruments, composed of 81 questions, assess in further detail jaw functional limitations and psychological distress as well as additional constructs of anxiety and presence of comorbid pain conditions. Conclusion The recommended evidence-based new DC/TMD protocol is appropriate for use in both clinical and research settings. More comprehensive instruments augment short and simple screening instruments for Axis I and Axis II. These validated instruments allow for identification of patients with a range of simple to complex TMD presentations. PMID:24482784
Cross-cultural adaptation and psychometric assessment of the Chinese version of the comprehensive needs assessment tool for cancer caregivers (CNAT-C).

PubMed

Zhang, Yin-Ping; Zhao, Xin-Shuang; Zhang, Bei; Zhang, Lu-Lu; Ni, Chun-Ping; Hao, Nan; Shi, Chang-Bei; Porr, Caroline

2015-07-01

The comprehensive needs assessment tool for cancer caregivers (CNAT-C) is a systematic and comprehensive needs assessment tool for the family caregivers. The purpose of this project was twofold: (1) to adapt the CNAT-C to Mainland China's cultural context and (2) to evaluate the psychometric properties of the newly adapted Chinese CNAT-C. Cross-cultural adaptation of the original CNAT-C was performed according to published guidelines. A pilot study was conducted in Mainland China with 30 Chinese family cancer caregivers. A subsequent validation study was conducted with 205 Chinese cancer caregivers from Mainland China. Construct validity was determined through exploratory and confirmatory factor analyses. Reliability was determined using internal consistency and test-retest reliability. The split-half coefficient for the overall Chinese CNAT-C scale was 0.77. Principal component analysis resulted in an eight-factor structure explaining 68.11 % of the total variance. The comparative fit index (CFI) was 0.91 from the modified model confirmatory factor analysis. The Chi-square divided by degrees of freedom was 1.98, and the root mean squared error of approximation (RMSEA) was 0.079. In relation to the known-group validation, significant differences were found in the Chinese CNAT-C scale according to various caregiver characteristics. Internal consistency was high for the Chinese CNAT-C reaching a Cronbach α value of 0.94. Test-retest reliability was 0.85. The newly adapted Chinese CNAT-C scale possesses adequate validity, test-retest reliability, and internal consistency and therefore may be used to ascertain holistic health and support needs of cancer patients' family caregivers in Mainland China.
Psychometric Properties of the "Miranda Rights Comprehension Instruments" with a Juvenile Justice Sample

ERIC Educational Resources Information Center

Goldstein, Naomi E. Sevin; Romaine, Christina L. Riggs; Zelle, Heather; Kalbeitzer, Rachel; Mesiarik, Constance; Wolbransky, Melinda

2011-01-01

This article describes the psychometric properties of the "Miranda Rights Comprehension Instruments", the revised version of Grisso's "Miranda" instruments. The original instruments demonstrated good reliability and validity in a normative sample. The revised instruments updated the content of the original instruments and were…
Development and content validity testing of a comprehensive classification of diagnoses for pediatric nurse practitioners.

PubMed

Burns, C

1991-01-01

Pediatric nurse practitioners (PNPs) need an integrated, comprehensive classification that includes nursing, disease, and developmental diagnoses to effectively describe their practice. No such classification exists. Further, methodologic studies to help evaluate the content validity of any nursing taxonomy are unavailable. A conceptual framework was derived. Then 178 diagnoses from the North American Nursing Diagnosis Association (NANDA) 1986 list, selected diagnoses from the International Classification of Diseases, the Diagnostic and Statistical Manual, Third Revision, and others were selected. This framework identified and listed, with definitions, three domains of diagnoses: Developmental Problems, Diseases, and Daily Living Problems. The diagnoses were ranked using a 4-point scale (4 = highly related to 1 = not related) and were placed into the three domains. The rating scale was assigned by a panel of eight expert pediatric nurses. Diagnoses that were assigned to the Daily Living Problems domain were then sorted into the 11 Functional Health patterns described by Gordon (1987). Reliability was measured using proportions of agreement and Kappas. Content validity of the groups created was measured using indices of content validity and average congruency percentages. The experts used a new method to sort the diagnoses in a new way that decreased overlaps among the domains. The Developmental and Disease domains were judged reliable and valid. The Daily Living domain of nursing diagnoses showed marginally acceptable validity with acceptable reliability. Six Functional Health Patterns were judged reliable and valid, mixed results were determined for four categories, and the Coping/Stress Tolerance category was judged reliable but not valid using either test. There were considerable differences between the panel's, Gordon's (1987), and NANDA's clustering of NANDA diagnoses. This study defines the diagnostic practice of nurses from a holistic, patient-centered perspective. It is the first study to use quantitative methods to test a diagnostic classification system for nursing. The classification model could also be adapted for other nurse specialties.
Validity and Reliability of the 8-Item Work Limitations Questionnaire.

PubMed

Walker, Timothy J; Tullar, Jessica M; Diamond, Pamela M; Kohl, Harold W; Amick, Benjamin C

2017-12-01

Purpose To evaluate factorial validity, scale reliability, test-retest reliability, convergent validity, and discriminant validity of the 8-item Work Limitations Questionnaire (WLQ) among employees from a public university system. Methods A secondary analysis using de-identified data from employees who completed an annual Health Assessment between the years 2009-2015 tested research aims. Confirmatory factor analysis (CFA) (n = 10,165) tested the latent structure of the 8-item WLQ. Scale reliability was determined using a CFA-based approach while test-retest reliability was determined using the intraclass correlation coefficient. Convergent/discriminant validity was tested by evaluating relations between the 8-item WLQ with health/performance variables for convergent validity (health-related work performance, number of chronic conditions, and general health) and demographic variables for discriminant validity (gender and institution type). Results A 1-factor model with three correlated residuals demonstrated excellent model fit (CFI = 0.99, TLI = 0.99, RMSEA = 0.03, and SRMR = 0.01). The scale reliability was acceptable (0.69, 95% CI 0.68-0.70) and the test-retest reliability was very good (ICC = 0.78). Low-to-moderate associations were observed between the 8-item WLQ and the health/performance variables while weak associations were observed between the demographic variables. Conclusions The 8-item WLQ demonstrated sufficient reliability and validity among employees from a public university system. Results suggest the 8-item WLQ is a usable alternative for studies when the more comprehensive 25-item WLQ is not available.
The Nature Contact Questionnaire: a measure of healthy workplace exposure.

PubMed

Largo-Wight, Erin; Chen, W William; Dodd, Virginia; Weiler, Robert

2011-01-01

Understanding and promoting healthy workplaces is an important and growing area of interest in occupational health. Nature contact is a central component to the study of and promotion of healthy places. Previous findings suggest that nature contact influences health via stress appraisal process. Currently, there are no known comprehensive valid and reliable measures of nature contact, which presents obstacles to research and worksite health promotion. This study was designed to develop and test an instrument to measure nature contact at work, entitled the Nature Contact Questionnaire (NCQ), 16-item self-reported checklist to measure actual exposure. A sample of 503 (30% response rate) office staff completed the questionnaire. Office staff were sent an email with a link to the electronic survey twice, two weeks apart. Content and construct validity (KMO=0.68), internal consistency (Alpha=0.64), and test-retest reliability (r=0.85, p<0.01) were established. The NCQ is the first known comprehensive, reliable and valid survey to measure nature contact, which allows research to compare forms of nature contact to best inform practice and design of healthy places.
Comprehension of Japanese oral care-related terms among caregivers and nurses, as assessed using a newly developed instrument.

PubMed

Shibata, Satoko; Stegaroiu, Roxana; Nakazawa, Akari; Ohuchi, Akitsugu

2017-03-01

(i) To assess comprehension of oral care-related terms among caregivers and nurses working at long-term care facilities, using a newly developed test; (ii) to analyse the effect of participant characteristics on their comprehension. Effective mutual communication between dental professionals and caregivers/nurses is essential for providing information on daily oral care for institutionalised elders. A 36-item word-knowledge test in Japanese was developed to assess comprehension of oral care-related terms. The test was administered to a convenience sample of 236 nursing staff (198 caregivers and 38 nurses) at six long-term care facilities in Niigata City, Japan, and its reliability and validity were verified. Associations of participant characteristics with their responses were investigated by multiple regression analysis. Mean percentage of correct responses (accuracy rate) for nursing staff was approximately 62% (highest for oral care products and lowest for prosthodontic terms). Test internal reliability was high (Cronbach's alpha >0.8). Concurrent validity (test ability to distinguish between characteristically different groups) was confirmed. Mean accuracy rate was significantly higher among nurses (78.5 ± 19.3%) than among caregivers (58.7 ± 22.8%), and among respondents with interest in oral care (64.2 ± 21.1%) than among those with no such interest (51.5 ± 28.9%). The word-knowledge test was valid and reliable for nursing staff of six long-term care facilities in Niigata City. Their comprehension was low for perioral and intraoral structures, related symptom and disease names, and prosthodontics terms related to oral care. Understanding of oral care-related terms among the nursing staff was related to their occupation and interest in oral care. © 2016 John Wiley & Sons A/S and The Gerodontology Association. Published by John Wiley & Sons Ltd.
The Well-Being 5: Development and Validation of a Diagnostic Instrument to Improve Population Well-being

PubMed Central

Sears, Lindsay E.; Agrawal, Sangeeta; Sidney, James A.; Castle, Patricia H.; Coberley, Carter R.; Witters, Dan; Pope, James E.; Harter, James K.

2014-01-01

Abstract Building upon extensive research from 2 validated well-being instruments, the objective of this research was to develop and validate a comprehensive and actionable well-being instrument that informs and facilitates improvement of well-being for individuals, communities, and nations. The goals of the measure were comprehensiveness, validity and reliability, significant relationships with health and performance outcomes, and diagnostic capability for intervention. For measure development and validation, questions from the Well-being Assessment and Wellbeing Finder were simultaneously administered as a test item pool to over 13,000 individuals across 3 independent samples. Exploratory factor analysis was conducted on a random selection from the first sample and confirmed in the other samples. Further evidence of validity was established through correlations to the established well-being scores from the Well-Being Assessment and Wellbeing Finder, and individual outcomes capturing health care utilization and productivity. Results showed the Well-Being 5 score comprehensively captures the known constructs within well-being, demonstrates good reliability and validity, significantly relates to health and performance outcomes, is diagnostic and informative for intervention, and can track and compare well-being over time and across groups. With this tool, well-being deficiencies within a population can be effectively identified, prioritized, and addressed, yielding the potential for substantial improvements to the health status, performance, and quality of life for individuals and cost savings for stakeholders. (Population Health Management 2014;17:357–365) PMID:24892873
The approved Italian version of the comprehensive assessment of at-risk mental states (CAARMS-ITA): Field test and psychometric features.

PubMed

Pelizza, Lorenzo; Paterlini, Federica; Azzali, Silvia; Garlassi, Sara; Scazza, Ilaria; Pupo, Simona; Simmons, Magenta; Nelson, Barnaby; Raballo, Andrea

2018-04-26

The Comprehensive Assessment of At-Risk Mental States (CAARMS) was specifically developed to assess and detect young people at ultra-high risk (UHR) of developing psychosis. The current study was undertaken to test the reliability and validity of the authorized Italian version of the CAARMS (CAARMS-ITA) in a help-seeking population. Psychometric properties of the CAARMS-ITA were established using a sample of 223 Italian adolescents and young adults aged between 13 and 35 years, who were divided into 3 groups according to the CAARMS criteria: UHR-negative individuals (UHR [-]; n = 64), UHR-positive (UHR [+]; n = 55) and individuals with a first-episode psychosis (FEP; n = 104). The CAARMS-ITA's reliability was tested measuring interrater reliability and internal consistency. Construct validity was tested comparing the Positive and Negative Syndrome Scale (PANSS) and CAARMS-ITA subscale scores across groups (ie, UHR [-], UHR [+] and FEP). For concurrent validity, we studied correlations between symptoms of the CAARMS-ITA and their equivalents in the PANSS. Finally, the predictive validity was examined by following up with UHR [+] individuals. The 12-month transition rate to psychosis was calculated. The CAARMS-ITA showed good interrater reliability. The PANSS "Positive Symptoms" subscale scores in UHR [+] individuals were intermediate between FEP and UHR [-] groups. The positive and negative symptoms scores of the CAARMS-ITA significantly correlated with the corresponding scores of the PANSS. After 12 months, 4 of 41 (9.8%) UHR [+] individuals had transitioned to psychosis. The CAARMS-ITA is a reliable and valid instrument for assessing and detecting at-risk mental states in Italian clinical settings. It also appears to be helpful in the prediction of psychosis transition. © 2018 John Wiley & Sons Australia, Ltd.
The Comprehensive Care Project: Measuring Physician Performance in Ambulatory Practice

PubMed Central

Holmboe, Eric S; Weng, Weifeng; Arnold, Gerald K; Kaplan, Sherrie H; Normand, Sharon-Lise; Greenfield, Sheldon; Hood, Sarah; Lipner, Rebecca S

2010-01-01

Objective To investigate the feasibility, reliability, and validity of comprehensively assessing physician-level performance in ambulatory practice. Data Sources/Study Setting Ambulatory-based general internists in 13 states participated in the assessment. Study Design We assessed physician-level performance, adjusted for patient factors, on 46 individual measures, an overall composite measure, and composite measures for chronic, acute, and preventive care. Between- versus within-physician variation was quantified by intraclass correlation coefficients (ICC). External validity was assessed by correlating performance on a certification exam. Data Collection/Extraction Methods Medical records for 236 physicians were audited for seven chronic and four acute care conditions, and six age- and gender-appropriate preventive services. Principal Findings Performance on the individual and composite measures varied substantially within (range 5–86 percent compliance on 46 measures) and between physicians (ICC range 0.12–0.88). Reliabilities for the composite measures were robust: 0.88 for chronic care and 0.87 for preventive services. Higher certification exam scores were associated with better performance on the overall (r = 0.19; p <.01), chronic care (r = 0.14, p = .04), and preventive services composites (r = 0.17, p = .01). Conclusions Our results suggest that reliable and valid comprehensive assessment of the quality of chronic and preventive care can be achieved by creating composite measures and by sampling feasible numbers of patients for each condition. PMID:20819110
Evaluation of written medicine information: validation of the Consumer Information Rating Form.

PubMed

Koo, Michelle M; Krass, Ines; Aslani, Parisa

2007-06-01

The Consumer Information Rating Form (CIRF) was developed as a direct method for measuring consumers' perceptions of the comprehensibility, utility, and design quality of written medicine information. The validity and reliability of the CIRF were evaluated in a small convenience consumer sample in the US. Its validity and reliability have yet to be established in a larger sample of consumers who are on chronic therapy in different settings. To determine the validity and reliability of the CIRF in Australian consumers on chronic therapy. Consumers read and subsequently evaluated a Consumer Medicine Information (CMI) leaflet for one of their own medications, using an adapted version of the CIRF. The construct validity and internal reliability of the adapted version of the CIRF were tested using principal components analysis (PCA) and Cronbach's alpha, respectively. The adapted CIRF was completed by 282 consumers (aged 19-90 y; median 66; interquartile range 53-75 y; 60.3% females). Most respondents spoke primarily English at home (85.5%), had attained at least secondary education (84%), and had adequate health literacy levels (88.2%). Consumers rated CMI easy to read, understand, and navigate, but less easy to remember and keep. Most also found it to be useful and to contain the right amount of information. The design aspects also scored favorably, although CMI did score relatively poorly in terms of its attractiveness and tone (whether alarming or not). PCA yielded 3 factors (explaining 59.3% of the total variance) identical to those in the original CIRF: comprehensibility, utility, and design quality. All factors demonstrated good reliability (Cronbach's alpha 0.74, 0.92, and 0.75, respectively). The CIRF appears to be a robust instrument for assessing consumers' perceptions of written medicine information. However, validity always needs to be reestablished when using a previously validated measure in a different population.
Development and validation of the Vellore Occupational Therapy Evaluation Scale to assess functioning in people with mental illness.

PubMed

Samuel, Reema; Russell, Paul Ss; Paraseth, Tapan Kumar; Ernest, Sharmila; Jacob, K S

2016-08-26

Available occupational therapy assessment scales focus on specific areas of functioning. There is a need for comprehensive evaluation of diverse aspects of functioning in people with mental illness. To develop a comprehensive assessment scale to evaluate diverse aspects of functioning among people with mental illness and to assess its validity and reliability. Available instruments, which evaluate diverse aspects of functioning in people with mental illness, were retrieved. Relevant items, which evaluate specific functions, were selected by a committee of mental health experts and combined to form a comprehensive instrument. Face and content validity and feasibility were assessed and the new instrument was piloted among 60 patients with mental illness. The final version of the instrument was employed in 151 consecutive clients, between 18 and 60 years of age, who were also assessed using Global Assessment of Functioning (GAF), Occupational Therapy Task Observation Scale (OTTOS), Social Functioning Questionnaire (SFQ), Rosenberg Self Esteem Scale (RSES) and Pai and Kapur Family Burden Interview Schedule (FBIS) by two therapists. The inter-rater reliability and test-retest reliability of the new instrument (Vellore Occupational Therapy Evaluation Scale (VOTES)) were also evaluated. The new scale had good internal consistency (Cronbach's alpha = .817), inter-rater reliability .928 (.877-.958) and test-retest reliability .928 (.868-.961). The correlation between the general behaviour domain (Pearson's Correlation Coefficient [PCC] = -.763, p = .000), task behaviour (PCC = -.829, p = .000), social skills (PCC = -.351, p = .000), intrapersonal skills (PCC = -.208, p = .010), instrumental activities of daily living (IADL) (PCC = -.329, p = .038) and leisure activities (PCC = -.433, p = .005) scores of VOTES with the corresponding domains in the scales used for comparison was statistically significant. The correlation between the total score of VOTES and the total scores of OTTOS, SFQ and RSES was also statistically significant suggesting convergent validity. The correlation between the total score of VOTES with the total score of FBI is not statistically significant, implying good divergent validity. VOTES seems to be a promising tool to assess overall functioning of people with mental illness. © The Author(s) 2016.
Reliability and validity of the test of incremental respiratory endurance measures of inspiratory muscle performance in COPD.

PubMed

Formiga, Magno F; Roach, Kathryn E; Vital, Isabel; Urdaneta, Gisel; Balestrini, Kira; Calderon-Candelario, Rafael A; Campos, Michael A; Cahalin, Lawrence P

2018-01-01

The Test of Incremental Respiratory Endurance (TIRE) provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP) over time. The integration of MIP over inspiratory duration (ID) provides the sustained maximal inspiratory pressure (SMIP). Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Test-retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test-retest reliability with a nearly perfect intraclass correlation coefficient (ICC) of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP and ID, but not MIP. The TIRE measures of MIP, SMIP and ID have excellent test-retest reliability and demonstrated known-groups validity in subjects with COPD. SMIP and ID also demonstrated evidence of moderate convergent validity and appear to be more stable measures in this patient population than the traditional MIP.
Comprehensive Comparison of Self-Administered Questionnaires for Measuring Quantitative Autistic Traits in Adults

ERIC Educational Resources Information Center

Nishiyama, Takeshi; Suzuki, Masako; Adachi, Katsunori; Sumi, Satoshi; Okada, Kensuke; Kishino, Hirohisa; Sakai, Saeko; Kamio, Yoko; Kojima, Masayo; Suzuki, Sadao; Kanne, Stephen M.

2014-01-01

We comprehensively compared all available questionnaires for measuring quantitative autistic traits (QATs) in terms of reliability and construct validity in 3,147 non-clinical and 60 clinical subjects with normal intelligence. We examined four full-length forms, the Subthreshold Autism Trait Questionnaire (SATQ), the Broader Autism Phenotype…
Small Business: Action Needed to Determine Whether DOD’s Comprehensive Subcontracting Plan Test Program Should Be Made Permanent

DTIC Science & Technology

2015-11-01

collected. We determined that the methodologies were valid and the data were reliable for our purposes. In addition, we interviewed DOD officials and...of contracts and cost of labor involved in preparing program documentation, to arrive at the estimates for savings. To validate the data used in the...studies to be reasonable, and the data were sufficiently reliable for our purposes. We interviewed officials from DOD and 5 of the 12 Test Program
The Development and Validation of a Rapid Assessment Tool of Primary Care in China

PubMed Central

Mei, Jie; Liang, Yuan; Shi, LeiYu; Zhao, JingGe; Wang, YuTan; Kuang, Li

2016-01-01

Introduction. With Chinese health care reform increasingly emphasizing the importance of primary care, the need for a tool to evaluate primary care performance and service delivery is clear. This study presents a methodology for a rapid assessment of primary care organizations and service delivery in China. Methods. The study translated and adapted the Primary Care Assessment Tool-Adult Edition (PCAT-AE) into a Chinese version to measure core dimensions of primary care, namely, first contact, continuity, comprehensiveness, and coordination. A cross-sectional survey was conducted to assess the validity and reliability of the Chinese Rapid Primary Care Assessment Tool (CR-PCAT). Eight community health centers in Guangdong province have been selected to participate in the survey. Results. A total of 1465 effective samples were included for data analysis. Eight items were eliminated following principal component analysis and reliability testing. The principal component analysis extracted five multiple-item scales (first contact utilization, first contact accessibility, ongoing care, comprehensiveness, and coordination). The tests of scaling assumptions were basically met. Conclusion. The standard psychometric evaluation indicates that the scales have achieved relatively good reliability and validity. The CR-PCAT provides a rapid and reliable measure of four core dimensions of primary care, which could be applied in various scenarios. PMID:26885509
[Development of a Japanese version of a short form of the Profile of Emotional Competence].

PubMed

Nozaki, Yuki; Koyasu, Masuo

2015-06-01

Emotional competence refers to individual differences in the ability to appropriately identity, understand, express, regulate, and utilize one's own emotions and those of others. This study developed a Japanese version of a short form of the Profile of Emotional Competence, a measure that allows the comprehensive assessment of intra- and interpersonal emotional competence with shorter items, and investigated its reliability and validity. In Study 1, we selected items for a short version and compared it with the full scale in terms of scores, internal consistency, and validity. In Study 2, we examined the short form's test-retest reliability. Results supported the original two-factor model and the measure had adequate reliability and validity. We discuss the construct validity and practical applicability of the short form of the Profile of Emotional Competence.
Validation of the M. D. Anderson Symptom Inventory multiple myeloma module

PubMed Central

2013-01-01

Background The symptom burden associated with multiple myeloma (MM) is often severe. Presently, no instrument comprehensively assesses disease-related and treatment-related symptoms in patients with MM. We sought to validate a module of the M. D. Anderson Symptom Inventory (MDASI) developed specifically for patients with MM (MDASI-MM). Methods The MDASI-MM was developed with clinician input, cognitive debriefing, and literature review, and administered to 132 patients undergoing induction chemotherapy or stem cell transplantation. We demonstrated the MDASI-MM’s reliability (Cronbach α values); criterion validity (item and subscale correlations between the MDASI-MM and the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire (EORTC QLQ-C30) and the EORTC MM module (QLQ-MY20)), and construct validity (differences between groups by performance status). Ratings from transplant patients were examined to demonstrate the MDASI-MM’s sensitivity in detecting the acute worsening of symptoms post-transplantation. Results The MDASI-MM demonstrated excellent correlations with subscales of the 2 EORTC instruments, strong ability to distinguish clinically different patient groups, high sensitivity in detecting change in patients’ performance status, and high reliability. Cognitive debriefing confirmed that the MDASI-MM encompasses the breadth of symptoms relevant to patients with MM. Conclusion The MDASI-MM is a valid, reliable, comprehensive-yet-concise tool that is recommended as a uniform symptom assessment instrument for patients with MM. PMID:23384030
An adaptive semantic matching paradigm for reliable and valid language mapping in individuals with aphasia.

PubMed

Wilson, Stephen M; Yen, Melodie; Eriksson, Dana K

2018-04-17

Research on neuroplasticity in recovery from aphasia depends on the ability to identify language areas of the brain in individuals with aphasia. However, tasks commonly used to engage language processing in people with aphasia, such as narrative comprehension and picture naming, are limited in terms of reliability (test-retest reproducibility) and validity (identification of language regions, and not other regions). On the other hand, paradigms such as semantic decision that are effective in identifying language regions in people without aphasia can be prohibitively challenging for people with aphasia. This paper describes a new semantic matching paradigm that uses an adaptive staircase procedure to present individuals with stimuli that are challenging yet within their competence, so that language processing can be fully engaged in people with and without language impairments. The feasibility, reliability and validity of the adaptive semantic matching paradigm were investigated in sixteen individuals with chronic post-stroke aphasia and fourteen neurologically normal participants, in comparison to narrative comprehension and picture naming paradigms. All participants succeeded in learning and performing the semantic paradigm. Test-retest reproducibility of the semantic paradigm in people with aphasia was good (Dice coefficient = 0.66), and was superior to the other two paradigms. The semantic paradigm revealed known features of typical language organization (lateralization; frontal and temporal regions) more consistently in neurologically normal individuals than the other two paradigms, constituting evidence for validity. In sum, the adaptive semantic matching paradigm is a feasible, reliable and valid method for mapping language regions in people with aphasia. © 2018 Wiley Periodicals, Inc.

Reliability and validity of the transport and physical activity questionnaire (TPAQ) for assessing physical activity behaviour.

PubMed

Adams, Emma J; Goad, Mary; Sahlqvist, Shannon; Bull, Fiona C; Cooper, Ashley R; Ogilvie, David

2014-01-01

No current validated survey instrument allows a comprehensive assessment of both physical activity and travel behaviours for use in interdisciplinary research on walking and cycling. This study reports on the test-retest reliability and validity of physical activity measures in the transport and physical activity questionnaire (TPAQ). The TPAQ assesses time spent in different domains of physical activity and using different modes of transport for five journey purposes. Test-retest reliability of eight physical activity summary variables was assessed using intra-class correlation coefficients (ICC) and Kappa scores for continuous and categorical variables respectively. In a separate study, the validity of three survey-reported physical activity summary variables was assessed by computing Spearman correlation coefficients using accelerometer-derived reference measures. The Bland-Altman technique was used to determine the absolute validity of survey-reported time spent in moderate-to-vigorous physical activity (MVPA). In the reliability study, ICC for time spent in different domains of physical activity ranged from fair to substantial for walking for transport (ICC = 0.59), cycling for transport (ICC = 0.61), walking for recreation (ICC = 0.48), cycling for recreation (ICC = 0.35), moderate leisure-time physical activity (ICC = 0.47), vigorous leisure-time physical activity (ICC = 0.63), and total physical activity (ICC = 0.56). The proportion of participants estimated to meet physical activity guidelines showed acceptable reliability (k = 0.60). In the validity study, comparison of survey-reported and accelerometer-derived time spent in physical activity showed strong agreement for vigorous physical activity (r = 0.72, p<0.001), fair but non-significant agreement for moderate physical activity (r = 0.24, p = 0.09) and fair agreement for MVPA (r = 0.27, p = 0.05). Bland-Altman analysis showed a mean overestimation of MVPA of 87.6 min/week (p = 0.02) (95% limits of agreement -447.1 to +622.3 min/week). The TPAQ provides a more comprehensive assessment of physical activity and travel behaviours and may be suitable for wider use. Its physical activity summary measures have comparable reliability and validity to those of similar existing questionnaires.
Validity and reliability of Chinese version of Adult Carer Quality of Life questionnaire (AC-QoL) in family caregivers of stroke survivors

PubMed Central

Li, Yingshuang; Ding, Chunge

2017-01-01

The Adult Carer Quality of Life questionnaire (AC-QoL) is a reliable and valid instrument used to assess the quality of life (QoL) of adult family caregivers. We explored the psychometric properties and tested the reliability and validity of a Chinese version of the AC-QoL with reliability and validity testing in 409 Chinese stroke caregivers. We used item-total correlation and extreme group comparison to do item analysis. To evaluate its reliability, we used a test-retest reliability approach, intraclass correlation coefficient (ICC), together with Cronbach’s alpha and model-based internal consistency index; to evaluate its validity, we used scale content validity, confirmatory factor analysis (CFA) and exploratory factor analysis (EFA) via principal component analysis with varimax rotation. We found that the CFA did not in fact confirm the original factor model and our EFA yielded a 31-item measure with a five-factor model. In conclusions, although some items performed differently in our analysis of the original English language version and our Chinese language version, our translated AC-QoL is a reliable and valid tool which can be used to assess the quality of life of stroke caregivers in mainland China. Chinese version AC-QoL is a comprehensive and good measurement to understand caregivers and has the potential to be a screening tool to assess QoL of caregiver. PMID:29131845
Validity and reliability of a novel measure of activity performance and participation.

PubMed

Murgatroyd, Phil; Karimi, Leila

2016-01-01

To develop and evaluate an innovative clinician-rated measure, which produces global numerical ratings of activity performance and participation. Repeated measures study with 48 community-dwelling participants investigating clinical sensibility, comprehensiveness, practicality, inter-rater reliability, responsiveness, sensitivity and concurrent validity with Barthel Index. Important clinimetric characteristics including comprehensiveness and ease of use were rated >8/10 by clinicians. Inter-rater reliability was excellent on the summary scores (intraclass correlation of 0.95-0.98). There was good evidence that the new outcome measure distinguished between known high and low functional scoring groups, including both responsiveness to change and sensitivity at the same time point in numerous tests. Concurrent validity with the Barthel Index was fair to high (Spearman Rank Order Correlation 0.32-0.85, p > 0.05). The new measure's summary scores were nearly twice as responsive to change compared with the Barthel Index. Other more detailed data could also be generated by the new measure. The Activity Performance Measure is an innovative outcome instrument that showed good clinimetric qualities in this initial study. Some of the results were strong, given the sample size, and further trial and evaluation is appropriate. Implications for Rehabilitation The Activity Performance Measure is an innovative outcome measure covering activity performance and participation. In an initial evaluation, it showed good clinimetric qualities including responsiveness to change, sensitivity, practicality, clinical sensibility, item coverage, inter-rater reliability and concurrent validity with the Barthel Index. Further trial and evaluation is appropriate.
Development and testing of mobile technology for community park improvements: validity and reliability of the eCPAT application with youth.

PubMed

Besenyi, Gina M; Diehl, Paul; Schooley, Benjamin; Turner-McGrievy, Brie M; Wilcox, Sara; Stanis, Sonja A Wilhelm; Kaczynski, Andrew T

2016-12-01

Creation of mobile technology environmental audit tools can provide a more interactive way for youth to engage with communities and facilitate participation in health promotion efforts. This study describes the development and validity and reliability testing of an electronic version of the Community Park Audit Tool (eCPAT). eCPAT consists of 149 items and incorporates a variety of technology benefits. Criterion-related validity and inter-rater reliability were evaluated using data from 52 youth across 47 parks in Greenville County, SC. A large portion of items (>70 %) demonstrated either fair or moderate to perfect validity and reliability. All but six items demonstrated excellent percent agreement. The eCPAT app is a user-friendly tool that provides a comprehensive assessment of park environments. Given the proliferation of smartphones, tablets, and other electronic devices among both adolescents and adults, the eCPAT app has potential to be distributed and used widely for a variety of health promotion purposes.
Preliminary Evidence for the Validity of the New Test of Everyday Reading Comprehension

ERIC Educational Resources Information Center

Wheldall, Kevin; McMurtry, Sarah

2014-01-01

The Test of Everyday Reading Comprehension (TERC) has recently been presented as an addition to the armoury of tests available for assessing the skills of low-progress readers. While comparison data for students of different ages are presented together with evidence for high test reliability, there is, as yet, no published evidence for its…
Validation of the Arabic Version of the Iowa Infant Feeding Attitude Scale among Lebanese Women.

PubMed

Charafeddine, Lama; Tamim, Hani; Soubra, Marwa; de la Mora, Arlene; Nabulsi, Mona

2016-05-01

There is need in the Arab world for validated instruments that can reliably assess infant feeding attitudes among women. The 17-item Iowa Infant Feeding Attitude Scale (IIFAS) has consistently shown good reliability and validity in different cultures and the ability to predict breastfeeding intention and exclusivity. This study assessed the psychometric properties of the Arabic version of the IIFAS (IIFAS-A). After translating to classical Arabic and back-translating to English, the IIFAS-A was pilot tested among 20 women for comprehension, clarity, length, and cultural appropriateness. The IIFAS-A was then validated among 170 women enrolled in a breastfeeding promotion and support clinical trial in Lebanon. The IIFAS-A showed acceptable internal consistency reliability (Cronbach's α = 0.640), with principal components analysis revealing that it is unidimensional. The 17 items had good interitem reliabilities ranging between 0.599 and 0.665. The number of breastfed children was the only predictor of the overall IIFAS-A score in a multivariate stepwise regression model (β = 1.531, P < .0001). The 17-item IIFAS-A is a reliable and valid instrument for measuring women's infant feeding attitudes in the Arab context. © The Author(s) 2015.
Reliability and validity of the test of incremental respiratory endurance measures of inspiratory muscle performance in COPD

PubMed Central

Formiga, Magno F; Roach, Kathryn E; Vital, Isabel; Urdaneta, Gisel; Balestrini, Kira; Calderon-Candelario, Rafael A

2018-01-01

Purpose The Test of Incremental Respiratory Endurance (TIRE) provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP) over time. The integration of MIP over inspiratory duration (ID) provides the sustained maximal inspiratory pressure (SMIP). Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Patients and methods Test–retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. Results All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test–retest reliability with a nearly perfect intraclass correlation coefficient (ICC) of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP and ID, but not MIP. Conclusion The TIRE measures of MIP, SMIP and ID have excellent test–retest reliability and demonstrated known-groups validity in subjects with COPD. SMIP and ID also demonstrated evidence of moderate convergent validity and appear to be more stable measures in this patient population than the traditional MIP. PMID:29805255
Comprehensive comparison of self-administered questionnaires for measuring quantitative autistic traits in adults.

PubMed

Nishiyama, Takeshi; Suzuki, Masako; Adachi, Katsunori; Sumi, Satoshi; Okada, Kensuke; Kishino, Hirohisa; Sakai, Saeko; Kamio, Yoko; Kojima, Masayo; Suzuki, Sadao; Kanne, Stephen M

2014-05-01

We comprehensively compared all available questionnaires for measuring quantitative autistic traits (QATs) in terms of reliability and construct validity in 3,147 non-clinical and 60 clinical subjects with normal intelligence. We examined four full-length forms, the Subthreshold Autism Trait Questionnaire (SATQ), the Broader Autism Phenotype Questionnaire, the Social Responsiveness Scale2-Adult Self report (SRS2-AS), and the Autism-Spectrum Quotient (AQ). The SRS2-AS and the AQ each had several short forms that we also examined, bringing the total to 11 forms. Though all QAT questionnaires showed acceptable levels of test-retest reliability, the AQ and SRS2-AS, including their short forms, exhibited poor internal consistency and discriminant validity, respectively. The SATQ excelled in terms of classical test theory and due to its short length.
Assessment of the Validity of the Research Diagnostic Criteria for Temporomandibular Disorders: Overview and Methodology

PubMed Central

Schiffman, Eric L.; Truelove, Edmond L.; Ohrbach, Richard; Anderson, Gary C.; John, Mike T.; List, Thomas; Look, John O.

2011-01-01

AIMS The purpose of the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) Validation Project was to assess the diagnostic validity of this examination protocol. An overview is presented, including Axis I and II methodology and descriptive statistics for the study participant sample. This paper details the development of reliable methods to establish the reference standards for assessing criterion validity of the Axis I RDC/TMD diagnoses. Validity testing for the Axis II biobehavioral instruments was based on previously validated reference standards. METHODS The Axis I reference standards were based on the consensus of 2 criterion examiners independently performing a comprehensive history, clinical examination, and evaluation of imaging. Intersite reliability was assessed annually for criterion examiners and radiologists. Criterion exam reliability was also assessed within study sites. RESULTS Study participant demographics were comparable to those of participants in previous studies using the RDC/TMD. Diagnostic agreement of the criterion examiners with each other and with the consensus-based reference standards was excellent with all kappas ≥ 0.81, except for osteoarthrosis (moderate agreement, k = 0.53). Intrasite criterion exam agreement with reference standards was excellent (k ≥ 0.95). Intersite reliability of the radiologists for detecting computed tomography-disclosed osteoarthrosis and magnetic resonance imaging-disclosed disc displacement was good to excellent (k = 0.71 and 0.84, respectively). CONCLUSION The Validation Project study population was appropriate for assessing the reliability and validity of the RDC/TMD Axis I and II. The reference standards used to assess the validity of Axis I TMD were based on reliable and clinically credible methods. PMID:20213028
Validity and reliability of the Turkish Migraine Disability Assessment (MIDAS) questionnaire.

PubMed

Ertaş, Mustafa; Siva, Aksel; Dalkara, Turgay; Uzuner, Nevzat; Dora, Babür; Inan, Levent; Idiman, Fethi; Sarica, Yakup; Selçuki, Deniz; Sirin, Hadiye; Oğuzhanoğlu, Atilla; Irkeç, Ceyla; Ozmenoğlu, Mehmet; Ozbenli, Taner; Oztürk, Musa; Saip, Sabahattin; Neyal, Münife; Zarifoğlu, Mehmet

2004-09-01

The aim of this study is to assess the comprehensibility, internal consistency, patient-physician reliability, test-retest reliability, and validity of Turkish version of Migraine Disability Assessment (MIDAS) questionnaire in patients with headache. MIDAS questionnaire has been developed by Stewart et al and shown to be reliable and valid to determine the degree of disability caused by migraine. This study was designed as a national multicenter study to demonstrate the reliability and validity of Turkish version of MIDAS questionnaire. Patients applying to 17 Neurology Clinics in Turkey were evaluated at the baseline (visit 1), week 4 (visit 2), and week 12 (visit 3) visits in terms of disease severity and comprehensibility, internal consistency, test-retest reliability, and validity of MIDAS. Since the severity of the disease has been found to change significantly at visit 2 compared to visit 1, test-retest reliability was assessed using the MIDAS scores of a subgroup of patients whose disease severity remained unchanged (up to +/-3 days difference in the number of days with headache between visits 1 and 2). A total of 306 patients (86.2% female, mean age: 35.0 +/- 9.8 years) were enrolled into the study. A total of 65.7%, 77.5%, 82.0% of patients reported that "they had fully understood the MIDAS questionnaire" in visits 1, 2, and 3, respectively. A highly positive correlation was found between physician and patient and the applied total MIDAS scores in all three visits (Spearman correlation coefficients were R= 0.87, 0.83, and 0.90, respectively, P <.001). Internal consistency of MIDAS was assessed using Cronbach's alpha and was found at acceptable (>0.7) or excellent (>0.8) levels in both patient and physician applied MIDAS scores, respectively. Total MIDAS score showed good test-retest reliability (R= 0.68). Both the number of days with headache and the total MIDAS scores were positively correlated at all visits with correlation coefficients between 0.47 and 0.63. There was also a moderate degree of correlation (R= 0.54) between the total MIDAS score at week 12 and the number of days with headache at visit 2 + visit 3, which quantify headache-related disability over a 3-month period similar to MIDAS questionnaire. These findings demonstrated that the Turkish translation is equivalent to the English version of MIDAS in terms of internal consistency, test-retest reliability, and validity. Physicians can reliably use the Turkish translation of the MIDAS questionnaire in defining the severity of illness and its treatment strategy when applied as a self-administered report by migraine patients themselves.
Development and Validation of the Caring Loneliness Scale.

PubMed

Karhe, Liisa; Kaunonen, Marja; Koivisto, Anna-Maija

2016-12-01

The Caring Loneliness Scale (CARLOS) includes 5 categories derived from earlier qualitative research. This article assesses the reliability and construct validity of a scale designed to measure patient experiences of loneliness in a professional caring relationship. Statistical analysis with 4 different sample sizes included Cronbach's alpha and exploratory factor analysis with principal axis factoring extraction. The sample size of 250 gave the most useful and comprehensible structure, but all 4 samples yielded underlying content of loneliness experiences. The initial 5 categories were reduced to 4 factors with 24 items and Cronbach's alpha ranging from .77 to .90. The findings support the reliability and validity of CARLOS for the assessment of Finnish breast cancer and heart surgery patients' experiences but as all instruments, further validation is needed.
The Research Diagnostic Criteria for Temporomandibular Disorders. I: overview and methodology for assessment of validity.

PubMed

Schiffman, Eric L; Truelove, Edmond L; Ohrbach, Richard; Anderson, Gary C; John, Mike T; List, Thomas; Look, John O

2010-01-01

The purpose of the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) Validation Project was to assess the diagnostic validity of this examination protocol. The aim of this article is to provide an overview of the project's methodology, descriptive statistics, and data for the study participant sample. This article also details the development of reliable methods to establish the reference standards for assessing criterion validity of the Axis I RDC/TMD diagnoses. The Axis I reference standards were based on the consensus of two criterion examiners independently performing a comprehensive history, clinical examination, and evaluation of imaging. Intersite reliability was assessed annually for criterion examiners and radiologists. Criterion examination reliability was also assessed within study sites. Study participant demographics were comparable to those of participants in previous studies using the RDC/TMD. Diagnostic agreement of the criterion examiners with each other and with the consensus-based reference standards was excellent with all kappas > or = 0.81, except for osteoarthrosis (moderate agreement, k = 0.53). Intrasite criterion examiner agreement with reference standards was excellent (k > or = 0.95). Intersite reliability of the radiologists for detecting computed tomography-disclosed osteoarthrosis and magnetic resonance imaging-disclosed disc displacement was good to excellent (k = 0.71 and 0.84, respectively). The Validation Project study population was appropriate for assessing the reliability and validity of the RDC/TMD Axis I and II. The reference standards used to assess the validity of Axis I TMD were based on reliable and clinically credible methods.
Factor Analysis Methods and Validity Evidence: A Systematic Review of Instrument Development across the Continuum of Medical Education

ERIC Educational Resources Information Center

Wetzel, Angela Payne

2011-01-01

Previous systematic reviews indicate a lack of reporting of reliability and validity evidence in subsets of the medical education literature. Psychology and general education reviews of factor analysis also indicate gaps between current and best practices; yet, a comprehensive review of exploratory factor analysis in instrument development across…
First year progress report on the development of the Texas flexible pavement database.

DOT National Transportation Integrated Search

2008-01-01

Comprehensive and reliable databases are essential for the development, validation, and calibration of any pavement : design and rehabilitation system. These databases should include material properties, pavement structural : characteristics, highway...
Creating High Reliability in Health Care Organizations

PubMed Central

Pronovost, Peter J; Berenholtz, Sean M; Goeschel, Christine A; Needham, Dale M; Sexton, J Bryan; Thompson, David A; Lubomski, Lisa H; Marsteller, Jill A; Makary, Martin A; Hunt, Elizabeth

2006-01-01

Objective The objective of this paper was to present a comprehensive approach to help health care organizations reliably deliver effective interventions. Context Reliability in healthcare translates into using valid rate-based measures. Yet high reliability organizations have proven that the context in which care is delivered, called organizational culture, also has important influences on patient safety. Model for Improvement Our model to improve reliability, which also includes interventions to improve culture, focuses on valid rate-based measures. This model includes (1) identifying evidence-based interventions that improve the outcome, (2) selecting interventions with the most impact on outcomes and converting to behaviors, (3) developing measures to evaluate reliability, (4) measuring baseline performance, and (5) ensuring patients receive the evidence-based interventions. The comprehensive unit-based safety program (CUSP) is used to improve culture and guide organizations in learning from mistakes that are important, but cannot be measured as rates. Conclusions We present how this model was used in over 100 intensive care units in Michigan to improve culture and eliminate catheter-related blood stream infections—both were accomplished. Our model differs from existing models in that it incorporates efforts to improve a vital component for system redesign—culture, it targets 3 important groups—senior leaders, team leaders, and front line staff, and facilitates change management—engage, educate, execute, and evaluate for planned interventions. PMID:16898981
Measuring theory of mind in children. Psychometric properties of the ToM Storybooks.

PubMed

Blijd-Hoogewys, E M A; van Geert, P L C; Serra, M; Minderaa, R B

2008-11-01

Although research on Theory-of-Mind (ToM) is often based on single task measurements, more comprehensive instruments result in a better understanding of ToM development. The ToM Storybooks is a new instrument measuring basic ToM-functioning and associated aspects. There are 34 tasks, tapping various emotions, beliefs, desires and mental-physical distinctions. Four studies on the validity and reliability of the test are presented, in typically developing children (n = 324, 3-12 years) and children with PDD-NOS (n = 30). The ToM Storybooks have good psychometric qualities. A component analysis reveals five components corresponding with the underlying theoretical constructs. The internal consistency, test-retest reliability, inter-rater reliability, construct validity and convergent validity are good. The ToM Storybooks can be used in research as well as in clinical settings.
Adaptation and Validation of the Arabic Version of the Infant Breastfeeding Knowledge Questionnaire among Lebanese Women.

PubMed

Tamim, Hani; Ghandour, Lilian A; Shamsedine, Lama; Charafeddine, Lama; Nasser, Fatima; Khalil, Yvette; Nabulsi, Mona

2016-11-01

Valid instruments that can reliably assess maternal breastfeeding knowledge in Arabic-speaking populations are nonexistent. The availability of such an instrument is essential for investigators working in this field. This study aimed to describe the adaptation and validation of the Arabic Breastfeeding Knowledge Questionnaire (BFK-A) from the original 20-item English version. A translated version of the 20-item BFK was validated among 417 Lebanese women after pilot testing for clarity, comprehension, length, and cultural appropriateness. Exploratory factor analysis was run to examine dimensionality of the instrument and Kuder-Richardson-20 (KR-20) was used to assess its internal consistency. The BFK-A is a unidimensional scale with acceptable internal consistency reliability (KR-20 = 0.652) after the exclusion of 4 items. Higher breastfeeding knowledge levels were strongly and statistically significantly associated with higher mean scores for the validated Arabic Iowa Infant Feeding Attitude Scale ( P < .001), thus confirming its construct validity. The Arabic 16-item BFK-A has an acceptable reliability, similar to the original instrument. Further studies are encouraged to confirm the validity of the 16-item BFK-A among other Arab populations. There is also a need to develop more reliable instruments to use in lactation research in this context.
[Research on the reliability and validity of postural workload assessment method and the relation to work-related musculoskeletal disorders of workers].

PubMed

Qin, D L; Jin, X N; Wang, S J; Wang, J J; Mamat, N; Wang, F J; Wang, Y; Shen, Z A; Sheng, L G; Forsman, M; Yang, L Y; Wang, S; Zhang, Z B; He, L H

2018-06-18

To form a new assessment method to evaluate postural workload comprehensively analyzing the dynamic and static postural workload for workers during their work process to analyze the reliability and validity, and to study the relation between workers' postural workload and work-related musculoskeletal disorders (WMSDs). In the study, 844 workers from electronic and railway vehicle manufacturing factories were selected as subjects investigated by using the China Musculoskeletal Questionnaire (CMQ) to form the postural workload comprehensive assessment method. The Cronbach's α, cluster analysis and factor analysis were used to assess the reliability and validity of the new assessment method. Non-conditional Logistic regression was used to analyze the relation between workers' postural workload and WMSDs. Reliability of the assessment method for postural workload: internal consistency analysis results showed that Cronbach's α was 0.934 and the results of split-half reliability indicated that Spearman-Brown coefficient was 0.881 and the correlation coefficient between the first part and the second was 0.787. Validity of the assessment method for postural workload: the results of cluster analysis indicated that square Euclidean distance between dynamic and static postural workload assessment in the same part or work posture was the shortest. The results of factor analysis showed that 2 components were extracted and the cumulative percentage of variance achieved 65.604%. The postural workload score of the different occupational workers showed significant difference (P<0.05) by covariance analysis. The results of nonconditional Logistic regression indicated that alcohol intake (OR=2.141, 95%CI 1.337-3.428) and obesity (OR=3.408, 95%CI 1.629-7.130) were risk factors for WMSDs. The risk for WMSDs would rise as workers' postural workload rose (OR=1.035, 95%CI 1.022-1.048). There was significant different risk for WMSDs in the different groups of workers distinguished by work type, gender and age. Female workers exhibited a higher prevalence for WMSDs (OR=2.626, 95%CI 1.414-4.879) and workers between 30-40 years of age (OR=1.909, 95%CI 1.237-2.946) as compared with those under 30. This method for comprehensively assessing postural workload is reliable and effective when used in assembling workers, and there is certain relation between the postural workload and WMSDs.
Design and validation of a comprehensive fecal incontinence questionnaire.

PubMed

Macmillan, Alexandra K; Merrie, Arend E H; Marshall, Roger J; Parry, Bryan R

2008-10-01

Fecal incontinence can have a profound effect on quality of life. Its prevalence remains uncertain because of stigma, lack of consistent definition, and dearth of validated measures. This study was designed to develop a valid clinical and epidemiologic questionnaire, building on current literature and expertise. Patients and experts undertook face validity testing. Construct validity, criterion validity, and test-retest reliability was undertaken. Construct validity comprised factor analysis and internal consistency of the quality of life scale. The validity of known groups was tested against 77 control subjects by using regression models. Questionnaire results were compared with a stool diary for criterion validity. Test-retest reliability was calculated from repeated questionnaire completion. The questionnaire achieved good face validity. It was completed by 104 patients. The quality of life scale had four underlying traits (factor analysis) and high internal consistency (overall Cronbach alpha = 0.97). Patients and control subjects answered the questionnaire significantly differently (P < 0.01) in known-groups validity testing. Criterion validity assessment found mean differences close to zero. Median reliability for the whole questionnaire was 0.79 (range, 0.35-1). This questionnaire compares favorably with other available instruments, although the interpretation of stool consistency requires further research. Its sensitivity to treatment still needs to be investigated.
Reliability and Validity of the Transport and Physical Activity Questionnaire (TPAQ) for Assessing Physical Activity Behaviour

PubMed Central

Adams, Emma J.; Goad, Mary; Sahlqvist, Shannon; Bull, Fiona C.; Cooper, Ashley R.; Ogilvie, David

2014-01-01

Background No current validated survey instrument allows a comprehensive assessment of both physical activity and travel behaviours for use in interdisciplinary research on walking and cycling. This study reports on the test-retest reliability and validity of physical activity measures in the transport and physical activity questionnaire (TPAQ). Methods The TPAQ assesses time spent in different domains of physical activity and using different modes of transport for five journey purposes. Test-retest reliability of eight physical activity summary variables was assessed using intra-class correlation coefficients (ICC) and Kappa scores for continuous and categorical variables respectively. In a separate study, the validity of three survey-reported physical activity summary variables was assessed by computing Spearman correlation coefficients using accelerometer-derived reference measures. The Bland-Altman technique was used to determine the absolute validity of survey-reported time spent in moderate-to-vigorous physical activity (MVPA). Results In the reliability study, ICC for time spent in different domains of physical activity ranged from fair to substantial for walking for transport (ICC = 0.59), cycling for transport (ICC = 0.61), walking for recreation (ICC = 0.48), cycling for recreation (ICC = 0.35), moderate leisure-time physical activity (ICC = 0.47), vigorous leisure-time physical activity (ICC = 0.63), and total physical activity (ICC = 0.56). The proportion of participants estimated to meet physical activity guidelines showed acceptable reliability (k = 0.60). In the validity study, comparison of survey-reported and accelerometer-derived time spent in physical activity showed strong agreement for vigorous physical activity (r = 0.72, p<0.001), fair but non-significant agreement for moderate physical activity (r = 0.24, p = 0.09) and fair agreement for MVPA (r = 0.27, p = 0.05). Bland-Altman analysis showed a mean overestimation of MVPA of 87.6 min/week (p = 0.02) (95% limits of agreement −447.1 to +622.3 min/week). Conclusion The TPAQ provides a more comprehensive assessment of physical activity and travel behaviours and may be suitable for wider use. Its physical activity summary measures have comparable reliability and validity to those of similar existing questionnaires. PMID:25215510

Research diagnostic criteria for temporomandibular disorders (RDC/TMD): development of image analysis criteria and examiner reliability for image analysis.

PubMed

Ahmad, Mansur; Hollender, Lars; Anderson, Quentin; Kartha, Krishnan; Ohrbach, Richard; Truelove, Edmond L; John, Mike T; Schiffman, Eric L

2009-06-01

As part of the Multisite Research Diagnostic Criteria For Temporomandibular Disorders (RDC/TMD) Validation Project, comprehensive temporomandibular joint diagnostic criteria were developed for image analysis using panoramic radiography, magnetic resonance imaging (MRI), and computerized tomography (CT). Interexaminer reliability was estimated using the kappa (kappa) statistic, and agreement between rater pairs was characterized by overall, positive, and negative percent agreement. Computerized tomography was the reference standard for assessing validity of other imaging modalities for detecting osteoarthritis (OA). For the radiologic diagnosis of OA, reliability of the 3 examiners was poor for panoramic radiography (kappa = 0.16), fair for MRI (kappa = 0.46), and close to the threshold for excellent for CT (kappa = 0.71). Using MRI, reliability was excellent for diagnosing disc displacements (DD) with reduction (kappa = 0.78) and for DD without reduction (kappa = 0.94) and good for effusion (kappa = 0.64). Overall percent agreement for pairwise ratings was >or=82% for all conditions. Positive percent agreement for diagnosing OA was 19% for panoramic radiography, 59% for MRI, and 84% for CT. Using MRI, positive percent agreement for diagnoses of any DD was 95% and of effusion was 81%. Negative percent agreement was >or=88% for all conditions. Compared with CT, panoramic radiography and MRI had poor and marginal sensitivity, respectively, but excellent specificity in detecting OA. Comprehensive image analysis criteria for the RDC/TMD Validation Project were developed, which can reliably be used for assessing OA using CT and for disc position and effusion using MRI.
Research to Establish the Validity, Reliability, and Clinical Utility of a Comprehensive Language Assessment of Mandarin

ERIC Educational Resources Information Center

Liu, Xueman Lucy; de Villiers, Jill; Ning, Chunyan; Rolfhus, Eric; Hutchings, Teresa; Lee, Wendy; Jiang, Fan; Zhang, Yi Wen

2017-01-01

Purpose: With no existing gold standard for comparison, challenges arise for establishing the validity of a new standardized Mandarin language assessment normed in mainland China. Method: A new assessment, Diagnostic Receptive and Expressive Assessment of Mandarin (DREAM), was normed with a stratified sample of 969 children ages 2;6 (years;months)…
Six Years of Comprehensive, Clinical, Performance-Based Assessment Using Standardized Patients at the Southern Illinois University School of Medicine.

ERIC Educational Resources Information Center

Vu, Nu Viet; And Others

1992-01-01

The use of a performance-based assessment of senior medical students' clinical skills utilizing standardized patients was evaluated, with 6,804 student-patient encounters involving 405 students over 6 years. Results provide evidence for test security, content validity, construct validity, reliability, and test ability to discriminate a wide range…
Validity and reliability testing of the Prenatal Psychosocial Profile.

PubMed

Curry, M A; Campbell, R A; Christian, M

1994-04-01

Two studies of low-income pregnant women (N = 179) were done to examine the validity and reliability of the Prenatal Psychosocial Profile (PPP). The PPP, a composite of the Rosenberg Self-Esteem Scale, the Support Behaviors Inventory, and a newly developed measure of stress, is a brief, comprehensive clinical assessment of psychosocial risk during pregnancy. Construct validity of the stress scale was supported by theoretically predicted negative correlations with self-esteem, partner support, and support from others (N = 91). Convergent validity of the stress scale was demonstrated by a correlation of .71 with the Difficult Life Circumstances Scale. Adequate levels of internal consistency were found. Interrelationships between the four subscales were consistent with the underlying conceptualization, and there was beginning evidence of the factorial independence of the subscales.
The James Supportive Care Screening: integrating science and practice to meet the NCCN guidelines for distress management at a Comprehensive Cancer Center.

PubMed

Wells-Di Gregorio, Sharla; Porensky, Emily K; Minotti, Matthew; Brown, Susan; Snapp, Janet; Taylor, Robert M; Adolph, Michael D; Everett, Sherman; Lowther, Kenneth; Callahan, Kelly; Streva, Devita; Heinke, Vicki; Leno, Debra; Flower, Courtney; McVey, Anne; Andersen, Barbara Lee

2013-09-01

Selecting a measure for oncology distress screening can be challenging. The measure must be brief, but comprehensive, capturing patients' most distressing concerns. The measure must provide meaningful coverage of multiple domains, assess symptom and problem-related distress, and ideally be suited for both clinical and research purposes. From March 2006 to August 2012, the James Supportive Care Screening (SCS) was developed and validated in three phases including content validation, factor analysis, and measure validation. Exploratory factor analyses were completed with 596 oncology patients followed by a confirmatory factor analysis with 477 patients. Six factors were identified and confirmed including (i) emotional concerns; (ii) physical symptoms; (iii) social/practical problems; (iv) spiritual problems; (v) cognitive concerns; and (vi) healthcare decision making/communication issues. Subscale evaluation reveals good to excellent internal consistency, test-retest reliability, and convergent, divergent, and predictive validity. Specificity of individual items was 0.90 and 0.87, respectively, for identifying patients with DSM-IV-TR diagnoses of major depression and generalized anxiety disorder. Results support use of the James SCS to quickly detect the most frequent and distressing symptoms and concerns of cancer patients. The James SCS is an efficient, reliable, and valid clinical and research outcomes measure. Copyright © 2013 John Wiley & Sons, Ltd.
Fifteen-Minute Comprehensive Alcohol Risk Survey: Reliability and Validity Across American Indian and White Adolescents

PubMed Central

Komro, Kelli A; Livingston, Melvin D; Kominsky, Terrence K; Livingston, Bethany J; Garrett, Brady A; Molina, Mildred Maldonado; Boyd, Misty L

2015-01-01

Objective: American Indians (AIs) suffer from significant alcohol-related health disparities, and increased risk begins early. This study examined the reliability and validity of measures to be used in a preventive intervention trial. Reliability and validity across racial/ethnic subgroups are crucial to evaluate intervention effectiveness and promote culturally appropriate evidence-based practice. Method: To assess reliability and validity, we used three baseline surveys of high school students participating in a preventive intervention trial within the jurisdictional service area of the Cherokee Nation in northeastern Oklahoma. The 15-minute alcohol risk survey included 16 multi-item scales and one composite score measuring key proximal, primary, and moderating variables. Forty-four percent of the students indicated that they were AI (of whom 82% were Cherokee), including 23% who reported being AI only (n = 435) and 18% both AI and White (n = 352). Forty-seven percent reported being White only (n = 901). Results: Scales were adequately reliable for the full sample and across race/ethnicity defined by AI, AI/White, and White subgroups. Among the full sample, all scales had acceptable internal consistency, with minor variation across race/ethnicity. All scales had extensive to exemplary test–retest reliability and showed minimal variation across race/ethnicity. The eight proximal and two primary outcome scales were each significantly associated with the frequency of alcohol use during the past month in both the cross-sectional and the longitudinal models, providing support for both criterion validity and predictive validity. For most scales, interpretation of the strength of association and statistical significance did not differ between the racial/ethnic subgroups. Conclusions: The results support the reliability and validity of scales of a brief questionnaire measuring risk and protective factors for alcohol use among AI adolescents, primarily members of the Cherokee Nation. PMID:25486402
Development of a quality-of-life instrument for autoimmune bullous disease: the Autoimmune Bullous Disease Quality of Life questionnaire.

PubMed

Sebaratnam, Deshan F; Hanna, Anna Marie; Chee, Shien-ning; Frew, John W; Venugopal, Supriya S; Daniel, Benjamin S; Martin, Linda K; Rhodes, Lesley M; Tan, Jeremy Choon Kai; Wang, Charles Qian; Welsh, Belinda; Nijsten, Tamar; Murrell, Dédée F

2013-10-01

Quality-of-life (QOL) evaluation is an increasingly important outcome measure in dermatology, with disease-specific QOL instruments being the most sensitive to changes in disease status. To develop a QOL instrument specific to autoimmune bullous disease (AIBD). A comprehensive item generation process was used to build a 45-item pilot Autoimmune Bullous Disease Quality of Life (ABQOL) questionnaire, distributed to 70 patients with AIBD. Experts in bullous disease refined the pilot ABQOL before factor analysis was performed to yield the final ABQOL questionnaire of 17 questions. We evaluated validity and reliability across a range of indices. Australian dermatology outpatient clinics and private dermatology practices. PATIENTS AND EXPOSURE: Patients with a histological diagnosis of AIBD. The development of an AIBD-specific QOL instrument. Face and content validity were established through the comprehensive patient interview process and expert review. In terms of convergent validity, the ABQOL was found to have a moderate correlation with scores on the Dermatology Life Quality Index (R = 0.63) and the General Health subscale of the 36-Item Short Form Health Survey (R = 0.69; P = .009) and low correlation with the Pemphigus Disease Area Index (R = 0.42) and Autoimmune Bullous Disease Skin Disorder Intensity Score (R = 0.48). In terms of discriminant validity, the ABQOL was found to be more sensitive than the Dermatology Life Quality Index (P = .02). The ABQOL was also found to be a reliable instrument evaluated by internal consistency (Cronbach α coefficient, 0.84) and test-retest reliability (mean percentage variation, 0.92). The ABQOL has been shown to be a valid and reliable instrument that may serve as an end point in clinical trials. Future work should include incorporating patient weighting on questions to further increase content validity and translation of the measure to other languages. anzctr.org.au Identifier: ACTRN12612000750886.
Psychometric testing of the modified Care Dependency Scale among hospitalized school-aged children in Germany.

PubMed

Tork, Hanan; Lohrmann, Christa; Dassen, Theo

2008-03-01

The objectives of this study were to examine the psychometric properties of the modified Care Dependency Scale in a pediatric setting and to explore the extent of dependency of school-aged children regarding their self-care. The data were collected from 130 hospitalized children, aged 6-12 years. The reliability was determined by Cronbach's alpha, which showed a high level of consistency. The subsequent inter-rater reliability revealed moderate-to-substantial agreement. The criterion-related validity was tested by comparing the sum scores of the Care Dependency Scale for Paediatrics and the Visual Analog Scale. Factor analysis was used to investigate the construct validity and resulted in a one-factor solution. In conclusion, this study provides evidence that the Care Dependency Scale for Paediatrics is a valid and reliable measure that offers a comprehensive assessment from a nursing perspective and enables nurses to help children acquire independence.
The 1-min Screening Test for Reading Problems in College Students: Psychometric Properties of the 1-min TIL.

PubMed

Fernandes, Tânia; Araújo, Susana; Sucena, Ana; Reis, Alexandra; Castro, São Luís

2017-02-01

Reading is a central cognitive domain, but little research has been devoted to standardized tests for adults. We, thus, examined the psychometric properties of the 1-min version of Teste de Idade de Leitura (Reading Age Test; 1-min TIL), the Portuguese version of Lobrot L3 test, in three experiments with college students: typical readers in Experiment 1A and B, dyslexic readers and chronological age controls in Experiment 2. In Experiment 1A, test-retest reliability and convergent validity were evaluated in 185 students. Reliability was >.70, and phonological decoding underpinned 1-min TIL. In Experiment 1B, internal consistency was assessed by presenting two 45-s versions of the test to 19 students, and performance in these versions was significantly associated (r = .78). In Experiment 2, construct validity, criterion validity and clinical utility of 1-min TIL were investigated. A multiple regression analysis corroborated construct validity; both phonological decoding and listening comprehension were reliable predictors of 1-min TIL scores. Logistic regression and receiver operating characteristics analyses revealed the high accuracy of this test in distinguishing dyslexic from typical readers. Therefore, the 1-min TIL, which assesses reading comprehension and potential reading difficulties in college students, has the necessary psychometric properties to become a useful screening instrument in neuropsychological assessment and research. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
Development and Preliminary Validation of a Comprehensive Questionnaire to Assess Women’s Knowledge and Perception of the Current Weight Gain Guidelines during Pregnancy

PubMed Central

Ockenden, Holly; Gunnell, Katie; Giles, Audrey; Nerenberg, Kara; Goldfield, Gary; Manyanga, Taru; Adamo, Kristi

2016-01-01

The aim of this study was to develop and validate an electronic questionnaire, the Electronic Maternal Health Survey (EMat Health Survey), related to women’s knowledge and perceptions of the current gestational weight gain guidelines (GWG), as well as pregnancy-related health behaviours. Constructs addressed within the questionnaire include self-efficacy, locus of control, perceived barriers, and facilitators of physical activity and diet, outcome expectations, social environment and health practices. Content validity was examined using an expert panel (n = 7) and pilot testing items in a small sample (n = 5) of pregnant women and recent mothers (target population). Test re-test reliability was assessed among a sample (n = 71) of the target population. Reliability scores were calculated for all constructs (r and intra-class correlation coefficients (ICC)), those with a score of >0.5 were considered acceptable. The content validity of the questionnaire reflects the degree to which all relevant components of excessive GWG risk in women are included. Strong test-retest reliability was found in the current study, indicating that responses to the questionnaire were reliable in this population. The EMat Health Survey adds to the growing body of literature on maternal health and gestational weight gain by providing the first comprehensive questionnaire that can be self-administered and remotely accessed. The questionnaire can be completed in 15–25 min and collects useful data on various social determinants of health and GWG as well as associated health behaviours. This online tool may assist researchers by providing them with a platform to collect useful information in developing and tailoring interventions to better support women in achieving recommended weight gain targets in pregnancy. PMID:27916921
Iranian Health Literacy Questionnaire (IHLQ): An Instrument for Measuring Health Literacy in Iran.

PubMed

Haghdoost, Ali Akbar; Rakhshani, Fatemeh; Aarabi, Mohsen; Montazeri, Ali; Tavousi, Mahmoud; Solimanian, Atoosa; Sarbandi, Fatemeh; Namdar, Hosein; Iranpour, Abedin

2015-06-01

Promoting Health Literacy (HL) is considered as an important goal in strategic plans of many countries. In spite of the necessity for access to valid, reliable and native HL instruments, the number of such instruments in the Persian language is scarce. Moreover, there is no good estimation of HL status in Iran. The aim of this study was to provide a valid, reliable and native instrument to measure and monitor community HL in Iran and also, to provide an estimation of HL status in two Iranian provinces. By applying the multistage cluster sampling, 1080 respondents (540 from each gender) were recruited from Kerman and Mazandaran provinces of Iran, from February to June 2014 to participate in this cross-sectional study. The development of the Iranian Health Literacy Questionnaire (IHLQ) was initiated with a comprehensive review of the literature. Then, face, content and construct validity as well as reliability were determined. Internal consistency and test-retest reliability (ICC) of the factors was in the range of 0.71 to 0.96 and 0.73 to 0.86, respectively. In order to construct validity, Exploratory Factor Analysis (EFA) Kaiser-Meyer-Olkin (KMO) = 0.95 and Bartlett's test result of 3.017 with P < 0.001) with varimax rotation was used. Optimal reduced solution, including 36 items and seven factors, was found in EFA. Five of the factors identified were reading/comprehension skills, individual empowerment, communication/decision-making skills, social empowerment and health knowledge. It was concluded that IHLQ might be a practical and useful tool for investigating HL for Persian language speakers around the world. Since HL is dynamic and its instruments should be regularly revised, further studies are recommended to assess HL with application of IHLQ to detect its potential imperfections.
Developing an interactive mobile phone self-report system for self-management of hypertension. Part 2: content validity and usability.

PubMed

Bengtsson, Ulrika; Kjellgren, Karin; Höfer, Stefan; Taft, Charles; Ring, Lena

2014-10-01

Self-management support tools using technology may improve adherence to hypertension treatment. There is a need for user-friendly tools facilitating patients' understanding of the interconnections between blood pressure, wellbeing and lifestyle. This study aimed to examine comprehension, comprehensiveness and relevance of items, and further to evaluate the usability and reliability of an interactive hypertension-specific mobile phone self-report system. Areas important in supporting self-management and candidate items were derived from five focus group interviews with patients and healthcare professionals (n = 27), supplemented by a literature review. Items and response formats were drafted to meet specifications for mobile phone administration and were integrated into a mobile phone data-capture system. Content validity and usability were assessed iteratively in four rounds of cognitive interviews with patients (n = 21) and healthcare professionals (n = 4). Reliability was examined using a test-retest. Focus group analyses yielded six areas covered by 16 items. The cognitive interviews showed satisfactory item comprehension, relevance and coverage; however, one item was added. The mobile phone self-report system was reliable and perceived easy to use. The mobile phone self-report system appears efficiently to capture information relevant in patients' self-management of hypertension. Future studies need to evaluate the effectiveness of this tool in improving self-management of hypertension in clinical practice.
Validation of the Narcissistic Admiration and Rivalry Questionnaire Short Scale (NARQ-S) in convenience and representative samples.

PubMed

Leckelt, Marius; Wetzel, Eunike; Gerlach, Tanja M; Ackerman, Robert A; Miller, Joshua D; Chopik, William J; Penke, Lars; Geukes, Katharina; Küfner, Albrecht C P; Hutteman, Roos; Richter, David; Renner, Karl-Heinz; Allroggen, Marc; Brecheen, Courtney; Campbell, W Keith; Grossmann, Igor; Back, Mitja D

2018-01-01

Due to increased empirical interest in narcissism across the social sciences, there is a need for inventories that can be administered quickly while also reliably measuring both the agentic and antagonistic aspects of grandiose narcissism. In this study, we sought to validate the factor structure, provide representative descriptive data and reliability estimates, assess the reliability across the trait spectrum, and examine the nomological network of the short version of the Narcissistic Admiration and Rivalry Questionnaire (NARQ-S; Back et al., 2013). We used data from a large convenience sample (total N = 11,937) as well as data from a large representative sample (total N = 4,433) that included responses to other narcissism measures as well as related constructs, including the other Dark Triad traits, Big Five personality traits, and self-esteem. Confirmatory factor analysis and item response theory were used to validate the factor structure and estimate the reliability across the latent trait spectrum, respectively. Results suggest that the NARQ-S shows a robust factor structure and is a reliable and valid short measure of the agentic and antagonistic aspects of grandiose narcissism. We also discuss future directions and applications of the NARQ-S as a short and comprehensive measure of grandiose narcissism. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Development and Validation of a Questionnaire to Assess Multimorbidity in Primary Care: An Indian Experience.

PubMed

Pati, Sanghamitra; Hussain, Mohammad Akhtar; Swain, Subhashisa; Salisbury, Chris; Metsemakers, Job F M; Knottnerus, J André; van den Akker, Marjan

2016-01-01

Multimorbidity remains an underexplored domain in Indian primary care. We undertook a study to assess the prevalence, correlates, and outcomes of multimorbidity in primary care settings in India. This paper describes the process of development and validation of our data collection tool "Multimorbidity Assessment Questionnaire for Primary Care (MAQ-PC)." An iterative process comprising desk review, chart review, and expert consultations was undertaken to generate the questionnaire. The MAQ-PC contained items on chronic conditions, health care utilization, health related quality of life, disease severity, and sociodemographics. It was first tested with twelve adults for comprehensibility followed by test-retest reliability with 103 patients from four primary care practices. For interrater reliability, two interviewers separately administered the questionnaire to sixteen patients. MAQ-PC displayed strong internal consistency (Cronbach's alpha: 0.69), interrater reliability (Cohen's Kappa: 0.78-1), and test-retest reliability (ICC: 0.970-0.741). Substantial concordance between self-report and physician diagnosis (Scott Kappa: 0.59-1.0) was observed for listed chronic conditions indicating strong concurrent validity. Nearly 54% had one chronic condition and 23.3% had multimorbidity. Our findings demonstrate MAQ-PC to be a valid and reliable measure of multimorbidity in primary care practice and suggest its potential utility in multimorbidity research in India.
Measuring professional satisfaction in Greek nurses: combination of qualitative and quantitative investigation to evaluate the validity and reliability of the Index of Work Satisfaction.

PubMed

Karanikola, Maria N K; Papathanassoglou, Elizabeth D E

2015-02-01

The Index of Work Satisfaction (IWS) is a comprehensive scale assessing nurses' professional satisfaction. The aim of the present study was to explore: a) the applicability, reliability and validity of the Greek version of the IWS and b) contrasts among the factors addressed by IWS against the main themes emerging from a qualitative phenomenological investigation of nurses' professional experiences. A descriptive correlational design was applied using a sample of 246 emergency and critical care nurses. Internal consistency and test-retest reliability were tested. Construct and content validity were assessed by factor analysis, and through qualitative phenomenological analysis with a purposive sample of 12 nurses. Scale factors were contrasted to qualitative themes to assure that IWS embraces all aspects of Greek nurses' professional satisfaction. The internal consistency (α = 0.81) and test-retest (tau = 1, p < 0.0001) reliability were adequate. Following appropriate modifications, factor analysis confirmed the construct validity of the scale and subscales. The qualitative data partially clarified the low reliability of one subscale. The Greek version of the IWS scale is supported for use in acute care. The mixed methods approach constitutes a powerful tool for transferring scales to different cultures and healthcare systems. Copyright © 2014 Elsevier Inc. All rights reserved.
Improving the utility of the fine motor skills subscale of the comprehensive developmental inventory for infants and toddlers: a computerized adaptive test.

PubMed

Huang, Chien-Yu; Tung, Li-Chen; Chou, Yeh-Tai; Chou, Willy; Chen, Kuan-Lin; Hsieh, Ching-Lin

2017-07-27

This study aimed at improving the utility of the fine motor subscale of the comprehensive developmental inventory for infants and toddlers (CDIIT) by developing a computerized adaptive test of fine motor skills. We built an item bank for the computerized adaptive test of fine motor skills using the fine motor subscale of the CDIIT items fitting the Rasch model. We also examined the psychometric properties and efficiency of the computerized adaptive test of fine motor skills with simulated computerized adaptive tests. Data from 1742 children with suspected developmental delays were retrieved. The mean scores of the fine motor subscale of the CDIIT increased along with age groups (mean scores = 1.36-36.97). The computerized adaptive test of fine motor skills contains 31 items meeting the Rasch model's assumptions (infit mean square = 0.57-1.21, outfit mean square = 0.11-1.17). For children of 6-71 months, the computerized adaptive test of fine motor skills had high Rasch person reliability (average reliability >0.90), high concurrent validity (rs = 0.67-0.99), adequate to excellent diagnostic accuracy (area under receiver operating characteristic = 0.71-1.00), and large responsiveness (effect size = 1.05-3.93). The computerized adaptive test of fine motor skills used 48-84% fewer items than the fine motor subscale of the CDIIT. The computerized adaptive test of fine motor skills used fewer items for assessment but was as reliable and valid as the fine motor subscale of the CDIIT. Implications for Rehabilitation We developed a computerized adaptive test based on the comprehensive developmental inventory for infants and toddlers (CDIIT) for assessing fine motor skills. The computerized adaptive test has been shown to be efficient because it uses fewer items than the original measure and automatically presents the results right after the test is completed. The computerized adaptive test is as reliable and valid as the CDIIT.
Assessing Students' Spiritual and Religious Qualities

ERIC Educational Resources Information Center

Astin, Alexander W.; Astin, Helen S.; Lindholm, Jennifer A.

2011-01-01

This paper describes a comprehensive set of 12 new measures for studying undergraduate students' spiritual and religious development. The three measures of spirituality, four measures of "spiritually related" qualities, and five measures of religiousness demonstrate satisfactory reliability, robustness, and both concurrent and predictive validity.…
Reliability and validity of a novel tool to comprehensively assess food and beverage marketing in recreational sport settings.

PubMed

Prowse, Rachel J L; Naylor, Patti-Jean; Olstad, Dana Lee; Carson, Valerie; Mâsse, Louise C; Storey, Kate; Kirk, Sara F L; Raine, Kim D

2018-05-31

Current methods for evaluating food marketing to children often study a single marketing channel or approach. As the World Health Organization urges the removal of unhealthy food marketing in children's settings, methods that comprehensively explore the exposure and power of food marketing within a setting from multiple marketing channels and approaches are needed. The purpose of this study was to test the inter-rater reliability and the validity of a novel settings-based food marketing audit tool. The Food and beverage Marketing Assessment Tool for Settings (FoodMATS) was developed and its psychometric properties evaluated in five public recreation and sport facilities (sites) and subsequently used in 51 sites across Canada for a cross-sectional analysis of food marketing. Raters recorded the count of food marketing occasions, presence of child-targeted and sports-related marketing techniques, and the physical size of marketing occasions. Marketing occasions were classified by healthfulness. Inter-rater reliability was tested using Cohen's kappa (κ) and intra-class correlations (ICC). FoodMATS scores for each site were calculated using an algorithm that represented the theoretical impact of the marketing environment on food preferences, purchases, and consumption. Higher FoodMATS scores represented sites with higher exposure to, and more powerful (unhealthy, child-targeted, sports-related, large) food marketing. Validity of the scoring algorithm was tested through (1) Pearson's correlations between FoodMATS scores and facility sponsorship dollars, and (2) sequential multiple regression for predicting "Least Healthy" food sales from FoodMATS scores. Inter-rater reliability was very good to excellent (κ = 0.88-1.00, p < 0.001; ICC = 0.97, p < 0.001). There was a strong positive correlation between FoodMATS scores and food sponsorship dollars, after controlling for facility size (r = 0.86, p < 0.001). The FoodMATS score explained 14% of the variability in "Least Healthy" concession sales (p = 0.012) and 24% of the variability total concession and vending "Least Healthy" food sales (p = 0.003). FoodMATS has high inter-rater reliability and good validity. As the first validated tool to evaluate the exposure and power of food marketing in recreation facilities, the FoodMATS provides a novel means to comprehensively track changes in food marketing environments that can assist in developing and monitoring the impact of policies and interventions.
Validity and Reliability of the Verbal Numerical Rating Scale for Children Aged 4 to 17 Years With Acute Pain.

PubMed

Tsze, Daniel S; von Baeyer, Carl L; Pahalyants, Vartan; Dayan, Peter S

2018-06-01

The Verbal Numerical Rating Scale is the most commonly used self-report measure of pain intensity. It is unclear how the validity and reliability of the scale scores vary across children's ages. We aimed to determine the validity and reliability of the scale for children presenting to the emergency department across a comprehensive spectrum of age. This was a cross-sectional study of children aged 4 to 17 years. Children self-reported their pain intensity, using the Verbal Numerical Rating Scale and Faces Pain Scale-Revised at 2 serial assessments. We evaluated convergent validity (strong validity defined as correlation coefficient ≥0.60), agreement (difference between concurrent Verbal Numerical Rating Scale and Faces Pain Scale-Revised scores), known-groups validity (difference in score between children with painful versus nonpainful conditions), responsivity (decrease in score after analgesic administration), and reliability (test-retest at 2 serial assessments) in the total sample and subgroups based on age. We enrolled 760 children; 27 did not understand the Verbal Numerical Rating Scale and were removed. Of the remainder, Pearson correlations were strong to very strong (0.62 to 0.96) in all years of age except 4 and 5 years, and agreement was strong for children aged 8 and older. Known-groups validity and responsivity were strong in all years of age. Reliability was strong in all age subgroups, including each year of age from 4 to 7 years. Convergent validity, known-groups validity, responsivity, and reliability of the Verbal Numerical Rating Scale were strong for children aged 6 to 17 years. Convergent validity was not strong for children aged 4 and 5 years. Our findings support the use of the Verbal Numerical Rating Scale for most children aged 6 years and older, but not for those aged 4 and 5 years. Copyright © 2017 American College of Emergency Physicians. Published by Elsevier Inc. All rights reserved.
Cross-cultural translation, adaptation, and psychometric testing of the Roland-Morris disability questionnaire into modern standard Arabic.

PubMed

Maki, Dana; Rajab, Ebrahim; Watson, Paul J; Critchley, Duncan J

2014-12-01

Cross-cultural translation, adaptation, and psychometric testing. To cross-culturally translate and adapt the Roland-Morris Disability Questionnaire (RMDQ) into Modern Standard Arabic and examine its validity with Arabic-speaking patients with low back pain (LBP). The English RMDQ is valid, reliable, and commonly used to assess LBP disability in clinical practice and research. There is no valid and reliable version of the RMDQ in Modern Standard Arabic. The RMDQ was forward translated and back translated. An expert committee of musculoskeletal physiotherapists reviewed the translation. Eight patients with LBP evaluated item-by-item comprehensibility. Ten patients piloted the RMDQ for overall comprehensibility and acceptability. Seventeen bilingual patients tested the agreement of the Arabic and English RMDQs. Two-hundred one patients completed the RMDQ and the visual analogue scale. Sixty-four patients were followed-up for test-retest reliability. Translation of most items was uncontroversial. The expert committee found the Arabic RMDQ clinically and culturally appropriate. They reviewed item 11, addressing bending and kneeling, because this has a clinical significance and cultural/religious implication regarding prayer positions. All patients reported that it was easy to understand and complete. The Arabic RMDQ had high overall agreement with the English RMDQ for the global score (intraclass correlation coefficient [ICC] = 0.925; 0.811-0.972). Kappa statistics showed good item-by-item agreement (none ≤0.30). Mean (SD) RMDQ and visual analog scale scores of 201 patients were 10.53 (4.80) and 5.11 (2.28), respectively. The RMDQ had a low correlation against pain intensity (r = 0.259; P < 0.01). A Cronbach α of 0.729 showed high internal consistency. Test-retest reliability of the Arabic RMDQ was good (ICC = 0.900; 95% confidence interval, 0.753-0.951). Kappa statistics were high for 18 items and fair for 6. The Arabic version of the RMDQ has good comprehensibility and acceptability, high internal consistency and reliability, low correlation against pain intensity, and good agreement with the English RMDQ. We recommend its use with Arabic-speaking patients with LBP. 3.

Validation of the Malay Version of the Parental Bonding Instrument among Malaysian Youths Using Exploratory Factor Analysis.

PubMed

Muhammad, Noor Azimah; Shamsuddin, Khadijah; Omar, Khairani; Shah, Shamsul Azhar; Mohd Amin, Rahmah

2014-01-01

Parenting behaviour is culturally sensitive. The aims of this study were (1) to translate the Parental Bonding Instrument into Malay (PBI-M) and (2) to determine its factorial structure and validity among the Malaysian population. The PBI-M was generated from a standard translation process and comprehension testing. The validation study of the PBI-M was administered to 248 college students aged 18 to 22 years. Participants in the comprehension testing had difficulty understanding negative items. Five translated double negative items were replaced with five positive items with similar meanings. Exploratory factor analysis showed a three-factor model for the PBI-M with acceptable reliability. Four negative items (items 3, 4, 8, and 16) and item 19 were omitted from the final PBI-M list because of incorrect placement or low factor loading (< 0.32). Out of the final 20 items of the PBI-M, there were 10 items for the care factor, five items for the autonomy factor and five items for the overprotection factor. All the items loaded positively on their respective factors. The Malaysian population favoured positive items in answering questions. The PBI-M confirmed the three-factor model that consisted of care, autonomy and overprotection. The PBI-M is a valid and reliable instrument to assess the Malaysian parenting style. Confirmatory factor analysis may further support this finding. Malaysia, parenting, questionnaire, validity.
Development and psychometric validation of a brief comprehensive health status assessment scale in older patients with hematological malignancies: The GAH Scale.

PubMed

Bonanad, S; De la Rubia, J; Gironella, M; Pérez Persona, E; González, B; Fernández Lago, C; Arnan, M; Zudaire, M; Hernández Rivas, J A; Soler, A; Marrero, C; Olivier, C; Altés, A; Valcárcel, D; Hernández, M T; Oiartzabal, I; Fernández Ordoño, R; Arnao, M; Esquerra, A; Sarrá, J; González-Barca, E; González, J; Calvo, X; Nomdedeu, M; García Guiñón, A; Ramírez Payer, A; Casado, A; López, S; Durán, M; Marcos, M; Cruz-Jentoft, A J

2015-09-01

The purpose of this study was to develop a new brief, comprehensive geriatric assessment scale for older patients diagnosed with different hematological malignancies, the Geriatric Assessment in Hematology (GAH scale), and to determine its psychometric properties. The 30-item GAH scale was designed through a multi-step process to cover 8 relevant dimensions. This is an observational study conducted in 363 patients aged≥65years, newly diagnosed with different hematological malignancies (myelodysplasic syndrome/acute myeloblastic leukemia, multiple myeloma, or chronic lymphocytic leukemia), and treatment-naïve. The scale psychometric validation process included the analyses of feasibility, floor and ceiling effect, validity and reliability criteria. Mean time taken to complete the GAH scale was 11.9±4.7min that improved through a learning-curve effect. Almost 90% of patients completed all items, and no floor or ceiling effects were identified. Criterion validity was supported by reasonable correlations between the GAH scale dimensions and three contrast variables (global health visual analogue scale, ECOG and Karnofsky), except for comorbidities. Factor analysis (supported by the scree plot) revealed nine factors that explained almost 60% of the total variance. Moderate internal consistency reliability was found (Cronbach's α: 0.610), and test-retest was excellent (ICC coefficients, 0.695-0.928). Our study suggests that the GAH scale is a valid, internally reliable and a consistent tool to assess health status in older patients with different hematological malignancies. Future large studies should confirm whether the GAH scale may be a tool to improve clinical decision-making in older patients with hematological malignancies. Copyright © 2015 Elsevier Inc. All rights reserved.
The Sickness Impact Profile as a measure of the health status of noncognitively impaired nursing home residents.

PubMed

Rothman, M L; Hedrick, S; Inui, T

1989-03-01

The Sickness Impact Profile (SIP) is a multidimensional, behaviorally based measure of the health status that has been successfully used in a wide range of applications. The characteristics of this measure have not been assessed with nursing home residents. The purpose of this study was to assess the feasibility, reliability (internal consistency), validity, and comprehensiveness of the SIP as a measure of the health status of a selected group of nursing home residents. One hundred sixty-eight veterans residing in community and VA nursing homes responded to a questionnaire consisting of the SIP, Index of Activities of Daily Living, Barthel Index, Life Satisfaction Index Z, and the Philadelphia Geriatric Center Morale Scale. In general, the respondents correctly interpreted instructions; reliability and validity were supported; and the SIP was found to provide a comprehensive assessment of physical function. Adding a measure of psychologic well-being to a study protocol involving this population may, however, provide additional useful information regarding this construct.
An initial reliability and validity study of the Interaction, Communication, and Literacy Skills Audit.

PubMed

El-Choueifati, Nisrine; Purcell, Alison; McCabe, Patricia; Heard, Robert; Munro, Natalie

2014-06-01

Early childhood educators (ECEs) have an important role in promoting positive outcomes for children's language and literacy development. This paper reports the development of a new tool, The Interaction Communication and Literacy (ICL) Skills Audit, and pilots its reliability and validity. Intra- and inter-rater reliability was examined by three speech-language pathologists (SLPs). Five skill areas relating to ECE language and literacy practice were rated. The face and content validity of the ICL Skills Audit was examined by expert SLPs (n = 8) and expert ECEs (n = 4) via questionnaire. The overall intra-rater reliability for the ICL Skills Audit was excellent with percentage close agreement (PCA) of 91-94. Inter-rater agreement was PCA 68-80. Expert SLPs and ECEs agreed that the content was comprehensive and practical. Based on this preliminary study, the ICL Skills Audit appears to be a promising tool that can be used by SLPs and ECEs in collaboration to measure the skills of ECEs in the areas of language and literacy support. Future psychometric and outcome research on the revised ICL Skills Audit is warranted.
Validation of highly reliable, real-time knowledge-based systems

NASA Technical Reports Server (NTRS)

Johnson, Sally C.

1988-01-01

Knowledge-based systems have the potential to greatly increase the capabilities of future aircraft and spacecraft and to significantly reduce support manpower needed for the space station and other space missions. However, a credible validation methodology must be developed before knowledge-based systems can be used for life- or mission-critical applications. Experience with conventional software has shown that the use of good software engineering techniques and static analysis tools can greatly reduce the time needed for testing and simulation of a system. Since exhaustive testing is infeasible, reliability must be built into the software during the design and implementation phases. Unfortunately, many of the software engineering techniques and tools used for conventional software are of little use in the development of knowledge-based systems. Therefore, research at Langley is focused on developing a set of guidelines, methods, and prototype validation tools for building highly reliable, knowledge-based systems. The use of a comprehensive methodology for building highly reliable, knowledge-based systems should significantly decrease the time needed for testing and simulation. A proven record of delivering reliable systems at the beginning of the highly visible testing and simulation phases is crucial to the acceptance of knowledge-based systems in critical applications.
DEVELOPMENT OF MOTIVATION SCALE - CLINICAL VALIDATION WITH ALCOHOL DEPENDENTS

PubMed Central

Neeliyara, Teresa; Nagalakshmi, S.V.

1994-01-01

This study focusses on the development of a comprehensive multi-dimensional scale for assessing motivation for change in the alcohol dependent population. After establishing face validity, the items evolved were administered to a normal sample of 600 male subjects in whom psychiatric illness was ruled out. The data thus obtained was subjected to factor analysis. Six factors were obtained which accounted for 55.2% of variance. These together formed a 80 item five point scale and norms were established on a sample of 600 normal subjects. Further clinical validation was established on 30 alcohol dependent subjects and 30 normals. The status of motivation was found to be inadequate in alcohol dependent individuals as compared to the normals. Split-half reliability was carried out and the tool was found to be highly reliable. PMID:21743674
Reliability and validity of the Symptoms of Depression Questionnaire (SDQ)

PubMed Central

Pedrelli, Paola; Blais, Mark A.; Alpert, Jonathan E.; Shelton, Richard C.; Walker, Rosemary S. W.; Fava, Maurizio

2015-01-01

Current measures for major depressive disorder focus primarily on the assessment of depressive symptoms, while often omitting other common features. However, the presence of comorbid features in the anxiety spectrum influences outcome and may effect treatment. More comprehensive measures of depression are needed that include the assessment of symptoms in the anxiety–depression spectrum. This study examines the reliability and validity of the Symptoms of Depression Questionnaire (SDQ), which assesses irritability, anger attacks, and anxiety symptoms together with the commonly considered symptoms of depression. Analysis of the factor structure of the SDQ identified 5 subscales, including one in the anxiety–depression spectrum, with adequate internal consistency and concurrent validity. The SDQ may be a valuable new tool to better characterize depression and identify and administer more targeted interventions. PMID:25275853
Developing and validating the Communication Function Classification System for individuals with cerebral palsy

PubMed Central

HIDECKER, MARY JO COOLEY; PANETH, NIGEL; ROSENBAUM, PETER L; KENT, RAYMOND D; LILLIE, JANET; EULENBERG, JOHN B; CHESTER, KEN; JOHNSON, BRENDA; MICHALSEN, LAUREN; EVATT, MORGAN; TAYLOR, KARA

2011-01-01

Aim The purpose of this study was to create and validate a Communication Function Classification System (CFCS) for children with cerebral palsy (CP) that can be used by a wide variety of individuals who are interested in CP. This paper reports the content validity, interrater reliability, and test–retest reliability of the CFCS for children with CP. Method An 11-member development team created comprehensive descriptions of the CFCS levels, and four nominal groups comprising 27 participants critiqued these levels. Within a Delphi survey, 112 participants commented on the clarity and usefulness of the CFCS. Interrater reliability was completed by 61 professionals and 68 parents/relatives who classified 69 children with CP aged 2 to 18 years. Test–retest reliability was completed by 48 professionals who allowed at least 2 weeks between classifications. The participants who assessed the CFCS were all relevant stakeholders: adults with CP, parents of children with CP, educators, occupational therapists, physical therapists, physicians, and speech–language pathologists. Results The interrater reliability of the CFCS was 0.66 between two professionals and 0.49 between a parent and a professional. Professional interrater reliability improved to 0.77 for classification of children older than 4 years. The test–retest reliability was 0.82. Interpretation The CFCS demonstrates content validity and shows very good test–retest reliability, good professional interrater reliability, and moderate parent–professional interrater reliability. Combining the CFCS with the Gross Motor Function Classification System and the Manual Ability Classification System contributes to a functional performance view of daily life for individuals with CP, in accordance with the World Health Organization’s International Classification of Functioning, Disability and Health. PMID:21707596
Cross-cultural adaptation and validation of the Ankle Osteoarthritis Scale for use in French-speaking populations.

PubMed

Angers, Magalie; Svotelis, Amy; Balg, Frederic; Allard, Jean-Pascal

2016-04-01

The Ankle Osteoarthritis Scale (AOS) is a self-administered score specific for ankle osteoarthritis (OA) with excellent reliability and strong construct and criterion validity. Many recent randomized multicentre trials have used the AOS, and the involvement of the French-speaking population is limited by the absence of a French version. Our goal was to develop a French version and validate the psychometric properties to assure equivalence to the original English version. Translation was performed according to American Association of Orthopaedic Surgeons (AAOS) 2000 guidelines for cross-cultural adaptation. Similar to the validation process of the English AOS, we evaluated the psychometric properties of the French version (AOS-Fr): criterion validity (AOS-Fr v. Western Ontario and McMaster Universities Arthritis Index [WOMAC] and SF-36 scores), construct validity (AOS-Fr correlation to single heel-lift test), and reliability (AOS-Fr test-retest). Sixty healthy individuals tested a prefinal version of the AOS-Fr for comprehension, leading to modifications and a final version that was approved by C. Saltzman, author of the AOS. We then recruited patients with ankle OA for evaluation of the AOS-Fr psychometric properties. Twenty-eight patients with ankle OA participated in the evaluation. The AOS-Fr showed strong criterion validity (AOS:WOMAC r = 0.709 and AOS:SF-36 r = -0.654) and construct validity (r = 0.664) and proved to be reliable (test-retest intraclass correlation coefficient = 0.922). The AOS-Fr is a reliable and valid score equivalent to the English version in terms of psychometric properties, thus is available for use in multicentre trials.
Development, reliability, and validity testing of Toddler NutriSTEP: a nutrition risk screening questionnaire for children 18-35 months of age.

PubMed

Randall Simpson, Janis; Gumbley, Jillian; Whyte, Kylie; Lac, Jane; Morra, Crystal; Rysdale, Lee; Turfryer, Mary; McGibbon, Kim; Beyers, Joanne; Keller, Heather

2015-09-01

Nutrition is vital for optimal growth and development of young children. Nutrition risk screening can facilitate early intervention when followed by nutritional assessment and treatment. NutriSTEP (Nutrition Screening Tool for Every Preschooler) is a valid and reliable nutrition risk screening questionnaire for preschoolers (aged 3-5 years). A need was identified for a similar questionnaire for toddlers (aged 18-35 months). The purpose was to develop a reliable and valid Toddler NutriSTEP. Toddler NutriSTEP was developed in 4 phases. Content and face validity were determined with a literature review, parent focus groups (n = 6; 48 participants), and experts (n = 13) (phase A). A draft questionnaire was refined with key intercept interviews of 107 parents/caregivers (phase B). Test-retest reliability (phase C), based on intra-class correlations (ICC), Kappa (κ) statistics, and Wilcoxon tests was assessed with 133 parents/caregivers. Criterion validity (phase D) was assessed using Receiver Operating Characteristic (ROC) curves by comparing scores on the Toddler NutriSTEP to a comprehensive nutritional assessment of 200 toddlers with a registered dietitian (RD). The Toddler NutriSTEP was reliable between 2 administrations (ICC = 0.951, F = 20.53, p < 0.001); most questions had moderate (κ ≥ 0.6) or excellent (κ ≥ 0.8) agreement. Scores on the RD nutrition risk rating and the Toddler NutriSTEP were correlated (r = 0.67, p < 0.000). The area under the ROC curve for moderate and high RD risk ratings were 84.6% and 82.7%, respectively. Cut-points of ≥21 (sensitivity 86%; specificity 61%) (moderate risk) and ≥26 (sensitivity 95%; specificity 63%) (high risk) were determined. The Toddler NutriSTEP questionnaire is both reliable and valid for screening for nutritional risk in toddlers.
The Preschool Rating Instrument for Science and Mathematics (PRISM)

ERIC Educational Resources Information Center

Brenneman, Kimberly; Stevenson-Garcia, Judi; Jung, Kwanghee; Frede, Ellen

2011-01-01

Until recently, few valid and reliable assessments were available to measure young children's mathematics and science learning in a "comprehensive" way. Now, a number of mathematics assessments have been developed and subjected to testing (Klein, Starkey, & Wakeley, 2000; Ginsburg, 2008; Clements & Sarama, 2008), and progress has…
Objective structured clinical examination for pharmacy students in Qatar: cultural and contextual barriers to assessment.

PubMed

Wilby, K J; Black, E K; Austin, Z; Mukhalalati, B; Aboulsoud, S; Khalifa, S I

2016-07-10

This study aimed to evaluate the feasibility and psychometric defensibility of implementing a comprehensive objective structured clinical examination (OSCE) on the complete pharmacy programme for pharmacy students in a Middle Eastern context, and to identify facilitators and barriers to implementation within new settings. Eight cases were developed, validated, and had standards set according to a blueprint, and were assessed with graduating pharmacy students. Assessor reliability was evaluated using inter-class coefficients (ICCs). Concurrent validity was evaluated by comparing OSCE results to professional skills course grades. Field notes were maintained to generate recommendations for implementation in other contexts. The examination pass mark was 424 points out of 700 (60.6%). All 23 participants passed. Mean performance was 74.6%. Low to moderate inter-rater reliability was obtained for analytical and global components (average ICC 0.77 and 0.48, respectively). In conclusion, OSCE was feasible in Qatar but context-related validity and reliability concerns must be addressed prior to future iterations in Qatar and elsewhere.
Psychometric survey of nursing competences illustrated with nursing students and apprentices

PubMed

Reichardt, Christoph; Wernecke, Frances; Giesler, Marianne; Petersen-Ewert, Corinna

2016-09-01

Background: The term competences is discussed differently in various disciplines of science. Furthermore there is no international or discipline comprehensive accepted definition of this term. Problem: So far, there are few practical, reliable and valid measuring instruments for a survey of general nursing skills. This article describes the adaptation process of a measuring instrument for medical skills into one for nursing competences. Method: The measurement quality of the questionnaire was audited using a sample of two different courses of studies and regular nursing apprentices. Another research question focused whether the adapted questionnaire is able to detect a change of nursing skills. For the validation of reliability and validity data from the first point of measurement was used (n = 240). The data from the second point of measurement, which was conducted two years later (n = 163), were used to validate, whether the questionnaire is able to detect a change of nursing competences. Results/Conclusions: The results indicate that the adapted version of the questionnaire is reliable and valid. Also the questionnaire was able to detect significant, partly even strong, effects of change in nursing skills (d = 0,17 – 1,04). It was possible to adapt the questionnaire for the measurement of nursing competences.
Training Guide for Observation and Interviewing in Marine Corps Task Analysis. Training Manual 3

DTIC Science & Technology

1975-08-01

McGraw- Hill Book Co., 1954. This is a comprehensive text that treats theory and sta- tistical operations in psychological testing . Chapter 14 deals...with reliability and validity of measures. -44- Reliabilitv and Validity (cont’d.) Cronbach, L., ESSENTIALS OF PSYCHOLOGICAL TESTING , 2nd ed., New...Commandant of the Marine Corps (Code RD) And Monitored By Personnel and Training Research Programs Psychological Sciences Division Office of Naval
A psychometric evaluation of the Rorschach comprehensive system's perceptual thinking index.

PubMed

Dao, Tam K; Prevatt, Frances

2006-04-01

In this study, we investigated evidence for reliability and validity of the Perceptual Thinking Index (PTI; Exner, 2000a, 2000b) among an adult inpatient population. We conducted reliability and validity analyses on 107 patients who met the Diagnostic and Statistical Manual of Mental Disorders (4th ed., text revision; American Psychiatric Association, 2000) criteria for a schizophrenia-spectrum disorder (SSD) or mood disorder with no psychotic features (MD). Results provided support for interrater reliability as well as internal consistency of the PTI. Furthermore, the PTI was an effective index in differentiating SSD patients from patients diagnosed with an MD. Finally, the PTI demonstrated adequate diagnostic statistics that can be useful in the classification of patients diagnosed with SSD and MD. We discuss methodological issues, implications for assessment practice, and directions for future research.
A Comprehensive Observational Coding Scheme for Analyzing Instrumental, Affective, and Relational Communication in Health Care Contexts

PubMed Central

SIMINOFF, LAURA A.; STEP, MARY M.

2011-01-01

Many observational coding schemes have been offered to measure communication in health care settings. These schemes fall short of capturing multiple functions of communication among providers, patients, and other participants. After a brief review of observational communication coding, the authors present a comprehensive scheme for coding communication that is (a) grounded in communication theory, (b) accounts for instrumental and relational communication, and (c) captures important contextual features with tailored coding templates: the Siminoff Communication Content & Affect Program (SCCAP). To test SCCAP reliability and validity, the authors coded data from two communication studies. The SCCAP provided reliable measurement of communication variables including tailored content areas and observer ratings of speaker immediacy, affiliation, confirmation, and disconfirmation behaviors. PMID:21213170
Psychometric Analysis of Role Conflict and Ambiguity Scales in Academia

ERIC Educational Resources Information Center

Khan, Anwar; Yusoff, Rosman Bin Md.; Khan, Muhammad Muddassar; Yasir, Muhammad; Khan, Faisal

2014-01-01

A comprehensive Psychometric Analysis of Rizzo et al.'s (1970) Role Conflict & Ambiguity (RCA) scales were performed after its distribution among 600 academic staff working in six universities of Pakistan. The reliability analysis includes calculation of Cronbach Alpha Coefficients and Inter-Items statistics, whereas validity was determined by…
Development and Implementation of a Food Safety Knowledge Instrument

ERIC Educational Resources Information Center

Byrd-Bredbenner, Carol; Wheatley, Virginia; Schaffner, Donald; Bruhn, Christine; Blalock, Lydia; Maurer, Jaclyn

2007-01-01

Little is known about the food safety knowledge of young adults. In addition, few knowledge questionnaires and no comprehensive, criterion-referenced measure that assesses the full range of food safety knowledge could be identified. Without appropriate, valid, and reliable measures and baseline data, it is difficult to develop and implement…
Bicultural Work Motivation Scale for Asian American College Students

ERIC Educational Resources Information Center

Chen, Yung-Lung; Fouad, Nadya A.

2016-01-01

The bicultural work motivations of Asian Americans have not yet been comprehensively captured by contemporary vocational constructs and scales. For this study, we conducted two studies on the initial reliability and validity of the Bicultural Work Motivation Scale (BWMS) by combining qualitative and quantitative methods. First, a pilot study was…
Toward Objectivity in Diagnosing Learning Disabilities: Refinement of Established Procedures.

ERIC Educational Resources Information Center

Goodman, Marvin; Mina, Elias

Variability in diagnostic procedures and a lack of valid and reliable measures led to the development of a comprehensive battery, which incorporated an operational definition of learning disabilities. The battery consisted of forms for observing these functions: intelligence, academic achievement, gross and fine motor control, visual perception,…

Self-Report Measures of Juvenile Psychopathic Personality Traits: A Comparative Review

ERIC Educational Resources Information Center

Vaughn, Michael G.; Howard, Matthew O.

2005-01-01

The authors evaluated self-report instruments currently being used to assess children and adolescents with psychopathic personality traits with respect to their reliability, validity, and research utility. Comprehensive searches across multiple computerized bibliographic databases were conducted and supplemented with manual searches. A total of 30…
Validation of the Mandarin Chinese version of the Leicester Cough Questionnaire in bronchiectasis.

PubMed

Gao, Y-H; Guan, W-J; Xu, G; Gao, Y; Lin, Z-Y; Tang, Y; Lin, Z-M; Li, H-M; Luo, Q; Zhong, N-S; Birring, S S; Chen, R-C

2014-12-01

The Leicester Cough Questionnaire (LCQ) has been validated for assessing cough-specific health status in bronchiectasis. We translated the LCQ into Mandarin Chinese and investigated its validity, reliability and responsiveness. The LCQ was translated into Mandarin Chinese using the forward-backward translation procedure. A total of 144 out-patients completed the Mandarin Chinese version of the LCQ (LCQ-MC), the Hospital Anxiety and Depression Scale (HADS) and the St George's Respiratory Questionnaire. Reassessments were performed during exacerbations and at 6 months. Concurrent validation, internal consistency, repeatability and responsiveness were determined. Minor cultural adaptations were made to the wording of LCQ-MC. No other difficulties were found during the translation process, with all items easily adapted to acceptable Mandarin Chinese. The questionnaire was not changed in terms of content layout and the order of the questions. In cognitive debriefing interviews, participants reported that the questionnaire was acceptable, relevant, comprehensive and easy to complete. The LCQ-MC showed good concurrent validity, internal consistency and test-retest reliability. Responsiveness was shown by significant changes in LCQ-MC scores between steady state, the first exacerbation and following 2-week antibiotic treatment (both interval changes, P < 0.01) CONCLUSION: The LCQ-MC is a valid, reliable and responsive instrument for determining cough-specific health status in Chinese bronchiectasis patients.
The Computerized Perceptual Motor Skills Assessment: A new visual perceptual motor skills evaluation tool for children in early elementary grades.

PubMed

Howe, Tsu-Hsin; Chen, Hao-Ling; Lee, Candy Chieh; Chen, Ying-Dar; Wang, Tien-Ni

2017-10-01

Visual perceptual motor skills have been proposed as underlying courses of handwriting difficulties. However, there is no evaluation tool currently available to assess these skills comprehensively and to serve as a sensitive measure. The purpose of this study was to validate the Computerized Perceptual Motor Skills Assessment (CPMSA), a newly developed evaluation tool for children in early elementary grades. Its test-retest reliability, concurrent validity, discriminant validity, and responsiveness were examined in 43 typically developing children and 26 children with handwriting difficulty. The CPMSA demonstrated excellent reliability across all subtests with intra-class correlation coefficients (ICCs)≥0.80. Significant moderate correlations between the domains of the CPMSA and corresponding gold standards including Beery VMI, the TVPS-3, and the eye-hand coordination subtest of the DTVP-2 demonstrated good concurrent validity. In addition, the CPMSA showed evidence of discriminant validity in samples of children with and without handwriting difficulty. This article provides evidence in support of the CPMSA. The CPMSA is a reliable, valid, and promising measure of visual perceptual motor skills for children in early elementary grades. Directions for future study and improvements to the assessment are discussed. Copyright © 2017. Published by Elsevier Ltd.
Fear of cancer recurrence: a systematic literature review of self-report measures.

PubMed

Thewes, Belinda; Butow, Phillis; Zachariae, Robert; Christensen, Soren; Simard, Sébastien; Gotay, Carolyn

2012-06-01

Prior research has shown that many cancer survivors experience ongoing fears of cancer recurrence (FCR) and that this chronic uncertainty of health status during and after cancer treatment can be a significant psychological burden. The field of research on FCR is an emerging area of investigation in the cancer survivorship literature, and several standardised instruments for its assessment have been developed. This review aims to identify all available FCR-specific questionnaires and subscales and critically appraise their properties. A systematic review was undertaken to identify instruments measuring FCR. Relevant studies were identified via Medline (1950-2010), CINAHL (1982-2010), PsycINFO (1967-2010) and AMED (1985-2010) databases, reference lists of articles and reviews, grey literature databases and consultation with experts in the field. The Medical Outcomes Trust criteria were used to examine the psychometric properties of the questionnaires. A total of 20 relevant multi-item measures were identified. The majority of instruments have demonstrated reliability and preliminary evidence of validity. Relatively few brief measures (2-10 items) were found to have comprehensive validation and reliability data available. Several valid and reliable longer measures (>10 items) are available. Three have developed short forms that may prove useful as screening tools. This analysis indicated that further refinement and validation of existing instruments is required. Valid and reliable instruments are needed for both research and clinical care. Copyright © 2011 John Wiley & Sons, Ltd.
Development and validation of the impact of dry eye on everyday life (IDEEL) questionnaire, a patient-reported outcomes (PRO) measure for the assessment of the burden of dry eye on patients.

PubMed

Abetz, Linda; Rajagopalan, Krithika; Mertzanis, Polyxane; Begley, Carolyn; Barnes, Rod; Chalmers, Robin

2011-12-08

To develop and validate a comprehensive patient-reported outcomes instrument focusing on the impact of dry eye on everyday life (IDEEL). Development and validation of the IDEEL occurred in four phases: 1) focus groups with 45 dry eye patients to develop a draft instrument, 2) item generation, 3) pilot study to assess content validity in 16 patients and 4) psychometric validation in 210 subjects: 130 with non-Sjögren's keratoconjunctivitis sicca, 32 with Sjögren's syndrome and 48 controls, and subsequent item reduction. Focus groups identified symptoms and the associated bother, the impact of dry eye on daily life and the patients' satisfaction with their treatment as the central concepts in patients' experience of dry eye. Qualitative analysis indicated that saturation was achieved for these concepts and yielded an initial 112-item draft instrument. Patients understood the questionnaire and found the items to be relevant indicating content validity. Patient input, item descriptive statistics and factor analysis identified 55 items that could be deleted. The final 57-item IDEEL assesses dry eye impact constituting 3 modules: dry eye symptom-bother, dry eye impact on daily life comprising impact on daily activities, emotional impact, impact on work, and dry eye treatment satisfaction comprising satisfaction with treatment effectiveness and treatment-related bother/inconvenience. The psychometric analysis results indicated that the IDEEL met the criteria for item discriminant validity, internal consistency reliability, test-retest reliability and floor/ceiling effects. As expected, the correlations between IDEEL and the Dry Eye Questionnaire (a habitual symptom questionnaire) were higher than between IDEEL and Short-Form-36 and EuroQoL-5D, indicating concurrent validity. The IDEEL is a reliable, valid and comprehensive questionnaire relevant to issues that are specific to dry eye patients, and meets current FDA patient-reported outcomes guidelines. The use of this questionnaire will provide assessment of the impact of dry eye on patient dry eye-related quality of life, impact of treatment on patient outcomes in clinical trials, and may aid in treatment effectiveness evaluation.
Evaluation of a Russian version of the oral health literacy instrument (OHLI).

PubMed

Blizniuk, Anastasiya; Ueno, Masayuki; Furukawa, Sayaka; Kawaguchi, Yoko

2014-11-27

Oral health literacy has become a popular research area in the last decade; however, to date no health literacy instruments in the Russian language exist. The objectives of this study were to develop a Russian version of the Oral Health Literacy Instrument (OHLI) and to examine its reliability and validity. A convenience sample of patients who visited the dental division of the district hospital in Belarus was used in the study. The OHLI, created originally in English, was modified to adapt it to characteristics of routine dental services in Belarus and then translated into Russian, followed by back-translation. Participants completed a self-administered socio-demographic questionnaire, an oral health knowledge test and the Russian version of the OHLI (R-OHLI). Bivariate and multivariate statistical analyses, including multiple regression modeling, were performed to examine reliability and validity of the R-OHLI. Participants were 281 adult patients aged from 18 to 60 years, with a mean age of 33.1 ± 12.2; 64.1% of them were women. Cronbach's alpha values for the two sections (reading comprehension and numeracy) and the total R-OHLI were 0.853, 0.815 and 0.895, respectively. The mean total R-OHLI score was 77.2 ± 14.5; the mean reading comprehension and numeracy scores were 39.5 ± 7.5 and 37.8 ± 8.8, respectively. The R-OHLI was significantly correlated to the oral health knowledge test. Pearson's correlation coefficients between the oral health knowledge test and the reading comprehension, numeracy and total R-OHLI were 0.401, 0.258, and 0.363, respectively (p < 0.001). Women, participants with a university degree, and those who visited a dentist at least once a year had significantly (p < 0.05) higher mean scores for each section (reading comprehension, numeracy) and for total R-OHLI compared to their counterparts. The R-OHLI showed good internal consistency and test-retest reliability. It was significantly associated with the oral health knowledge test, socio-demographic and behavioral factors. Therefore, the R-OHLI was proved to be a reliable and valid oral health literacy instrument for Russian-speaking people.
The IDEA Assessment Tool: Assessing the Reporting, Diagnostic Reasoning, and Decision-Making Skills Demonstrated in Medical Students' Hospital Admission Notes.

PubMed

Baker, Elizabeth A; Ledford, Cynthia H; Fogg, Louis; Way, David P; Park, Yoon Soo

2015-01-01

Construct: Clinical skills are used in the care of patients, including reporting, diagnostic reasoning, and decision-making skills. Written comprehensive new patient admission notes (H&Ps) are a ubiquitous part of student education but are underutilized in the assessment of clinical skills. The interpretive summary, differential diagnosis, explanation of reasoning, and alternatives (IDEA) assessment tool was developed to assess students' clinical skills using written comprehensive new patient admission notes. The validity evidence for assessment of clinical skills using clinical documentation following authentic patient encounters has not been well documented. Diagnostic justification tools and postencounter notes are described in the literature (1,2) but are based on standardized patient encounters. To our knowledge, the IDEA assessment tool is the first published tool that uses medical students' H&Ps to rate students' clinical skills. The IDEA assessment tool is a 15-item instrument that asks evaluators to rate students' reporting, diagnostic reasoning, and decision-making skills based on medical students' new patient admission notes. This study presents validity evidence in support of the IDEA assessment tool using Messick's unified framework, including content (theoretical framework), response process (interrater reliability), internal structure (factor analysis and internal-consistency reliability), and relationship to other variables. Validity evidence is based on results from four studies conducted between 2010 and 2013. First, the factor analysis (2010, n = 216) yielded a three-factor solution, measuring patient story, IDEA, and completeness, with reliabilities of .79, .88, and .79, respectively. Second, an initial interrater reliability study (2010) involving two raters demonstrated fair to moderate consensus (κ = .21-.56, ρ =.42-.79). Third, a second interrater reliability study (2011) with 22 trained raters also demonstrated fair to moderate agreement (intraclass correlations [ICCs] = .29-.67). There was moderate reliability for all three skill domains, including reporting skills (ICC = .53), diagnostic reasoning skills (ICC = .64), and decision-making skills (ICC = .63). Fourth, there was a significant correlation between IDEA rating scores (2010-2013) and final Internal Medicine clerkship grades (r = .24), 95% confidence interval (CI) [.15, .33]. The IDEA assessment tool is a novel tool with validity evidence to support its use in the assessment of students' reporting, diagnostic reasoning, and decision-making skills. The moderate reliability achieved supports formative or lower stakes summative uses rather than high-stakes summative judgments.
Lymphoedema Functioning, Disability and Health Questionnaire for Lower Limb Lymphoedema (Lymph-ICF-LL): reliability and validity.

PubMed

Devoogdt, Nele; De Groef, An; Hendrickx, Ad; Damstra, Robert; Christiaansen, Anke; Geraerts, Inge; Vervloesem, Nele; Vergote, Ignace; Van Kampen, Marijke

2014-05-01

Patients may develop primary (congenital) or secondary (acquired) lymphedema, causing significant physical and psychosocial problems. To plan treatment for lymphedema and monitor a patient's progress, swelling, and problems in functioning associated with lymphedema development should be assessed at baseline and follow-up. The purpose of this study was to investigate the reliability (test-retest, internal consistency, and measurement variability) and validity (content and construct) of data obtained with the Lymphoedema Functioning, Disability and Health Questionnaire for Lower Limb Lymphoedema (Lymph-ICF-LL). This was a multicenter, cross-sectional study. The Lymph-ICF-LL is a descriptive, evaluative tool containing 28 questions about impairments in function, activity limitations, and participation restrictions in patients with lower limb lymphedema. The questionnaire has 5 domains: physical function, mental function, general tasks/household activities, mobility activities, and life domains/social life. The reliability and validity of the Lymph-ICF-LL were examined in 30 participants with objective lower limb lymphedema. Intraclass correlation coefficients for test-retest reliability ranged from .69 to .94, and Cronbach alpha coefficients for internal consistency ranged from .82 to .97. Measurement variability was acceptable (standard error of measurement=5.9-12.6). Content validity was good because all questions were understandable for 93% of participants, the scoring system (visual analog scale) was clear, and the questionnaire was comprehensive for 90% of participants. Construct validity was good. All hypotheses for assessing convergent validity and divergent validity were accepted. The known-groups validity and responsiveness of the Dutch Lymph-ICF-LL and the cross-cultural validity of the English version of the Lymph-ICF-LL were not investigated. The Lymph-ICF-LL is a Dutch questionnaire with evidence of reliability and validity for assessing impairments in function, activity limitations, and participation restrictions in people with primary or secondary lower limb lymphedema.
Reliability evaluation of microgrid considering incentive-based demand response

NASA Astrophysics Data System (ADS)

Huang, Ting-Cheng; Zhang, Yong-Jun

2017-07-01

Incentive-based demand response (IBDR) can guide customers to adjust their behaviour of electricity and curtail load actively. Meanwhile, distributed generation (DG) and energy storage system (ESS) can provide time for the implementation of IBDR. The paper focus on the reliability evaluation of microgrid considering IBDR. Firstly, the mechanism of IBDR and its impact on power supply reliability are analysed. Secondly, the IBDR dispatch model considering customer’s comprehensive assessment and the customer response model are developed. Thirdly, the reliability evaluation method considering IBDR based on Monte Carlo simulation is proposed. Finally, the validity of the above models and method is studied through numerical tests on modified RBTS Bus6 test system. Simulation results demonstrated that IBDR can improve the reliability of microgrid.
Validation of the Malay Version of the Parental Bonding Instrument among Malaysian Youths Using Exploratory Factor Analysis

PubMed Central

MUHAMMAD, Noor Azimah; SHAMSUDDIN, Khadijah; OMAR, Khairani; SHAH, Shamsul Azhar; MOHD AMIN, Rahmah

2014-01-01

Background: Parenting behaviour is culturally sensitive. The aims of this study were (1) to translate the Parental Bonding Instrument into Malay (PBI-M) and (2) to determine its factorial structure and validity among the Malaysian population. Methods: The PBI-M was generated from a standard translation process and comprehension testing. The validation study of the PBI-M was administered to 248 college students aged 18 to 22 years. Results: Participants in the comprehension testing had difficulty understanding negative items. Five translated double negative items were replaced with five positive items with similar meanings. Exploratory factor analysis showed a three-factor model for the PBI-M with acceptable reliability. Four negative items (items 3, 4, 8, and 16) and item 19 were omitted from the final PBI-M list because of incorrect placement or low factor loading (< 0.32). Out of the final 20 items of the PBI-M, there were 10 items for the care factor, five items for the autonomy factor and five items for the overprotection factor. All the items loaded positively on their respective factors. Conclusion: The Malaysian population favoured positive items in answering questions. The PBI-M confirmed the three-factor model that consisted of care, autonomy and overprotection. The PBI-M is a valid and reliable instrument to assess the Malaysian parenting style. Confirmatory factor analysis may further support this finding. Keywords: Malaysia, parenting, questionnaire, validity PMID:25977634
Cross-cultural validation and psychometric evaluation of the Participation and Environment Measure for Children and Youth in Korea.

PubMed

Jeong, Yunwha; Law, Mary; Stratford, Paul; DeMatteo, Carol; Kim, Hwan

2016-11-01

To develop the Korean version of the Participation and Environment Measure for Children and Youth (KPEM-CY) and examine its psychometric properties. The PEM-CY was cross-culturally translated into Korean using a specific guideline: pre-review of participation items, forward/backward translation, expert committee review, pre-test of the KPEM-CY and final review. To establish internal consistency, test-retest reliability and construct validity of the KPEM-CY, 80 parents of children with disabilities aged 5-13 years were recruited in South Korea. Across the home, school and community settings, 76% of participation items and 29% of environment items were revised to improve their fit with Korean culture. Internal consistency was moderate to excellent (0.67-0.92) for different summary scores. Test-retest reliability was excellent (>0.75) in the summary scores of participation frequency and extent of involvement across the three settings and moderate to excellent (0.53-0.95) in all summary scores at home. Child's age, type of school and annual income were the factors that significantly influenced specific dimensions of participation and environment across all settings. Results indicated that the KPEM-CY is equivalent to the original PEM-CY and has initial evidence of reliability and validity for use with Korean children with disabilities. Implications for rehabilitation Because 'participation' is a key outcome of the rehabilitation, measuring comprehensive participation of children with disabilities is necessary. The PEM-CY is a parent-report survey measure to assess comprehensive participation of children and youth and environment, which affect their participation, at home, school and in the community. A cross-cultural adaptation process is mandatory to adapt the measurement tool to a new culture or country. The Korean PEM-CY has both reliability and validity and can therefore generate useful clinical data for Korean children with disabilities.
Iranian Health Literacy Questionnaire (IHLQ): An Instrument for Measuring Health Literacy in Iran

PubMed Central

Haghdoost, Ali Akbar; Rakhshani, Fatemeh; Aarabi, Mohsen; Montazeri, Ali; Tavousi, Mahmoud; Solimanian, Atoosa; Sarbandi, Fatemeh; Namdar, Hosein; Iranpour, Abedin

2015-01-01

Background: Promoting Health Literacy (HL) is considered as an important goal in strategic plans of many countries. In spite of the necessity for access to valid, reliable and native HL instruments, the number of such instruments in the Persian language is scarce. Moreover, there is no good estimation of HL status in Iran. Objectives: The aim of this study was to provide a valid, reliable and native instrument to measure and monitor community HL in Iran and also, to provide an estimation of HL status in two Iranian provinces. Patients and Methods: By applying the multistage cluster sampling, 1080 respondents (540 from each gender) were recruited from Kerman and Mazandaran provinces of Iran, from February to June 2014 to participate in this cross-sectional study. The development of the Iranian Health Literacy Questionnaire (IHLQ) was initiated with a comprehensive review of the literature. Then, face, content and construct validity as well as reliability were determined. Results: Internal consistency and test-retest reliability (ICC) of the factors was in the range of 0.71 to 0.96 and 0.73 to 0.86, respectively. In order to construct validity, Exploratory Factor Analysis (EFA) Kaiser-Meyer-Olkin (KMO) = 0.95 and Bartlett’s test result of 3.017 with P < 0.001) with varimax rotation was used. Optimal reduced solution, including 36 items and seven factors, was found in EFA. Five of the factors identified were reading/comprehension skills, individual empowerment, communication/decision-making skills, social empowerment and health knowledge. Conclusions: It was concluded that IHLQ might be a practical and useful tool for investigating HL for Persian language speakers around the world. Since HL is dynamic and its instruments should be regularly revised, further studies are recommended to assess HL with application of IHLQ to detect its potential imperfections. PMID:26290752
Report on Survey Implementation. Research Triangle Institute, Caliber Associates and Human Resources Research Organization

DTIC Science & Technology

1994-03-01

DISTRIBUTION /AVAILABILITY STATEMENT 12b. DISTRIBUTION CODE Approved for public release; distribution is unlimited. 13. ABSTRACT ( Maximum 200 words) This...different factors influence degree of readiness. The Army currently does not have an operational set of reliable, comprehensive, and valid measures of...the p.ro . . ....iky th .. ... - ou... ,,,,,,,uni.,.,ucceaa I&Ar COm.FI.,,, AIO W.Wa .’,U.J M-SS-oU, (b) the variable would be a valid indicator of
Digitised audio questionnaire for assessment of informed consent comprehension in a low-literacy African research population: development and psychometric evaluation

PubMed Central

Afolabi, Muhammed O; Bojang, Kalifa; D'Alessandro, Umberto; Ota, Martin O C; Imoukhuede, Egeruan B; Ravinetto, Raffaella; Larson, Heidi J; McGrath, Nuala; Chandramohan, Daniel

2014-01-01

Objective To develop and psychometrically evaluate an audio digitised tool for assessment of comprehension of informed consent among low-literacy Gambian research participants. Setting We conducted this study in the Gambia where a high illiteracy rate and absence of standardised writing formats of local languages pose major challenges for research participants to comprehend consent information. We developed a 34-item questionnaire to assess participants’ comprehension of key elements of informed consent. The questionnaire was face validated and content validated by experienced researchers. To bypass the challenge of a lack of standardised writing formats, we audiorecorded the questionnaire in three major Gambian languages: Mandinka, Wolof and Fula. The questionnaire was further developed into an audio computer-assisted interview format. Participants The digitised questionnaire was administered to 250 participants enrolled in two clinical trials in the urban and rural areas of the Gambia. One week after first administration, the questionnaire was readministered to half of the participants who were randomly selected. Participants were eligible if enrolled in the parent trials and could speak any of the three major Gambian languages. Outcome measure The primary outcome measure was reliability and validity of the questionnaire. Results Item reduction by factor analysis showed that 21 of the question items have strong factor loadings. These were retained along with five other items which were fundamental components of informed consent. The 26-item questionnaire has high internal consistency with a Cronbach's α of 0.73–0.79 and an intraclass correlation coefficient of 0.94 (95% CI 0.923 to 0.954). Hypotheses testing also showed that the questionnaire has a positive correlation with a similar questionnaire and discriminates between participants with and without education. Conclusions We have developed a reliable and valid measure of comprehension of informed consent information for the Gambian context, which might be easily adapted to similar settings. This is a major step towards engendering comprehension of informed consent information among low-literacy participants. PMID:24961716
Digitised audio questionnaire for assessment of informed consent comprehension in a low-literacy African research population: development and psychometric evaluation.

PubMed

Afolabi, Muhammed O; Bojang, Kalifa; D'Alessandro, Umberto; Ota, Martin O C; Imoukhuede, Egeruan B; Ravinetto, Raffaella; Larson, Heidi J; McGrath, Nuala; Chandramohan, Daniel

2014-06-24

To develop and psychometrically evaluate an audio digitised tool for assessment of comprehension of informed consent among low-literacy Gambian research participants. We conducted this study in the Gambia where a high illiteracy rate and absence of standardised writing formats of local languages pose major challenges for research participants to comprehend consent information. We developed a 34-item questionnaire to assess participants' comprehension of key elements of informed consent. The questionnaire was face validated and content validated by experienced researchers. To bypass the challenge of a lack of standardised writing formats, we audiorecorded the questionnaire in three major Gambian languages: Mandinka, Wolof and Fula. The questionnaire was further developed into an audio computer-assisted interview format. The digitised questionnaire was administered to 250 participants enrolled in two clinical trials in the urban and rural areas of the Gambia. One week after first administration, the questionnaire was readministered to half of the participants who were randomly selected. Participants were eligible if enrolled in the parent trials and could speak any of the three major Gambian languages. The primary outcome measure was reliability and validity of the questionnaire. Item reduction by factor analysis showed that 21 of the question items have strong factor loadings. These were retained along with five other items which were fundamental components of informed consent. The 26-item questionnaire has high internal consistency with a Cronbach's α of 0.73-0.79 and an intraclass correlation coefficient of 0.94 (95% CI 0.923 to 0.954). Hypotheses testing also showed that the questionnaire has a positive correlation with a similar questionnaire and discriminates between participants with and without education. We have developed a reliable and valid measure of comprehension of informed consent information for the Gambian context, which might be easily adapted to similar settings. This is a major step towards engendering comprehension of informed consent information among low-literacy participants. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
A Dynamic Speech Comprehension Test for Assessing Real-World Listening Ability.

PubMed

Best, Virginia; Keidser, Gitte; Freeston, Katrina; Buchholz, Jörg M

2016-07-01

Many listeners with hearing loss report particular difficulties with multitalker communication situations, but these difficulties are not well predicted using current clinical and laboratory assessment tools. The overall aim of this work is to create new speech tests that capture key aspects of multitalker communication situations and ultimately provide better predictions of real-world communication abilities and the effect of hearing aids. A test of ongoing speech comprehension introduced previously was extended to include naturalistic conversations between multiple talkers as targets, and a reverberant background environment containing competing conversations. In this article, we describe the development of this test and present a validation study. Thirty listeners with normal hearing participated in this study. Speech comprehension was measured for one-, two-, and three-talker passages at three different signal-to-noise ratios (SNRs), and working memory ability was measured using the reading span test. Analyses were conducted to examine passage equivalence, learning effects, and test-retest reliability, and to characterize the effects of number of talkers and SNR. Although we observed differences in difficulty across passages, it was possible to group the passages into four equivalent sets. Using this grouping, we achieved good test-retest reliability and observed no significant learning effects. Comprehension performance was sensitive to the SNR but did not decrease as the number of talkers increased. Individual performance showed associations with age and reading span score. This new dynamic speech comprehension test appears to be valid and suitable for experimental purposes. Further work will explore its utility as a tool for predicting real-world communication ability and hearing aid benefit. American Academy of Audiology.
Comprehensive Design Reliability Activities for Aerospace Propulsion Systems

NASA Technical Reports Server (NTRS)

Christenson, R. L.; Whitley, M. R.; Knight, K. C.

2000-01-01

This technical publication describes the methodology, model, software tool, input data, and analysis result that support aerospace design reliability studies. The focus of these activities is on propulsion systems mechanical design reliability. The goal of these activities is to support design from a reliability perspective. Paralleling performance analyses in schedule and method, this requires the proper use of metrics in a validated reliability model useful for design, sensitivity, and trade studies. Design reliability analysis in this view is one of several critical design functions. A design reliability method is detailed and two example analyses are provided-one qualitative and the other quantitative. The use of aerospace and commercial data sources for quantification is discussed and sources listed. A tool that was developed to support both types of analyses is presented. Finally, special topics discussed include the development of design criteria, issues of reliability quantification, quality control, and reliability verification.
Assessing patient-centered care: one approach to health disparities education.

PubMed

Wilkerson, LuAnn; Fung, Cha-Chi; May, Win; Elliott, Donna

2010-05-01

Patient-centered care has been described as one approach to cultural competency education that could reduce racial and ethnic health disparities by preparing providers to deliver care that is respectful and responsive to the preferences of each patient. In order to evaluate the effectiveness of a curriculum in teaching patient-centered care (PCC) behaviors to medical students, we drew on the work of Kleinman, Eisenberg, and Good to develop a scale that could be embedded across cases in an objective structured clinical examination (OSCE). To compare the reliability, validity, and feasibility of an embedded patient-centered care scale with the use of a single culturally challenging case in measuring students' use of PCC behaviors as part of a comprehensive OSCE. A total of 322 students from two California medical schools participated in the OSCE as beginning seniors. Cronbach's alpha was used to assess the internal consistency of each approach. Construct validity was addressed by establishing convergent and divergent validity using the cultural challenge case total score and OSCE component scores. Feasibility assessment considered cost and training needs for the standardized patients (SPs). Medical students demonstrated a moderate level of patient-centered skill (mean = 63%, SD = 11%). The PCC Scale demonstrated an acceptable level of internal consistency (alpha = 0.68) over the single case scale (alpha = 0.60). Both convergent and divergent validities were established through low to moderate correlation coefficients. The insertion of PCC items across multiple cases in a comprehensive OSCE can provide a reliable estimate of students' use of PCC behaviors without incurring extra costs associated with implementing a special cross-cultural OSCE. This approach is particularly feasible when an OSCE is already part of the standard assessment of clinical skills. Reliability may be increased with an additional investment in SP training.
Personality traits in companion dogs-Results from the VIDOPET.

PubMed

Turcsán, Borbála; Wallis, Lisa; Virányi, Zsófia; Range, Friederike; Müller, Corsin A; Huber, Ludwig; Riemer, Stefanie

2018-01-01

Individual behavioural differences in pet dogs are of great interest from a basic and applied research perspective. Most existing dog personality tests have specific (practical) goals in mind and so focused only on a limited aspect of dogs' personality, such as identifying problematic (aggressive or fearful) behaviours, assessing suitability as working dogs, or improving the results of adoption. Here we aimed to create a comprehensive test of personality in pet dogs that goes beyond traditional practical evaluations by exposing pet dogs to a range of situations they might encounter in everyday life. The Vienna Dog Personality Test (VIDOPET) consists of 15 subtests and was performed on 217 pet dogs. A two-step data reduction procedure (principal component analysis on each subtest followed by an exploratory factor analysis on the subtest components) yielded five factors: Sociability-obedience, Activity-independence, Novelty seeking, Problem orientation, and Frustration tolerance. A comprehensive evaluation of reliability and validity measures demonstrated excellent inter- and intra-observer reliability and adequate internal consistency of all factors. Moreover the test showed good temporal consistency when re-testing a subsample of dogs after an average of 3.8 years-a considerably longer test-retest interval than assessed for any other dog personality test, to our knowledge. The construct validity of the test was investigated by analysing the correlations between the results of video coding and video rating methods and the owners' assessment via a dog personality questionnaire. The results demonstrated good convergent as well as discriminant validity. To conclude, the VIDOPET is not only a highly reliable and valid tool for measuring dog personality, but also the first test to show consistent behavioural traits related to problem solving ability and frustration tolerance in pet dogs.
Personality traits in companion dogs—Results from the VIDOPET

PubMed Central

Wallis, Lisa; Virányi, Zsófia; Range, Friederike; Müller, Corsin A.; Huber, Ludwig; Riemer, Stefanie

2018-01-01

Individual behavioural differences in pet dogs are of great interest from a basic and applied research perspective. Most existing dog personality tests have specific (practical) goals in mind and so focused only on a limited aspect of dogs’ personality, such as identifying problematic (aggressive or fearful) behaviours, assessing suitability as working dogs, or improving the results of adoption. Here we aimed to create a comprehensive test of personality in pet dogs that goes beyond traditional practical evaluations by exposing pet dogs to a range of situations they might encounter in everyday life. The Vienna Dog Personality Test (VIDOPET) consists of 15 subtests and was performed on 217 pet dogs. A two-step data reduction procedure (principal component analysis on each subtest followed by an exploratory factor analysis on the subtest components) yielded five factors: Sociability-obedience, Activity-independence, Novelty seeking, Problem orientation, and Frustration tolerance. A comprehensive evaluation of reliability and validity measures demonstrated excellent inter- and intra-observer reliability and adequate internal consistency of all factors. Moreover the test showed good temporal consistency when re-testing a subsample of dogs after an average of 3.8 years—a considerably longer test-retest interval than assessed for any other dog personality test, to our knowledge. The construct validity of the test was investigated by analysing the correlations between the results of video coding and video rating methods and the owners’ assessment via a dog personality questionnaire. The results demonstrated good convergent as well as discriminant validity. To conclude, the VIDOPET is not only a highly reliable and valid tool for measuring dog personality, but also the first test to show consistent behavioural traits related to problem solving ability and frustration tolerance in pet dogs. PMID:29634747

Translation and validation of the Arab version of the Late-Life Function and Disability Instrument: a cross sectional study.

PubMed

Elboim-Gabyzon, Michal; Agmon, Maayan; Azaiza, Faisal; Laufer, Yocheved

2015-04-24

The Late-Life Function and Disability Instrument (LLFDI) provides a comprehensive, reliable, and valid assessment of physical function and disability in community-dwelling adults. There does not appear to be a validated, comprehensive instrument for assessing function and disability in Arabic. The objective of the present study was to translate and culturally adapt the LLFDI to Arabic, and to determine its test-retest reliability and validity. The LLFDI was translated to Arabic through a forward and backward translation process, and approved by a bilingual committee of experts. Sixty-one (26 male and 35 female) Arabic speaking, healthy, older adults, ages 65-88, living in northern Israel participated in the study. To determine test-retest reliability, the questionnaire was administered twice to 41 subjects with a 6 to 8day interval. Construct validity was examined by correlating the LLFDI responses with the 10-item physical function (PF-10) subscales of the General Health Survey (SF-36), with the physical component of SF-36 (SF-36 PCS), and with two performance measures, the Berg Balance Scale (BBS) and Time Up and Go (TUG) test. Additionally, gender and fall related differences in the LLFDI were also examined. Internal consistency (Cronbach's alpha) was good to excellent (0.77 to 0.97). Test-retest agreement was good to very good (function component: 0.86-0.93, disability component: 0.77-0.93). Correlation with the SF-36 PCS and PF-10 was moderate to strong for both LLFDI components (function, r = 0.53-0.65 and r = 0.57-0.63, and LLFDI disability, r = 0.57-0.76 and 0.53-0.73, respectively). Significant, moderate-to-strong correlations between the LLFDI and BBS (r = 0.73-0.87) and a significant, moderate, negative correlation between LLFDI and TUG test (r = -0.59- -0.68) were noted. The standard error of measure was 6-12%, and the smallest real difference was 18-33%. Discriminative validity for both gender and fall status were also demonstrated. The Arabic version of the LLFDI is a highly reliable and valid instrument for assessing function and disability in community dwelling, Arab older adults. The translated instrument has a discriminative ability between genders and between fallers and non-fallers. The translated instrument may be used in clinical settings and for research purposes.
Validity and reliability of a self-report instrument to assess social support and physical environmental correlates of physical activity in adolescents

PubMed Central

2012-01-01

Background The purpose of this study was to examine the internal consistency, test-retest reliability, construct validity and predictive validity of a new German self-report instrument to assess the influence of social support and the physical environment on physical activity in adolescents. Methods Based on theoretical consideration, the short scales on social support and physical environment were developed and cross-validated in two independent study samples of 9 to 17 year-old girls and boys. The longitudinal sample of Study I (n = 196) was recruited from a German comprehensive school, and subjects in this study completed the questionnaire twice with a between-test interval of seven days. Cronbach’s alphas were computed to determine the internal consistency of the factors. Test-retest reliability of the latent factors was assessed using intra-class coefficients. Factorial validity of the scales was assessed using principle components analysis. Construct validity was determined using a cross-validation technique by performing confirmatory factor analysis with the independent nationwide cross-sectional sample of Study II (n = 430). Correlations between factors and three measures of physical activity (objectively measured moderate-to-vigorous physical activity (MVPA), self-reported habitual MVPA and self-reported recent MVPA) were calculated to determine the predictive validity of the instrument. Results Construct validity of the social support scale (two factors: parental support and peer support) and the physical environment scale (four factors: convenience, public recreation facilities, safety and private sport providers) was shown. Both scales had moderate test-retest reliability. The factors of the social support scale also had good internal consistency and predictive validity. Internal consistency and predictive validity of the physical environment scale were low to acceptable. Conclusions The results of this study indicate moderate to good reliability and construct validity of the social support scale and physical environment scale. Predictive validity was only confirmed for the social support scale but not for the physical environment scale. Hence, it remains unclear if a person’s physical environment has a direct or an indirect effect on physical activity behavior or a moderation function. PMID:22928865
Validity and reliability of a self-report instrument to assess social support and physical environmental correlates of physical activity in adolescents.

PubMed

Reimers, Anne K; Jekauc, Darko; Mess, Filip; Mewes, Nadine; Woll, Alexander

2012-08-29

The purpose of this study was to examine the internal consistency, test-retest reliability, construct validity and predictive validity of a new German self-report instrument to assess the influence of social support and the physical environment on physical activity in adolescents. Based on theoretical consideration, the short scales on social support and physical environment were developed and cross-validated in two independent study samples of 9 to 17 year-old girls and boys. The longitudinal sample of Study I (n = 196) was recruited from a German comprehensive school, and subjects in this study completed the questionnaire twice with a between-test interval of seven days. Cronbach's alphas were computed to determine the internal consistency of the factors. Test-retest reliability of the latent factors was assessed using intra-class coefficients. Factorial validity of the scales was assessed using principle components analysis. Construct validity was determined using a cross-validation technique by performing confirmatory factor analysis with the independent nationwide cross-sectional sample of Study II (n = 430). Correlations between factors and three measures of physical activity (objectively measured moderate-to-vigorous physical activity (MVPA), self-reported habitual MVPA and self-reported recent MVPA) were calculated to determine the predictive validity of the instrument. Construct validity of the social support scale (two factors: parental support and peer support) and the physical environment scale (four factors: convenience, public recreation facilities, safety and private sport providers) was shown. Both scales had moderate test-retest reliability. The factors of the social support scale also had good internal consistency and predictive validity. Internal consistency and predictive validity of the physical environment scale were low to acceptable. The results of this study indicate moderate to good reliability and construct validity of the social support scale and physical environment scale. Predictive validity was only confirmed for the social support scale but not for the physical environment scale. Hence, it remains unclear if a person's physical environment has a direct or an indirect effect on physical activity behavior or a moderation function.
Assessing Learning, Quality and Engagement in Learning Objects: The Learning Object Evaluation Scale for Students (LOES-S)

ERIC Educational Resources Information Center

Kay, Robin H.; Knaack, Liesel

2009-01-01

Learning objects are interactive web-based tools that support the learning of specific concepts by enhancing, amplifying, and/or guiding the cognitive processes of learners. Research on the impact, effectiveness, and usefulness of learning objects is limited, partially because comprehensive, theoretically based, reliable, and valid evaluation…
A Model for Pain Behavior in Individuals with Intellectual and Developmental Disabilities

ERIC Educational Resources Information Center

Meir, Lotan; Strand, Liv Inger; Alice, Kvale

2012-01-01

The dearth of information on the pain experience of individuals with intellectual and developmental disabilities (IDD) calls for a more comprehensive understanding of pain in this population. The Non-Communicating Adults Pain Checklist (NCAPC) is an 18-item behavioral scale that was recently found to be reliable, valid, sensitive and clinically…
Perceived Personal and Social Competence: Development of Valid and Reliable Measures

ERIC Educational Resources Information Center

Fetro, Joyce V.; Rhodes, Darson L.; Hey, David W.

2010-01-01

During the last 20 years, youth programming has shifted from risk reduction to youth development. While numerous instruments exist to measure selected individual characteristics/competencies among youth, a comprehensive instrument to measure four constructs of personal and social skills could not be identified. The purpose of this study was to…
Measuring Graph Comprehension, Critique, and Construction in Science

ERIC Educational Resources Information Center

Lai, Kevin; Cabrera, Julio; Vitale, Jonathan M.; Madhok, Jacquie; Tinker, Robert; Linn, Marcia C.

2016-01-01

Interpreting and creating graphs plays a critical role in scientific practice. The K-12 Next Generation Science Standards call for students to use graphs for scientific modeling, reasoning, and communication. To measure progress on this dimension, we need valid and reliable measures of graph understanding in science. In this research, we designed…
Force Project Technology Presentation to the NRCC

DTIC Science & Technology

2014-02-04

Functional Bridge components Smart Odometer Adv Pretreatment Smart Bridge Multi-functional Gap Crossing Fuel Automated Tracking System Adv...comprehensive matrix of candidate composite material systems and textile reinforcement architectures via modeling/analyses and testing. Product(s...Validated Dynamic Modeling tool based on parametric study using material models to reliably predict the textile mechanics of the hose
[The Basel Interview for Psychosis (BIP): structure, reliability and validity].

PubMed

Riecher-Rössler, A; Ackermann, T; Uttinger, M; Ittig, S; Koranyi, S; Rapp, C; Bugra, H; Studerus, E

2015-02-01

Although several instruments have been developed to identify patients with an at-risk mental state (ARMS) for psychosis and first episode of psychosis (FEP), up to now there were no instruments for a detailed assessment of risk factors and indicators of emerging psychosis and the temporal development of psychiatric symptoms over the whole life span in these patients. We therefore developed the Basle Interview for Psychosis (BIP). The aim of this study is to describe the development of the BIP and to report about its psychometric properties. The BIP is a comprehensive semi-structured interview that was developed for the Basel early detection of psychoses (FePsy) study. Its items were derived from the most important risk factors and indicators of psychosis described in the literature and from several existing instruments. It contains the following six sections: 1) social and physical development and family, 2) signs and symptoms, 3) vulnerability, 4) help-seeking behavior, 5) illness insight, 6) evaluation of the interview. To estimate the inter-rater reliabilities of the items of sections 2 and 3, 20 interviews were conducted and rated by 8 well-trained raters. The factorial structure of the BIP section "signs and symptoms" was explored in a sample of 120 ARMS and 77 FEP patients. On the basis of the discovered factorial structure, we created new subscales and assessed their reliabilities and validities. Of the 153 studied items of sections 2 and 3, 150 (98 %) were rated with sufficiently high agreement (inter-rater reliability > 0.4). The items of section "signs and symptoms" could be grouped into 5 subscales with predominantly good to very good internal consistencies, homogeneities, and discriminant and convergent validities. Predictive validities could be demonstrated for the subscales "Positive Psychotic Symptoms", "Disturbance of Thinking" and the total score. The BIP is the first interview for comprehensively assessing risk factors and indicators of emerging psychosis and the temporal development of psychiatric symptoms over the whole life span, which has been validated in ARMS and FEP patients. We could show that the BIP has excellent psychometric properties. © Georg Thieme Verlag KG Stuttgart · New York.
Measuring quality of life in cleft lip and palate patients: currently available patient-reported outcomes measures.

PubMed

Eckstein, Donna A; Wu, Rebecca L; Akinbiyi, Takintope; Silver, Lester; Taub, Peter J

2011-11-01

Patient-reported outcomes in cleft lip and palate treatment are critical for patient care. Traditional surgical outcomes focused on objective measures, such as photographs, anatomic measurements, morbidity, and mortality. Although these remain important, they leave many questions unanswered. Surveys that include aesthetics, speech, functionality, self-image, and quality of life provide more thorough outcomes assessment. It is vital that reliable, valid, and comprehensive questionnaires are available to craniofacial surgeons. The authors performed a literature review to identify questionnaires validated in cleft lip and palate patients. Qualifying instruments were assessed for adherence to guidelines for development and validation by the scientific advisory committee and for content. The authors identified 44 measures used in cleft lip and palate studies. After 15 ad hoc questionnaires, eight generic instruments, 11 psychiatric instruments, and one non-English language questionnaire were excluded, nine measures remained. Of these, four were never validated in the cleft population. Analysis revealed one craniofacial-specific measure (Youth Quality of Life-Facial Differences), two voice-related measures (Patient Voice-Related Quality of Life and Cleft Audit Protocol for Speech-Augmented), and two oral health-related measures (Child Oral Health Impact Profile and Child Oral Health Quality of Life). The Youth Quality of Life-Facial Differences, Child Oral Health Impact Profile, and Child Oral Health Quality of Life questionnaires were sufficiently validated. None was created specifically for clefts, resulting in content limitations. There is a lack of comprehensive, valid, and reliable questionnaires for cleft lip and palate surgery. For thorough assessment of satisfaction, further research to develop and validate cleft lip and palate surgery-specific instruments is needed.
Comprehensive Deployment Method for Technical Characteristics Base on Multi-failure Modes Correlation Analysis

NASA Astrophysics Data System (ADS)

Zheng, W.; Gao, J. M.; Wang, R. X.; Chen, K.; Jiang, Y.

2017-12-01

This paper put forward a new method of technical characteristics deployment based on Reliability Function Deployment (RFD) by analysing the advantages and shortages of related research works on mechanical reliability design. The matrix decomposition structure of RFD was used to describe the correlative relation between failure mechanisms, soft failures and hard failures. By considering the correlation of multiple failure modes, the reliability loss of one failure mode to the whole part was defined, and a calculation and analysis model for reliability loss was presented. According to the reliability loss, the reliability index value of the whole part was allocated to each failure mode. On the basis of the deployment of reliability index value, the inverse reliability method was employed to acquire the values of technology characteristics. The feasibility and validity of proposed method were illustrated by a development case of machining centre’s transmission system.
Assessing therapy-relevant cognitive capacities in young people: development and psychometric evaluation of the self-reflection and insight scale for youth.

PubMed

Sauter, Floor M; Heyne, David; Blöte, Anke W; van Widenfelt, Brigit M; Westenberg, P Michiel

2010-05-01

The effectiveness of cognitive-behaviour therapy with young people may be influenced by a young person's capacity for self-reflection and insight. Clinicians who assess clients' proficiencies in these cognitive capacities can better tailor cognitive and behavioural techniques to the client, facilitating engagement and enhancing treatment outcome. It is therefore important that sound instruments for assessing self-reflection and insight in young people are available. The aim of the current study was to translate and adapt the Self-Reflection and Insight Scale (SRIS) for use with a child and adolescent population (Study 1), and to evaluate the psychometric properties of the resulting measure, the Self-Reflection and Insight Scale for Youth (SRIS-Y; Study 2). In Study 1 (n=145), the comprehensibility of the SRIS-Y was assessed in a community sample of children and adolescents. Study 2 (n=215) then explored the reliability and structural, convergent, and divergent validity of the SRIS-Y. The SRIS-Y was found to be comprehensible to young people, and had good reliability and structural validity. It appears that the SRIS-Y is a sound instrument for assessing therapy-relevant cognitive capacities in young people, of potential benefit in both research and clinical contexts. Future research foci include the predictive validity of the instrument.
Development and Validity Testing of the Worksite Health Index: An Assessment Tool to Help and Improve Korean Employees' Health-Related Outcome.

PubMed

Yun, Young Ho; Sim, Jin Ah; Lim, Ye Jin; Lim, Cheol Il; Kang, Sung-Choon; Kang, Joon-Ho; Park, Jun Dong; Noh, Dong Young

2016-06-01

The objective of this study was to develop the Worksite Health Index (WHI) and validate its psychometric properties. The development of the WHI questionnaire included item generation, item construction, and field testing. To assess the instrument's reliability and validity, we recruited 30 different Korean worksites. We developed the WHI questionnaire of 136 items categorized into five domains, namely Governance and Infrastructure, Need Assessment and Planning, Health Prevention and Promotion Program, Occupational Safety, and Monitoring and Feedback. All WHI domains demonstrated a high reliability with good internal consistency. The total WHI scores differentiated worksite groups effectively according to firm size. Each domain was associated significantly with employees' health status, absence, and financial outcome. The WHI can assess comprehensive worksite health programs. This tool is publicly available for addressing the growing need for worksite health programs.
A Measure of Perceived Argument Strength: Reliability and Validity

PubMed Central

Zhao, Xiaoquan; Strasser, Andrew; Cappella, Joseph N.; Lerman, Caryn; Fishbein, Martin

2014-01-01

Studies of the content of persuasive messages in which the central arguments of the message are scrutinized have traditionally relied on the technique of thought-listing to assess argument strength. Although the validity of the thought-listing procedure is well documented, its utility can be limited in situations involving non-adult populations and sensitive topics. In this paper we present a self-reported scale that can be used to assess perceived argument strength in contexts where thought-listing may be less appropriate. This scale taps into perceived argument strength from multiple points of view, including but also extending beyond the potential of the argument to elicit positive and negative thoughts. Reliability and validity of this scale were assessed in health communication contexts involving anti-drug PSAs directed at adolescents and anti-smoking PSAs targeting adults. Evidence of convergence between this scale and the thought-listing technique was also obtained using the classical comprehensive exam arguments. PMID:25568663
Construct validity of the individual work performance questionnaire.

PubMed

Koopmans, Linda; Bernaards, Claire M; Hildebrandt, Vincent H; de Vet, Henrica C W; van der Beek, Allard J

2014-03-01

To examine the construct validity of the Individual Work Performance Questionnaire (IWPQ). A total of 1424 Dutch workers from three occupational sectors (blue, pink, and white collar) participated in the study. First, IWPQ scores were correlated with related constructs (convergent validity). Second, differences between known groups were tested (discriminative validity). First, IWPQ scores correlated weakly to moderately with absolute and relative presenteeism, and work engagement. Second, significant differences in IWPQ scores were observed for workers differing in job satisfaction, and workers differing in health. Overall, the results indicate acceptable construct validity of the IWPQ. Researchers are provided with a reliable and valid instrument to measure individual work performance comprehensively and generically, among workers from different occupational sectors, with and without health problems.
[Development of a questionnaire to measure family stress among married working women].

PubMed

Kim, Gwang Suk; Cho, Won Jung

2006-08-01

Even though a number of studies have suggested that appropriate measuring instruments of family stress for working women have to be developed, the validity and reliability of the instruments used have not been consistently examined. The purpose of the present study was to develop a sensitive instrument to measure family stress for married working women, and to test the validity and reliability of the instrument. The items generated for this instrument were drawn from a comprehensive literature review. Twenty four items were developed through evaluation by 10 experts and twenty one items were finally confirmed through item analysis. Psychometric testing was preformed and confirmed with a convenient sample of 240 women employed in the industrial sector. Four factors evolved by factor analysis, which explained 50.5% of the total variance. The first factor 'Cooperation' explained 28.1%, 2nd factor 'Satisfaction with relationships' 10.6%, 3rd factor 'Democratic and comfortable environment' 6.3%, and 4th factor 'Disturbance of own living' 5.5%. Cronbach's coefficient of this instrument was 0.86. The study supports the validity and reliability of the instrument.
Cross-cultural adaptation of the Portuguese version of the Patient-Generated Subjective Global Assessment.

PubMed

Duarte Bonini Campos, J A; Dias do Prado, C

2012-01-01

The cross-cultural adaptation of the Patient-Generated Subjective Global Assessment is important so it can be used with confidence in Portuguese language. To perform a cross-cultural adaptation of the Portuguese version of the Patient-Generated Subjective Global Assessment and to estimate its intrarater reliability. This is a validation study. Face Validity was classified by 17 health professionals and 10 Portuguese language specialists. Idiomatic, semantic, cultural and conceptual equivalences were analyzed. The questionnaire was completed by 20 patients of the Amaral Carvalho Hospital (Jaú, São Paulo, Brazil) in order to verify the Comprehension Index of each item. Therefore, 27 committee members classified each item into "essential", "useful, but not essential" and "not necessary", in order to calculate the Content Validity Ratio. After, this version of the questionnaire was applied twice to 62 patients of the hospital cited above. The intrarater reliability of the nutritional status analyzed by Patient-Generated Subjective Global Assessment was estimated by Kappa statistics. The Portuguese version of the Patient-Generated Subjective Global Assessment presented 10 incomprehensible expressions. The items "a year ago weight" and "dry mouth symptom" presented the lowest Content Validity Ratio. Substantial intrarater reliability (k = 0.78, p = 0.001) was observed. The cross-cultural adaptation of the Portuguese version of the Patient-Generated Subjective Global Assessment became simple and understandable for Brazilian patients. Thus, this version of the Patient-Generated Subjective Global Assessment was considered a valid and a reliable method.
Reliability, validity and minimal detectable change of computerized respiratory sounds in patients with chronic obstructive pulmonary disease.

PubMed

Oliveira, Ana; Lage, Susan; Rodrigues, João; Marques, Alda

2017-11-17

Computerized respiratory sounds (CRS) are closely related to the movement of air within the tracheobronchial tree and are promising outcome measures in patients with chronic obstructive pulmonary disease (COPD). However, CRS measurement properties have been poorly tested. The aim of this study was to assess the reliability, validity and the minimal detectable changes (MDC) of CRS in patients with stable COPD. Fifty patients (36♂, 67.26 ± 9.31y, FEV 1 49.52 ± 19.67%predicted) were enrolled. CRS were recorded simultaneously at seven anatomic locations (trachea; right and left anterior, lateral and posterior chest). The number of crackles, wheeze occupation rate, median frequency (F50) and maximum intensity (Imax) were processed using validated algorithms. Within-day and between-days reliability, criterion and construct validity, validity to predict exacerbations and MDC were established. CRS presented moderate-to-excellent within-day reliability (ICC 1,3 ≥ 0.51; P < .05) and moderate-to-good between-days reliability (ICC 1,2 ≥ 0.47; P < .05) for most locations. Negligible-to-moderate correlations with FEV 1 %predicted were found (-0.53 < r s < -0.28; P < .05), and the inspiratory number of crackles were the best discriminator between mild-to-moderate and severe-to-very severe airflow limitations (area under the curve >0.78). CRS correlated poorly with patient-reported outcomes (r s < 0.48; P < .05) and did not predict exacerbations. Inspiratory number of crackles at posterior right chest, inspiratory F50 at trachea and anterior left chest and expiratory Imax at anterior right chest were simultaneously reliable and valid, and their MDC were 2.41, 55.27, 29.55 and 3.98, respectively. CRS are reliable and valid. Their use, integrated with other clinical and patient-reported measures, may fill the gap of assessing small airways and contribute toward a patient's comprehensive evaluation. © 2017 John Wiley & Sons Ltd.
Development of the Patient Education Materials Assessment Tool (PEMAT): A new measure of understandability and actionability for print and audiovisual patient information

PubMed Central

Shoemaker, Sarah J.; Wolf, Michael S.; Brach, Cindy

2016-01-01

Objective To develop a reliable and valid instrument to assess the understandability and actionability of print and audiovisual materials. Methods We compiled items from existing instruments/guides that the expert panel assessed for face/content validity. We completed four rounds of reliability testing, and produced evidence of construct validity with consumers and readability assessments. Results The experts deemed the PEMAT items face/content valid. Four rounds of reliability testing and refinement were conducted using raters untrained on the PEMAT. Agreement improved across rounds. The final PEMAT showed moderate agreement per Kappa (Average K = 0.57) and strong agreement per Gwet’s AC1 (Average = 0.74). Internal consistency was strong (α = 0.71; Average Item-Total Correlation = 0.62). For construct validation with consumers (n = 47), we found significant differences between actionable and poorly-actionable materials in comprehension scores (76% vs. 63%, p < 0.05) and ratings (8.9 vs. 7.7, p < 0.05). For understandability, there was a significant difference for only one of two topics on consumer numeric scores. For actionability, there were significant positive correlations between PEMAT scores and consumer-testing results, but no relationship for understandability. There were, however, strong, negative correlations between grade-level and both consumer-testing results and PEMAT scores. Conclusions The PEMAT demonstrated strong internal consistency, reliability, and evidence of construct validity. Practice implications The PEMAT can help professionals judge the quality of materials (available at: http://www.ahrq.gov/pemat). PMID:24973195
Development of the Patient Education Materials Assessment Tool (PEMAT): a new measure of understandability and actionability for print and audiovisual patient information.

PubMed

Shoemaker, Sarah J; Wolf, Michael S; Brach, Cindy

2014-09-01

To develop a reliable and valid instrument to assess the understandability and actionability of print and audiovisual materials. We compiled items from existing instruments/guides that the expert panel assessed for face/content validity. We completed four rounds of reliability testing, and produced evidence of construct validity with consumers and readability assessments. The experts deemed the PEMAT items face/content valid. Four rounds of reliability testing and refinement were conducted using raters untrained on the PEMAT. Agreement improved across rounds. The final PEMAT showed moderate agreement per Kappa (Average K=0.57) and strong agreement per Gwet's AC1 (Average=0.74). Internal consistency was strong (α=0.71; Average Item-Total Correlation=0.62). For construct validation with consumers (n=47), we found significant differences between actionable and poorly-actionable materials in comprehension scores (76% vs. 63%, p<0.05) and ratings (8.9 vs. 7.7, p<0.05). For understandability, there was a significant difference for only one of two topics on consumer numeric scores. For actionability, there were significant positive correlations between PEMAT scores and consumer-testing results, but no relationship for understandability. There were, however, strong, negative correlations between grade-level and both consumer-testing results and PEMAT scores. The PEMAT demonstrated strong internal consistency, reliability, and evidence of construct validity. The PEMAT can help professionals judge the quality of materials (available at: http://www.ahrq.gov/pemat). Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

The development of a multimedia online language assessment tool for young children with autism.

PubMed

Lin, Chu-Sui; Chang, Shu-Hui; Liou, Wen-Ying; Tsai, Yu-Show

2013-10-01

This study aimed to provide early childhood special education professionals with a standardized and comprehensive language assessment tool for the early identification of language learning characteristics (e.g., hyperlexia) of young children with autism. In this study, we used computer technology to develop a multi-media online language assessment tool that presents auditory or visual stimuli. This online comprehensive language assessment consists of six subtests: decoding, homographs, auditory vocabulary comprehension, visual vocabulary comprehension, auditory sentence comprehension, and visual sentence comprehension. Three hundred typically developing children and 35 children with autism from Tao-Yuan County in Taiwan aged 4-6 participated in this study. The Cronbach α values of the six subtests ranged from .64 to .97. The variance explained by the six subtests ranged from 14% to 56%, the current validity of each subtest with the Peabody Picture Vocabulary Test-Revised ranged from .21 to .45, and the predictive validity of each subtest with WISC-III ranged from .47 to .75. This assessment tool was also found to be able to accurately differentiate children with autism up to 92%. These results indicate that this assessment tool has both adequate reliability and validity. Additionally, 35 children with autism have completed the entire assessment in this study without exhibiting any extremely troubling behaviors. However, future research is needed to increase the sample size of both typically developing children and young children with autism and to overcome the technical challenges associated with internet issues. Copyright © 2013 Elsevier Ltd. All rights reserved.
Comprehensive quantification of the spastic catch in children with cerebral palsy.

PubMed

Lynn, Bar-On; Erwin, Aertbeliën; Guy, Molenaers; Herman, Bruyninckx; Davide, Monari; Ellen, Jaspers; Anne, Cazaerck; Kaat, Desloovere

2013-01-01

In clinical settings, the spastic catch is judged subjectively. This study assessed the psychometric properties of objective parameters that define and quantify the severity of the spastic catch in children with cerebral palsy (CP). A convenience sample of children with spastic CP (N=46; age range: 4-16 years) underwent objective spasticity assessments. High velocity, passive stretches were applied to the gastrocnemius (GAS) and medial hamstrings (MEH). Muscle activity was measured with surface electromyography (sEMG), joint angle characteristics using inertial sensors and reactive torque using a force sensor. To test reliability, a group of 12 children were retested after an average of 13 ± 9 days. The angle of spastic catch (AOC) was estimated by three biomechanical definitions: joint angle at (1) maximum angular deceleration; (2) maximum change in torque; and (3) minimum power. Each definition was checked for reliability and validity. Construct and clinical validity were evaluated by correlating each AOC definition to the averaged root mean square envelope of EMG (RMS-EMG) and the Modified Tardieu Scale (MTS). Severity categories were created based on selected parameters to establish face validity. All definitions showed moderate to high reliability. Significant correlations were found between AOC3 and the MTS of both muscles and the RMS-EMG of the MEH, though coefficients were only weak. AOC3 further distinguished between mild, moderate and severe catches. Objective parameters can define and quantify the severity of the spastic catch in children with CP. However, a comprehensive understanding requires the integration of both biomechanical and RMS-EMG data. Copyright © 2012 Elsevier Ltd. All rights reserved.
Quality assessment of gasoline using comprehensive two-dimensional gas chromatography combined with unfolded partial least squares: A reliable approach for the detection of gasoline adulteration.

PubMed

Parastar, Hadi; Mostafapour, Sara; Azimi, Gholamhasan

2016-01-01

Comprehensive two-dimensional gas chromatography and flame ionization detection combined with unfolded-partial least squares is proposed as a simple, fast and reliable method to assess the quality of gasoline and to detect its potential adulterants. The data for the calibration set are first baseline corrected using a two-dimensional asymmetric least squares algorithm. The number of significant partial least squares components to build the model is determined using the minimum value of root-mean square error of leave-one out cross validation, which was 4. In this regard, blends of gasoline with kerosene, white spirit and paint thinner as frequently used adulterants are used to make calibration samples. Appropriate statistical parameters of regression coefficient of 0.996-0.998, root-mean square error of prediction of 0.005-0.010 and relative error of prediction of 1.54-3.82% for the calibration set show the reliability of the developed method. In addition, the developed method is externally validated with three samples in validation set (with a relative error of prediction below 10.0%). Finally, to test the applicability of the proposed strategy for the analysis of real samples, five real gasoline samples collected from gas stations are used for this purpose and the gasoline proportions were in range of 70-85%. Also, the relative standard deviations were below 8.5% for different samples in the prediction set. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A systematic review of reliable and valid tools for the measurement of patient participation in healthcare.

PubMed

Phillips, Nicole Margaret; Street, Maryann; Haesler, Emily

2016-02-01

Patient participation in healthcare is recognised internationally as essential for consumer-centric, high-quality healthcare delivery. Its measurement as part of continuous quality improvement requires development of agreed standards and measurable indicators. This systematic review sought to identify strategies to measure patient participation in healthcare and to report their reliability and validity. In the context of this review, patient participation was constructed as shared decision-making, acknowledging the patient as having critical knowledge regarding their own health and care needs and promoting self-care/autonomy. Following a comprehensive search, studies reporting reliability or validity of an instrument used in a healthcare setting to measure patient participation, published in English between January 2004 and March 2014 were eligible for inclusion. From an initial search, which identified 1582 studies, 156 studies were retrieved and screened against inclusion criteria. Thirty-three studies reporting 24 patient participation measurement tools met inclusion criteria, and were critically appraised. The majority of studies were descriptive psychometric studies using prospective, cross-sectional designs. Almost all the tools completed by patients, family caregivers, observers or more than one stakeholder focused on aspects of patient-professional communication. Few tools designed for completion by patients or family caregivers provided valid and reliable measures of patient participation. There was low correlation between many of the tools and other measures of patient satisfaction. Few reliable and valid tools for measurement of patient participation in healthcare have been recently developed. Of those reported in this review, the dyadic Observing Patient Involvement in Decision Making (dyadic-OPTION) tool presents the most promise for measuring core components of patient participation. There remains a need for further study into valid, reliable and feasible strategies for measuring patient participation as part of continuous quality improvement. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Assessment of NDE reliability data

NASA Technical Reports Server (NTRS)

Yee, B. G. W.; Couchman, J. C.; Chang, F. H.; Packman, D. F.

1975-01-01

Twenty sets of relevant nondestructive test (NDT) reliability data were identified, collected, compiled, and categorized. A criterion for the selection of data for statistical analysis considerations was formulated, and a model to grade the quality and validity of the data sets was developed. Data input formats, which record the pertinent parameters of the defect/specimen and inspection procedures, were formulated for each NDE method. A comprehensive computer program was written and debugged to calculate the probability of flaw detection at several confidence limits by the binomial distribution. This program also selects the desired data sets for pooling and tests the statistical pooling criteria before calculating the composite detection reliability. An example of the calculated reliability of crack detection in bolt holes by an automatic eddy current method is presented.
Validity and reliability of the Spanish version of the Organizational Readiness for Knowledge Translation (OR4KT) questionnaire.

PubMed

Grandes, Gonzalo; Bully, Paola; Martinez, Catalina; Gagnon, Marie-Pierre

2017-11-10

Organizational readiness to change healthcare practice is a major determinant of successful implementation of evidence-based interventions. However, we lack of comprehensive, valid, and reliable instruments to measure it. We assessed the validity and reliability of the Spanish version of the Organizational Readiness for Knowledge Translation (OR4KT) questionnaire in the context of the implementation of the Prescribe Vida Saludable III project, which seeks to strengthen health promotion and chronic disease prevention in primary healthcare organizations of the Osakidetza (Basque Health Service, Spain). A cross-sectional study was conducted including 127 professionals from 20 primary care centers within Osakidetza. They filled in the OR4KT questionnaire twice in a 15- to 30-day period to test repeatability. In addition, we used the Survey of Organizational Attributes for Primary Care (SOAPC) and we documented the number of healthcare professionals who formally engaged in the Prescribe Vida Saludable III project within each participating center to assess concurrent validity. Cronbach's alpha for the overall OR4KT was .95, and the overall repeatability coefficient was 6.95%, both excellent results. Confirmatory factor analysis supported the underlying theoretical structure of 6 dimensions and 23 sub-dimensions. There were positive moderate-to-high internal correlations between these six dimensions, and there was evidence of good concurrent validity (correlation coefficient of .76 with SOAPC, and .80 with the proportion of professionals engaged by center). A score higher than 64 (out of 100) would be indicative of an organization with high level of readiness to implement the intervention (sensitivity = .75, specificity = 1). The Spanish version of the OR4KT exhibits very strong reliability and good validity, although it needs to be validated in a larger sample and in different implementation contexts.
Development and validation of a pediatric sports activity rating scale: the Hospital for Special Surgery Pediatric Functional Activity Brief Scale (HSS Pedi-FABS).

PubMed

Fabricant, Peter D; Robles, Alex; Downey-Zayas, Timothy; Do, Huong T; Marx, Robert G; Widmann, Roger F; Green, Daniel W

2013-10-01

Having simple and reliable validated outcome measures is vital to conducting high-quality outcomes research in the field of orthopaedic surgery. Activity level is a key prognostic variable for patients with sports injuries. There is a paucity of such activity scales for children and adolescents who are otherwise healthy and athletically active. In addition to frequency and intensity of athletic activity, level of play and coach/trainer supervision are important variables unique to children and adolescents that are not captured in available adult scoring systems. To create and validate a concise and comprehensive activity rating scale for athletically active children and adolescents 10 to 18 years of age. Cohort study (diagnosis); Level of evidence, 2. Item generation was performed with a panel of orthopaedic surgeons and adolescent athletes. Item reduction, pilot testing and scale refinement resulted in a final 8-item instrument, the Hospital for Special Surgery Pediatric Functional Activity Brief Scale (HSS Pedi-FABS). Existing methods were used to determine reliability and validation. The Flesch-Kincaid score was calculated at a 6.6th-grade reading level (approximately 13 years old); therefore, although all subjects provided their own answers, parents were allowed to assist children younger than 13 years with reading the questionnaire. Scale reliability was excellent (test-retest reliability, intraclass correlation coefficient = 0.91; internal consistency, Cronbach alpha = .914), and there were no floor or ceiling effects. There was also robust construct validity: Convergent validity testing revealed positive correlations between the HSS Pedi-FABS and level of competition in athletic activity, number of reported hours of athletic activity per week, and existing comparable adult and pediatric scales. Discriminant validity was shown with age, body mass index, and type of sport as measured by the Daniel scale. The 8-item HSS Pedi-FABS can be used to reliably and accurately evaluate activity level as a prognostic variable for clinical research studies. It is a simple, reliable, and valid metric to assess activity in children and adolescents 10 to 18 years of age. This instrument will lead to better evaluation of posttreatment outcomes and patient-reported activity for child and adolescent athletes.
Evaluating Effective Teaching in College Level Economics Using Student Ratings of Instruction: A Factor Analytic Approach

ERIC Educational Resources Information Center

Agbetsiafa, Douglas

2010-01-01

This paper explores the factors that affect students' evaluation of economic instruction using a sample of 1300 completed rating instruments at a comprehensive four-year mid-western public university. The study uses factor analysis to determine the validity and reliability of the evaluation instrument in assessing instructor or course…
Initial Validity and Reliability of the SCCAN: Using Tailored Testing to Assess Adult Cognition and Communication

ERIC Educational Resources Information Center

Milman, Lisa H.; Holland, Audrey; Kaszniak, Alfred W.; D'Agostino, Jerry; Garrett, Merrill; Rapcsak, Steve

2008-01-01

Purpose: The Scales of Cognitive and Communicative Ability for Neurorehabilitation (SCCAN; L. Milman & A. Holland, 2007) was developed in the hospital setting to address changes in assessment practice. The SCCAN was designed to provide an overview of impairment and activity limitations across 8 cognitive scales (Speech Comprehension, Oral…
Identifying Psychometric Properties of the Social-Emotional Learning Skills Scale

ERIC Educational Resources Information Center

Esen-Aygun, Hanife; Sahin-Taskin, Cigdem

2017-01-01

This study aims to develop a comprehensive scale of social-emotional learning. After constructing a wide range of item pool and expertise evaluation, validity and reliability studies were carried out through using the data-set of 439 primary school students at 3rd and 4th grade levels. Exploratory and confirmatory factor analysis results revealed…
The Complexity of School Racial Climate: Reliability and Validity of a New Measure for Secondary Students

ERIC Educational Resources Information Center

Byrd, Christy M.

2017-01-01

Background: The conceptualization of the role of race and culture in students' experience of school has been limited. This study presents a more comprehensive and multidimensional framework than previously conceptualized and includes the two domains of (1) intergroup interactions (frequency of interaction, quality of interaction, equal status, and…
Comprehensive Evaluation Criteria for English Learning Websites Using Expert Validity Surveys

ERIC Educational Resources Information Center

Yang, Ya-Ting C.; Chan, Chia-Ying

2008-01-01

This study aimed to develop a set of evaluation criteria for English learning websites. These criteria can assist English teachers/web designers in designing effective websites for their English courses and can also guide English learners in screening for appropriate and reliable websites to use in increasing their English ability. To fulfill our…
Language Assessment of a Farsi-Norwegian Bilingual Speaker with Aphasia

ERIC Educational Resources Information Center

Koumanidi Knoph, Monica I.

2011-01-01

The increased occurrence of strokes combined with the high incidence of bilingualism in many regions of the world has led to an increasing number of bilingual adults with aphasia. The literature on bilingual aphasia shows the need for valid, comprehensive and reliable assessment tools for diagnostic and treatment purposes. In spite of a growing…
Reliability and Validity of the CTOPP Elision and Blending Words Subtests for Struggling Adult Readers

ERIC Educational Resources Information Center

Nanda, Alice O.; Greenberg, Daphne; Morris, Robin D.

2014-01-01

Almost half of American adults struggle with reading but there is a dearth of reading-related assessments for these adults. In turn, researchers and practitioners use assessments designed for children with these adults. This study examined the psychometric and descriptive attributes of the Comprehensive Test of Phonological Processing (CTOPP)…
Development and Validity Testing of an Arthritis Self-Management Assessment Tool.

PubMed

Oh, HyunSoo; Han, SunYoung; Kim, SooHyun; Seo, WhaSook

Because of the chronic, progressive nature of arthritis and the substantial effects it has on quality of life, patients may benefit from self-management. However, no valid, reliable self-management assessment tool has been devised for patients with arthritis. This study was conducted to develop a comprehensive self-management assessment tool for patients with arthritis, that is, the Arthritis Self-Management Assessment Tool (ASMAT). To develop a list of qualified items corresponding to the conceptual definitions and attributes of arthritis self-management, a measurement model was established on the basis of theoretical and empirical foundations. Content validity testing was conducted to evaluate whether listed items were suitable for assessing arthritis self-management. Construct validity and reliability of the ASMAT were tested. Construct validity was examined using confirmatory factor analysis and nomological validity. The 32-item ASMAT was developed with a sample composed of patients in a clinic in South Korea. Content validity testing validated the 32 items, which comprised medical (10 items), behavioral (13 items), and psychoemotional (9 items) management subscales. Construct validity testing of the ASMAT showed that the 32 items properly corresponded with conceptual constructs of arthritis self-management, and were suitable for assessing self-management ability in patients with arthritis. Reliability was also well supported. The ASMAT devised in the present study may aid the evaluation of patient self-management ability and the effectiveness of self-management interventions. The authors believe the developed tool may also aid the identification of problems associated with the adoption of self-management practice, and thus improve symptom management, independence, and quality of life of patients with arthritis.
Questionnaire to assess patient satisfaction with pharmaceutical care in Spanish language.

PubMed

Traverso, María Luz; Salamano, Mercedes; Botta, Carina; Colautti, Marisel; Palchik, Valeria; Pérez, Beatriz

2007-08-01

To develop and validate a questionnaire, in Spanish, for assessing patient satisfaction with pharmaceutical care received in community pharmacies. Selection and translation of questionnaire's items; definition of response scale and demographic questions. Evaluation of face and content validity, feasibility, factor structure, reliability and construct validity. Forty-one community pharmacies of the province of Santa Fe. Argentina. Questionnaire administered to patients receiving pharmaceutical care or traditional pharmacy services. Pilot test to assess feasibility. Factor analysis used principal components and varimax rotation. Reliability established using internal consistency with Cronbach's alpha. Construct validity determined with extreme group method. A self-administered questionnaire with 27 items, 5-point Likert response scale and demographic questions was designed considering multidimensional structure of patient satisfaction. Questionnaire evaluates cumulative experience of patients with comprehensive pharmaceutical care practice in community pharmacies. Two hundred and seventy-four complete questionnaires were obtained. Factor analysis resulted in three factors: Managing therapy, Interpersonal relationship and General satisfaction, with a cumulative variance of 62.51%. Cronbach's alpha for the whole questionnaire was 0.96, and 0.95, 0.88 and 0.76 for the three factors, respectively. Mann-Whitney test for construct validity did not showed significant differences between pharmacies that provide pharmaceutical care and those that do not, however, 23 items showed significant differences between the two groups of pharmacies. The questionnaire developed can be a reliable and valid instrument to assess patient satisfaction with pharmaceutical care in community pharmacies in Spanish. Further research is needed to deepen the validation process.
An evaluation of Wikipedia as a resource for patient education in nephrology.

PubMed

Thomas, Garry R; Eng, Lawson; de Wolff, Jacob F; Grover, Samir C

2013-01-01

Wikipedia, a multilingual online encyclopedia, is a common starting point for patient medical searches. As its articles can be authored and edited by anyone worldwide, the credibility of the medical content of Wikipedia has been openly questioned. Wikipedia medical articles have also been criticized as too advanced for the general public. This study assesses the comprehensiveness, reliability, and readability of nephrology articles on Wikipedia. The International Statistical Classification of Diseases and Related problems, 10th Edition (ICD-10) diagnostic codes for nephrology (N00-N29.8) were used as a topic list to investigate the English Wikipedia database. Comprehensiveness was assessed by the proportion of ICD-10 codes that had corresponding articles. Reliability was measured by both the number of references per article and proportion of references from substantiated sources. Finally, readability was assessed using three validated indices (Flesch-Kincaid grade level, Automated readability index, and Flesch reading ease). Nephrology articles on Wikipedia were relatively comprehensive, with 70.5% of ICD-10 codes being represented. The articles were fairly reliable, with 7.1 ± 9.8 (mean ± SD) references per article, of which 59.7 ± 35.0% were substantiated references. Finally, all three readability indices determined that nephrology articles are written at a college level. Wikipedia is a comprehensive and fairly reliable medical resource for nephrology patients that is written at a college reading level. Accessibility of this information for the general public may be improved by hosting it at alternative Wikipedias targeted at a lower reading level, such as the Simple English Wikipedia. © 2013 Wiley Periodicals, Inc.
TENI: A comprehensive battery for cognitive assessment based on games and technology.

PubMed

Delgado, Marcela Tenorio; Uribe, Paulina Arango; Alonso, Andrés Aparicio; Díaz, Ricardo Rosas

2016-01-01

TENI (Test de Evaluación Neuropsicológica Infantil) is an instrument developed to assess cognitive abilities in children between 3 and 9 years of age. It is based on a model that incorporates games and technology as tools to improve the assessment of children's capacities. The test was standardized with two Chilean samples of 524 and 82 children living in urban zones. Evidence of reliability and validity based on current standards is presented. Data show good levels of reliability for all subtests. Some evidence of validity in terms of content, test structure, and association with other variables is presented. This instrument represents a novel approach and a new frontier in cognitive assessment. Further studies with clinical, rural, and cross-cultural populations are required.
Cultural Adaptation and Reliability of the Compliance with Standard Precautions Scale (CSPS) for Nurses in Brazil 1

PubMed Central

Pereira, Fernanda Maria Vieira; Lam, Simon Ching; Gir, Elucir

2017-01-01

ABSTRACT Objective: this study aimed to carry of the cultural adaptation and to evaluate the reliability of the Compliance with Standard Precautions Scale (CSPS) for nurses in Brazil. Method: the adaptation process entailed translation, consensus among judges, back-translation, semantic validation and pretest. The reliability was evaluated by internal consistency (Cronbach alpha) and stability (test-retest). The instrument was administered to a sample group of 300 nurses who worked in a large hospital located in the city of São Paulo/SP, Brazil. Results: through the semantic validation, the items from the scale were considered understandable and deemed important for the nurse´s clinical practice. The CSPS Brazilian Portuguese version (CSPS-PB) revealed excellent interpretability. The Cronbach`s alpha was 0.61 and the intraclass correlation coefficient was 0.85. Conclusion: the initial study showed that CSPS-PB is appropriate to assess compliance with standard precautions among nurses in Brazil. The reliability was considered acceptable. Furhter study is necessary to evaluate its comprehensive psychometric properties. PMID:28301030
[Reliability and validity of Meaningful Life Measure-Chinese Revised in Chinese college students].

PubMed

Xiao, Rong; Lai, Qiao-Zhen; Yang, Jia-Ping

2016-04-20

To test the reliability and validity of Meaningful Life Measure-Chinese Revised (MLM-CR) in Chinese college students. A total of 1035 college students were evaluated with MLM-CR, Satisfaction with Life Scale (SWLS), Purpose in Life (PIL) and Patient Health Questionnaire-2 (PHQ-2), and 120 of the students were examined with PIL-SF twice. All the items in MLM-CR had good discrimination indexes (r=0.753-0.838, P<0.001). Confirmatory factor analysis confirmed the hypothesized five-factor model of MLM-CR (Χ 2 /df=3.4, GFI=0.946, AGFI=0.924, RMR=0.069, NFI=0.953, CFI=0.966, RMSEA=0.048). The total internal consistency reliability of MLM-CR was 0.942, and the alpha coefficients of the 5 dimensions ranged from 0.782 to 0.877; the total split-half reliability was 0.920, and the split-half reliability of the 5 dimensions ranged from 0.752 to 0.830; the total test-retest reliability was 0.871, and the test-retest reliability of the 5 dimensions ranged from 0.783 to 0.805. The criterion validity of MLM-CR in correlation with SWLS, PIL and PHQ-2 was 0.66, 0.755 and -0.388, respectively (P<0.01). The Average score of MLM-CR of the college students was 5.20∓0.90, and the scores were significantly higher in female students than in the male students (P<0.001). MLM-CR has good psychometric properties for application in comprehensive evaluation of personal meaning in life.

Italian version of the organic brain syndrome and the depression scales from the CARE: evaluation of their performance in geriatric institutions.

PubMed

Spagnoli, A; Foresti, G; MacDonald, A; Williams, P

1987-05-01

The Organic Brain Syndrome (OBS) and the Depression (D) scales derived from the Comprehensive Assessment and Referral Evaluation (CARE) were translated into Italian and used in a survey of geriatric institutions in Milan. During the survey validity and reliability tests of the scales were conducted. Inter-rater reliability (total score weighted kappa) was highly satisfactory for both scales (0.96 for OBS and 0.83 for D scale). Reliability was assessed three times during the survey and showed good stability for both scales, with a slight but significant trend towards reduction over time for the D scale. Reliability of the D scale was significantly lower when the subjects interviewed scored highly on the OBS scale (severe cognitive impairment). Criterion validity was highly satisfactory both for the OBS scale (cut-off point 4/5: sensitivity 77%, specificity 96%, positive predictive value 91%) and the D scale (cut-off point 10/11: sensitivity 95%, specificity 92%, positive predictive value 84%). Results are discussed with special reference to longitudinal assessment of reliability, the choice of the cut-off point, and the context-dependent properties of questionnaires.
Development of an opioid-related Overdose Risk Behavior Scale (ORBS).

PubMed

Pouget, Enrique R; Bennett, Alex S; Elliott, Luther; Wolfson-Stofko, Brett; Almeñana, Ramona; Britton, Peter C; Rosenblum, Andrew

2017-01-01

Drug overdose has emerged as the leading cause of injury-related death in the United States, driven by prescription opioid (PO) misuse, polysubstance use, and use of heroin. To better understand opioid-related overdose risks that may change over time and across populations, there is a need for a more comprehensive assessment of related risk behaviors. Drawing on existing research, formative interviews, and discussions with community and scientific advisors an opioid-related Overdose Risk Behavior Scale (ORBS) was developed. Military veterans reporting any use of heroin or POs in the past month were enrolled using venue-based and chain referral recruitment. The final scale consisted of 25 items grouped into 5 subscales eliciting the number of days in the past 30 during which the participant engaged in each behavior. Internal reliability, test-retest reliability and criterion validity were assessed using Cronbach's alpha, intraclass correlations (ICC) and Pearson's correlations with indicators of having overdosed during the past 30 days, respectivelyInternal reliability, test-retest reliability and criterion validity were assessed using Cronbach's alpha, intraclass correlations (ICC) and Pearson's correlations with indicators of having overdosed during the past 30 days, respectively. Data for 220 veterans were analyzed. The 5 subscales-(A) Adherence to Opioid Dosage and Therapeutic Purposes; (B) Alternative Methods of Opioid Administration; (C) Solitary Opioid Use; (D) Use of Nonprescribed Overdose-associated Drugs; and (E) Concurrent Use of POs, Other Psychoactive Drugs and Alcohol-generally showed good internal reliability (alpha range = 0.61 to 0.88), test-retest reliability (ICC range = 0.81 to 0.90), and criterion validity (r range = 0.22 to 0.66). The subscales were internally consistent with each other (alpha = 0.84). The scale mean had an ICC value of 0.99, and correlations with validators ranged from 0.44 to 0.56. These results constitute preliminary evidence for the reliability and validity of the new scale. If further validated, it could help improve overdose prevention and response research and could help improve the precision of overdose education and prevention efforts.
CHAT: development and validation of a computer-delivered, self-report, substance use assessment for adolescents.

PubMed

Lord, Sarah E; Trudeau, Kimberlee J; Black, Ryan A; Lorin, Lucy; Cooney, Elizabeth; Villapiano, Albert; Butler, Stephen F

2011-01-01

The current study was conducted to construct and validate a computer-delivered, multimedia, substance use self-assessment for adolescents. Reliability and validity of six problem dimensions were evaluated in two studies, conducted from 2003 to 2008. Study 1 included 192 adolescents from five treatment settings throughout the United States (N = 142) and two high schools from Greater Boston, Massachusetts (N = 50). Study 2 included 356 adolescents (treatment: N = 260; school: N = 94). The final version of Comprehensive Health Assessment for Teens (CHAT) demonstrated relatively strong psychometric properties. The limitations and implications of this study are noted. This study was supported by an SBIR grant.
Translation, Cross-cultural Adaptation and Validation of the Farsi Version of NIH Task Force's Recommended Multidimensional Minimal Dataset for Research on Chronic Low Back Pain.

PubMed

Noormohammadpour, Pardis; Tavana, Bahareh; Mansournia, Mohammad Ali; Zeinalizadeh, Mehdi; Mirzashahi, Babak; Rostami, Mohsen; Kordi, Ramin

2018-05-01

Translation and cultural adaptation of the National Institutes of Health (NIH) Task Force's minimal dataset. The purpose of this study was to evaluate validity and reliability of the Farsi version of NIH Task Force's recommended multidimensional minimal dataset for research on chronic low back pain (CLBP). Considering the high treatment cost of CLBP and its increasing prevalence, NIH Pain Consortium developed research standards (including recommendations for definitions, a minimum dataset, and outcomes' report) for studies regarding CLBP. Application of these recommendations could standardize research and improve comparability of different studies in CLBP. This study has three phases: translation of dataset into Farsi and its cultural adaptation, assessment of pre-final version of dataset's comprehensibility via a pilot study, and investigation of the reliability and validity of final version of translated dataset. Subjects were 250 patients with CLBP. Test-retest reliability, content validity, and convergent validity (correlations among different dimensions of dataset and Farsi versions of Oswestry Disability Index, Roland Morris Disability Questionnaire, Fear-Avoidance Belief Questionnaire, and Beck Depression Inventory-II) were assessed. The Farsi version demonstrated good/excellent convergent validity (the correlation coefficient between impact dimension and ODI was r = 0.75 [P < 0.001], between impact dimension and Roland-Morris Disability Questionnaire was r = 0.80 [P < 0.001], and between psychological dimension and BDI was r = 0.62 [P < 0.001]). The test-retest reliability was also strong (intraclass correlation coefficient value ranged between 0.70 and 0.95) and the internal consistency was good/excellent (Chronbach's alpha coefficients' value for two main dimensions including impact dimension and psychological dimension were 0.91 and 0.82 [P < 0.001], respectively). In addition, its face validity and content validity were acceptable. The Farsi version of minimal dataset for research on CLBP is a reliable and valid instrument for data gathering in patients with CLBP. This minimum dataset can be a step toward standardization of research regarding CLBP. 3.
Reynolds Adolescent Depression Scale - Second Edition: initial validation of the Korean version.

PubMed

Hyun, Myung-Sun; Nam, Kyoung-A; Kang, Hee Sun; Reynolds, William M

2009-03-01

This paper is a report of a study conducted to test the validity and reliability of the Reynolds Adolescent Depression Scale - Second Edition in Korean culture. Depression is a significant mental health problem in adolescents. The Reynolds Adolescent Depression Scale - Second Edition has been shown to be a useful tool to assess depression in adolescents, with extensive research on this measure having been conducted in western cultures. Measures developed in western cultures need to be tested and validated before being used in Asian cultures. The participants were a convenience sample of 440 Korean adolescents with a mean age of 13.78 years (sd = 0.95) from grades 7 to 9 in three public middle schools in South Korea. A cross-sectional design was used. Back-translation was used to create the Korean version, with additional testing for cultural meaning and comprehension. The data were collected at the end of 2004. Internal consistency reliability for the Korean version of the Reynolds Adolescent Depression Scale - Second Edition was 0.89, with subscale reliability ranging from 0.66 to 0.81. Evidence for criterion-related, convergent and discriminant validity for the Korean version of the Reynolds Adolescent Depression Scale - Second Edition was found. Confirmatory factor analysis supported the 4-factor structure of Reynolds Adolescent Depression Scale - Second Edition. Our results support the validity and reliability for the Korean version of the Reynolds Adolescent Depression Scale - Second Edition as a measure of depression and suggest that it can be used to screen students and to evaluate the effectiveness of preventive interventions in school settings.
Designing and psychoanalysis: A comprehensive questionnaire on coping with domestic violence against women in Iranian society.

PubMed

Mohhamadian, Zeinab; Mohtashami, Jamileh; Rohani, Camelia; Jamshidi, Tayebeh

2018-01-01

Domestic violence is the third sociopathology after addiction and child abuse in Iran. Fifty-six percent of Iranian women in the range of 17-32 years old are exposed to the highest domestic violence. Objective: The aim of this study was to design and psychoanalyze a comprehensive questionnaire on coping with domestic violence against women in Iranian society. This study was carried out on a random sample of women exposed to domestic violence and referred to the health and care center of Shahid Beheshti University of Medical Sciences in Tehran, and Forensic Medical Centers in Urmia city (Iran), in 2017. Two hundred questionnaires were distributed among the participants. One hundred sixty-eight questionnaires were returned to the researchers for data analysis. Eight of those were excluded from the analysis because of incompleteness. Finally, exploratory factor analysis was performed. After reviewing the literature, a questionnaire with 32 items was developed. Content validity ratio (0.95) and content validity index (0.97) were obtained. The results of exploratory factor analysis indicated that the questionnaire explained 69.34% of the data variance. Cronbach's alpha coefficient, and test-retest methods were used for determining the reliability and the obtained value, which were 0.82 and 0.81, respectively. Validity and reliability of the questionnaire with 32 items were confirmed. The tool can be utilized to measure how women cope with domestic violence.
Caregiving demands in parents of children with cancer: psychometric validation of the Care of My Child with Cancer questionnaire.

PubMed

Klassen, Anne; Klaassen, Robert J; Dix, David; Pritchard, Sheila; Yanofsky, Rochelle; Sung, Lillian

2010-08-01

A comprehensive evaluation of the psychometric properties of Care of My Child With Cancer (CMCC) was performed in a sample of 411 parents of children undergoing treatment of cancer at five Canadian pediatric oncology centers. Psychometric tests used to assess data quality, targeting, reliability, and construct validity demonstrated that the CMCC is a scientific sound measure. The CMCC will be helpful for assessing increasing parental responsibility for caregiving tasks associated with cancer care. Copyright 2010 Elsevier Inc. All rights reserved.
Reliability and validity of a Chinese version of the Multidimensional Fatigue Symptom Inventory-Short Form (MFSI-SF-C).

PubMed

Pien, Li-Chung; Chu, Hsin; Chen, Wen-Chun; Chang, Yu-Shiun; Liao, Yuan-Mei; Chen, Chiung-Hua; Chou, Kuei-Ru

2011-08-01

To examine the psychometric properties of the Chinese version of the Multidimensional Fatigue Symptom Inventory-Short Form (MFSI-SF-C) for use in Chinese-speaking countries. The assessment of fatigue is a challenging task for most researchers because culture may influence perceptions of meaning of fatigue. The lack of examination of the psychometric properties of the fatigue measures across studies limits the scientific rigour for generating additional research on the concept of 'fatigue.' A cross-sectional study. The study recruited 107 cancer inpatients from two medical centres in Taiwan. The MFSI-SF-C was examined using a two step process: (1) Translation and back-translation of the instrument; and (2) Examination of internal consistency reliability, test-retest reliability, content validity and construct validity. The results showed that the Cronbach's α of MFSI-SF-C total scale and subscales ranged between 0·83-0·92. The content validity index was 0·93. The difference between the fatigue of cancer patients and the comparison group of healthy people in the community was significant. The results demonstrated good convergent validity when comparing fatigue with depression and quality of life. Factor analysis confirmed the four dimensions of fatigue: physical, emotional, mental and vigour. It showed moderate intercorrelation between subscales and high factor loadings also helped to clarify the psychometric meaning. The reliability and validity information presented in this article support the use of the Chinese version of the MFSI-SF as a research instrument for measuring fatigue in Chinese populations. This study also provides evidence that the MFSI-SF possesses robust psychometric properties. The MFSI-SF-C is an effective and comprehensive tool for measuring fatigue in Chinese patients with cancer. © 2011 Blackwell Publishing Ltd.
Validation of a Comprehensive Early Childhood Allergy Questionnaire.

PubMed

Minasyan, Anna; Babajanyan, Arman; Campbell, Dianne E; Nanan, Ralph

2015-09-01

Parental questionnaires to assess incidence of pediatric allergic disease have been validated for use in school-aged children. Currently, there is no validated questionnaire-based assessment of food allergy, atopic dermatitis (AD), and asthma for infants and young children. The Comprehensive Early Childhood Allergy Questionnaire was designed for detecting AD, asthma, and IgE-mediated food allergies in children aged 1-5 years. A nested case-control design was applied. Parents of 150 children attending pediatric outpatient clinics completed the questionnaire before being clinically assessed by a pediatrician for allergies. Sensitivity, specificity, and reproducibility of the questionnaire were assessed. Seventy-seven children were diagnosed with one or more current allergic diseases. The questionnaire demonstrated high overall sensitivity of 0.93 (95% CI 0.86-0.98) with a specificity of 0.79 (95% CI 0.68-0.88). Questionnaire reproducibility was good with a kappa agreement rate for symptom-related questions of 0.45-0.90. Comprehensive Early Childhood Allergy Questionnaire accurately and reliably reflects the presence of allergies in children aged 1-5 years. Its use is warranted as a tool for determining prevalence of allergies in this pediatric age group. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
The Dutch Review Process for Evaluating the Quality of Psychological Tests: History, Procedure, and Results

ERIC Educational Resources Information Center

Evers, Arne; Sijtsma, Klaas; Lucassen, Wouter; Meijer, Rob R.

2010-01-01

This article describes the 2009 revision of the Dutch Rating System for Test Quality and presents the results of test ratings from almost 30 years. The rating system evaluates the quality of a test on seven criteria: theoretical basis, quality of the testing materials, comprehensiveness of the manual, norms, reliability, construct validity, and…
Reliability and Validity of the Matson Evaluation of Social Skills with Youngsters

ERIC Educational Resources Information Center

Matson, Johnny L.; Neal, Daniene; Fodstad, Jill C.; Hess, Julie A.; Mahan, Sara; Rivet, Tessa T.

2010-01-01

Social skills are an important part of development, and deficits in this area have long-term impacts on a child. As a result, clinicians should include a measure of social skills as part of a comprehensive assessment. There are a few well-researched measures of social skills that are currently used, including the Matson Evaluation of Social Skills…
Psychometric Properties of the Five Facet Mindfulness Questionnaire in Depressed Adults and Development of a Short Form

ERIC Educational Resources Information Center

Bohlmeijer, Ernst; ten Klooster, Peter M.; Fledderus, Martine; Veehof, Martine; Baer, Ruth

2011-01-01

In recent years, there has been a growing interest in therapies that include the learning of mindfulness skills. The 39-item Five Facet Mindfulness Questionnaire (FFMQ) has been developed as a reliable and valid comprehensive instrument for assessing different aspects of mindfulness in community and student samples. In this study, the psychometric…
Development and Psychometric Properties of an Assessment for Persons with Intellectual Disability--The InterRAI ID

ERIC Educational Resources Information Center

Martin, Lynn; Hirdes, John P.; Fries, Brant E.; Smith, Trevor F.

2007-01-01

This paper describes the development of the interRAI-Intellectual Disability (interRAI ID), a comprehensive instrument that assesses all key domains of interest to service providers relative to a person with an intellectual disability (ID). The authors report on the reliability and validity of embedded scales for cognition, self-care, aggression,…
Accelerometer-based measures in physical activity surveillance: current practices and issues.

PubMed

Pedišić, Željko; Bauman, Adrian

2015-02-01

Self-reports of physical activity (PA) have been the mainstay of measurement in most non-communicable disease (NCD) surveillance systems. To these, other measures are added to summate to a comprehensive PA surveillance system. Recently, some national NCD surveillance systems have started using accelerometers as a measure of PA. The purpose of this paper was specifically to appraise the suitability and role of accelerometers for population-level PA surveillance. A thorough literature search was conducted to examine aspects of the generalisability, reliability, validity, comprehensiveness and between-study comparability of accelerometer estimates, and to gauge the simplicity, cost-effectiveness, adaptability and sustainability of their use in NCD surveillance. Accelerometer data collected in PA surveillance systems may not provide estimates that are generalisable to the target population. Accelerometer-based estimates have adequate reliability for PA surveillance, but there are still several issues associated with their validity. Accelerometer-based prevalence estimates are largely dependent on the investigators' choice of intensity cut-off points. Maintaining standardised accelerometer data collections in long-term PA surveillance systems is difficult, which may cause discontinuity in time-trend data. The use of accelerometers does not necessarily produce useful between-study and international comparisons due to lack of standardisation of data collection and processing methods. To conclude, it appears that accelerometers still have limitations regarding generalisability, validity, comprehensiveness, simplicity, affordability, adaptability, between-study comparability and sustainability. Therefore, given the current evidence, it seems that the widespread adoption of accelerometers specifically for large-scale PA surveillance systems may be premature. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Individualized quality of life in patients with low back pain: reliability and validity of the Patient Generated Index.

PubMed

Løchting, Ida; Grotle, Margreth; Storheim, Kjersti; Werner, Erik L; Garratt, Andrew M

2014-09-01

To evaluate the reliability and validity of the improved version of the Patient Generated Index (PGI) in patients with low back pain. The PGI was administered to 90 patients attending care in 1 of 6 institutions in Norway and evaluated for reliability and validity. The questionnaire was given out to 61 patients for re-test purposes. The PGI was completed correctly by 80 (88.9%) patients and, of the 61 patients responding to the re-test, 50 (82.0%) completed both surveys correctly. PGI scores were approximately normally distributed, with a median of 40 (range 80), where 100 is the best possible quality of life. There were no floor or ceiling effects. The 5 most frequently listed areas affecting quality of life were pain, sleep, stiffness, socializing and housework. The test-retest intraclass correlation coefficient was 0.73. The smallest detectable changes for individual and group purposes were 32.8 and 4.6, respectively. The correlations between PGI scores and other instrument scores followed a priori hypotheses of low to moderate correlations. The PGI has evidence for reliability and validity in Norwegian patients with low back pain at the group level and may be considered for application in intervention studies when a comprehensive evaluation of quality of life is important. However, the smallest detectable change, of approximately 30 points, may be considered too large for individual purposes in clinical applications.
Advances in Sprint Acceleration Profiling for Field-Based Team-Sport Athletes: Utility, Reliability, Validity and Limitations.

PubMed

Simperingham, Kim D; Cronin, John B; Ross, Angus

2016-11-01

Advanced testing technologies enable insight into the kinematic and kinetic determinants of sprint acceleration performance, which is particularly important for field-based team-sport athletes. Establishing the reliability and validity of the data, particularly from the acceleration phase, is important for determining the utility of the respective technologies. The aim of this systematic review was to explain the utility, reliability, validity and limitations of (1) radar and laser technology, and (2) non-motorised treadmill (NMT) and torque treadmill (TT) technology for providing kinematic and kinetic measures of sprint acceleration performance. A comprehensive search of the CINAHL Plus, MEDLINE (EBSCO), PubMed, SPORTDiscus, and Web of Science databases was conducted using search terms that included radar, laser, non-motorised treadmill, torque treadmill, sprint, acceleration, kinetic, kinematic, force, and power. Studies examining the kinematics or kinetics of short (≤10 s), maximal-effort sprint acceleration in adults or children, which included an assessment of reliability or validity of the advanced technologies of interest, were included in this systematic review. Absolute reliability, relative reliability and validity data were extracted from the selected articles and tabulated. The level of acceptance of reliability was a coefficient of variation (CV) ≤10 % and an intraclass correlation coefficient (ICC) or correlation coefficient (r) ≥0.70. A total of 34 studies met the inclusion criteria and were included in the qualitative analysis. Generally acceptable validity (r = 0.87-0.99; absolute bias 3-7 %), intraday reliability (CV ≤9.5 %; ICC/r ≥0.84) and interday reliability (ICC ≥0.72) were reported for data from radar and laser. However, low intraday reliability was reported for the theoretical maximum horizontal force (ICC 0.64) within adolescent athletes, and low validity was reported for velocity during the initial 5 m of a sprint acceleration (bias up to 0.41 m/s) measured with a laser device. Acceptable reliability of results from NMT and TT was only ensured when testing protocols involved sufficient familiarisation, a high sampling rate (≥200 Hz), a 'blocked' start position, and the analysis of discrete steps rather than arbitrary time periods. Sprinting times and speeds were 20-28 % slower on a TT, 28-67 % slower on an NMT, and only 9-64 % of the variance in overground measurements of speed and time (≤30 m) was explained by results from an NMT. There have been no reports to date of criterion validity of kinetic measures of sprint acceleration performance on NMT andTT, and only limited results regarding acceptable concurrent validity of radar-derived kinetic data. Radar, laser, NMT and TT technologies can be used to reliably measure sprint acceleration performance and to provide insight into the determinants of sprinting speed. However, further research is required to establish the validity of the kinetic measurements made with NMT and TT. Radar and laser technology may not be suitable for measuring the first few steps of a sprint acceleration.
Assessing the reading comprehension of adults with learning disabilities.

PubMed

Jones, F W; Long, K; Finlay, W M L

2006-06-01

This study's aim was to begin the process of measuring the reading comprehension of adults with mild and borderline learning disabilities, in order to generate information to help clinicians and other professionals to make written material for adults with learning disabilities more comprehensible. The Test for the Reception of Grammar (TROG), with items presented visually rather than orally, and the Reading Comprehension sub-test of the Wechsler Objective Reading Dimensions (WORD) battery were given to 24 service-users of a metropolitan community learning disability team who had an estimated IQ in the range 50-79. These tests were demonstrated to have satisfactory split-half reliability and convergent validity with this population, supporting both their use in this study and in clinical work. Data are presented concerning the distribution across the sample of reading-ages and the comprehension of written grammatical constructions. These data should be useful to those who are preparing written material for adults with learning disabilities.
The Spiritual Needs Assessment for Patients (SNAP): development and validation of a comprehensive instrument to assess unmet spiritual needs.

PubMed

Sharma, Rashmi K; Astrow, Alan B; Texeira, Kenneth; Sulmasy, Daniel P

2012-07-01

Unmet spiritual needs have been associated with decreased patient ratings of quality of care, satisfaction, and quality of life. There is a need for a well-validated, psychometrically sound instrument to describe and measure spiritual needs. To develop a valid and reliable instrument to assess patients' spiritual needs. Instrument development was based on a literature review, clinical and pastoral evaluation, and cognitive pretesting (n=15 ambulatory cancer patients). Forty-seven ambulatory cancer patients completed cross-sectional and longitudinal surveys to test instrument validity and reliability. Internal reliability was assessed by Cronbach's α, test-retest reliability by Spearman's correlation coefficients, and construct validity by comparing instrument scores to a previously used single-item spiritual needs question. The Spiritual Needs Assessment for Patients (SNAP) comprises a total of 23 items in three domains: psychosocial (n=5), spiritual (n=13), and religious (n=5). Sixty percent of participants were white, 21% black, 13% Hispanic, and 6% Asian or other. Fifty-eight percent were Catholic, 13% Jewish, 11% Protestant, 2% Buddhist, 2% Muslim, and 2% Hindu. Sixty-eight percent described themselves as spiritual but not religious; 15% reported unmet spiritual needs; 19% wanted help meeting their spiritual needs. Cronbach's α for the total SNAP was 0.95, and for the subscales was psychosocial=0.74, spiritual=0.93, and religious needs=0.86. Test-retest correlation coefficients were total SNAP=0.69, psychosocial needs=0.51, spiritual needs=0.70, and religious needs=0.65. Participants reporting unmet spiritual needs had significantly higher mean scores on the total SNAP (66.3 vs. 49.4, P=0.03) and on the spiritual needs subscale (39.0 vs. 28.3, P=0.02). The results provide preliminary evidence that the SNAP is a valid and reliable instrument for measuring spiritual needs in a diverse patient population. Copyright © 2012 U.S. Cancer Pain Relief Committee. Published by Elsevier Inc. All rights reserved.
Measuring leader perceptions of school readiness for reforms: use of an iterative model combining classical and Rasch methods.

PubMed

Chatterji, Madhabi

2002-01-01

This study examines validity of data generated by the School Readiness for Reforms: Leader Questionnaire (SRR-LQ) using an iterative procedure that combines classical and Rasch rating scale analysis. Following content-validation and pilot-testing, principal axis factor extraction and promax rotation of factors yielded a five factor structure consistent with the content-validated subscales of the original instrument. Factors were identified based on inspection of pattern and structure coefficients. The rotated factor pattern, inter-factor correlations, convergent validity coefficients, and Cronbach's alpha reliability estimates supported the hypothesized construct properties. To further examine unidimensionality and efficacy of the rating scale structures, item-level data from each factor-defined subscale were subjected to analysis with the Rasch rating scale model. Data-to-model fit statistics and separation reliability for items and persons met acceptable criteria. Rating scale results suggested consistency of expected and observed step difficulties in rating categories, and correspondence of step calibrations with increases in the underlying variables. The combined approach yielded more comprehensive diagnostic information on the quality of the five SRR-LQ subscales; further research is continuing.
Validity and reliability of a new ankle dorsiflexion measurement device.

PubMed

Gatt, Alfred; Chockalingam, Nachiappan

2013-08-01

The assessment of the maximum ankle dorsiflexion angle is an important clinical examination procedure. Evidence shows that the traditional goniometer is highly unreliable, and various designs of goniometers to measure the maximum ankle dorsiflexion angle rely on the application of a known force to obtain reliable results. Hence, an innovative ankle dorsiflexion measurement device was designed to make this measurement more reliable by holding the foot in a selected posture without the application of a known moment. To report on the comprehensive validity and reliability testing carried out on the new device. Following validity testing, four different trials to test reliability of the ankle dorsiflexion measurement device were performed. These trials included inter-rater and intra-rater testings with a controlled moment, intra-rater reliability testing with knees flexed and extended without a controlled moment, intra-rater testing with a patient population, and inter-rater reliability testing between four raters of varying experience without controlling moment. All raters were blinded. A series of trials to test intra-rater and inter-rater reliabilities. Intra-rater reliability intraclass correlation coefficient was 0.98 and inter-rater reliability intraclass correlation coefficient (2,1) was 0.953 with a controlled moment. With uncontrolled moment, very high reliability for intra-tester was also achieved (intraclass correlation coefficient = 0.94 with knees extended and intraclass correlation coefficient = 0.95 with knees flexed). For the trial investigating test-retest reliability with actual patients, intraclass correlation coefficient of 0.99 was obtained. In the trial investigating four different raters with uncontrolled moment, intraclass correlation coefficient of 0.91 was achieved. The new ankle dorsiflexion measurement device is a valid and reliable device for measuring ankle dorsiflexion in both healthy subjects and patients, with both controlled and uncontrolled moments, even by multiple raters of varying experience when the foot is dorsiflexed to its end of range of motion. An ankle dorsiflexion measuring device has been designed to increase the reliability of ankle dorsiflexion measurement and replace the traditional goniometer. While the majority of similar devices rely on application of a known moment to perform this measurement, it has been shown that this is not required with the new ankle dorsiflexion measurement device and, rather, foot posture should be taken into consideration as this affects the maximum ankle dorsiflexion angle.

Effectiveness of ethics education as perceived by nursing students: development and testing of a novel assessment instrument.

PubMed

Vynckier, Tine; Gastmans, Chris; Cannaerts, Nancy; de Casterlé, Bernadette Dierckx

2015-05-01

The effectiveness of ethics education continues to be disputed. No studies exist on how nursing students perceive the effectiveness of nursing ethics education in Flanders, Belgium. To develop a valid and reliable instrument, named the 'Students' Perceived Effectiveness of Ethics Education Scale' (SPEEES), to measure students' perceptions of the effectiveness of ethics education, and to conduct a pilot study in Flemish nursing students to investigate the perceived efficacy of nursing ethics education in Flanders. Content validity, comprehensibility and usability of the SPEEES were assessed. Reliability was assessed by means of a quantitative descriptive non-experimental pilot study. 86 third-year baccalaureate nursing students of two purposefully selected university colleges answered the SPEEES. Formal approval was given by the ethics committee. Informed consent was obtained and anonymity was ensured for both colleges and their participating students. The scale content validity index/Ave scores for the subscales were 1.00, 1.00 and 0.86. The comprehensibility and user-friendliness were favourable. Cronbach's alpha was 0.94 for general effectiveness, 0.89 for teaching methods and 0.85 for ethical content. Students perceived 'case study', 'lecture' and 'instructional dialogue' to be effective teaching methods and 'general ethical concepts' to contain effective content. 'Reflecting critically on their own values' was mentioned as the only ethical competence that, was promoted by the ethics courses. The study revealed rather large differences between both schools in students' perceptions of the contribution of ethics education to other ethical competences. The study revealed that according to the students, ethics courses failed to meet some basic objectives of ethics education. Although the SPEEES proved to be a valid and reliable measure, the pilot study suggests that there is still space for improvement and a need for larger scale research. Additional insights will enable educators to improve current nursing ethics education. © The Author(s) 2014.
Advancing implementation science through measure development and evaluation: a study protocol.

PubMed

Lewis, Cara C; Weiner, Bryan J; Stanick, Cameo; Fischer, Sarah M

2015-07-22

Significant gaps related to measurement issues are among the most critical barriers to advancing implementation science. Three issues motivated the study aims: (a) the lack of stakeholder involvement in defining pragmatic measure qualities; (b) the dearth of measures, particularly for implementation outcomes; and (c) unknown psychometric and pragmatic strength of existing measures. Aim 1: Establish a stakeholder-driven operationalization of pragmatic measures and develop reliable, valid rating criteria for assessing the construct. Aim 2: Develop reliable, valid, and pragmatic measures of three critical implementation outcomes, acceptability, appropriateness, and feasibility. Aim 3: Identify Consolidated Framework for Implementation Research and Implementation Outcome Framework-linked measures that demonstrate both psychometric and pragmatic strength. For Aim 1, we will conduct (a) interviews with stakeholder panelists (N = 7) and complete a literature review to populate pragmatic measure construct criteria, (b) Q-sort activities (N = 20) to clarify the internal structure of the definition, (c) Delphi activities (N = 20) to achieve consensus on the dimension priorities, (d) test-retest and inter-rater reliability assessments of the emergent rating system, and (e) known-groups validity testing of the top three prioritized pragmatic criteria. For Aim 2, our systematic development process involves domain delineation, item generation, substantive validity assessment, structural validity assessment, reliability assessment, and predictive validity assessment. We will also assess discriminant validity, known-groups validity, structural invariance, sensitivity to change, and other pragmatic features. For Aim 3, we will refine our established evidence-based assessment (EBA) criteria, extract the relevant data from the literature, rate each measure using the EBA criteria, and summarize the data. The study outputs of each aim are expected to have a positive impact as they will establish and guide a comprehensive measurement-focused research agenda for implementation science and provide empirically supported measures, tools, and methods for accomplishing this work.
[Some critical remarks on standardised assessment instruments in nursing].

PubMed

Bartholomeyczik, Sabine

2007-08-01

The use of standardised instruments in nursing has rapidly grown and can be seen as a symptom of the necessary comprehensive nursing diagnostics. However, these instruments comprise the risk of misuse, if they are not critically evaluated. Published statements about tests of reliability and validity of an instrument are insufficient. First, the critical evaluation has to ask for the instrument's theoretical and content base: Is the instrument relevant for nursing, suitable for practice and leading to nursing actions? Two examples of well known instruments and different kinds of their utilization in nursing are discussed. Next, the instruments have to be questioned as "bodies with numbers". Studies on reliability and validity have to be as carefully evaluated as other empirical research. The sample, the suitability of agreement indicators (interraterreliability), kind and reason of tests have to be questioned. The same has to be done with tests of validity which comprise an even greater challenge. Methodological studies about these questions are missing; guidelines for test user qualifications need to be developed.
Validation of the Parental Facilitation of Mastery Scale-II.

PubMed

Zalta, Alyson K; Allred, Kelly M; Jayawickreme, Eranda; Blackie, Laura E R; Chambless, Dianne L

2017-10-01

To develop a more reliable and comprehensive version of the Parental Facilitation of Mastery Scale (PFMS) METHOD: In Study 1, 387 undergraduates completed an expanded PFMS (PFMS-II) and measures of parenting, perceived control, responses to early life challenges, and psychopathology. In Study 2, 182 trauma-exposed community participants completed the PFMS-II and measures of perceived control, psychopathology, and well-being RESULTS: In Study 1, exploratory factor analysis of the PFMS-II revealed two factors. These factors replicated in Study 2; one item was removed to achieve measurement invariance across race. The final PFMS-II comprised a 10-item overprotection scale and a 7-item challenge scale. In both samples, this measure demonstrated good convergent and discriminant validity and was more reliable than the original PFMS. Parental challenge was a unique predictor of perceived control in both samples CONCLUSION: The PFMS-II is a valid measure of important parenting behaviors not fully captured in other measures. © 2016 Wiley Periodicals, Inc.
The military social health index: a partial multicultural validation.

PubMed

Van Breda, Adrian D

2008-05-01

Routine military deployments place great stress on military families. Before South African soldiers can be deployed, they undergo a comprehensive health assessment, which includes a social work assessment. The assessment focuses on the resilience of the family system to estimate how well the family will cope when exposed to the stress of deployments. This article reports on the development and validation of a new measuring tool, the Military Social Health Index, or MSHI. The MSHI is made up of four scales, each comprising 14 items, viz social support, problem solving, stressor appraisal, and generalized resistance resources. An initial, large-scale, multicultural validation of the MSHI revealed strong levels of reliability (Cronbach a and standard error of measurement) and validity (factorial, construct, convergent, and discriminant).
Measuring Graph Comprehension, Critique, and Construction in Science

NASA Astrophysics Data System (ADS)

Lai, Kevin; Cabrera, Julio; Vitale, Jonathan M.; Madhok, Jacquie; Tinker, Robert; Linn, Marcia C.

2016-08-01

Interpreting and creating graphs plays a critical role in scientific practice. The K-12 Next Generation Science Standards call for students to use graphs for scientific modeling, reasoning, and communication. To measure progress on this dimension, we need valid and reliable measures of graph understanding in science. In this research, we designed items to measure graph comprehension, critique, and construction and developed scoring rubrics based on the knowledge integration (KI) framework. We administered the items to over 460 middle school students. We found that the items formed a coherent scale and had good reliability using both item response theory and classical test theory. The KI scoring rubric showed that most students had difficulty linking graphs features to science concepts, especially when asked to critique or construct graphs. In addition, students with limited access to computers as well as those who speak a language other than English at home have less integrated understanding than others. These findings point to the need to increase the integration of graphing into science instruction. The results suggest directions for further research leading to comprehensive assessments of graph understanding.
The Validity and Reliability Work of the Scale That Determines the Level of the Trauma after the Earthquake

ERIC Educational Resources Information Center

Tanhan, Fuat; Kayri, Murat

2013-01-01

In this study, it was aimed to develop a short, comprehensible, easy, applicable, and appropriate for cultural characteristics scale that can be evaluated in mental traumas concerning earthquake. The universe of the research consisted of all individuals living under the effects of the earthquakes which occurred in Tabanli Village on 23.10.2011 and…
Reliability and validity of the faculty evaluation instrument used at King Saud bin Abdulaziz University for Health Sciences: Results from the Haematology Course.

PubMed

Al-Eidan, Fahad; Baig, Lubna Ansari; Magzoub, Mohi-Eldin; Omair, Aamir

2016-04-01

To assess reliability and validity of evaluation tool using Haematology course as an example. The cross-sectional study was conducted at King Saud Bin Abdul Aziz University of Health Sciences, Riyadh, Saudi Arabia, in 2012, while data analysis was completed in 2013. The 27-item block evaluation instrument was developed by a multidisciplinary faculty after a comprehensive literature review. Validity of the questionnaire was confirmed using principal component analysis with varimax rotation and Kaiser normalisation. Identified factors were combined to get the internal consistency reliability of each factor. Student's t-test was used to compare mean ratings between male and female students for the faculty and block evaluation. Of the 116 subjects in the study, 80(69%) were males and 36(31%) were females. Reliability of the questionnaire was Cronbach's alpha 0.91. Factor analysis yielded a logically coherent 7 factor solution that explained 75% of the variation in the data. The factors were group dynamics in problem-based learning (alpha0.92), block administration (alpha 0.89), quality of objective structured clinical examination (alpha 0.86), block coordination (alpha 0.81), structure of problem-based learning (alpha 0.84), quality of written exam (alpha 0.91), and difficulty of exams (alpha0.41). Female students' opinion on depth of analysis and critical thinking was significantly higher than that of the males (p=0.03). The faculty evaluation tool used was found to be reliable, but its validity, as assessed through factor analysis, has to be interpreted with caution as the responders were less than the minimum required for factor analysis.
The 11-item Medication Adherence Reasons Scale: reliability and factorial validity among patients with hypertension in Malaysian primary healthcare settings

PubMed Central

Shima, Razatul; Farizah, Hairi; Majid, Hazreen Abdul

2015-01-01

INTRODUCTION The aim of this study was to assess the reliability and validity of a modified Malaysian version of the Medication Adherence Reasons Scale (MAR-Scale). METHODS In this cross-sectional study, the 15-item MAR-Scale was administered to 665 patients with hypertension who attended one of the four government primary healthcare clinics in the Hulu Langat and Klang districts of Selangor, Malaysia, between early December 2012 and end-March 2013. The construct validity was examined in two phases. Phase I consisted of translation of the MAR-Scale from English to Malay, a content validity check by an expert panel, a face validity check via a small preliminary test among patients with hypertension, and exploratory factor analysis (EFA). Phase II involved internal consistency reliability calculations and confirmatory factor analysis (CFA). RESULTS EFA verified five existing factors that were previously identified (i.e. issues with medication management, multiple medications, belief in medication, medication availability, and the patient’s forgetfulness and convenience), while CFA extracted four factors (medication availability issues were not extracted). The final modified MAR-Scale model, which had 11 items and a four-factor structure, provided good evidence of convergent and discriminant validities. Cronbach’s alpha coefficient was > 0.7, indicating good internal consistency of the items in the construct. The results suggest that the modified MAR-Scale has good internal consistencies and construct validity. CONCLUSION The validated modified MAR-Scale (Malaysian version) was found to be suitable for use among patients with hypertension receiving treatment in primary healthcare settings. However, the comprehensive measurement of other factors that can also lead to non-adherence requires further exploration. PMID:25902719
The 11-item Medication Adherence Reasons Scale: reliability and factorial validity among patients with hypertension in Malaysian primary healthcare settings.

PubMed

Shima, Razatul; Farizah, Hairi; Majid, Hazreen Abdul

2015-08-01

The aim of this study was to assess the reliability and validity of a modified Malaysian version of the Medication Adherence Reasons Scale (MAR-Scale). In this cross-sectional study, the 15-item MAR-Scale was administered to 665 patients with hypertension who attended one of the four government primary healthcare clinics in the Hulu Langat and Klang districts of Selangor, Malaysia, between early December 2012 and end-March 2013. The construct validity was examined in two phases. Phase I consisted of translation of the MAR-Scale from English to Malay, a content validity check by an expert panel, a face validity check via a small preliminary test among patients with hypertension, and exploratory factor analysis (EFA). Phase II involved internal consistency reliability calculations and confirmatory factor analysis (CFA). EFA verified five existing factors that were previously identified (i.e. issues with medication management, multiple medications, belief in medication, medication availability, and the patient's forgetfulness and convenience), while CFA extracted four factors (medication availability issues were not extracted). The final modified MAR-Scale model, which had 11 items and a four-factor structure, provided good evidence of convergent and discriminant validities. Cronbach's alpha coefficient was > 0.7, indicating good internal consistency of the items in the construct. The results suggest that the modified MAR-Scale has good internal consistencies and construct validity. The validated modified MAR-Scale (Malaysian version) was found to be suitable for use among patients with hypertension receiving treatment in primary healthcare settings. However, the comprehensive measurement of other factors that can also lead to non-adherence requires further exploration.
Cross-cultural adaptation, content validation, and reliability of the Nigerian Composite Lifestyle CVD Risk Factors Questionnaire for adolescents among Yoruba rural adolescents in Nigeria.

PubMed

Odunaiya, Nse A; Louw, Quinette A; Grimmer, Karen

2017-06-01

Assessment of lifestyle risk factors must be culturally- and contextually relevant and available in local languages. This paper reports on a study which aimed to cross culturally adapt a composite lifestyle cardiovascular disease (CVD) risk factors questionnaire into an African language (Yoruba) and testing some of its psychometric properties such as content validity and test retest reliability in comparison to the original English version. This study utilized a cross sectional design. Translation of the English version of the questionnaire into Yoruba was undertaken using the guideline by Beaton et al. The translated instrument was presented to 21 rural adolescents to assess comprehensibility and clarity using a sample of convenience. A test retest reliability was conducted among 150 rural adolescents using a purposive sampling. Data was analyzed using intraclass correlation (ICC ) model 3, Cohen kappa statistics and prevalence rates. ICC ranged between 0.4-0.8. The Yoruba version was completed 15-20 minutes and was reported to be culturally appropriate and acceptable for rural Nigerian adolescents. The Yoruba translation of the Nigerian composite lifestyle risk factors questionnaire performs at least as well as the original English version in terms of content validity and reliability. It took a shorter time to complete therefore may be more relevant to rural adolescents.
Validity and reliability of the Japanese version of the Newest Vital Sign: a preliminary study.

PubMed

Kogure, Takamichi; Sumitani, Masahiko; Suka, Machi; Ishikawa, Hirono; Odajima, Takeshi; Igarashi, Ataru; Kusama, Makiko; Okamoto, Masako; Sugimori, Hiroki; Kawahara, Kazuo

2014-01-01

Health literacy (HL) refers to the ability to obtain, process, and understand basic health information and services, and is thus needed to make appropriate health decisions. The Newest Vital Sign (NVS) is comprised of 6 questions about an ice cream nutrition label and assesses HL numeracy skills. We developed a Japanese version of the NVS (NVS-J) and evaluated the validity and reliability of the NVS-J in patients with chronic pain. The translation of the original NVS into Japanese was achieved as per the published guidelines. An observational study was subsequently performed to evaluate the validity and reliability of the NVS-J in 43 Japanese patients suffering from chronic pain. Factor analysis with promax rotation, using the Kaiser criterion (eigenvalues ≥1.0), and a scree plot revealed that the main component of the NVS-J consists of three determinative factors, and each factor consists of two NVS-J items. The criterion-related validity of the total NVS-J score was significantly correlated with the total score of Ishikawa et al.'s self-rated HL Questionnaire, the clinical global assessment of comprehensive HL level, cognitive function, and the Brinkman index. In addition, Cronbach's coefficient for the total score of the NVS-J was adequate (alpha = 0.72). This study demonstrated that the NVS-J has good validity and reliability. Further, the NVS-J consists of three determinative factors: "basic numeracy ability," "complex numeracy ability," and "serious-minded ability." These three HL abilities comprise a 3-step hierarchical structure. Adequate HL should be promoted in chronic pain patients to enable coping, improve functioning, and increase activities of daily living (ADLs) and quality of life (QOL).
Cross-cultural adaptation and validation of Systemic Lupus Erythematosus Quality of Life questionnaire into Arabic.

PubMed

Aziz, M M; Galal, M A A; Elzohri, M H; El-Nouby, F; Leong, K P

2018-04-01

Systemic lupus erythematosus (SLE) is a chronic autoimmune disease which affects all aspects of quality of life (QoL) of the patients. Comprehensive patient assessment should include QoL measures in addition to the objective clinical measures of the disease. There is no specific Arabic instrument for assessment of QoL of SLE patients. The objective of this study was to translate and cross culturally adapt the SLEQOL questionnaire into Arabic and test its reliability and validity. The SLEQOL questionnaire was translated into Arabic based on the Guidelines for Translation and Cross-cultural Adaptation into other languages. Reliability was assessed by interviewing patients three times: two interviews on the same day by different interviewers and the third interview 14 days later by one of the first interviewers. Validity was assessed by correlating SLEQOL scores of 91 patients with 36-item Short Form Health Survey (SF-36) scores and clinical parameters of the patients. We found that the Arabic version of SLEQOL has a Cronbach's alpha of 0.936, interobserver and intraobserver correlation coefficients of 0.809 and 0.886 respectively. Strong correlations were also found between SLEQOL scores and SF-36 Physical and Mental Component summaries. In conclusion, the Arabic version of SLEQOL is a reliable and valid instrument for measuring QoL of Egyptian SLE patients.
Designing and psychoanalysis: A comprehensive questionnaire on coping with domestic violence against women in Iranian society

PubMed Central

Mohhamadian, Zeinab; Rohani, Camelia; Jamshidi, Tayebeh

2018-01-01

Background Domestic violence is the third sociopathology after addiction and child abuse in Iran. Fifty-six percent of Iranian women in the range of 17–32 years old are exposed to the highest domestic violence. Objective: The aim of this study was to design and psychoanalyze a comprehensive questionnaire on coping with domestic violence against women in Iranian society. Methods This study was carried out on a random sample of women exposed to domestic violence and referred to the health and care center of Shahid Beheshti University of Medical Sciences in Tehran, and Forensic Medical Centers in Urmia city (Iran), in 2017. Two hundred questionnaires were distributed among the participants. One hundred sixty-eight questionnaires were returned to the researchers for data analysis. Eight of those were excluded from the analysis because of incompleteness. Finally, exploratory factor analysis was performed. Results After reviewing the literature, a questionnaire with 32 items was developed. Content validity ratio (0.95) and content validity index (0.97) were obtained. The results of exploratory factor analysis indicated that the questionnaire explained 69.34% of the data variance. Cronbach’s alpha coefficient, and test-retest methods were used for determining the reliability and the obtained value, which were 0.82 and 0.81, respectively. Conclusion Validity and reliability of the questionnaire with 32 items were confirmed. The tool can be utilized to measure how women cope with domestic violence. PMID:29588816
Measurement of sedentary behaviour in population health surveys: a review and recommendations

PubMed Central

LeBlanc, Allana G.; Colley, Rachel C.; Saunders, Travis J.

2017-01-01

Background The purpose of this review was to determine the most valid and reliable questions for targeting key modes of sedentary behaviour (SB) in a broad range of national and international health surveillance surveys. This was done by reviewing the SB modules currently used in population health surveys, as well as examining SB questionnaires that have performed well in psychometric testing. Methods Health surveillance surveys were identified via scoping review and contact with experts in the field. Previous systematic reviews provided psychometric information on pediatric questionnaires. A comprehensive search of four bibliographic databases was used to identify studies reporting psychometric information for adult questionnaires. Only surveys/studies published/used in English or French were included. Results The review identified a total of 16 pediatric and 18 adult national/international surveys assessing SB, few of which have undergone psychometric testing. Fourteen pediatric and 35 adult questionnaires with psychometric information were included. While reliability was generally good to excellent for questions targeting key modes of SB, validity was poor to moderate, and reported much less frequently. The most valid and reliable questions targeting specific modes of SB were combined to create a single questionnaire targeting key modes of SB. Discussion Our results highlight the importance of including SB questions in survey modules that are adaptable, able to assess various modes of SB, and that exhibit adequate reliability and validity. Future research could investigate the psychometric properties of the module we have proposed in this paper, as well as other questionnaires currently used in national and international population health surveys. PMID:29250468
Measurement of sedentary behaviour in population health surveys: a review and recommendations.

PubMed

Prince, Stephanie A; LeBlanc, Allana G; Colley, Rachel C; Saunders, Travis J

2017-01-01

The purpose of this review was to determine the most valid and reliable questions for targeting key modes of sedentary behaviour (SB) in a broad range of national and international health surveillance surveys. This was done by reviewing the SB modules currently used in population health surveys, as well as examining SB questionnaires that have performed well in psychometric testing. Health surveillance surveys were identified via scoping review and contact with experts in the field. Previous systematic reviews provided psychometric information on pediatric questionnaires. A comprehensive search of four bibliographic databases was used to identify studies reporting psychometric information for adult questionnaires. Only surveys/studies published/used in English or French were included. The review identified a total of 16 pediatric and 18 adult national/international surveys assessing SB, few of which have undergone psychometric testing. Fourteen pediatric and 35 adult questionnaires with psychometric information were included. While reliability was generally good to excellent for questions targeting key modes of SB, validity was poor to moderate, and reported much less frequently. The most valid and reliable questions targeting specific modes of SB were combined to create a single questionnaire targeting key modes of SB. Our results highlight the importance of including SB questions in survey modules that are adaptable, able to assess various modes of SB, and that exhibit adequate reliability and validity. Future research could investigate the psychometric properties of the module we have proposed in this paper, as well as other questionnaires currently used in national and international population health surveys.
The validity and reliability of the ADL-Glittre test for children.

PubMed

Martins, Renata; Assumpção, Maíra S de; Bobbio, Tatiana G; Mayer, Anamaria F; Schivinski, Camila

2018-04-16

The ADL-Glittre was created to assess more comprehensively the essential activities of daily living in adults with chronic obstructive pulmonary disease. The aim of this study was to validate the ADL-Glittre test adapted for children (TGlittre-P) and verify its reliability. This is a cross-sectional study with 87 healthy children aged 6 to 14 years (mean 10.36 ± 2.32 years). Biometric and spirometry data were collected from all participants. On the same day, part of the sample (36 children included in the validation process) performed two 6MWT and two TGlittre-P (30-minute interval between them). The other part of the sample just performed two TGlittre-P for the reliability process. Pearson and Spearman correlation tests were used to verify the correlation between the time spent on the TGlittre-P and the distance walked in the 6MWT. The intraclass correlation coefficient (ICC) was also used to assess the reproducibility of the TGlittre-P. The TGlittre-P showed a moderate negative correlation with the 6MWT (r = -0.490; p = 0.002; 95%CI -0.712 to -0.233). However, the behavior of the physiological variables that were monitored during the tests was similar and showed to be reproducible (ICC = 0.843; p = 0.000; 95%CI 0.695 to 0.911). The TGlittre-P proved to be a valid and reliable assessment of the functional capacity of healthy children aged 6 to 14 years.
Reliability of the Dutch-language version of the Communication Function Classification System and its association with language comprehension and method of communication.

PubMed

Vander Zwart, Karlijn E; Geytenbeek, Joke J; de Kleijn, Maaike; Oostrom, Kim J; Gorter, Jan Willem; Hidecker, Mary Jo Cooley; Vermeulen, R Jeroen

2016-02-01

The aims of this study were to determine the intra- and interrater reliability of the Dutch-language version of the Communication Function Classification System (CFCS-NL) and to investigate the association between the CFCS level and (1) spoken language comprehension and (2) preferred method of communication in children with cerebral palsy (CP). Participants were 93 children with CP (50 males, 43 females; mean age 7y, SD 2y 6mo, range 2y 9mo-12y 10mo; unilateral spastic [n=22], bilateral spastic [n=51], dyskinetic [n=15], ataxic [n=3], not specified [n=2]; Gross Motor Function Classification System level I [n=16], II [n=14], III, [n=7], IV [n=24], V [n=31], unknown [n=1]), recruited from rehabilitation centres throughout the Netherlands. Because some centres only contributed to part of the study, different numbers of participants are presented for different aspects of the study. Parents and speech and language therapists (SLTs) classified the communication level using the CFCS. Kappa was used to determine the intra- and interrater reliability. Spearman's correlation coefficient was used to determine the association between CFCS level and spoken language comprehension, and Fisher's exact test was used to examine the association between the CFCS level and method of communication. Interrater reliability of the CFCS-NL between parents and SLTs was fair (r=0.54), between SLTs good (r=0.78), and the intrarater (SLT) reliability very good (r=0.85). The association between the CFCS and spoken language comprehension was strong for SLTs (r=0.63) and moderate for parents (r=0.51). There was a statistically significant difference between the CFCS level and the preferred method of communication of the child (p<0.01). Also, CFCS level classification showed a statistically significant difference between parents and SLTs (p<0.01). These data suggest that the CFCS-NL is a valid and reliable clinical tool to classify everyday communication in children with CP. Preferably, professionals should classify the child's CFCS level in collaboration with the parents to acquire the most comprehensive information about the everyday communication of the child in various situations both with familiar and with unfamiliar partners. © 2015 Mac Keith Press.
Development of a quality instrument for assessing the spontaneous reports of ADR/ADE using Delphi method in China.

PubMed

Chen, Lixun; Jiang, Ling; Shen, Aizong; Wei, Wei

2016-09-01

The frequently low quality of submitted spontaneous reports is of an increasing concern; to our knowledge, no validated instrument exists for assessing case reports' quality comprehensively enough. This work was conducted to develop such a quality instrument for assessing the spontaneous reports of adverse drug reaction (ADR)/adverse drug event (ADE) in China. Initial evaluation indicators were generated using systematic and literature data analysis. Final indicators and their weights were identified using Delphi method. The final quality instrument was developed by adopting the synthetic scoring method. A consensus was reached after four rounds of Delphi survey. The developed quality instrument consisted of 6 first-rank indicators, 18 second-rank indicators, and 115 third-rank indicators, and each rank indicator has been weighted. It evaluates the quality of spontaneous reports of ADR/ADE comprehensively and quantitatively on six parameters: authenticity, duplication, regulatory, completeness, vigilance level, and reporting time frame. The developed instrument was tested with good reliability and validity, which can be used to comprehensively and quantitatively assess the submitted spontaneous reports of ADR/ADE in China.
Reviews of the Comprehensive Nuclear-Test-Ban Treaty and U.S. security

NASA Astrophysics Data System (ADS)

Jeanloz, Raymond

2017-11-01

Reviews of the Comprehensive Nuclear-Test-Ban Treaty (CTBT) by the National Academy of Sciences concluded that the United States has the technical expertise and physical means to i) maintain a safe, secure and reliable nuclear-weapons stockpile without nuclear-explosion testing, and ii) effectively monitor global compliance once the Treaty enters into force. Moreover, the CTBT is judged to help constrain proliferation of nuclear-weapons technology, so it is considered favorable to U.S. security. Review of developments since the studies were published, in 2002 and 2012, show that the study conclusions remain valid and that technical capabilities are better than anticipated.

A highly reliable, high performance open avionics architecture for real time Nap-of-the-Earth operations

NASA Technical Reports Server (NTRS)

Harper, Richard E.; Elks, Carl

1995-01-01

An Army Fault Tolerant Architecture (AFTA) has been developed to meet real-time fault tolerant processing requirements of future Army applications. AFTA is the enabling technology that will allow the Army to configure existing processors and other hardware to provide high throughput and ultrahigh reliability necessary for TF/TA/NOE flight control and other advanced Army applications. A comprehensive conceptual study of AFTA has been completed that addresses a wide range of issues including requirements, architecture, hardware, software, testability, producibility, analytical models, validation and verification, common mode faults, VHDL, and a fault tolerant data bus. A Brassboard AFTA for demonstration and validation has been fabricated, and two operating systems and a flight-critical Army application have been ported to it. Detailed performance measurements have been made of fault tolerance and operating system overheads while AFTA was executing the flight application in the presence of faults.
Development and testing of the Test of Functional Health Literacy in Dentistry (TOFHLiD).

PubMed

Gong, Debra A; Lee, Jessica Y; Rozier, R Gary; Pahel, Bhavna T; Richman, Julia A; Vann, William F

2007-01-01

This study aims to evaluate the reliability and validity of the Test of Functional Health Literacy in Dentistry (TOFHLiD), a new instrument to measure functional oral health literacy. TOFHLiD uses text passages and prompts related to fluoride use and access to care to assess reading comprehension and numerical ability. Parents of pediatric dental patients (n = 102) were administered TOFHLiD, a medical literacy comprehension test (TOFHLA), and two word recognition tests [Rapid Estimate of Adult Literacy in Dentistry (REALD), Rapid Estimate of Adult Literacy in Medicine (REALM)]. This design provided assessments of dental and medical health literacy by all subjects, both measured with two different methods (reading/numeracy ability and word recognition). Construct validity of TOFHLiD was assessed by entering the correlation coefficients for all pairwise comparisons of literacy instruments into a multitrait-multimethod matrix. Internal reliability of TOFHLiD was assessed with Cronbach's alpha. Criterion-related predictive validity was tested by associations between the TOFHLiD scores and the three measures of oral health in multivariate regression analyses. The correlation coefficient for TOFHLiD and REALD-99 scores (monotrait-heteromethod) was high (r = 0.82, P < 0.05). Coefficients between TOFHLiD and TOFHLA (heterotrait-monomethod: r = 0.52) and REALM (heterotrait-heteromethod: r = 0.53) were smaller than coefficients for convergent validity Cronbach's alpha for TOFHLiD was 0.63. TOFHLiD was positively correlated with OHIP-14 (P < 0.05), but not with parent or child oral health. TOFHLA was not related to dental outcomes. TOFHLiD demonstrates good convergent validity but only moderate ability to discriminate between dental and medical health literacy. Its predictive validity is only partially established, and internal consistency just meets the threshold for acceptability. Results provide solid support for more research, but not widespread use in clinical or public health practice.
Beyond reading level: a systematic review of the suitability of cancer education print and Web-based materials.

PubMed

Finnie, Ramona K C; Felder, Tisha M; Linder, Suzanne Kneuper; Mullen, Patricia Dolan

2010-12-01

Consideration of categories related to reading comprehension--beyond reading level--is imperative to reach low literacy populations effectively. "Suitability" has been proposed as a term to encompass six categories of such factors: content, literacy demand graphics, layout/typography, learning stimulation, and cultural appropriateness. Our purpose was to describe instruments used to evaluate categories of suitability in cancer education materials in published reports and their findings. We searched databases and reference lists for evaluations of print and Web-based cancer education materials to identify and describe measures of these categories. Studies had to evaluate reading level and at least one category of suitability. Eleven studies met our criteria. Seven studies reported inter-rater reliability. Cultural appropriateness was most often assessed; four instruments assessed only surface aspects of cultural appropriateness. Only two of seven instruments used, the suitability assessment of materials (SAM) and the comprehensibility assessment of materials (SAM + CAM), were described as having any evidence of validity. Studies using Simplified Measure of Goobledygook (SMOG) and Fry reported higher average reading level scores than those using Flesh-Kincaid. Most materials failed criteria for reading level and cultural appropriateness. We recommend more emphasis on the categories of suitability for those developing cancer education materials and more study of these categories and reliability and validity testing of instruments.
Validation of Malayalam Version of National Comprehensive Cancer Network Distress Thermometer and its Feasibility in Oncology Patients.

PubMed

Biji, M S; Dessai, Sampada; Sindhu, N; Aravind, Sithara; Satheesan, B

2018-01-01

This study was designed to translate and validate the National Comprehensive Cancer Network (NCCN) distress thermometer (DT) in regional language " Malayalam" and to see the feasibility of using it in our patients. (1) To translate and validate the NCCN DT. (2) To study the feasibility of using validated Malayalam translated DT in Malabar Cancer center. This is a single-arm prospective observational study. The study was conducted at author's institution between December 8, 2015, and January 20, 2016 in the Department of Cancer Palliative Medicine. This was a prospective observational study carried out in two phases. In Phase 1, the linguistic validation of the NCCN DT was done. In Phase 2, the feasibility, face validity, and utility of the translated of NCCN DT in accordance with QQ-10 too was done. SPSS version 16 (SPSS Inc. Released 2007. SPSS for Windows, Version 16.0. Chicago, SPSS Inc.) was used for analysis. Ten patients were enrolled in Phase 2. The median age was 51.5 years and 40% of patients were male. All patients had completed at least basic education up to the primary level. The primary site of cancer was heterogeneous. The NCCN DT completion rate was 100%. The face validity, utility, reliability, and feasibility were 100%, 100%, 100%, and 90%, respectively. It can be concluded that the Malayalam validated DT has high face validity, utility, and it is feasible for its use.
[Design and validation of scales to measure adolescent attitude toward eating and toward physical activity].

PubMed

Lima-Serrano, Marta; Lima-Rodríguez, Joaquín Salvador; Sáez-Bueno, Africa

2012-01-01

Different authors suggest that attitude is a mediator in behavior change, so it is a predictor of behavior practice. The main of this study was to design and to validate two scales for measure adolescent attitude toward healthy eating and adolescent attitude toward healthy physical activity. Scales were design based on a literature review. After, they were validated using an on-line Delphi Panel with eighteen experts, a pretest, and a pilot test with a sample of 188 high school students. Comprehensibility, content validity, adequacy, as well as the reliability (alpha of Cronbach test), and construct validity (exploratory factor analysis) of scales were tested. Scales validated by experts were considered appropriate in the pretest. In the pilot test, the ten-item Attitude to Eating Scale obtained α=0.72. The eight-item Attitude to Physical Activity Scale obtained α=0.86. They showed evidence of one-dimensional interpretation after factor analysis, a) all items got weights r>0.30 in first factor before rotations, b) the first factor explained a significant proportion of variance before rotations, and c) the total variance explained by the main factors extracted was greater than 50%. The Scales showed their reliability and validity. They could be employed to assess attitude to these priority intervention areas in Spanish adolescents, and to evaluate this intermediate result of health interventions and health programs.
Teaching Quality in Math Class: The Development of a Scale and the Analysis of Its Relationship with Engagement and Achievement

PubMed Central

Leon, Jaime; Medina-Garrido, Elena; Núñez, Juan L.

2017-01-01

Math achievement and engagement declines in secondary education; therefore, educators are faced with the challenge of engaging students to avoid school failure. Within self-determination theory, we address the need to assess comprehensively student perceptions of teaching quality that predict engagement and achievement. In study one we tested, in a sample of 548 high school students, a preliminary version of a scale to assess nine factors: teaching for relevance, acknowledge negative feelings, participation encouragement, controlling language, optimal challenge, focus on the process, class structure, positive feedback, and caring. In the second study, we analyzed the scale’s reliability and validity in a sample of 1555 high school students. The scale showed evidence of reliability, and with regard to criterion validity, at the classroom level, teaching quality was a predictor of behavioral engagement, and higher grades were observed in classes where students, as a whole, displayed more behavioral engagement. At the within level, behavioral engagement was associated with achievement. We not only provide a reliable and valid method to assess teaching quality, but also a method to design interventions, these could be designed based on the scale items to encourage students to persist and display more engagement on school duties, which in turn bolsters student achievement. PMID:28701964
A measure of early physical functioning (EPF) post-stroke.

PubMed

Finch, Lois E; Higgins, Johanne; Wood-Dauphinee, Sharon; Mayo, Nancy E

2008-07-01

To develop a comprehensive measure of Early Physical Functioning (EPF) post-stroke quantified through Rasch analysis and conceptualized using the International Classification of Functioning Disability and Health (ICF). An observational cohort study. A cohort of 262 subjects (mean age 71.6 (standard deviation 12.5) years) hospitalized post-acute stroke. Functional assessments were made within 3 days of stroke with items from valid and reliable indices commonly utilized to evaluate stroke survivors. Information on important variables was also collected. Principal component and Rasch analysis confirmed the factor structure, and dimensionality of the measure. Rasch analysis combined items across ICF components to develop the measure. Items were deleted iteratively, those retained fit the model and were related to the construct; reliability and validity were assessed. A 38-item unidimensional measure of the EPF met all Rasch model requirements. The item difficulty matched the person ability (mean person measure: -0.31; standard error 0.37 logits), reliability of the person-item-hierarchy was excellent at 0.97. Initial validity was adequate. The 38-item EPF measure was developed. It expands the range of assessment post acute stroke; it covers a broad spectrum of difficulty with good initial psychometric properties that, once revalidated, can assist in planning and evaluating early interventions.
Teaching Quality in Math Class: The Development of a Scale and the Analysis of Its Relationship with Engagement and Achievement.

PubMed

Leon, Jaime; Medina-Garrido, Elena; Núñez, Juan L

2017-01-01

Math achievement and engagement declines in secondary education; therefore, educators are faced with the challenge of engaging students to avoid school failure. Within self-determination theory, we address the need to assess comprehensively student perceptions of teaching quality that predict engagement and achievement. In study one we tested, in a sample of 548 high school students, a preliminary version of a scale to assess nine factors: teaching for relevance, acknowledge negative feelings, participation encouragement, controlling language, optimal challenge, focus on the process, class structure, positive feedback, and caring. In the second study, we analyzed the scale's reliability and validity in a sample of 1555 high school students. The scale showed evidence of reliability, and with regard to criterion validity, at the classroom level, teaching quality was a predictor of behavioral engagement, and higher grades were observed in classes where students, as a whole, displayed more behavioral engagement. At the within level, behavioral engagement was associated with achievement. We not only provide a reliable and valid method to assess teaching quality, but also a method to design interventions, these could be designed based on the scale items to encourage students to persist and display more engagement on school duties, which in turn bolsters student achievement.
Reliability and Validity of the Turkish Version of Abdel-Khalek's Death Anxiety Scale among College Students.

PubMed

Sariçiçek Aydoğan, Aybala; Gülseren, Şeref; Öztürk Sarikaya, Özyıl; Özen, Çiğdem

2015-12-01

Although death anxiety is considered a universal phenomenon, attitudes toward death may vary across populations that differ in terms of religion and culture. Abdel-Khalek's Death Anxiety Scale (ASDA) was developed on the basis of the rationale that there are specific concepts related to death and after death in Muslim populations. This study aims to translate and adapt ASDA in the Turkish population, examine its validity and reliability, and to compare its psychometric properties with the widely used Templer's Death Anxiety Scale (DAS). A total of 220 medical students were included in the study. The Turkish version of ASDA, DAS, and Hospital Anxiety and Depression Scale were used for data collection. Cronbach's alpha coefficients were .86 for ASDA and .66 for DAS. Analysis by principal components with varimax rotation produced five factors for ASDA that explained 65.6% of total variance. ASDA and DAS were highly correlated with each other (r=.68, p<.001). The results of this study indicate that the Turkish version of Abdel-Khalek's Death Anxiety Scale is a reliable and valid instrument. The Turkish version of ASDA revealed better psychometric properties than DAS. This finding may reflect specific cultural and religious attitudes toward death or may result from more comprehensible language use in ASDA.
The psychometric testing of the diabetes health promotion self-care scale.

PubMed

Wang, Ruey-Hsia; Lin, Li-Ying; Cheng, Chung-Ping; Hsu, Min-Tao; Kao, Chia-Chan

2012-06-01

Health-promoting behavior is an important strategy to maintain and enhance health of patients with Type 2 diabetes. Few instruments have been developed to measure health promotion self-care behavior of patients with Type 2 diabetes. Developing and psychometric testing of the Chinese version of the Diabetes Health Promotion Self-Care Scale (DHPSC) for patients with Type 2 diabetes. Four hundred and eighty-nine patients with Type 2 diabetes were recruited from endocrine clinics in four hospitals in Kaohsiung City in southern Taiwan. Exploratory and confirmatory factor analyses were used to assess the construct validity of the scale. Correlations between the DHPSC and the satisfaction subscale of Diabetes Quality of Life, Diabetes Empowerment Scale, and HbA1c were calculated to evaluate concurrent validity. Internal consistency and test-retest reliability were used to assess the reliability of the scale. The study was conducted in 2007 and 2008. A proposed second-order factor model with seven subscales and 26 items fit the data well. The seven subscales were interpersonal relationships, diet, blood glucose self-monitoring, personal health responsibility, exercise, adherence to the recommended regimens, and foot care. The DHPSC statistically significantly correlated with the satisfaction subscale of Diabetes Quality of Life and the Diabetes Empowerment Scale. HbA1c only statistically significantly correlated with the subscale of health responsibility. Reliability was supported by acceptable Cronbach's alpha (range, .78-.94) and test-retest reliability (range, .76-.95). The DHPSC has satisfactory reliability and validity. Healthcare providers can use the DHPSC to comprehensively assess the health promotion self-care behaviors of patients with Type 2 diabetes.
Development and validation of oral health-related early childhood quality of life tool for North Indian preschool children.

PubMed

Mathur, Vijay Prakash; Dhillon, Jatinder Kaur; Logani, Ajay; Agarwal, Ramesh

2014-01-01

The purpose of this study was to develop a reliable instrument [Oral Health related Early Childhood Quality of Life (OH- ECQOL) scale] for measuring oral health related quality of life (OHrQoL) in preschool children in North Indian population. Four pediatric dentists evaluated a pool of 65 items from various QoL questionnaires to assess their relevance to Indian population. These items were discussed with eight independent pediatric dentists and two community dentists who were not a part of this study to assess relevance of these items to preschool age children based on their comprehensiveness and clarity. Based on their responses and feedback a modified pool of items was developed and administered to a convenience sample of 20 parents who rated these items according to their relevance. The test retest reliability was evaluated on another sample of 20 parents of 2-5 year old children. The final questionnaire comprised of 16 items (12 child and 4 family). This was administered to 300 parents of 24-71 months old children divided on the basis of early childhood caries to assess its reliability and validity. OH-ECQOL scores were significantly associated with parental ratings of their child's general and oral health, and the presence of dental disease in the child. Cronbach's alpha was 0.862, and the ICC for test-retest reliability was 0.94. The OH-ECQOL proved reliable and valid tool for assessing the impact of oral disorders on the quality of life of preschool children in Northern India.
A Comprehensive Critique and Review of Published Measures of Acne Severity

PubMed Central

Furber, Gareth; Leach, Matthew; Segal, Leonie

2016-01-01

Objective: Acne vulgaris is a dynamic, complex condition that is notoriously difficult to evaluate. The authors set out to critically evaluate currently available measures of acne severity, particularly in terms of suitability for use in clinical trials. Design: A systematic review was conducted to identify methods used to measure acne severity, using MEDLINE, CINAHL, Scopus, and Wiley Online. Each method was critically reviewed and given a score out of 13 based on eight quality criteria under two broad groupings of psychometric testing and suitability for research and evaluation. Results: Twenty-four methods for assessing acne severity were identified. Four scales received a quality score of zero, and 11 scored ≤3. The highest rated scales achieved a total score of 6. Six scales reported strong inter-rater reliability (ICC>0.75), and four reported strong intra-rater reliability (ICC>0.75). The poor overall performance of most scales, largely characterized by the absence of reliability testing or evidence for independent assessment and validation indicates that generally, their application in clinical trials is not supported. Conclusion: This review and appraisal of instruments for measuring acne severity supports previously identified concerns regarding the quality of published measures. It highlights the need for a valid and reliable acne severity scale, especially for use in research and evaluation. The ideal scale would demonstrate adequate validation and reliability and be easily implemented for third-party analysis. The development of such a scale is critical to interpreting results of trials and facilitating the pooling of results for systematic reviews and meta-analyses. PMID:27672410
Development of an interprofessional lean facilitator assessment scale.

PubMed

Bravo-Sanchez, Cindy; Dorazio, Vincent; Denmark, Robert; Heuer, Albert J; Parrott, J Scott

2018-05-01

High reliability is important for optimising quality and safety in healthcare organisations. Reliability efforts include interprofessional collaborative practice (IPCP) and Lean quality/process improvement strategies, which require skilful facilitation. Currently, no validated Lean facilitator assessment tool for interprofessional collaboration exists. This article describes the development and pilot evaluation of such a tool; the Interprofessional Lean Facilitator Assessment Scale (ILFAS), which measures both technical and 'soft' skills, which have not been measured in other instruments. The ILFAS was developed using methodologies and principles from Lean/Shingo, IPCP, metacognition research and Bloom's Taxonomy of Learning Domains. A panel of experts confirmed the initial face validity of the instrument. Researchers independently assessed five facilitators, during six Lean sessions. Analysis included quantitative evaluation of rater agreement. Overall inter-rater agreement of the assessment of facilitator performance was high (92%), and discrepancies in the agreement statistics were analysed. Face and content validity were further established, and usability was evaluated, through primary stakeholder post-pilot feedback, uncovering minor concerns, leading to tool revision. The ILFAS appears comprehensive in the assessment of facilitator knowledge, skills, abilities, and may be useful in the discrimination between facilitators of different skill levels. Further study is needed to explore instrument performance and validity.
Measuring Primary Teachers' Attitudes Toward Teaching Science: Development of the Dimensions of Attitude Toward Science (DAS) Instrument

NASA Astrophysics Data System (ADS)

van Aalderen-Smeets, Sandra; Walma van der Molen, Juliette

2013-03-01

In this article, we present a valid and reliable instrument which measures the attitude of in-service and pre-service primary teachers toward teaching science, called the Dimensions of Attitude Toward Science (DAS) Instrument. Attention to the attitudes of primary teachers toward teaching science is of fundamental importance to the professionalization of these teachers in the field of primary science education. With the development of this instrument, we sought to fulfill the need for a statistically and theoretically valid and reliable instrument to measure pre-service and in-service teachers' attitudes. The DAS Instrument is based on a comprehensive theoretical framework for attitude toward (teaching) science. After pilot testing, the DAS was revised and subsequently validated using a large group of respondents (pre-service and in-service primary teachers) (N = 556). The theoretical underpinning of the DAS combined with the statistical data indicate that the DAS possesses good construct validity and that it proves to be a promising instrument that can be utilized for research purposes, and also as a teacher training and coaching tool. This instrument can therefore make a valuable contribution to progress within the field of science education.
Validity and reliability of a modified english version of the physical activity questionnaire for adolescents.

PubMed

Aggio, Daniel; Fairclough, Stuart; Knowles, Zoe; Graves, Lee

2016-01-01

Adaptation of physical activity self-report questionnaires is sometimes required to reflect the activity behaviours of diverse populations. The processes used to modify self-report questionnaires though are typically underreported. This two-phased study used a formative approach to investigate the validity and reliability of the Physical Activity Questionnaire for Adolescents (PAQ-A) in English youth. Phase one examined test content and response process validity and subsequently informed a modified version of the PAQ-A. Phase two assessed the validity and reliability of the modified PAQ-A. In phase one, focus groups (n = 5) were conducted with adolescents (n = 20) to investigate test content and response processes of the original PAQ-A. Based on evidence gathered in phase one, a modified version of the questionnaire was administered to participants (n = 169, 14.5 ± 1.7 years) in phase two. Internal consistency and test-retest reliability were assessed using Cronbach's alpha and intra-class correlations, respectively. Spearman correlations were used to assess associations between modified PAQ-A scores and accelerometer-derived physical activity, self-reported fitness and physical activity self-efficacy. Phase one revealed that the original PAQ-A was unrepresentative for English youth and that item comprehension varied. Contextual and population/cultural-specific modifications were made to the PAQ-A for use in the subsequent phase. In phase two, modified PAQ-A scores had acceptable internal consistency (α = 0.72) and test-retest reliability (ICC = 0.78). Modified PAQ-A scores were significantly associated with objectively assessed moderate-to-vigorous physical activity (r = 0.39), total physical activity (r = 0.42), self-reported fitness (r = 0.35), and physical activity self-efficacy (r = 0.32) (p ≤ 0.01). The modified PAQ-A had acceptable internal consistency and test-retest reliability. Modified PAQ-A scores displayed weak-to-moderate correlations with objectively measured physical activity, self-reported fitness, and self-efficacy providing evidence of satisfactory criterion and construct validity, respectively. Further testing with more diverse English samples is recommended to provide a more complete assessment of the tool.
Development and validation of the Comprehensive Indoor Tanning Expectations Scale.

PubMed

Noar, Seth M; Myrick, Jessica Gall; Morales-Pico, Brenda; Thomas, Nancy E

2014-05-01

Strong links between indoor tanning behavior and skin cancer have been demonstrated across several studies. Understanding the complex belief systems that underlie indoor tanning in young women is a crucial first step in developing interventions to deter this behavior. To develop and validate a comprehensive, multidimensional, theory-based outcome expectations measure to advance an understanding of the sets of beliefs that underlie indoor tanning behavior among young women. Cross-sectional study comprising a web-based survey of 11 sororities at a large university in the southeastern United States. Study participants (n = 706) were aged 18 to 25 years; 45.3% had tanned indoors in their lifetime and 30.3% in the past year. Intention to tan indoors, frequency of indoor tanning behavior in the past year, and indoor tanner type (nontanner, former tanner, or current tanner). A comprehensive scale assessing indoor tanning outcome expectations was developed. In total, 6 positive outcome expectations factors and 5 negative outcome expectations factors were identified. These subscales were reliable (coefficient α range, 0.86-0.95) and were significantly (mostly at P < .001) correlated with a set of established measures, including appearance motivation, indoor tanning attitudes and norms, and intention to tan indoors. Examination of subscales across the 3 indoor tanning groups also revealed significant (P < .001) differences on all 11 subscales. Current tanners had the most positive and least negative perceptions about indoor tanning, while nontanners had the most negative and least positive perceptions. Former tanners tended to fall in between these 2 groups. The 2 subscales with the largest differences across the groups were mood enhancement (positive outcome expectation) and psychological/physical discomfort (negative outcome expectation). Multiple linear regression analyses demonstrated several outcome expectations subscales to be significantly associated with intention to tan indoors and frequency of indoor tanning behavior. Results suggest that the Comprehensive Indoor Tanning Expectations (CITE) Scale provides a reliable and valid assessment of the complex sets of beliefs that underlie indoor tanning, including positive (motivational) and negative (deterrent) beliefs. This new scale may further advance research on indoor tanning beliefs and can guide health communications to prevent and deter indoor tanning behavior.
Adolescents' perception of parental feeding practices: Adaptation and validation of the Comprehensive Feeding Practices Questionnaire for Brazilian adolescents—The CFPQ-Teen

PubMed Central

Piccoli, Ângela Bein; Neiva-Silva, Lucas; Mosmann, Clarisse Pereira; Musher-Eizenman, Dara; Pellanda, Lucia C.

2017-01-01

Background Parental feeding practices may play a key role in dietary habits and nutritional status of adolescents, but research from adolescents’ point of view on this topic is scarce. Objective To adapt and validate an instrument of parental feeding practices as perceived by adolescents in a Brazilian setting. Methods The Comprehensive Feeding Practices Questionnaire was translated into Portuguese and adapted to be answered by adolescents (ages 12 to 18). Content analysis and FACE validity to assess cultural equivalence was undertaken by experts in the adolescent nutritional and psychological fields. Pilot study was evaluated in 23 adolescents. The final version was administered to 41 students to assess instrument reproducibility (Intraclass Correlation Coefficient). Internal consistency (Cronbach's Alpha) and construct validity (Confirmatory Factor Analysis) were assessed in a third sample of 307 adolescents. Results Experts and adolescents considered content validity as appropriate. In reproducibility analysis (Intraclass Correlation Coefficient), 10 of the 12 factors were above 0.7. The factors “teaching about nutrition” and “food as reward” obtained values of 0.60 and 0.68, respectively. The Cronbach's Alpha of the whole scale was 0.83 and alphas for subscales ranged from 0.52 to 0.85; the factors “teaching about nutrition” and “food as a reward” had the lowest values (0.52). After removing these two factors, the Confirmatory Factor Analysis indicated that the structural model was appropriate. The final scale was made up of 10 factors with 43 questions. Conclusions The Comprehensive Feeding Practices Questionnaire-Teen demonstrates validity and reliability, and is a suitable tool to evaluate the perceptions of adolescents regarding parental feeding practices. PMID:29145485
Development and validation of a ten-item questionnaire with explanatory illustrations to assess upper extremity disorders: favorable effect of illustrations in the item reduction process.

PubMed

Kurimoto, Shigeru; Suzuki, Mikako; Yamamoto, Michiro; Okui, Nobuyuki; Imaeda, Toshihiko; Hirata, Hitoshi

2011-11-01

The purpose of this study is to develop a short and valid measure for upper extremity disorders and to assess the effect of attached illustrations in item reduction of a self-administered disability questionnaire while retaining psychometric properties. A validated questionnaire used to assess upper extremity disorders, the Hand20, was reduced to ten items using two item-reduction techniques. The psychometric properties of the abbreviated form, the Hand10, were evaluated on an independent sample that was used for the shortening process. Validity, reliability, and responsiveness of the Hand10 were retained in the item reduction process. It was possible that the use of explanatory illustrations attached to the Hand10 helped with its reproducibility. The illustrations for the Hand10 promoted text comprehension and motivation to answer the items. These changes resulted in high acceptability; more than 99.3% of patients, including 98.5% of elderly patients, could complete the Hand10 properly. The illustrations had favorable effects on the item reduction process and made it possible to retain precision of the instrument. The Hand10 is a reliable and valid instrument for individual-level applications with the advantage of being compact and broadly applicable, even in elderly individuals.
Validity and reliability of the Traditional Chinese version of the Multidimensional Fatigue Inventory in general population.

PubMed

Chuang, Li-Ling; Chuang, Yu-Fen; Hsu, Miao-Ju; Huang, Ying-Zu; Wong, Alice M K; Chang, Ya-Ju

2018-01-01

Fatigue is a common symptom in the general population and has a substantial effect on individuals' quality of life. The Multidimensional Fatigue Inventory (MFI) has been widely used to quantify the impact of fatigue, but no Traditional Chinese translation has yet been validated. The goal of this study was to translate the MFI from English into Traditional Chinese ('the MFI-TC') and subsequently to examine its validity and reliability. The study recruited a convenience sample of 123 people from various age groups in Taiwan. The MFI was examined using a two-step process: (1) translation and back-translation of the instrument; and (2) examination of construct validity, convergent validity, internal consistency, test-retest reliability, and measurement error. The validity and reliability of the MFI-TC were assessed by factor analysis, Spearman rho correlation coefficient, Cronbach's alpha coefficient, intraclass correlation coefficient (ICC), minimal detectable change (MDC), and Bland-Altman analysis. All participants completed the Short-Form-36 Health Survey Taiwan Form (SF-36-T) and the Chinese version of the Pittsburgh Sleep Quality Index (PSQI) concurrently to test the convergent validity of the MFI-TC. Test-retest reliability was assessed by readministration of the MFI-TC after a 1-week interval. Factor analysis confirmed the four dimensions of fatigue: general/physical fatigue, reduced activity, reduced motivation, and mental fatigue. A four-factor model was extracted, combining general fatigue and physical fatigue as one factor. The results demonstrated moderate convergent validity when correlating fatigue (MFI-TC) with quality of life (SF-36-T) and sleep disturbances (PSQI) (Spearman's rho = 0.68 and 0.47, respectively). Cronbach's alpha for the MFI-TC total scale and subscales ranged from 0.73 (mental fatigue subscale) to 0.92 (MFI-TC total scale). ICCs ranged from 0.85 (reduced motivation) to 0.94 (MFI-TC total scale), and the MDC ranged from 2.33 points (mental fatigue) to 9.5 points (MFI-TC total scale). The Bland-Altman analyses showed no significant systematic bias between the repeated assessments. The results support the use of the Traditional Chinese version of the MFI as a comprehensive instrument for measuring specific aspects of fatigue. Clinicians and researchers should consider interpreting general fatigue and physical fatigue as one subscale when measuring fatigue in Traditional Chinese-speaking populations.
Validity and reliability of the Traditional Chinese version of the Multidimensional Fatigue Inventory in general population

PubMed Central

Chuang, Li-Ling; Chuang, Yu-Fen; Hsu, Miao-Ju; Huang, Ying-Zu; Wong, Alice M. K.

2018-01-01

Background Fatigue is a common symptom in the general population and has a substantial effect on individuals’ quality of life. The Multidimensional Fatigue Inventory (MFI) has been widely used to quantify the impact of fatigue, but no Traditional Chinese translation has yet been validated. The goal of this study was to translate the MFI from English into Traditional Chinese (‘the MFI-TC’) and subsequently to examine its validity and reliability. Methods The study recruited a convenience sample of 123 people from various age groups in Taiwan. The MFI was examined using a two-step process: (1) translation and back-translation of the instrument; and (2) examination of construct validity, convergent validity, internal consistency, test-retest reliability, and measurement error. The validity and reliability of the MFI-TC were assessed by factor analysis, Spearman rho correlation coefficient, Cronbach’s alpha coefficient, intraclass correlation coefficient (ICC), minimal detectable change (MDC), and Bland-Altman analysis. All participants completed the Short-Form-36 Health Survey Taiwan Form (SF-36-T) and the Chinese version of the Pittsburgh Sleep Quality Index (PSQI) concurrently to test the convergent validity of the MFI-TC. Test-retest reliability was assessed by readministration of the MFI-TC after a 1-week interval. Results Factor analysis confirmed the four dimensions of fatigue: general/physical fatigue, reduced activity, reduced motivation, and mental fatigue. A four-factor model was extracted, combining general fatigue and physical fatigue as one factor. The results demonstrated moderate convergent validity when correlating fatigue (MFI-TC) with quality of life (SF-36-T) and sleep disturbances (PSQI) (Spearman's rho = 0.68 and 0.47, respectively). Cronbach’s alpha for the MFI-TC total scale and subscales ranged from 0.73 (mental fatigue subscale) to 0.92 (MFI-TC total scale). ICCs ranged from 0.85 (reduced motivation) to 0.94 (MFI-TC total scale), and the MDC ranged from 2.33 points (mental fatigue) to 9.5 points (MFI-TC total scale). The Bland-Altman analyses showed no significant systematic bias between the repeated assessments. Conclusions The results support the use of the Traditional Chinese version of the MFI as a comprehensive instrument for measuring specific aspects of fatigue. Clinicians and researchers should consider interpreting general fatigue and physical fatigue as one subscale when measuring fatigue in Traditional Chinese-speaking populations. PMID:29746466

[Validation of the German version of the Vertigo Handicap Questionnaire (VHQ) in patients with vestibular vertigo syndromes or somatoform vertigo and dizziness].

PubMed

Tschan, Regine; Wiltink, Jörg; Best, Christoph; Beutel, Manfred; Dieterich, Marianne; Eckhardt-Henn, Annegret

2010-01-01

The Vertigo Handicap Questionnaire (VHQ) by Yardley (1992) assesses physical and psychosocial impairments of vertigo or dizziness. Our study examines the structure, reliability, and aspects of validity of the German version of the VHQ. 98 vestibular vertigo syndromes vs. 90 patients with somatoform vertigo and dizziness were evaluated with the VHQ, symptom severity (VSS), distress (GSI), anxiety and depression (HADS), catastrophizing beliefs (ACQ), fear of body sensations (BSQ), and quality of life (SF-36). For diagnostic classification detailed clinical neurological, neuro-otological and psychosomatic testing were conducted. Principal components analysis identified two factors, which could be confirmed by confirmatory factor analyses: 'handicapped activity'(VHQ-ACT) and 'anxiety' (VHQ-ANX). The VHQ had good internal consistency (Cronbach's alpha: 0.92). Test-retest reliability was r = 0.80. We noted close relations between the VHQ, the VSS and measures of emotional distress as aspects of good construct validity. Together with the VSS, the VHQ completes a comprehensive diagnostic screening tool for vertigo or dizziness. © Georg Thieme Verlag KG Stuttgart · New York.
Screening for learning disabilities in young adult career counseling.

PubMed

Kasler, Jon; Fawcett, Angela

2009-01-01

The Strengths and Weaknesses Academic Profile (SWAP) was constructed in Israel in response to the local need of career counselors for a valid, reliable, comprehensive, parsimonious, and computerized screening device for identifying those likely to be at risk of learning disabilities (LD). The method chosen was self-report. A set of cognitive items was written and divided into seven scales: reading, writing, attention and memory, computation, English as a foreign language (EFL), study skills, and self-image. The screening tool was validated on a research sample in Sheffield, UK, based on comparison of the results obtained from the screening with the results of standardized diagnosis of learning disabilities administered to the respondents. The questionnaire was administered to 39 students, half of them diagnosed for dyslexia and half tested and found to be free of dyslexia. Results indicate that SWAP is a reliable and valid questionnaire, with a classification power of approximately 90%. The questionnaire is now widely used in Israel, where an Internet site has been constructed to administer the questionnaire and provide immediate and direct results.
MASH Suite: a user-friendly and versatile software interface for high-resolution mass spectrometry data interpretation and visualization.

PubMed

Guner, Huseyin; Close, Patrick L; Cai, Wenxuan; Zhang, Han; Peng, Ying; Gregorich, Zachery R; Ge, Ying

2014-03-01

The rapid advancements in mass spectrometry (MS) instrumentation, particularly in Fourier transform (FT) MS, have made the acquisition of high-resolution and high-accuracy mass measurements routine. However, the software tools for the interpretation of high-resolution MS data are underdeveloped. Although several algorithms for the automatic processing of high-resolution MS data are available, there is still an urgent need for a user-friendly interface with functions that allow users to visualize and validate the computational output. Therefore, we have developed MASH Suite, a user-friendly and versatile software interface for processing high-resolution MS data. MASH Suite contains a wide range of features that allow users to easily navigate through data analysis, visualize complex high-resolution MS data, and manually validate automatically processed results. Furthermore, it provides easy, fast, and reliable interpretation of top-down, middle-down, and bottom-up MS data. MASH Suite is convenient, easily operated, and freely available. It can greatly facilitate the comprehensive interpretation and validation of high-resolution MS data with high accuracy and reliability.
Comprehensive validation scheme for in situ fiber optics dissolution method for pharmaceutical drug product testing.

PubMed

Mirza, Tahseen; Liu, Qian Julie; Vivilecchia, Richard; Joshi, Yatindra

2009-03-01

There has been a growing interest during the past decade in the use of fiber optics dissolution testing. Use of this novel technology is mainly confined to research and development laboratories. It has not yet emerged as a tool for end product release testing despite its ability to generate in situ results and efficiency improvement. One potential reason may be the lack of clear validation guidelines that can be applied for the assessment of suitability of fiber optics. This article describes a comprehensive validation scheme and development of a reliable, robust, reproducible and cost-effective dissolution test using fiber optics technology. The test was successfully applied for characterizing the dissolution behavior of a 40-mg immediate-release tablet dosage form that is under development at Novartis Pharmaceuticals, East Hanover, New Jersey. The method was validated for the following parameters: linearity, precision, accuracy, specificity, and robustness. In particular, robustness was evaluated in terms of probe sampling depth and probe orientation. The in situ fiber optic method was found to be comparable to the existing manual sampling dissolution method. Finally, the fiber optic dissolution test was successfully performed by different operators on different days, to further enhance the validity of the method. The results demonstrate that the fiber optics technology can be successfully validated for end product dissolution/release testing. (c) 2008 Wiley-Liss, Inc. and the American Pharmacists Association
Older adult mistreatment risk screening: contribution to the validation of a screening tool in a domestic setting.

PubMed

Lindenbach, Jeannette M; Larocque, Sylvie; Lavoie, Anne-Marise; Garceau, Marie-Luce

2012-06-01

ABSTRACTThe hidden nature of older adult mistreatment renders its detection in the domestic setting particularly challenging. A validated screening instrument that can provide a systematic assessment of risk factors can facilitate this detection. One such instrument, the "expanded Indicators of Abuse" tool, has been previously validated in the Hebrew language in a hospital setting. The present study has contributed to the validation of the "e-IOA" in an English-speaking community setting in Ontario, Canada. It consisted of two phases: (a) a content validity review and adaptation of the instrument by experts throughout Ontario, and (b) an inter-rater reliability assessment by home visiting nurses. The adaptation, the "Mistreatment of Older Adult Risk Factors" tool, offers a comprehensive tool for screening in the home setting. This instrument is significant to professional practice as practitioners working with older adults will be better equipped to assess for risk of mistreatment.
The Japanese version of the overall assessment of the speaker's experience of stuttering for adults (OASES-A-J): Translation and psychometric evaluation.

PubMed

Sakai, Naomi; Chu, Shin Ying; Mori, Koichi; Yaruss, J Scott

2017-03-01

This study evaluates the psychometric performance of the Japanese version of the Overall Assessment of the Speaker's Experience of Stuttering for Adults (OASES-A), a comprehensive assessment tool of individuals who stutter. The OASES-A-J was administered to 200 adults who stutter in Japan. All respondents also evaluated their own speech (SA scale), satisfaction of their own speech (SS scale) and the Japanese translation version of the Modified Erickson Communication Attitude scale (S-24). The test-retest reliability and internal consistency of the OASES-A-J were assessed. To examine the concurrent validity of the questionnaire, Pearson correlation was conducted between the OASES-A-J Impact score and the S-24 scale, SA scale and SS scale. In addition, Pearson correlation among the impact scores of each section and total were calculated to examine the construct validity. The OASES-A-J showed a good test-retest reliability (r=0.81-0.95) and high internal consistency (α>0.80). Concurrent validity was moderate to high (0.55-0.75). Construct validity was confirmed by the relation between internal consistency in each section and correlation among sections' impact scores. Japanese adults showed higher negative impact for 'General Information', 'Reactions to Stuttering' and 'Quality of Life' sections. These results suggest that the OASES-A-J is a reliable and valid instrument to measure the impact of stuttering on Japanese adults who stutter. The OASES-A-J could be used as a clinical tool in Japanese stuttering field. Copyright Â© 2016 Elsevier Inc. All rights reserved.
Validation of the French translation-adaptation of the impact of cancer questionnaire version 2 (IOCv2) in a breast cancer survivor population.

PubMed

Blanchin, Myriam; Dauchy, Sarah; Cano, Alejandra; Brédart, Anne; Aaronson, Neil K; Hardouin, Jean-Benoit

2015-07-29

The Impact of Cancer version 2 (IOCv2) was designed to assess the physical and psychosocial health experience of cancer survivors through its positive and negative impacts. Although the IOCv2 is available in English and Dutch, it has not yet been validated for use in French-speaking populations. The current study was undertaken to provide a comprehensive assessment of the reliability and validity of the French language version of the IOCv2 in a sample of breast cancer survivors. An adapted French version of the IOCv2 as well as demographic and medical information were completed by 243 women to validate the factor structure divergent/divergent validities and reliability. Concurrent validity was assessed by correlating the IOCv2 scales with measures from the SF-12, PostTraumatic Growth Inventory and Fear of Cancer Recurrence Inventory. The French version of the IOCv2 supports the structure of the original version, with four positive impact dimensions and four negative impact dimensions. This result was suggested by the good fit of the confirmatory factor analysis and the adequate reliability revealed by Cronbach's alpha coefficients and other psychometric indices. The concurrent validity analysis revealed patterns of association between IOCv2 scale scores and other measures. Unlike the original version, a structure with a Positive Impact domain consisting in the IOCv2 positive dimensions and a Negative Impact domain consisting in the negative ones has not been clearly evidenced in this study. The limited practical use of the conditional dimensions Employment Concerns and Relationship Concerns, whether the patient is partnered or not, did not make possible to provide evidence of validity and reliability of these dimensions as the subsets of sample to work with were not large enough. The scores of these conditional dimensions have to be used with full knowledge of the facts of this limitation of the study. Integrating IOCv2 into studies will contribute to evaluate the psychosocial health experience of the growing population of cancer survivors, enabling better understanding of the multi-dimensional impact of cancer.
Measuring stress in medical education: validation of the Korean version of the higher education stress inventory with medical students.

PubMed

Shim, Eun-Jung; Jeon, Hong Jin; Kim, Hana; Lee, Kwang-Min; Jung, Dooyoung; Noh, Hae-Lim; Roh, Myoung-Sun; Hahm, Bong-Jin

2016-11-24

Medical students face a variety of stressors associated with their education; if not promptly identified and adequately dealt with, it may bring about several negative consequences in terms of mental health and academic performance. This study examined psychometric properties of the Korean version of the Higher Education Stress Inventory (K-HESI). The reliability and validity of the K-HESI were examined in a large scale multi-site survey involving 7110 medical students. The K-HESI, Beck Depression Inventory (BDI) and questions regarding quality of life (QOL) and self-rated physical health (SPH) were administered. Exploratory factor analysis of the K-HESI identified seven factors: Low commitment; financial concerns; teacher-student relationship; worries about future profession; non-supportive climate; workload; and dissatisfaction with education. A subsequent confirmatory factor analysis supported the 7-factor model. Internal consistency of the K-HESI was satisfactory (Cronbach's α = .78). Convergent validity was demonstrated by its positive association with the BDI. Known group validity was supported by the K-HESI's ability to detect significant differences on the overall and subscale scores of K-HESI according to different levels of QOL and SPH. The K-HESI is a psychometrically valid tool that comprehensively assesses various relevant stressors related to medical education. Evidence-based stress management in medical education empirically guided by the regular assessment of stress using reliable and valid measure is warranted.
A critique of Lilienfeld et al.'s (2000) "The scientific status of projective techniques".

PubMed

Hibbard, Stephen

2003-06-01

Lilienfeld, Wood, and Garb (2000) published a largely negative critique of the validity and reliability of projective methods, concentrating on the Comprehensive System for the Rorschach (Exner, 1993), 3 systems for coding the Thematic Apperception Test (TAT; Murray, 1943) cards, and human figure drawings. This article is an effort to document and correct what I perceive as errors of omission and commission in the Lilienfeld et al. article. When projective measures are viewed in the light of these corrections, the evidence for the validity and clinical usefulness of the Rorschach and TAT methods is more robust than Lilienfeld et al. represented.
Quantitative micro-CT based coronary artery profiling using interactive local thresholding and cylindrical coordinates.

PubMed

Panetta, Daniele; Pelosi, Gualtiero; Viglione, Federica; Kusmic, Claudia; Terreni, Marianna; Belcari, Nicola; Guerra, Alberto Del; Athanasiou, Lambros; Exarchos, Themistoklis; Fotiadis, Dimitrios I; Filipovic, Nenad; Trivella, Maria Giovanna; Salvadori, Piero A; Parodi, Oberdan

2015-01-01

Micro-CT is an established imaging technique for high-resolution non-destructive assessment of vascular samples, which is gaining growing interest for investigations of atherosclerotic arteries both in humans and in animal models. However, there is still a lack in the definition of micro-CT image metrics suitable for comprehensive evaluation and quantification of features of interest in the field of experimental atherosclerosis (ATS). A novel approach to micro-CT image processing for profiling of coronary ATS is described, providing comprehensive visualization and quantification of contrast agent-free 3D high-resolution reconstruction of full-length artery walls. Accelerated coronary ATS has been induced by high fat cholesterol-enriched diet in swine and left coronary artery (LCA) harvested en bloc for micro-CT scanning and histologic processing. A cylindrical coordinate system has been defined on the image space after curved multiplanar reformation of the coronary vessel for the comprehensive visualization of the main vessel features such as wall thickening and calcium content. A novel semi-automatic segmentation procedure based on 2D histograms has been implemented and the quantitative results validated by histology. The potentiality of attenuation-based micro-CT at low kV to reliably separate arterial wall layers from adjacent tissue as well as identify wall and plaque contours and major tissue components has been validated by histology. Morphometric indexes from histological data corresponding to several micro-CT slices have been derived (double observer evaluation at different coronary ATS stages) and highly significant correlations (R2 > 0.90) evidenced. Semi-automatic morphometry has been validated by double observer manual morphometry of micro-CT slices and highly significant correlations were found (R2 > 0.92). The micro-CT methodology described represents a handy and reliable tool for quantitative high resolution and contrast agent free full length coronary wall profiling, able to assist atherosclerotic vessels morphometry in a preclinical experimental model of coronary ATS and providing a link between in vivo imaging and histology.
Measuring the impact of diagnostic decision support on the quality of clinical decision making: development of a reliable and valid composite score.

PubMed

Ramnarayan, Padmanabhan; Kapoor, Ritika R; Coren, Michael; Nanduri, Vasantha; Tomlinson, Amanda L; Taylor, Paul M; Wyatt, Jeremy C; Britto, Joseph F

2003-01-01

Few previous studies evaluating the benefits of diagnostic decision support systems have simultaneously measured changes in diagnostic quality and clinical management prompted by use of the system. This report describes a reliable and valid scoring technique to measure the quality of clinical decision plans in an acute medical setting, where diagnostic decision support tools might prove most useful. Sets of differential diagnoses and clinical management plans generated by 71 clinicians for six simulated cases, before and after decision support from a Web-based pediatric differential diagnostic tool (ISABEL), were used. A composite quality score was calculated separately for each diagnostic and management plan by considering the appropriateness value of each component diagnostic or management suggestion, a weighted sum of individual suggestion ratings, relevance of the entire plan, and its comprehensiveness. The reliability and validity (face, concurrent, construct, and content) of these two final scores were examined. Two hundred fifty-two diagnostic and 350 management suggestions were included in the interrater reliability analysis. There was good agreement between raters (intraclass correlation coefficient, 0.79 for diagnoses, and 0.72 for management). No counterintuitive scores were demonstrated on visual inspection of the sets. Content validity was verified by a consultation process with pediatricians. Both scores discriminated adequately between the plans of consultants and medical students and correlated well with clinicians' subjective opinions of overall plan quality (Spearman rho 0.65, p < 0.01). The diagnostic and management scores for each episode showed moderate correlation (r = 0.51). The scores described can be used as key outcome measures in a larger study to fully assess the value of diagnostic decision aids, such as the ISABEL system.
[Validity 'and Utilities' clinic of a grid observation (PACSLAC-F) to evaluate the pain in seniors with dementia's living in the Long-Term Care ].

PubMed

Aubin, Michèle; Verreault, René; Savoie, Maryse; LeMay, Sylvie; Hadjistavropoulos, Thomas; Fillion, Lise; Beaulieu, Marie; Viens, Chantal; Bergeron, Rénald; Vézina, Lucie; Misson, Lucie; Fuchs-Lacelle, Shannon

2008-01-01

This study presents the validation of the French Canadian version (PACLSAC-F) of the Pain Assessment Checklist for Seniors with Limited Ability to Communicate (PACSLAC). Unlike the published validation of the English version of the PACSLAC, which was validated retrospectively, the French version was validated prospectively. The PACSLAC-F was completed by nurses working in long-term care facilities after observing 86 seniors, with severe cognitive impairment, in calm, painful or distressing but non-painful situations. The test-retest and inter-observer reliability, the internal consistency, and the discriminent validity were found to be satisfactory. To evaluate the convergent validity with the DOLOPLUS-2 and the clinical relevance of the PACSLAC, it was also completed by nurses during their work shift, with 26 additional patients, for three days per week during a period of four weeks. These results encourage us to test the PACSLAC in a comprehensive program of pain management targeting this population.
Measuring financial toxicity as a clinically relevant patient-reported outcome: The validation of the COmprehensive Score for financial Toxicity (COST).

PubMed

de Souza, Jonas A; Yap, Bonnie J; Wroblewski, Kristen; Blinder, Victoria; Araújo, Fabiana S; Hlubocky, Fay J; Nicholas, Lauren H; O'Connor, Jeremy M; Brockstein, Bruce; Ratain, Mark J; Daugherty, Christopher K; Cella, David

2017-02-01

Cancer and its treatment lead to increased financial distress for patients. To the authors' knowledge, to date, no standardized patient-reported outcome measure has been validated to assess this distress. Patients with AJCC Stage IV solid tumors receiving chemotherapy for at least 2 months were recruited. Financial toxicity was measured by the COmprehensive Score for financial Toxicity (COST) measure. The authors collected data regarding patient characteristics, clinical trial participation, health care use, willingness to discuss costs, psychological distress (Brief Profile of Mood States [POMS]), and health-related quality of life (HRQOL) as measured by the Functional Assessment of Cancer Therapy: General (FACT-G) and the European Organization for Research and Treatment of Cancer (EORTC) QOL questionnaires. Test-retest reliability, internal consistency, and validity of the COST measure were assessed using standard-scale construction techniques. Associations between the resulting factors and other variables were assessed using multivariable analyses. A total of 375 patients with advanced cancer were approached, 233 of whom (62.1%) agreed to participate. The COST measure demonstrated high internal consistency and test-retest reliability. Factor analyses revealed a coherent, single, latent variable (financial toxicity). COST values were found to be correlated with income (correlation coefficient [r] = 0.28; P<.001), psychosocial distress (r = -0.26; P<.001), and HRQOL, as measured by the FACT-G (r = 0.42; P<.001) and by the EORTC QOL instruments (r = 0.33; P<.001). Independent factors found to be associated with financial toxicity were race (P = .04), employment status (P<.001), income (P = .003), number of inpatient admissions (P = .01), and psychological distress (P = .003). Willingness to discuss costs was not found to be associated with the degree of financial distress (P = .49). The COST measure demonstrated reliability and validity in measuring financial toxicity. Its correlation with HRQOL indicates that financial toxicity is a clinically relevant patient-centered outcome. Cancer 2017;123:476-484. © 2016 American Cancer Society. © 2016 The Authors. Cancer published by Wiley Periodicals, Inc. on behalf of American Cancer Society.
Retell as an Indicator of Reading Comprehension

PubMed Central

Reed, Deborah K.; Vaughn, Sharon

2011-01-01

The purpose of this narrative synthesis is to determine the reliability and validity of retell protocols for assessing reading comprehension of students in grades K–12. Fifty-four studies were systematically coded for data related to the administration protocol, scoring procedures, and technical adequacy of the retell component. Retell was moderately correlated with standardized measures of reading comprehension and, with older students, had a lower correlation with decoding and fluency. Literal information was retold more frequently than inferential, and students with learning disabilities or reading difficulties needed more supports to demonstrate adequate recall. Great variability was shown in the prompting procedures, but scoring methods were more consistent across studies. The influences of genre, background knowledge, and organizational features were often specific to particular content, texts, or students. Overall, retell has not yet demonstrated adequacy as a progress monitoring instrument. PMID:23125521
Evaluation on Cost Overrun Risks of Long-distance Water Diversion Project Based on SPA-IAHP Method

NASA Astrophysics Data System (ADS)

Yuanyue, Yang; Huimin, Li

2018-02-01

Large investment, long route, many change orders and etc. are main causes for costs overrun of long-distance water diversion project. This paper, based on existing research, builds a full-process cost overrun risk evaluation index system for water diversion project, apply SPA-IAHP method to set up cost overrun risk evaluation mode, calculate and rank weight of every risk evaluation indexes. Finally, the cost overrun risks are comprehensively evaluated by calculating linkage measure, and comprehensive risk level is acquired. SPA-IAHP method can accurately evaluate risks, and the reliability is high. By case calculation and verification, it can provide valid cost overrun decision making information to construction companies.
A critical enquiry into the psychometric properties of the professional quality of life scale (ProQol-5) instrument.

PubMed

Hemsworth, David; Baregheh, Anahita; Aoun, Samar; Kazanjian, Arminee

2018-02-01

This study had conducted a comprehensive analysis of the psychometric properties of Proqol 5, professional quality of work instrument among nurses and palliative care-workers on the basis of three independent datasets. The goal is to see the general applicability of this instrument across multiple populations. Although the Proqol scale has been widely adopted, there are few attempts that have thoroughly analyzed this instrument across multiple datasets using multiple populations. A questionnaire was developed and distributed to palliative care-workers in Canada and Nurses at two hospitals in Australia and Canada, this resulted in 273 datasets from the Australian and 303 datasets from the Canadian nurses and 503 datasets from the Canadian palliative care-workers. A comprehensive psychometric property analysis was conducted including inter-item correlations, tests of reliability, and both convergent and discriminant validity as well as construct validity analyses. In addition, to test for the reverse coding artifacts in the BO scale, exploratory factor analysis was adopted. The psychometric property analysis of Proqol 5 was satisfactory for the compassion satisfaction construct. However, there are concerns with respect to the burnout and secondary trauma stress scales and recommendations are made regarding the coding and specific items which should improve the reliability and validity of these scales. This research establishes the strengths and weaknesses of the Proqol instrument and demonstrates how it can be improved. Through specific recommendations, the academic community is invited to revise the burnout and secondary traumatic stress scales in an effort to improve Proqol 5 measures. Copyright © 2017. Published by Elsevier Inc.
Validation of the German patient-reported outcomes version of the common terminology criteria for adverse events (PRO-CTCAE™).

PubMed

Hagelstein, V; Ortland, I; Wilmer, A; Mitchell, S A; Jaehde, U

2016-12-01

Integrating the patient's perspective has become an increasingly important component of adverse event reporting. The National Cancer Institute has developed a Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE™). This instrument has been translated into German and linguistically validated; however, its quantitative measurement properties have not been evaluated. A German language survey that included 31 PRO-CTCAE items, as well as the EORTC QLQ-C30 and the Oral Mucositis Daily Questionnaire (OMDQ), was distributed at 10 cancer treatment settings in Germany and Austria. Item quality was assessed by analysis of acceptability and comprehensibility. Reliability was evaluated by using Cronbach's' alpha and validity by principal components analysis (PCA), multitrait-multimethod matrix (MTMM) and known groups validity techniques. Of 660 surveys distributed to the study centres, 271 were returned (return rate 41%), and data from 262 were available for analysis. Participants' median age was 59.7 years, and 69.5% of the patients were female. Analysis of item quality supported the comprehensibility of the 31 PRO-CTCAE items. Reliability was very good; Cronbach's' alpha correlation coefficients were >0.9 for almost all item clusters. Construct validity of the PRO-CTCAE core item set was shown by identifying 10 conceptually meaningful item clusters via PCA. Moreover, construct validity was confirmed by the MTMM: monotrait-heteromethod comparison showed 100% high correlation, whereas heterotrait-monomethod comparison indicated 0% high correlation. Known groups validity was supported; PRO-CTCAE scores were significantly lower for those with impaired versus preserved health-related quality of life. A set of 31 items drawn from the German PRO-CTCAE item library demonstrated favourable measurement properties. These findings add to the body of evidence that PRO-CTCAE provides a rigorous method to capture patient self-reports of symptomatic toxicity for use in cancer clinical trials. © The Author 2016. Published by Oxford University Press on behalf of the European Society for Medical Oncology. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Assessment of NDE Reliability Data

NASA Technical Reports Server (NTRS)

Yee, B. G. W.; Chang, F. H.; Couchman, J. C.; Lemon, G. H.; Packman, P. F.

1976-01-01

Twenty sets of relevant Nondestructive Evaluation (NDE) reliability data have been identified, collected, compiled, and categorized. A criterion for the selection of data for statistical analysis considerations has been formulated. A model to grade the quality and validity of the data sets has been developed. Data input formats, which record the pertinent parameters of the defect/specimen and inspection procedures, have been formulated for each NDE method. A comprehensive computer program has been written to calculate the probability of flaw detection at several confidence levels by the binomial distribution. This program also selects the desired data sets for pooling and tests the statistical pooling criteria before calculating the composite detection reliability. Probability of detection curves at 95 and 50 percent confidence levels have been plotted for individual sets of relevant data as well as for several sets of merged data with common sets of NDE parameters.
Validity and Reliability of Persian Version of HIV/AIDS Related Stigma Scale for People Living With HIV/AIDS in Iran.

PubMed

Pourmarzi, Davoud; Khoramirad, Ashraf; Ahmari Tehran, Hoda; Abedini, Zahra

2015-11-01

To assess the perceived HIV/AIDS related stigma a comprehensive and well developed stigma instrument is necessary. This study aimed to assess validity and reliability of the Persian version of HIV/AIDS related stigma scale which was developed by Kang et al for people living with HIV/AIDS in Iran. Thescale was forward translatedby two bilingual academic members then both translations were discussed by expert team. Back-translation was done by two other bilingual translators then we carried out discussion with both of them. To evaluate understandability the scale was administered to 10 Persons Living with HIV/AIDS (PLWHA). Final Persian version was administered to 80 PLWHA in Qom, Iran in 2014. Test-retest reliability was assessed in a sample of 20 PLWHA after a week by intra-class correlation coefficient (ICC). Cronbach's alpha coefficient for overall scale was 0.85. Also Cronbach's alpha coefficients for the five subscales were as follows: social rejection (9 items, α = 0.84), negative self-worth (4 items, α = 0.70), perceived interpersonal insecurity (2 items, α = 0.57), financial insecurity (3 items, α = 0.70), discretionary disclosure (2 items, α = 0.83). Test-retest reliability was also approved with ICC = 0.78. Correlation between items and their hypothesized subscale is greater than 0.5. Correlation between an item and its own subscale was significantly higher than its correlation with other subscales. This study demonstrate that the Persian version of HIV/AIDS related stigma scale is valid and reliable to assess HIV/AIDS related stigma perceived by people living whit HIV/AIDS in Iran.
Validity and Reliability of Persian Version of HIV/AIDS Related Stigma Scale for People Living With HIV/AIDS in Iran

PubMed Central

Pourmarzi, Davoud; Khoramirad, Ashraf; Ahmari Tehran, Hoda; Abedini, Zahra

2015-01-01

Objective: To assess the perceived HIV/AIDS related stigma a comprehensive and well developed stigma instrument is necessary. This study aimed to assess validity and reliability of the Persian version of HIV/AIDS related stigma scale which was developed by Kang et al for people living with HIV/AIDS in Iran. Materials and methods: Thescale was forward translatedby two bilingual academic members then both translations were discussed by expert team. Back-translation was done by two other bilingual translators then we carried out discussion with both of them. To evaluate understandability the scale was administered to 10 Persons Living with HIV/AIDS (PLWHA). Final Persian version was administered to 80 PLWHA in Qom, Iran in 2014. Test–retest reliability was assessed in a sample of 20 PLWHA after a week by intra-class correlation coefficient (ICC). Results: Cronbach’s alpha coefficient for overall scale was 0.85. Also Cronbach’s alpha coefficients for the five subscales were as follows: social rejection (9 items, α = 0.84), negative self-worth (4 items, α = 0.70), perceived interpersonal insecurity (2 items, α = 0.57), financial insecurity (3 items, α = 0.70), discretionary disclosure (2 items, α = 0.83). Test–retest reliability was also approved with ICC = 0.78. Correlation between items and their hypothesized subscale is greater than 0.5. Correlation between an item and its own subscale was significantly higher than its correlation with other subscales. Conclusion: This study demonstrate that the Persian version of HIV/AIDS related stigma scale is valid and reliable to assess HIV/AIDS related stigma perceived by people living whit HIV/AIDS in Iran. PMID:27047562

The validity of upper-limb neurodynamic tests for detecting peripheral neuropathic pain.

PubMed

Nee, Robert J; Jull, Gwendolen A; Vicenzino, Bill; Coppieters, Michel W

2012-05-01

The validity of upper-limb neurodynamic tests (ULNTs) for detecting peripheral neuropathic pain (PNP) was assessed by reviewing the evidence on plausibility, the definition of a positive test, reliability, and concurrent validity. Evidence was identified by a structured search for peer-reviewed articles published in English before May 2011. The quality of concurrent validity studies was assessed with the Quality Assessment of Diagnostic Accuracy Studies tool, where appropriate. Biomechanical and experimental pain data support the plausibility of ULNTs. Evidence suggests that a positive ULNT should at least partially reproduce the patient's symptoms and that structural differentiation should change these symptoms. Data indicate that this definition of a positive ULNT is reliable when used clinically. Limited evidence suggests that the median nerve test, but not the radial nerve test, helps determine whether a patient has cervical radiculopathy. The median nerve test does not help diagnose carpal tunnel syndrome. These findings should be interpreted cautiously, because diagnostic accuracy might have been distorted by the investigators' definitions of a positive ULNT. Furthermore, patients with PNP who presented with increased nerve mechanosensitivity rather than conduction loss might have been incorrectly classified by electrophysiological reference standards as not having PNP. The only evidence for concurrent validity of the ulnar nerve test was a case study on cubital tunnel syndrome. We recommend that researchers develop more comprehensive reference standards for PNP to accurately assess the concurrent validity of ULNTs and continue investigating the predictive validity of ULNTs for prognosis or treatment response.
Development of a Self-Report Measure of Reward Sensitivity:A Test in Current and Former Smokers.

PubMed

Hughes, John R; Callas, Peter W; Priest, Jeff S; Etter, Jean-Francois; Budney, Alan J; Sigmon, Stacey C

2017-06-01

Tobacco use or abstinence may increase or decrease reward sensitivity. Most existing measures of reward sensitivity were developed decades ago, and few have undergone extensive psychometric testing. We developed a 58-item survey of the anticipated enjoyment from, wanting for, and frequency of common rewards (the Rewarding Events Inventory-REI). The current analysis focuses on ratings of anticipated enjoyment. The first validation study recruited current and former smokers from Internet sites. The second study recruited smokers who wished to quit and monetarily reinforced them to stay abstinent in a laboratory study and a comparison group of former smokers. In both studies, participants completed the inventory on two occasions, 3-7 days apart. They also completed four anhedonia scales and a behavioral test of reduced reward sensitivity. Half of the enjoyment ratings loaded on four factors: socializing, active hobbies, passive hobbies, and sex/drug use. Cronbach's alpha coefficients were all ≥0.73 for overall mean and factor scores. Test-retest correlations were all ≥0.83. Correlations of the overall and factor scores with frequency of rewards and anhedonia scales were 0.19-0.53, except for the sex/drugs factor. The scores did not correlate with behavioral tests of reward and did not differ between current and former smokers. Lower overall mean enjoyment score predicted a shorter time to relapse. Internal reliability and test-retest reliability of the enjoyment outcomes of the REI are excellent, and construct and predictive validity are modest but promising. The REI is comprehensive and up-to-date, yet is short enough to use on repeated occasions. Replication tests, especially predictive validity tests, are needed. Both use of and abstinence from nicotine appear to increase or decrease how rewarding nondrug rewards are; however, self-report scales to test this have limitations. Our inventory of enjoyment from 58 rewards appears to be reliable and valid as well as comprehensive and up-to-date, yet is short enough to use on repeated occasions. Replication tests, especially of the predictive validity of our scale, are needed. © The Author 2017. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Development and evaluation of the "BRISK Scale," a brief observational measure of risk communication competence.

PubMed

Han, Paul K J; Joekes, Katherine; Mills, Greg; Gutheil, Caitlin; Smith, Kahsi; Cochran, Nancy E; Elwyn, Glyn

2016-12-01

To develop and evaluate a brief observational measure of clinical risk communication competence. A 4-item checklist-type measure, the BRISK (Brief Risk Information Skill) Scale, was developed by selecting and refining items from a more comprehensive measure of clinical risk communication competence. Six volunteer raters received brief training on the measure and then used the BRISK Scale to evaluate 52 video-recorded encounters between 2nd-year medical students and standardized patients conducted as part of an Observed Structured Clinical Examination (OSCE) involving a risk communication task. Internal consistency reliability, inter-rater reliability, and criterion validity were assessed. Raters reported no difficulties using the BRISK Scale; scores across all raters and subjects ranged from 0 to 16 with a mean score of 6.49 (SD=3.17). The BRISK Scale showed good internal consistency reliability (α=0.64), and inter-rater reliability at the scale level (Intraclass Correlation Coefficient (ICC)=0.79 for consistency, and 0.75 for absolute agreement) and individual-item level (ICC range: 0.62-.91). Novice raters' BRISK Scale scores were highly correlated (r=0.84, p<0.01) with expert raters' scores on the Risk Communication Content measure, a more comprehensive measure of risk communication competence. The BRISK Scale is a promising new brief observational measure of clinical risk communication competence. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
American Sign Language Comprehension Test: A Tool for Sign Language Researchers.

PubMed

Hauser, Peter C; Paludneviciene, Raylene; Riddle, Wanda; Kurz, Kim B; Emmorey, Karen; Contreras, Jessica

2016-01-01

The American Sign Language Comprehension Test (ASL-CT) is a 30-item multiple-choice test that measures ASL receptive skills and is administered through a website. This article describes the development and psychometric properties of the test based on a sample of 80 college students including deaf native signers, hearing native signers, deaf non-native signers, and hearing ASL students. The results revealed that the ASL-CT has good internal reliability (α = 0.834). Discriminant validity was established by demonstrating that deaf native signers performed significantly better than deaf non-native signers and hearing native signers. Concurrent validity was established by demonstrating that test results positively correlated with another measure of ASL ability (r = .715) and that hearing ASL students' performance positively correlated with the level of ASL courses they were taking (r = .726). Researchers can use the ASL-CT to characterize an individual's ASL comprehension skills, to establish a minimal skill level as an inclusion criterion for a study, to group study participants by ASL skill (e.g., proficient vs. nonproficient), or to provide a measure of ASL skill as a dependent variable. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Beyond Reliability

PubMed Central

2008-01-01

The validity of psychiatric diagnosis rests in part on a demonstration that identifiable biomarkers exist for major psychiatric illnesses. Recent evidence supports the existence of several biomarkers or endophenotypes for both schizophrenia and bipolar disorder. As we learn more about how these biomarkers relate to the symptoms, course, and treatment response of major psychiatric disorders, the “objectivity” of psychiatric diagnosis will increase. However, psychiatry is and will remain a clinically based discipline, aimed at comprehensively understanding and relieving human suffering. PMID:19727304
Questionnaires used to assess barriers of clinical guideline use among physicians are not comprehensive, reliable, or valid: a scoping review.

PubMed

Willson, Melina L; Vernooij, Robin W M; Gagliardi, Anna R

2017-06-01

This study described the number and characteristics of questionnaires used to assess barriers of guideline use among physicians. A scoping review was conducted. MEDLINE and EMBASE were searched from 2005 to June 2016. English-language studies that administered a questionnaire to assess barriers of guideline use among practicing physicians were eligible. Summary statistics were used to report study and questionnaire characteristics. Questionnaire content was assessed with a checklist of 57 known barriers. Each of the 178 included studies administered a unique questionnaire. The number of questionnaires increased yearly from 2005 to 2015. Few were pilot-tested (50, 28.1%) or tested for psychometric properties (3, 1.7%). Two were based on theory. None probed for the full range of known barriers. Ten included a free-text option. The majority assessed professional barriers (177, 99.4%) but few of the 14 factors within this domain. Questionnaire characteristics did not change over time. Organizations administered questionnaires that were not reliable or valid and did not comprehensively assess barriers and may have selected interventions unlikely to promote guideline use. Research is needed to construct a questionnaire that is practical, adaptable, and robust and leads to the selection of interventions that support guideline use. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
[Design and validation of a brief questionnaire to assess young´s sexual knowledge].

PubMed

Leon-Larios, Fátima; Gómez-Baya, Diego

2018-06-01

Only very few instruments have been developed to assess sexual knowledge and practices. Most of the research to date has been carried out with adolescent samples, but not with university students, who are also at a particularly risky stage. The aim of this study was to design and validate a brief questionnaire to assess young´s sexual knowledge, practices and behaviors to design health education programs in the university context. We created a specific questionnaire about sexual pattern in university adolescents and a brief questionnaire consisted of 9 items (true/false) about contraception, sexuality and sexual transmission diseases. We carried out a pilot study, reliability (KR-20) and validity analyses using factorial analysis and examining the association with other variables. 566 students from University of Seville participated during 2015/16. One item was eliminated because of comprehension (only 13.9% of correct answers) and weak or non significant associations (p more than 0.05). Finally, the scale was formed by 8 items and had good internal consistency reliability (KR-20 = 0.57), and both factorial and external validity reliability. A three-factor model showed good data fit, χ2 (14, N=566)=17.48, p= 0.232, Comparative Fit Index CFI = 0.97, root mean squared error of prediction RMSEA = 0.02. Participants with less knowledge about sexuality were whose did not receive any information (M=6.82, SD=1.41), without partner (M=6.87, SD=1.35), had an abortion (M=6.43, SD=1.95) and did not use any contraceptive method (M=6.66, SD=0.58) or coitus interruptus (M=6.55, SD=1.39), and had less sexual relationships, e.g., once or twice a year (M=6.49, SD=1.70). This questionnaire is a short instrument to assess students´ practices and knowledge about sexuality and contraception. The analyses of reliability and validity have shown the good psychometric properties of this instrument.
Translation and validation of Moroccan Western Ontario and McMaster Universities (WOMAC) osteoarthritis index in knee osteoarthritis.

PubMed

Faik, A; Benbouazza, K; Amine, B; Maaroufi, H; Bahiri, R; Lazrak, N; Aboukal, R; Hajjaj-Hassouni, N

2008-05-01

The aim of this study is to assess the reliability and validity of the Western Ontario and McMaster University Osteoarthritis Index (WOMAC) in Moroccan patients with knee osteoarthritis. The WOMAC was translated and back translated to and from dialectal Arabic, pre-tested and reviewed by a committee following the Guillemin criteria. The Moroccan version of the WOMAC was administered twice during a 24-48 h interval to 71 Moroccan patients with symptomatic knee osteoarthritis, fulfilling the revised criteria of the American College of Rheumatology. The test-retest reliability was assessed using intra-class correlation coefficient, and the Bland and Altman method. Internal consistency was assessed by Cronbach's alpha coefficient. Construct validity was tested by correlating the WOMAC subscales with visual analogic scale (VAS) of pain, VAS of handicap, maximum distance walked and clinical characteristics. The Moroccan version of the WOMAC showed good reliability, with ICC values of the three dimensions: pain, stiffness and physical function being 0.80, 0.77 and 0.89, respectively. Bland and Altman analysis showed that means of differences did not differ significantly from 0 and that no systematic trend was observed. Internal consistency with Cronbach's alpha for pain was found to be 0.76, and its equivalents for stiffness and physical function subscales were evaluated at 0.76, 0.90, respectively. Construct validity showed statistically significant correlation with all WOMAC subscales and VAS of pain (rho=0.38, 0.42, 0.63 respectively, P<0.01). Correlation between VAS handicap (rho=0.38 P<0.001) and maximum distance walked (rho=-0.40, P<0.01) was observed with physical function subscale. There was no correlation between age, duration of disease, BMI and severity of pain and physical function in knee OA. The Moroccan version of the WOMAC is a comprehensible, reliable, and valid instrument to measure outcome in patients with knee OA.
Development and validation of The Personal Diabetes Questionnaire (PDQ): a measure of diabetes self-care behaviors, perceptions and barriers.

PubMed

Stetson, Barbara; Schlundt, David; Rothschild, Chelsea; Floyd, Jennifer E; Rogers, Whitney; Mokshagundam, Sri Prakash

2011-03-01

To develop and evaluate the validity and reliability of The Personal Diabetes Questionnaire (PDQ), a brief, yet comprehensive measure of diabetes self-care behaviors, perceptions and barriers. To examine individual items to provide descriptive and normative information and provide data on scale reliability and associations between PDQ scales and concurrently assessed HBA(1c) and BMI. Items were written to address nutritional management, medication utilization, blood glucose monitoring, and physical activity. The initial instrument was reviewed by multidisciplinary diabetes care providers and items subsequently revised until the measure provided complete coverage of the diabetes care domains using as few items as possible. The scoring scheme was generated rationally. Subjects were 790 adults (205 with type 1 and 585 with type 2 diabetes) who completed the PDQ while waiting for clinic appointments. Item completion rates were high, with few items skipped by participants. Subscales demonstrated good internal consistency (Cronbach α=.650-.834) and demonstrated significant associations with BMI (p ≤.001) and HbA(1c) (p ≤.001). The PDQ is a useful measure of diabetes self-care behaviors and related perceptions and barriers that is reliable and valid and feasible to administer in a clinic setting. This measure may be used to obtain data for assessing diabetes self-management and barriers and to guide patient care. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Reliability and Validity of the Turkish Version of Abdel-Khalek’s Death Anxiety Scale among College Students

PubMed Central

SARIÇİÇEK AYDOĞAN, Aybala; GÜLSEREN, Şeref; ÖZTÜRK SARIKAYA, Özyıl; ÖZEN, Çiğdem

2015-01-01

Introduction Although death anxiety is considered a universal phenomenon, attitudes toward death may vary across populations that differ in terms of religion and culture. Abdel-Khalek’s Death Anxiety Scale (ASDA) was developed on the basis of the rationale that there are specific concepts related to death and after death in Muslim populations. This study aims to translate and adapt ASDA in the Turkish population, examine its validity and reliability, and to compare its psychometric properties with the widely used Templer’s Death Anxiety Scale (DAS). Methods A total of 220 medical students were included in the study. The Turkish version of ASDA, DAS, and Hospital Anxiety and Depression Scale were used for data collection. Results Cronbach’s alpha coefficients were .86 for ASDA and .66 for DAS. Analysis by principal components with varimax rotation produced five factors for ASDA that explained 65.6% of total variance. ASDA and DAS were highly correlated with each other (r=.68, p<.001) Conclusion The results of this study indicate that the Turkish version of Abdel-Khalek’s Death Anxiety Scale is a reliable and valid instrument. The Turkish version of ASDA revealed better psychometric properties than DAS. This finding may reflect specific cultural and religious attitudes toward death or may result from more comprehensible language use in ASDA. PMID:28360742
The development and validation of a carer questionnaire to assess cognitive function in neuropsychiatric patients.

PubMed

Randhawa, Sharan; Walterfang, Mark; Miller, Kathryn; Scholes, Amelia; Mocellin, Ramon; Velakoulis, Dennis

2007-07-01

The carer history is an integral part of the assessment of patients with cognitive impairment. We aimed to develop a comprehensive yet concise carer questionnaire, the CogRisk, which captures actuarial risk variables for cognitive impairment in addition to key symptoms suggestive of cognitive decline in a number of cognitive domains, and to then assess its validity and reliability in a neuropsychiatric population. Carers of patients assessed for cognitive impairment completed the CogRisk, and patients were clinically assessed using the Mini-Mental State Examination (MMSE) and Neuropsychiatry Unit COGnitive assessment tool (NUCOG). Reliability was assessed using test-retest and interrater measures and measures of internal consistency. Construct and concurrent validity was assessed using correlation between total and subscale scores on the CogRisk, total scores on the NUCOG and MMSE, and subscale scores on the NUCOG. Predictive validity was determined using measures of sensitivity and specificity and using receiver operating characteristic (ROC) methods. The CogRisk was completed by all carers in less than 10 min. The total CogRisk score correlated significantly with total MMSE and NUCOG scores (r=-0.511 and -0.563, respectively) and remained highly significant when age and education were controlled for. Internal consistency of CogRisk items was high (alpha=0.943). Intrarater reliability of the CogRisk was high with an intraclass correlation coefficient of .978 (P<.001), and interrater reliability between carers was also high at 0.868 (P<.05). Sensitivity and specificity for the detection of dementia were .70 and .73, respectively, with area under the ROC curve not significantly different from that of the MMSE or NUCOG. The CogRisk is a brief carer-rated tool of a patient's cognitive functioning developed for use within a neuropsychiatric setting. It exhibited good concurrent validity, internal consistency, and interrater and intrarater reliability. The CogRisk also demonstrated good sensitivity and specificity for dementia. The CogRisk provides carer information, which complements the clinical assessment and can be used to focus on direct carer interview.
Assessing physiotherapists' communication skills for promoting patient autonomy for self-management: reliability and validity of the communication evaluation in rehabilitation tool.

PubMed

Murray, Aileen; Hall, Amanda; Williams, Geoffrey C; McDonough, Suzanne M; Ntoumanis, Nikos; Taylor, Ian; Jackson, Ben; Copsey, Bethan; Hurley, Deirdre A; Matthews, James

2018-02-27

To assess the inter-rater reliability and concurrent validity of the Communication Evaluation in Rehabilitation Tool, which aims to externally assess physiotherapists competency in using Self-Determination Theory-based communication strategies in practice. Audio recordings of initial consultations between 24 physiotherapists and 24 patients with chronic low back pain in four hospitals in Ireland were obtained as part of a larger randomised controlled trial. Three raters, all of whom had Ph.Ds in psychology and expertise in motivation and physical activity, independently listened to the 24 audio recordings and completed the 18-item Communication Evaluation in Rehabilitation Tool. Inter-rater reliability between all three raters was assessed using intraclass correlation coefficients. Concurrent validity was assessed using Pearson's r correlations with a reference standard, the Health Care Climate Questionnaire. The total score for the Communication Evaluation in Rehabilitation Tool is an average of all 18 items. Total scores demonstrated good inter-rater reliability (Intraclass Correlation Coefficient (ICC) = 0.8) and concurrent validity with the Health Care Climate Questionnaire total score (range: r = 0.7-0.88). Item-level scores of the Communication Evaluation in Rehabilitation Tool identified five items that need improvement. Results provide preliminary evidence to support future use and testing of the Communication Evaluation in Rehabilitation Tool. Implications for Rehabilitation Promoting patient autonomy is a learned skill and while interventions exist to train clinicians in these skills there are no tools to assess how well clinicians use these skills when interacting with a patient. The lack of robust assessment has severe implications regarding both the fidelity of clinician training packages and resulting outcomes for promoting patient autonomy. This study has developed a novel measurement tool Communication Evaluation in Rehabilitation Tool and a comprehensive user manual to assess how well health care providers use autonomy-supportive communication strategies in real world-clinical settings. This tool has demonstrated good inter-rater reliability and concurrent validity in its initial testing phase. The Communication Evaluation in Rehabilitation Tool can be used in future studies to assess autonomy-supportive communication and undergo further measurement property testing as per our recommendations.
Third Molars on the Internet: A Guide for Assessing Information Quality and Readability.

PubMed

Hanna, Kamal; Brennan, David; Sambrook, Paul; Armfield, Jason

2015-10-06

Directing patients suffering from third molars (TMs) problems to high-quality online information is not only medically important, but also could enable better engagement in shared decision making. This study aimed to develop a scale that measures the scientific information quality (SIQ) for online information concerning wisdom tooth problems and to conduct a quality evaluation for online TMs resources. In addition, the study evaluated whether a specific piece of readability software (Readability Studio Professional 2012) might be reliable in measuring information comprehension, and explored predictors for the SIQ Scale. A cross-sectional sample of websites was retrieved using certain keywords and phrases such as "impacted wisdom tooth problems" using 3 popular search engines. The retrieved websites (n=150) were filtered. The retained 50 websites were evaluated to assess their characteristics, usability, accessibility, trust, readability, SIQ, and their credibility using DISCERN and Health on the Net Code (HoNCode). Websites' mean scale scores varied significantly across website affiliation groups such as governmental, commercial, and treatment provider bodies. The SIQ Scale had a good internal consistency (alpha=.85) and was significantly correlated with DISCERN (r=.82, P<.01) and HoNCode (r=.38, P<.01). Less than 25% of websites had SIQ scores above 75%. The mean readability grade (10.3, SD 1.9) was above the recommended level, and was significantly correlated with the Scientific Information Comprehension Scale (r=.45. P<.01), which provides evidence for convergent validity. Website affiliation and DISCERN were significantly associated with SIQ (P<.01) and explained 76% of the SIQ variance. The developed SIQ Scale was found to demonstrate reliability and initial validity. Website affiliation, DISCERN, and HoNCode were significant predictors for the quality of scientific information. The Readability Studio software estimates were associated with scientific information comprehensiveness measures.
Reliability and Validity of Three Instruments (DSM-IV, CPGI, and PPGM) in the Assessment of Problem Gambling in South Korea.

PubMed

Back, Ki-Joon; Williams, Robert J; Lee, Choong-Ki

2015-09-01

Most research on the assessment, epidemiology, and treatment of problem gambling has occurred in Western jurisdictions. This potentially limits the cross-cultural validity of problem gambling assessment instruments as well as etiological models of problem gambling. The primary objective of the present research was to investigate the reliability and validity of three problem gambling assessment instruments within a South Korean context. A total of 4,330 South Korean adults participated in a comprehensive assessment of their gambling behavior that included the administration of the DSM-IV criteria for pathological gambling (NODS), the Canadian Problem Gambling Index (CPGI), and the Problem and Pathological Gambling Measure (PPGM). Cronbach alpha showed that all three instruments had good internal consistency. Concurrent validity was established by the significant associations observed between scores on the instruments and measures of gambling involvement (number of gambling formats engaged in; frequency of gambling; and gambling expenditure). Most importantly, kappa statistics showed that all instruments have satisfactory classification accuracy against clinical assessment of problem gambling conducted by South Korean clinicians (NODS κ = .66; PPGM κ = .62; CPGI κ = .51). These results confirm that Western-derived operationalizations of problem gambling have applicability in a South Korean setting.
The merits and problems of Neuropsychiatric Inventory as an assessment tool in people with dementia and other neurological disorders.

PubMed

Lai, Claudia K Y

2014-01-01

The Neuropsychiatric Inventory (NPI) is one of the most commonly used assessment scales for assessing symptoms in people with dementia and other neurological disorders. This paper analyzes its conceptual framework, measurement mode, psychometric properties, and merits and problems. All articles discussing the psychometric properties and factor structure of the NPI were searched for in Medline via Ovid. The abstracts of these papers were read to determine their relevance to the purpose of this paper. If deemed appropriate, a full paper was then obtained and read. The NPI has reasonably good content validity and internal consistency, and good test-retest and interrater reliability. There is limited information about its sensitivity, specificity, positive and negative predictive values, and, in particular, responsiveness. Merits of the NPI include being comprehensive, avoiding symptom overlap, ease of use, and flexibility. It has problems in scoring (no multiples of 5, 7, and 11) and, therefore, analysis using parametric tests may not be appropriate. The use of individual subscales also warrants further investigation. In terms of its content and concurrent validity, intra- and interrater reliability, test-retest reliability, and internal consistency, the NPI can be considered as valid and reliable, and can be used across different ethnic groups. The tool is most likely unable to deliver as good a performance in terms of discriminating between different disorders. More studies are required to further evaluate its psychometric properties, particularly in the areas of factor structure and responsiveness. The clinical utility of the NPI also needs to be further explored.
FIMA, the questionnaire for health-related resource use in the elderly population: validity, reliability, and usage of the Polish version in clinical practice.

PubMed

Mazurek, Justyna; Sutkowska, Edyta; Szcześniak, Dorota; Urbańska, Katarzyna Małgorzata; Rymaszewska, Joanna

2018-01-01

The purpose of this study was to determine the validity and reliability of the Polish version of the Questionnaire for Health-Related Resource Use in an Elderly Population [Fragebogen zur Inanspruchnahme medizinischer und nicht-medizinischer Versorgungsleistungen im Alter (FIMA)]. This was a cross-sectional study conducted in a rehabilitation care unit in Poland between January and June of 2017. Sixty-one patients aged ≥65 years who had been admitted to the unit were enrolled into the study. Each participant was evaluated twice: once within 48 hours of admission (T1) and once after 2 weeks (T2). The translated instrument was understood by most respondents in a selected population and it maintained a reading and comprehension level that was accessible by most respondents, even of a low education level. With the aid of the prevalence-adjusted bias-adjusted kappa (PABAK) and intraclass correlation coefficient (ICC), 100% test-retest reliability for 10 out of the 12 questions that were subjected to analysis was indicated. The most frequent health-related resource uses were appointments at the general practitioner (90.2%) and orthopedist (54.1%), medication (93.4%), and the necessity to have glasses as supportive equipment (70.5%). The Polish FIMA demonstrated very good test-retest reliability, good validity, and ease of use for elderly people. Further investigation is required. In the future, the routine use of this instrument could be encouraged to assess the use and demand for medical and nonmedical services among the elderly.
Validity and reliability of the TED-QOL: a new three-item questionnaire to assess quality of life in thyroid eye disease.

PubMed

Fayers, Tessa; Dolman, Peter J

2011-12-01

To develop and test a user-friendly questionnaire for rapidly assessing quality of life (QOL) in thyroid eye disease (TED). A three-item questionnaire, the TED-QOL, was designed and compared to the 16-item Graves Ophthalmopathy (GO)-QOL and the nine-item GO-Quality of Life Scale (QLS). 100 patients with TED were administered all three questionnaires on two occasions. Results were compared to clinical severity scores (Vision, Inflammation, Strabismus, Appearance (VISA) classification). Main outcomes were construct and criterion validity, test-retest reliability, duration, comprehension and completion rates. TED-QOL correlated strongly with the other questionnaires for corresponding items (Pearson correlation: appearance 0.71, 0.62; functioning 0.69, 0.66; overall QOL 0.53). Test-retest analysis demonstrated good reliability for all three questionnaires (intraclass correlations: TED-QOL 0.81, 0.74, 0.87; GO-QOL 0.81, 0.82; GO-QLS 0.74, 0.86, 0.67). TED-QOL was significantly faster to complete (1.6 min vs GO-QOL 3.1 min, GO-QLS 2.7 min, p<0.0001) and had a higher completion rate (100% vs GO-QOL 78%, GO-QLS 94%). There was only moderate correlation between items on all three questionnaires and VISA scores. The TED-QOL is rapid and easy to complete and analyse and has similar validity and reliability to longer questionnaires. All questionnaires showed only moderate correlation with disease severity, emphasising the discrepancy between objective and subjective assessments and the importance of measuring both.
Predicting implementation from organizational readiness for change: a study protocol

PubMed Central

2011-01-01

Background There is widespread interest in measuring organizational readiness to implement evidence-based practices in clinical care. However, there are a number of challenges to validating organizational measures, including inferential bias arising from the halo effect and method bias - two threats to validity that, while well-documented by organizational scholars, are often ignored in health services research. We describe a protocol to comprehensively assess the psychometric properties of a previously developed survey, the Organizational Readiness to Change Assessment. Objectives Our objective is to conduct a comprehensive assessment of the psychometric properties of the Organizational Readiness to Change Assessment incorporating methods specifically to address threats from halo effect and method bias. Methods and Design We will conduct three sets of analyses using longitudinal, secondary data from four partner projects, each testing interventions to improve the implementation of an evidence-based clinical practice. Partner projects field the Organizational Readiness to Change Assessment at baseline (n = 208 respondents; 53 facilities), and prospectively assesses the degree to which the evidence-based practice is implemented. We will conduct predictive and concurrent validities using hierarchical linear modeling and multivariate regression, respectively. For predictive validity, the outcome is the change from baseline to follow-up in the use of the evidence-based practice. We will use intra-class correlations derived from hierarchical linear models to assess inter-rater reliability. Two partner projects will also field measures of job satisfaction for convergent and discriminant validity analyses, and will field Organizational Readiness to Change Assessment measures at follow-up for concurrent validity (n = 158 respondents; 33 facilities). Convergent and discriminant validities will test associations between organizational readiness and different aspects of job satisfaction: satisfaction with leadership, which should be highly correlated with readiness, versus satisfaction with salary, which should be less correlated with readiness. Content validity will be assessed using an expert panel and modified Delphi technique. Discussion We propose a comprehensive protocol for validating a survey instrument for assessing organizational readiness to change that specifically addresses key threats of bias related to halo effect, method bias and questions of construct validity that often go unexplored in research using measures of organizational constructs. PMID:21777479
Initial Psychometric Validation of the Non-Suicidal Self-Injury Scar Cognition Scale.

PubMed

Burke, Taylor A; Olino, Thomas M; Alloy, Lauren B

2017-09-01

Given the growing literature on the detrimental psychological consequences of NSSI, it is surprising that scarce research has focused on the permanent physical consequences of NSSI, scarring to one's tissue (Burke et al. 2015; Lewis 2016). Indeed, with recent research suggesting that upwards of half of those with a history of NSSI bear scarring as a result of the behavior (Burke et al. 2016), the psychological implications of scarring are important to understand. Given preliminary literature suggesting that the vast majority of individuals who bear NSSI scars ascribe a great deal of meaning to their scarring, and that this meaning varies widely, a psychometrically sound scale is needed to comprehensively and systematically assess NSSI scar-related cognitions. The present study examined the psychometric properties of the Non-Suicidal Self-Injury Scar Cognition Scale (NSSI-SCS). A sample of 110 undergraduates with at least one scar from NSSI completed the NSSI-SCS as well as measures of concurrent and divergent validity. Exploratory Factor Analysis was conducted to determine the factor structure of the NSSI-SCS. Results indicated that a five-factor solution offered the best fit for the data. Psychometric analyses support the validity of the NSSI-SCS given evidence of concurrent validity, divergent validity, and reliability. Future research should examine the test-retest reliability of the NSSI-SCS, as well as its sensitivity to change, particularly in the context of treatment research.
Comprehensive knowledge of HIV among women in rural Mozambique: development and validation of the HIV knowledge 27 scale.

PubMed

Ciampa, Philip J; Skinner, Shannon L; Patricio, Sérgio R; Rothman, Russell L; Vermund, Sten H; Audet, Carolyn M

2012-01-01

The relationship between HIV knowledge and HIV-related behaviors in settings like Mozambique has been limited by a lack of rigorously validated measures. A convenience sample of women seeking prenatal care at two clinics were administered an adapted, orally-administered, 27 item HIV-knowledge scale, the HK-27. Validation analyses were stratified by survey language (Portuguese and Echuabo). Kuder-Richardson (KR-20) coefficients estimated internal reliability. Construct validity was assessed with bivariate associations between HK-27 scores (% correct) and selected participant characteristics. The association between knowledge, self-reported HIV testing, and HIV infection were evaluated with multivariable logistic regression. Participants (N = 348) had a median age of 24; 188 spoke Portuguese, and 160 spoke Echuabo. Mean HK-27 scores were higher for Portuguese-speaking participants than Echuabo-speaking participants (68% correct vs. 42%, p<0.001). Internal reliability was strong (KR-20>0.8) for scales in both languages. Higher HK-27 scores were significantly (p≤0.05) correlated with more education, more media items in the home, a history of HIV testing, and participant work outside of the home for women of both languages. HK-27 scores were independently associated with completion of HIV testing in multivariable analysis (per 1% correct: aOR:1.02, 95%CI:0.01-0.03, p = 0.01), but not with HIV infection. HK-27 is a reliable and valid measure of HIV knowledge among Portuguese and Echuabo-speaking Mozambican women. The HK-27 demonstrated significant knowledge deficits among women in the study, and higher scores were associated with higher HIV testing probability. Future studies should evaluate the role of the HK-27 in longitudinal studies and in other populations.

Design, testing and validation of an innovative web-based instrument to evaluate school meal quality.

PubMed

Patterson, Emma; Quetel, Anna-Karin; Lilja, Karin; Simma, Marit; Olsson, Linnea; Elinder, Liselotte Schäfer

2013-06-01

To develop a feasible, valid, reliable web-based instrument to objectively evaluate school meal quality in Swedish primary schools. The construct 'school meal quality' was operationalized by an expert panel into six domains, one of which was nutritional quality. An instrument was drafted and pilot-tested. Face validity was evaluated by the panel. Feasibility was established via a large national study. Food-based criteria to predict the nutritional adequacy of school meals in terms of fat quality, iron, vitamin D and fibre content were developed. Predictive validity was evaluated by comparing the nutritional adequacy of school menus based on these criteria with the results from a nutritional analysis. Inter-rater reliability was also assessed. The instrument was developed between 2010 and 2012. It is designed for use in all primary schools by school catering and/or management representatives. A pilot-test of eighty schools in Stockholm (autumn 2010) and a further test of feasibility in 191 schools nationally (spring 2011). The four nutrient-specific food-based criteria predicted nutritional adequacy with sensitivity ranging from 0.85 to 1.0, specificity from 0.45 to 1.0 and accuracy from 0.67 to 1.0. The sample in the national study was statistically representative and the majority of users rated the questionnaire positively, suggesting the instrument is feasible. The inter-rater reliability was fair to almost perfect for continuous variables and agreement was ≥ 67 % for categorical variables. An innovative web-based system to comprehensively monitor school meal quality across several domains, with validated questions in the nutritional domain, is available in Sweden for the first time.
Evaluation of a respiratory symptom diary for clinical studies of idiopathic pulmonary fibrosis.

PubMed

Bacci, Elizabeth Dansie; O'Quinn, Sean; Leidy, Nancy Kline; Murray, Lindsey; Vernon, Margaret

2018-01-01

There are no validated patient diaries for evaluating respiratory symptoms in idiopathic pulmonary fibrosis (IPF). To evaluate the performance properties of the chronic obstructive pulmonary disease (COPD) Evaluating Respiratory Symptoms™ (E-RS™: COPD) measure in patients with IPF. Concept elicitation and cognitive interviews were conducted with IPF patients to evaluate content validity, including comprehensiveness, relevance, and interpretability of E-RS™ items in this patient population. Secondary analyses of IPF clinical study data were performed to evaluate the scoring structure of the tool. With modifications, reliability, validity, and responsiveness of the instrument (E-RS™: IPF) were evaluated. Qualitative interviews (n = 30) were conducted. During the elicitation interviews (n = 20), concept saturation for IPF respiratory symptoms was achieved; all respiratory symptoms covered by the E-RS™ were endorsed by ≥ 30% of the sample. During cognitive interviews (n = 10), all participants found the items interpretable and relevant. Factor analyses conducted via secondary analysis of IPF clinical study data identified no total score and four symptom scales: Chest, Breathlessness, Cough, and Sputum. Reliability of each scale was high (internal consistency [α] >0.85); 2-day reproducibility (ICC >0.88). Validity was supported through significant (P < 0.0001) relationships with the St. George's Respiratory Questionnaire (SGRQ), the University of California, San Diego Shortness of Breath Questionnaire (UCSD-SOBQ), and other variables. The scales were responsive to change when evaluated using SGRQ Symptoms, UCSD-SOBQ, and Patient Global Impression of Change as anchors (P < 0.01 to P < 0.0001). The E-RS™: IPF is a valid, reliable, and responsive tool for evaluating respiratory symptoms in patients with IPF. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Development of the NIH PROMIS ® Sexual Function and Satisfaction measures in patients with cancer.

PubMed

Flynn, Kathryn E; Lin, Li; Cyranowski, Jill M; Reeve, Bryce B; Reese, Jennifer Barsky; Jeffery, Diana D; Smith, Ashley Wilder; Porter, Laura S; Dombeck, Carrie B; Bruner, Deborah Watkins; Keefe, Francis J; Weinfurt, Kevin P

2013-02-01

We describe the development and validation of the Patient-Reported Outcomes Measurement Information System(®) Sexual Function and Satisfaction (PROMIS(®) SexFS; National Institutes of Health) measures, version 1.0, for cancer populations. To develop a customizable self-report measure of sexual function and satisfaction as part of the U.S. National Institutes of Health PROMIS Network. Our multidisciplinary working group followed a comprehensive protocol for developing psychometrically robust patient-reported outcome measures including qualitative (scale development) and quantitative (psychometric evaluation) development. We performed an extensive literature review, conducted 16 focus groups with cancer patients and multiple discussions with clinicians, and evaluated candidate items in cognitive testing with patients. We administered items to 819 cancer patients. Items were calibrated using item-response theory and evaluated for reliability and validity. The PROMIS SexFS measures, version 1.0, include 81 items in 11 domains: Interest in Sexual Activity, Lubrication, Vaginal Discomfort, Erectile Function, Global Satisfaction with Sex Life, Orgasm, Anal Discomfort, Therapeutic Aids, Sexual Activities, Interfering Factors, and Screener Questions. In addition to content validity (patients indicate that items cover important aspects of their experiences) and face validity (patients indicate that items measure sexual function and satisfaction), the measure shows evidence for discriminant validity (domains discriminate between groups expected to be different) and convergent validity (strong correlations between scores on PROMIS and scores on conceptually similar older measures of sexual function), as well as favorable test-retest reliability among people not expected to change (interclass correlations from two administrations of the instrument, 1 month apart). The PROMIS SexFS offers researchers a reliable and valid set of tools to measure self-reported sexual function and satisfaction among diverse men and women. The measures are customizable; researchers can select the relevant domains and items comprising those domains for their study. © 2013 International Society for Sexual Medicine.
Development of the NIH PROMIS® Sexual Function and Satisfaction Measures in Patients with Cancer

PubMed Central

Flynn, Kathryn E.; Lin, Li; Cyranowski, Jill M.; Reeve, Bryce B.; Reese, Jennifer Barsky; Jeffery, Diana D.; Smith, Ashley Wilder; Porter, Laura S.; Dombeck, Carrie B.; Bruner, Deborah Watkins; Keefe, Francis J.; Weinfurt, Kevin P.

2013-01-01

Introduction We describe the development and validation of the PROMIS Sexual Function and Satisfaction (PROMIS SexFS) measures version 1.0 for cancer populations. Aim To develop a customizable self-report measure of sexual function and satisfaction as part of the U.S. National Institutes of Health PROMIS® Network. Methods Our multidisciplinary working group followed a comprehensive protocol for developing psychometrically robust patient reported outcome (PRO) measures including qualitative (scale development) and quantitative (psychometric evaluation) development. We performed an extensive literature review, conducted 16 focus groups with cancer patients and multiple discussions with clinicians, and evaluated candidate items in cognitive testing with patients. We administered items to 819 cancer patients. Items were calibrated using item response theory and evaluated for reliability and validity. Main Outcome Measures The PROMIS Sexual Function and Satisfaction (PROMIS SexFS) measures version 1.0 include 79 items in 11 domains: interest in sexual activity, lubrication, vaginal discomfort, erectile function, global satisfaction with sex life, orgasm, anal discomfort, therapeutic aids, sexual activities, interfering factors, and screener questions. Results In addition to content validity (patients indicate that items cover important aspects of their experiences) and face validity (patients indicate that items measure sexual function and satisfaction), the measure shows evidence for discriminant validity (domains discriminate between groups expected to be different), convergent validity (strong correlations between scores on PROMIS and scores on conceptually-similar older measures of sexual function), as well as favorable test-retest reliability among people not expected to change (inter-class correlations from 2 administrations of the instrument, 1 month apart). Conclusions The PROMIS SexFS offers researchers a reliable and valid set of tools to measure self-reported sexual function and satisfaction among diverse men and women. The measures are customizable; researchers can select the relevant domains and items comprising those domains for their study. PMID:23387911
International Psychometric Validation of an EORTC Quality of Life Module Measuring Cancer Related Fatigue (EORTC QLQ-FA12).

PubMed

Weis, Joachim; Tomaszewski, Krzysztof A; Hammerlid, Eva; Ignacio Arraras, Juan; Conroy, Thierry; Lanceley, Anne; Schmidt, Heike; Wirtz, Markus; Singer, Susanne; Pinto, Monica; Alm El-Din, Mohamed; Compter, Inge; Holzner, Bernhard; Hofmeister, Dirk; Chie, Wei-Chu; Czeladzki, Marek; Harle, Amelie; Jones, Louise; Ritter, Sabrina; Flechtner, Hans-Henning; Bottomley, Andrew

2017-05-01

The European Organisation for Research and Treatment of Cancer (EORTC) Group has developed a new multidimensional instrument measuring cancer-related fatigue to be used in conjunction with the quality of life core questionnaire (EORTC QLQ-C30). The module EORTC QLQ-FA13 assesses physical, cognitive, and emotional aspects of cancer-related fatigue. The methodology follows the EORTC guidelines for phase IV validation of modules. This paper focuses on the results of the psychometric validation of the factorial structure of the module. For validation and cross-validation confirmatory factor analysis (maximum likelihood estimation), intraclass correlation and Cronbach alpha for internal consistency were employed. The study involved an international multicenter collaboration of 11 European and non-European countries. A total of 946 patients with various tumor diagnoses were enrolled. Based on the confirmatory factor analysis, we could approve the three-dimensional structure of the module. Removing one item and reassigning the factorial mapping of another item resulted in the EORTC QLQ-FA12. For the revised scale, we found evidence supporting good local (indicator reliability ≥ 0.60, factor reliability ≥ 0.82) and global model fit (GFI t1|t2 = 0.965/0.957, CFI t1|t2 = 0.976/0.972, RMSEA t1|t2 = 0.060/0.069) for both measurement points. For each scale, test-retest reliability proved to be very good (intraclass correlation: R t1-t2 = 0.905-0.921) and internal consistency proved to be good to high (Cronbach alpha = .79-.90). Based on the former phase III module, the multidimensional structure was revised as a phase IV module (EORTC FA12) with an improved scale structure. For a comprehensive validation of the EORTC FA12, further aspects of convergent and divergent validity as well as sensitivity to change should be determined. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Measuring patient activation in The Netherlands: translation and validation of the American short form Patient Activation Measure (PAM13).

PubMed

Rademakers, Jany; Nijman, Jessica; van der Hoek, Lucas; Heijmans, Monique; Rijken, Mieke

2012-07-31

The American short form Patient Activation Measure (PAM) is a 13-item instrument which assesses patient (or consumer) self-reported knowledge, skills and confidence for self-management of one's health or chronic condition. In this study the PAM was translated into a Dutch version; psychometric properties of the Dutch version were established and the instrument was validated in a panel of chronically ill patients. The translation was done according to WHO guidelines. The PAM 13-Dutch was sent to 4178 members of the Dutch National Panel of people with Chronic illness or Disability (NPCD) in April 2010 (study A) and again to a sub sample of this group (N = 973) in June 2010 (study B). Internal consistency, test-retest reliability and cross-validation with the SBSQ-D (a measure for Health literacy) were computed. The Dutch results were compared to similar Danish and American data. The psychometric properties of the PAM 13-Dutch were generally good. The level of internal consistency is good (α = 0.88) and item-rest correlations are moderate to strong. The Dutch mean PAM score (61.3) is comparable to the American (61.9) and lower than the Danish (64.2). The test-retest reliability was moderate. The association with Health literacy was weak to moderate. The PAM-13 Dutch is a reliable instrument to measure patient activation. More research is needed into the validity of the Patient Activation Measure, especially with respect to a more comprehensive measure of Health literacy.
[Attempt for development of rapid word reading test for children--evaluation of reliability and validity].

PubMed

Hashimoto, Ryusaku; Kashiwagi, Mitsuru; Suzuki, Shuhei

2008-09-01

We developed a rapid word reading test for examining the phonological processing ability of Japanese children. We prepared two versions of the test, version A and B. Each test has word and non-word tasks. Twenty-two healthy boys of third grade in primary schools participated in this validation study. For criterion related validity, we performed the serial Hiragana reading test, the sentence reading test, Raven's coloured progressive matrices (RCPM), the Token test for children, the Kana word dictation test, the standardized comprehension test of abstract words (SCTAW), and Trail Circle test. The reading times of the newly developed test correlated moderately or highly with those of the serial Hiragana reading test and the sentence reading test. However, the scores of the other tests (RCPM, Token test for children, Kana word dictation test, SCTAW, Trail Circle test) did not correlated with the reading time of the rapid word reading test. Test-retest reliabilities in the word tasks were more than moderate: 0.52 and 0.76 in versions A and B, while those in the non-word tasks were high: 0.91 and 0.88 in versions A and B. The correlation coefficient between versions A and B was 0.7 for the word tasks and 0.92 for the non-word tasks. This study showed that the rapid word reading test has substantial validity and reliability for testing the phonological processing ability of Japanese children. In addition, the non-word tasks were more suitable for selectively examining the speed of the grapheme to phoneme conversion process.
Development and initial cohort validation of the Arthritis Research UK Musculoskeletal Health Questionnaire (MSK-HQ) for use across musculoskeletal care pathways

PubMed Central

Hill, Jonathan C; Kang, Sujin; Benedetto, Elena; Myers, Helen; Blackburn, Steven; Smith, Stephanie; Hay, Elaine; Rees, Jonathan; Beard, David; Glyn-Jones, Sion; Barker, Karen; Ellis, Benjamin; Fitzpatrick, Ray; Price, Andrew

2016-01-01

Objectives Current musculoskeletal outcome tools are fragmented across different healthcare settings and conditions. Our objectives were to develop and validate a single musculoskeletal outcome measure for use throughout the pathway and patients with different musculoskeletal conditions: the Arthritis Research UK Musculoskeletal Health Questionnaire (MSK-HQ). Setting A consensus workshop with stakeholders from across the musculoskeletal community, workshops and individual interviews with a broad mix of musculoskeletal patients identified and prioritised outcomes for MSK-HQ inclusion. Initial psychometric validation was conducted in four cohorts from community physiotherapy, and secondary care orthopaedic hip, knee and shoulder clinics. Participants Stakeholders (n=29) included primary care, physiotherapy, orthopaedic and rheumatology patients (n=8); general practitioners, physiotherapists, orthopaedists, rheumatologists and pain specialists (n=7), patient and professional national body representatives (n=10), and researchers (n=4). The four validation cohorts included 570 participants (n=210 physiotherapy, n=150 hip, n=150 knee, n=60 shoulder patients). Outcome measures Outcomes included the MSK-HQ's acceptability, feasibility, comprehension, readability and responder burden. The validation cohort outcomes were the MSK-HQ's completion rate, test–retest reliability and convergent validity with reference standards (EQ-5D-5L, Oxford Hip, Knee, Shoulder Scores, and the Keele MSK-PROM). Results Musculoskeletal domains prioritised were pain severity, physical function, work interference, social interference, sleep, fatigue, emotional health, physical activity, independence, understanding, confidence to self-manage and overall impact. Patients reported MSK-HQ items to be ‘highly relevant’ and ‘easy to understand’. Completion rates were high (94.2%), with scores normally distributed, and no floor/ceiling effects. Test–retest reliability was excellent, and convergent validity was strong (correlations 0.81–0.88). Conclusions A new musculoskeletal outcome measure has been developed through a coproduction process with patients to capture prioritised outcomes for use throughout the pathway and with different musculoskeletal conditions. Four validation cohorts found that the MSK-HQ had high completion rates, excellent test–retest reliability and strong convergent validity with reference standards. Further validation studies are ongoing, including a cohort with rheumatoid/inflammatory arthritis. PMID:27496243
Development and validation of a fast and simple multi-analyte procedure for quantification of 40 drugs relevant to emergency toxicology using GC-MS and one-point calibration.

PubMed

Meyer, Golo M J; Weber, Armin A; Maurer, Hans H

2014-05-01

Diagnosis and prognosis of poisonings should be confirmed by comprehensive screening and reliable quantification of xenobiotics, for example by gas chromatography-mass spectrometry (GC-MS) or liquid chromatography-mass spectrometry (LC-MS). The turnaround time should be short enough to have an impact on clinical decisions. In emergency toxicology, quantification using full-scan acquisition is preferable because this allows screening and quantification of expected and unexpected drugs in one run. Therefore, a multi-analyte full-scan GC-MS approach was developed and validated with liquid-liquid extraction and one-point calibration for quantification of 40 drugs relevant to emergency toxicology. Validation showed that 36 drugs could be determined quickly, accurately, and reliably in the range of upper therapeutic to toxic concentrations. Daily one-point calibration with calibrators stored for up to four weeks reduced workload and turn-around time to less than 1 h. In summary, the multi-analyte approach with simple liquid-liquid extraction, GC-MS identification, and quantification over fast one-point calibration could successfully be applied to proficiency tests and real case samples. Copyright © 2013 John Wiley & Sons, Ltd.
Otoplasty Online Information: A Comprehensive Analysis of the Websites and Videos that Patients View Regarding Cosmetic Ear Surgery.

PubMed

Nissan, Michael E; Gupta, Amar; Rayess, Hani; Black, Kevin Z; Carron, Michael

2018-02-01

Physicians should be aware of both websites and videos available online regarding the otoplasty procedure to provide quality care. This study systematically analyzes the authorships, reliability, quality, and readability of the websites, as well as the authorships and primary objectives of the videos regarding otoplasty. Validated instruments were used to analyze the reliability, quality, and readability of websites, and videos were systematically categorized and analyzed. A Google search was conducted, and the first five pages of results were included in this study. After excluding unrelated websites, the remaining 44 websites were categorized by authorship (physician, patient, academic, or unaffiliated) and were analyzed using the validated DISCERN instrument for reliability and quality, as well as various other validated instruments to measure readability. A YouTube search was also conducted, and the first 50 relevant videos were included in the study. These videos were categorized by authorship and their primary objective. Website authorships were physician-dominated. Reliability, quality, and overall DISCERN score differ between the four authorship groups by a statistically significant margin (Kruskall-Wallis test, p < 0.05). Unaffiliated websites were the most reliable, and physician websites were the least reliable. Academic websites were of the highest quality, and patient websites were of the lowest quality. Readability did not differ significantly between the groups, though the readability measurements made showed a general lack of material easily readable by the general public. YouTube was likewise dominated by physician-authored videos. While the physician-authored videos sought mainly to inform and to advertise, patient-authored videos sought mainly to provide the patient's perspective. Academic organizations showed very little representation on YouTube, and the YouTube views on otoplasty videos were dominated by the top 20 videos, which represented over 93% of the total views of videos included in this study. Thieme Medical Publishers 333 Seventh Avenue, New York, NY 10001, USA.
Adapting the SERVQUAL scale to hospital services: an empirical investigation.

PubMed Central

Babakus, E; Mangold, W G

1992-01-01

Defining and measuring the quality of service has been a major challenge for health care marketers. A comprehensive service quality measurement scale (SERVQUAL) is empirically evaluated for its potential usefulness in a hospital service environment. Active participation by hospital management helped to address practical and user-related aspects of the assessment. The completed expectations and perceptions scales met various criteria for reliability and validity. Suggestions are provided for the managerial use of the scale, and a number of future research issues are identified. PMID:1737708
Recognizing emotional speech in Persian: a validated database of Persian emotional speech (Persian ESD).

PubMed

Keshtiari, Niloofar; Kuhlmann, Michael; Eslami, Moharram; Klann-Delius, Gisela

2015-03-01

Research on emotional speech often requires valid stimuli for assessing perceived emotion through prosody and lexical content. To date, no comprehensive emotional speech database for Persian is officially available. The present article reports the process of designing, compiling, and evaluating a comprehensive emotional speech database for colloquial Persian. The database contains a set of 90 validated novel Persian sentences classified in five basic emotional categories (anger, disgust, fear, happiness, and sadness), as well as a neutral category. These sentences were validated in two experiments by a group of 1,126 native Persian speakers. The sentences were articulated by two native Persian speakers (one male, one female) in three conditions: (1) congruent (emotional lexical content articulated in a congruent emotional voice), (2) incongruent (neutral sentences articulated in an emotional voice), and (3) baseline (all emotional and neutral sentences articulated in neutral voice). The speech materials comprise about 470 sentences. The validity of the database was evaluated by a group of 34 native speakers in a perception test. Utterances recognized better than five times chance performance (71.4 %) were regarded as valid portrayals of the target emotions. Acoustic analysis of the valid emotional utterances revealed differences in pitch, intensity, and duration, attributes that may help listeners to correctly classify the intended emotion. The database is designed to be used as a reliable material source (for both text and speech) in future cross-cultural or cross-linguistic studies of emotional speech, and it is available for academic research purposes free of charge. To access the database, please contact the first author.
Health literacy demands of written health information materials: an assessment of cervical cancer prevention materials.

PubMed

Helitzer, Deborah; Hollis, Christine; Cotner, Jane; Oestreicher, Nancy

2009-01-01

Health literacy requires reading and writing skills as well as knowledge of health topics and health systems. Materials written at high reading levels with ambiguous, technical, or dense text, often place great comprehension demands on consumers with lower literacy skills. This study developed and used an instrument to analyze cervical cancer prevention materials for readability, comprehensibility, suitability, and message design. The Suitability Assessment of Materials (SAM) was amended for ease of use, inclusivity, and objectivity with the encouragement of the original developers. Other novel contributions were specifically related to "comprehensibility" (CAM). The resulting SAM + CAM was used to score 69 materials for content, literacy demand, numeric literacy, graphics, layout/typography, and learning stimulation variables. Expert reviewers provided content validation. Inter-rater reliability was "substantial" (kappa = .77). The mean reading level of materials was 11th grade. Most materials (68%) scored as "adequate" for comprehensibility, suitability, and message design; health education brochures scored better than other materials. Only one-fifth were ranked "superior" for ease of use and comprehensibility. Most written materials have a readability level that is too high and require improvement in ease of use and comprehensibility for the majority of readers.
Translation, cross-cultural adaptation and reliability of the German version of the migraine disability assessment (MIDAS) questionnaire.

PubMed

Benz, Thomas; Lehmann, Susanne; Gantenbein, Andreas R; Sandor, Peter S; Stewart, Walter F; Elfering, Achim; Aeschlimann, André G; Angst, Felix

2018-03-09

The Migraine Disability Assessment (MIDAS) is a brief questionnaire and measures headache-related disability. This study aimed to translate and cross-culturally adapt the original English version of the MIDAS to German and to test its reliability. The standardized translation process followed international guidelines. The pre-final version was tested for clarity and comprehensibility by 34 headache sufferers. Test-retest reliability of the final version was quantified by 36 headache patients completing the MIDAS twice with an interval of 48 h. Reliability was determined by intraclass correlation coefficients and internal consistency by Cronbach's α. All steps of the translation process were followed, documented and approved by the developer of the MIDAS. The expert committee discussed in detail the complex phrasing of the questions that refer to one to another, especially exclusion of headache-days from one item to the next. The German version contains more active verb sentences and prefers the perfect to the imperfect tense. The MIDAS scales intraclass correlation coefficients ranged from 0.884 to 0.994 and was 0.991 (95% CI: 0.982-0.995) for the MIDAS total score. Cronbach's α for the MIDAS as a whole was 0.69 at test and 0.67 at retest. The translation process was challenged by the comprehensibility of the questionnaire. The German version of the MIDAS is a highly reliable instrument for assessing headache related disability with moderate internal consistency. Provided validity testing of the German MIDAS is successful, it can be recommended for use in clinical practice as well as in research.
A Comprehensive Snow Density Model for Integrating Lidar-Derived Snow Depth Data into Spatial Snow Modeling

NASA Astrophysics Data System (ADS)

Marks, D. G.; Kormos, P.; Johnson, M.; Bormann, K. J.; Hedrick, A. R.; Havens, S.; Robertson, M.; Painter, T. H.

2017-12-01

Lidar-derived snow depths when combined with modeled or estimated snow density can provide reliable estimates of the distribution of SWE over large mountain areas. Application of this approach is transforming western snow hydrology. We present a comprehensive approach toward modeling bulk snow density that is reliable over a vast range of weather and snow conditions. The method is applied and evaluated over mountainous regions of California, Idaho, Oregon and Colorado in the western US. Simulated and measured snow density are compared at fourteen validation sites across the western US where measurements of snow mass (SWE) and depth are co-located. Fitting statistics for ten sites from three mountain catchments (two in Idaho, one in California) show an average Nash-Sutcliff model efficiency coefficient of 0.83, and mean bias of 4 kg m-3. Results illustrate issues associated with monitoring snow depth and SWE and show the effectiveness of the model, with a small mean bias across a range of snow and climate conditions in the west.
Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 5. Technical Report #1204

ERIC Educational Resources Information Center

Park, Bitnara Jasmine; Irvin, P. Shawn; Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald

2012-01-01

In this technical report, we present the results of a reliability study of the fifth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 2. Technical Report #1201

ERIC Educational Resources Information Center

Lai, Cheng-Fei; Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Tindal, Gerald

2012-01-01

In this technical report, we present the results of a reliability study of the second-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 4. Technical Report #1203

ERIC Educational Resources Information Center

Park, Bitnara Jasmine; Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Tindal, Gerald

2012-01-01

In this technical report, we present the results of a reliability study of the fourth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 6. Technical Report #1205

ERIC Educational Resources Information Center

Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Lai, Cheng-Fei; Tindal, Gerald

2012-01-01

In this technical report, we present the results of a reliability study of the sixth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 7. Technical Report #1206

ERIC Educational Resources Information Center

Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Park, Bitnara Jasmine; Tindal, Gerald

2012-01-01

In this technical report, we present the results of a reliability study of the seventh-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 3. Technical Report #1202

ERIC Educational Resources Information Center

Lai, Cheng-Fei; Irvin, P. Shawn; Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald

2012-01-01

In this technical report, we present the results of a reliability study of the third-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Development and initial validation of the NCCN/FACT symptom index for advanced kidney cancer.

PubMed

Rothrock, Nan E; Jensen, Sally E; Beaumont, Jennifer L; Abernethy, Amy P; Jacobsen, Paul B; Syrjala, Karen; Cella, David

2013-01-01

There is a need for a brief symptom index for advanced kidney cancer that includes perspectives of both patients and clinicians and is consistent with the Food and Drug Administration's guidance for patient-reported outcome measures. This study developed and examined the preliminary reliability and validity of the new National Comprehensive Cancer Network/Functional Assessment of Cancer Therapy (FACT)-Kidney Symptom Index 19. Fifty patients with advanced kidney cancer provided open-ended and survey responses ranking their most important symptoms. Responses were reconciled with published clinician reports of the most important symptoms. Ten experienced oncologists rated symptoms as disease- or treatment-related. Patients completed quality-of-life and performance status measures. A 19-item index was produced from symptoms that were rated as most important by patients or clinicians. It includes three subscales: disease-related symptoms (DRS), treatment side effects (TSE), and general function and well-being (FWB). Internal consistency was good for the full instrument (α = 0.83), the DRS subscale (α = 0.76), and the FWB subscale (α = 0.78) but lower for the TSE subscale (α = 0.59). Convergent validity was demonstrated through correlations with the FACT-General. Patients with differing performance status were distinguished by the total score (F2,47 = 17.37; P < .0001), the DRS subscale (F2,47 = 14.22; P < .0001), and the FWB subscale (F2,47 = 13.40; P < .0001) but not the TSE subscale (F2,47 =1.48; P = 0.2380). The National Comprehensive Cancer Network/FACT-Kidney Symptom Index 19 combines symptoms deemed most important by patients and clinicians. Preliminary evidence suggests that the total score and DRS and FWB subscales are reliable and valid as summary indexes. The TSE subscale may be least relevant given the advent of newer therapies. Copyright © 2013 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.
A Comprehensive, Multi-modal Evaluation of the Assessment System of an Undergraduate Research Methodology Course: Translating Theory into Practice.

PubMed

Mohammad Abdulghani, Hamza; G Ponnamperuma, Gominda; Ahmad, Farah; Amin, Zubair

2014-03-01

To evaluate assessment system of the 'Research Methodology Course' using utility criteria (i.e. validity, reliability, acceptability, educational impact, and cost-effectiveness). This study demonstrates comprehensive evaluation of assessment system and suggests a framework for similar courses. Qualitative and quantitative methods used for evaluation of the course assessment components (50 MCQ, 3 Short Answer Questions (SAQ) and research project) using the utility criteria. RESULTS of multiple evaluation methods for all the assessment components were collected and interpreted together to arrive at holistic judgments, rather than judgments based on individual methods or individual assessment. Face validity, evaluated using a self-administered questionnaire (response rate-88.7%) disclosed that the students perceived that there was an imbalance in the contents covered by the assessment. This was confirmed by the assessment blueprint. Construct validity was affected by the low correlation between MCQ and SAQ scores (r=0.326). There was a higher correlation between the project and MCQ (r=0.466)/SAQ (r=0.463) scores. Construct validity was also affected by the presence of recall type of MCQs (70%; 35/50), item construction flaws and non-functioning distractors. High discriminating indices (>0.35) were found in MCQs with moderate difficulty indices (0.3-0.7). Reliability of the MCQs was 0.75 which could be improved up to 0.8 by increasing the number of MCQs to at least 70. A positive educational impact was found in the form of the research project assessment driving students to present/publish their work in conferences/peer reviewed journals. Cost per student to complete the course was US$164.50. The multi-modal evaluation of an assessment system is feasible and provides thorough and diagnostic information. Utility of the assessment system could be further improved by modifying the psychometrically inappropriate assessment items.
A Comprehensive, Multi-modal Evaluation of the Assessment System of an Undergraduate Research Methodology Course: Translating Theory into Practice

PubMed Central

Mohammad Abdulghani, Hamza; G. Ponnamperuma, Gominda; Ahmad, Farah; Amin, Zubair

2014-01-01

Objective: To evaluate assessment system of the 'Research Methodology Course' using utility criteria (i.e. validity, reliability, acceptability, educational impact, and cost-effectiveness). This study demonstrates comprehensive evaluation of assessment system and suggests a framework for similar courses. Methods: Qualitative and quantitative methods used for evaluation of the course assessment components (50 MCQ, 3 Short Answer Questions (SAQ) and research project) using the utility criteria. Results of multiple evaluation methods for all the assessment components were collected and interpreted together to arrive at holistic judgments, rather than judgments based on individual methods or individual assessment. Results: Face validity, evaluated using a self-administered questionnaire (response rate-88.7%) disclosed that the students perceived that there was an imbalance in the contents covered by the assessment. This was confirmed by the assessment blueprint. Construct validity was affected by the low correlation between MCQ and SAQ scores (r=0.326). There was a higher correlation between the project and MCQ (r=0.466)/SAQ (r=0.463) scores. Construct validity was also affected by the presence of recall type of MCQs (70%; 35/50), item construction flaws and non-functioning distractors. High discriminating indices (>0.35) were found in MCQs with moderate difficulty indices (0.3-0.7). Reliability of the MCQs was 0.75 which could be improved up to 0.8 by increasing the number of MCQs to at least 70. A positive educational impact was found in the form of the research project assessment driving students to present/publish their work in conferences/peer reviewed journals. Cost per student to complete the course was US$164.50. Conclusions: The multi-modal evaluation of an assessment system is feasible and provides thorough and diagnostic information. Utility of the assessment system could be further improved by modifying the psychometrically inappropriate assessment items. PMID:24772117
Translation, cross-cultural adaptation and psychometric properties of the Back Beliefs Questionnaire in Modern Standard Arabic.

PubMed

Maki, Dana; Rajab, Ebrahim; Watson, Paul J; Critchley, Duncan J

2017-02-01

Purpose To translate and cross-culturally adapt the Back Beliefs Questionnaire (BBQ) into modern standard Arabic and examine its validity, acceptability and reliability in Arabic-speaking patients with low back pain (LBP). Method The BBQ was forward, back-translated and reviewed by an expert committee. Seventeen bilingual patients completed Arabic and English BBQs. LBP patients (n = 199) completed the Arabic BBQ. Sixty-four repeated it a week later, and 151 completed the Arabic Fear-avoidance Beliefs Questionnaire (FABQ). Results The expert committee followed advice from the developers to maintain Arabic equivalence of "back trouble(s)". Patients found the questionnaire comprehensible and acceptable. Agreement between the English and Arabic versions of the BBQ was acceptable, ICC = 0.65 (0.25-0.86). Most item-by-item agreement ranged from fair to moderate (K = 0.12-0.54). Mean (SD) of BBQ, FABQ total, work and physical activity subscales were 25.31(6.13), 44.76(19.49), 21.17(10.10) and 13.95(6.65). The BBQ correlated with the FABQ at r = -0.33, work subscale r = -0.29 and physical activity r = -0.30 (all p < 0.01). Cronbach's α = 0.73 indicated high internal consistency. Test-retest reliability was high, ICC = 0.80 (0.68-0.87). Item-by-item agreement ranged from fair to acceptable (K = 0.31-0.66). Conclusions The Arabic BBQ has good comprehensibility and acceptability, acceptable agreement with the English BBQ, high internal consistency and test-retest reliability. We recommend its use with Arabic-speaking LBP patient to determine their beliefs and attitudes about their back pain, as they have been shown to be important predictors of persistent LBP disability. Implications for Rehabilitation There are limited valid and reliable outcome measures for back pain in Arabic. The Back Beliefs Questionnaire (BBQ) is a tool that measures attitudes and beliefs about back pain. We recommend the use of our valid and reliable, translated and cross-culturally adapted tool with Arabic-speaking patients. The tool can measure attitudes and beliefs concerning the future consequences of LBP, with regards to recovery and return to work in this sample. Findings will improve back pain management options aimed at reducing back pain disability though challenging and modifying beliefs in the Middle East or with migrant populations in the West.
Validating the Patient Experience with Treatment and Self-Management (PETS), a patient-reported measure of treatment burden, in people with diabetes

PubMed Central

Rogers, Elizabeth A; Yost, Kathleen J; Rosedahl, Jordan K; Linzer, Mark; Boehm, Deborah H; Thakur, Azra; Poplau, Sara; Anderson, Roger T; Eton, David T

2017-01-01

Aims To validate a comprehensive general measure of treatment burden, the Patient Experience with Treatment and Self-Management (PETS), in people with diabetes. Methods We conducted a secondary analysis of a cross-sectional survey study with 120 people diagnosed with type 1 or type 2 diabetes and at least one additional chronic illness. Surveys included established patient-reported outcome measures and a 48-item version of the PETS, a new measure comprised of multi-item scales assessing the burden of chronic illness treatment and self-care as it relates to nine domains: medical information, medications, medical appointments, monitoring health, interpersonal challenges, health care expenses, difficulty with health care services, role activity limitations, and physical/mental exhaustion from self-management. Internal reliability of PETS scales was determined using Cronbach’s alpha. Construct validity was determined through correlation of PETS scores with established measures (measures of chronic condition distress, medication satisfaction, self-efficacy, and global well-being), and known-groups validity through comparisons of PETS scores across clinically distinct groups. In an exploratory test of predictive validity, step-wise regressions were used to determine which PETS scales were most associated with outcomes of chronic condition distress, overall physical and mental health, and medication adherence. Results Respondents were 37–88 years old, 59% female, 29% non-white, and 67% college-educated. PETS scales showed good reliability (Cronbach’s alphas ≥0.74). Higher PETS scale scores (greater treatment burden) were correlated with more chronic condition distress, less medication convenience, lower self-efficacy, and worse general physical and mental health. Participants less (versus more) adherent to medications and those with more (versus fewer) health care financial difficulties had higher mean PETS scores. Medication burden was the scale that was most consistently associated with well-being and patient-reported adherence. Conclusion The PETS is a reliable and valid measure for assessing perceived treatment burden in people coping with diabetes. PMID:29184456
Validating the Patient Experience with Treatment and Self-Management (PETS), a patient-reported measure of treatment burden, in people with diabetes.

PubMed

Rogers, Elizabeth A; Yost, Kathleen J; Rosedahl, Jordan K; Linzer, Mark; Boehm, Deborah H; Thakur, Azra; Poplau, Sara; Anderson, Roger T; Eton, David T

2017-01-01

To validate a comprehensive general measure of treatment burden, the Patient Experience with Treatment and Self-Management (PETS), in people with diabetes. We conducted a secondary analysis of a cross-sectional survey study with 120 people diagnosed with type 1 or type 2 diabetes and at least one additional chronic illness. Surveys included established patient-reported outcome measures and a 48-item version of the PETS, a new measure comprised of multi-item scales assessing the burden of chronic illness treatment and self-care as it relates to nine domains: medical information, medications, medical appointments, monitoring health, interpersonal challenges, health care expenses, difficulty with health care services, role activity limitations, and physical/mental exhaustion from self-management. Internal reliability of PETS scales was determined using Cronbach's alpha. Construct validity was determined through correlation of PETS scores with established measures (measures of chronic condition distress, medication satisfaction, self-efficacy, and global well-being), and known-groups validity through comparisons of PETS scores across clinically distinct groups. In an exploratory test of predictive validity, step-wise regressions were used to determine which PETS scales were most associated with outcomes of chronic condition distress, overall physical and mental health, and medication adherence. Respondents were 37-88 years old, 59% female, 29% non-white, and 67% college-educated. PETS scales showed good reliability (Cronbach's alphas ≥0.74). Higher PETS scale scores (greater treatment burden) were correlated with more chronic condition distress, less medication convenience, lower self-efficacy, and worse general physical and mental health. Participants less (versus more) adherent to medications and those with more (versus fewer) health care financial difficulties had higher mean PETS scores. Medication burden was the scale that was most consistently associated with well-being and patient-reported adherence. The PETS is a reliable and valid measure for assessing perceived treatment burden in people coping with diabetes.
Reliability and validity of a scale for health-promoting schools.

PubMed

Lee, Eun Young; Shin, Young-Jeon; Choi, Bo Youl; Cho, Ho Soon Michelle

2014-12-01

Despite a growing body of research regarding the health-promoting schools (HPS) concept from the World Health Organization (WHO), research on measuring of the HPS is limited. This study aims to develop a scale for assessing the status of the HPS based on the WHO guidelines and to evaluate the reliability and validity of the scale. After completing the translation and back-translation process, the content validity of the 50-item scale for HPS (SHPS) was assessed by an expert committee review and pretested with 17 teachers. A stratified, random sampling design was used. A total of 728 teachers from 94 schools completed a self-administered questionnaire. The total sample was randomly divided into three groups for exploratory factor analysis (EFA), confirmatory factor analysis (CFA) and cross-validation. The EFA suggested seven factors, including 37 items, and the CFA confirmed these factors. In a second-order factor analysis, the second-order seven-factor model had acceptable fit indices (root mean square error of approximation 0.07, comparative fit index 0.98) with stability over validation sample and whole sample. Thus, the first-order seven factors (school nutrition services [three-item, α = 0.87], healthy school policies [six-item, α = 0.87], school's physical environment [10-item, α = 0.91], school's social environment [four-item, α = 0.88], community links [six-item, α = 0.91], individual health skills and action competencies [three-item, α = 0.89], and health services [five-item, α = 0.86]) loaded significantly onto the second-order factor (HPS [37-item, α = 0.97]). In conclusion, the SHPS is a reliable and valid measurement tool for assessing the states of the HPS in the Korean school context. It will be useful for comprehensively assessing schools' needs and monitoring the progress of school health interventions. © The Author (2013). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Can we measure patients' perception during dental impressions? The Burdens in Dental Impression-Making Questionnaire - BiDIM-Q.

PubMed

Tsirogiannis, Panagiotis; Neophytou, Sophia; Reul, Anika; Heydecke, Guido; Reissmann, Daniel R

2017-01-01

To develop a reliable and valid instrument for the comprehensive assessment of patients' burdens during dental impression making, the Burdens in Dental Impression Making Questionnaire, BiDIM-Q. The item pool was generated in a convenience sample of 20 prosthodontic patients using semi-structured face-to-face interviews. The final instrument was tested in 145 consecutively recruited patients, and psychometric properties of the BiDIM-Q were determined. Four different impression materials were used according to the manufacturers' instructions and indications: alginate, c-silicone, polyvinylsiloxane, and polyether. The final BiDIM-Q consisting of 12 items showed sufficient reliability, indicated by Cronbach's alpha of .82 and an average inter-item correlation of .29. Validity was supported by Pearson correlation coefficients for the correlation between the instrument's total score with the patients' overall satisfaction rating (r=.63), and by the correlation matrix for the correlations of the patients' perceptions with the practitioners' satisfaction ratings. Overall, patient perceived burdens were low with highest burdens observed when using polyether in partially dentate patients for pick-up impressions, while lowest burdens were reported when using c-silicone for impressions of edentulous jaws. The BiDIM-Q is a reliable and valid tool for assessing patient-based process-related quality of care in dentistry allowing a deeper insight into patients' perspective during dental impression making. Copyright © 2016 Japan Prosthodontic Society. Published by Elsevier Ltd. All rights reserved.
Reliability and validity of the EQ-5D-3L for Kashin-Beck disease in China.

PubMed

Fang, Hua; Farooq, Umer; Wang, Dimiao; Yu, Fangfang; Younus, Mohammad Imran; Guo, Xiong

2016-01-01

Kashin-Beck Disease (KBD) is an endemic osteoarthropathy in areas which extend from the North-East to the South-West of China. Most of the patients with KBD suffer multiple dysfunctions in major joints causing decreased health status. However because of their low education level and unique living habits, it is hard to find tools to measure the health-related quality of life (HRQOL). European quality of life (EQ-5D-3L) patient-reported instrument is widely used to measure HRQOL. This study aimed to establish the validity and reliability of the Chinese version of the EQ-5D-3L for evaluating HRQOL of KBD individuals in rural area. 368 individuals who were suffering from KBD were recruited through stratified multistage random sampling from Shaanxi province, China. The EQ-5D-3L and the WHOQOL-BREF were administrated in each individual by face to face interview. Test-retest reliability was assessed at 10-14 days intervals. The test-retest reliability was measured by calculating the Kappa coefficients for EQ-5D-3L five dimensions. For the EQ VAS, the intraclass correlation coefficient (ICC) was computed. Convergent and divergent analysis, construct validity was established using Spearman's rank correlation between the EQ-5D-3L and the WHOQOL-BREF. Known groups' validity was examined by comparing groups with a priori expected differences in health-related quality of life (HRQOL). For 362 individuals (98%), comprehensive data of all the EQ-5D-3L dimensions were available. Kappa values of the EQ-5D-3L five items ranged from 0.324 to 0.554. ICC of the EQ VAS was 0.497. For convergent validity, the three items (self-care, usual activity, and mobility) of EQ-5D-3L, index scores, and VAS showed moderate correlations with the physical health domain of the WHOQOL-BREF (r absolute value ranged from 0.339 to 0.475). For divergent validity, the 5 items of EQ-5D-3L showed weak or no correlations with environment and social relationship domains of WHOQOL-BREF. The Chinese EQ-5D-3L clearly demarcated between groups which were reporting severe disease degree, poorer general health, more number of painful joints with worse HRQOL. The EQ-5D-3L Chinese Version demonstrated fair to moderate levels of test-retest reliability and adequate construct validity in KBD individuals in China.
Design and Testing of a Tool for Evaluating the Quality of Diabetes Consumer-Information Web Sites

PubMed Central

Steinwachs, Donald; Rubin, Haya R

2003-01-01

Background Most existing tools for measuring the quality of Internet health information focus almost exclusively on structural criteria or other proxies for quality information rather than evaluating actual accuracy and comprehensiveness. Objective This research sought to develop a new performance-measurement tool for evaluating the quality of Internet health information, test the validity and reliability of the tool, and assess the variability in diabetes Web site quality. Methods An objective, systematic tool was developed to evaluate Internet diabetes information based on a quality-of-care measurement framework. The principal investigator developed an abstraction tool and trained an external reviewer on its use. The tool included 7 structural measures and 34 performance measures created by using evidence-based practice guidelines and experts' judgments of accuracy and comprehensiveness. Results Substantial variation existed in all categories, with overall scores following a normal distribution and ranging from 15% to 95% (mean was 50% and median was 51%). Lin's concordance correlation coefficient to assess agreement between raters produced a rho of 0.761 (Pearson's r of 0.769), suggesting moderate to high agreement. The average agreement between raters for the performance measures was 0.80. Conclusions Diabetes Web site quality varies widely. Alpha testing of this new tool suggests that it could become a reliable and valid method for evaluating the quality of Internet health sites. Such an instrument could help lay people distinguish between beneficial and misleading information. PMID:14713658
Vending machine assessment methodology. A systematic review.

PubMed

Matthews, Melissa A; Horacek, Tanya M

2015-07-01

The nutritional quality of food and beverage products sold in vending machines has been implicated as a contributing factor to the development of an obesogenic food environment. How comprehensive, reliable, and valid are the current assessment tools for vending machines to support or refute these claims? A systematic review was conducted to summarize, compare, and evaluate the current methodologies and available tools for vending machine assessment. A total of 24 relevant research studies published between 1981 and 2013 met inclusion criteria for this review. The methodological variables reviewed in this study include assessment tool type, study location, machine accessibility, product availability, healthfulness criteria, portion size, price, product promotion, and quality of scientific practice. There were wide variations in the depth of the assessment methodologies and product healthfulness criteria utilized among the reviewed studies. Of the reviewed studies, 39% evaluated machine accessibility, 91% evaluated product availability, 96% established healthfulness criteria, 70% evaluated portion size, 48% evaluated price, 52% evaluated product promotion, and 22% evaluated the quality of scientific practice. Of all reviewed articles, 87% reached conclusions that provided insight into the healthfulness of vended products and/or vending environment. Product healthfulness criteria and complexity for snack and beverage products was also found to be variable between the reviewed studies. These findings make it difficult to compare results between studies. A universal, valid, and reliable vending machine assessment tool that is comprehensive yet user-friendly is recommended. Copyright © 2015 Elsevier Ltd. All rights reserved.
Measurement of Function Post Hip Fracture: Testing a Comprehensive Measurement Model of Physical Function

PubMed Central

Gruber-Baldini, Ann L.; Hicks, Gregory; Ostir, Glen; Klinedinst, N. Jennifer; Orwig, Denise; Magaziner, Jay

2015-01-01

Background Measurement of physical function post hip fracture has been conceptualized using multiple different measures. Purpose This study tested a comprehensive measurement model of physical function. Design This was a descriptive secondary data analysis including 168 men and 171 women post hip fracture. Methods Using structural equation modeling, a measurement model of physical function which included grip strength, activities of daily living, instrumental activities of daily living and performance was tested for fit at 2 and 12 months post hip fracture and among male and female participants and validity of the measurement model of physical function was evaluated based on how well the model explained physical activity, exercise and social activities post hip fracture. Findings The measurement model of physical function fit the data. The amount of variance the model or individual factors of the model explained varied depending on the activity. Conclusion Decisions about the ideal way in which to measure physical function should be based on outcomes considered and participant Clinical Implications The measurement model of physical function is a reliable and valid method to comprehensively measure physical function across the hip fracture recovery trajectory. Practical but useful assessment of function should be considered and monitored over the recovery trajectory post hip fracture. PMID:26492866
Documentation of pharmaceutical care: Validation of an intervention oriented classification system.

PubMed

Maes, Karen A; Studer, Helene; Berger, Jérôme; Hersberger, Kurt E; Lampert, Markus L

2017-12-01

During the dispensing process, pharmacists may come across technical and clinical issues requiring a pharmaceutical intervention (PI). An intervention-oriented classification system is a helpful tool to document these PIs in a structured manner. Therefore, we developed the PharmDISC classification system (Pharmacists' Documentation of Interventions in Seamless Care). The aim of this study was to evaluate the PharmDISC system in the daily practice environment (in terms of interrater reliability, appropriateness, interpretability, acceptability, feasibility, and validity); to assess its user satisfaction, the descriptive manual, and the online training; and to explore first implementation aspects. Twenty-one pharmacists from different community pharmacies each classified 30 prescriptions requiring a PI with the PharmDISC system on 5 selected days within 5 weeks. Interrater reliability was determined using model PIs and Fleiss's kappa coefficients (κ) were calculated. User satisfaction was assessed by questionnaire with a 4-point Likert scale. The main outcome measures were interrater reliability (κ); appropriateness, interpretability, validity (ratio of completely classified PIs/all PIs); feasibility, and acceptability (user satisfaction and suggestions). The PharmDISC system reached an average substantial agreement (κ = 0.66). Of documented 519 PIs, 430 (82.9%) were completely classified. Most users found the system comprehensive (median user agreement 3 [2/3.25 quartiles]) and practical (3[2.75/3]). The PharmDISC system raised the awareness regarding drug-related problems for most users (n = 16). To facilitate its implementation, an electronic version that automatically connects to the prescription together with a task manager for PIs needing follow-up was suggested. Barriers could be time expenditure and lack of understanding the benefits. Substantial interrater reliability and acceptable user satisfaction indicate that the PharmDISC system is a valid system to document PIs in daily community pharmacy practice. © 2017 John Wiley & Sons, Ltd.
Measuring Nurses' Value, Implementation, and Knowledge of Evidence-Based Practice: Further Psychometric Testing of the Quick-EBP-VIK Survey.

PubMed

Connor, Linda; Paul, Fiona; McCabe, Margaret; Ziniel, Sonja

2017-02-01

The Quick-EBP-VIK is a new instrument for measuring nurses' value, implementation, and knowledge of EBP. Psychometric testing was conducted in two parts. Part 1 describes the tool development and validity testing which resulted in the development of a 25-item survey after receiving ≥0.80 Item-Level Content Validity Index for both clarity and relevance. Part 2 describes psychometric testing was necessary to assess additional types of validity and reliability. The purpose of this paper is to further describe the psychometric testing of the Quick-EBP-VIK survey instrument. This descriptive study was designed to assess test-retest reliability, internal consistency and construct validity via a web-based survey. The survey instrument was e-mailed to all nurses at the study hospital. Nurses who responded to the first survey (Wave 1) received another e-mail invitation to complete the survey instrument again (Wave 2) for the purpose of assessing the test-retest reliability of the instrument. A total of 1,177 deliverable e-mails were sent to all nursing staff at one free standing pediatric hospital with Magnet ® designation in the northeast. A total of 382 nurses returned completed surveys, indicating a 32.5% response rate for Wave 1. A total of 131 nurses responded to Wave 2 indicating a response rate of 34.3%. The intraclass correlation coefficients for the items included in the final instrument ranged from 0.43 to 0.80 and were deemed sufficient. These represent a sufficient intraclass correlation coefficient. The Cronbach's Alpha values for each of the three domains are all higher than 0.7 indicating that the items of each of the measurement dimension are internally consistent. However, the composite reliability of the third domain was slightly lower than 0.7 when using Raykov's Rho. The Quick-EBP-VIK instrument has gone through rigorous comprehensive testing and has demonstrated good psychometric properties. © 2016 Sigma Theta Tau International.
Bioinformatics approach for choosing the correct reference genes when studying gene expression in human keratinocytes.

PubMed

Beer, Lucian; Mlitz, Veronika; Gschwandtner, Maria; Berger, Tanja; Narzt, Marie-Sophie; Gruber, Florian; Brunner, Patrick M; Tschachler, Erwin; Mildner, Michael

2015-10-01

Reverse transcription polymerase chain reaction (qRT-PCR) has become a mainstay in many areas of skin research. To enable quantitative analysis, it is necessary to analyse expression of reference genes (RGs) for normalization of target gene expression. The selection of reliable RGs therefore has an important impact on the experimental outcome. In this study, we aimed to identify and validate the best suited RGs for qRT-PCR in human primary keratinocytes (KCs) over a broad range of experimental conditions using the novel bioinformatics tool 'RefGenes', which is based on a manually curated database of published microarray data. Expression of 6 RGs identified by RefGenes software and 12 commonly used RGs were validated by qRT-PCR. We assessed whether these 18 markers fulfilled the requirements for a valid RG by the comprehensive ranking of four bioinformatics tools and the coefficient of variation (CV). In an overall ranking, we found GUSB to be the most stably expressed RG, whereas the expression values of the commonly used RGs, GAPDH and B2M were significantly affected by varying experimental conditions. Our results identify RefGenes as a powerful tool for the identification of valid RGs and suggest GUSB as the most reliable RG for KCs. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Validity and reliability of instruments aimed at measuring Evidence-Based Practice in Physical Therapy: a systematic review of the literature.

PubMed

Fernández-Domínguez, Juan Carlos; Sesé-Abad, Albert; Morales-Asencio, Jose Miguel; Oliva-Pascual-Vaca, Angel; Salinas-Bueno, Iosune; de Pedro-Gómez, Joan Ernest

2014-12-01

Our goal is to compile and analyse the characteristics - especially validity and reliability - of all the existing international tools that have been used to measure evidence-based clinical practice in physiotherapy. A systematic review conducted with data from exclusively quantitative-type studies synthesized in narrative format. An in-depth search of the literature was conducted in two phases: initial, structured, electronic search of databases and also journals with summarized evidence; followed by a residual-directed search in the bibliographical references of the main articles found in the primary search procedure. The studies included were assigned to members of the research team who acted as peer reviewers. Relevant information was extracted from each of the selected articles using a template that included the general characteristics of the instrument as well as an analysis of the quality of the validation processes carried out, by following the criteria of Terwee. Twenty-four instruments were found to comply with the review screening criteria; however, in all cases, they were found to be limited as regards the 'constructs' included. Besides, they can all be seen to be lacking as regards comprehensiveness associated to the validation process of the psychometric tests used. It seems that what constitutes a rigorously developed assessment instrument for EBP in physical therapy continues to be a challenge. © 2014 John Wiley & Sons, Ltd.
The Anaclitic-Introjective Depression Assessment: Development and preliminary validity of an observer-rated measure.

PubMed

Rost, Felicitas; Luyten, Patrick; Fonagy, Peter

2018-03-01

The two-configurations model developed by Blatt and colleagues offers a comprehensive conceptual and empirical framework for understanding depression. This model suggests that depressed patients struggle, at different developmental levels, with issues related to dependency (anaclitic issues) or self-definition (introjective issues), or a combination of both. This paper reports three studies on the development and preliminary validation of the Anaclitic-Introjective Depression Assessment, an observer-rated assessment tool of impairments in relatedness and self-definition in clinical depression based on the item pool of the Shedler-Westen Assessment Procedure. Study 1 describes the development of the measure using expert consensus rating and Q-methodology. Studies 2 and 3 report the assessment of its psychometric properties, preliminary reliability, and validity in a sample of 128 patients diagnosed with treatment-resistant depression. Four naturally occurring clusters of depressed patients were identified using Q-factor analysis, which, overall, showed meaningful and theoretically expected relationships with anaclitic/introjective prototypes as formulated by experts, as well as with clinical, social, occupational, global, and relational functioning. Taken together, findings reported in this paper provide preliminary evidence for the reliability and validity of the Anaclitic-Introjective Depression Assessment, an observer-rated measure that allows the detection of important nuanced differentiations between and within anaclitic and introjective depression. Copyright © 2017 John Wiley & Sons, Ltd.
Assessing practice-based influences on adolescent psychosocial development in sport: the activity context in youth sport questionnaire.

PubMed

García Bengoechea, Enrique; Sabiston, Catherine M; Wilson, Philip M

2017-01-01

The aim of this study was to provide initial evidence of validity and reliability of scores derived from the Activity Context in Youth Sport Questionnaire (ACYSQ), an instrument designed to offer a comprehensive assessment of the activities adolescents take part in during sport practices. Two studies were designed for the purposes of item development and selection, and to provide evidence of structural and criterion validity of ACYSQ scores, respectively (N = 334; M age = 14.93, SD = 1.76 years). Confirmatory factor analysis (CFA) supported the adequacy of a 20-item ACYSQ measurement model, which was invariant across gender, and comprised the following dimensions: (1) stimulation; (2) usefulness-value; (3) authenticity; (4) repetition-boredom; and (5) ineffectiveness. Internal consistency reliability estimates and composite reliability estimates for ACYSQ subscale scores ranged from 0.72 to 0.91. In regression analyses, stimulation predicted enjoyment and perceived competence, ineffectiveness was significantly associated with perceived competence and authenticity emerged as a predictor of commitment in sport. These findings indicate that the ACYSQ displays adequate psychometric properties and the use of the instrument may be useful for studying selected activity-based features of the practice environment and their motivational consequences in youth sport.
The Swedish translation and cross-cultural adaptation of the Functional Assessment of Chronic Illness Therapy - Cervical Dysplasia (FACIT-CD): linguistic validity and reliability of the Swedish version.

PubMed

Rask, Marie; Oscarsson, Marie; Ludwig, Neil; Swahnberg, Katarina

2017-04-04

Cervical dysplasia is a precancerous condition, which has been shown to create anxiety in women. To be able to investigate these women's health-related quality of life, a disease-specific instrument is required. There does not seem to be a Swedish version of an instrument to screen for this specific disease. Therefore, this study aims to translate and cross-culturally adapt the Functional Assessment of Chronic Illness Therapy - Cervical Dysplasia (FACIT-CD) into a Swedish context and evaluate its linguistic validity and reliability. The Functional Assessment of Chronic Illness Therapy (FACIT) translation methodology was used, which consists of several steps including pilot testing of the FACIT-CD instrument through cognitive debriefing interviews. Ten women diagnosed with cervical dysplasia participated in the cognitive debriefing interviews. The internal consistency reliability of the Swedish FACIT-CD was estimated by Cronbach's alpha coefficient. Homogeneity of the items was evaluated by corrected item-total correlations. The sample consists of 34 women who were diagnosed with cervical dysplasia. The translation and cross-cultural adaptation went smoothly without any problems for the majority of the items. The cognitive debriefing interviews indicated that the Swedish FACIT-CD consists of relevant items, is easy to understand and complete, and has unambiguous and comprehensive response categories. The translation and cross-cultural adaptation resulted in a Swedish FACIT-CD, which is conceptually and semantically equivalent to the English version and linguistically valid. The total scale of the Swedish FACIT-CD exhibited good internal consistency reliability with a Cronbach's alpha coefficient of 0.84, and all of the subscales exhibited acceptable value between 0.71 and 0.81 except the Relationships subscale, which had a value of 0.67. Finally, all but four items exceeded the acceptable level for the corrected item-total correlations of ≥ 0.20. The Swedish FACIT-CD is conceptually and semantically equivalent to the English version and linguistically valid; further, it exhibits good internal consistency reliability.

Translation and Validation of the Persian Version the Boston Carpal Tunnel Syndrome Questionnaire.

PubMed

Hassankhani, Golnaz Ghayyem; Moradi, Ali; Birjandinejad, Ali; Vahedi, Ehsan; Kachooei, Amir R; Ebrahimzadeh, Mohammad H

2018-01-01

Carpal tunnel syndrome (CTS) is recognized as the most common type of neuropathies. Questionnaires are the method of choice for evaluating patients with CTS. Boston Carpal Tunnel Syndrome (BCTS) is one of the most famous questionnaires that evaluate the functional and symptomatic aspects of CTS. This study was performed to evaluate the validity and reliability of the Persian version of BCTS questionnaire. First, both parts of the original questionnaire (Symptom Severity Scale and Functional Status Scale) were translated into Persian by two expert translators. The translated questionnaire was revised after merging and confirmed by an orthopedic hand surgeon. The confirmed questionnaire was interpreted back into the original language (English) to check for any possible content inequality between the original questionnaire and its final translated version. The final Persian questionnaire was answered by 10 patients suffering from CTS to elucidate its comprehensibility; afterwards, it was filled by 142 participants along with the Persian version of the Quick-DASH questionnaire. After 2 to 6 days, the translated questionnaire was refilled by some of the previous patients who had not received any substantial medical treatment during that period. Among all 142 patients, 13.4 % were male and 86.6 % were female. The reliability of the questionnaire was tested using Cronbach's alpha and Intraclass correlation coefficient (ICC). Cronbach's alpha was 0.859 for symptom severity scale (SSS) and 0.878 for functional status scale (FSS). Also, ICCs were calculated as 0.538 for SSS and 0.773 for FSS. In addition, construct validity of SSS and FSS against QuickDASH were 0.641 and 0.701, respectively. Based on our results, the Persian version of the BCTQ is valid and reliable. Level of evidence: II.
Development of a Korean Version of the Perceived Deficits Questionnaire-Depression for Patients with Major Depressive Disorder

PubMed Central

Kim, Jae-Min; Hong, Jin-Pyo; Kim, Sang-Dae; Kang, Hee-Ju; Lee, Yong-Sung

2016-01-01

Objective Cognitive symptoms are an important component of depression and the Perceived Deficits Questionnaire-Depression is one of only a few instruments available for the subjective assessment of cognitive dysfunction in depression. Thus, the present study aimed to validate a Korean version of the PDQ-D (K-PDQ-D) using patients with major depressive disorder (MDD). Methods This study included 128 MDD patients who were assessed at study entry and 86 of these patients were then completed 12 weeks of antidepressant monotherapy. All subjects were assessed with the K-PDQ-D, the Montgomery-Asberg Depression Rating Scale (MADRS), the Sheehan Disability Scale (SDS), the EuroQol-5 dimensions questionnaire (EQ-5D), and the number of sick leave days taken in the previous week. The internal consistency, Guttman’s split-half and test-retest reliabilities, factorial analyses, and concurrent and predictive validities of the K-PDQ-D were investigated. Results The K-PDQ-D exhibited excellent internal consistency and reliabilities, and was composed of four factors with high coefficients of determination. The concurrent validity analyses revealed that the K-PDQ-D scores were significantly correlated with the MADRS, SDS, and EQ-5D scores and the number of sick leave days taken. The K-PDQ-D scores at study entry significantly predicted changes in sick leave days and EQ-5D score from study entry to the 12-week endpoint. Conclusion The newly developed K-PDQ-D is a reliable and valid instrument for the evaluation of subjective cognitive symptoms in MDD patients. The K-PDQ-D may assist in the gathering of unique information regarding subjective cognitive complaints, which is important for the comprehensive evaluation of patients with MDD. PMID:26792037
Adaptation and Validation of the Cambridge Pulmonary Hypertension Outcome Review (CAMPHOR) for Use in Spain.

PubMed

Aguirre-Camacho, Aldo; Stepanous, Jessica; Blanco-Donoso, Luis M; Moreno-Jiménez, Bernardo; Wilburn, Jeanette; González-Saiz, Laura; McKenna, Stephen P

2017-06-01

The Cambridge Pulmonary Hypertension Outcome Review (CAMPHOR) is a patient-reported outcome measure of health-related quality of life and quality of life specific to individuals with pulmonary hypertension (PH). This questionnaire has demonstrated superiority over other instruments assessing similar domains. The objective of the present study was to adapt and validate the Spanish version of the questionnaire. The adaptation consisted of 3 stages: translation from English to Spanish using bilingual and lay panels, cognitive debriefing interviews with patients, and assessment of psychometric properties by means of a postal validation survey. The translation panels produced a version of the CAMPHOR that was considered suitable for use by Spanish PH patients. The relevance, comprehensiveness, and acceptability of this version were confirmed in interviews with PH patients. Finally, the validation survey (n = 70) revealed that the 3 CAMPHOR scales (Symptoms, Activities, and Quality of life) showed strong psychometric properties. The internal consistency (Cronbach α) coefficients of the scales were above 0.89, and the test-retest reliability was above 0.87. The convergent and known group validity of the CAMPHOR scales was also demonstrated. The Spanish version of the CAMPHOR is a valid and reliable instrument for the assessment of health-related quality of life and quality of life in Spanish PH patients. Therefore, it is recommended for use in future research and clinical practice in the Spanish population of PH patients. Copyright © 2016 Sociedad Española de Cardiología. Published by Elsevier España, S.L.U. All rights reserved.
Development and validation of a disease-specific scale to assess psychosocial well-being of patients living with unruptured intracranial aneurysm.

PubMed

Fujishima-Hachiya, Asami; Inoue, Tomoko

2012-12-01

Although the detection rate for unruptured intracranial aneurysm (UIA) has improved since the 1990s, the quality of life and psychosocial status of patients living with UIA have been negatively affected. However, a comprehensive assessment tool for UIA patients is still awaited. This study aimed to develop and validate a disease-specific scale to assess UIA patients' psychosocial well-being in their daily lives. On the basis of previous qualitative research, 52 items on a six-dimension scale were generated. After a pilot study, statistical analysis was conducted to examine construct validity-including convergent validity, discriminant and known-group validity, and internal reliability. Between 2010 and 2011, 124 patients across three hospitals in Japan were tested using a tentative scale. As a result of exploratory factor analysis, we identified 25 items based on five conceptually derived dimensions (psychological stability, trust in healthcare resources, satisfaction with the decision-making process, positive perception of self-management, and confidence in UIA knowledge) as a final psychosocial well-being scale for UIA patients (UIA-PW scale). Cronbach's alpha coefficients for each subscale ranged between .76 and .90, with .83 for the total score, which indicated satisfactory internal consistency. The total score for the UIA-PW scale correlated significantly with the existing quality of life and mental health scales, but it is important to note that psychological stability and positive perception of self-management were negatively correlated. Although additional investigation is needed, the UIA-PW scale shows reasonable validity and reliability in assessing psychosocial well-being of patients living with UIA.
The Impact of Gender, Socioeconomic Status and Home Language on Primary School Children’s Reading Comprehension in KwaZulu-Natal

PubMed Central

Völkel, Gabriela; Seabi, Joseph; Cockcroft, Kate; Goldschagg, Paul

2016-01-01

The current study constituted part of a larger, longitudinal, South African-based study, namely, The Road and Aircraft Noise Exposure on Children’s Cognition and Health (RANCH—South Africa). In the context of a multicultural South Africa and varying demographic variables thereof, this study sought to investigate and describe the effects of gender, socioeconomic status and home language on primary school children’s reading comprehension in KwaZulu-Natal. In total, 834 learners across 5 public schools in the KwaZulu-Natal province participated in the study. A biographical questionnaire was used to obtain biographical data relevant to this study, and the Suffolk Reading Scale 2 (SRS2) was used to obtain reading comprehension scores. The findings revealed that there was no statistical difference between males and females on reading comprehension scores. In terms of socioeconomic status (SES), learners from a low socioeconomic background performed significantly better than those from a high socioeconomic background. English as a First Language (EL1) speakers had a higher mean reading comprehension score than speakers who spoke English as an Additional Language (EAL). Reading comprehension is indeed affected by a variety of variables, most notably that of language proficiency. The tool to measure reading comprehension needs to be standardized and administered in more than one language, which will ensure increased reliability and validity of reading comprehension scores. PMID:26999169
The Impact of Gender, Socioeconomic Status and Home Language on Primary School Children's Reading Comprehension in KwaZulu-Natal.

PubMed

Völkel, Gabriela; Seabi, Joseph; Cockcroft, Kate; Goldschagg, Paul

2016-03-15

The current study constituted part of a larger, longitudinal, South African-based study, namely, The Road and Aircraft Noise Exposure on Children's Cognition and Health (RANCH-South Africa). In the context of a multicultural South Africa and varying demographic variables thereof, this study sought to investigate and describe the effects of gender, socioeconomic status and home language on primary school children's reading comprehension in KwaZulu-Natal. In total, 834 learners across 5 public schools in the KwaZulu-Natal province participated in the study. A biographical questionnaire was used to obtain biographical data relevant to this study, and the Suffolk Reading Scale 2 (SRS2) was used to obtain reading comprehension scores. The findings revealed that there was no statistical difference between males and females on reading comprehension scores. In terms of socioeconomic status (SES), learners from a low socioeconomic background performed significantly better than those from a high socioeconomic background. English as a First Language (EL1) speakers had a higher mean reading comprehension score than speakers who spoke English as an Additional Language (EAL). Reading comprehension is indeed affected by a variety of variables, most notably that of language proficiency. The tool to measure reading comprehension needs to be standardized and administered in more than one language, which will ensure increased reliability and validity of reading comprehension scores.
Clinimetrics of ultrasound pathologies in osteoarthritis: systematic literature review and meta-analysis.

PubMed

Oo, W M; Linklater, J M; Daniel, M; Saarakkala, S; Samuels, J; Conaghan, P G; Keen, H I; Deveza, L A; Hunter, D J

2018-05-01

The aims of this study were to systematically review clinimetrics of commonly assessed ultrasound pathologies in knee, hip and hand osteoarthritis (OA), and to conduct a meta-analysis for each clinimetric. Medline, Embase, and Cochrane Library databases were searched from their inceptions to September 2016. According to the Outcome Measures in Rheumatology (OMERACT) Instrument Selection Algorithm, data extraction focused on ultrasound technical features and performance metrics. Methodological quality was assessed with modified 19-item Downs and Black score and 11-item Quality Appraisal of Diagnostic Reliability (QAREL) score. Separate meta-analyses were performed for clinimetrics: (1) inter-rater/intra-rater reliability; (2) construct validity; (3) criteria validity; and (4) internal/external responsiveness. Statistical Package for the Social Sciences (SPSS), Excel and Comprehensive Meta-analysis were used. Our search identified 1126 records; of these, 100 were eligible, including a total of 8542 patients and 32,373 joints. The average Downs and Black score was 13.01, and average QAREL was 5.93. The stratified meta-analysis was performed only for knee OA, which demonstrated moderate to substantial reliability [minimum kappa > 0.44(0.15,0.74), minimum intraclass correlation coefficient (ICC) > 0.82(0.73-0.89)], weak construct validity against pain (r = 0.12 to 0.27), function (r = 0.15 to 0.23), and blood biomarkers (r = 0.01 to 0.21), but weak to strong correlation with plain radiography (r = 0.13 to 0.60), strong association with Magnetic Resonance Imaging (MRI) [minimum r = 0.60(0.52,0.67)] and strong discrimination against symptomatic patients (OR = 3.08 to 7.46). There was strong criterion validity against cartilage histology [r = 0.66(-0.05,0.93)], and small to moderate internal [standardized mean difference(SMD) = 0.20 to 0.58] and external (r = 0.35 to 0.43) responsiveness to interventions. Ultrasound demonstrated strong criterion validity with cartilage histology, poor to strong correlation with patient findings and MRI, moderate reliability, and low responsiveness to interventions. CRD42016039954. Copyright © 2018 Osteoarthritis Research Society International. All rights reserved.
CANFOR Portuguese version: validation study.

PubMed

Talina, Miguel; Thomas, Stuart; Cardoso, Ana; Aguiar, Pedro; Caldas de Almeida, Jose M; Xavier, Miguel

2013-05-30

The increase in prisoner population is a troublesome reality in several regions of the world. Along with this growth there is increasing evidence that prisoners have a higher proportion of mental illnesses and suicide than the general population. In order to implement strategies that address criminal recidivism and the health and social status of prisoners, particularly in mental disordered offenders, it is necessary to assess their care needs in a comprehensive, but individual perspective. This assessment must include potential harmful areas like comorbid personality disorder, substance misuse and offending behaviours. The Camberwell Assessment of Need - Forensic Version (CANFOR) has proved to be a reliable tool designed to accomplish such aims. The present study aimed to validate the CANFOR Portuguese version. The translation, adaptation to the Portuguese context, back-translation and revision followed the usual procedures. The sample comprised all detainees receiving psychiatric care in four forensic facilities, over a one year period. A total of 143 subjects, and respective case manager, were selected. The forensic facilities were chosen by convenience: one prison hospital psychiatric ward (n=68; 47.6%), one male (n=24; 16.8%) and one female (n=22; 15.4%) psychiatric clinic and one civil security ward (n=29; 20.3%), all located nearby Lisbon. Basic descriptive statistics and Kappa weighted coefficients were calculated for the inter-rater and the test-retest reliability studies. The convergent validity was evaluated using the Global Assessment of Functioning and the Brief Psychiatric Rating Scale scores. The majority of the participants were male and single, with short school attendance, and accused of a crime involving violence against persons. The most frequent diagnosis was major depression (56.1%) and almost half presented positive suicide risk. The reliability study showed average Kappa weighted coefficients of 0.884 and 0.445 for inter-rater and test-retest agreement, respectively. The convergent validity study presented highly significant correlations between unmet needs scores, GAF and BPRS scores. The CANFOR Portuguese version revealed similar psychometric properties to the original English version. Moreover, the results of the reliability and validity studies indicate that the tool is appropriate for individual care needs assessment and as a guide for the mental health and social interventions in forensic psychiatric services.
Construct validity of the Iowa Gambling Task.

PubMed

Buelow, Melissa T; Suhr, Julie A

2009-03-01

The Iowa Gambling Task (IGT) was created to assess real-world decision making in a laboratory setting and has been applied to various clinical populations (i.e., substance abuse, schizophrenia, pathological gamblers) outside those with orbitofrontal cortex damage, for whom it was originally developed. The current review provides a critical examination of lesion, functional neuroimaging, developmental, and clinical studies in order to examine the construct validity of the IGT. The preponderance of evidence provides support for the use of the IGT to detect decision making deficits in clinical populations, in the context of a more comprehensive evaluation. The review includes a discussion of three critical issues affecting the validity of the IGT, as it has recently become available as a clinical instrument: the lack of a concise definition as to what aspect of decision making the IGT measures, the lack of data regarding reliability of the IGT, and the influence of personality and state mood on IGT performance.
[The Maugeri Stress Index: a questionnaire to assess work-related psychological stress].

PubMed

Giorgi, Ines; Baiardi, Paola; Tringali, Salvatore; Candura, Stefano Massimo; Gardinali, Francesco; Grignani, Elena; Bertolotti, Giorgio; Imbriani, Marcello

2011-01-01

The European directives concerning the evaluation of work-related stress were absorbed into Italian law by means of Legislative Decree No. 81 of 9 April 2008. To develop a new questionnaire to assess the impact of work-related psychological distress and to validate it by testing its factorial structure, its content, its construct and discriminant validity. After critically reviewing the literature, we generated an initial item set to identify the items to be used in a preliminary version of the questionnaire, and then used a focus group to test the comprehensibility of the items. The questionnaire was administered to 329 subjects working in state and private organisation and a small sample of 29 subjects complaining of vexation at work. The Maugeri Stress Index (MSI) is reliable (Cronbach alpha: 0.93). Factorial analysis indicated five factors: Well-being, Adaptation, Support, Irritability and Avoidance. The total and subscale scores were significantly different when comparing subjects with and without vexation at work. The MSI has a multi-factorial structure, good internal reliability and sufficient discriminant power.
Developing a Tool for Measuring the Decision-Making Competence of Older Adults

PubMed Central

Finucane, Melissa L.; Gullion, Christina M.

2010-01-01

The authors evaluated the reliability and validity of a tool for measuring older adults’ decision-making competence (DMC). Two-hundred-five younger adults (25-45 years), 208 young-older adults (65-74 years), and 198 old-older adults (75-97 years) made judgments and decisions related to health, finance, and nutrition. Reliable indices of comprehension, dimension weighting, and cognitive reflection were developed. Unlike previous research, the authors were able to compare old-older with young-older adults’ performance. As hypothesized, old-older adults performed more poorly than young-older adults; both groups of older adults performed more poorly than younger adults. Hierarchical regression analyses showed that a large amount of variance in decision performance across age groups (including mean trends) could be accounted for by social variables, health measures, basic cognitive skills, attitudinal measures, and numeracy. Structural equation modeling revealed significant pathways from three exogenous latent factors (crystallized intelligence, other cognitive abilities, and age) to the endogenous DMC latent factor. Further research is needed to validate the meaning of performance on these tasks for real-life decision making. PMID:20545413
Radiation Measurements Performed with Active Detectors Relevant for Human Space Exploration

PubMed Central

Narici, Livio; Berger, Thomas; Matthiä, Daniel; Reitz, Günther

2015-01-01

A reliable radiation risk assessment in space is a mandatory step for the development of countermeasures and long-duration mission planning in human spaceflight. Research in radiobiology provides information about possible risks linked to radiation. In addition, for a meaningful risk evaluation, the radiation exposure has to be assessed to a sufficient level of accuracy. Consequently, both the radiation models predicting the risks and the measurements used to validate such models must have an equivalent precision. Corresponding measurements can be performed both with passive and active devices. The former is easier to handle, cheaper, lighter, and smaller but they measure neither the time dependence of the radiation environment nor some of the details useful for a comprehensive radiation risk assessment. Active detectors provide most of these details and have been extensively used in the International Space Station. To easily access such an amount of data, a single point access is becoming essential. This review presents an ongoing work on the development of a tool that allows obtaining information about all relevant measurements performed with active detectors providing reliable inputs for radiation model validation. PMID:26697408
Development and evaluation of a quality score for abstracts

PubMed Central

Timmer, Antje; Sutherland, Lloyd R; Hilsden, Robert J

2003-01-01

Background The evaluation of abstracts for scientific meetings has been shown to suffer from poor inter observer reliability. A measure was developed to assess the formal quality of abstract submissions in a standardized way. Methods Item selection was based on scoring systems for full reports, taking into account published guidelines for structured abstracts. Interrater agreement was examined using a random sample of submissions to the American Gastroenterological Association, stratified for research type (n = 100, 1992–1995). For construct validity, the association of formal quality with acceptance for presentation was examined. A questionnaire to expert reviewers evaluated sensibility items, such as ease of use and comprehensiveness. Results The index comprised 19 items. The summary quality scores showed good interrater agreement (intra class coefficient 0.60 – 0.81). Good abstract quality was associated with abstract acceptance for presentation at the meeting. The instrument was found to be acceptable by expert reviewers. Conclusion A quality index was developed for the evaluation of scientific meeting abstracts which was shown to be reliable, valid and useful. PMID:12581457
A comprehensive clinical assessment tool to inform policy and practice: applications of the minimum data set.

PubMed

Mor, Vincent

2004-04-01

The Minimum Data Set (MDS) for nursing home (NH) resident assessment, designed to assess elders functional status and care needs, exemplifies how the information needs of clinical practice are congruent with those of research. Building on a review of the published literature, this article describes the development of the MDS, its reliability and validity testing, as well as the variety of different policy and research uses to which it has been applied. Interrater reliability of items and internal consistency of MDS summary scales is generally good to excellent. Validation studies reveal good correspondence to research quality instruments for cognition, activities of daily living, and diagnoses with more variable results for vision, pain, mood, and behavior scales. To date, no consistent evidence suggests that applications of MDS data for case-mix reimbursement and quality indicator monitoring systematically bias the data. Although facility variation in data quality could compromise some applications, creation of the MDS as a clinical tool for care planning provides an example of how assessment tools with clinical use can be used in administrative databases for research and policy applications.
Radiation Measurements Performed with Active Detectors Relevant for Human Space Exploration.

PubMed

Narici, Livio; Berger, Thomas; Matthiä, Daniel; Reitz, Günther

2015-01-01

A reliable radiation risk assessment in space is a mandatory step for the development of countermeasures and long-duration mission planning in human spaceflight. Research in radiobiology provides information about possible risks linked to radiation. In addition, for a meaningful risk evaluation, the radiation exposure has to be assessed to a sufficient level of accuracy. Consequently, both the radiation models predicting the risks and the measurements used to validate such models must have an equivalent precision. Corresponding measurements can be performed both with passive and active devices. The former is easier to handle, cheaper, lighter, and smaller but they measure neither the time dependence of the radiation environment nor some of the details useful for a comprehensive radiation risk assessment. Active detectors provide most of these details and have been extensively used in the International Space Station. To easily access such an amount of data, a single point access is becoming essential. This review presents an ongoing work on the development of a tool that allows obtaining information about all relevant measurements performed with active detectors providing reliable inputs for radiation model validation.
Comprehensive neuromechanical assessment in stroke patients: reliability and responsiveness of a protocol to measure neural and non-neural wrist properties.

PubMed

van der Krogt, Hanneke; Klomp, Asbjørn; de Groot, Jurriaan H; de Vlugt, Erwin; van der Helm, Frans Ct; Meskers, Carel Gm; Arendzen, J Hans

2015-03-13

Understanding movement disorder after stroke and providing targeted treatment for post stroke patients requires valid and reliable identification of biomechanical (passive) and neural (active and reflexive) contributors. Aim of this study was to assess test-retest reliability of passive, active and reflexive parameters and to determine clinical responsiveness in a cohort of stroke patients with upper extremity impairments and healthy volunteers. Thirty-two community-residing chronic stroke patients with an impairment of an upper limb and fourteen healthy volunteers were assessed with a comprehensive neuromechanical assessment protocol consisting of active and passive tasks and different stretch reflex-eliciting measuring velocities, using a haptic manipulator and surface electromyography of wrist flexor and extensor muscles (Netherlands Trial Registry number NTR1424). Intraclass correlation coefficients (ICC) and Standard Error of Measurement were calculated to establish relative and absolute test-retest reliability of passive, active and reflexive parameters. Clinical responsiveness was tested with Kruskal Wallis test for differences between groups. ICC of passive parameters were fair to excellent (0.45 to 0.91). ICC of active parameters were excellent (0.88-0.99). ICC of reflexive parameters were fair to good (0.50-0.74). Only the reflexive loop time of the extensor muscles performed poor (ICC 0.18). Significant differences between chronic stroke patients and healthy volunteers were found in ten out of fourteen parameters. Passive, active and reflexive parameters can be assessed with high reliability in post-stroke patients. Parameters were responsive to clinical status. The next step is longitudinal measurement of passive, active and reflexive parameters to establish their predictive value for functional outcome after stroke.
Validation of heart and lung teleauscultation on an Internet-based system.

PubMed

Fragasso, Gabriele; De Benedictis, Marialuisa; Palloshi, Altin; Moltrasio, Marco; Cappelletti, Alberto; Carlino, Mauro; Marchisi, Angelo; Pala, Mariagrazia; Alfieri, Ottavio; Margonato, Alberto

2003-11-01

The feasibility and accuracy of an Internet-based system for teleauscultation was evaluated in 103 cardiac patients, who were auscultated by the same cardiologist with a conventional stethoscope and with an Internet-based method, using an electronic stethoscope and transmitting heart and lung sounds between computer work stations. In 92% of patients, the results of electronic and acoustic auscultation coincided, indicating that teleauscultation may be considered a reliable method for assessing cardiac patients and could, therefore, be adopted in the context of comprehensive telecare programs.
Development and Validation of the Faceted Inventory of the Five-Factor Model (FI-FFM).

PubMed

Watson, David; Nus, Ericka; Wu, Kevin D

2017-06-01

The Faceted Inventory of the Five-Factor Model (FI-FFM) is a comprehensive hierarchical measure of personality. The FI-FFM was created across five phases of scale development. It includes five facets apiece for neuroticism, extraversion, and conscientiousness; four facets within agreeableness; and three facets for openness. We present reliability and validity data obtained from three samples. The FI-FFM scales are internally consistent and highly stable over 2 weeks (retest rs ranged from .64 to .82, median r = .77). They show strong convergent and discriminant validity vis-à-vis the NEO, the Big Five Inventory, and the Personality Inventory for DSM-5. Moreover, self-ratings on the scales show moderate to strong agreement with corresponding ratings made by informants ( rs ranged from .26 to .66, median r = .42). Finally, in joint analyses with the NEO Personality Inventory-3, the FI-FFM neuroticism facet scales display significant incremental validity in predicting indicators of internalizing psychopathology.
Knee Injury and Osteoarthritis Outcome Score (KOOS): systematic review and meta-analysis of measurement properties.

PubMed

Collins, N J; Prinsen, C A C; Christensen, R; Bartels, E M; Terwee, C B; Roos, E M

2016-08-01

To conduct a systematic review and meta-analysis to synthesize evidence regarding measurement properties of the Knee injury and Osteoarthritis Outcome Score (KOOS). A comprehensive literature search identified 37 eligible papers evaluating KOOS measurement properties in participants with knee injuries and/or osteoarthritis (OA). Methodological quality was evaluated using the COSMIN checklist. Where possible, meta-analysis of extracted data was conducted for all studies and stratified by age and knee condition; otherwise narrative synthesis was performed. KOOS has adequate internal consistency, test-retest reliability and construct validity in young and old adults with knee injuries and/or OA. The ADL subscale has better content validity for older patients and Sport/Rec for younger patients with knee injuries, while the Pain subscale is more relevant for painful knee conditions. The five-factor structure of the original KOOS is unclear. There is some evidence that the KOOS subscales demonstrate sufficient unidimensionality, but this requires confirmation. Although measurement error requires further evaluation, the minimal detectable change for KOOS subscales ranges from 14.3 to 19.6 for younger individuals, and ≥20 for older individuals. Evidence of responsiveness comes from larger effect sizes following surgical (especially total knee replacement) than non-surgical interventions. KOOS demonstrates adequate content validity, internal consistency, test-retest reliability, construct validity and responsiveness for age- and condition-relevant subscales. Structural validity, cross-cultural validity and measurement error require further evaluation, as well as construct validity of KOOS Physical function Short form. Suggested order of subscales for different knee conditions can be applied in hierarchical testing of endpoints in clinical trials. PROSPERO (CRD42011001603). Copyright © 2016 Osteoarthritis Research Society International. Published by Elsevier Ltd. All rights reserved.
Validation of a self-reported HIV symptoms list: the ISS-HIV symptoms scale.

PubMed

Bucciardini, Raffaella; Pugliese, Katherina; Francisci, Daniela; Costantini, Andrea; Schiaroli, Elisabetta; Cognigni, Miriam; Tontini, Chiara; Lucattini, Stefano; Fucili, Luca; Di Gregorio, Massimiliano; Mirra, Marco; Fragola, Vincenzo; Pompili, Sara; Murri, Rita; Vella, Stefano

2016-01-01

To describe the development and the psychometric properties of the Istituto Superiore di Sanità-HIV symptoms scale (lSS-HIV symptoms scale). The ISS-HIV symptom scale was developed by an Italian working team including researchers, physicians and people living with HIV. The development process went through the following steps: (1) review of HIV/AIDS literature; (2) focus group; (3) pre-test analysis; (4) scale validation. The 22 symptoms of HIV-ISS symptoms scale were clustered in five factors: pain/general discomfort (7 items); depression/anxiety (4 items); emotional reaction/psychological distress (5 items); gastrointestinal discomfort (4 items); sexual discomfort (2 items). The internal consistence reliability was for all factors within the minimum accepted standard of 0.70. The results of this study provide a preliminary evidence of the reliability and validity of the ISS-HIV symptoms scale. In the new era where HIV infection has been transformed into a chronic diseases and patients are experiencing a complex range of symptoms, the ISS-HIV symptoms scale may represent an useful tool for a comprehensive symptom assessment with the advantage of being easy to fill out by patients and potentially attractive to physicians mainly because it is easy to understand and requires short time to interpret the results.

Assessing burden in families of critical care patients.

PubMed

Kentish-Barnes, Nancy; Lemiale, Virginie; Chaize, Marine; Pochard, Frédéric; Azoulay, Elie

2009-10-01

To provide critical care clinicians with information on validated instruments for assessing burden in families of critical care patients. PubMed (1979-2009). We included all quantitative studies that used a validated instrument to evaluate the prevalence of, and risk factors for, burden on families. We extracted the descriptions of the instruments used and the main results. Family burden after critical illness can be detected reliably and requires preventive strategies and specific treatments. Using simple face-to-face interviews, intensivists can learn to detect poor comprehension and its determinants. Instruments for detecting symptoms of anxiety, depression, or stress can be used reliably even by physicians with no psychiatric training. For some symptoms, the evaluation should take place at a distance from intensive care unit discharge or death. Experience with families of patients who died in the intensive care unit and data from the literature have prompted studies of bereaved family members and the development of interventions aimed at decreasing guilt and preventing complicated grief. We believe that burden on families should be assessed routinely. In clinical studies, using markers for burden measured by validated tools may provide further evidence that effective communication and efforts to detect and to prevent symptoms of stress, anxiety, or depression provide valuable benefits to families.
Basic psychometric properties of the transfer assessment instrument (version 3.0).

PubMed

Tsai, Chung-Ying; Rice, Laura A; Hoelmer, Claire; Boninger, Michael L; Koontz, Alicia M

2013-12-01

To refine the Transfer Assessment Instrument (TAI 2.0), develop a training program for the TAI, and analyze the basic psychometric properties of the TAI 3.0, including reliability, standard error of measurement (SEM), minimal detectable change (MDC), and construct validity. Repeated measures. A winter sports clinic for disabled veterans. Wheelchair users (N=41) who perform sitting-pivot or standing-pivot transfers. Not applicable. TAI version 3.0, intraclass correlation coefficients, SEMs, and MDCs for reliable measurement of raters' responses. Spearman correlation coefficient, 1-way analysis of variance, and independent t tests to evaluate construct validity. TAI 3.0 had acceptable to high levels of reliability (range, .74-.88). The SEMs for part 1, part 2, and final scores ranged from .45 to .75. The MDC was 1.5 points on the 10-point scale for the final score. There were weak correlations (ρ range, -.13 to .25; P>.11) between TAI final scores and subjects' characteristics (eg, sex, body mass index, age, type of disability, length of wheelchair use, grip and elbow strength, sitting balance). With comprehensive training, the refined TAI 3.0 yields high reliability among raters of different clinical backgrounds and experience. TAI 3.0 was unbiased toward certain physical characteristics that may influence transfer. TAI fills a void in the field by providing a quantitative measurement of transfers and a tool that can be used to detect problems and guide transfer training. Copyright © 2013 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
[Process and key points of clinical literature evaluation of post-marketing traditional Chinese medicine].

PubMed

Liu, Huan; Xie, Yanming

2011-10-01

The clinical literature evaluation of the post-marketing traditional Chinese medicine is a comprehensive evaluation by the comprehensive gain, analysis of the drug, literature of drug efficacy, safety, economy, based on the literature evidence and is part of the evaluation of evidence-based medicine. The literature evaluation in the post-marketing Chinese medicine clinical evaluation is in the foundation and the key position. Through the literature evaluation, it can fully grasp the information, grasp listed drug variety of traditional Chinese medicines second development orientation, make clear further clinical indications, perfect the medicines, etc. This paper discusses the main steps and emphasis of the clinical literature evaluation. Emphasizing security literature evaluation should attach importance to the security of a comprehensive collection drug information. Safety assessment should notice traditional Chinese medicine validity evaluation in improving syndrome, improveing the living quality of patients with special advantage. The economics literature evaluation should pay attention to reliability, sensitivity and practicability of the conclusion.
Development and initial cohort validation of the Arthritis Research UK Musculoskeletal Health Questionnaire (MSK-HQ) for use across musculoskeletal care pathways.

PubMed

Hill, Jonathan C; Kang, Sujin; Benedetto, Elena; Myers, Helen; Blackburn, Steven; Smith, Stephanie; Dunn, Kate M; Hay, Elaine; Rees, Jonathan; Beard, David; Glyn-Jones, Sion; Barker, Karen; Ellis, Benjamin; Fitzpatrick, Ray; Price, Andrew

2016-08-05

Current musculoskeletal outcome tools are fragmented across different healthcare settings and conditions. Our objectives were to develop and validate a single musculoskeletal outcome measure for use throughout the pathway and patients with different musculoskeletal conditions: the Arthritis Research UK Musculoskeletal Health Questionnaire (MSK-HQ). A consensus workshop with stakeholders from across the musculoskeletal community, workshops and individual interviews with a broad mix of musculoskeletal patients identified and prioritised outcomes for MSK-HQ inclusion. Initial psychometric validation was conducted in four cohorts from community physiotherapy, and secondary care orthopaedic hip, knee and shoulder clinics. Stakeholders (n=29) included primary care, physiotherapy, orthopaedic and rheumatology patients (n=8); general practitioners, physiotherapists, orthopaedists, rheumatologists and pain specialists (n=7), patient and professional national body representatives (n=10), and researchers (n=4). The four validation cohorts included 570 participants (n=210 physiotherapy, n=150 hip, n=150 knee, n=60 shoulder patients). Outcomes included the MSK-HQ's acceptability, feasibility, comprehension, readability and responder burden. The validation cohort outcomes were the MSK-HQ's completion rate, test-retest reliability and convergent validity with reference standards (EQ-5D-5L, Oxford Hip, Knee, Shoulder Scores, and the Keele MSK-PROM). Musculoskeletal domains prioritised were pain severity, physical function, work interference, social interference, sleep, fatigue, emotional health, physical activity, independence, understanding, confidence to self-manage and overall impact. Patients reported MSK-HQ items to be 'highly relevant' and 'easy to understand'. Completion rates were high (94.2%), with scores normally distributed, and no floor/ceiling effects. Test-retest reliability was excellent, and convergent validity was strong (correlations 0.81-0.88). A new musculoskeletal outcome measure has been developed through a coproduction process with patients to capture prioritised outcomes for use throughout the pathway and with different musculoskeletal conditions. Four validation cohorts found that the MSK-HQ had high completion rates, excellent test-retest reliability and strong convergent validity with reference standards. Further validation studies are ongoing, including a cohort with rheumatoid/inflammatory arthritis. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Initial development and psychometric testing of an instrument to measure the quality of children's end-of-life care.

PubMed

Widger, Kimberley; Tourangeau, Ann E; Steele, Rose; Streiner, David L

2015-01-01

The field of pediatric palliative care is hindered by the lack of a well-defined, reliable, and valid method for measuring the quality of end-of-life care. The study purpose was to develop and test an instrument to measure mothers' perspectives on the quality of care received before, at the time of, and following a child's death. In Phase 1, key components of quality end-of-life care for children were synthesized through a comprehensive review of research literature. These key components were validated in Phase 2 and then extended through focus groups with bereaved parents. In Phase 3, items were developed to assess structures, processes, and outcomes of quality end-of-life care then tested for content and face validity with health professionals. Cognitive testing was conducted through interviews with bereaved parents. In Phase 4, bereaved mothers were recruited through 10 children's hospitals/hospices in Canada to complete the instrument, and psychometric testing was conducted. Following review of 67 manuscripts and 3 focus groups with 10 parents, 141 items were initially developed. The overall content validity index for these items was 0.84 as rated by 7 health professionals. Based on feedback from health professionals and cognitive testing with 6 parents, a 144-item instrument was finalized for further testing. In Phase 4, 128 mothers completed the instrument, 31 of whom completed it twice. Test-retest reliability, internal consistency, and construct validity were demonstrated for six subscales: Connect With Families, Involve Parents, Share Information With Parents, Share Information Among Health Professionals, Support Parents, and Provide Care at Death. Additional items with content validity were grouped in four domains: Support the Child, Support Siblings, Provide Bereavement Follow-up, and Structures of Care. Forty-eight items were deleted through psychometric testing, leaving a 95-item instrument. There is good initial evidence for the reliability and validity of this new quality of end-of-life care instrument as a mechanism for evaluative feedback to health professionals, health systems, and policy makers to improve children's end-of-life care.
Factor analysis methods and validity evidence: A systematic review of instrument development across the continuum of medical education

NASA Astrophysics Data System (ADS)

Wetzel, Angela Payne

Previous systematic reviews indicate a lack of reporting of reliability and validity evidence in subsets of the medical education literature. Psychology and general education reviews of factor analysis also indicate gaps between current and best practices; yet, a comprehensive review of exploratory factor analysis in instrument development across the continuum of medical education had not been previously identified. Therefore, the purpose for this study was critical review of instrument development articles employing exploratory factor or principal component analysis published in medical education (2006--2010) to describe and assess the reporting of methods and validity evidence based on the Standards for Educational and Psychological Testing and factor analysis best practices. Data extraction of 64 articles measuring a variety of constructs that have been published throughout the peer-reviewed medical education literature indicate significant errors in the translation of exploratory factor analysis best practices to current practice. Further, techniques for establishing validity evidence tend to derive from a limited scope of methods including reliability statistics to support internal structure and support for test content. Instruments reviewed for this study lacked supporting evidence based on relationships with other variables and response process, and evidence based on consequences of testing was not evident. Findings suggest a need for further professional development within the medical education researcher community related to (1) appropriate factor analysis methodology and reporting and (2) the importance of pursuing multiple sources of reliability and validity evidence to construct a well-supported argument for the inferences made from the instrument. Medical education researchers and educators should be cautious in adopting instruments from the literature and carefully review available evidence. Finally, editors and reviewers are encouraged to recognize this gap in best practices and subsequently to promote instrument development research that is more consistent through the peer-review process.
Are Validity and Reliability "Relevant" in Qualitative Evaluation Research?

ERIC Educational Resources Information Center

Goodwin, Laura D.; Goodwin, William L.

1984-01-01

The views of prominant qualitative methodologists on the appropriateness of validity and reliability estimation for the measurement strategies employed in qualitative evaluations are summarized. A case is made for the relevance of validity and reliability estimation. Definitions of validity and reliability for qualitative measurement are presented…
Quality of care assessment in geriatric evaluation and management units: construction of a chart review tool for a tracer condition.

PubMed

Kergoat, Marie-Jeanne; Leclerc, Bernard-Simon; Leduc, Nicole; Latour, Judith; Berg, Katherine; Bolduc, Aline

2009-07-29

The number of elderly people requiring hospital care is growing, so, quality and assessment of care for elders are emerging and complex areas of research. Very few validated and reliable instruments exist for the assessment of quality of acute care in this field. This study's objective was to create such a tool for Geriatric Evaluation and Management Units (GEMUs). The methodology involved a reliability and feasibility study of a retrospective chart review on 934 older inpatients admitted in 49 GEMUs during the year 2002-2003 for fall-related trauma as a tracer condition. Pertinent indicators for a chart abstraction tool, the Geriatric Care Tool (GCT), were developed and validated according to five dimensions: access to care, comprehensiveness, continuity of care, patient-centred care and appropriateness. Consensus methods were used to develop the content. Participants were experts representing eight main health care professions involved in GEMUs from 19 different sites. Items associated with high quality of care at each step of the multidisciplinary management of patients admitted due to falls were identified. The GCT was tested for intra- and inter-rater reliability using 30 medical charts reviewed by each of three independent and blinded trained nurses. Kappa and agreement measures between pairs of chart reviewers were computed on an item-by-item basis. Three quarters of 169 items identifying the process of care, from the case history to discharge planning, demonstrated good agreement (kappa greater than 0.40 and agreement over 70%). Indicators for the appropriateness of care showed less reliability. Content validity and reliability results, as well as the feasibility of the process, suggest that the chart abstraction tool can gather standardized and pertinent clinical information for further evaluating quality of care in GEMU using admission due to falls as a tracer condition. However, the GCT should be evaluated in other models of acute geriatric units and new strategies should be developed to improve reliability of peer assessments in characterizing the quality of care for elderly patients with complex conditions.
Reliability and validity in a nutshell.

PubMed

Bannigan, Katrina; Watson, Roger

2009-12-01

To explore and explain the different concepts of reliability and validity as they are related to measurement instruments in social science and health care. There are different concepts contained in the terms reliability and validity and these are often explained poorly and there is often confusion between them. To develop some clarity about reliability and validity a conceptual framework was built based on the existing literature. The concepts of reliability, validity and utility are explored and explained. Reliability contains the concepts of internal consistency and stability and equivalence. Validity contains the concepts of content, face, criterion, concurrent, predictive, construct, convergent (and divergent), factorial and discriminant. In addition, for clinical practice and research, it is essential to establish the utility of a measurement instrument. To use measurement instruments appropriately in clinical practice, the extent to which they are reliable, valid and usable must be established.
Third Molars on the Internet: A Guide for Assessing Information Quality and Readability

PubMed Central

Brennan, David; Sambrook, Paul; Armfield, Jason

2015-01-01

Background Directing patients suffering from third molars (TMs) problems to high-quality online information is not only medically important, but also could enable better engagement in shared decision making. Objectives This study aimed to develop a scale that measures the scientific information quality (SIQ) for online information concerning wisdom tooth problems and to conduct a quality evaluation for online TMs resources. In addition, the study evaluated whether a specific piece of readability software (Readability Studio Professional 2012) might be reliable in measuring information comprehension, and explored predictors for the SIQ Scale. Methods A cross-sectional sample of websites was retrieved using certain keywords and phrases such as “impacted wisdom tooth problems” using 3 popular search engines. The retrieved websites (n=150) were filtered. The retained 50 websites were evaluated to assess their characteristics, usability, accessibility, trust, readability, SIQ, and their credibility using DISCERN and Health on the Net Code (HoNCode). Results Websites’ mean scale scores varied significantly across website affiliation groups such as governmental, commercial, and treatment provider bodies. The SIQ Scale had a good internal consistency (alpha=.85) and was significantly correlated with DISCERN (r=.82, P<.01) and HoNCode (r=.38, P<.01). Less than 25% of websites had SIQ scores above 75%. The mean readability grade (10.3, SD 1.9) was above the recommended level, and was significantly correlated with the Scientific Information Comprehension Scale (r=.45. P<.01), which provides evidence for convergent validity. Website affiliation and DISCERN were significantly associated with SIQ (P<.01) and explained 76% of the SIQ variance. Conclusion The developed SIQ Scale was found to demonstrate reliability and initial validity. Website affiliation, DISCERN, and HoNCode were significant predictors for the quality of scientific information. The Readability Studio software estimates were associated with scientific information comprehensiveness measures. PMID:26443470
Viscosity and diffusivity in melts: from unary to multicomponent systems

NASA Astrophysics Data System (ADS)

Chen, Weimin; Zhang, Lijun; Du, Yong; Huang, Baiyun

2014-05-01

Viscosity and diffusivity, two important transport coefficients, are systematically investigated from unary melt to binary to multicomponent melts in the present work. By coupling with Kaptay's viscosity equation of pure liquid metals and effective radii of diffusion species, the Sutherland equation is modified by taking the size effect into account, and further derived into an Arrhenius formula for the convenient usage. Its reliability for predicting self-diffusivity and impurity diffusivity in unary liquids is then validated by comparing the calculated self-diffusivities and impurity diffusivities in liquid Al- and Fe-based alloys with the experimental and the assessed data. Moreover, the Kozlov model was chosen among various viscosity models as the most reliable one to reproduce the experimental viscosities in binary and multicomponent melts. Based on the reliable viscosities calculated from the Kozlov model, the modified Sutherland equation is utilized to predict the tracer diffusivities in binary and multicomponent melts, and validated in Al-Cu, Al-Ni and Al-Ce-Ni melts. Comprehensive comparisons between the calculated results and the literature data indicate that the experimental tracer diffusivities and the theoretical ones can be well reproduced by the present calculations. In addition, the vacancy-wind factor in binary liquid Al-Ni alloys with the increasing temperature is also discussed. What's more, the calculated inter-diffusivities in liquid Al-Cu, Al-Ni and Al-Ag-Cu alloys are also in excellent agreement with the measured and theoretical data. Comparisons between the simulated concentration profiles and the measured ones in Al-Cu, Al-Ce-Ni and Al-Ag-Cu melts are further used to validate the present calculation method.
Development of an easy-to-use Spanish Health Literacy test.

PubMed

Lee, Shoou-Yih D; Bender, Deborah E; Ruiz, Rafael E; Cho, Young Ik

2006-08-01

The study was intended to develop and validate a health literacy test, termed the Short Assessment of Health Literacy for Spanish-speaking Adults (SAHLSA), for the Spanish-speaking population. The design of SAHLSA was based on the Rapid Estimate of Adult Literacy in Medicine (REALM), known as the most easily administered tool for assessing health literacy in English. In addition to the word recognition test in REALM, SAHLSA incorporates a comprehension test using multiple-choice questions designed by an expert panel. Validation of SAHLSA involved testing and comparing the tool with other health literacy instruments in a sample of 201 Spanish-speaking and 202 English-speaking subjects recruited from the Ambulatory Care Center at UNC Health Care. With only the word recognition test, REALM could not differentiate the level of health literacy in Spanish. The SAHLSA significantly improved the differentiation. Item response theory analysis was performed to calibrate the SAHLSA and reduce the instrument to 50 items. The resulting instrument, SAHLSA-50, was correlated with the Test of Functional Health Literacy in Adults, another health literacy instrument, at r=0.65. The SAHLSA-50 score was significantly and positively associated with the physical health status of Spanish-speaking subjects (p<.05), holding constant age and years of education. The instrument displayed good internal reliability (Cronbach's alpha=0.92) and test-retest reliability (Pearson's r=0.86). The new instrument, SAHLSA-50, has good reliability and validity. It could be used in the clinical or community setting to screen for low health literacy among Spanish speakers.
Development of an Easy-to-Use Spanish Health Literacy Test

PubMed Central

Lee, Shoou-Yih D; Bender, Deborah E; Ruiz, Rafael E; Cho, Young Ik

2006-01-01

Objective The study was intended to develop and validate a health literacy test, termed the Short Assessment of Health Literacy for Spanish-speaking Adults (SAHLSA), for the Spanish-speaking population. Study Design The design of SAHLSA was based on the Rapid Estimate of Adult Literacy in Medicine (REALM), known as the most easily administered tool for assessing health literacy in English. In addition to the word recognition test in REALM, SAHLSA incorporates a comprehension test using multiple-choice questions designed by an expert panel. Data Collection Validation of SAHLSA involved testing and comparing the tool with other health literacy instruments in a sample of 201 Spanish-speaking and 202 English-speaking subjects recruited from the Ambulatory Care Center at UNC Health Care. Principal Findings With only the word recognition test, REALM could not differentiate the level of health literacy in Spanish. The SAHLSA significantly improved the differentiation. Item response theory analysis was performed to calibrate the SAHLSA and reduce the instrument to 50 items. The resulting instrument, SAHLSA-50, was correlated with the Test of Functional Health Literacy in Adults, another health literacy instrument, at r = 0.65. The SAHLSA-50 score was significantly and positively associated with the physical health status of Spanish-speaking subjects (p < .05), holding constant age and years of education. The instrument displayed good internal reliability (Cronbach's α = 0.92) and test–retest reliability (Pearson's r = 0.86). Conclusions The new instrument, SAHLSA-50, has good reliability and validity. It could be used in the clinical or community setting to screen for low health literacy among Spanish speakers. PMID:16899014
Prognostics-based qualification of high-power white LEDs using Lévy process approach

NASA Astrophysics Data System (ADS)

Yung, Kam-Chuen; Sun, Bo; Jiang, Xiaopeng

2017-01-01

Due to their versatility in a variety of applications and the growing market demand, high-power white light-emitting diodes (LEDs) have attracted considerable attention. Reliability qualification testing is an essential part of the product development process to ensure the reliability of a new LED product before its release. However, the widely used IES-TM-21 method does not provide comprehensive reliability information. For more accurate and effective qualification, this paper presents a novel method based on prognostics techniques. Prognostics is an engineering technology predicting the future reliability or determining the remaining useful lifetime (RUL) of a product by assessing the extent of deviation or degradation from its expected normal operating conditions. A Lévy subordinator of a mixed Gamma and compound Poisson process is used to describe the actual degradation process of LEDs characterized by random sporadic small jumps of degradation degree, and the reliability function is derived for qualification with different distribution forms of jump sizes. The IES LM-80 test results reported by different LED vendors are used to develop and validate the qualification methodology. This study will be helpful for LED manufacturers to reduce the total test time and cost required to qualify the reliability of an LED product.
Development and validation of an assessment of adult educators' reading instructional knowledge.

PubMed

Bell, Sherry Mee; McCallum, R Steve; Ziegler, Mary; Davis, C A; Coleman, Maribeth

2013-10-01

The purpose of this paper is to describe briefly the development and utility of the Assessment of Reading Instructional Knowledge-Adults (ARIK-A), the only nationally normed (n = 468) measure of adult reading instructional knowledge, created to facilitate professional development of adult educators. Developmental data reveal reliabilities ranging from 0.73 to 0.85 for five ARIK-A scales (alphabetics, fluency, vocabulary, comprehension, and assessment) and 0.91 for the composite score; factor analytic data and expert review provide support for construct validity as well. Information on how to use the ARIK-A to determine mastery and relative standing is presented. With two alternate forms, the ARIK-A is a promising and needed tool for adult education practitioners within continuing education and professional development contexts.
A DYNAMIC VALGUS INDEX THAT COMBINES HIP AND KNEE ANGLES: ASSESSMENT OF UTILITY IN FEMALES WITH PATELLOFEMORAL PAIN.

PubMed

Scholtes, Sara A; Salsich, Gretchen B

2017-06-01

Two=dimensional motion analysis of lower=extremity movement typically focuses on the knee frontal plane projection angle, which considers the position of the femur and the tibia. A measure that includes the pelvis may provide a more comprehensive and accurate indicator of lower=extremity movement. Hypothesis/Purpose: The purpose of the study was to describe the utility of a two=dimensional dynamic valgus index (DVI) in females with patellofemoral pain. The hypothesis was that the DVI would be more reliable and valid than the knee frontal plane projection angle, be greater in females with patellofemoral pain during a single=limb squat than in females without patellofemoral pain, and decrease in females with patellofemoral pain following instruction. Study Design: Controlled Laboratory Study. Data were captured while participants performed single limb squats under two conditions: usual and corrected. Two=dimensional hip and knee angles and a DVI that combined the hip and knee angles were calculated. Three=dimensional sagittal, frontal, and transverse plane angles of the hip and knee and a DVI combining the frontal and transverse plane angles were calculated. The two=dimensional DVI demonstrated moderate reliability (ICC=0.74). The correlation between the two=dimensional and three=dimensional DVI's was 0.635 (p<0001). Females with patellofemoral pain demonstrated a greater two=dimensional DVI (31.14 °±13.36 °) than females without patellofemoral pain (18.30 °±14.97 °; p=0.010). Females with patellofemoral pain demonstrated a decreased DVI in the corrected (19.04 °±13.70 °) versus usual (31.14 °±13.36 °) condition (p=0.001). The DVI is a reliable and valid measure that may provide a more comprehensive assessment of lower=extremity movement patterns than the knee frontal plane projection angle in individuals with lower=extremity musculoskeletal pain problems. 2b.
Development of and Field-Test Results for the CAHPS PCMH Survey

PubMed Central

Scholle, Sarah Hudson; Vuong, Oanh; Ding, Lin; Fry, Stephanie; Gallagher, Patricia; Brown, Julie A.; Hays, Ron D.; Cleary, Paul D.

2017-01-01

Objective To develop and evaluate survey questions that assess processes of care relevant to Patient-Centered Medical Homes (PCMHs). Research Design We convened expert panels, reviewed evidence on effective care practices and existing surveys, elicited broad public input, and conducted cognitive interviews and a field test to develop items relevant to PCMHs that could be added to the CAHPS® Clinician & Group (CG-CAHPS) 1.0 Survey. Surveys were tested using a two-contact mail protocol in 10 adult and 33 pediatric practices (both private and community health centers) in Massachusetts. A total of 4,875 completed surveys were received (overall response rate of 25%). Analyses We calculated the rate of valid responses for each item. We conducted exploratory factor analyses and estimated item-to-total correlations, individual and site level reliability, and correlations among proposed multi-item composites. Results Ten items in four new domains (Comprehensiveness, Information, Self-Management Support, and Shared Decision-Making) and four items in two existing domains (Access and Coordination of Care) were selected to be supplemental items to be used in conjunction with the adult CG-CAHPS 1.0 survey. For the child version, four items in each of two new domains (Information and Self-Management Support) and five items in existing domains (Access, Comprehensiveness-Prevention, Coordination of Care) were selected. Conclusions This study provides support for the reliability and validity of new items to supplement the CG-CAHPS 1.0 survey to assess aspects of primary care that are important attributes of Patient-Centered Medical Homes. PMID:23064272
Haptic-2D: A new haptic test battery assessing the tactual abilities of sighted and visually impaired children and adolescents with two-dimensional raised materials.

PubMed

Mazella, Anaïs; Albaret, Jean-Michel; Picard, Delphine

2016-01-01

To fill an important gap in the psychometric assessment of children and adolescents with impaired vision, we designed a new battery of haptic tests, called Haptic-2D, for visually impaired and sighted individuals aged five to 18 years. Unlike existing batteries, ours uses only two-dimensional raised materials that participants explore using active touch. It is composed of 11 haptic tests, measuring scanning skills, tactile discrimination skills, spatial comprehension skills, short-term tactile memory, and comprehension of tactile pictures. We administered this battery to 138 participants, half of whom were sighted (n=69), and half visually impaired (blind, n=16; low vision, n=53). Results indicated a significant main effect of age on haptic scores, but no main effect of vision or Age × Vision interaction effect. Reliability of test items was satisfactory (Cronbach's alpha, α=0.51-0.84). Convergent validity was good, as shown by a significant correlation (age partialled out) between total haptic scores and scores on the B101 test (rp=0.51, n=47). Discriminant validity was also satisfactory, as attested by a lower but still significant partial correlation between total haptic scores and the raw score on the verbal WISC (rp=0.43, n=62). Finally, test-retest reliability was good (rs=0.93, n=12; interval of one to two months). This new psychometric tool should prove useful to practitioners working with young people with impaired vision. Copyright © 2015 Elsevier Ltd. All rights reserved.
[Spanish validation of the Boston Carpal Tunnel Questionnaire].

PubMed

Oteo-Álvaro, Ángel; Marín, María T; Matas, José A; Vaquero, Javier

2016-03-18

To describe the process of cultural adaptation and validation of the Boston Carpal Tunnel Questionnaire (BCTQ) measuring symptom intensity, functional status and quality of life in carpal tunnel syndrome patients and to report the psychometric properties of this version. A 3 expert panel supervised the adaptation process. After translation, review and back-translation of the original instrument, a new Spanish version was obtained, which was administered to 2 patient samples: a pilot sample of 20 patients for assessing comprehension, and a 90 patient sample for assessing structural validity (factor analysis and reliability), construct validity and sensitivity to change. A re-test measurement was carried out in 21 patients. Follow-up was accomplished in 40 patients. The questionnaire was well accepted by all participants. Celling effect was observed for 3 items. Reliability was very good, internal consistency: αS=0.91 y αF=0.87; test-retest stability: rS=0.939 and rF=0.986. Both subscales fitted to a general dimension. Subscales correlated with dynamometer measurements (rS=0.77 and rF=0.75) and showed to be related to abnormal 2-point discrimination, muscle atrophy and electromyography deterioration level. Scores properly correlated with other validated instruments: Douleur Neuropatique 4 questions and Brief Pain Inventory. BCTQ demonstrated to be sensitive to clinical changes, with large effect sizes (dS=-3.3 and dF=-1.9). The Spanish version of the BCTQ shows good psychometric properties warranting its use in clinical settings. Copyright © 2015 Elsevier España, S.L.U. All rights reserved.
Comprehensive proficiency-based inanimate training for robotic surgery: reliability, feasibility, and educational benefit.

PubMed

Arain, Nabeel A; Dulan, Genevieve; Hogg, Deborah C; Rege, Robert V; Powers, Cathryn E; Tesfay, Seifu T; Hynan, Linda S; Scott, Daniel J

2012-10-01

We previously developed a comprehensive proficiency-based robotic training curriculum demonstrating construct, content, and face validity. This study aimed to assess reliability, feasibility, and educational benefit associated with curricular implementation. Over an 11-month period, 55 residents, fellows, and faculty (robotic novices) from general surgery, urology, and gynecology were enrolled in a 2-month curriculum: online didactics, half-day hands-on tutorial, and self-practice using nine inanimate exercises. Each trainee completed a questionnaire and performed a single proctored repetition of each task before (pretest) and after (post-test) training. Tasks were scored for time and errors using modified FLS metrics. For inter-rater reliability (IRR), three trainees were scored by two raters and analyzed using intraclass correlation coefficients (ICC). Data from eight experts were analyzed using ICC and Cronbach's α to determine test-retest reliability and internal consistency, respectively. Educational benefit was assessed by comparing baseline (pretest) and final (post-test) trainee performance; comparisons used Wilcoxon signed-rank test. Of the 55 trainees that pretested, 53 (96 %) completed all curricular components in 9-17 h and reached proficiency after completing an average of 72 ± 28 repetitions over 5 ± 1 h. Trainees indicated minimal prior robotic experience and "poor comfort" with robotic skills at baseline (1.8 ± 0.9) compared to final testing (3.1 ± 0.8, p < 0.001). IRR data for the composite score revealed an ICC of 0.96 (p < 0.001). Test-retest reliability was 0.91 (p < 0.001) and internal consistency was 0.81. Performance improved significantly after training for all nine tasks and according to composite scores (548 ± 176 vs. 914 ± 81, p < 0.001), demonstrating educational benefit. This curriculum is associated with high reliability measures, demonstrated feasibility for a large cohort of trainees, and yielded significant educational benefit. Further studies and adoption of this curriculum are encouraged.

Ethical Implications of Validity-vs.-Reliability Trade-Offs in Educational Research

ERIC Educational Resources Information Center

Fendler, Lynn

2016-01-01

In educational research that calls itself empirical, the relationship between validity and reliability is that of trade-off: the stronger the bases for validity, the weaker the bases for reliability (and vice versa). Validity and reliability are widely regarded as basic criteria for evaluating research; however, there are ethical implications of…
Clinical assessment of adventitious movements.

PubMed

Brasić, J R; Barnett, J Y; Sheitman, B B; Lafargue, R T; Ahn, S C

1998-12-01

Many procedures with variable validity and reliability have been developed in research settings to evaluate adventitious movements and related phenomena in specific populations, e.g., people with schizophrenia treated with dopamine antagonists, but these only provide global assessments or rate specific movements. A battery for rating individuals with possible movements disorders in a comprehensive way in clinical settings is needed so a protocol to assess briefly and thoroughly potential movement disorders was videotaped for five prepubertal boys with autistic disorder and severe mental retardation in a clinical trial. Utilizing a Movement Assessment Battery, four raters independently scored videotapes of 10-16 movements assessments of each of the five subjects. Experienced raters attained agreement of 59% to 100% on ratings of tardive dyskinesia and 48% to 100% on tics. Hindrances to reliability included poor quality of some tapes, high activity of subjects, and fatigue of raters.
What to Do With "Moderate" Reliability and Validity Coefficients?

PubMed

Post, Marcel W

2016-07-01

Clinimetric studies may use criteria for test-retest reliability and convergent validity such that correlation coefficients as low as .40 are supportive of reliability and validity. It can be argued that moderate (.40-.60) correlations should not be interpreted in this way and that reliability coefficients <.70 should be considered as indicative of unreliability. Convergent validity coefficients in the .40 to .60 or .40 to .70 range should be considered as indications of validity problems, or as inconclusive at best. Studies on reliability and convergent should be designed in such a way that it is realistic to expect high reliability and validity coefficients. Multitrait multimethod approaches are preferred to study construct (convergent-divergent) validity. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Dutch translation and validation of the readiness for interprofessional learning scale (RIPLS) in a primary healthcare context.

PubMed

Pype, Peter; Deveugele, Myriam

2016-12-01

Interprofessional education and collaborative practice are gradually gaining importance in the context of growing healthcare complexity. The readiness for interprofessional learning scale (RIPLS) is a well-known scale that can identify attitudinal barriers and variance across professions, which may affect educational interventions. This study aims to translate the English RIPLS into Dutch and to test its reliability and validity. The scale was translated and back-translated by two pairs of people independently and tested for feasibility and comprehensibility. The translated scale was used with 219 general practitioners, 238 community nurses, and 53 palliative home-care nurses. Exploratory factor analysis was used to assess construct validity. Confirmatory factor analysis was done to generate a fit model. Cronbach's alpha was computed to evaluate internal consistency. Regression analysis was used to evaluate the effect of the RIPLS score on the level of learning through collaboration and to gauge the influence of the participants' gender, age, previous palliative care education, type of practice and years in practice. Confirmatory and exploratory factor analysis confirms the factor structure of the original version. The Dutch version shows good reliability (overall Cronbach's alpha: 0.88; intraclass correlation coefficient after test-retest: 0.718 (95%CI: 0.499-0.852). The RIPLS score correlates with the amount of workplace learning during collaboration (discriminant validity: P < 0.001). The Dutch translation of the RIPLS is now ready for comparative studies.
Psychometric properties of the Chinese version of the Difficulties in Emotion Regulation Scale (DERS): Factor structure, reliability, and validity.

PubMed

Li, Jian; Han, Zhuo Rachel; Gao, Mengyu M; Sun, Xin; Ahemaitijiang, Nigela

2018-05-01

Numerous studies have identified the significant role of emotion regulation in an individual's psychological and social functioning. Ever since its development, the Difficulties in Emotion Regulation Scale (DERS) has been widely adopted as a comprehensive measure to assess emotion regulation problems among English-speaking adults. To assess emotion regulation in adults from Chinese-speaking societies and to promote future cross-cultural examination of the emotion regulation processes, the authors aimed to develop a Chinese version of the DERS and provide an initial validation of this instrument. For the purpose of the current study, we recruited 862 Chinese adults from universities and local companies. The results indicated a similar six-factor solution in the Chinese version to the original version. Internal consistency and test-retest reliability were good. Concurrent validity was assessed by examining the correlations of the DERS and its subscales with measures of psychopathological symptoms and self-regulation of negative mood. The results demonstrated strong correlations of the DERS subscales with the Symptom Checklist-90 (SCL-90) and the Generalized Expectancy for Negative Mood Regulation Scale, except for that between the awareness subscale and the SCL-90. For the convergent validity, most DERS subscales were significantly correlated with personality traits, emotional intelligence, and self-control ability, with several exceptions. These findings are discussed within the context of the relevant literature. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Validation of the Minority Stress Scale Among Italian Gay and Bisexual Men.

PubMed

Pala, Andrea Norcini; Dell'Amore, Francesca; Steca, Patrizia; Clinton, Lauren; Sandfort, Theodorus; Rael, Christine

2017-12-01

The experience of sexual orientation stigma (e.g., homophobic discrimination and physical aggression) generates minority stress, a chronic form of psychosocial stress. Minority stress has been shown to have a negative effect on gay and bisexual men's (GBM's) mental and physical health, increasing the rates of depression, suicidal ideation, and HIV risk behaviors. In conservative religious settings, such as Italy, sexual orientation stigma can be more frequently and/or more intensively experienced. However, minority stress among Italian GBM remains understudied. The aim of this study was to explore the dimensionality, internal reliability, and convergent validity of the Minority Stress Scale (MSS), a comprehensive instrument designed to assess the manifestations of sexual orientation stigma. The MSS consists of 50 items assessing (a) Structural Stigma, (b) Enacted Stigma, (c) Expectations of Discrimination, (d) Sexual Orientation Concealment, (e) Internalized Homophobia Toward Others, (f) Internalized Homophobia toward Oneself, and (g) Stigma Awareness. We recruited an online sample of 451 Italian GBM to take the MSS. We tested convergent validity using the Perceived Stress Questionnaire. Through exploratory factor analysis, we extracted the 7 theoretical factors and an additional 3-item factor assessing Expectations of Discrimination From Family Members. The MSS factors showed good internal reliability (ordinal α > .81) and good convergent validity. Our scale can be suitable for applications in research settings, psychosocial interventions, and, potentially, in clinical practice. Future studies will be conducted to further investigate the properties of the MSS, exploring the association with additional health-related measures (e.g., depressive symptoms and anxiety).
The Complementary Health Approaches for Pain Survey (CHAPS): Validity testing and characteristics of a rural population with pain

PubMed Central

2018-01-01

Objectives Little is known about patterns and correlates of Complementary Health Approaches (CHAs) in chronic pain populations, particularly in rural, underserved communities. This article details the development and implementation of a new survey instrument designed to address this gap, the Complementary Health Approaches for Pain Survey (CHAPS). Design Following pilot-testing using pre-specified criteria to assess quality and comprehension in our target population, and after feedback regarding face-validity from content experts and stakeholders, the final cross-sectional self-report survey required 10–12 minutes to complete. It contained 69 demographic, lifestyle and health-related factors, and utilized a Transtheoretical Model (TTM) underpinning to assess short- and long-term use of 12 CHAs for pain management. Twenty additional items on pain severity, feelings, clinical outcomes, and activities were assessed using the Short-Form Global Pain Scale (SF-GPS); Internal reliability was assessed using Cronbach’s alpha. Settings/location Investigators conducted consecutive sampling in four West Virginia pain management and rheumatology practices. Participants 301 Appalachian adult patients seeking conventional care for pain management. Results Response rates were high (88% ± 4.1%). High quality and comprehension deemed the CHAPS an appropriate measurement tool in a rural population with pain. Missing data were unrelated to patient characteristics. Participants predominantly experienced chronic pain (93%), had five or more health conditions (56%, Mean = 5.4±3.1), were white (92%), female (57%), and middle-aged (Mean = 55.6 (SD = 13.6) years). Over 40% were disabled (43%) and/or obese (44%, Mean BMI = 33.4±31.5). Additionally, 44% used opioids, 31% used other prescription medications, and 66% used at least one CHA for pain, with 48% using CHAs for greater than 6 months. There was high internal reliability of the SF-GPS (alpha = .93) and satisfactory internal reliability for each of the five TTM stages across (all) twelve CHAs: precontemplation (0.89), contemplation (0.72), preparation (0.75), action (0.70), and maintenance (0.70). Conclusions The CHAPS is the first comprehensive measurement tool to assess CHA use specifically for pain management. Ease of administration in a population with pain support further use in population- and clinic-based studies in similar populations. PMID:29718951
Clinimetric Testing of the Comprehensive Cervical Dystonia Rating Scale

PubMed Central

Comella, C. L.; Perlmutter, J.S.; Jinnah, H. A.; Waliczek, T. A.; Rosen, A. R.; Galpern, W. R.; Adler, C. H.; Barbano, R. L.; Factor, S. A.; Goetz, C.G.; Jankovic, J.; Reich, S. G.; Rodriguez, R. L.; Severt, W. L.; Zurowski, M.; Fox, S. H.; Stebbins, G.T.

2016-01-01

Objective To test the clinimetric properties of the Comprehensive Cervical Dystonia Rating Scale. Background This is a modular scale with modifications of the Toronto Western Spasmodic Torticollis Rating Scale (composed of three subscales assessing motor severity, disability and pain) now referred to as the revised Toronto Western Spasmodic Torticollis Scale-2.; a newly developed psychiatric screening instrument; and the Cervical Dystonia Impact Profile-58 as a quality of life measure. Methods Ten dystonia experts rated subjects with cervical dystonia using the comprehensive scale. Clinimetric techniques assessed each module of the scale for reliability, item correlation and factor structure. Results There were 208 cervical dystonia patients (73% women, age 59±10 years, duration 15±12 years). The internal consistency of the motor severity subscale was acceptable (Cronbach’s alpha = 0.57). Item to total correlations showed that elimination of items with low correlations (<0.20) increased alpha to 0.71. Internal consistency estimates for the subscales for disability and pain were 0.88 and 0.95 respectively. The psychiatric screening scale had a Cronbach’s alpha of 0.84 and satisfactory item to total correlations. When the subscales of the Toronto Western Spasmodic Torticollis scale -2 were combined with the psychiatric screening scale, Cronbach's alpha was 0.88, and construct validity assessment demonstrated four rational factors: motor, disability, pain and psychiatric disorders. The Cervical Dystonia Impact Profile-58 had an alpha of 0.98 and its construction was validated through a confirmatory factor analysis. Conclusions The modules of the Comprehensive Cervical Dystonia Rating Scale are internally consistent with a logical factor structure. PMID:26971359
Assessment of sedentary behaviors and transport-related activities by questionnaire: a validation study.

PubMed

Mensah, Keitly; Maire, Aurélia; Oppert, Jean-Michel; Dugas, Julien; Charreire, Hélène; Weber, Christiane; Simon, Chantal; Nazare, Julie-Anne

2016-08-09

Comprehensive assessment of sedentary behavior (SB) and physical activity (PA), including transport-related activities (TRA), is required to design innovative PA promotion strategies. There are few validated instruments that simultaneously assess the different components of human movement according to their context of practice (e.g. work, transport, leisure). We examined test-retest reliability and validity of the Sedentary, Transportation and Activity Questionnaire (STAQ), a newly developed questionnaire dedicated to assessing context-specific SB, TRA and PA. Ninety six subjects (51 women) kept a contextualized activity-logbook and wore a hip accelerometer (Actigraph GT3X + (TM)) for a 7-day or 14-day period, at the end of which they completed the STAQ. Activity-energy expenditure was measured in a subgroup of 45 subjects using the double labeled water (DLW) method. Test-retest reliability was assessed using intra-class-coefficients (ICC) in a subgroup of 32 subjects who filled the questionnaire twice one month apart. Accelerometry was annotated using the logbook to obtain total and context-specific objective estimates of SB. Spearman correlations, Bland-Altman plots and ICC were used to analyze validity with logbook, accelerometry and DLW data validity criteria. Test-retest reliability was fair for total sitting time (ICC = 0.52), good to excellent for work sitting time (ICC = 0.71), transport-related walking (ICC = 0.61) and car use (ICC = 0.67), and leisure screen-related SB (ICC = 0.64-0.79), but poor for total sitting time during leisure and transport-related contexts. For validity, compared to accelerometry, significant correlations were found for STAQ estimates of total (r = 0.54) and context-specific sitting times with stronger correlations for work sitting time (r = 0.88), and screen times (TV/DVD viewing: r = 0.46; other screens: r = 0.42) than for transport (r = 0.35) or leisure-related sitting-times (r = 0.19). Compared to contextualized logbook, STAQ estimates of TRA was higher for car (r = 0.65) than for active transport (r = 0.41). The questionnaire generally overestimated work- and leisure-related SB and sitting times, while it underestimated total and transport-related sitting times. The STAQ showed acceptable reliability and a good ranking validity for assessment of context-specific SB and TRA. This instrument appears as a useful tool to study SB, TRA and PA in context in adults.
Performance evaluation of non-targeted peak-based cross-sample analysis for comprehensive two-dimensional gas chromatography-mass spectrometry data and application to processed hazelnut profiling.

PubMed

Kiefl, Johannes; Cordero, Chiara; Nicolotti, Luca; Schieberle, Peter; Reichenbach, Stephen E; Bicchi, Carlo

2012-06-22

The continuous interest in non-targeted profiling induced the development of tools for automated cross-sample analysis. Such tools were found to be selective or not comprehensive thus delivering a biased view on the qualitative/quantitative peak distribution across 2D sample chromatograms. Therefore, the performance of non-targeted approaches needs to be critically evaluated. This study focused on the development of a validation procedure for non-targeted, peak-based, GC×GC-MS data profiling. The procedure introduced performance parameters such as specificity, precision, accuracy, and uncertainty for a profiling method known as Comprehensive Template Matching. The performance was assessed by applying a three-week validation protocol based on CITAC/EURACHEM guidelines. Optimized ¹D and ²D retention times search windows, MS match factor threshold, detection threshold, and template threshold were evolved from two training sets by a semi-automated learning process. The effectiveness of proposed settings to consistently match 2D peak patterns was established by evaluating the rate of mismatched peaks and was expressed in terms of results accuracy. The study utilized 23 different 2D peak patterns providing the chemical fingerprints of raw and roasted hazelnuts (Corylus avellana L.) from different geographical origins, of diverse varieties and different roasting degrees. The validation results show that non-targeted peak-based profiling can be reliable with error rates lower than 10% independent of the degree of analytical variance. The optimized Comprehensive Template Matching procedure was employed to study hazelnut roasting profiles and in particular to find marker compounds strongly dependent on the thermal treatment, and to establish the correlation of potential marker compounds to geographical origin and variety/cultivar and finally to reveal the characteristic release of aroma active compounds. Copyright © 2012 Elsevier B.V. All rights reserved.
Measurement properties of patient-reported outcome measures (PROMs) used in adult patients with chronic kidney disease: A systematic review

PubMed Central

Kyte, Derek; Cockwell, Paul; Marshall, Tom; Gheorghe, Adrian; Keeley, Thomas; Slade, Anita; Calvert, Melanie

2017-01-01

Background Patient-reported outcome measures (PROMs) can provide valuable information which may assist with the care of patients with chronic kidney disease (CKD). However, given the large number of measures available, it is unclear which PROMs are suitable for use in research or clinical practice. To address this we comprehensively evaluated studies that assessed the measurement properties of PROMs in adults with CKD. Methods Four databases were searched; reference list and citation searching of included studies was also conducted. The COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist was used to appraise the methodological quality of the included studies and to inform a best evidence synthesis for each PROM. Results The search strategy retrieved 3,702 titles/abstracts. After 288 duplicates were removed, 3,414 abstracts were screened and 71 full-text articles were retrieved for further review. Of these, 24 full-text articles were excluded as they did not meet the eligibility criteria. Following reference list and citation searching, 19 articles were retrieved bringing the total number of papers included in the final analysis to 66. There was strong evidence supporting internal consistency and moderate evidence supporting construct validity for the Kidney Disease Quality of Life-36 (KDQOL-36) in pre-dialysis patients. In the dialysis population, the KDQOL-Short Form (KDQOL-SF) had strong evidence for internal consistency and structural validity and moderate evidence for test-retest reliability and construct validity while the KDQOL-36 had moderate evidence of internal consistency, test-retest reliability and construct validity. The End Stage Renal Disease-Symptom Checklist Transplantation Module (ESRD-SCLTM) demonstrated strong evidence for internal consistency and moderate evidence for test-retest reliability, structural and construct validity in renal transplant recipients. Conclusions We suggest considering the KDQOL-36 for use in pre-dialysis patients; the KDQOL-SF or KDQOL-36 for dialysis patients and the ESRD-SCLTM for use in transplant recipients. However, further research is required to evaluate the measurement error, structural validity, responsiveness and patient acceptability of PROMs used in CKD. PMID:28636678
Initial validation of the Argentinean Spanish version of the PedsQL™ 4.0 Generic Core Scales in children and adolescents with chronic diseases: acceptability and comprehensibility in low-income settings

PubMed Central

Roizen, Mariana; Rodríguez, Susana; Bauer, Gabriela; Medin, Gabriela; Bevilacqua, Silvina; Varni, James W; Dussel, Veronica

2008-01-01

Background To validate the Argentinean Spanish version of the PedsQL™ 4.0 Generic Core Scales in Argentinean children and adolescents with chronic conditions and to assess the impact of socio-demographic characteristics on the instrument's comprehensibility and acceptability. Reliability, and known-groups, and convergent validity were tested. Methods Consecutive sample of 287 children with chronic conditions and 105 healthy children, ages 2–18, and their parents. Chronically ill children were: (1) attending outpatient clinics and (2) had one of the following diagnoses: stem cell transplant, chronic obstructive pulmonary disease, HIV/AIDS, cancer, end stage renal disease, complex congenital cardiopathy. Patients and adult proxies completed the PedsQL™ 4.0 and an overall health status assessment. Physicians were asked to rate degree of health status impairment. Results The PedsQL™ 4.0 was feasible (only 9 children, all 5 to 7 year-olds, could not complete the instrument), easy to administer, completed without, or with minimal, help by most children and parents, and required a brief administration time (average 5–6 minutes). People living below the poverty line and/or low literacy needed more help to complete the instrument. Cronbach Alpha's internal consistency values for the total and subscale scores exceeded 0.70 for self-reports of children over 8 years-old and parent-reports of children over 5 years of age. Reliability of proxy-reports of 2–4 year-olds was low but improved when school items were excluded. Internal consistency for 5–7 year-olds was low (α range = 0.28–0.76). Construct validity was good. Child self-report and parent proxy-report PedsQL™ 4.0 scores were moderately but significantly correlated (ρ = 0.39, p < 0.0001) and both significantly correlated with physician's assessment of health impairment and with child self-reported overall health status. The PedsQL™ 4.0 discriminated between healthy and chronically ill children (72.72 and 66.87, for healthy and ill children, respectively, p = 0.01), between different chronic health conditions, and children from lower socioeconomic status. Conclusion Results suggest that the Argentinean Spanish PedsQL™ 4.0 is suitable for research purposes in the public health setting for children over 8 years old and parents of children over 5 years old. People with low income and low literacy need help to complete the instrument. Steps to expand the use of the Argentinean Spanish PedsQL™ 4.0 include an alternative approach to scoring for the 2–4 year-olds, further understanding of how to increase reliability for the 5–7 year-olds self-report, and confirmation of other aspects of validity. PMID:18687134
Initial validation of the Argentinean Spanish version of the PedsQL 4.0 Generic Core Scales in children and adolescents with chronic diseases: acceptability and comprehensibility in low-income settings.

PubMed

Roizen, Mariana; Rodríguez, Susana; Bauer, Gabriela; Medin, Gabriela; Bevilacqua, Silvina; Varni, James W; Dussel, Veronica

2008-08-07

To validate the Argentinean Spanish version of the PedsQL 4.0 Generic Core Scales in Argentinean children and adolescents with chronic conditions and to assess the impact of socio-demographic characteristics on the instrument's comprehensibility and acceptability. Reliability, and known-groups, and convergent validity were tested. Consecutive sample of 287 children with chronic conditions and 105 healthy children, ages 2-18, and their parents. Chronically ill children were: (1) attending outpatient clinics and (2) had one of the following diagnoses: stem cell transplant, chronic obstructive pulmonary disease, HIV/AIDS, cancer, end stage renal disease, complex congenital cardiopathy. Patients and adult proxies completed the PedsQL 4.0 and an overall health status assessment. Physicians were asked to rate degree of health status impairment. The PedsQL 4.0 was feasible (only 9 children, all 5 to 7 year-olds, could not complete the instrument), easy to administer, completed without, or with minimal, help by most children and parents, and required a brief administration time (average 5-6 minutes). People living below the poverty line and/or low literacy needed more help to complete the instrument. Cronbach Alpha's internal consistency values for the total and subscale scores exceeded 0.70 for self-reports of children over 8 years-old and parent-reports of children over 5 years of age. Reliability of proxy-reports of 2-4 year-olds was low but improved when school items were excluded. Internal consistency for 5-7 year-olds was low (alpha range = 0.28-0.76). Construct validity was good. Child self-report and parent proxy-report PedsQL 4.0 scores were moderately but significantly correlated (rho = 0.39, p < 0.0001) and both significantly correlated with physician's assessment of health impairment and with child self-reported overall health status. The PedsQL 4.0 discriminated between healthy and chronically ill children (72.72 and 66.87, for healthy and ill children, respectively, p = 0.01), between different chronic health conditions, and children from lower socioeconomic status. Results suggest that the Argentinean Spanish PedsQL 4.0 is suitable for research purposes in the public health setting for children over 8 years old and parents of children over 5 years old. People with low income and low literacy need help to complete the instrument. Steps to expand the use of the Argentinean Spanish PedsQL 4.0 include an alternative approach to scoring for the 2-4 year-olds, further understanding of how to increase reliability for the 5-7 year-olds self-report, and confirmation of other aspects of validity.
Evaluating Management Information Systems, A Protocol for Automated Peer Review Systems

PubMed Central

Black, Gordon C.

1980-01-01

This paper discusses key issues in evaluating an automated Peer Review System. Included are the conceptual base, design, steps in planning structural components, operation parameters, criteria, costs and a detailed outline or protocol for use in the evaluation. At the heart of the Peer Review System is the criteria utilized for measuring quality. Criteria evaluation should embrace, as a minimum, appropriateness, validity and reliability, and completemess or comprehensiveness of content. Such an evaluation is not complete without determining the impact (clinical outcome) of the service system or the patient and the population served.
Ball Bearing Analysis with the ORBIS Tool

NASA Technical Reports Server (NTRS)

Halpin, Jacob D.

2016-01-01

Ball bearing design is critical to the success of aerospace mechanisms. Key bearing performance parameters, such as load capability, stiffness, torque, and life all depend on accurate determination of the internal load distribution. Hence, a good analytical bearing tool that provides both comprehensive capabilities and reliable results becomes a significant asset to the engineer. This paper introduces the ORBIS bearing tool. A discussion of key modeling assumptions and a technical overview is provided. Numerous validation studies and case studies using the ORBIS tool are presented. All results suggest the ORBIS code closely correlates to predictions on bearing internal load distributions, stiffness, deflection and stresses.
Multi-viewpoint clustering analysis

NASA Technical Reports Server (NTRS)

Mehrotra, Mala; Wild, Chris

1993-01-01

In this paper, we address the feasibility of partitioning rule-based systems into a number of meaningful units to enhance the comprehensibility, maintainability and reliability of expert systems software. Preliminary results have shown that no single structuring principle or abstraction hierarchy is sufficient to understand complex knowledge bases. We therefore propose the Multi View Point - Clustering Analysis (MVP-CA) methodology to provide multiple views of the same expert system. We present the results of using this approach to partition a deployed knowledge-based system that navigates the Space Shuttle's entry. We also discuss the impact of this approach on verification and validation of knowledge-based systems.
The role of observational reference data for climate downscaling: Insights from the VALUE COST Action

NASA Astrophysics Data System (ADS)

Kotlarski, Sven; Gutiérrez, José M.; Boberg, Fredrik; Bosshard, Thomas; Cardoso, Rita M.; Herrera, Sixto; Maraun, Douglas; Mezghani, Abdelkader; Pagé, Christian; Räty, Olle; Stepanek, Petr; Soares, Pedro M. M.; Szabo, Peter

2016-04-01

VALUE is an open European network to validate and compare downscaling methods for climate change research (http://www.value-cost.eu). A key deliverable of VALUE is the development of a systematic validation framework to enable the assessment and comparison of downscaling methods. Such assessments can be expected to crucially depend on the existence of accurate and reliable observational reference data. In dynamical downscaling, observational data can influence model development itself and, later on, model evaluation, parameter calibration and added value assessment. In empirical-statistical downscaling, observations serve as predictand data and directly influence model calibration with corresponding effects on downscaled climate change projections. We here present a comprehensive assessment of the influence of uncertainties in observational reference data and of scale-related issues on several of the above-mentioned aspects. First, temperature and precipitation characteristics as simulated by a set of reanalysis-driven EURO-CORDEX RCM experiments are validated against three different gridded reference data products, namely (1) the EOBS dataset (2) the recently developed EURO4M-MESAN regional re-analysis, and (3) several national high-resolution and quality-controlled gridded datasets that recently became available. The analysis reveals a considerable influence of the choice of the reference data on the evaluation results, especially for precipitation. It is also illustrated how differences between the reference data sets influence the ranking of RCMs according to a comprehensive set of performance measures.
A comprehensive approach to psychometric assessment of instruments used in dementia educational interventions for health professionals: a cross-sectional study.

PubMed

Wang, Yao; Xiao, Lily Dongxia; He, Guo-Ping

2015-02-01

Suboptimal care for people with dementia in hospital settings has been reported and is attributed to the lack of knowledge and inadequate attitudes in dementia care among health professionals. Educational interventions have been widely used to improve care outcomes; however, Chinese-language instruments used in dementia educational interventions for health professionals are lacking. The aims of this study were to select, translate and evaluate instruments used in dementia educational interventions for Chinese health professionals in acute-care hospitals. A cross-sectional study design was used. A modified stratified random sampling was used to recruit 442 participants from different levels of hospitals in Changsha, China. Dementia care competence was used as a framework for the selection and evaluation of Alzheimer's Disease Knowledge Scale and Dementia Care Attitudes Scale for health professionals in the study. These two scales were translated into Chinese using forward and back translation method. Content validity, test-retest reliability and internal consistency were assessed. Construct validity was tested using exploratory factor analysis. Known-group validity was established by comparing scores of Alzheimer's Disease Knowledge Scale and Dementia Care Attitudes Scale in two sub-groups. A person-centred care scale was utilised as a gold standard to establish concurrent validity of these two scales. Results demonstrated acceptable content validity, internal consistency, test-retest reliability and concurrent validity. Exploratory factor analysis presented a single-factor structure of the Chinese Alzheimer's Disease Knowledge Scale and a two-factor structure of the Chinese Dementia Care Attitudes Scale, supporting the conceptual dimensions of the original scales. The Chinese Alzheimer's Disease Knowledge Scale and Chinese Dementia Care Attitudes Scale demonstrated known-group validity evidenced by significantly higher scores identified from the sub-group with a longer work experience compared to those in the sub-group with less work experience. The use of dementia care competence as a framework to inform the selection and evaluation of instruments used in dementia educational interventions for health professionals has wide applicability in other areas. The results support that Chinese Alzheimer's Disease Knowledge Scale and Chinese Dementia Care Attitudes Scale are reliable and valid instruments for health professionals to use in acute-care settings. Copyright © 2014 Elsevier Ltd. All rights reserved.
Development and validation of the assessment of health literacy in breast and cervical cancer screening.

PubMed

Han, Hae-Ra; Huh, Boyun; Kim, Miyong T; Kim, Jiyun; Nguyen, Tam

2014-01-01

For many people limited health literacy is a major barrier to effective preventive health behavior such as cancer screening, yet a comprehensive health literacy measure that is specific to breast and cervical cancer screening is not readily available. The purpose of this article is to describe the development and testing of a new instrument to measure health literacy in the context of breast and cervical cancer screening, the Assessment of Health Literacy in Cancer Screening (AHL-C). The AHL-C is based on Baker's conceptualization of health literacy and modeled from the two most popular health literacy tests, the Rapid Estimate of Adult Literacy in Medicine and the Test of Functional Health Literacy in Adults. The AHL-C consists of four subscales; print literacy, numeracy, comprehension, and familiarity. We used baseline data from 560 Korean American immigrant women who participated in a community-based randomized trial designed to test the effect of a health literacy-focused intervention to promote breast and cervical cancer screening. Rigorous psychometric testing supports that the AHL-C is reliable, valid, and significantly correlated with theoretically selected variables. Future research is needed to test the utility of the AHL-C in predicting cancer screening outcomes.
Comprehensive Modeling of Temperature-Dependent Degradation Mechanisms in Lithium Iron Phosphate Batteries

DOE Office of Scientific and Technical Information (OSTI.GOV)

Schimpe, Michael; von Kuepach, M. E.; Naumann, M.

For reliable lifetime predictions of lithium-ion batteries, models for cell degradation are required. A comprehensive semi-empirical model based on a reduced set of internal cell parameters and physically justified degradation functions for the capacity loss is developed and presented for a commercial lithium iron phosphate/graphite cell. One calendar and several cycle aging effects are modeled separately. Emphasis is placed on the varying degradation at different temperatures. Degradation mechanisms for cycle aging at high and low temperatures as well as the increased cycling degradation at high state of charge are calculated separately. For parameterization, a lifetime test study is conducted includingmore » storage and cycle tests. Additionally, the model is validated through a dynamic current profile based on real-world application in a stationary energy storage system revealing the accuracy. Tests for validation are continued for up to 114 days after the longest parametrization tests. In conclusion, the model error for the cell capacity loss in the application-based tests is at the end of testing below 1% of the original cell capacity and the maximum relative model error is below 21%.« less

Comprehensive Modeling of Temperature-Dependent Degradation Mechanisms in Lithium Iron Phosphate Batteries

DOE PAGES

Schimpe, Michael; von Kuepach, M. E.; Naumann, M.; ...

2018-01-12

For reliable lifetime predictions of lithium-ion batteries, models for cell degradation are required. A comprehensive semi-empirical model based on a reduced set of internal cell parameters and physically justified degradation functions for the capacity loss is developed and presented for a commercial lithium iron phosphate/graphite cell. One calendar and several cycle aging effects are modeled separately. Emphasis is placed on the varying degradation at different temperatures. Degradation mechanisms for cycle aging at high and low temperatures as well as the increased cycling degradation at high state of charge are calculated separately. For parameterization, a lifetime test study is conducted includingmore » storage and cycle tests. Additionally, the model is validated through a dynamic current profile based on real-world application in a stationary energy storage system revealing the accuracy. Tests for validation are continued for up to 114 days after the longest parametrization tests. In conclusion, the model error for the cell capacity loss in the application-based tests is at the end of testing below 1% of the original cell capacity and the maximum relative model error is below 21%.« less
Reliability and validity of the revised Gibson Test of Cognitive Skills, a computer-based test battery for assessing cognition across the lifespan.

PubMed

Moore, Amy Lawson; Miller, Terissa M

2018-01-01

The purpose of the current study is to evaluate the validity and reliability of the revised Gibson Test of Cognitive Skills, a computer-based battery of tests measuring short-term memory, long-term memory, processing speed, logic and reasoning, visual processing, as well as auditory processing and word attack skills. This study included 2,737 participants aged 5-85 years. A series of studies was conducted to examine the validity and reliability using the test performance of the entire norming group and several subgroups. The evaluation of the technical properties of the test battery included content validation by subject matter experts, item analysis and coefficient alpha, test-retest reliability, split-half reliability, and analysis of concurrent validity with the Woodcock Johnson III Tests of Cognitive Abilities and Tests of Achievement. Results indicated strong sources of evidence of validity and reliability for the test, including internal consistency reliability coefficients ranging from 0.87 to 0.98, test-retest reliability coefficients ranging from 0.69 to 0.91, split-half reliability coefficients ranging from 0.87 to 0.91, and concurrent validity coefficients ranging from 0.53 to 0.93. The Gibson Test of Cognitive Skills-2 is a reliable and valid tool for assessing cognition in the general population across the lifespan.
Evaluation of tools used to measure calcium and/or dairy consumption in adults.

PubMed

Magarey, Anthea; Baulderstone, Lauren; Yaxley, Alison; Markow, Kylie; Miller, Michelle

2015-05-01

To identify and critique tools for the assessment of Ca and/or dairy intake in adults, in order to ascertain the most accurate and reliable tools available. A systematic review of the literature was conducted using defined inclusion and exclusion criteria. Articles reporting on originally developed tools or testing the reliability or validity of existing tools that measure Ca and/or dairy intake in adults were included. Author-defined criteria for reporting reliability and validity properties were applied. Studies conducted in Western countries. Adults. Thirty papers, utilising thirty-six tools assessing intake of dairy, Ca or both, were identified. Reliability testing was conducted on only two dairy and five Ca tools, with results indicating that only one dairy and two Ca tools were reliable. Validity testing was conducted for all but four Ca-only tools. There was high reliance in validity testing on lower-order tests such as correlation and failure to differentiate between statistical and clinically meaningful differences. Results of the validity testing suggest one dairy and five Ca tools are valid. Thus one tool was considered both reliable and valid for the assessment of dairy intake and only two tools proved reliable and valid for the assessment of Ca intake. While several tools are reliable and valid, their application across adult populations is limited by the populations in which they were tested. These results indicate a need for tools that assess Ca and/or dairy intake in adults to be rigorously tested for reliability and validity.
[The Spanish adapted version of the Children's Communication Checklist identifies disorders of pragmatic use of language and differentiates between clinical subtypes].

PubMed

Crespo-Eguilaz, N; Magallon, S; Sanchez-Carpintero, R; Narbona, J

2016-01-01

The Children's Communication Checklist (CCC) by Bishop is a useful scale for evaluation of pragmatic verbal abilities in school children. The aim of the study is to ascertain the validity and reliability of the CCC in Spanish. Answers to the CCC items by parents of 360 children with normal intelligence were analyzed. There were five groups: 160 control children; 68 children with attention deficit hyperactivity disorder, 77 with procedural non-verbal disorder, 25 children with social communication disorder and 30 with autism spectrum disorder. Investigations included: factorial analysis in order to cluster checklist items, reliability analyses of the proposed scales and discriminant analysis to check whether the scale correctly classifies children with pragmatic verbal abilities. Seven factors were obtained (Kaiser-Meyer-Olkin: 0.852) with moderate similarity with those of the original scale: social relationships, interests, and five more that can be grouped into pragmatic verbal ability (conversational abilities, coherence-comprehension, empathy nonverbal communication and appropriateness). All factors are significantly correlated with each other in the control group, and the five that compose pragmatic verbal ability correlate with each other in the clinical groups (Pearson r). The scales have good reliability (Cronbach's alpha: 0.914). The questionnaire correctly classifies 98.9% of grouped cases with and without pragmatic disorder and 78% of subjects in their appropriate clinical group. Besides, the questionnaire allows to differentiate the pathologies according to the presence and intensity of the symptoms. This Spanish version of the CCC is highly valid and reliable. The proposed statistics can be used as normative-reference values.
Instrumented Static and Dynamic Balance Assessment after Stroke Using Wii Balance Boards: Reliability and Association with Clinical Tests

PubMed Central

Bower, Kelly J.; McGinley, Jennifer L.; Miller, Kimberly J.; Clark, Ross A.

2014-01-01

Background and Objectives The Wii Balance Board (WBB) is a globally accessible device that shows promise as a clinically useful balance assessment tool. Although the WBB has been found to be comparable to a laboratory-grade force platform for obtaining centre of pressure data, it has not been comprehensively studied in clinical populations. The aim of this study was to investigate the measurement properties of tests utilising the WBB in people after stroke. Methods Thirty individuals who were more than three months post-stroke and able to stand unsupported were recruited from a single outpatient rehabilitation facility. Participants performed standardised assessments incorporating the WBB and customised software (static stance with eyes open and closed, static weight-bearing asymmetry, dynamic mediolateral weight shifting and dynamic sit-to-stand) in addition to commonly employed clinical tests (10 Metre Walk Test, Timed Up and Go, Step Test and Functional Reach) on two testing occasions one week apart. Test-retest reliability and construct validity of the WBB tests were investigated. Results All WBB-based outcomes were found to be highly reliable between testing occasions (ICC = 0.82 to 0.98). Correlations were poor to moderate between WBB variables and clinical tests, with the strongest associations observed between task-related activities, such as WBB mediolateral weight shifting and the Step Test. Conclusions The WBB, used with customised software, is a reliable and potentially useful tool for the assessment of balance and weight-bearing asymmetry following stroke. Future research is recommended to further investigate validity and responsiveness. PMID:25541939
Instrumented static and dynamic balance assessment after stroke using Wii Balance Boards: reliability and association with clinical tests.

PubMed

Bower, Kelly J; McGinley, Jennifer L; Miller, Kimberly J; Clark, Ross A

2014-01-01

The Wii Balance Board (WBB) is a globally accessible device that shows promise as a clinically useful balance assessment tool. Although the WBB has been found to be comparable to a laboratory-grade force platform for obtaining centre of pressure data, it has not been comprehensively studied in clinical populations. The aim of this study was to investigate the measurement properties of tests utilising the WBB in people after stroke. Thirty individuals who were more than three months post-stroke and able to stand unsupported were recruited from a single outpatient rehabilitation facility. Participants performed standardised assessments incorporating the WBB and customised software (static stance with eyes open and closed, static weight-bearing asymmetry, dynamic mediolateral weight shifting and dynamic sit-to-stand) in addition to commonly employed clinical tests (10 Metre Walk Test, Timed Up and Go, Step Test and Functional Reach) on two testing occasions one week apart. Test-retest reliability and construct validity of the WBB tests were investigated. All WBB-based outcomes were found to be highly reliable between testing occasions (ICC = 0.82 to 0.98). Correlations were poor to moderate between WBB variables and clinical tests, with the strongest associations observed between task-related activities, such as WBB mediolateral weight shifting and the Step Test. The WBB, used with customised software, is a reliable and potentially useful tool for the assessment of balance and weight-bearing asymmetry following stroke. Future research is recommended to further investigate validity and responsiveness.
Measuring theory of mind across middle childhood: Reliability and validity of the Silent Films and Strange Stories tasks.

PubMed

Devine, Rory T; Hughes, Claire

2016-09-01

Recent years have seen a growth of research on the development of children's ability to reason about others' mental states (or "theory of mind") beyond the narrow confines of the preschool period. The overall aim of this study was to investigate the psychometric properties of a task battery composed of items from Happé's Strange Stories task and Devine and Hughes' Silent Film task. A sample of 460 ethnically and socially diverse children (211 boys) between 7 and 13years of age completed the task battery at two time points separated by 1month. The Strange Stories and Silent Film tasks were strongly correlated even when verbal ability and narrative comprehension were taken into account, and all items loaded onto a single theory-of-mind latent factor. The theory-of-mind latent factor provided reliable estimates of performance across a wide range of theory-of-mind ability and showed no evidence of differential item functioning across gender, ethnicity, or socioeconomic status. The theory-of-mind latent factor also exhibited strong 1-month test-retest reliability, and this stability did not vary as a function of child characteristics. Taken together, these findings provide evidence for the validity and reliability of the Strange Stories and Silent Film task battery as a measure of individual differences in theory of mind suitable for use across middle childhood. We consider the methodological and conceptual implications of these findings for research on theory of mind beyond the preschool years. Copyright © 2015 Elsevier Inc. All rights reserved.
Dichotic listening performance predicts language comprehension.

PubMed

Asbjørnsen, Arve E; Helland, Turid

2006-05-01

Dichotic listening performance is considered a reliable and valid procedure for the assessment of language lateralisation in the brain. However, the documentation of a relationship between language functions and dichotic listening performance is sparse, although it is accepted that dichotic listening measures language perception. In particular, language comprehension should show close correspondence to perception of language stimuli. In the present study, we tested samples of reading-impaired and normally achieving children between 10 and 13 years of age with tests of reading skills, language comprehension, and dichotic listening to consonant-vowel (CV) syllables. A high correlation between the language scores and the dichotic listening performance was expected. However, since the left ear score is believed to be an error when assessing language laterality, covariation was expected for the right ear scores only. In addition, directing attention to one ear input was believed to reduce the influence of random factors, and thus show a more concise estimate of left hemisphere language capacity. Thus, a stronger correlation between language comprehension skills and the dichotic listening performance when attending to the right ear was expected. The analyses yielded a positive correlation between the right ear score in DL and language comprehension, an effect that was stronger when attending to the right ear. The present results confirm the assumption that dichotic listening with CV syllables measures an aspect of language perception and language skills that is related to general language comprehension.
Preliminary development and psychometric evaluation of an unmet needs measure for adolescents and young adults with cancer: the Cancer Needs Questionnaire - Young People (CNQ-YP).

PubMed

Clinton-McHarg, Tara; Carey, Mariko; Sanson-Fisher, Rob; D'Este, Catherine; Shakeshaft, Anthony

2012-01-30

Adolescents and young adult (AYA) cancer survivors may have unique physical, psychological and social needs due to their cancer occurring at a critical phase of development. The aim of this study was to develop a psychometrically rigorous measure of unmet need to capture the specific needs of this group. Items were developed following a comprehensive literature review, focus groups with AYAs, and feedback from health care providers, researchers and other professionals. The measure was pilot tested with 32 AYA cancer survivors recruited through a state-based cancer registry to establish face and content validity. A main sample of 139 AYA cancer patients and survivors were recruited through seven treatment centres and invited to complete the questionnaire. To establish test-retest reliability, a sub-sample of 34 participants completed the measure a second time. Exploratory factor analysis was performed and the measure was assessed for internal consistency, discriminative validity, potential responsiveness and acceptability. The Cancer Needs Questionnaire - Young People (CNQ-YP) has established face and content validity, and acceptability. The final measure has 70 items and six factors: Treatment Environment and Care (33 items); Feelings and Relationships (14 items); Daily Life (12 items); Information and Activities (5 items); Education (3 items); and Work (3 items). All domains achieved Cronbach's alpha values greater than 0.80. Item-to-item test-retest reliability was also high, with all but four items reaching weighted kappa values above 0.60. The CNQ-YP is the first multi-dimensional measure of unmet need which has been developed specifically for AYA cancer patients and survivors. The measure displays a strong factor structure, and excellent internal consistency and test-retest reliability. However, the small sample size has implications for the reliability of the statistical analyses undertaken, particularly the exploratory factor analysis. Future studies with a larger sample are recommended to confirm the factor structure of the measure. Longitudinal studies to establish responsiveness and predictive validity should also be undertaken.
Preliminary development and psychometric evaluation of an unmet needs measure for adolescents and young adults with cancer: the Cancer Needs Questionnaire - Young People (CNQ-YP)

PubMed Central

2012-01-01

Background Adolescents and young adult (AYA) cancer survivors may have unique physical, psychological and social needs due to their cancer occurring at a critical phase of development. The aim of this study was to develop a psychometrically rigorous measure of unmet need to capture the specific needs of this group. Methods Items were developed following a comprehensive literature review, focus groups with AYAs, and feedback from health care providers, researchers and other professionals. The measure was pilot tested with 32 AYA cancer survivors recruited through a state-based cancer registry to establish face and content validity. A main sample of 139 AYA cancer patients and survivors were recruited through seven treatment centres and invited to complete the questionnaire. To establish test-retest reliability, a sub-sample of 34 participants completed the measure a second time. Exploratory factor analysis was performed and the measure was assessed for internal consistency, discriminative validity, potential responsiveness and acceptability. Results The Cancer Needs Questionnaire - Young People (CNQ-YP) has established face and content validity, and acceptability. The final measure has 70 items and six factors: Treatment Environment and Care (33 items); Feelings and Relationships (14 items); Daily Life (12 items); Information and Activities (5 items); Education (3 items); and Work (3 items). All domains achieved Cronbach's alpha values greater than 0.80. Item-to-item test-retest reliability was also high, with all but four items reaching weighted kappa values above 0.60. Conclusions The CNQ-YP is the first multi-dimensional measure of unmet need which has been developed specifically for AYA cancer patients and survivors. The measure displays a strong factor structure, and excellent internal consistency and test-retest reliability. However, the small sample size has implications for the reliability of the statistical analyses undertaken, particularly the exploratory factor analysis. Future studies with a larger sample are recommended to confirm the factor structure of the measure. Longitudinal studies to establish responsiveness and predictive validity should also be undertaken. PMID:22284545
Measurement of availability and accessibility of food among youth: a systematic review of methodological studies.

PubMed

Gebremariam, Mekdes K; Vaqué-Crusellas, Cristina; Andersen, Lene F; Stok, F Marijn; Stelmach-Mardas, Marta; Brug, Johannes; Lien, Nanna

2017-02-14

Comprehensive and psychometrically tested measures of availability and accessibility of food are needed in order to explore availability and accessibility as determinants and predictors of dietary behaviors. The main aim of this systematic review was to update the evidence regarding the psychometric properties of measures of food availability and accessibility among youth. A secondary objective was to assess how availability and accessibility were conceptualized in the included studies. A systematic literature search was conducted using Medline, Embase, PsycINFO and Web of Science. Methodological studies published between January 2010 and March 2016 and reporting on at least one psychometric property of a measure of availability and/or accessibility of food among youth were included. Two reviewers independently extracted data and assessed study quality. Existing criteria were used to interpret reliability and validity parameters. A total of 20 studies were included. While 16 studies included measures of food availability, three included measures of both availability and accessibility; one study included a measure of accessibility only. Different conceptualizations of availability and accessibility were used across the studies. The measures aimed at assessing availability and/or accessibility in the home environment (n = 11), the school (n = 4), stores (n = 3), childcare/early care and education services (n = 2) and restaurants (n = 1). Most studies followed systematic steps in the development of the measures. The most common psychometrics tested for these measures were test-retest reliability and criterion validity. The majority of the measures had satisfactory evidence of reliability and/or validity. None of the included studies assessed the responsiveness of the measures. The review identified several measures of food availability or accessibility among youth with satisfactory evidence of reliability and/or validity. Findings indicate a need for more studies including measures of accessibility and addressing its conceptualization. More testing of some of the identified measures in different population groups is also warranted, as is the development of more measures of food availability and accessibility in the broader environment such as the neighborhood food environment.
Benchmarking Treatment Response in Tourette's Disorder: A Psychometric Evaluation and Signal Detection Analysis of the Parent Tic Questionnaire.

PubMed

Ricketts, Emily J; McGuire, Joseph F; Chang, Susanna; Bose, Deepika; Rasch, Madeline M; Woods, Douglas W; Specht, Matthew W; Walkup, John T; Scahill, Lawrence; Wilhelm, Sabine; Peterson, Alan L; Piacentini, John

2018-01-01

This study assessed the psychometric properties of a parent-reported tic severity measure, the Parent Tic Questionnaire (PTQ), and used the scale to establish guidelines for delineating clinically significant tic treatment response. Participants were 126 children ages 9 to 17 who participated in a randomized controlled trial of Comprehensive Behavioral Intervention for Tics (CBIT). Tic severity was assessed using the Yale Global Tic Severity Scale (YGTSS), Hopkins Motor/Vocal Tic Scale (HMVTS) and PTQ; positive treatment response was defined by a score of 1 (very much improved) or 2 (much improved) on the Clinical Global Impressions - Improvement (CGI-I) scale. Cronbach's alpha and intraclass correlations (ICC) assessed internal consistency and test-retest reliability, with correlations evaluating validity. Receiver- and Quality-Receiver Operating Characteristic analyses assessed the efficiency of percent and raw-reduction cutoffs associated with positive treatment response. The PTQ demonstrated good internal consistency (α = 0.80 to 0.86), excellent test-retest reliability (ICC = .84 to .89), good convergent validity with the YGTSS and HM/VTS, and good discriminant validity from hyperactive, obsessive-compulsive, and externalizing (i.e., aggression and rule-breaking) symptoms. A 55% reduction and 10-point decrease in PTQ Total score were optimal for defining positive treatment response. Findings help standardize tic assessment and provide clinicians with greater clarity in determining clinically meaningful tic symptom change during treatment. Copyright © 2017. Published by Elsevier Ltd.
Development and validation of a five-factor sexual satisfaction and distress scale for women: the Sexual Satisfaction Scale for Women (SSS-W).

PubMed

Meston, Cindy; Trapnell, Paul

2005-01-01

This article presents data based on the responses of over 800 women who contributed to the development of the Sexual Satisfaction Scale for Women (SSS-W). The aim of this study was to develop a comprehensive, multifaceted, valid, and reliable self-report measure of women's sexual satisfaction and distress. Phase I involved the initial selection of items based on past literature and on interviews of women diagnosed with sexual dysfunction and an exploratory factor analysis. Phase II involved an additional administration of the questionnaire, factor analyses, and refinement of the questionnaire items. Phase III involved administration of the final questionnaire to a sample of women with clinically diagnosed sexual dysfunction and controls. Psychometric evaluation of the SSS-W conducted in a sample of women meeting DSM-IV-TR criteria for female sexual dysfunction and in a control sample provided preliminary evidence of reliability and validity. The ability of the SSS-W to discriminate between sexually functional and dysfunctional women was demonstrated for each of the SSS-W domain scores and total score. The SSS-W is a brief, 30-item measure of sexual satisfaction and sexual distress, composed of five domains supported by factor analyses: contentment, communication, compatibility, relational concern, and personal concern. It exhibits sound psychometric properties and has a demonstrated ability to discriminate between clinical and nonclinical samples.
Improving cultural diversity awareness of physical therapy educators.

PubMed

Lazaro, Rolando T; Umphred, Darcy A

2007-01-01

In a climate of increasing diversity in the population of patients requiring physical therapy (PT) services, PT educators must prepare students and future clinicians to work competently in culturally diverse environments. To be able to achieve this goal, PT educators must be culturally competent as well. The purposes of the study were to develop a valid and reliable instrument to assess cultural diversity awareness and to develop an educational workshop to improve cultural diversity awareness of PT academic and clinical educators. Phase 1 of the study involved the development of an instrument to assess cultural diversity awareness. The Cultural Diversity Awareness Questionnaire (CDAQ) was developed, validated for content, analyzed for reliability, and field and pilot tested. Results indicated that the CDAQ has favorable psychometric properties. Phase 2 of the study involved the development and implementation of the Cultural Diversity Workshop (CDW). The seminar contents and class materials were developed, validated, and implemented as a one-day cultural diversity awareness seminar. A one-group, pretest-posttest experimental design was used, with participants who completed the CDAQ before and after the workshop. Results indicated that the workshop was effective in improving cultural diversity awareness of the participants. Results of the workshop evaluation affirmed the achievement of objectives and effectiveness of the facilitator. This study provided a solid initial foundation upon which a comprehensive cultural competence program can be developed.
[Evaluation of Suicide Risk Levels in Hospitals: Validity and Reliability Tests].

PubMed

Macagnino, Sandro; Steinert, Tilman; Uhlmann, Carmen

2018-05-01

Examination of in-hospital suicide risk levels concerning their validity and their reliability. The internal suicide risk levels were evaluated in a cross sectional study of in 163 inpatients. A reliability check was performed via determining interrater-reliability of senior physician, therapist and the responsible nurse. Within the scope of the validity check, we conducted analyses of criterion validity and construct validity. For the total sample an "acceptable" to "good" interrater-reliability (Kendalls W = .77) of suicide risk levels were obtained. Schizophrenic disorders showed the lowest values, for personality disorders we found the highest level of interrater-reliability. When examining the criterion validity, Item-9 of the BDI-II is substantial correlated to our suicide risk levels (ρ m = .54, p < .01). Within the scope of construct validity check, affective disorders showed the highest correlation (ρ = .77), compatible also with "convergent validity". They differed with schizophrenic disorders which showed the least concordance (ρ = .43). In-hospital suicide risk levels may represent an important contribution to the assessment of suicidal behavior of inpatients experiencing psychiatric treatment due to their overall good validity and reliability. © Georg Thieme Verlag KG Stuttgart · New York.
Psychometric Properties of the Modified Personal Diabetes Questionnaire Among Chinese Patients With Type 2 Diabetes.

PubMed

Cheng, Li; Leung, Doris Y P; Wu, Yu-Ning; Sit, Janet W H; Yang, Miao-Yan; Li, Xiao-Mei

2018-03-01

This study examined the psychometric properties of the Chinese version of the Personal Diabetes Questionnaire (C-PDQ). The PDQ was translated into Chinese using a forward and backward translation approach. After being reviewed by an expert panel, the C-PDQ was administered to a convenience sample of 346 adults with Type 2 diabetes. The Chinese version of the Summary of Diabetes Self-Care Activities (C-SDSCA) was also administered. The results of the exploratory factor analysis revealed a one-factor structure for the Diet Knowledge, Decision-Making, and Eating Problems subscales and a two-factor structure for the barriers-related subscales. The criterion and convergent validity were supported by significant correlations of the subscales of the C-PDQ with the glycated hemoglobin values and the parallel subscales in the C-SDSCA, respectively. The C-PDQ subscales also showed acceptable internal consistency (α = .61-.89) and excellent test-retest reliability (intraclass correlation coefficients: .73-.96). The results provide preliminary support for the reliability and validity of the C-PDQ. This comprehensive, patient-centered instrument could be useful to identify the needs, concerns, and priorities of Chinese patients with type 2 diabetes.
Urban traffic-related determinants of health questionnaire (UTDHQ): an instrument developed for health impact assessments.

PubMed

Nadrian, Haidar; Nedjat, Saharnaz; Taghdisi, Mohammad Hossein; Shojaeizadeh, Davoud

2014-01-01

Traffic and transport is a substantial part of a range of economic, social and environmental factors distinguished to have impact on human health. This paper is a report on a preliminary section of a Health Impact Assessment (HIA) on urban traffic and transport initiatives, being conducted in Sanandaj, Iran. In this preliminary study, the psychometric properties of Urban Traffic related Determinants of Health Questionnaire (UTDHQ) were investigated. Multistage cluster sampling was employed to recruit 476 key informants in Sanandaj from April to June 2013 to participate in the study. The development of UTDHQ began with a comprehensive review of the literature. Then face, content and construct validity as well as reliability were determined. Exploratory Factor Analysis showed optimal reduced solution including 40 items and 8 factors. Three of the factors identified were Physical Environment, Social Environment, Public Services Delivery and Accessibility. UTDHQ demonstrated an appropriate validity, reliability, functionality and simplicity. Despite the need for further studies on UTDHQ, this study showed that it can be a practical and useful tool for conducting HIAs in order to inform decision makers and stakeholders about the health influences of their decisions and measures.
Structured assessment of microsurgery skills in the clinical setting.

PubMed

Chan, WoanYi; Niranjan, Niri; Ramakrishnan, Venkat

2010-08-01

Microsurgery is an essential component in plastic surgery training. Competence has become an important issue in current surgical practice and training. The complexity of microsurgery requires detailed assessment and feedback on skills components. This article proposes a method of Structured Assessment of Microsurgery Skills (SAMS) in a clinical setting. Three types of assessment (i.e., modified Global Rating Score, errors list and summative rating) were incorporated to develop the SAMS method. Clinical anastomoses were recorded on videos using a digital microscope system and were rated by three consultants independently and in a blinded fashion. Fifteen clinical cases of microvascular anastomoses performed by trainees and a consultant microsurgeon were assessed using SAMS. The consultant had consistently the highest scores. Construct validity was also demonstrated by improvement of SAMS scores of microsurgery trainees. The overall inter-rater reliability was strong (alpha=0.78). The SAMS method provides both formative and summative assessment of microsurgery skills. It is demonstrated to be a valid, reliable and feasible assessment tool of operating room performance to provide systematic and comprehensive feedback as part of the learning cycle. Copyright 2009 British Association of Plastic, Reconstructive and Aesthetic Surgeons. Published by Elsevier Ltd. All rights reserved.
Multidisciplinary assessment measure for individuals with disorders of consciousness.

PubMed

Gollega, Ana; Meghji, Chamine; Renton, Sharon; Lazoruk, Arlene; Haynes, Elizabeth; Lawson, Denise; Ostapovitch, MaryAnne

2015-01-01

This study introduces the Comprehensive Assessment Measure for the Minimally Responsive Individual (CAMMRI) and reports on its development, inter-rater reliability, construct validity and clinical value. A multidisciplinary team of therapists developed this measure, which comprises 12 sub-tests that examine three main areas: Response to the Environment, Motor Control and Communication and Swallowing. The sub-tests are scored using a 7-point scale; sub-tests can also be administered individually. The measure was administered during a pilot project and then 1 year later to 12 adult clients with severe acquired brain injury at a long-term rehabilitation programme. The age range of the participants was 18-65 years; individuals were 1.5-10 years post-injury. Comparison measures included the Western Neuro Sensory Stimulation Profile (WNSSP), the Coma Recovery Scale-Revised (CRS-R) and the Chedoke McMaster Impairment Inventory (CMII). Inter-rater reliability of each sub-test ranged from 0.87-1.0, with an average of 0.90 in the first year of the assessments. Validity data supported the use of the CAMMRI for minimally conscious adults with ABI to measure behavioural changes and plan treatment for this population. Future research should focus on using this measure with other neurological populations.
A comprehensive approach to identify reliable reference gene candidates to investigate the link between alcoholism and endocrinology in Sprague-Dawley rats.

PubMed

Taki, Faten A; Abdel-Rahman, Abdel A; Zhang, Baohong

2014-01-01

Gender and hormonal differences are often correlated with alcohol dependence and related complications like addiction and breast cancer. Estrogen (E2) is an important sex hormone because it serves as a key protein involved in organism level signaling pathways. Alcoholism has been reported to affect estrogen receptor signaling; however, identifying the players involved in such multi-faceted syndrome is complex and requires an interdisciplinary approach. In many situations, preliminary investigations included a straight forward, yet informative biotechniques such as gene expression analyses using quantitative real time PCR (qRT-PCR). The validity of qRT-PCR-based conclusions is affected by the choice of reliable internal controls. With this in mind, we compiled a list of 15 commonly used housekeeping genes (HKGs) as potential reference gene candidates in rat biological models. A comprehensive comparison among 5 statistical approaches (geNorm, dCt method, NormFinder, BestKeeper, and RefFinder) was performed to identify the minimal number as well the most stable reference genes required for reliable normalization in experimental rat groups that comprised sham operated (SO), ovariectomized rats in the absence (OVX) or presence of E2 (OVXE2). These rat groups were subdivided into subgroups that received alcohol in liquid diet or isocalroic control liquid diet for 12 weeks. Our results showed that U87, 5S rRNA, GAPDH, and U5a were the most reliable gene candidates for reference genes in heart and brain tissue. However, different gene stability ranking was specific for each tissue input combination. The present preliminary findings highlight the variability in reference gene rankings across different experimental conditions and analytic methods and constitute a fundamental step for gene expression assays.

Educational testing validity and reliability in pharmacy and medical education literature.

PubMed

Hoover, Matthew J; Jung, Rose; Jacobs, David M; Peeters, Michael J

2013-12-16

To evaluate and compare the reliability and validity of educational testing reported in pharmacy education journals to medical education literature. Descriptions of validity evidence sources (content, construct, criterion, and reliability) were extracted from articles that reported educational testing of learners' knowledge, skills, and/or abilities. Using educational testing, the findings of 108 pharmacy education articles were compared to the findings of 198 medical education articles. For pharmacy educational testing, 14 articles (13%) reported more than 1 validity evidence source while 83 articles (77%) reported 1 validity evidence source and 11 articles (10%) did not have evidence. Among validity evidence sources, content validity was reported most frequently. Compared with pharmacy education literature, more medical education articles reported both validity and reliability (59%; p<0.001). While there were more scholarship of teaching and learning (SoTL) articles in pharmacy education compared to medical education, validity, and reliability reporting were limited in the pharmacy education literature.
[Design and validation of the scale for the detection of violence in courtship in young people in the Sevilla University (Spain)].

PubMed

García-Carpintero, María Ángeles; Rodríguez-Santero, Javier; Porcel-Gálvez, Ana María

To design and validate a specific instrument to detect exercised and suffered in the relations of young couples in violence. Descriptive study of validation clinimetric. Stratified by sex and area of knowledge, which was adopted as inclusion criteria have or have had any relationship. The sample consisted of 447 subjects. We obtained the Multidimensional Scale Dating Violence (EMVN), 32 items with three dimensions: physical and sexual assault, behavior control (cyberbullying, surveillance and harassment) and abuse psicoemocional (disparagement and domination), as a victim or as aggressor. No statistically significant differences were found between the violence exerted and the violence suffered, but it was based on sex. The EMVN is a valid and reliable scale that measures the different elements of violence in couples of young people and you can suppose a resource for the comprehensive detection of violent behaviors in dating relationships that are established among young people. Copyright © 2017 SESPAS. Publicado por Elsevier España, S.L.U. All rights reserved.
Adapting the academic motivation scale for use in pre-tertiary mathematics classrooms

NASA Astrophysics Data System (ADS)

Lim, Siew Yee; Chapman, Elaine

2015-09-01

The Academic Motivation Scale ( ams) is a comprehensive and widely used instrument for assessing motivation based on the self-determination theory. Currently, no such comprehensive instrument exists to assess the different domains of motivation (stipulated by the self-determination theory) in mathematics education at the pre-tertiary level (grades 11 and 12) in Asia. This study adapted the ams for this use and assessed the properties of the adapted instrument with 1610 students from Singapore. Exploratory and confirmatory factor analyses indicated a five-factor structure for the modified instrument (the three original ams intrinsic subscales collapsed into a single factor). Additionally, the modified instrument exhibited good internal consistency (mean α = .88), and satisfactory test-retest reliability over a 1-month interval (mean r xx = .73). The validity of the modified ams was further demonstrated through correlational analyses among scores on its subscales, and with scores on other instruments measuring mathematics attitudes, anxiety and achievement.
Development of HomeSTEAD's physical activity and screen time physical environment inventory.

PubMed

Hales, Derek; Vaughn, Amber E; Mazzucca, Stephanie; Bryant, Maria J; Tabak, Rachel G; McWilliams, Christina; Stevens, June; Ward, Dianne S

2013-12-05

The home environment has a significant influence on children's physical activity, sedentary behavior, dietary intake, and risk for obesity and chronic disease. Our understanding of the most influential factors and how they interact and impact child behavior is limited by current measurement tools, specifically the lack of a comprehensive instrument. HomeSTEAD (the Home Self-administered Tool for Environmental assessment of Activity and Diet) was designed to address this gap. This new tool contains four sections: home physical activity and media equipment inventory, family physical activity and screen time practices, home food inventory, and family food practices. This paper will describe HomeSTEAD's development and present reliability and validity evidence for the first section. The ANGELO framework guided instrument development, and systematic literature reviews helped identify existing items or scales for possible inclusion. Refinement of items was based on expert review and cognitive interviews. Parents of children ages 3-12 years (n = 125) completed the HomeSTEAD survey on three separate occasions over 12-18 days (Time 1, 2, and 3). The Time 1 survey also collected demographic information and parent report of child behaviors. Between Time 1 and 2, staff conducted an in-home observation and measured parent and child BMI. Kappa and intra-class correlations were used to examine reliability (test-retest) and validity (criterion and construct). Reliability and validity was strong for most items (97% having ICC > 0.60 and 72% having r > 0.50, respectively). Items with lower reliability generally had low variation between people. Lower validity estimates (r < 0.30) were more common for items that assessed usability and accessibility, with observers generally rating usability and accessibility lower than parents. Small to moderate, but meaningful, correlations between physical environment factors and BMI, outside time, and screen time were observed (e.g., amount of child portable play equipment in good condition and easy to access was significantly associated with child BMI: r = -0.23), providing evidence of construct validity. The HomeSTEAD instrument represents a clear advancement in the measurement of factors in the home environment related to child weight and weight-related behaviors. HomeSTEAD, in its entirety, represents a useful tool for researchers from which they can draw particular scales of greatest interest and highest relevance to their research questions.
Measuring relational and intrapersonal empowerment: testing instrument validity in a former soviet country with a secular muslim culture.

PubMed

Cheryomukhin, Alexander; Peterson, N Andrew

2014-06-01

Research and evaluation studies measuring the construct of empowerment within international community development and human rights initiatives are rare due to a lack of validated measures appropriate for the cultural context. This study represents an initial effort to develop and test the Brief Azerbaijani Empowerment Scale (BAES), an instrument designed to assess relational and intrapersonal components of psychological empowerment among adult community residents (n = 350) in Azerbaijan, a former Soviet country with a predominantly Muslim culture. Exploratory factor analysis was used to examine the underlying dimensionality of the BAES, and path analysis was used to examine relationships between subscales of the BAES and a set of conceptually relevant variables (i.e., alienation, sense of community, and involvement in community organizations). Findings supported the reliability and validity of the BAES, which may be useful to future efforts to develop more comprehensive measures of intrapersonal and relational empowerment. Implications for future research and practice are discussed.
Development and validation of the Body, Eating, and Exercise Comparison Orientation Measure (BEECOM) among college women.

PubMed

Fitzsimmons-Craft, Ellen E; Bardone-Cone, Anna M; Harney, Megan B

2012-09-01

We constructed and validated a measure of comparison dimensions associated with eating pathology, namely, the body, eating, and exercise comparison orientation measure (BEECOM). Participants were 441 undergraduate women. In Study 1, items were generated and refined via exploratory factor analysis, yielding three interpretable factors (i.e., body, eating, and exercise comparison orientation). Confirmatory factor analysis was then used to confirm the three-factor structure of the BEECOM and to investigate the potential presence of a higher-order factor. Given that the lower-order factors loaded strongly onto a higher-order factor, it is appropriate to use a total BEECOM score, in addition to subscale scores. Further, the BEECOM's scores yielded evidence of internal consistency and construct validity in this sample. Study 2 demonstrated two-week test-retest reliability of the BEECOM among college women. Overall, the BEECOM demonstrated good psychometric properties and may be useful for more comprehensively assessing eating disorder-related social comparison behavior. Copyright © 2012 Elsevier Ltd. All rights reserved.
Father for the first time - development and validation of a questionnaire to assess fathers’ experiences of first childbirth (FTFQ)

PubMed Central

2012-01-01

Background A father’s experience of the birth of his first child is important not only for his birth-giving partner but also for the father himself, his relationship with the mother and the newborn. No validated questionnaire assessing first-time fathers' experiences during childbirth is currently available. Hence, the aim of this study was to develop and validate an instrument to assess first-time fathers’ experiences of childbirth. Method Domains and items were initially derived from interviews with first-time fathers, and supplemented by a literature search and a focus group interview with midwives. The comprehensibility, comprehension and relevance of the items were evaluated by four paternity research experts and a preliminary questionnaire was pilot tested in eight first-time fathers. A revised questionnaire was completed by 200 first-time fathers (response rate = 81%) Exploratory factor analysis using principal component analysis with varimax rotation was performed and multitrait scaling analysis was used to test scaling assumptions. External validity was assessed by means of known-groups analysis. Results Factor analysis yielded four factors comprising 22 items and accounting 48% of the variance. The domains found were Worry, Information, Emotional support and Acceptance. Multitrait analysis confirmed the convergent and discriminant validity of the domains; however, Cronbach’s alpha did not meet conventional reliability standards in two domains. The questionnaire was sensitive to differences between groups of fathers hypothesized to differ on important socio demographic or clinical variables. Conclusions The questionnaire adequately measures important dimensions of first-time fathers’ childbirth experience and may be used to assess aspects of fathers’ experiences during childbirth. To obtain the FTFQ and permission for its use, please contact the corresponding author. PMID:22594834
Father for the first time--development and validation of a questionnaire to assess fathers' experiences of first childbirth (FTFQ).

PubMed

Premberg, Åsa; Taft, Charles; Hellström, Anna-Lena; Berg, Marie

2012-05-17

A father's experience of the birth of his first child is important not only for his birth-giving partner but also for the father himself, his relationship with the mother and the newborn. No validated questionnaire assessing first-time fathers' experiences during childbirth is currently available. Hence, the aim of this study was to develop and validate an instrument to assess first-time fathers' experiences of childbirth. Domains and items were initially derived from interviews with first-time fathers, and supplemented by a literature search and a focus group interview with midwives. The comprehensibility, comprehension and relevance of the items were evaluated by four paternity research experts and a preliminary questionnaire was pilot tested in eight first-time fathers. A revised questionnaire was completed by 200 first-time fathers (response rate = 81%) Exploratory factor analysis using principal component analysis with varimax rotation was performed and multitrait scaling analysis was used to test scaling assumptions. External validity was assessed by means of known-groups analysis. Factor analysis yielded four factors comprising 22 items and accounting 48% of the variance. The domains found were Worry, Information, Emotional support and Acceptance. Multitrait analysis confirmed the convergent and discriminant validity of the domains; however, Cronbach's alpha did not meet conventional reliability standards in two domains. The questionnaire was sensitive to differences between groups of fathers hypothesized to differ on important socio demographic or clinical variables. The questionnaire adequately measures important dimensions of first-time fathers' childbirth experience and may be used to assess aspects of fathers' experiences during childbirth. To obtain the FTFQ and permission for its use, please contact the corresponding author.
Using Ryff's scales of psychological well-being in adolescents in mainland China.

PubMed

Gao, Jie; McLellan, Ros

2018-04-20

Psychological well-being in adolescence has always been a focus of public attention and academic research. Ryff's six-factor model of psychological well-being potentially provides a comprehensive theoretical framework for investigating positive functioning of adolescents. However, previous studies reported inconsistent findings of the reliability and validity of Ryff's Scales of Psychological Well-being (SPWB). The present study aimed to explore whether Ryff's six-factor model of psychological well-being could be applied in Chinese adolescents. The Scales of Psychological Well-being (SPWB) were adapted for assessing the psychological well-being of adolescents in mainland China. 772 adolescents (365 boys to 401 girls, 6 missing gender data, mean age = 13.65) completed the adapted 33-item SPWB. The data was used to examine the reliability and construct validity of the adapted SPWB. Results showed that five of the six sub-scales had acceptable internal consistency of items, except the sub-scale of autonomy. The factorial structure of the SPWB was not as clear-cut as the theoretical framework suggested. Among the models under examination, the six-factor model had better model fit than the hierarchical model and the one-factor model. However, the goodness-of-fit of the six-factor model was hardly acceptable. High factor correlations were identified between the sub-scales of environmental mastery, purpose in life and personal growth. Findings of the present study echoed a number of previous studies which reported inadequate reliability and validity of Ryff's scales. Given the evidence, it was suggested that future adolescent studies should seek to develop more age-specific and context-appropriate items for a better operationalisation of Ryff's theoretical model of psychological well-being.
Reliability and validity of non-radiographic methods of thoracic kyphosis measurement: a systematic review.

PubMed

Barrett, Eva; McCreesh, Karen; Lewis, Jeremy

2014-02-01

A wide array of instruments are available for non-invasive thoracic kyphosis measurement. Guidelines for selecting outcome measures for use in clinical and research practice recommend that properties such as validity and reliability are considered. This systematic review reports on the reliability and validity of non-invasive methods for measuring thoracic kyphosis. A systematic search of 11 electronic databases located studies assessing reliability and/or validity of non-invasive thoracic kyphosis measurement techniques. Two independent reviewers used a critical appraisal tool to assess the quality of retrieved studies. Data was extracted by the primary reviewer. The results were synthesized qualitatively using a level of evidence approach. 27 studies satisfied the eligibility criteria and were included in the review. The reliability, validity and both reliability and validity were investigated by sixteen, two and nine studies respectively. 17/27 studies were deemed to be of high quality. In total, 15 methods of thoracic kyphosis were evaluated in retrieved studies. All investigated methods showed high (ICC ≥ .7) to very high (ICC ≥ .9) levels of reliability. The validity of the methods ranged from low to very high. The strongest levels of evidence for reliability exists in support of the Debrunner kyphometer, Spinal Mouse and Flexicurve index, and for validity supports the arcometer and Flexicurve index. Further reliability and validity studies are required to strengthen the level of evidence for the remaining methods of measurement. This should be addressed by future research. Copyright © 2013 Elsevier Ltd. All rights reserved.
Reliability of an e-PRO Tool of EORTC QLQ-C30 for Measurement of Health-Related Quality of Life in Patients With Breast Cancer: Prospective Randomized Trial.

PubMed

Wallwiener, Markus; Matthies, Lina; Simoes, Elisabeth; Keilmann, Lucia; Hartkopf, Andreas D; Sokolov, Alexander N; Walter, Christina B; Sickenberger, Nina; Wallwiener, Stephanie; Feisst, Manuel; Gass, Paul; Fasching, Peter A; Lux, Michael P; Wallwiener, Diethelm; Taran, Florin-Andrei; Rom, Joachim; Schneeweiss, Andreas; Graf, Joachim; Brucker, Sara Y

2017-09-14

Breast cancer represents the most common malignant disease in women worldwide. As currently systematic palliative treatment only has a limited effect on survival rates, the concept of health-related quality of life (HRQoL) is gaining more and more importance in the therapy setting of metastatic breast cancer. One of the major patient-reported outcomes (PROs) for measuring HRQoL in patients with breast cancer is provided by the European Organization for Research and Treatment of Cancer (EORTC). Currently, paper-based surveys still predominate, as only a few reliable and validated electronic-based questionnaires are available. Facing the possibilities associated with evolving digitalization in medicine, validation of electronic versions of well-established PRO is essential in order to contribute to comprehensive and holistic oncological care and to ensure high quality in cancer research. The aim of this study was to analyze the reliability of a tablet-based measuring application for EORTC QLQ-C30 in German language in patients with adjuvant and (curative) metastatic breast cancer. Paper- and tablet-based questionnaires were completed by a total of 106 female patients with adjuvant and metastatic breast cancer recruited as part of the e-PROCOM study. All patients were required to complete the electronic- (e-PRO) and paper-based versions of the HRQoL EORTC QLQ-C30 questionnaire. A frequency analysis was performed to determine descriptive sociodemographic characteristics. Both dimensions of reliability (parallel forms reliability [Wilcoxon test] and test of internal consistency [Spearman rho and agreement rates for single items, Pearson correlation and Kendall tau for each scale]) were analyzed. High correlations were shown for both dimensions of reliability (parallel forms reliability and internal consistency) in the patient's response behavior between paper- and electronic-based questionnaires. Regarding the test of parallel forms reliability, no significant differences were found in 27 of 30 single items and in 14 of 15 scales, whereas a statistically significant correlation in the test of consistency was found in all 30 single items and all 15 scales. The evaluated e-PRO version of the EORTC QLQ-C30 is reliable for patients with both adjuvant and metastatic breast cancer, showing a high correlation in almost all questions (and in many scales). Thus, we conclude that the validated paper-based PRO assessment and the e-PRO tool are equally valid. However, the reliability should also be analyzed in other prospective trials to ensure that usability is reliable in all patient groups. ClinicalTrials.gov NCT03132506; https://clinicaltrials.gov/ct2/show/NCT03132506 (Archived by WebCite at http://www.webcitation.org/6tRcgQuou). ©Markus Wallwiener, Lina Matthies, Elisabeth Simoes, Lucia Keilmann, Andreas D Hartkopf, Alexander N Sokolov, Christina B Walter, Nina Sickenberger, Stephanie Wallwiener, Manuel Feisst, Paul Gass, Peter A Fasching, Michael P Lux, Diethelm Wallwiener, Florin-Andrei Taran, Joachim Rom, Andreas Schneeweiss, Joachim Graf, Sara Y Brucker. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 14.09.2017.
Assessment of technical and nontechnical skills in surgical residents.

PubMed

Ponton-Carss, Alicia; Kortbeek, John B; Ma, Irene W Y

2016-11-01

Surgical competence encompasses both technical and nontechnical skills. This study seeks to evaluate the validity evidence for a comprehensive surgical skills examination and to examine the relationship between technical and nontechnical skills. Six examination stations assessing both technical and nontechnical skills, conducted yearly for surgical trainees (n = 120) between 2010 and 2014 are included. The assessment tools demonstrated acceptable internal consistency. Interstation reliability for technical skills was low (alpha = .39). Interstation reliability for the nontechnical skills was lower (alpha range -.05 to .31). Nontechnical skills domains were strongly correlated, ranging from r = .65, P < .001 to .86, P < .001. The associations between nontechnical and technical skills were inconsistent, ranging from poor (r = -.06; P = .54) to moderate (r = .45; P < .001). Multiple samplings of integrated technical and nontechnical skills are necessary to assess overall surgical competency. Copyright © 2016 Elsevier Inc. All rights reserved.
The French version of the Nottingham Health Profile. A comparison of items weights with those of the source version.

PubMed

Bucquet, D; Condon, S; Ritchie, K

1990-01-01

The efficient and reliable assessment of general community health requires the development of comprehensive and parsimonious measures of proven validity. The Nottingham Health Profile (NHP) has been demonstrated to be a reliable indicator of common expressions of discomfort and stress in the general population. The present paper describes its linguistic adaptation into French, the derivation of item weights by Thurstone's method of paired comparisons and the comparison of item weights across various sociodemographic groups. There is more similarity than variation on the valuation of the state of health explored by the NHP between the French and the British population as little inter-cultural or inter-linguistic variations were found. The differences in judgement of severity elicited across sociodemographic groups in the French sample cast some doubts on the relevance of general weights for use in population surveys.
Improving the quality of discrete-choice experiments in health: how can we assess validity and reliability?

PubMed

Janssen, Ellen M; Marshall, Deborah A; Hauber, A Brett; Bridges, John F P

2017-12-01

The recent endorsement of discrete-choice experiments (DCEs) and other stated-preference methods by regulatory and health technology assessment (HTA) agencies has placed a greater focus on demonstrating the validity and reliability of preference results. Areas covered: We present a practical overview of tests of validity and reliability that have been applied in the health DCE literature and explore other study qualities of DCEs. From the published literature, we identify a variety of methods to assess the validity and reliability of DCEs. We conceptualize these methods to create a conceptual model with four domains: measurement validity, measurement reliability, choice validity, and choice reliability. Each domain consists of three categories that can be assessed using one to four procedures (for a total of 24 tests). We present how these tests have been applied in the literature and direct readers to applications of these tests in the health DCE literature. Based on a stakeholder engagement exercise, we consider the importance of study characteristics beyond traditional concepts of validity and reliability. Expert commentary: We discuss study design considerations to assess the validity and reliability of a DCE, consider limitations to the current application of tests, and discuss future work to consider the quality of DCEs in healthcare.
Short Assessment of Health Literacy—Spanish and English: A Comparable Test of Health Literacy for Spanish and English Speakers

PubMed Central

Lee, Shoou-Yih Daniel; Stucky, Brian D; Lee, Jessica Y; Rozier, R Gary; Bender, Deborah E

2010-01-01

Objective The intent of the study was to develop and validate a comparable health literacy test for Spanish-speaking and English-speaking populations. Study Design The design of the instrument, named the Short Assessment of Health Literacy—Spanish and English (SAHL-S&E), combined a word recognition test, as appearing in the Rapid Estimate of Adult Literacy in Medicine (REALM), and a comprehension test using multiple-choice questions designed by an expert panel. We used the item response theory (IRT) in developing and validating the instrument. Data Collection Validation of SAHL-S&E involved testing and comparing the instrument with other health literacy instruments in a sample of 201 Spanish-speaking and 202 English-speaking subjects recruited from the Ambulatory Care Center at the University of North Carolina Healthcare System. Principal Findings Based on IRT analysis, 18 items were retained in the comparable test. The Spanish version of the test, SAHL-S, was highly correlated with other Spanish health literacy instruments, Short Assessment of Health Literacy for Spanish-Speaking Adults (r=0.88, p<.05) and the Spanish Test of Functional Health Literacy in Adults (TOFHLA) (r=0.62, p<.05). The English version, SAHL-E, had high correlations with REALM (r=0.94, p<.05) and the English TOFHLA (r=0.68, p<.05). Significant correlations were found between SAHL-S&E and years of schooling in both Spanish- and English-speaking samples (r=0.15 and 0.39, respectively). SAHL-S&E displayed satisfactory reliability of 0.80 and 0.89 in the Spanish- and English-speaking samples, respectively. IRT analysis indicated that the SAHL-S&E score was highly reliable for individuals with a low level of health literacy. Conclusions The new instrument, SAHL-S&E, has good reliability and validity. It is particularly useful for identifying individuals with low health literacy and could be used to screen for low health literacy among Spanish and English speakers. PMID:20500222
Development and Psychometric Evaluation of an Instrument to Assess Cross-Cultural Competence of Healthcare Professionals (CCCHP)

PubMed Central

Bernhard, Gerda; Knibbe, Ronald A.; von Wolff, Alessa; Dingoyan, Demet; Schulz, Holger; Mösko, Mike

2015-01-01

Background Cultural competence of healthcare professionals (HCPs) is recognized as a strategy to reduce cultural disparities in healthcare. However, standardised, valid and reliable instruments to assess HCPs’ cultural competence are notably lacking. The present study aims to 1) identify the core components of cultural competence from a healthcare perspective, 2) to develop a self-report instrument to assess cultural competence of HCPs and 3) to evaluate the psychometric properties of the new instrument. Methods The conceptual model and initial item pool, which were applied to the cross-cultural competence instrument for the healthcare profession (CCCHP), were derived from an expert survey (n = 23), interviews with HCPs (n = 12), and a broad narrative review on assessment instruments and conceptual models of cultural competence. The item pool was reduced systematically, which resulted in a 59-item instrument. A sample of 336 psychologists, in advanced psychotherapeutic training, and 409 medical students participated, in order to evaluate the construct validity and reliability of the CCCHP. Results Construct validity was supported by principal component analysis, which led to a 32-item six-component solution with 50% of the total variance explained. The different dimensions of HCPs’ cultural competence are: Cross-Cultural Motivation/Curiosity, Cross-Cultural Attitudes, Cross-Cultural Skills, Cross-Cultural Knowledge/Awareness and Cross-Cultural Emotions/Empathy. For the total instrument, the internal consistency reliability was .87 and the dimension’s Cronbach’s α ranged from .54 to .84. The discriminating power of the CCCHP was indicated by statistically significant mean differences in CCCHP subscale scores between predefined groups. Conclusions The 32-item CCCHP exhibits acceptable psychometric properties, particularly content and construct validity to examine HCPs’ cultural competence. The CCCHP with its five dimensions offers a comprehensive assessment of HCPs’ cultural competence, and has the ability to distinguish between groups that are expected to differ in cultural competence. This instrument can foster professional development through systematic self-assessment and thus contributes to improve the quality of patient care. PMID:26641876
Development and Psychometric Evaluation of an Instrument to Assess Cross-Cultural Competence of Healthcare Professionals (CCCHP).

PubMed

Bernhard, Gerda; Knibbe, Ronald A; von Wolff, Alessa; Dingoyan, Demet; Schulz, Holger; Mösko, Mike

2015-01-01

Cultural competence of healthcare professionals (HCPs) is recognized as a strategy to reduce cultural disparities in healthcare. However, standardised, valid and reliable instruments to assess HCPs' cultural competence are notably lacking. The present study aims to 1) identify the core components of cultural competence from a healthcare perspective, 2) to develop a self-report instrument to assess cultural competence of HCPs and 3) to evaluate the psychometric properties of the new instrument. The conceptual model and initial item pool, which were applied to the cross-cultural competence instrument for the healthcare profession (CCCHP), were derived from an expert survey (n = 23), interviews with HCPs (n = 12), and a broad narrative review on assessment instruments and conceptual models of cultural competence. The item pool was reduced systematically, which resulted in a 59-item instrument. A sample of 336 psychologists, in advanced psychotherapeutic training, and 409 medical students participated, in order to evaluate the construct validity and reliability of the CCCHP. Construct validity was supported by principal component analysis, which led to a 32-item six-component solution with 50% of the total variance explained. The different dimensions of HCPs' cultural competence are: Cross-Cultural Motivation/Curiosity, Cross-Cultural Attitudes, Cross-Cultural Skills, Cross-Cultural Knowledge/Awareness and Cross-Cultural Emotions/Empathy. For the total instrument, the internal consistency reliability was .87 and the dimension's Cronbach's α ranged from .54 to .84. The discriminating power of the CCCHP was indicated by statistically significant mean differences in CCCHP subscale scores between predefined groups. The 32-item CCCHP exhibits acceptable psychometric properties, particularly content and construct validity to examine HCPs' cultural competence. The CCCHP with its five dimensions offers a comprehensive assessment of HCPs' cultural competence, and has the ability to distinguish between groups that are expected to differ in cultural competence. This instrument can foster professional development through systematic self-assessment and thus contributes to improve the quality of patient care.
Preliminary validation study of the Russian Birmingham Cognitive Screen.

PubMed

Kuzmina, E; Humphreys, G W; Riddoch, M J; Skvortsov, A A; Weekes, B S

2018-02-01

The Birmingham Cognitive Screen (BCoS) is designed for use with individuals who have acquired language impairment following stroke. Our goal was to develop a Russian version of the BCoS (Rus-BCoS) by translating the battery following cultural and linguistic adaptations and establishing preliminary data on its psychometric properties. Fifty patients with left-hemisphere stroke were recruited, of whom 98% were diagnosed with mild to moderate aphasia. To check whether the Rus-BCoS provides stable and consistent scores, internal consistency, test-retest, and interrater types of reliability were determined. Eight participants with stroke and 20 neurologically intact participants were assessed twice. To inspect the discriminative power of the battery, 63 participants without brain impairment were tested with the Rus-BCoS. Additionally, the Russian version of the Montreal Cognitive Assessment (MoCA), Quantitative Assessment of Speech in Aphasia, and Luria's Neuropsychological Assessment Battery were used to examine convergent validity, sensitivity, and specificity of the Rus-BCoS. The internal consistency as well as test-retest and interrater reliability of the Rus-BCoS satisfied criteria for the research use. Performance on a majority of tasks in the battery correlated significantly with independently validated tests that putatively measure similar cognitive processes. Critically, all patients with aphasia returned nonzero scores in at least one task in all the Rus-BCoS sections, with the exception of the Controlled Attention section where two patients with severe executive control deficits could not perform. The Rus-BCoS shows promise as a comprehensive cognitive screening tool that can be used by clinicians working with Russian-speaking persons experiencing poststroke aphasia after much further validation and development of reliable normative standards. Given a lack of quantitative neuropsychological assessment tools in Russia, however, we contend the Rus-BCoS offers potential benefits to clinicians and patients. However, data from research studies with a broader sample of Russian speakers are needed.
Evaluation of Animal-Based Indicators to Be Used in a Welfare Assessment Protocol for Sheep.

PubMed

Richmond, Susan E; Wemelsfelder, Francoise; de Heredia, Ina Beltran; Ruiz, Roberto; Canali, Elisabetta; Dwyer, Cathy M

2017-01-01

Sheep are managed under a variety of different environments (continually outdoors, partially outdoors with seasonal or diurnal variation, continuously indoors) and for different purposes, which makes assessing welfare challenging. This diversity means that resource-based indicators are not particularly useful and, thus, a welfare assessment scheme for sheep, focusing on animal-based indicators, was developed. We focus specifically on ewes, as the most numerous group of sheep present on farm, although many of the indicators may also have relevance to adult male sheep. Using the Welfare Quality ® framework of four Principles and 12 Criteria, we considered the validity, reliability, and feasibility of 46 putative animal-based indicators derived from the literature for these criteria. Where animal-based indicators were potentially unreliably or were not considered feasible, we also considered the resource-based indicators of access to water, stocking density, and floor slipperiness. With the exception of the criteria "Absence of prolonged thirst," we suggest at least one animal-based indicator for each welfare criterion. As a minimum, face validity was available for all indicators; however, for many, we found evidence of convergent validity and discriminant validity (e.g., lameness as measured by gait score, body condition score). The reliability of most of the physical and health measures has been tested in the field and found to be appropriate for use in welfare assessment. However, for the majority of the proposed behavioral indicators (lying synchrony, social withdrawal, postures associated with pain, vocalizations, stereotypy, vigilance, response to surprise, and human approach test), this still needs to be tested. In conclusion, the comprehensive assessment of sheep welfare through largely animal-based measures is supported by the literature through the use of indicators focusing on specific aspects of sheep biology. Further work is required for some indicators to ensure that measures are reliable when used in commercial settings.
Evaluation of Animal-Based Indicators to Be Used in a Welfare Assessment Protocol for Sheep

PubMed Central

Richmond, Susan E.; Wemelsfelder, Francoise; de Heredia, Ina Beltran; Ruiz, Roberto; Canali, Elisabetta; Dwyer, Cathy M.

2017-01-01

Sheep are managed under a variety of different environments (continually outdoors, partially outdoors with seasonal or diurnal variation, continuously indoors) and for different purposes, which makes assessing welfare challenging. This diversity means that resource-based indicators are not particularly useful and, thus, a welfare assessment scheme for sheep, focusing on animal-based indicators, was developed. We focus specifically on ewes, as the most numerous group of sheep present on farm, although many of the indicators may also have relevance to adult male sheep. Using the Welfare Quality® framework of four Principles and 12 Criteria, we considered the validity, reliability, and feasibility of 46 putative animal-based indicators derived from the literature for these criteria. Where animal-based indicators were potentially unreliably or were not considered feasible, we also considered the resource-based indicators of access to water, stocking density, and floor slipperiness. With the exception of the criteria “Absence of prolonged thirst,” we suggest at least one animal-based indicator for each welfare criterion. As a minimum, face validity was available for all indicators; however, for many, we found evidence of convergent validity and discriminant validity (e.g., lameness as measured by gait score, body condition score). The reliability of most of the physical and health measures has been tested in the field and found to be appropriate for use in welfare assessment. However, for the majority of the proposed behavioral indicators (lying synchrony, social withdrawal, postures associated with pain, vocalizations, stereotypy, vigilance, response to surprise, and human approach test), this still needs to be tested. In conclusion, the comprehensive assessment of sheep welfare through largely animal-based measures is supported by the literature through the use of indicators focusing on specific aspects of sheep biology. Further work is required for some indicators to ensure that measures are reliable when used in commercial settings. PMID:29322048

Clinimetric properties of the Nepali version of the Pain Catastrophizing Scale in individuals with chronic pain

PubMed Central

Thibault, Pascal; Abbott, J Haxby; Jensen, Mark P

2018-01-01

Background Pain catastrophizing is an exaggerated negative cognitive response related to pain. It is commonly assessed using the Pain Catastrophizing Scale (PCS). Translation and validation of the scale in a new language would facilitate cross-cultural comparisons of the role that pain catastrophizing plays in patient function. Purpose The aim of this study was to translate and culturally adapt the PCS into Nepali (Nepali version of PCS [PCS-NP]) and evaluate its clinimetric properties. Methods We translated, cross-culturally adapted, and performed an exploratory factor analysis (EFA) of the PCS-NP in a sample of adults with chronic pain (N=143). We then confirmed the resulting factor model in a separate sample (N=272) and compared this model with 1-, 2-, and 3-factor models previously identified using confirmatory factor analyses (CFAs). We also computed internal consistencies, test–retest reliabilities, standard error of measurement (SEM), minimal detectable change (MDC), and limits of agreement with 95% confidence interval (LOA95%) of the PCS-NP scales. Concurrent validity with measures of depression, anxiety, and pain intensity was assessed by computing Pearson’s correlation coefficients. Results The PCS-NP was comprehensible and culturally acceptable. We extracted a two-factor solution using EFA and confirmed this model using CFAs in the second sample. Adequate fit was also found for a one-factor model and different two- and three-factor models based on prior studies. The PCS-NP scores evidenced excellent reliability and temporal stability, and demonstrated validity via moderate-to-strong associations with measures of depression, anxiety, and pain intensity. The SEM and MDC for the PCS-NP total score were 2.52 and 7.86, respectively (range of PCS scores 0–52). LOA95% was between −15.17 and +16.02 for the total PCS-NP scores. Conclusion The PCS-NP is a valid and reliable instrument to assess pain catastrophizing in Nepalese individuals with chronic pain. PMID:29430196
Measures of frailty in population-based studies: an overview

PubMed Central

2013-01-01

Background Although research productivity in the field of frailty has risen exponentially in recent years, there remains a lack of consensus regarding the measurement of this syndrome. This overview offers three services: first, we provide a comprehensive catalogue of current frailty measures; second, we evaluate their reliability and validity; third, we report on their popularity of use. Methods In order to identify relevant publications, we searched MEDLINE (from its inception in 1948 to May 2011); scrutinized the reference sections of the retrieved articles; and consulted our own files. An indicator of the frequency of use of each frailty instrument was based on the number of times it had been utilized by investigators other than the originators. Results Of the initially retrieved 2,166 papers, 27 original articles described separate frailty scales. The number (range: 1 to 38) and type of items (range of domains: physical functioning, disability, disease, sensory impairment, cognition, nutrition, mood, and social support) included in the frailty instruments varied widely. Reliability and validity had been examined in only 26% (7/27) of the instruments. The predictive validity of these scales for mortality varied: for instance, hazard ratios/odds ratios (95% confidence interval) for mortality risk for frail relative to non-frail people ranged from 1.21 (0.78; 1.87) to 6.03 (3.00; 12.08) for the Phenotype of Frailty and 1.57 (1.41; 1.74) to 10.53 (7.06; 15.70) for the Frailty Index. Among the 150 papers which we found to have used at least one of the 27 frailty instruments, 69% (n = 104) reported on the Phenotype of Frailty, 12% (n = 18) on the Frailty Index, and 19% (n = 28) on one of the remaining 25 instruments. Conclusions Although there are numerous frailty scales currently in use, reliability and validity have rarely been examined. The most evaluated and frequently used measure is the Phenotype of Frailty. PMID:23786540
Development of a Quantitative Measure of Holistic Nursing Care.

PubMed

Kinchen, Elizabeth

2015-09-01

Holistic care has long been a defining attribute of nursing practice. From the earliest years of its formal history, nursing has favored a holistic approach in the care of patients, and such an approach has become more important over time. The expansion of nursing's responsibility in delivering comprehensive primary care, the recognition of the importance of relationship-centered care, and the need for evidence-based legitimation of holistic nursing care and practices to insurance companies, policy-makers, health care providers, and patients highlight the need to examine the holistic properties of nursing care. The Holistic Caring Inventory is a theoretically sound, valid, and reliable tool; however, it does not comprehensively address attributes that have come to define holistic nursing care, necessitating the development of a more current instrument to measure the elements of a holistic perspective in nursing care. The development of a current and more comprehensive measure of holistic nursing care may be critical in demonstrating the importance of a holistic approach to patient care that reflects the principles of relationship-based care, shared decision-making, authentic presence, and pattern recognition. © The Author(s) 2014.
77 FR 56650 - Food and Drug Administration/American Glaucoma Society Workshop on the Validity, Reliability, and...

Federal Register 2010, 2011, 2012, 2013, 2014

2012-09-13

...] Food and Drug Administration/American Glaucoma Society Workshop on the Validity, Reliability, and... entitled ``FDA/American Glaucoma Society (AGS) Workshop on the Validity, Reliability, and Usability of... research. The purpose of this public workshop is to provide a forum for discussing the validity...
Sexual Dysfunction in Breast Cancer Survivors: Cross-Cultural Adaptation of the Sexual Activity Questionnaire for Use in Portugal.

PubMed

da Costa, Filipa Alves; Ribeiro, Manuel Castro; Braga, Sofia; Carvalho, Elisabete; Francisco, Fátima; Miranda, Ana Costa; Moreira, António; Fallowfield, Lesley

2016-09-01

The increasing survivor population of breast cancer has shifted research and practice interests into the impacts of the disease and treatment in quality of life aspects. The lack of tools available in Portuguese to objectively evaluate sexual function led to the development of this study, which aimed to cross-culturally adapt and validate the Sexual Activity Questionnaire for use in Portugal. The questionnaire was translated and back-translated, refined following face-to-face interviews with seven breast cancer survivors, and then self-administered by a larger sample at baseline and a fortnight later to test validity and reliability. Following cognitive debriefing (n = 7), minor changes were made and the Sexual Activity Questionnaire was then tested with 134 breast cancer survivors. A 3-factor structure explained 75.5% of the variance, comprising the Pleasure, Habit and Discomfort scales, all yielding good internal consistency (Cronbach's α > 0.70). Concurrent validity with the FACt-An and the BCPT checklist was good (Spearman's r > 0.65; p-value < 0.001) and reliability acceptable (Cohen's k > 0.444). The Sexual Activity Questionnaire allowed the identification of 23.9% of sexually inactive women, for whom the main reasons were lack of interest or motivation and not having a partner. Patient-reported outcomes led to a more comprehensive and improved approach to cancer, tackling areas previously abandoned. Future research should focus on the validation of this scale in samples with different characteristics and even in the overall population to enable generalizability of the findings. The adapted Sexual Activity Questionnaire is a valid tool for assessing sexual function in breast cancer survivors in Portugal.
[Development and validation of the Inventory of Needs in Memory Impairment (BIG-65): illness-related needs in people with cognitive impairment and dementia].

PubMed

Schmid, R; Eschen, A; Rüegger-Frey, B; Martin, M

2013-06-01

There is growing evidence that individuals with cognitive impairment and dementia require systematic assessment of needs for the selection of optimal treatments. Currently no valid instrument is applicable for illness-related need assessment in this growing population. The purpose of this study was to develop and validate a new instrument ("Bedürfnisinventar bei Gedächtnisstörungen", BIG-65) that systematically assesses illness-related needs. The development was based on an adequate theoretical framework and standardised procedural guidelines and validated to an appropriate sample of individuals attending a Swiss memory clinic (n = 83). The BIG-65 provides a comprehensive range of biopsychosocial and environmental needs items and offers a dementia-friendly structure for the assessment of illness-related needs. The BIG-65 has high face validity and very high test-retest reliability (rtt = 0,916). On average 3.5 (SD = 3.7) unmet needs were assessed. Most frequently mentioned needs were: "forget less" (50%), "better concentration" (23.2%), "information on illness" (20.7%), "information on treatments" (17.1%), "less worry", "less irritable", "improve mood", "improve orientation" (13.4% each). Needs profiles differed between patients with preclinical (subjective cognitive impairment, mild cognitive impairment) and clinical (dementia) diagnosis. The BIG-65 reliably assesses illness-related needs in individuals with moderate dementia. With decreasing cognitive functions or an MMSE <20 points, additional methods such as observation of the emotional expression may be applied. According to our results, individuals with cognitive impairment and dementia pursue individual strategies to stabilize their quality of life level. In addition to the assessment of objective illness symptoms the selection of optimal treatments may profit from a systematic needs assessment to optimally support patients in their individual quality of life strategies.
Validity and Reliability of Field-Based Measures for Assessing Movement Skill Competency in Lifelong Physical Activities: A Systematic Review.

PubMed

Hulteen, Ryan M; Lander, Natalie J; Morgan, Philip J; Barnett, Lisa M; Robertson, Samuel J; Lubans, David R

2015-10-01

It has been suggested that young people should develop competence in a variety of 'lifelong physical activities' to ensure that they can be active across the lifespan. The primary aim of this systematic review is to report the methodological properties, validity, reliability, and test duration of field-based measures that assess movement skill competency in lifelong physical activities. A secondary aim was to clearly define those characteristics unique to lifelong physical activities. A search of four electronic databases (Scopus, SPORTDiscus, ProQuest, and PubMed) was conducted between June 2014 and April 2015 with no date restrictions. Studies addressing the validity and/or reliability of lifelong physical activity tests were reviewed. Included articles were required to assess lifelong physical activities using process-oriented measures, as well as report either one type of validity or reliability. Assessment criteria for methodological quality were adapted from a checklist used in a previous review of sport skill outcome assessments. Movement skill assessments for eight different lifelong physical activities (badminton, cycling, dance, golf, racquetball, resistance training, swimming, and tennis) in 17 studies were identified for inclusion. Methodological quality, validity, reliability, and test duration (time to assess a single participant), for each article were assessed. Moderate to excellent reliability results were found in 16 of 17 studies, with 71% reporting inter-rater reliability and 41% reporting intra-rater reliability. Only four studies in this review reported test-retest reliability. Ten studies reported validity results; content validity was cited in 41% of these studies. Construct validity was reported in 24% of studies, while criterion validity was only reported in 12% of studies. Numerous assessments for lifelong physical activities may exist, yet only assessments for eight lifelong physical activities were included in this review. Generalizability of results may be more applicable if more heterogeneous samples are used in future research. Moderate to excellent levels of inter- and intra-rater reliability were reported in the majority of studies. However, future work should look to establish test-retest reliability. Validity was less commonly reported than reliability, and further types of validity other than content validity need to be established in future research. Specifically, predictive validity of 'lifelong physical activity' movement skill competency is needed to support the assertion that such activities provide the foundation for a lifetime of activity.
Development and initial validation of a brief self-report measure of cognitive dysfunction in fibromyalgia.

PubMed

Kratz, Anna L; Schilling, Stephen G; Goesling, Jenna; Williams, David A

2015-06-01

Pain is often the focus of research and clinical care in fibromyalgia (FM); however, cognitive dysfunction is also a common, distressing, and disabling symptom in FM. Current efforts to address this problem are limited by the lack of a comprehensive, valid measure of subjective cognitive dysfunction in FM that is easily interpretable, accessible, and brief. The purpose of this study was to leverage cognitive functioning item banks that were developed as part of the Patient Reported Outcomes Measurement Information System (PROMIS) to devise a 10-item short form measure of cognitive functioning for use in FM. In study 1, a nationwide (U.S.) sample of 1,035 adults with FM (age range = 18-82, 95.2% female) completed 2 cognitive item pools. Factor analyses and item response theory analyses were used to identify dimensionality and optimally performing items. A recommended 10-item measure, called the Multidimensional Inventory of Subjective Cognitive Impairment (MISCI) was created. In study 2, 232 adults with FM completed the MISCI and a legacy measure of cognitive functioning that is used in FM clinical trials, the Multiple Ability Self-Report Questionnaire (MASQ). The MISCI showed excellent internal reliability, low ceiling/floor effects, and good convergent validity with the MASQ (r = -.82). This paper presents the MISCI, a 10-item measure of cognitive dysfunction in FM, developed through classical test theory and item response theory. This brief but comprehensive measure shows evidence of excellent construct validity through large correlations with a lengthy legacy measure of cognitive functioning. Copyright © 2015 American Pain Society. Published by Elsevier Inc. All rights reserved.
Validity and Reliability of Turkish Male Breast Self-Examination Instrument.

PubMed

Erkin, Özüm; Göl, İlknur

2018-04-01

This study aims to measure the validity and reliability of Turkish male breast self-examination (MBSE) instrument. The methodological study was performed in 2016 at Ege University, Faculty of Nursing, İzmir, Turkey. The MBSE includes ten steps. For validity studies, face validity, content validity, and construct validity (exploratory factor analysis) were done. For reliability study, Kuder Richardson was calculated. The content validity index was found to be 0.94. Kendall W coefficient was 0.80 (p=0.551). The total variance explained by the two factors was found to be 63.24%. Kuder Richardson 21 was done for reliability study and found to be 0.97 for the instrument. The final instrument included 10 steps and two stages. The Turkish version of MBSE is a valid and reliable instrument for early diagnose. The MBSE can be used in Turkish speaking countries and cultures with two stages and 10 steps.
Hopes and Cautions for Instrument-Based Evaluation of Consent Capacity: Results of a Construct Validity Study of Three Instruments

PubMed Central

Moye, Jennifer; Azar, Annin R.; Karel, Michele J.; Gurrera, Ronald J.

2016-01-01

Does instrument based evaluation of consent capacity increase the precision and validity of competency assessment or does ostensible precision provide a false sense of confidence without in fact improving validity? In this paper we critically examine the evidence for construct validity of three instruments for measuring four functional abilities important in consent capacity: understanding, appreciation, reasoning, and expressing a choice. Instrument based assessment of these abilities is compared through investigation of a multi-trait multi-method matrix in 88 older adults with mild to moderate dementia. Results find variable support for validity. There appears to be strong evidence for good hetero-method validity for the measurement of understanding, mixed evidence for validity in the measurement of reasoning, and strong evidence for poor hetero-method validity for the concepts of appreciation and expressing a choice, although the latter is likely due to extreme range restrictions. The development of empirically based tools for use in capacity evaluation should ultimately enhance the reliability and validity of assessment, yet clearly more research is needed to define and measure the constructs of decisional capacity. We would also emphasize that instrument based assessment of capacity is only one part of a comprehensive evaluation of competency which includes consideration of diagnosis, psychiatric and/or cognitive symptomatology, risk involved in the situation, and individual and cultural differences. PMID:27330455
Psychometric instrumentation: reliability and validity of instruments used for clinical practice, evidence-based practice projects and research studies.

PubMed

Mayo, Ann M

2015-01-01

It is important for CNSs and other APNs to consider the reliability and validity of instruments chosen for clinical practice, evidence-based practice projects, or research studies. Psychometric testing uses specific research methods to evaluate the amount of error associated with any particular instrument. Reliability estimates explain more about how well the instrument is designed, whereas validity estimates explain more about scores that are produced by the instrument. An instrument may be architecturally sound overall (reliable), but the same instrument may not be valid. For example, if a specific group does not understand certain well-constructed items, then the instrument does not produce valid scores when used with that group. Many instrument developers may conduct reliability testing only once, yet continue validity testing in different populations over many years. All CNSs should be advocating for the use of reliable instruments that produce valid results. Clinical nurse specialists may find themselves in situations where reliability and validity estimates for some instruments that are being utilized are unknown. In such cases, CNSs should engage key stakeholders to sponsor nursing researchers to pursue this most important work.
Validation and Comprehension of Text Information: Two Sides of the Same Coin

ERIC Educational Resources Information Center

Richter, Tobias

2015-01-01

In psychological research, the comprehension of linguistic information and the knowledge-based assessment of its validity are often regarded as two separate stages of information processing. Recent findings in psycholinguistics and text comprehension research call this two-stage model into question. In particular, validation can affect…
Reliability and validity of generalizable skills instruments for students who are deaf, blind, or visually impaired.

PubMed

Loeding, B L; Greenan, J P

1998-12-01

The study examined the validity and reliability of four assessments, with three instruments per domain. Domains included generalizable mathematics, communication, interpersonal relations, and reasoning skills. Participants were deaf, legally blind, or visually impaired students enrolled in vocational classes at residential secondary schools. The researchers estimated the internal consistency reliability, test-retest reliability, and construct validity correlations of three subinstruments: student self-ratings, teacher ratings, and performance assessments. The data suggest that these instruments are highly internally consistent measures of generalizable vocational skills. Four performance assessments have high-to-moderate test-retest reliability estimates, and were generally considered to possess acceptable validity and reliability.
Internal Consistency, Retest Reliability, and their Implications For Personality Scale Validity

PubMed Central

McCrae, Robert R.; Kurtz, John E.; Yamagata, Shinji; Terracciano, Antonio

2010-01-01

We examined data (N = 34,108) on the differential reliability and validity of facet scales from the NEO Inventories. We evaluated the extent to which (a) psychometric properties of facet scales are generalizable across ages, cultures, and methods of measurement; and (b) validity criteria are associated with different forms of reliability. Composite estimates of facet scale stability, heritability, and cross-observer validity were broadly generalizable. Two estimates of retest reliability were independent predictors of the three validity criteria; none of three estimates of internal consistency was. Available evidence suggests the same pattern of results for other personality inventories. Internal consistency of scales can be useful as a check on data quality, but appears to be of limited utility for evaluating the potential validity of developed scales, and it should not be used as a substitute for retest reliability. Further research on the nature and determinants of retest reliability is needed. PMID:20435807
Methods to Develop the Eye-tem Bank to Measure Ophthalmic Quality of Life.

PubMed

Khadka, Jyoti; Fenwick, Eva; Lamoureux, Ecosse; Pesudovs, Konrad

2016-12-01

There is an increasing demand for high-standard, comprehensive, and reliable patient-reported outcome (PRO) instruments in all the disciplines of health care including in ophthalmology and optometry. Over the past two decades, a plethora of PRO instruments have been developed to assess the impact of eye diseases and their treatments. Despite this large number of instruments, significant shortcomings exist for the measurement of ophthalmic quality of life (QoL). Most PRO instruments are short-form instruments designed for clinical use, but this limits their content coverage often poorly targeting any study population other than that which they were developed for. Also, existing instruments are static paper and pencil based and unable to be updated easily leading to outdated and irrelevant item content. Scores obtained from different PRO instruments may not be directly comparable. These shortcomings can be addressed using item banking implemented with computer-adaptive testing (CAT). Therefore, we designed a multicenter project (The Eye-tem Bank project) to develop and validate such PROs to enable comprehensive measurement of ophthalmic QoL in eye diseases. Development of the Eye-tem Bank follows four phases: Phase I, Content Development; Phase II, Pilot Testing and Item Calibration; Phase III, Validation; and Phase IV, Evaluation. This project will deliver technologically advanced comprehensive QoL PROs in the form of item banking implemented via a CAT system in eye diseases. Here, we present a detailed methodological framework of this project.
Psychometric Properties of Language Assessments for Children Aged 4–12 Years: A Systematic Review

PubMed Central

Denman, Deborah; Speyer, Renée; Munro, Natalie; Pearce, Wendy M.; Chen, Yu-Wei; Cordier, Reinie

2017-01-01

Introduction: Standardized assessments are widely used by speech pathologists in clinical and research settings to evaluate the language abilities of school-aged children and inform decisions about diagnosis, eligibility for services and intervention. Given the significance of these decisions, it is important that assessments have sound psychometric properties. Objective: The aim of this systematic review was to examine the psychometric quality of currently available comprehensive language assessments for school-aged children and identify assessments with the best evidence for use. Methods: Using the PRISMA framework as a guideline, a search of five databases and a review of websites and textbooks was undertaken to identify language assessments and published material on the reliability and validity of these assessments. The methodological quality of selected studies was evaluated using the COSMIN taxonomy and checklist. Results: Fifteen assessments were evaluated. For most assessments evidence of hypothesis testing (convergent and discriminant validity) was identified; with a smaller number of assessments having some evidence of reliability and content validity. No assessments presented with evidence of structural validity, internal consistency or error measurement. Overall, all assessments were identified as having limitations with regards to evidence of psychometric quality. Conclusions: Further research is required to provide good evidence of psychometric quality for currently available language assessments. Of the assessments evaluated, the Assessment of Literacy and Language, the Clinical Evaluation of Language Fundamentals-5th Edition, the Clinical Evaluation of Language Fundamentals-Preschool: 2nd Edition and the Preschool Language Scales-5th Edition presented with most evidence and are thus recommended for use. PMID:28936189
Validation of the Minority Stress Scale Among Italian Gay and Bisexual Men

PubMed Central

Pala, Andrea Norcini; Dell’Amore, Francesca; Steca, Patrizia; Clinton, Lauren; Sandfort, Theodorus; Rael, Christine

2017-01-01

The experience of sexual orientation stigma (e.g., homophobic discrimination and physical aggression) generates minority stress, a chronic form of psychosocial stress. Minority stress has been shown to have a negative effect on gay and bisexual men’s (GBM’s) mental and physical health, increasing the rates of depression, suicidal ideation, and HIV risk behaviors. In conservative religious settings, such as Italy, sexual orientation stigma can be more frequently and/or more intensively experienced. However, minority stress among Italian GBM remains understudied. The aim of this study was to explore the dimensionality, internal reliability, and convergent validity of the Minority Stress Scale (MSS), a comprehensive instrument designed to assess the manifestations of sexual orientation stigma. The MSS consists of 50 items assessing (a) Structural Stigma, (b) Enacted Stigma, (c) Expectations of Discrimination, (d) Sexual Orientation Concealment, (e) Internalized Homophobia Toward Others, (f) Internalized Homophobia toward Oneself, and (g) Stigma Awareness. We recruited an online sample of 451 Italian GBM to take the MSS. We tested convergent validity using the Perceived Stress Questionnaire. Through exploratory factor analysis, we extracted the 7 theoretical factors and an additional 3-item factor assessing Expectations of Discrimination From Family Members. The MSS factors showed good internal reliability (ordinal α > .81) and good convergent validity. Our scale can be suitable for applications in research settings, psychosocial interventions, and, potentially, in clinical practice. Future studies will be conducted to further investigate the properties of the MSS, exploring the association with additional health-related measures (e.g., depressive symptoms and anxiety). PMID:29479555
Study on the Validity and Reliability of Melbourne Decision Making Scale in Turkey

ERIC Educational Resources Information Center

Çolakkadioglu, Oguzhan; Deniz, M. Engin

2015-01-01

This study is to analyze the validity and reliability of Melbourne Decision Making Questionnaire (MDMQ). The sample consisted of 650 university students. The structural validity of the MDMQ, as well as correlations among its sub-scales, measure-bound validity, internal consistency, item total correlations and test-retest reliability coefficients…
A Model for Estimating the Reliability and Validity of Criterion-Referenced Measures.

ERIC Educational Resources Information Center

Edmonston, Leon P.; Randall, Robert S.

A decision model designed to determine the reliability and validity of criterion referenced measures (CRMs) is presented. General procedures which pertain to the model are discussed as to: Measures of relationship, Reliability, Validity (content, criterion-oriented, and construct validation), and Item Analysis. The decision model is presented in…
A transversal multicenter study assessing functioning, disability and environmental factors with the comprehensive ICF core set for low back pain in Brazil.

PubMed

Riberto, M; Chiappetta, L M; Lopes, K A; Chiappetta, L R

2014-04-01

Low back pain is a leading cause of disability in Brazil. The multiple aspects of disability in these patients require comprehensive tools for their assessment. The International Classification of Functioning, Disability, and Health (ICF) core set for low back pain is designed to comprehensively describe the experience of such patients with their functioning. This study aimed to describe functioning and contextual factors and to empirically validate the ICF core set for low back pain. Cross sectional study. Three outpatient clinics in Manaus, Maceio and São Paulo, Brazil. Population. 135 low back pain outpatients under rehabilitation. Data concerning diagnosis, personal features, and the 78 ICF core set categories for low back pain were collected from clinical charts, physical examinations, tests, and interviews with patients from rehabilitation services in three parts of Brazil. 7.7% of the categories (6 body functions and 10 activity and participation) were affected in less than 20% of the sample, and were thus considered not validated. Pain and other sensations related to the musculoskeletal system were the body most frequently impaired functions. Mobility and domestic life were the chapters of activity and limitation most often described as limited. All environmental factors were qualified as either facilitators or barriers and acted as modulators of disability. The comprehensive ICF core sets for low back pain can be used to describe the living experience of such individuals, although efforts to make it operational and enhance the reproducibility of the results are needed to warrant its reliable routine use. This study highlights the importance of a complete assessment of chronic low back pain and demonstrate the need for multidisciplinary approach.

Development of family and dietary habits questionnaires: the assessment of family processes, dietary habits and adolescents' impulsiveness in Norwegian adolescents and their parents.

PubMed

Bjelland, Mona; Hausken, Solveig E S; Sleddens, Ester F C; Andersen, Lene F; Lie, Hanne C; Finset, Arnstein; Maes, Lea; Melbye, Elisabeth L; Glavin, Kari; Hanssen-Bauer, Merete W; Lien, Nanna

2014-10-15

There is a need for valid and comprehensive measures of parental influence on children's energy balance-related behaviours (EBRB). Such measures should be based on a theoretical framework, acknowledging the dynamic and complex nature of interactions occurring within a family. The aim of the Family & Dietary habits (F&D) project was to develop a conceptual framework identifying important and changeable family processes influencing dietary behaviours of 13-15 year olds. A second aim was to develop valid and reliable questionnaires for adolescents and their parents (both mothers and fathers) measuring these processes. A stepwise approach was used; (1) preparation of scope and structure, (2) development of the F&D questionnaires, (3) the conducting of pilot studies and (4) the conducting of validation studies (assessing internal reliability, test-retest reliability and confirmatory factor analysis) using data from a cross-sectional study. The conceptual framework includes psychosocial concepts such as family functioning, cohesion, conflicts, communication, work-family stress, parental practices and parental style. The physical characteristics of the home environment include accessibility and availability of different food items, while family meals are the sociocultural setting included. Individual characteristics measured are dietary intake (vegetables and sugar-sweetened beverages) and adolescents' impulsivity. The F&D questionnaires developed were tested in a test-retest (54 adolescents and 44 of their parents) and in a cross-sectional survey including 440 adolescents (13-15 year olds), 242 mothers and 155 fathers. The samples appear to be relatively representative for Norwegian adolescents and parents. For adolescents, mothers and fathers, the test-retest reliability of the dietary intake, frequencies of (family) meals, work-family stress and communication variables was satisfactory (ICC: 0.53-0.99). Barratt Impulsiveness Scale-Brief (BIS-Brief) was included, assessing adolescent's impulsivity. The internal reliability (Cronbach's alphas: 0.77/0.82) and test-retest reliability values (ICC: 0.74/0.77) of BIS-Brief were good. The conceptual framework developed may be a useful tool in guiding measurement and assessment of the home food environment and family processes related to adolescents' dietary habits, in particular and for EBRBs more generally. The results support the use of the F&D questionnaires as psychometrically sound tools to assess family characteristics and adolescent's impulsivity.
Task-oriented evaluation of electronic medical records systems: development and validation of a questionnaire for physicians

PubMed Central

2004-01-01

Background Evaluation is a challenging but necessary part of the development cycle of clinical information systems like the electronic medical records (EMR) system. It is believed that such evaluations should include multiple perspectives, be comparative and employ both qualitative and quantitative methods. Self-administered questionnaires are frequently used as a quantitative evaluation method in medical informatics, but very few validated questionnaires address clinical use of EMR systems. Methods We have developed a task-oriented questionnaire for evaluating EMR systems from the clinician's perspective. The key feature of the questionnaire is a list of 24 general clinical tasks. It is applicable to physicians of most specialties and covers essential parts of their information-oriented work. The task list appears in two separate sections, about EMR use and task performance using the EMR, respectively. By combining these sections, the evaluator may estimate the potential impact of the EMR system on health care delivery. The results may also be compared across time, site or vendor. This paper describes the development, performance and validation of the questionnaire. Its performance is shown in two demonstration studies (n = 219 and 80). Its content is validated in an interview study (n = 10), and its reliability is investigated in a test-retest study (n = 37) and a scaling study (n = 31). Results In the interviews, the physicians found the general clinical tasks in the questionnaire relevant and comprehensible. The tasks were interpreted concordant to their definitions. However, the physicians found questions about tasks not explicitly or only partially supported by the EMR systems difficult to answer. The two demonstration studies provided unambiguous results and low percentages of missing responses. In addition, criterion validity was demonstrated for a majority of task-oriented questions. Their test-retest reliability was generally high, and the non-standard scale was found symmetric and ordinal. Conclusion This questionnaire is relevant for clinical work and EMR systems, provides reliable and interpretable results, and may be used as part of any evaluation effort involving the clinician's perspective of an EMR system. PMID:15018620
Task-oriented evaluation of electronic medical records systems: development and validation of a questionnaire for physicians.

PubMed

Laerum, Hallvard; Faxvaag, Arild

2004-02-09

Evaluation is a challenging but necessary part of the development cycle of clinical information systems like the electronic medical records (EMR) system. It is believed that such evaluations should include multiple perspectives, be comparative and employ both qualitative and quantitative methods. Self-administered questionnaires are frequently used as a quantitative evaluation method in medical informatics, but very few validated questionnaires address clinical use of EMR systems. We have developed a task-oriented questionnaire for evaluating EMR systems from the clinician's perspective. The key feature of the questionnaire is a list of 24 general clinical tasks. It is applicable to physicians of most specialties and covers essential parts of their information-oriented work. The task list appears in two separate sections, about EMR use and task performance using the EMR, respectively. By combining these sections, the evaluator may estimate the potential impact of the EMR system on health care delivery. The results may also be compared across time, site or vendor. This paper describes the development, performance and validation of the questionnaire. Its performance is shown in two demonstration studies (n = 219 and 80). Its content is validated in an interview study (n = 10), and its reliability is investigated in a test-retest study (n = 37) and a scaling study (n = 31). In the interviews, the physicians found the general clinical tasks in the questionnaire relevant and comprehensible. The tasks were interpreted concordant to their definitions. However, the physicians found questions about tasks not explicitly or only partially supported by the EMR systems difficult to answer. The two demonstration studies provided unambiguous results and low percentages of missing responses. In addition, criterion validity was demonstrated for a majority of task-oriented questions. Their test-retest reliability was generally high, and the non-standard scale was found symmetric and ordinal. This questionnaire is relevant for clinical work and EMR systems, provides reliable and interpretable results, and may be used as part of any evaluation effort involving the clinician's perspective of an EMR system.
Development, reliability, and validity of the My Child's Play (MCP) questionnaire.

PubMed

Schneider, Eleanor; Rosenblum, Sara

2014-01-01

This article describes the development, reliability, and validity of My Child's Play (MCP), a parent questionnaire designed to evaluate the play of children ages 3-9 yr. The first phase of the study determined the questionnaire's content and face validity. Subsequently, the internal reliability consistency and construct and concurrent validity were demonstrated using 334 completed questionnaires. The MCP showed good internal consistency (α = .86). The factor analysis revealed four distinct factors with acceptable levels of internal reliability (Cronbach's αs = .63-.81) and gender- and age-related differences in play characteristics; both findings attest to the tool's construct validity. Significant correlations (r = .33, p < .0001) with the Parent as a Teacher Inventory demonstrate the MCP's concurrent validity. The MCP demonstrated acceptable reliability and validity. It appears to be a promising standardized assessment tool for use in research and practice to promote understanding of a child's play. Copyright © 2014 by the American Occupational Therapy Association, Inc.
Classical test theory and Rasch analysis validation of the Upper Limb Functional Index in subjects with upper limb musculoskeletal disorders.

PubMed

Bravini, Elisabetta; Franchignoni, Franco; Giordano, Andrea; Sartorio, Francesco; Ferriero, Giorgio; Vercelli, Stefano; Foti, Calogero

2015-01-01

To perform a comprehensive analysis of the psychometric properties and dimensionality of the Upper Limb Functional Index (ULFI) using both classical test theory and Rasch analysis (RA). Prospective, single-group observational design. Freestanding rehabilitation center. Convenience sample of Italian-speaking subjects with upper limb musculoskeletal disorders (N=174). Not applicable. The Italian version of the ULFI. Data were analyzed using parallel analysis, exploratory factor analysis, and RA for evaluating dimensionality, functioning of rating scale categories, item fit, hierarchy of item difficulties, and reliability indices. Parallel analysis revealed 2 factors explaining 32.5% and 10.7% of the response variance. RA confirmed the failure of the unidimensionality assumption, and 6 items out of the 25 misfitted the Rasch model. When the analysis was rerun excluding the misfitting items, the scale showed acceptable fit values, loading meaningfully to a single factor. Item separation reliability and person separation reliability were .98 and .89, respectively. Cronbach alpha was .92. RA revealed weakness of the scale concerning dimensionality and internal construct validity. However, a set of 19 ULFI items defined through the statistical process demonstrated a unidimensional structure, good psychometric properties, and clinical meaningfulness. These findings represent a useful starting point for further analyses of the tool (based on modern psychometric approaches and confirmatory factor analysis) in larger samples, including different patient populations and nationalities. Copyright © 2015 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Development of a patient-administered self-assessment tool (SATp) for follow-up of colorectal cancer patients in general practice.

PubMed

Ngune, Irene; Jiwa, Moyez; McManus, Alexandra; Hughes, Jeff; Parsons, Richard; Hodder, Rupert; Entriken, Fiona

2014-01-01

Treatment for colorectal cancer (CRC) may result in physical, social, and psychological needs that affect patients' quality of life post-treatment. A comprehensive assessment should be conducted to identify these needs in CRC patients post treatment, however, there is a lack of tools and processes available in general practice. This study aimed to develop a patient-completed needs screening tool that identifies potentially unmet physical, psychological, and social needs in CRC and facilitates consultation with a general practitioner (GP) to address these needs. The development of the self-assessment tool for patients (SATp) included a review of the literature; face and content validity with reference to an expert panel; psychometric testing including readability, internal consistency, and test-retest reliability; and usability in clinical practice. The SATp contains 25 questions. The tool had internal consistency (Cronbach's alpha 0.70-0.97), readability (reading ease 82.5%), and test-retest reliability (kappa 0.689-1.000). A total of 66 patients piloted the SATp. Participants were on average 69.2 (SD 9.9) years old and had a median follow-up period of 26.7 months. The SATp identified a total of 547 needs (median 7 needs/per patient; IQR [3-12.25]). Needs were categorised into social (175[32%]), psychological (175[32%]), and physical (197[36%]) domains. SATp is a reliable self-assessment tool useful for identifying CRC patient needs. Further testing of this tool for validity and usability is underway.
Psychometric properties of the parent́s perception uncertainty in illness scale, spanish version.

PubMed

Suarez-Acuña, C E; Carvajal-Carrascal, G; Serrano-Gómez, M E

2018-03-27

To analyze the psychometric properties of the Parents' Perception of Uncertainty in Illness Scale, parents/children, adapted to Spanish. A descriptive methodological study involving the translation into Spanish of the Parents' Perception of Uncertainty in Illness Scale, parents/children, and analysis of their face validity, content validity, construct validity and internal consistency. The original version of the scale in English was translated into Spanish, and approved by its author. Six face validity items with comprehension difficulty were reported; which were reviewed and adapted, keeping its structure. The global content validity index with expert appraisal was 0.94. In the exploratory analysis of factors, 3 dimensions were identified: ambiguity and lack of information, unpredictability and lack of clarity, with a KMO=0.846, which accumulated 91.5% of the explained variance. The internal consistency of the scale yielded a Cronbach alpha of 0.86 demonstrating a good level of correlation between items. The Spanish version of "Parent's Perception of Uncertainty in Illness Scale" is a valid and reliable tool that can be used to determine the level of uncertainty of parents facing the illness of their children. Copyright © 2018 Sociedad Española de Enfermería Intensiva y Unidades Coronarias (SEEIUC). Publicado por Elsevier España, S.L.U. All rights reserved.
Soldier Dimensions in Combat Models

DTIC Science & Technology

1990-05-07

and performance. Questionnaires, SQTs, and ARTEPs were often used. Many scales had estimates of reliability but few had validity data. Most studies...pending its validation . Research plans were provided for applications in simulated combat and with simulation devices, for data previously gathered...regarding reliability and validity . Lack of information following an instrument indicates neither reliability nor validity information was provided by the
Methodology Series Module 9: Designing Questionnaires and Clinical Record Forms - Part II.

PubMed

Setia, Maninder Singh

2017-01-01

This article is a continuation of the previous module on designing questionnaires and clinical record form in which we have discussed some basic points about designing the questionnaire and clinical record forms. In this section, we will discuss the reliability and validity of questionnaires. The different types of validity are face validity, content validity, criterion validity, and construct validity. The different types of reliability are test-retest reliability, inter-rater reliability, and intra-rater reliability. Some of these parameters are assessed by subject area experts. However, statistical tests should be used for evaluation of other parameters. Once the questionnaire has been designed, the researcher should pilot test the questionnaire. The items in the questionnaire should be changed based on the feedback from the pilot study participants and the researcher's experience. After the basic structure of the questionnaire has been finalized, the researcher should assess the validity and reliability of the questionnaire or the scale. If an existing standard questionnaire is translated in the local language, the researcher should assess the reliability and validity of the translated questionnaire, and these values should be presented in the manuscript. The decision to use a self- or interviewer-administered, paper- or computer-based questionnaire depends on the nature of the questions, literacy levels of the target population, and resources.
Methodology Series Module 9: Designing Questionnaires and Clinical Record Forms – Part II

PubMed Central

Setia, Maninder Singh

2017-01-01

This article is a continuation of the previous module on designing questionnaires and clinical record form in which we have discussed some basic points about designing the questionnaire and clinical record forms. In this section, we will discuss the reliability and validity of questionnaires. The different types of validity are face validity, content validity, criterion validity, and construct validity. The different types of reliability are test-retest reliability, inter-rater reliability, and intra-rater reliability. Some of these parameters are assessed by subject area experts. However, statistical tests should be used for evaluation of other parameters. Once the questionnaire has been designed, the researcher should pilot test the questionnaire. The items in the questionnaire should be changed based on the feedback from the pilot study participants and the researcher's experience. After the basic structure of the questionnaire has been finalized, the researcher should assess the validity and reliability of the questionnaire or the scale. If an existing standard questionnaire is translated in the local language, the researcher should assess the reliability and validity of the translated questionnaire, and these values should be presented in the manuscript. The decision to use a self- or interviewer-administered, paper- or computer-based questionnaire depends on the nature of the questions, literacy levels of the target population, and resources. PMID:28584367
Reliability and Validity of the Chinese Version of FACIT-AI, a New Tool for Assessing Quality of Life in Patients with Malignant Ascites.

PubMed

Lou, Yanni; Lu, Linghui; Li, Yuan; Liu, Meng; Bredle, Jason M; Jia, Liqun

2015-10-01

The study objective was to determine the reliability and validity of the Chinese version of the Functional Assessment of Chronic Illness Therapy - Ascites Index (FACIT-AI). A forward-backward translation procedure was adopted to develop the Chinese version of the FACIT-AI, which was tested in 69 patients with malignant ascites. Cronbach's α, split-half reliability, and test-retest reliability were used to assess the reliability of the scale. The content validity index was used to assess the content validity, while factor analysis was used for construct validity and correlation analysis was used for criterion validity. The Cronbach's α was 0.772 for the total scale, and the split-half reliability was 0.693. The test-retest correlation was 0.972. The content validity index for the scale was 0.8-1.0. Four factors were extracted by factor analysis, and these contributed 63.51% of the total variance. Item-total correlations ranged from 0.591 to 0.897, and these were correlated with visual analog scale scores (correlation coefficient, 0.889; P<0.01). The Chinese version of the FACIT-AI has good reliability and validity and can be used as a tool to measure quality of life in Chinese patients with malignant ascites.
Global Education Implications of the Foreign Pharmacy Graduate Equivalency Examination

PubMed Central

Clauson, Kevin A.; Latif, David A.; Al-Rousan, Rabaa M.

2010-01-01

Although the Foreign Pharmacy Graduate Equivalency Examination (FPGEE) is not intended to measure educational outcomes or institutional effectiveness, it may be a reliable and valid criterion to assess the quality or success of international pharmacy programs. This comprehensive review describes the evolution and historical milestones of the FPGEE, along with trends in structure, administration, and passing rates, and the impact of country of origin on participant performance. Similarities between the FPGEE and the Pharmacy Curriculum Outcomes Assessment (PCOA) are also explored. This paper aims to provide a global prospective and insight for foreign academic institutions into parameters for evaluating their students' educational capabilities. PMID:20798798
Patient's Perspective on Quality of Teleconsultation Services.

PubMed

Thijssing, Leonie; Tensen, Esmée; Jaspers, Monique

2016-01-01

Patient satisfaction with teleconsultation services can increase their acceptance. Validated and standardized questionnaires to measure the quality aspects of teleconsultation relevant from the patients' perspective are not available yet. We aim to develop such a questionnaire. First, a systematic literature search was performed and focus groups were held to acquire quality aspects of teleconsultations patients perceive as important. Thirty-seven unique quality aspects distilled from these activities, were used for questionnaire development based on the framework of the Consumer Quality Index. In future research, the comprehensiveness, relevance and unambiguousness of the concept questionnaire need to be tested and the reliability and internal cohesion of the questionnaire assessed.
A recursive Bayesian approach for fatigue damage prognosis: An experimental validation at the reliability component level

NASA Astrophysics Data System (ADS)

Gobbato, Maurizio; Kosmatka, John B.; Conte, Joel P.

2014-04-01

Fatigue-induced damage is one of the most uncertain and highly unpredictable failure mechanisms for a large variety of mechanical and structural systems subjected to cyclic and random loads during their service life. A health monitoring system capable of (i) monitoring the critical components of these systems through non-destructive evaluation (NDE) techniques, (ii) assessing their structural integrity, (iii) recursively predicting their remaining fatigue life (RFL), and (iv) providing a cost-efficient reliability-based inspection and maintenance plan (RBIM) is therefore ultimately needed. In contribution to these objectives, the first part of the paper provides an overview and extension of a comprehensive reliability-based fatigue damage prognosis methodology — previously developed by the authors — for recursively predicting and updating the RFL of critical structural components and/or sub-components in aerospace structures. In the second part of the paper, a set of experimental fatigue test data, available in the literature, is used to provide a numerical verification and an experimental validation of the proposed framework at the reliability component level (i.e., single damage mechanism evolving at a single damage location). The results obtained from this study demonstrate (i) the importance and the benefits of a nearly continuous NDE monitoring system, (ii) the efficiency of the recursive Bayesian updating scheme, and (iii) the robustness of the proposed framework in recursively updating and improving the RFL estimations. This study also demonstrates that the proposed methodology can lead to either an extent of the RFL (with a consequent economical gain without compromising the minimum safety requirements) or an increase of safety by detecting a premature fault and therefore avoiding a very costly catastrophic failure.
Development, sensibility, and reliability of the Toronto Axial Spondyloarthritis Questionnaire in inflammatory bowel disease.

PubMed

Alnaqbi, Khalid A; Touma, Zahi; Passalent, Laura; Johnson, Sindhu R; Tomlinson, George A; Carty, Adele; Inman, Robert D

2013-10-01

There is an unacceptable delay in the diagnosis of axial spondyloarthritis (axSpA) in its early stages among patients at high risk, in particular those with inflammatory bowel disease (IBD). Our objectives were to develop a sensible and reliable questionnaire to identify undetected axSpA among patients with IBD. Literature was reviewed for item generation in the Toronto axSpA Questionnaire on IBD (TASQ-IBD). Sensibility of the questionnaire was assessed among healthcare professionals and patients. This assessment was related to purpose and framework (clinical function, clinical justification, and clinical applicability), face validity, comprehensiveness [oligo-variability (limiting the questionnaire to important items) and transparency], replicability, content validity, and feasibility. The test-retest reliability study was administered to 77 patients with established IBD and axSpA. Kappa agreement coefficients and absolute agreement were calculated for items. Three domains included IBD, inflammatory back symptoms, and extraaxial features. The entry criterion required a patient to have IBD and back pain or stiffness that ever persisted for ≥ 3 months. Iterative sensibility assessment involved 16 items and a diagram of the back. Kappa coefficients ranged from 0.81-1.00 for each item. Absolute agreement across all items ranged from 91% to 100%. TASQ-IBD is a newly developed, sensible, and reliable case-finding questionnaire to be administered to patients with IBD who have ever had chronic back pain or stiffness persisting for ≥ 3 months. It should facilitate identification and timely referral of patients with IBD to rheumatologists and minimize the delay in diagnosis of axSpA. Consequently, it should assess the prevalence of axSpA in IBD.
Language Sampling for Preschoolers With Severe Speech Impairments

PubMed Central

Ragsdale, Jamie; Bustos, Aimee

2016-01-01

Purpose The purposes of this investigation were to determine if measures such as mean length of utterance (MLU) and percentage of comprehensible words can be derived reliably from language samples of children with severe speech impairments and if such measures correlate with tools that measure constructs assumed to be related. Method Language samples of 15 preschoolers with severe speech impairments (but receptive language within normal limits) were transcribed independently by 2 transcribers. Nonparametric statistics were used to determine which measures, if any, could be transcribed reliably and to determine if correlations existed between language sample measures and standardized measures of speech, language, and cognition. Results Reliable measures were extracted from the majority of the language samples, including MLU in words, mean number of syllables per utterance, and percentage of comprehensible words. Language sample comprehensibility measures were correlated with a single word comprehensibility task. Also, language sample MLUs and mean length of the participants' 3 longest sentences from the MacArthur–Bates Communicative Development Inventory (Fenson et al., 2006) were correlated. Conclusion Language sampling, given certain modifications, may be used for some 3-to 5-year-old children with normal receptive language who have severe speech impairments to provide reliable expressive language and comprehensibility information. PMID:27552110
Language Sampling for Preschoolers With Severe Speech Impairments.

PubMed

Binger, Cathy; Ragsdale, Jamie; Bustos, Aimee

2016-11-01

The purposes of this investigation were to determine if measures such as mean length of utterance (MLU) and percentage of comprehensible words can be derived reliably from language samples of children with severe speech impairments and if such measures correlate with tools that measure constructs assumed to be related. Language samples of 15 preschoolers with severe speech impairments (but receptive language within normal limits) were transcribed independently by 2 transcribers. Nonparametric statistics were used to determine which measures, if any, could be transcribed reliably and to determine if correlations existed between language sample measures and standardized measures of speech, language, and cognition. Reliable measures were extracted from the majority of the language samples, including MLU in words, mean number of syllables per utterance, and percentage of comprehensible words. Language sample comprehensibility measures were correlated with a single word comprehensibility task. Also, language sample MLUs and mean length of the participants' 3 longest sentences from the MacArthur-Bates Communicative Development Inventory (Fenson et al., 2006) were correlated. Language sampling, given certain modifications, may be used for some 3-to 5-year-old children with normal receptive language who have severe speech impairments to provide reliable expressive language and comprehensibility information.
Diagnosis of schizophrenia: a critical review of current diagnostic systems.

PubMed

Fenton, W S; Mosher, L R; Matthews, S M

1981-01-01

The data relevant to the evaluation of six systems for diagnosing schizophrenia are reviewed. They are summarized in terms of the reliability, predictive validity, specificity, and comprehensiveness of each system. Unfortunately, none, of these systems (Schneider's First-rank Symptoms, New Haven Schizophrenia Index, Flexible System, Feighner Criteria, Research Diagnostic Criteria, and DSM-III) have established construct validity. It is noted therefore that they are all, in a sense, arbitrary. Choosing one over another cannot be data-based. Because the elevation of any one diagnostic system to an official status is thought to be premature, clinicians and researchers alike are advised to exercise caution and openmindedness in their use of DSM-III. There is as yet no evidence that its criteria for schizophrenia are either less arbitrary or better (in identifying a group of "true" schizophrenics) than those of other systems or DSM-II.
Individual safety performance in the construction industry: development and validation of two short scales.

PubMed

DeArmond, Sarah; Smith, April E; Wilson, Christina L; Chen, Peter Y; Cigularov, Konstantin P

2011-05-01

In the current research a short measure of safety performance is developed for use in the construction industry and the relationships between different components of safety performance and safety outcomes (e.g., occupational injuries and work-related pain) are explored within the construction context. This research consists of two field studies. In the first, comprehensive measures of safety compliance and safety participation were shortened and modified to be appropriate for use in construction. Evidence of reliability and validity is provided. Both safety compliance and safety participation were negatively related to occupational injuries, yet these two correlations were not statistically different. In the second study, we investigated the relationships between these two components of safety performance and work-related pain frequency, in addition to replicating Study 1. Safety compliance had a stronger negative relationship with pain than safety participation. Implications for research are discussed. Copyright © 2010 Elsevier Ltd. All rights reserved.
Development and initial validation of a computer-administered health literacy assessment in Spanish and English: FLIGHT/VIDAS.

PubMed

Ownby, Raymond L; Acevedo, Amarilis; Waldrop-Valverde, Drenna; Jacobs, Robin J; Caballero, Joshua; Davenport, Rosemary; Homs, Ana-Maria; Czaja, Sara J; Loewenstein, David

2013-01-01

Current measures of health literacy have been criticized on a number of grounds, including use of a limited range of content, development on small and atypical patient groups, and poor psychometric characteristics. In this paper, we report the development and preliminary validation of a new computer-administered and -scored health literacy measure addressing these limitations. Items in the measure reflect a wide range of content related to health promotion and maintenance as well as care for diseases. The development process has focused on creating a measure that will be useful in both Spanish and English, while not requiring substantial time for clinician training and individual administration and scoring. The items incorporate several formats, including questions based on brief videos, which allow for the assessment of listening comprehension and the skills related to obtaining information on the Internet. In this paper, we report the interim analyses detailing the initial development and pilot testing of the items (phase 1 of the project) in groups of Spanish and English speakers. We then describe phase 2, which included a second round of testing of the items, in new groups of Spanish and English speakers, and evaluation of the new measure's reliability and validity in relation to other measures. Data are presented that show that four scales (general health literacy, numeracy, conceptual knowledge, and listening comprehension), developed through a process of item and factor analyses, have significant relations to existing measures of health literacy.

Assessing reliability and validity measures in managed care studies.

PubMed

Montoya, Isaac D

2003-01-01

To review the reliability and validity literature and develop an understanding of these concepts as applied to managed care studies. Reliability is a test of how well an instrument measures the same input at varying times and under varying conditions. Validity is a test of how accurately an instrument measures what one believes is being measured. A review of reliability and validity instructional material was conducted. Studies of managed care practices and programs abound. However, many of these studies utilize measurement instruments that were developed for other purposes or for a population other than the one being sampled. In other cases, instruments have been developed without any testing of the instrument's performance. The lack of reliability and validity information may limit the value of these studies. This is particularly true when data are collected for one purpose and used for another. The usefulness of certain studies without reliability and validity measures is questionable, especially in cases where the literature contradicts itself
Dynamic MRI to quantify musculoskeletal motion: A systematic review of concurrent validity and reliability, and perspectives for evaluation of musculoskeletal disorders.

PubMed

Borotikar, Bhushan; Lempereur, Mathieu; Lelievre, Mathieu; Burdin, Valérie; Ben Salem, Douraied; Brochard, Sylvain

2017-01-01

To report evidence for the concurrent validity and reliability of dynamic MRI techniques to evaluate in vivo joint and muscle mechanics, and to propose recommendations for their use in the assessment of normal and impaired musculoskeletal function. The search was conducted on articles published in Web of science, PubMed, Scopus, Academic search Premier, and Cochrane Library between 1990 and August 2017. Studies that reported the concurrent validity and/or reliability of dynamic MRI techniques for in vivo evaluation of joint or muscle mechanics were included after assessment by two independent reviewers. Selected articles were assessed using an adapted quality assessment tool and a data extraction process. Results for concurrent validity and reliability were categorized as poor, moderate, or excellent. Twenty articles fulfilled the inclusion criteria with a mean quality assessment score of 66% (±10.4%). Concurrent validity and/or reliability of eight dynamic MRI techniques were reported, with the knee being the most evaluated joint (seven studies). Moderate to excellent concurrent validity and reliability were reported for seven out of eight dynamic MRI techniques. Cine phase contrast and real-time MRI appeared to be the most valid and reliable techniques to evaluate joint motion, and spin tag for muscle motion. Dynamic MRI techniques are promising for the in vivo evaluation of musculoskeletal mechanics; however results should be evaluated with caution since validity and reliability have not been determined for all joints and muscles, nor for many pathological conditions.
A Turkish version of myocardial infarction dimensional assessment scale (TR-MIDAS): reliability-validity assesment.

PubMed

Uysal, Hilal; Ozcan, Şeyda

2011-06-01

Many new measuring devices have been developed so that broader psychometric measurements in the coronary artery disease, disease-specific health status measurements, and identification of the broader quality of life can be performed in the recent years. The study was intended to determine whether, and to what extent, MIDAS is a valid and reliable measurement to the patients suffering from myocardial infarction for the first time in Turkey. The research was conducted with the patients hospitalized and treated with myocardial infarction in the cardiology departments of 2 hospitals in Istanbul, Turkey, between 2007 and 2008. Psychometric evaluations of TR-MIDAS were used for validity studies; language validity, content validity, construct validity were examined. For reliability studies; the tool's internal consistency reliability, Cronbach's alpha reliability coefficient, and test-retest reliability were completed. The instrument's content validity index was determined to be "0.95". Principal component analysis revealed six factors with an eigenvalue >1.5. Cronbach's alpha was found to be 0.89 for total scale which was an acceptable value. The total's test-retest reliability was 0.51 (p<0.01). Data obtained at the end of the study supports that Turkish Myocardial Infarction Dimensional Assessment Scale is a valid and reliable instrument as a disease-specific scale to assess the patients' quality of life suffering from myocardial infarction in Turkey. Copyright © 2010 European Society of Cardiology. Published by Elsevier B.V. All rights reserved.
Development of a Comprehensive Assessment of Food Parenting Practices: The Home Self-Administered Tool for Environmental Assessment of Activity and Diet Family Food Practices Survey.

PubMed

Vaughn, Amber E; Dearth-Wesley, Tracy; Tabak, Rachel G; Bryant, Maria; Ward, Dianne S

2017-02-01

Parents' food parenting practices influence children's dietary intake and risk for obesity and chronic disease. Understanding the influence and interactions between parents' practices and children's behavior is limited by a lack of development and psychometric testing and/or limited scope of current measures. The Home Self-Administered Tool for Environmental Assessment of Activity and Diet (HomeSTEAD) was created to address this gap. This article describes development and psychometric testing of the HomeSTEAD family food practices survey. Between August 2010 and May 2011, a convenience sample of 129 parents of children aged 3 to 12 years were recruited from central North Carolina and completed the self-administered HomeSTEAD survey on three occasions during a 12- to 18-day window. Demographic characteristics and child diet were assessed at Time 1. Child height and weight were measured during the in-home observations (following Time 1 survey). Exploratory factor analysis with Time 1 data was used to identify potential scales. Scales with more than three items were examined for scale reduction. Following this, mean scores were calculated at each time point. Construct validity was assessed by examining Spearman rank correlations between mean scores (Time 1) and children's diet (fruits and vegetables, sugar-sweetened beverages, snacks, sweets) and body mass index (BMI) z scores. Repeated measures analysis of variance was used to examine differences in mean scores between time points, and single-measure intraclass correlations were calculated to examine test-retest reliability between time points. Exploratory factor analysis identified 24 factors and retained 124 items; however, scale reduction narrowed items to 86. The final instrument captures five coercive control practices (16 items), seven autonomy support practices (24 items), and 12 structure practices (46 items). All scales demonstrated good internal reliability (α>.62), 18 factors demonstrated construct validity (significant association with child diet, P<0.05), and 22 demonstrated good reliability (intraclass correlation coefficient>0.61). The HomeSTEAD family food practices survey provides a brief, yet comprehensive and psychometrically sound assessment of food parenting practices. Copyright © 2017 Academy of Nutrition and Dietetics. Published by Elsevier Inc. All rights reserved.
Cortical neuroanatomic correlates of symptom severity in primary progressive aphasia

PubMed Central

Sapolsky, D.; Bakkour, A.; Negreira, A.; Nalipinski, P.; Weintraub, S.; Mesulam, M.-M.; Caplan, D.; Dickerson, B.C.

2010-01-01

Objective: To test the validity and reliability of a new measure of clinical impairment in primary progressive aphasia (PPA), the Progressive Aphasia Severity Scale (PASS), and to investigate relationships with MRI-based cortical thickness biomarkers for localizing and quantifying the severity of anatomic abnormalities. Methods: Patients with PPA were rated using the PASS and underwent performance-based language testing and MRI scans that were processed for cortical thickness measures. Results: The level of impairment in PASS fluency, syntax/grammar, and word comprehension showed strong specific correlations with performance-based measures of these domains of language, and demonstrated high interrater reliability. Left inferior frontal thinning correlated with impairment in fluency and grammar/syntax, while left temporopolar thinning correlated with impairment in word comprehension. Discriminant function analysis demonstrated that a combination of left inferior frontal, left temporopolar, and left superior temporal sulcal thickness separated the 3 PPA subtypes from each other with 100% accuracy (87% accuracy in a leave-one-out analysis). Conclusions: The PASS, a novel measure of the severity of clinical impairment within domains of language typically affected in PPA, demonstrates reliable and valid clinical-behavioral properties. Furthermore, the presence of impairment in individual PASS domains demonstrates specific relationships with focal abnormalities in particular brain regions and the severity of impairment is strongly related to the severity of anatomic abnormality within the relevant brain region. These anatomic imaging biomarkers perform well in classifying PPA subtypes. These data provide robust support for the value of this novel clinical measure and the new imaging measure as markers for potential use in clinical research and trials in PPA. GLOSSARY AD = Alzheimer disease; BDAE = Boston Diagnostic Aphasia Examination; CDR = Clinical Dementia Rating; CSB = Cambridge Semantic Battery; ICC = intraclass correlation coefficient; NACC UDS = National Alzheimer's Coordinating Center Uniform Data Set; OC = older control participants; PASS = Progressive Aphasia Severity Scale; PPA = primary progressive aphasia; PPA-G = agrammatic primary progressive aphasia; PPA-L = logopenic primary progressive aphasia; PPA-S = semantic primary progressive aphasia; ROI = region of interest; WAB = Western Aphasia Battery. PMID:20660866
The Korean version of the Sniffin' stick (KVSS) test and its validity in comparison with the cross-cultural smell identification test (CC-SIT).

PubMed

Cho, Jae Hoon; Jeong, Yong Soo; Lee, Yeo Jin; Hong, Seok-Chan; Yoon, Joo-Heon; Kim, Jin Kook

2009-06-01

The Korean Version of the Sniffin' stick (KVSS) is the first olfactory test for Koreans. Although we adopted the Sniffin' Stick, we modified it to make it more suitable for Koreans. KVSS I is a screening test, and KVSS II a more comprehensive test. The aims of this study were to apply the KVSS test and assess its clinical validity and reliability in comparison to CC-SIT. One hundred and seventy-four healthy volunteers and 206 patients with subjective decreased olfaction participated. Each participant was tested with both the CC-SIT and KVSS tests and then the correlation between these two tests was analyzed. The correlation between CC-SIT and KVSS I was 0.720 (p<0.01) and 0.714 between the CC-SIT and KVSS II total scores (p<0.01). When the degree of olfaction based on the KVSS I was used, the mean CC-SIT score was 8.6+/-1.8 for normosmia, 7.3+/-2.2 for hyposmia, and 4.2+/-2.3 for anosmia. When the KVSS II total was applied, the mean CC-SIT score was 8.4+/-1.8 for normosmia, 7.3+/-2.0 for hyposmia, and 3.7+/-2.0 for anosmia. The means of the three group differed significantly in both cases (p<0.01). Thus, the KVSS test demonstrates validity and reliability for Korean in comparison with CC-SIT.
A self-efficacy questionnaire regarding leisure time physical activity: Psychometric properties among Iranian male adolescents.

PubMed

Abasi, Mohammad Hadi; Eslami, Ahmad Ali; Rakhshani, Fatemeh; Shiri, Mansoor

2016-01-01

Attention to different aspects of self-efficacy leads to actual evaluation of self-efficacy about physical activity. This study was carried out in order to design and determine psychometric characteristics of a questionnaire for evaluation of self-efficacy about leisure time physical activity (SELPA) among Iranian adolescent boys, with an emphasis on regulatory self-efficacy. This descriptive-analytic study was conducted in 734 male adolescents aged 15-19 years in Isfahan. After item generation and item selection based on review of literature and other questionnaires, content validity index (CVI) and content validity ratio (CVR) were determined and items were modified employing the opinions of expert panel (N = 10). Comprehensibility of the questionnaire was determined by members of target group (N = 35). Exploratory factors analysis (EFA) was operated on sample 1 (N 1 = 325) and confirmatory factors analysis (CFA) on sample 2 (N 2 = 347). Reliability of SELPA was estimated via internal consistency method. According to EFA, barrier self-efficacy and scheduling self-efficacy are the two main aspects of SELPA with the total variance of 65%. The suggested model was confirmed by CFA and all fitness indices of the corrected model were good. Cronbach's alpha was totally estimated as 0.89 and for barrier and scheduling self-efficacy, it was 0.86 and 0.81, respectively. The results provide some evidence for acceptable validity and reliability of SELPA in Iranian adolescent boys. However, further investigations, especially for evaluation of predictive power of the questionnaire, are necessary.
Longitudinal Models of Reliability and Validity: A Latent Curve Approach.

ERIC Educational Resources Information Center

Tisak, John; Tisak, Marie S.

1996-01-01

Dynamic generalizations of reliability and validity that will incorporate longitudinal or developmental models, using latent curve analysis, are discussed. A latent curve model formulated to depict change is incorporated into the classical definitions of reliability and validity. The approach is illustrated with sociological and psychological…
Scoring Rubric Development: Validity and Reliability.

ERIC Educational Resources Information Center

Moskal, Barbara M.; Leydens, Jon A.

2000-01-01

Provides clear definitions of the terms "validity" and "reliability" in the context of developing scoring rubrics and illustrates these definitions through examples. Also clarifies how validity and reliability may be addressed in the development of scoring rubrics, defined as descriptive scoring schemes developed to guide the analysis of the…
Validity of Highlighting on Text Comprehension

NASA Astrophysics Data System (ADS)

So, Joey C. Y.; Chan, Alan H. S.

2009-10-01

In this study, 38 university students were tested with a Chinese reading task on an LED display under different task conditions for determining the effects of the highlighting and its validity on comprehension performance on light-emitting diodes (LED) display for Chinese reading. Four levels of validity (0%, 33%, 67% and 100%) and a control condition with no highlighting were tested. Each subject was required to perform the five experimental conditions in which different passages were read and comprehended. The results showed that the condition with 100% validity of highlighting was found to have better comprehension performance than other validity levels and conditions with no highlighting. The comprehension score of the condition without highlighting effect was comparatively lower than those highlighting conditions with distracters, though not significant.
Next generation diagnostic molecular pathology: critical appraisal of quality assurance in Europe.

PubMed

Dubbink, Hendrikus J; Deans, Zandra C; Tops, Bastiaan B J; van Kemenade, Folkert J; Koljenović, S; van Krieken, Han J M; Blokx, Willeke A M; Dinjens, Winand N M; Groenen, Patricia J T A

2014-06-01

Tumor evaluation in pathology is more and more based on a combination of traditional histopathology and molecular analysis. Due to the rapid development of new cancer treatments that specifically target aberrant proteins present in tumor cells, treatment decisions are increasingly based on the molecular features of the tumor. Not only the number of patients eligible for targeted precision medicine, but also the number of molecular targets per patient and tumor type is rising. Diagnostic molecular pathology, the discipline that determines the molecular aberrations present in tumors for diagnostic, prognostic or predictive purposes, is faced with true challenges. The laboratories have to meet the need of comprehensive molecular testing using only limited amount of tumor tissue, mostly fixed in formalin and embedded in paraffin (FFPE), in short turnaround time. Choices must be made for analytical methods that provide accurate, reliable and cost-effective results. Validation of the test procedures and results is essential. In addition, participation and good performance in internal (IQA) and external quality assurance (EQA) schemes is mandatory. In this review, we critically evaluate the validation procedure for comprehensive molecular tests as well as the organization of quality assurance and assessment of competence of diagnostic molecular pathology laboratories within Europe. Copyright © 2014 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Different Dimensions of Cognitive Style in Typical and Atypical Cognition: New Evidence and a New Measurement Tool.

PubMed

Mealor, Andy D; Simner, Julia; Rothen, Nicolas; Carmichael, Duncan A; Ward, Jamie

2016-01-01

We developed the Sussex Cognitive Styles Questionnaire (SCSQ) to investigate visual and verbal processing preferences and incorporate global/local processing orientations and systemising into a single, comprehensive measure. In Study 1 (N = 1542), factor analysis revealed six reliable subscales to the final 60 item questionnaire: Imagery Ability (relating to the use of visual mental imagery in everyday life); Technical/Spatial (relating to spatial mental imagery, and numerical and technical cognition); Language & Word Forms; Need for Organisation; Global Bias; and Systemising Tendency. Thus, we replicate previous findings that visual and verbal styles are separable, and that types of imagery can be subdivided. We extend previous research by showing that spatial imagery clusters with other abstract cognitive skills, and demonstrate that global/local bias can be separated from systemising. Study 2 validated the Technical/Spatial and Language & Word Forms factors by showing that they affect performance on memory tasks. In Study 3, we validated Imagery Ability, Technical/Spatial, Language & Word Forms, Global Bias, and Systemising Tendency by issuing the SCSQ to a sample of synaesthetes (N = 121) who report atypical cognitive profiles on these subscales. Thus, the SCSQ consolidates research from traditionally disparate areas of cognitive science into a comprehensive cognitive style measure, which can be used in the general population, and special populations.
Validation of MIMGO: a method to identify differentially expressed GO terms in a microarray dataset

PubMed Central

2012-01-01

Background We previously proposed an algorithm for the identification of GO terms that commonly annotate genes whose expression is upregulated or downregulated in some microarray data compared with in other microarray data. We call these “differentially expressed GO terms” and have named the algorithm “matrix-assisted identification method of differentially expressed GO terms” (MIMGO). MIMGO can also identify microarray data in which genes annotated with a differentially expressed GO term are upregulated or downregulated. However, MIMGO has not yet been validated on a real microarray dataset using all available GO terms. Findings We combined Gene Set Enrichment Analysis (GSEA) with MIMGO to identify differentially expressed GO terms in a yeast cell cycle microarray dataset. GSEA followed by MIMGO (GSEA + MIMGO) correctly identified (p < 0.05) microarray data in which genes annotated to differentially expressed GO terms are upregulated. We found that GSEA + MIMGO was slightly less effective than, or comparable to, GSEA (Pearson), a method that uses Pearson’s correlation as a metric, at detecting true differentially expressed GO terms. However, unlike other methods including GSEA (Pearson), GSEA + MIMGO can comprehensively identify the microarray data in which genes annotated with a differentially expressed GO term are upregulated or downregulated. Conclusions MIMGO is a reliable method to identify differentially expressed GO terms comprehensively. PMID:23232071
Different Dimensions of Cognitive Style in Typical and Atypical Cognition: New Evidence and a New Measurement Tool

PubMed Central

Mealor, Andy D.; Simner, Julia; Rothen, Nicolas; Carmichael, Duncan A.; Ward, Jamie

2016-01-01

We developed the Sussex Cognitive Styles Questionnaire (SCSQ) to investigate visual and verbal processing preferences and incorporate global/local processing orientations and systemising into a single, comprehensive measure. In Study 1 (N = 1542), factor analysis revealed six reliable subscales to the final 60 item questionnaire: Imagery Ability (relating to the use of visual mental imagery in everyday life); Technical/Spatial (relating to spatial mental imagery, and numerical and technical cognition); Language & Word Forms; Need for Organisation; Global Bias; and Systemising Tendency. Thus, we replicate previous findings that visual and verbal styles are separable, and that types of imagery can be subdivided. We extend previous research by showing that spatial imagery clusters with other abstract cognitive skills, and demonstrate that global/local bias can be separated from systemising. Study 2 validated the Technical/Spatial and Language & Word Forms factors by showing that they affect performance on memory tasks. In Study 3, we validated Imagery Ability, Technical/Spatial, Language & Word Forms, Global Bias, and Systemising Tendency by issuing the SCSQ to a sample of synaesthetes (N = 121) who report atypical cognitive profiles on these subscales. Thus, the SCSQ consolidates research from traditionally disparate areas of cognitive science into a comprehensive cognitive style measure, which can be used in the general population, and special populations. PMID:27191169
Subjective Cognitive Complaints and Objective Cognitive Function in Aging: A Systematic Review and Meta-Analysis of Recent Cross-Sectional Findings.

PubMed

Burmester, Bridget; Leathem, Janet; Merrick, Paul

2016-12-01

Research investigating how subjective cognitive complaints (SCCs) might reliably indicate impairments in objective cognitive functioning has produced highly varied findings, and despite attempts to synthesise this literature (e.g., Jonker et al. International Journal of Geriatric Psychiatry, 15, 983-991, 2000; Reid and MacLullich Dementia and Geriatric Cognitive Disorders, 22(5-6), 471-485, 2006; Crumley et al. Psychology and Aging, 29(2), 250-263, 2014), recent work continues to offer little resolution. This review provides both quantitative and qualitative synthesis of research conducted since the last comprehensive review in 2006, with the aim of identifying reasons for these discrepancies that might provide fruitful avenues for future exploration. Meta-analysis found a small but significant association between SCCs and objective cognitive function, although it was limited by large heterogeneity between studies and evidence of potential publication bias. Often, assessments of SCCs and objective cognitive function were brief or not formally validated. However, studies that employed more comprehensive SCC measures tended to find that SCCs were associated independently with both objective cognitive function and depressive symptoms. Further explicit investigation of how assessment measures relate to reports of SCCs, and the validity of the proposed 'compensation theory' of SCC aetiology, is recommended.
An alternative to the balance error scoring system: using a low-cost balance board to improve the validity/reliability of sports-related concussion balance testing.

PubMed

Chang, Jasper O; Levy, Susan S; Seay, Seth W; Goble, Daniel J

2014-05-01

Recent guidelines advocate sports medicine professionals to use balance tests to assess sensorimotor status in the management of concussions. The present study sought to determine whether a low-cost balance board could provide a valid, reliable, and objective means of performing this balance testing. Criterion validity testing relative to a gold standard and 7 day test-retest reliability. University biomechanics laboratory. Thirty healthy young adults. Balance ability was assessed on 2 days separated by 1 week using (1) a gold standard measure (ie, scientific grade force plate), (2) a low-cost Nintendo Wii Balance Board (WBB), and (3) the Balance Error Scoring System (BESS). Validity of the WBB center of pressure path length and BESS scores were determined relative to the force plate data. Test-retest reliability was established based on intraclass correlation coefficients. Composite scores for the WBB had excellent validity (r = 0.99) and test-retest reliability (R = 0.88). Both the validity (r = 0.10-0.52) and test-retest reliability (r = 0.61-0.78) were lower for the BESS. These findings demonstrate that a low-cost balance board can provide improved balance testing accuracy/reliability compared with the BESS. This approach provides a potentially more valid/reliable, yet affordable, means of assessing sports-related concussion compared with current methods.
Validation of behaviour measurement instrument of patients with diabetes mellitus and hypertension

NASA Astrophysics Data System (ADS)

Saputri, G. Z.; Akrom; Dini, S. M.

2017-11-01

Non-adherence to the treatment of chronic diseases such as hypertension and Diabetes Mellitus (DM) is a major obstacle in achieving patient therapy targets and quality of life of patients. A comprehensive approach involving pharmacists counselling has shown influences on changes in health behaviour and patient compliance. Behaviour changes in patients are one of the parameters to assess the effectiveness of counselling and education by pharmacists. Therefore, it is necessary to develop questionnaires of behaviour change measurement in DM-hypertension patients. This study aims to develop a measurement instrument in the form of questionnaires in assessing the behaviour change of DM-hypertension patients. Preparation of question items from the questionnaire research instrument refers to some guidelines and previous research references. Test of questionnaire instrument valid was done with expert validation, followed by pilot testing on 10 healthy respondents, and 10 DM-hypertension patients included in the inclusion criteria. Furthermore, field validation test was conducted on 37 patients who had undergone outpatient care at the PKU Muhammadiyah Yogyakarta City Hospital and The Gading Clinic in Yogyakarta. The inclusion criteria were male and female patients, aged 18-65, diagnosed with type 2 diabetes with hypertension who received oral antidiabetic drugs and antihypertensives, and who were not illiterate and co-operative. The data were collected by questionnaire interviews by a standardized pharmacist. The result of validation test using Person correlation shows the value of 0.33. The results of the questionnaire validation test on 37 patients showed 5 items of invalid questions with the value of r <0.33, e: questions 2, 3, 6, 10 and 11, while the other 10 questions show the value of Pearson correlation > 0.33. The reliability value is shown from the Cronbach's alpha value of 0.722 (> 0.6), implying that the questionnaire is reliable for DM-hypertension patients. This Behavioural change questionnaire can be used on DM-hypertension patients, and an FGD approach is required for the development of factors affecting this questionnaire.
The cross-cultural adaptation, reliability, and validity of the Copenhagen Neck Functional Disability Scale in patients with chronic neck pain: Turkish version study.

PubMed

Yapali, Gökmen; Günel, Mintaze Kerem; Karahan, Sevilay

2012-05-15

The study design was cross-cultural adaptation and investigation of reliability and validity of the Copenhagen Neck Functional Disability Scale (CNFDS). The aim of this study was to translate the CNFDS into Turkish language and assess its reliability and validity among patients with neck pain in Turkish population. The CNFDS is a reliable and valid evaluation instrument for disability, but there is no published the Turkish version of the CNFDS. One hundred one subjects who had chronic neck pain were included in this study. The CNFDS, Neck Pain and Disability Scale, and visual analogue scale were administered to all subjects. For investigating test-retest reliability, correlation between CNFDS scores, applied at 1-week interval, intraclass correlation coefficient score for test-retest reliability was 0.86 (95% confidence interval = 0.679-0.935). There was no difference between test-retest scores (P < 0.001). For investigating concurrent validity, correlation between total score of the CNFDS and the mean visual analogue scale was r = 0.73 (P < 0.001). Concurrent validity of the CNFDS was very good. For investigating construct validity, correlation between total score of the CNFDS and the Neck Pain and Disability Scale was r = 0.78 (P < 0.001). Construct validity of the CNFDS was also very good. Our results suggest that the Turkish version of the CNFDS is a reliable and valid instrument for Turkish people.
Development of a Conservative Model Validation Approach for Reliable Analysis

DTIC Science & Technology

2015-01-01

CIE 2015 August 2-5, 2015, Boston, Massachusetts, USA [DRAFT] DETC2015-46982 DEVELOPMENT OF A CONSERVATIVE MODEL VALIDATION APPROACH FOR RELIABLE...obtain a conservative simulation model for reliable design even with limited experimental data. Very little research has taken into account the...3, the proposed conservative model validation is briefly compared to the conventional model validation approach. Section 4 describes how to account
Evaluation of Validity and Reliability for Hierarchical Scales Using Latent Variable Modeling

ERIC Educational Resources Information Center

Raykov, Tenko; Marcoulides, George A.

2012-01-01

A latent variable modeling method is outlined, which accomplishes estimation of criterion validity and reliability for a multicomponent measuring instrument with hierarchical structure. The approach provides point and interval estimates for the scale criterion validity and reliability coefficients, and can also be used for testing composite or…

The Reliability and Validity of a Scale to Measure Teachers' Attitudes toward Integration in an Australian Context.

ERIC Educational Resources Information Center

Roberts, Clare; Pratt, Chris

1988-01-01

The study evaluated the psychometric properties of reliability and construct validity of the Attitude Toward Mainstreaming Scale (ATMS) in an Australian context. It was concluded that the scale is both reliable and factorially valid in an Australian context. (Author/DB)
Self-esteem among nursing assistants: reliability and validity of the Rosenberg Self-Esteem Scale.

PubMed

McMullen, Tara; Resnick, Barbara

2013-01-01

To establish the reliability and validity of the Rosenberg Self-Esteem Scale (RSES) when used with nursing assistants (NAs). Testing the RSES used baseline data from a randomized controlled trial testing the Res-Care Intervention. Female NAs were recruited from nursing homes (n = 508). Validity testing for the positive and negative subscales of the RSES was based on confirmatory factor analysis (CFA) using structural equation modeling and Rasch analysis. Estimates of reliability were based on Rasch analysis and the person separation index. Evidence supports the reliability and validity of the RSES in NAs although we recommend minor revisions to the measure for subsequent use. Establishing reliable and valid measures of self-esteem in NAs will facilitate testing of interventions to strengthen workplace self-esteem, job satisfaction, and retention.
Construct Validity and Reliability of the Questionnaire on the Quality of Physician-Patient Interaction in Adults With Hypertension.

PubMed

Hickman, Ronald L; Clochesy, John M; Hetland, Breanna; Alaamri, Marym

2017-04-01

There are limited reliable and valid measures of the patient- provider interaction among adults with hypertension. Therefore, the purpose of this report is to describe the construct validity and reliability of the Questionnaire on the Quality of Physician-Patient Interaction (QQPPI), in community-dwelling adults with hypertension. A convenience sample of 109 participants with hypertension was recruited and administered the QQPPI at baseline and 8 weeks later. The exploratory factor analysis established a 12-item, 2-factor structure for the QQPPI was valid in this sample. The modified QQPPI proved to have sufficient internal consistency and test- retest reliability. The modified QQPPI is a valid and reliable measure of the provider-patient interaction, a construct posited to impact self-management, in adults with hypertension.
Psychometrics of the Home Safety Self-Assessment Tool (HSSAT) to prevent falls in community-dwelling older adults.

PubMed

Tomita, Machiko R; Saharan, Sumandeep; Rajendran, Sheela; Nochajski, Susan M; Schweitzer, Jo A

2014-01-01

OBJECTIVE. To identify psychometric properties of the Home Safety Self-Assessment Tool (HSSAT) to prevent falls in community-dwelling older adults. METHOD. We tested content validity, test-retest reliability, interrater reliability, construct validity, convergent and discriminant validity, and responsiveness to change. RESULTS. The content validity index was .98, the intraclass correlation coefficient for test-retest reliability was .97, and the interrater reliability was .89. The difference on identified risk factors between the use and nonuse of the HSSAT was significant (p = .005). Convergent validity with the Centers for Disease Control and Prevention Home Safety Checklist was high (r = .65), and discriminant validity with fear of falling was very low (r = .10). The responsiveness to change was moderate (standardized response mean = 0.57). CONCLUSION. The HSSAT is a reliable and valid instrument to identify fall risks in a home environment, and the HSSAT booklet is effective as educational material leading to improvement in home safety. Copyright © 2014 by the American Occupational Therapy Association, Inc.
Cross-Cultural Adaptation, Reliability and Validity Study of the Persian Version of the Clinical COPD Questionnaire.

PubMed

Hasanpour, Neda; Attarbashi Moghadam, Behrouz; Sami, Ramin; Tavakol, Kamran

2016-08-01

The clinical COPD questionnaire (CCQ) has been developed to measure the health status of COPD patients. The aim of this study was to translate CCQ into the Persian language and assess the validity and reliability of the translated version. We used a forward-backward procedure to translate the questionnaire. In a cross-sectional study 100 COPD patients and 50 healthy subjects over 40 years old were selected to assess the reliability and construct validity of the instrument. The face and content validity were used for the questionnaire validity. Validity was examined in a population of patients with COPD, using the Persian validated version of the St George's Respiratory Questionnaire (PSGRQ). In order to assess the questionnaire's reliability, the Intraclass correlation coefficient (ICC) and Cronbach's alpha were calculated. Test-retest reliability was tested by re-administering the Persian version of the CCQ (PCCQ) after 1 week. Test-retest carry out of data demonstrates that the PCCQ has excellent reliability (ICC for all 3 domains were higher than 0.9). Internal consistency was found by Cronbach's alpha to be 0.96, 0.94, 0.97, and 0.98 for the symptom, mental state, functional state and total scores respectively. In addition, the correlation between the components of PCCQ and PSGRQ showed satisfactory construct validity. Analyzing the data from healthy subjects and patients divulged that the PCCQ has acceptable discriminant validity. In general, the PCCQ had satisfactory reliability and validity for assessing health-related quality of life status of Iranian COPD patients.
Reliability of conditioned pain modulation: a systematic review

PubMed Central

Kennedy, Donna L.; Kemp, Harriet I.; Ridout, Deborah; Yarnitsky, David; Rice, Andrew S.C.

2016-01-01

Abstract A systematic literature review was undertaken to determine if conditioned pain modulation (CPM) is reliable. Longitudinal, English language observational studies of the repeatability of a CPM test paradigm in adult humans were included. Two independent reviewers assessed the risk of bias in 6 domains; study participation; study attrition; prognostic factor measurement; outcome measurement; confounding and analysis using the Quality in Prognosis Studies (QUIPS) critical assessment tool. Intraclass correlation coefficients (ICCs) less than 0.4 were considered to be poor; 0.4 and 0.59 to be fair; 0.6 and 0.75 good and greater than 0.75 excellent. Ten studies were included in the final review. Meta-analysis was not appropriate because of differences between studies. The intersession reliability of the CPM effect was investigated in 8 studies and reported as good (ICC = 0.6-0.75) in 3 studies and excellent (ICC > 0.75) in subgroups in 2 of those 3. The assessment of risk of bias demonstrated that reporting is not comprehensive for the description of sample demographics, recruitment strategy, and study attrition. The absence of blinding, a lack of control for confounding factors, and lack of standardisation in statistical analysis are common. Conditioned pain modulation is a reliable measure; however, the degree of reliability is heavily dependent on stimulation parameters and study methodology and this warrants consideration for investigators. The validation of CPM as a robust prognostic factor in experimental and clinical pain studies may be facilitated by improvements in the reporting of CPM reliability studies. PMID:27559835
An initial evaluation of the Social Communication Questionnaire for the assessment of autism spectrum disorders in children with Down syndrome.

PubMed

Magyar, Caroline I; Pandolfi, Vincent; Dill, Charles A

2012-02-01

This study investigated the psychometric properties of the Social Communication Questionnaire (SCQ) in a sample of children with Down syndrome (DS), many of whom had a co-occurring autism spectrum disorder (ASD). The SCQ is a widely used ASD screening measure; however, its measurement properties have not been comprehensively evaluated specifically in children with DS, a group that seems to be at higher risk for an ASD. Exploratory and confirmatory factor analyses, scale reliability, convergent and discriminant correlations, significance tests between groups of children with DS and DS + ASD, and diagnostic accuracy analyses were conducted. Factor analyses identified 2 reliable factors that we labeled Social-Communication and Stereotyped Behavior and Unusual Interests. Pearson correlations with Autism Diagnostic Interview-Revised subscales indicated support for the SCQ's convergent validity and some support for the discriminant validity of the factor-based scales. Significance tests and receiver operating characteristic analyses indicated that children with DS + ASD obtained significantly higher SCQ factor-based and total scores than children with DS alone, and that the SCQ Total Score evidenced good sensitivity and adequate specificity. Results indicated initial psychometric support for the SCQ as an ASD screening measure in children with DS. The SCQ should be considered as part of a multimethod evaluation when screening children with DS.
MMPI-2 Symptom Validity (FBS) Scale: psychometric characteristics and limitations in a Veterans Affairs neuropsychological setting.

PubMed

Gass, Carlton S; Odland, Anthony P

2014-01-01

The Minnesota Multiphasic Personality Inventory-2 (MMPI-2) Symptom Validity (Fake Bad Scale [FBS]) Scale is widely used to assist in determining noncredible symptom reporting, despite a paucity of detailed research regarding its itemmetric characteristics. Originally designed for use in civil litigation, the FBS is often used in a variety of clinical settings. The present study explored its fundamental psychometric characteristics in a sample of 303 patients who were consecutively referred for a comprehensive examination in a Veterans Affairs (VA) neuropsychology clinic. FBS internal consistency (reliability) was .77. Its underlying factor structure consisted of three unitary dimensions (Tiredness/Distractibility, Stomach/Head Discomfort, and Claimed Virtue of Self/Others) accounting for 28.5% of the total variance. The FBS's internal structure showed factoral discordance, as Claimed Virtue was negatively related to most of the FBS and to its somatic complaint components. Scores on this 12-item FBS component reflected a denial of socially undesirable attitudes and behaviors (Antisocial Practices Scale) that is commonly expressed by the 1,138 males in the MMPI-2 normative sample. These 12 items significantly reduced FBS reliability, introducing systematic error variance. In this VA neuropsychological referral setting, scores on the FBS have ambiguous meaning because of its structural discordance.
Development of the family symptom inventory: a psychosocial screener for children with hematology/oncology conditions.

PubMed

Karlson, Cynthia W; Haynes, Stacey; Faith, Melissa A; Elkin, Thomas D; Smith, Maria L; Megason, Gail

2015-03-01

A growing body of literature has begun to underscore the importance of integrating family-based comprehensive psychological screening into standard medical care for children with oncology and hematology conditions. There are no known family-based measures designed to screen for clinically significant emotional and behavioral concerns in pediatric oncology and hematology patients. The aim of this study was to develop and evaluate the Family Symptom Inventory (FSI), a brief screener of patient and family member psychological symptoms. The FSI also screens for common comorbid physical symptoms (pain and sleep disturbance) and is designed for use at any point during treatment and follow-up. A total of 488 caregivers completed the FSI during regular hematology/oncology visits for 193 cancer, 219 sickle cell disease, and 76 hematology pediatric patients. Exploratory factor analysis, confirmatory factor analysis, and tests of reliability and preliminary validity were conducted. Exploratory factor analysis suggested a 34-item, 4-factor solution, which was confirmed in an independent sample using confirmatory factor analysis (factor loadings=0.49 to 0.88). The FSI demonstrated good internal reliability (α's=0.86 to 0.92) and good preliminary validity. Regular psychosocial screening throughout the course of treatment and follow-up may lead to improved quality of care for children with oncology and hematology conditions.
Psychometric properties of the Beck Depression Inventory-II: a comprehensive review.

PubMed

Wang, Yuan-Pang; Gorenstein, Clarice

2013-01-01

To review the psychometric properties of the Beck Depression Inventory-II (BDI-II) as a self-report measure of depression in a variety of settings and populations. Relevant studies of the BDI-II were retrieved through a search of electronic databases, a hand search, and contact with authors. Retained studies (k = 118) were allocated into three groups: non-clinical, psychiatric/institutionalized, and medical samples. The internal consistency was described as around 0.9 and the retest reliability ranged from 0.73 to 0.96. The correlation between BDI-II and the Beck Depression Inventory (BDI-I) was high and substantial overlap with measures of depression and anxiety was reported. The criterion-based validity showed good sensitivity and specificity for detecting depression in comparison to the adopted gold standard. However, the cutoff score to screen for depression varied according to the type of sample. Factor analysis showed a robust dimension of general depression composed by two constructs: cognitive-affective and somatic-vegetative. The BDI-II is a relevant psychometric instrument, showing high reliability, capacity to discriminate between depressed and non-depressed subjects, and improved concurrent, content, and structural validity. Based on available psychometric evidence, the BDI-II can be viewed as a cost-effective questionnaire for measuring the severity of depression, with broad applicability for research and clinical practice worldwide.
Revision and psychometric testing of the City of Hope Quality of Life-Ostomy Questionnaire.

PubMed

Grant, Marcia; Ferrell, Betty; Dean, Grace; Uman, Gwen; Chu, David; Krouse, Robert

2004-10-01

Ostomies may be performed for bowel or urinary diversion, and occur in both cancer and non-cancer patients. Impact on physical, psychological, social and spiritual well-being is not unexpected, but has been minimally described in the literature. The City of Hope Quality of Life (COH-QOL)-Ostomy Questionnaire is an adult patient self-report instrument designed to assess quality of life. This report focuses on the revision and psychometric testing of this questionnaire. The revised COH-QOL-Ostomy Questionnaire involved in-depth patient interviews and expert panel review. The format consisted of a 13-item disease and demographic section, a 34-item forced-choice section, and a 41-item linear analogue scaled section. A mailed survey to California members of the United Ostomy Association resulted in a 62% response rate (n = 1513). Factor analysis was conducted to refine the instrument. Construct validity involved testing a number of hypotheses identifying contrasting groups. Factor analysis confirmed the conceptual framework. Reliability of subscales ranged from 0.77 to 0.90. The questionnaire discriminated between subpopulations with specific concerns. Overall, the analyses provide evidence for the validity and reliability of the COH-QOL-Ostomy Questionnaire as a comprehensive, multidimensional self-report questionnaire for measuring quality of life in patients with intestinal ostomies.
A review of the validity and reliability of alcohol retail sales data for monitoring population levels of alcohol consumption: a Scottish perspective.

PubMed

Robinson, Mark; Thorpe, Rachel; Beeston, Clare; McCartney, Gerry

2013-01-01

To assess the validity and reliability of using alcohol retail sales data to measure and monitor population levels of alcohol consumption. Potential sources of bias that could lead to under- or overestimation of population alcohol consumption based on alcohol retail sales data were identified and, where possible, quantified. This enabled an assessment of the potential impact of each bias on alcohol consumption estimates in Scotland. Overall, considering all the possible sources of overestimation and underestimation, and taking into account the potential for sampling variability to impact on the results, the range of uncertainty of consumption during 2010 was from an overestimate of 0.3 l to an underestimate of 2.4 l of pure alcohol per adult. This excludes the impacts of alcohol stockpiling and alcohol sold through outlets not included in the sampling frame. On balance, there is therefore far greater scope for alcohol retail sales data to be underestimating per adult alcohol consumption in Scotland than there is for overestimation. Alcohol retail sales data offer a robust source of data for monitoring per adult alcohol consumption in Scotland. Consideration of the sources of bias and a comprehensive understanding of data collection methods are essential for using sales data to monitor trends in alcohol consumption.
Young Adults’ Belief in Genetic Determinism, and Knowledge and Attitudes towards Modern Genetics and Genomics: The PUGGS Questionnaire

PubMed Central

Carver, Rebecca Bruu; Castéra, Jérémy; Gericke, Niklas; Evangelista, Neima Alice Menezes

2017-01-01

In this paper we present the development and validation a comprehensive questionnaire to assess college students’ knowledge about modern genetics and genomics, their belief in genetic determinism, and their attitudes towards applications of modern genetics and genomic-based technologies. Written in everyday language with minimal jargon, the Public Understanding and Attitudes towards Genetics and Genomics (PUGGS) questionnaire is intended for use in research on science education and public understanding of science, as a means to investigate relationships between knowledge, determinism and attitudes about modern genetics, which are to date little understood. We developed a set of core ideas and initial items from reviewing the scientific literature on genetics and previous studies on public and student knowledge and attitudes about genetics. Seventeen international experts from different fields (e.g., genetics, education, philosophy of science) reviewed the initial items and their feedback was used to revise the questionnaire. We validated the questionnaire in two pilot tests with samples of university freshmen students. The final questionnaire contains 45 items, including both multiple choice and Likert scale response formats. Cronbach alpha showed good reliability for each section of the questionnaire. In conclusion, the PUGGS questionnaire is a reliable tool for investigating public understanding and attitudes towards modern genetics and genomic-based technologies. PMID:28114357
Young Adults' Belief in Genetic Determinism, and Knowledge and Attitudes towards Modern Genetics and Genomics: The PUGGS Questionnaire.

PubMed

Carver, Rebecca Bruu; Castéra, Jérémy; Gericke, Niklas; Evangelista, Neima Alice Menezes; El-Hani, Charbel N

2017-01-01

In this paper we present the development and validation a comprehensive questionnaire to assess college students' knowledge about modern genetics and genomics, their belief in genetic determinism, and their attitudes towards applications of modern genetics and genomic-based technologies. Written in everyday language with minimal jargon, the Public Understanding and Attitudes towards Genetics and Genomics (PUGGS) questionnaire is intended for use in research on science education and public understanding of science, as a means to investigate relationships between knowledge, determinism and attitudes about modern genetics, which are to date little understood. We developed a set of core ideas and initial items from reviewing the scientific literature on genetics and previous studies on public and student knowledge and attitudes about genetics. Seventeen international experts from different fields (e.g., genetics, education, philosophy of science) reviewed the initial items and their feedback was used to revise the questionnaire. We validated the questionnaire in two pilot tests with samples of university freshmen students. The final questionnaire contains 45 items, including both multiple choice and Likert scale response formats. Cronbach alpha showed good reliability for each section of the questionnaire. In conclusion, the PUGGS questionnaire is a reliable tool for investigating public understanding and attitudes towards modern genetics and genomic-based technologies.
Initial development and preliminary validation of a new negative symptom measure: the Clinical Assessment Interview for Negative Symptoms (CAINS).

PubMed

Forbes, Courtney; Blanchard, Jack J; Bennett, Melanie; Horan, William P; Kring, Ann; Gur, Raquel

2010-12-01

As part of an ongoing scale development process, this study provides an initial examination of the psychometric properties and validity of a new interview-based negative symptom instrument, the Clinical Assessment Interview for Negative Symptoms (CAINS), in outpatients with schizophrenia or schizoaffective disorder (N = 37). The scale was designed to address limitations of existing measures and to comprehensively assess five consensus-based negative symptoms: asociality, avolition, anhedonia (consummatory and anticipatory), affective flattening, and alogia. Results indicated satisfactory internal consistency reliability for the total CAINS scale score and promising inter-rater agreement, with clear areas identified in need of improvement. Convergent validity was evident in general agreement between the CAINS and alternative negative symptom measures. Further, CAINS subscales significantly correlated with relevant self-report emotional experience measures as well as with social functioning. Discriminant validity of the CAINS was strongly supported by its small, non-significant relations with positive symptoms, general psychiatric symptoms, and depression. These preliminary data on an early beta-version of the CAINS provide initial support for this new assessment approach to negative symptoms and suggest directions for further scale development. Copyright © 2010 Elsevier B.V. All rights reserved.
Development and validation of the Body and Appearance Self-Conscious Emotions Scale (BASES).

PubMed

Castonguay, Andrée L; Sabiston, Catherine M; Crocker, Peter R E; Mack, Diane E

2014-03-01

The purpose of these studies was to develop a psychometrically sound measure of shame, guilt, authentic pride, and hubristic pride for use in body and appearance contexts. In Study 1, 41 potential items were developed and assessed for item quality and comprehension. In Study 2, a panel of experts (N=8; M=11, SD=6.5 years of experience) reviewed the scale and items for evidence of content validity. Participants in Study 3 (n=135 males, n=300 females) completed the BASES and various body image, personality, and emotion scales. A separate sample (n=155; 35.5% male) in Study 3 completed the BASES twice using a two-week time interval. The BASES subscale scores demonstrated evidence for internal consistency, item-total correlations, concurrent, convergent, incremental, and discriminant validity, and 2-week test-retest reliability. The 4-factor solution was a good fit in confirmatory factor analysis, reflecting body-related shame, guilt, authentic and hubristic pride subscales of the BASES. The development and validation of the BASES may help advance body image and self-conscious emotion research by providing a foundation to examine the unique antecedents and outcomes of these specific emotional experiences. Copyright © 2014 Elsevier Ltd. All rights reserved.
The Cognition Battery of the NIH Toolbox for Assessment of Neurological and Behavioral Function: Validation in an Adult Sample

PubMed Central

Weintraub, Sandra; Dikmen, Sureyya S.; Heaton, Robert K.; Tulsky, David S.; Zelazo, Philip David; Slotkin, Jerry; Carlozzi, Noelle E.; Bauer, Patricia J.; Wallner-Allen, Kathleen; Fox, Nathan; Havlik, Richard; Beaumont, Jennifer L.; Mungas, Dan; Manly, Jennifer J.; Moy, Claudia; Conway, Kevin; Edwards, Emmeline; Nowinski, Cindy J.; Gershon, Richard

2014-01-01

This paper introduces a special series on validity studies of the Cognition Battery (CB) from the U.S. National Institutes of Health Toolbox for the Assessment of Neurological and Behavioral Function (NIHTB) (R. C. Gershon et al., 2013) in an adult sample. This first paper in the series describes the sample, each of the seven instruments in the NIHTB-CB briefly, and the general approach to data analysis. Data are provided on test-retest reliability and practice effects, and raw scores (mean, standard deviation, range) are presented for each instrument and the gold standard instruments used to measure construct validity. Accompanying papers provide details on each instrument, including information about instrument development, psychometric properties, age and education effects on performance, and convergent and discriminant construct validity. One paper in the series is devoted to a factor analysis of the NIHTB-CB in adults and another describes the psychometric properties of three composite scores derived from the individual measures representing fluid and crystallized abilities and their combination. The NIHTB-CB is designed to provide a brief, comprehensive, common set of measures to allow comparisons among disparate studies and to improve scientific communication. PMID:24959840
The cognition battery of the NIH toolbox for assessment of neurological and behavioral function: validation in an adult sample.

PubMed

Weintraub, Sandra; Dikmen, Sureyya S; Heaton, Robert K; Tulsky, David S; Zelazo, Philip David; Slotkin, Jerry; Carlozzi, Noelle E; Bauer, Patricia J; Wallner-Allen, Kathleen; Fox, Nathan; Havlik, Richard; Beaumont, Jennifer L; Mungas, Dan; Manly, Jennifer J; Moy, Claudia; Conway, Kevin; Edwards, Emmeline; Nowinski, Cindy J; Gershon, Richard

2014-07-01

This study introduces a special series on validity studies of the Cognition Battery (CB) from the U.S. National Institutes of Health Toolbox for the Assessment of Neurological and Behavioral Function (NIHTB) (Gershon, Wagster et al., 2013) in an adult sample. This first study in the series describes the sample, each of the seven instruments in the NIHTB-CB briefly, and the general approach to data analysis. Data are provided on test-retest reliability and practice effects, and raw scores (mean, standard deviation, range) are presented for each instrument and the gold standard instruments used to measure construct validity. Accompanying papers provide details on each instrument, including information about instrument development, psychometric properties, age and education effects on performance, and convergent and discriminant construct validity. One study in the series is devoted to a factor analysis of the NIHTB-CB in adults and another describes the psychometric properties of three composite scores derived from the individual measures representing fluid and crystallized abilities and their combination. The NIHTB-CB is designed to provide a brief, comprehensive, common set of measures to allow comparisons among disparate studies and to improve scientific communication.
Evaluation of passenger health risk assessment of sustainable indoor air quality monitoring in metro systems based on a non-Gaussian dynamic sensor validation method.

PubMed

Kim, MinJeong; Liu, Hongbin; Kim, Jeong Tai; Yoo, ChangKyoo

2014-08-15

Sensor faults in metro systems provide incorrect information to indoor air quality (IAQ) ventilation systems, resulting in the miss-operation of ventilation systems and adverse effects on passenger health. In this study, a new sensor validation method is proposed to (1) detect, identify and repair sensor faults and (2) evaluate the influence of sensor reliability on passenger health risk. To address the dynamic non-Gaussianity problem of IAQ data, dynamic independent component analysis (DICA) is used. To detect and identify sensor faults, the DICA-based squared prediction error and sensor validity index are used, respectively. To restore the faults to normal measurements, a DICA-based iterative reconstruction algorithm is proposed. The comprehensive indoor air-quality index (CIAI) that evaluates the influence of the current IAQ on passenger health is then compared using the faulty and reconstructed IAQ data sets. Experimental results from a metro station showed that the DICA-based method can produce an improved IAQ level in the metro station and reduce passenger health risk since it more accurately validates sensor faults than do conventional methods. Copyright © 2014 Elsevier B.V. All rights reserved.
Validity, Reliability, and the Questionable Role of Psychometrics in Plastic Surgery

PubMed Central

2014-01-01

Summary: This report examines the meaning of validity and reliability and the role of psychometrics in plastic surgery. Study titles increasingly include the word “valid” to support the authors’ claims. Studies by other investigators may be labeled “not validated.” Validity simply refers to the ability of a device to measure what it intends to measure. Validity is not an intrinsic test property. It is a relative term most credibly assigned by the independent user. Similarly, the word “reliable” is subject to interpretation. In psychometrics, its meaning is synonymous with “reproducible.” The definitions of valid and reliable are analogous to accuracy and precision. Reliability (both the reliability of the data and the consistency of measurements) is a prerequisite for validity. Outcome measures in plastic surgery are intended to be surveys, not tests. The role of psychometric modeling in plastic surgery is unclear, and this discipline introduces difficult jargon that can discourage investigators. Standard statistical tests suffice. The unambiguous term “reproducible” is preferred when discussing data consistency. Study design and methodology are essential considerations when assessing a study’s validity. PMID:25289354

Some links on this page may take you to non-federal websites. Their policies may differ from this site.