Sample records for validity criterion-related validity

  1. Criterion-Related Validity: Assessing the Value of Subscores

    ERIC Educational Resources Information Center

    Davison, Mark L.; Davenport, Ernest C., Jr.; Chang, Yu-Feng; Vue, Kory; Su, Shiyang

    2015-01-01

    Criterion-related profile analysis (CPA) can be used to assess whether subscores of a test or test battery account for more criterion variance than does a single total score. Application of CPA to subscore evaluation is described, compared to alternative procedures, and illustrated using SAT data. Considerations other than validity and reliability…

  2. Ethical leadership: meta-analytic evidence of criterion-related and incremental validity.

    PubMed

    Ng, Thomas W H; Feldman, Daniel C

    2015-05-01

    This study examines the criterion-related and incremental validity of ethical leadership (EL) with meta-analytic data. Across 101 samples published over the last 15 years (N = 29,620), we observed that EL demonstrated acceptable criterion-related validity with variables that tap followers' job attitudes, job performance, and evaluations of their leaders. Further, followers' trust in the leader mediated the relationships of EL with job attitudes and performance. In terms of incremental validity, we found that EL significantly, albeit weakly in some cases, predicted task performance, citizenship behavior, and counterproductive work behavior-even after controlling for the effects of such variables as transformational leadership, use of contingent rewards, management by exception, interactional fairness, and destructive leadership. The article concludes with a discussion of ways to strengthen the incremental validity of EL. (PsycINFO Database Record (c) 2015 APA, all rights reserved).

  3. The Cognitive Abilities Scale--Second Edition Preschool Form: Studies of Concurrent Criterion-Related, Construct, and Predictive Criterion-Related Validity

    ERIC Educational Resources Information Center

    Swanson, Jennifer R.; Bradley-Johnson, Sharon; Johnson, C. Merle; O'Dell, Anna Rubenaker

    2009-01-01

    Three studies examine the validity of the Preschool Form of the Cognitive Abilities Scale--Second Edition (CAS-2). Significant high concurrent criterion-related validity correlations, corrected for restricted range, are found between the CAS-2 and the Detroit Test of Learning Ability--Primary: Third Edition for 26 three-year-olds (r[subscript c] =…

  4. An Application of Practical Strategies in Assessing the Criterion-Related Validity of Credentialing Examinations.

    ERIC Educational Resources Information Center

    Fidler, James R.

    1993-01-01

    Criterion-related validities of 2 laboratory practitioner certification examinations for medical technologists (MTs) and medical laboratory technicians (MLTs) were assessed for 81 MT and 70 MLT examinees. Validity coefficients are presented for both measures. Overall, summative ratings yielded stronger validity coefficients than ratings based on…

  5. Criterion Related Validity of Karate Specific Aerobic Test (KSAT).

    PubMed

    Chaabene, Helmi; Hachana, Younes; Franchini, Emerson; Tabben, Montassar; Mkaouer, Bessem; Negra, Yassine; Hammami, Mehrez; Chamari, Karim

    2015-09-01

    Karate is one the most popular combat sports in the world. Physical fitness assessment on a regular manner is important for monitoring the effectiveness of the training program and the readiness of karatekas to compete. The aim of this research was to examine the criterion related to validity of the karate specific aerobic test (KSAT) as an indicator of aerobic level of karate practitioners. Cardiorespiratory responses, aerobic performance level through both treadmill laboratory test and YoYo intermittent recovery test level 1 (YoYoIRTL1) as well as time to exhaustion in the KSAT test (TE'KSAT) were determined in a total of fifteen healthy international karatekas (i.e. karate practitioners) (means ± SD: age: 22.2 ± 4.3 years; height: 176.4 ± 7.5 cm; body mass: 70.3 ± 9.7 kg and body fat: 13.2 ± 6%). Peak heart rate obtained from KSAT represented ~99% of maximal heart rate registered during the treadmill test showing that KSAT imposes high physiological demands. There was no significant correlation between KSAT's TE and relative (mL/min kg) treadmill maximal oxygen uptake (r = 0.14; P = 0.69; [small]). On the other hand, there was a significant relationship between KSAT's TE and the velocity associated with VO2max (vVO2max) (r = 0.67; P = 0.03; [large]) as well as the velocity at VO2 corresponding to the second ventilatory threshold (vVO2 VAT) (r = 0.64; P = 0.04; [large]). Moreover, significant relationship was found between TE's KSAT and both the total distance covered and parameters of intermittent endurance measured through YoYoIRTL1. The KSAT has not proved to have indirect criterion related validity as no significant correlations have been found between TE's KSAT and treadmill VO2max. Nevertheless, as correlated to other aerobic fitness variables, KSAT can be considered as an indicator of karate specific endurance. The establishment of the criterion related validity of the KSAT requires further investigation.

  6. Brief report: The Brief Alcohol Social Density Assessment (BASDA): convergent, criterion-related, and incremental validity.

    PubMed

    MacKillop, James; Acker, John D; Bollinger, Jared; Clifton, Allan; Miller, Joshua D; Campbell, W Keith; Goodie, Adam S

    2013-09-01

    Alcohol misuse is substantially influenced by social factors, but systematic assessments of social network drinking are typically lengthy. The goal of the present study was to provide further validation of a brief measure of social network alcohol use, the Brief Alcohol Social Density Assessment (BASDA), in a sample of emerging adults. Specifically, the study sought to examine the BASDA's convergent, criterion, and incremental validity in relation to well-established measures of drinking motives and problematic drinking. Participants were 354 undergraduates who were assessed using the BASDA, the Alcohol Use Disorders Identification Test (AUDIT), and the Drinking Motives Questionnaire. Significant associations were observed between the BASDA index of alcohol-related social density and alcohol misuse, social motives, and conformity motives, supporting convergent validity. Criterion-related validity was supported by evidence that significantly greater alcohol involvement was present in the social networks of individuals scoring at or above an AUDIT score of 8, a validated criterion for hazardous drinking. Finally, the BASDA index was significantly associated with alcohol misuse above and beyond drinking motives in relation to AUDIT scores, supporting incremental validity. Taken together, these findings provide further support for the BASDA as an efficient measure of drinking in an individual's social network. Methodological considerations as well as recommendations for future investigations in this area are discussed.

  7. Discriminant Validity Assessment: Use of Fornell & Larcker criterion versus HTMT Criterion

    NASA Astrophysics Data System (ADS)

    Hamid, M. R. Ab; Sami, W.; Mohmad Sidek, M. H.

    2017-09-01

    Assessment of discriminant validity is a must in any research that involves latent variables for the prevention of multicollinearity issues. Fornell and Larcker criterion is the most widely used method for this purpose. However, a new method has emerged for establishing the discriminant validity assessment through heterotrait-monotrait (HTMT) ratio of correlations method. Therefore, this article presents the results of discriminant validity assessment using these methods. Data from previous study was used that involved 429 respondents for empirical validation of value-based excellence model in higher education institutions (HEI) in Malaysia. From the analysis, the convergent, divergent and discriminant validity were established and admissible using Fornell and Larcker criterion. However, the discriminant validity is an issue when employing the HTMT criterion. This shows that the latent variables under study faced the issue of multicollinearity and should be looked into for further details. This also implied that the HTMT criterion is a stringent measure that could detect the possible indiscriminant among the latent variables. In conclusion, the instrument which consisted of six latent variables was still lacking in terms of discriminant validity and should be explored further.

  8. Criterion Related Validity of Karate Specific Aerobic Test (KSAT)

    PubMed Central

    Chaabene, Helmi; Hachana, Younes; Franchini, Emerson; Tabben, Montassar; Mkaouer, Bessem; Negra, Yassine; Hammami, Mehrez; Chamari, Karim

    2015-01-01

    Background: Karate is one the most popular combat sports in the world. Physical fitness assessment on a regular manner is important for monitoring the effectiveness of the training program and the readiness of karatekas to compete. Objectives: The aim of this research was to examine the criterion related to validity of the karate specific aerobic test (KSAT) as an indicator of aerobic level of karate practitioners. Patients and Methods: Cardiorespiratory responses, aerobic performance level through both treadmill laboratory test and YoYo intermittent recovery test level 1 (YoYoIRTL1) as well as time to exhaustion in the KSAT test (TE’KSAT) were determined in a total of fifteen healthy international karatekas (i.e. karate practitioners) (means ± SD: age: 22.2 ± 4.3 years; height: 176.4 ± 7.5 cm; body mass: 70.3 ± 9.7 kg and body fat: 13.2 ± 6%). Results: Peak heart rate obtained from KSAT represented ~99% of maximal heart rate registered during the treadmill test showing that KSAT imposes high physiological demands. There was no significant correlation between KSAT’s TE and relative (mL/min kg) treadmill maximal oxygen uptake (r = 0.14; P = 0.69; [small]). On the other hand, there was a significant relationship between KSAT’s TE and the velocity associated with VO2max (vVO2max) (r = 0.67; P = 0.03; [large]) as well as the velocity at VO2 corresponding to the second ventilatory threshold (vVO2 VAT) (r = 0.64; P = 0.04; [large]). Moreover, significant relationship was found between TE’s KSAT and both the total distance covered and parameters of intermittent endurance measured through YoYoIRTL1. Conclusions: The KSAT has not proved to have indirect criterion related validity as no significant correlations have been found between TE’s KSAT and treadmill VO2max. Nevertheless, as correlated to other aerobic fitness variables, KSAT can be considered as an indicator of karate specific endurance. The establishment of the criterion related validity of the KSAT

  9. Reliability and criterion-related validity of a new repeated agility test

    PubMed Central

    Makni, E; Jemni, M; Elloumi, M; Chamari, K; Nabli, MA; Padulo, J; Moalla, W

    2016-01-01

    The study aimed to assess the reliability and the criterion-related validity of a new repeated sprint T-test (RSTT) that includes intense multidirectional intermittent efforts. The RSTT consisted of 7 maximal repeated executions of the agility T-test with 25 s of passive recovery rest in between. Forty-five team sports players performed two RSTTs separated by 3 days to assess the reliability of best time (BT) and total time (TT) of the RSTT. The intra-class correlation coefficient analysis revealed a high relative reliability between test and retest for BT and TT (>0.90). The standard error of measurement (<0.50) showed that the RSTT has a good absolute reliability. The minimal detectable change values for BT and TT related to the RSTT were 0.09 s and 0.58 s, respectively. To check the criterion-related validity of the RSTT, players performed a repeated linear sprint (RLS) and a repeated sprint with changes of direction (RSCD). Significant correlations between the BT and TT of the RLS, RSCD and RSTT were observed (p<0.001). The RSTT is, therefore, a reliable and valid measure of the intermittent repeated sprint agility performance. As this ability is required in all team sports, it is suggested that team sports coaches, fitness coaches and sports scientists consider this test in their training follow-up. PMID:27274109

  10. Criterion-Related Validity of Sit-and-Reach Tests for Estimating Hamstring and Lumbar Extensibility: a Meta-Analysis

    PubMed Central

    Mayorga-Vega, Daniel; Merino-Marban, Rafael; Viciana, Jesús

    2014-01-01

    The main purpose of the present meta-analysis was to examine the scientific literature on the criterion-related validity of sit-and-reach tests for estimating hamstring and lumbar extensibility. For this purpose relevant studies were searched from seven electronic databases dated up through December 2012. Primary outcomes of criterion-related validity were Pearson´s zero-order correlation coefficients (r) between sit-and-reach tests and hamstrings and/or lumbar extensibility criterion measures. Then, from the included studies, the Hunter- Schmidt´s psychometric meta-analysis approach was conducted to estimate population criterion- related validity of sit-and-reach tests. Firstly, the corrected correlation mean (rp), unaffected by statistical artefacts (i.e., sampling error and measurement error), was calculated separately for each sit-and-reach test. Subsequently, the three potential moderator variables (sex of participants, age of participants, and level of hamstring extensibility) were examined by a partially hierarchical analysis. Of the 34 studies included in the present meta-analysis, 99 correlations values across eight sit-and-reach tests and 51 across seven sit-and-reach tests were retrieved for hamstring and lumbar extensibility, respectively. The overall results showed that all sit-and-reach tests had a moderate mean criterion-related validity for estimating hamstring extensibility (rp = 0.46-0.67), but they had a low mean for estimating lumbar extensibility (rp = 0. 16-0.35). Generally, females, adults and participants with high levels of hamstring extensibility tended to have greater mean values of criterion-related validity for estimating hamstring extensibility. When the use of angular tests is limited such as in a school setting or in large scale studies, scientists and practitioners could use the sit-and-reach tests as a useful alternative for hamstring extensibility estimation, but not for estimating lumbar extensibility. Key Points Overall sit

  11. Evaluation of Criterion Validity for Scales with Congeneric Measures

    ERIC Educational Resources Information Center

    Raykov, Tenko

    2007-01-01

    A method for estimating criterion validity of scales with homogeneous components is outlined. It accomplishes point and interval estimation of interrelationship indices between composite scores and criterion variables and is useful for testing hypotheses about criterion validity of measurement instruments. The method can also be used with missing…

  12. easyCBM® Reading Criterion Related Validity Evidence: Grades K-1. Technical Report #1309

    ERIC Educational Resources Information Center

    Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald

    2013-01-01

    In this technical report, we present the results of a study to gather criterion-related evidence for Grade K-1 easyCBM® reading measures. We used correlations to examine the relation between the easyCBM® measures and other published measures with known reliability and validity evidence, including the Dynamic Indicators of Basic Early Literacy…

  13. easyCBM® Reading Criterion Related Validity Evidence: Grades 2-5. Technical Report #1310

    ERIC Educational Resources Information Center

    Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald

    2013-01-01

    In this technical report, we present the results of a study to gather criterion-related evidence for Grade 2-5 easyCBM® reading measures. We used correlations to examine the relation between the easyCBM® measures and other published measures with known reliability and validity evidence, including the Gates-MacGinitie Reading Tests and the Dynamic…

  14. Reliability and criterion-related validity testing (construct) of the Endotracheal Suction Assessment Tool (ESAT©).

    PubMed

    Davies, Kylie; Bulsara, Max K; Ramelet, Anne-Sylvie; Monterosso, Leanne

    2018-05-01

    To establish criterion-related construct validity and test-retest reliability for the Endotracheal Suction Assessment Tool© (ESAT©). Endotracheal tube suction performed in children can significantly affect clinical stability. Previously identified clinical indicators for endotracheal tube suction were used as criteria when designing the ESAT©. Content validity was reported previously. The final stages of psychometric testing are presented. Observational testing was used to measure construct validity and determine whether the ESAT© could guide "inexperienced" paediatric intensive care nurses' decision-making regarding endotracheal tube suction. Test-retest reliability of the ESAT© was performed at two time points. The researchers and paediatric intensive care nurse "experts" developed 10 hypothetical clinical scenarios with predetermined endotracheal tube suction outcomes. "Experienced" (n = 12) and "inexperienced" (n = 14) paediatric intensive care nurses were presented with the scenarios and the ESAT© guiding decision-making about whether to perform endotracheal tube suction for each scenario. Outcomes were compared with those predetermined by the "experts" (n = 9). Test-retest reliability of the ESAT© was measured at two consecutive time points (4 weeks apart) with "experienced" and "inexperienced" paediatric intensive care nurses using the same scenarios and tool to guide decision-making. No differences were observed between endotracheal tube suction decisions made by "experts" (n = 9), "inexperienced" (n = 14) and "experienced" (n = 12) nurses confirming the tool's construct validity. No differences were observed between groups for endotracheal tube suction decisions at T1 and T2. Criterion-related construct validity and test-retest reliability of the ESAT© were demonstrated. Further testing is recommended to confirm reliability in the clinical setting with the "inexperienced" nurse to guide decision-making related to endotracheal tube

  15. Comparing the Construct and Criterion-Related Validity of Ability-Based and Mixed-Model Measures of Emotional Intelligence

    ERIC Educational Resources Information Center

    Livingstone, Holly A.; Day, Arla L.

    2005-01-01

    Despite the popularity of the concept of emotional intelligence(EI), there is much controversy around its definition, measurement, and validity. Therefore, the authors examined the construct and criterion-related validity of an ability-based EI measure (Mayer Salovey Caruso Emotional Intelligence Test [MSCEIT]) and a mixed-model EI measure…

  16. Updating the Trainability Tests Literature on Black-White Subgroup Differences and Reconsidering Criterion-Related Validity

    ERIC Educational Resources Information Center

    Roth, Philip L.; Buster, Maury A.; Bobko, Philip

    2011-01-01

    A number of applied psychologists have suggested that trainability test Black-White ethnic group differences are low or relatively low (e.g., Siegel & Bergman, 1975), though data are scarce. Likewise, there are relatively few estimates of criterion-related validity for trainability tests predicting job performance (cf. Robertson & Downs,…

  17. Criterion-Related Validity of the TOEFL iBT Listening Section. TOEFL iBT Research Report. RR-09-02

    ERIC Educational Resources Information Center

    Sawaki, Yasuyo; Nissan, Susan

    2009-01-01

    The study investigated the criterion-related validity of the "Test of English as a Foreign Language"[TM] Internet-based test (TOEFL[R] iBT) Listening section by examining its relationship to a criterion measure designed to reflect language-use tasks that university students encounter in everyday academic life: listening to academic…

  18. A Model for Estimating the Reliability and Validity of Criterion-Referenced Measures.

    ERIC Educational Resources Information Center

    Edmonston, Leon P.; Randall, Robert S.

    A decision model designed to determine the reliability and validity of criterion referenced measures (CRMs) is presented. General procedures which pertain to the model are discussed as to: Measures of relationship, Reliability, Validity (content, criterion-oriented, and construct validation), and Item Analysis. The decision model is presented in…

  19. Evaluation of the Gratitude Questionnaire in a Chinese Sample of Adults: Factorial Validity, Criterion-Related Validity, and Measurement Invariance Across Sex

    PubMed Central

    Kong, Feng; You, Xuqun; Zhao, Jingjing

    2017-01-01

    The Gratitude Questionnaire (GQ; McCullough et al., 2002) is one of the most widely used instruments to assess dispositional gratitude. The purpose of this study was to validate a Chinese version of the GQ by examining internal consistency, factor structure, convergent validity, and measurement invariance across sex. A total of 1151 Chinese adults were recruited to complete the GQ, Positive Affect and Negative Affect Scales, and Satisfaction with Life Scale. Confirmatory factor analysis indicated that the original unidimensional model fitted well, which is in accordance with the findings in Western populations. Furthermore, the GQ had satisfactory composite reliability and criterion-related validity with measures of life satisfaction and affective well-being. Evidence of configural, metric and scalar invariance across sex was obtained. Tests of the latent mean differences found females had higher latent mean scores than males. These findings suggest that the Chinese version of GQ is a reliable and valid tool for measuring dispositional gratitude and can generally be utilized across sex in the Chinese context. PMID:28919873

  20. Evaluation of the Gratitude Questionnaire in a Chinese Sample of Adults: Factorial Validity, Criterion-Related Validity, and Measurement Invariance Across Sex.

    PubMed

    Kong, Feng; You, Xuqun; Zhao, Jingjing

    2017-01-01

    The Gratitude Questionnaire (GQ; McCullough et al., 2002) is one of the most widely used instruments to assess dispositional gratitude. The purpose of this study was to validate a Chinese version of the GQ by examining internal consistency, factor structure, convergent validity, and measurement invariance across sex. A total of 1151 Chinese adults were recruited to complete the GQ, Positive Affect and Negative Affect Scales, and Satisfaction with Life Scale. Confirmatory factor analysis indicated that the original unidimensional model fitted well, which is in accordance with the findings in Western populations. Furthermore, the GQ had satisfactory composite reliability and criterion-related validity with measures of life satisfaction and affective well-being. Evidence of configural, metric and scalar invariance across sex was obtained. Tests of the latent mean differences found females had higher latent mean scores than males. These findings suggest that the Chinese version of GQ is a reliable and valid tool for measuring dispositional gratitude and can generally be utilized across sex in the Chinese context.

  1. A systematic review of reliability and objective criterion-related validity of physical activity questionnaires.

    PubMed

    Helmerhorst, Hendrik J F; Brage, Søren; Warren, Janet; Besson, Herve; Ekelund, Ulf

    2012-08-31

    Physical inactivity is one of the four leading risk factors for global mortality. Accurate measurement of physical activity (PA) and in particular by physical activity questionnaires (PAQs) remains a challenge. The aim of this paper is to provide an updated systematic review of the reliability and validity characteristics of existing and more recently developed PAQs and to quantitatively compare the performance between existing and newly developed PAQs.A literature search of electronic databases was performed for studies assessing reliability and validity data of PAQs using an objective criterion measurement of PA between January 1997 and December 2011. Articles meeting the inclusion criteria were screened and data were extracted to provide a systematic overview of measurement properties. Due to differences in reported outcomes and criterion methods a quantitative meta-analysis was not possible.In total, 31 studies testing 34 newly developed PAQs, and 65 studies examining 96 existing PAQs were included. Very few PAQs showed good results on both reliability and validity. Median reliability correlation coefficients were 0.62-0.71 for existing, and 0.74-0.76 for new PAQs. Median validity coefficients ranged from 0.30-0.39 for existing, and from 0.25-0.41 for new PAQs.Although the majority of PAQs appear to have acceptable reliability, the validity is moderate at best. Newly developed PAQs do not appear to perform substantially better than existing PAQs in terms of reliability and validity. Future PAQ studies should include measures of absolute validity and the error structure of the instrument.

  2. A systematic review of reliability and objective criterion-related validity of physical activity questionnaires

    PubMed Central

    2012-01-01

    Physical inactivity is one of the four leading risk factors for global mortality. Accurate measurement of physical activity (PA) and in particular by physical activity questionnaires (PAQs) remains a challenge. The aim of this paper is to provide an updated systematic review of the reliability and validity characteristics of existing and more recently developed PAQs and to quantitatively compare the performance between existing and newly developed PAQs. A literature search of electronic databases was performed for studies assessing reliability and validity data of PAQs using an objective criterion measurement of PA between January 1997 and December 2011. Articles meeting the inclusion criteria were screened and data were extracted to provide a systematic overview of measurement properties. Due to differences in reported outcomes and criterion methods a quantitative meta-analysis was not possible. In total, 31 studies testing 34 newly developed PAQs, and 65 studies examining 96 existing PAQs were included. Very few PAQs showed good results on both reliability and validity. Median reliability correlation coefficients were 0.62–0.71 for existing, and 0.74–0.76 for new PAQs. Median validity coefficients ranged from 0.30–0.39 for existing, and from 0.25–0.41 for new PAQs. Although the majority of PAQs appear to have acceptable reliability, the validity is moderate at best. Newly developed PAQs do not appear to perform substantially better than existing PAQs in terms of reliability and validity. Future PAQ studies should include measures of absolute validity and the error structure of the instrument. PMID:22938557

  3. Convergent, discriminant, and criterion validity of DSM-5 traits.

    PubMed

    Yalch, Matthew M; Hopwood, Christopher J

    2016-10-01

    Section III of the Diagnostic and Statistical Manual of Mental Disorders (5th edi.; DSM-5; American Psychiatric Association, 2013) contains a system for diagnosing personality disorder based in part on assessing 25 maladaptive traits. Initial research suggests that this aspect of the system improves the validity and clinical utility of the Section II Model. The Computer Adaptive Test of Personality Disorder (CAT-PD; Simms et al., 2011) contains many similar traits as the DSM-5, as well as several additional traits seemingly not covered in the DSM-5. In this study we evaluate the convergent and discriminant validity between the DSM-5 traits, as assessed by the Personality Inventory for DSM-5 (PID-5; Krueger et al., 2012), and CAT-PD in an undergraduate sample, and test whether traits included in the CAT-PD but not the DSM-5 provide incremental validity in association with clinically relevant criterion variables. Results supported the convergent and discriminant validity of the PID-5 and CAT-PD scales in their assessment of 23 out of 25 DSM-5 traits. DSM-5 traits were consistently associated with 11 criterion variables, despite our having intentionally selected clinically relevant criterion constructs not directly assessed by DSM-5 traits. However, the additional CAT-PD traits provided incremental information above and beyond the DSM-5 traits for all criterion variables examined. These findings support the validity of pathological trait models in general and the DSM-5 and CAT-PD models in particular, while also suggesting that the CAT-PD may include additional traits for consideration in future iterations of the DSM-5 system. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  4. Criterion Validity of Measures of Perceived Relative Harm of E-Cigarettes and Smokeless Tobacco Compared to Cigarettes

    PubMed Central

    Persoskie, Alexander; Nguyen, Anh B.; Kaufman, Annette R.; Tworek, Cindy

    2017-01-01

    Beliefs about the relative harmfulness of one product compared to another (perceived relative harm) are central to research and regulation concerning tobacco and nicotine-containing products, but techniques for measuring such beliefs vary widely. We compared the validity of direct and indirect measures of perceived harm of e-cigarettes and smokeless tobacco (SLT) compared to cigarettes. On direct measures, participants explicitly compare the harmfulness of each product. On indirect measures, participants rate the harmfulness of each product separately, and ratings are compared. The U.S. Health Information National Trends Survey (HINTS-FDA-2015; N=3738) included direct measures of perceived harm of e-cigarettes and SLT compared to cigarettes. Indirect measures were created by comparing ratings of harm from e-cigarettes, SLT, and cigarettes on 3-point scales. Logistic regressions tested validity by assessing whether direct and indirect measures were associated with criterion variables including: ever-trying e-cigarettes, ever-trying snus, and SLT use status. Compared to the indirect measures, the direct measures of harm were more consistently associated with criterion variables. On direct measures, 26% of adults rated e-cigarettes as less harmful than cigarettes, and 11% rated SLT as less harmful than cigarettes. Direct measures appear to provide valid information about individuals’ harm beliefs, which may be used to inform research and tobacco control policy. Further validation research is encouraged. PMID:28073035

  5. Criterion-Related Validity of the Distance- and Time-Based Walk/Run Field Tests for Estimating Cardiorespiratory Fitness: A Systematic Review and Meta-Analysis

    PubMed Central

    Mayorga-Vega, Daniel; Bocanegra-Parrilla, Raúl; Ornelas, Martha; Viciana, Jesús

    2016-01-01

    Objectives The main purpose of the present meta-analysis was to examine the criterion-related validity of the distance- and time-based walk/run tests for estimating cardiorespiratory fitness among apparently healthy children and adults. Materials and Methods Relevant studies were searched from seven electronic bibliographic databases up to August 2015 and through other sources. The Hunter-Schmidt’s psychometric meta-analysis approach was conducted to estimate the population criterion-related validity of the following walk/run tests: 5,000 m, 3 miles, 2 miles, 3,000 m, 1.5 miles, 1 mile, 1,000 m, ½ mile, 600 m, 600 yd, ¼ mile, 15 min, 12 min, 9 min, and 6 min. Results From the 123 included studies, a total of 200 correlation values were analyzed. The overall results showed that the criterion-related validity of the walk/run tests for estimating maximum oxygen uptake ranged from low to moderate (rp = 0.42–0.79), with the 1.5 mile (rp = 0.79, 0.73–0.85) and 12 min walk/run tests (rp = 0.78, 0.72–0.83) having the higher criterion-related validity for distance- and time-based field tests, respectively. The present meta-analysis also showed that sex, age and maximum oxygen uptake level do not seem to affect the criterion-related validity of the walk/run tests. Conclusions When the evaluation of an individual’s maximum oxygen uptake attained during a laboratory test is not feasible, the 1.5 mile and 12 min walk/run tests represent useful alternatives for estimating cardiorespiratory fitness. As in the assessment with any physical fitness field test, evaluators must be aware that the performance score of the walk/run field tests is simply an estimation and not a direct measure of cardiorespiratory fitness. PMID:26987118

  6. Criterion-Related Validity of the Distance- and Time-Based Walk/Run Field Tests for Estimating Cardiorespiratory Fitness: A Systematic Review and Meta-Analysis.

    PubMed

    Mayorga-Vega, Daniel; Bocanegra-Parrilla, Raúl; Ornelas, Martha; Viciana, Jesús

    2016-01-01

    The main purpose of the present meta-analysis was to examine the criterion-related validity of the distance- and time-based walk/run tests for estimating cardiorespiratory fitness among apparently healthy children and adults. Relevant studies were searched from seven electronic bibliographic databases up to August 2015 and through other sources. The Hunter-Schmidt's psychometric meta-analysis approach was conducted to estimate the population criterion-related validity of the following walk/run tests: 5,000 m, 3 miles, 2 miles, 3,000 m, 1.5 miles, 1 mile, 1,000 m, ½ mile, 600 m, 600 yd, ¼ mile, 15 min, 12 min, 9 min, and 6 min. From the 123 included studies, a total of 200 correlation values were analyzed. The overall results showed that the criterion-related validity of the walk/run tests for estimating maximum oxygen uptake ranged from low to moderate (rp = 0.42-0.79), with the 1.5 mile (rp = 0.79, 0.73-0.85) and 12 min walk/run tests (rp = 0.78, 0.72-0.83) having the higher criterion-related validity for distance- and time-based field tests, respectively. The present meta-analysis also showed that sex, age and maximum oxygen uptake level do not seem to affect the criterion-related validity of the walk/run tests. When the evaluation of an individual's maximum oxygen uptake attained during a laboratory test is not feasible, the 1.5 mile and 12 min walk/run tests represent useful alternatives for estimating cardiorespiratory fitness. As in the assessment with any physical fitness field test, evaluators must be aware that the performance score of the walk/run field tests is simply an estimation and not a direct measure of cardiorespiratory fitness.

  7. Criterion-related validity of perceived exertion scales in healthy children: a systematic review and meta-analysis.

    PubMed

    Rodríguez, Iván; Zambrano, Lysien; Manterola, Carlos

    2016-04-01

    Physiological parameters used to measure exercise intensity are oxygen uptake and heart rate. However, perceived exertion (PE) is a scale that has also been frequently applied. The objective of this study is to establish the criterion-related validity of PE scales in children during an incremental exercise test. Seven electronic databases were used. Studies aimed at assessing criterion-related validity of PE scales in healthy children during an incremental exercise test were included. Correlation coefficients were transformed into z-values and assessed in a meta-analysis by means of a fixed effects model if I2 was below 50% or a random effects model, if it was above 50%. wenty-five articles that studied 1418 children (boys: 49.2%) met the inclusion criteria. Children's average age was 10.5 years old. Exercise modalities included bike, running and stepping exercises. The weighted correlation coefficient was 0.835 (95% confidence interval: 0.762-0.887) and 0.874 (95% confidence interval: 0.794-0.924) for heart rate and oxygen uptake as reference criteria. The production paradigm and scales that had not been adapted to children showed the lowest measurement performance (p < 0.05). Measuring PE could be valid in healthy children during an incremental exercise test. Child-specific rating scales showed a better performance than those that had not been adapted to this population. Further studies with better methodological quality should be conducted in order to confirm these results. Sociedad Argentina de Pediatría.

  8. Evaluation of Measurement Instrument Criterion Validity in Finite Mixture Settings

    ERIC Educational Resources Information Center

    Raykov, Tenko; Marcoulides, George A.; Li, Tenglong

    2016-01-01

    A method for evaluating the validity of multicomponent measurement instruments in heterogeneous populations is discussed. The procedure can be used for point and interval estimation of criterion validity of linear composites in populations representing mixtures of an unknown number of latent classes. The approach permits also the evaluation of…

  9. Evidence for the Criterion Validity and Clinical Utility of the Pathological Narcissism Inventory

    ERIC Educational Resources Information Center

    Thomas, Katherine M.; Wright, Aidan G. C.; Lukowitsky, Mark R.; Donnellan, M. Brent; Hopwood, Christopher J.

    2012-01-01

    In this study, the authors evaluated aspects of criterion validity and clinical utility of the grandiosity and vulnerability components of the Pathological Narcissism Inventory (PNI) using two undergraduate samples (N = 299 and 500). Criterion validity was assessed by evaluating the correlations of narcissistic grandiosity and narcissistic…

  10. Statistical Validation of Surrogate Endpoints: Another Look at the Prentice Criterion and Other Criteria.

    PubMed

    Saraf, Sanatan; Mathew, Thomas; Roy, Anindya

    2015-01-01

    For the statistical validation of surrogate endpoints, an alternative formulation is proposed for testing Prentice's fourth criterion, under a bivariate normal model. In such a setup, the criterion involves inference concerning an appropriate regression parameter, and the criterion holds if the regression parameter is zero. Testing such a null hypothesis has been criticized in the literature since it can only be used to reject a poor surrogate, and not to validate a good surrogate. In order to circumvent this, an equivalence hypothesis is formulated for the regression parameter, namely the hypothesis that the parameter is equivalent to zero. Such an equivalence hypothesis is formulated as an alternative hypothesis, so that the surrogate endpoint is statistically validated when the null hypothesis is rejected. Confidence intervals for the regression parameter and tests for the equivalence hypothesis are proposed using bootstrap methods and small sample asymptotics, and their performances are numerically evaluated and recommendations are made. The choice of the equivalence margin is a regulatory issue that needs to be addressed. The proposed equivalence testing formulation is also adopted for other parameters that have been proposed in the literature on surrogate endpoint validation, namely, the relative effect and proportion explained.

  11. Determination of the criterion-related validity of hip joint angle test for estimating hamstring flexibility using a contemporary statistical approach.

    PubMed

    Sainz de Baranda, Pilar; Rodríguez-Iniesta, María; Ayala, Francisco; Santonja, Fernando; Cejudo, Antonio

    2014-07-01

    To examine the criterion-related validity of the horizontal hip joint angle (H-HJA) test and vertical hip joint angle (V-HJA) test for estimating hamstring flexibility measured through the passive straight-leg raise (PSLR) test using contemporary statistical measures. Validity study. Controlled laboratory environment. One hundred thirty-eight professional trampoline gymnasts (61 women and 77 men). Hamstring flexibility. Each participant performed 2 trials of H-HJA, V-HJA, and PSLR tests in a randomized order. The criterion-related validity of H-HJA and V-HJA tests was measured through the estimation equation, typical error of the estimate (TEEST), validity correlation (β), and their respective confidence limits. The findings from this study suggest that although H-HJA and V-HJA tests showed moderate to high validity scores for estimating hamstring flexibility (standardized TEEST = 0.63; β = 0.80), the TEEST statistic reported for both tests was not narrow enough for clinical purposes (H-HJA = 10.3 degrees; V-HJA = 9.5 degrees). Subsequently, the predicted likely thresholds for the true values that were generated were too wide (H-HJA = predicted value ± 13.2 degrees; V-HJA = predicted value ± 12.2 degrees). The results suggest that although the HJA test showed moderate to high validity scores for estimating hamstring flexibility, the prediction intervals between the HJA and PSLR tests are not strong enough to suggest that clinicians and sport medicine practitioners should use the HJA and PSLR tests interchangeably as gold standard measurement tools to evaluate and detect short hamstring muscle flexibility.

  12. Effect of Items Direction (Positive or Negative) on the Factorial Construction and Criterion Related Validity in Likert Scale

    ERIC Educational Resources Information Center

    Naji Qasem, Mamun Ali; Ahmad Gul, Showkeen Bilal

    2014-01-01

    The study was conducted to know the effect of items direction (positive or negative) on the factorial construction and criterion related validity in Likert scale. The descriptive survey research method was used for the study and the sample consisted of 510 undergraduate students selected by used random sampling technique. A scale developed by…

  13. Is comorbidity in the eating disorders related to perceptions of parenting? Criterion validity of the revised Young Parenting Inventory.

    PubMed

    Sheffield, Alexandra; Waller, Glenn; Emanuelli, Francesca; Murray, James

    2006-01-01

    Recent studies support the reliability and validity of the Young Parenting Inventory-Revised (YPI-R) and its use in investigating the role of parenting in the aetiology and maintenance of eating pathology. However, criterion validity has yet to be fully established. To investigate one aspect of criterion validity, this study examines the association between parenting and comorbid problems in the eating disorders (including general psychopathology and impulsivity). The participants were 124 women with eating disorders. They completed the YPI-R and the Brief Symptom Inventory (BSI; a measure of general psychopathology). They were also interviewed about their use of a number of impulsive behaviours. YPI-R scales were significant predictors of one of the nine BSI scales, and distinguished those patients who did or did not use specific impulsive behaviours. The criterion validity of the YPI-R is partially supported with regards to general psychopathology and impulsivity. The findings highlight the specificity of the parenting styles measured by the YPI-R, and the need for further research using this tool.

  14. Montreal-Toulouse Language Assessment Battery: evidence of criterion validity from patients with aphasia.

    PubMed

    Pagliarin, Karina Carlesso; Ortiz, Karin Zazo; Barreto, Simone dos Santos; Pimenta Parente, Maria Alice de Mattos; Nespoulous, Jean-Luc; Joanette, Yves; Fonseca, Rochele Paz

    2015-10-15

    The Montreal-Toulouse Language Assessment Battery - Brazilian version (MTL-BR) provides a general description of language processing and related components in adults with brain injury. The present study aimed at verifying the criterion-related validity of the Montreal-Toulouse Language Assessment Battery - Brazilian version (MTL-BR) by assessing its ability to discriminate between individuals with unilateral brain damage with and without aphasia. The investigation was carried out in a Brazilian community-based sample of 104 adults, divided into four groups: 26 participants with left hemisphere damage (LHD) with aphasia, 25 participants with right hemisphere damage (RHD), 28 with LHD non-aphasic, and 25 healthy adults. There were significant differences between patients with aphasia and the other groups on most total and subtotal scores on MTL-BR tasks. The results showed strong criterion-related validity evidence for the MTL-BR Battery, and provided important information regarding hemispheric specialization and interhemispheric cooperation. Future research is required to search for additional evidence of sensitivity, specificity and validity of the MTL-BR in samples with different types of aphasia and degrees of language impairment. Copyright © 2015 Elsevier B.V. All rights reserved.

  15. Discriminative and Criterion Validity of the Autism Spectrum Identity Scale (ASIS)

    ERIC Educational Resources Information Center

    McDonald, T. A. M.

    2017-01-01

    Individuals on the autism spectrum face stigma that can influence identity development. Previous research on the 22-item Autism Spectrum Identity Scale (ASIS) reported a four-factor structure with strong split-sample cross-validation and good internal consistency. This study reports the discriminative and criterion validity of the ASIS with other…

  16. Criterion-related validity of the Test of Children's Speech sentence intelligibility measure for children with cerebral palsy and dysarthria.

    PubMed

    Hodge, Megan; Gotzke, Carrie Lynne

    2014-08-01

    To evaluate the criterion-related validity of the TOCS+ sentence measure (TOCS+, Hodge, Daniels & Gotzke, 2009 ) for children with dysarthria and CP by comparing intelligibility and rate scores obtained concurrently from the TOCS+ and from a conversational sample. Twenty children (3 to 10 years old) diagnosed with spastic cerebral palsy (CP) participated. Nineteen children also had a confirmed diagnosis of dysarthria. Children's intelligibility and speaking rate scores obtained from the TOCS+, which uses imitation of sets of randomly selected items ranging from 2-7 words (80 words in total) and from a contiguous 100-word conversational speech were compared. Mean intelligibility scores were 46.5% (SD = 26.4%) and 50.9% (SD = 19.1%) and mean rates in words per minute (WPM) were 90.2 (SD = 22.3) and 94.1 (SD = 25.6), respectively, for the TOCS+ and conversational samples. No significant differences were found between the two conditions for intelligibility or rate scores. Strong correlations were found between the TOCS+ and conversational samples for intelligibility (r = 0.86; p < 0.001) and WPM (r = 0.77; p < 0.001), supporting the criterion validity of the TOCS+ sentence task as a time efficient procedure for measuring intelligibility and rate in children with CP, with and without confirmed dysarthria. The results support the criterion validity of the TOCS+ sentence task as a time efficient procedure for measuring intelligibility and rate in children with CP, with and without confirmed dysarthria. Children varied in their relative performance on the two speaking tasks, reflecting the complexity of factors that influence intelligibility and rate scores.

  17. Concurrent criterion validity of the safe driving behavior measure: a predictor of on-road driving outcomes.

    PubMed

    Classen, Sherrilene; Wang, Yanning; Winter, Sandra M; Velozo, Craig A; Lanford, Desiree N; Bédard, Michel

    2013-01-01

    We determined the concurrent criterion validity of the Safe Driving Behavior Measure (SDBM) for on-road outcomes (passing or failing the on-road test as determined by a certified driving rehabilitation specialist) among older drivers and their family members-caregivers. On the basis of ratings from 168 older drivers and 168 family members-caregivers, we calculated receiver operating characteristic curves. The drivers' area under the curve (AUC) was .620 (95% confidence interval [CI] = .514-.725, p = .043). The family members-caregivers' AUC was .726 (95% CI = .622-.829, p ≤ .01). Older drivers' ratings showed statistically significant yet poor concurrent criterion validity, but family members-caregivers' ratings showed good concurrent criterion validity for the criterion on-road driving test. Continuing research with a more representative sample is being pursued to confirm the SDBM's concurrent criterion validity. This screening tool may be useful for generalist practitioners to use in making decisions regarding driving. Copyright © 2013 by the American Occupational Therapy Association, Inc.

  18. Concurrent Criterion Validity of the Safe Driving Behavior Measure: A Predictor of On-Road Driving Outcomes

    PubMed Central

    Wang, Yanning; Winter, Sandra M.; Velozo, Craig A.; Lanford, Desiree N.; Bédard, Michel

    2013-01-01

    We determined the concurrent criterion validity of the Safe Driving Behavior Measure (SDBM) for on-road outcomes (passing or failing the on-road test as determined by a certified driving rehabilitation specialist) among older drivers and their family members–caregivers. On the basis of ratings from 168 older drivers and 168 family members–caregivers, we calculated receiver operating characteristic curves. The drivers’ area under the curve (AUC) was .620 (95% confidence interval [CI] = .514–.725, p = .043). The family members–caregivers’ AUC was .726 (95% CI = .622–.829, p ≤ .01). Older drivers’ ratings showed statistically significant yet poor concurrent criterion validity, but family members–caregivers’ ratings showed good concurrent criterion validity for the criterion on-road driving test. Continuing research with a more representative sample is being pursued to confirm the SDBM’s concurrent criterion validity. This screening tool may be useful for generalist practitioners to use in making decisions regarding driving. PMID:23245789

  19. Mixed group validation: a method to address the limitations of criterion group validation in research on malingering detection.

    PubMed

    Frederick, R I

    2000-01-01

    Mixed group validation (MGV) is offered as an alternative to criterion group validation (CGV) to estimate the true positive and false positive rates of tests and other diagnostic signs. CGV requires perfect confidence about each research participant's status with respect to the presence or absence of pathology. MGV determines diagnostic efficiencies based on group data; knowing an individual's status with respect to pathology is not required. MGV can use relatively weak indicators to validate better diagnostic signs, whereas CGV requires perfect diagnostic signs to avoid error in computing true positive and false positive rates. The process of MGV is explained, and a computer simulation demonstrates the soundness of the procedure. MGV of the Rey 15-Item Memory Test (Rey, 1958) for 723 pre-trial criminal defendants resulted in higher estimates of true positive rates and lower estimates of false positive rates as compared with prior research conducted with CGV. The author demonstrates how MGV addresses all the criticisms Rogers (1997b) outlined for differential prevalence designs in malingering detection research. Copyright 2000 John Wiley & Sons, Ltd.

  20. The Counselor Evaluation Rating Scale: A Valid Criterion of Counselor Effectiveness?

    ERIC Educational Resources Information Center

    Jones, Lawrence K.

    1974-01-01

    The validity of recent recommendations regarding the use of certain factors of the 16 Personality Factor Questionnaire (16PF) to select persons for counselor training programs, where the CERS was the criterion measure, is challenged. (Author)

  1. Criterion and incremental validity of the emotion regulation questionnaire

    PubMed Central

    Ioannidis, Christos A.; Siegling, A. B.

    2015-01-01

    Although research on emotion regulation (ER) is developing, little attention has been paid to the predictive power of ER strategies beyond established constructs. The present study examined the incremental validity of the Emotion Regulation Questionnaire (ERQ; Gross and John, 2003), which measures cognitive reappraisal and expressive suppression, over and above the Big Five personality factors. It also extended the evidence for the measure's criterion validity to yet unexamined criteria. A university student sample (N = 203) completed the ERQ, a measure of the Big Five, and relevant cognitive and emotion-laden criteria. Cognitive reappraisal predicted positive affect beyond personality, as well as experiential flexibility and constructive self-assertion beyond personality and affect. Expressive suppression explained incremental variance in negative affect beyond personality and in experiential flexibility beyond personality and general affect. No incremental effects were found for worry, social anxiety, rumination, reflection, and preventing negative emotions. Implications for the construct validity and utility of the ERQ are discussed. PMID:25814967

  2. Self-Reported Physical Activity within and outside the Neighborhood: Criterion-Related Validity of the Neighborhood Physical Activity Questionnaire in German Older Adults

    ERIC Educational Resources Information Center

    Bödeker, Malte; Bucksch, Jens; Wallmann-Sperlich, Birgit

    2018-01-01

    The Neighborhood Physical Activity Questionnaire allows to assess physical activity within and outside the neighborhood. Study objectives were to examine the criterion-related validity and health/functioning associations of Neighborhood Physical Activity Questionnaire-derived physical activity in German older adults. A total of 107 adults aged…

  3. Numerical and Experimental Validation of a New Damage Initiation Criterion

    NASA Astrophysics Data System (ADS)

    Sadhinoch, M.; Atzema, E. H.; Perdahcioglu, E. S.; van den Boogaard, A. H.

    2017-09-01

    Most commercial finite element software packages, like Abaqus, have a built-in coupled damage model where a damage evolution needs to be defined in terms of a single fracture energy value for all stress states. The Johnson-Cook criterion has been modified to be Lode parameter dependent and this Modified Johnson-Cook (MJC) criterion is used as a Damage Initiation Surface (DIS) in combination with the built-in Abaqus ductile damage model. An exponential damage evolution law has been used with a single fracture energy value. Ultimately, the simulated force-displacement curves are compared with experiments to validate the MJC criterion. 7 out of 9 fracture experiments were predicted accurately. The limitations and accuracy of the failure predictions of the newly developed damage initiation criterion will be discussed shortly.

  4. Validation of the peak bilirubin criterion for outcome after partial hepatectomy.

    PubMed

    van Mierlo, Kim M C; Lodewick, Toine M; Dhar, Dipok K; van Woerden, Victor; Kurstjens, Ralph; Schaap, Frank G; van Dam, Ronald M; Vyas, Soumil; Malagó, Massimo; Dejong, Cornelis H C; Olde Damink, Steven W M

    2016-10-01

    Postoperative liver failure (PLF) is a dreaded complication after partial hepatectomy. The peak bilirubin criterion (>7.0 mg/dL or ≥120 μmol/L) is used to define PLF. This study aimed to validate the peak bilirubin criterion as postoperative risk indicator for 90-day liver-related mortality. Characteristics of 956 consecutive patients who underwent partial hepatectomy at the Maastricht University Medical Centre or Royal Free London between 2005 and 2012 were analyzed by uni- and multivariable analyses with odds ratios (OR) and 95% confidence intervals (95%CI). Thirty-five patients (3.7%) met the postoperative peak bilirubin criterion at median day 19 with a median bilirubin level of 183 [121-588] μmol/L. Sensitivity and specificity for liver-related mortality after major hepatectomy were 41.2% and 94.6%, respectively. The positive predictive value was 22.6%. Predictors of liver-related mortality were the peak bilirubin criterion (p < 0.001, OR = 15.9 [95%CI 5.2-48.7]), moderate-severe steatosis and fibrosis (p = 0.013, OR = 8.5 [95%CI 1.6-46.6]), ASA 3-4 (p = 0.047, OR = 3.0 [95%CI 1.0-8.8]) and age (p = 0.044, OR = 1.1 [95%CI 1.0-1.1]). The peak bilirubin criterion has a low sensitivity and positive predictive value for 90-day liver-related mortality after major hepatectomy. Copyright © 2016 International Hepato-Pancreato-Biliary Association Inc. Published by Elsevier Ltd. All rights reserved.

  5. A new responder criterion (relative effect per patient (REPP) > 0.2) externally validated in a large total hip replacement multicenter cohort (EUROHIP).

    PubMed

    Huber, J; Hüsler, J; Dieppe, P; Günther, K P; Dreinhöfer, K; Judge, A

    2016-03-01

    To validate a new method to identify responders (relative effect per patient (REPP) >0.2) using the OMERACT-OARSI criteria as gold standard in a large multicentre sample. The REPP ([score before - after treatment]/score before treatment) was calculated for 845 patients of a large multicenter European cohort study for THR. The patients with a REPP >0.2 were defined as responders. The responder rate was compared to the gold standard (OMERACT-OARSI criteria) using receiver operator characteristic (ROC) curve analysis for sensitivity, specificity and percentage of appropriately classified patients. With the criterion REPP>0.2 85.4% of the patients were classified as responders, applying the OARSI-OMERACT criteria 85.7%. The new method had 98.8% sensitivity, 94.2% specificity and 98.1% of the patients were correctly classified compared to the gold standard. The external validation showed a high sensitivity and also specificity of a new criterion to identify a responder compared to the gold standard method. It is simple and has no uncertainties due to a single classification criterion. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.

  6. Reliability and criterion validity of an observation protocol for working technique assessments in cash register work.

    PubMed

    Palm, Peter; Josephson, Malin; Mathiassen, Svend Erik; Kjellberg, Katarina

    2016-06-01

    We evaluated the intra- and inter-observer reliability and criterion validity of an observation protocol, developed in an iterative process involving practicing ergonomists, for assessment of working technique during cash register work for the purpose of preventing upper extremity symptoms. Two ergonomists independently assessed 17 15-min videos of cash register work on two occasions each, as a basis for examining reliability. Criterion validity was assessed by comparing these assessments with meticulous video-based analyses by researchers. Intra-observer reliability was acceptable (i.e. proportional agreement >0.7 and kappa >0.4) for 10/10 questions. Inter-observer reliability was acceptable for only 3/10 questions. An acceptable inter-observer reliability combined with an acceptable criterion validity was obtained only for one working technique aspect, 'Quality of movements'. Thus, major elements of the cashiers' working technique could not be assessed with an acceptable accuracy from short periods of observations by one observer, such as often desired by practitioners. Practitioner Summary: We examined an observation protocol for assessing working technique in cash register work. It was feasible in use, but inter-observer reliability and criterion validity were generally not acceptable when working technique aspects were assessed from short periods of work. We recommend the protocol to be used for educational purposes only.

  7. Validation of a Criterion Referenced Test for Young Handicapped Children: PIPER.

    ERIC Educational Resources Information Center

    Strum, Irene; Shapiro, Madelaine

    The purpose of this study was to validate the Prescriptive Instructional Program for Educational Readiness (PIPER) for utilization as a criterion referenced test (CRT) among learning disabled children. The program consisted of behavioral objectives and diagnostic and/or mastery tasks and activities for each objective in the area of gross motor…

  8. A Case for Transforming the Criterion of a Predictive Validity Study

    ERIC Educational Resources Information Center

    Patterson, Brian F.; Kobrin, Jennifer L.

    2011-01-01

    This study presents a case for applying a transformation (Box and Cox, 1964) of the criterion used in predictive validity studies. The goals of the transformation were to better meet the assumptions of the linear regression model and to reduce the residual variance of fitted (i.e., predicted) values. Using data for the 2008 cohort of first-time,…

  9. Criterion Validity of the Child's Challenging Behavior Scale, Version 2 (CCBS-2).

    PubMed

    Bourke-Taylor, Helen M; Cordier, Reinie; Pallant, Julie F

    The Child's Challenging Behavior Scale, Version 2 (CCBS-2), measures maternal rating of a child's challenging behaviors that compromise maternal mental health. The CCBS-2, the Child Behavior Checklist (CBCL), and the Strengths and Difficulties Questionnaire (SDQ) were compared in a sample of typically developing young Australian children. Criterion validity was investigated by correlating the CCBS-2 with "gold standard" measures (CBCL and SDQ subscales). Data were collected in a cross-sectional survey of mothers (N = 336) of children ages 3-9 yr. Correlations with the CBCL externalizing subscales demonstrated moderate (ρ = .46) to strong (ρ = .66) correlations. Correlations with the SDQ externalizing behaviors subscales were moderate (ρ = .35) to strong (ρ = .60). The criterion validity established in this study strengthens the psychometric properties that support ongoing development of the CCBS-2 as an efficient tool that may identify children in need of further evaluation. Copyright © 2018 by the American Occupational Therapy Association, Inc.

  10. An Evaluation of Available Models for Estimating the Reliability and Validity of Criterion Referenced Measures.

    ERIC Educational Resources Information Center

    Oakland, Thomas

    New strategies for evaluation criterion referenced measures (CRM) are discussed. These strategies examine the following issues: (1) the use of normed referenced measures (NRM) as CRM and then estimating the reliability and validity of such measures in terms of variance from an arbitrarily specified criterion score, (2) estimation of the…

  11. Meta-Analysis of Criterion Validity for Curriculum-Based Measurement in Written Language

    ERIC Educational Resources Information Center

    Romig, John Elwood; Therrien, William J.; Lloyd, John W.

    2017-01-01

    We used meta-analysis to examine the criterion validity of four scoring procedures used in curriculum-based measurement of written language. A total of 22 articles representing 21 studies (N = 21) met the inclusion criteria. Results indicated that two scoring procedures, correct word sequences and correct minus incorrect sequences, have acceptable…

  12. Further Validation of the IDAS: Evidence of Convergent, Discriminant, Criterion, and Incremental Validity

    ERIC Educational Resources Information Center

    Watson, David; O'Hara, Michael W.; Chmielewski, Michael; McDade-Montez, Elizabeth A.; Koffel, Erin; Naragon, Kristin; Stuart, Scott

    2008-01-01

    The authors explicated the validity of the Inventory of Depression and Anxiety Symptoms (IDAS; D. Watson et al., 2007) in 2 samples (306 college students and 605 psychiatric patients). The IDAS scales showed strong convergent validity in relation to parallel interview-based scores on the Clinician Rating version of the IDAS; the mean convergent…

  13. Development and Criterion Validity of Differentiated and Elevated Vocational Interests in Adolescence

    ERIC Educational Resources Information Center

    Hirschi, Andreas

    2009-01-01

    Interest differentiation and elevation are supposed to provide important information about a person's state of interest development, yet little is known about their development and criterion validity. The present study explored these constructs among a group of Swiss adolescents. Study 1 applied a cross-sectional design with 210 students in 11th…

  14. Development and validation of criterion-referenced clinically relevant fitness standards for maintaining physical independence in later years.

    PubMed

    Rikli, Roberta E; Jones, C Jessie

    2013-04-01

    To develop and validate criterion-referenced fitness standards for older adults that predict the level of capacity needed for maintaining physical independence into later life. The proposed standards were developed for use with a previously validated test battery for older adults-the Senior Fitness Test (Rikli, R. E., & Jones, C. J. (2001). Development and validation of a functional fitness test for community--residing older adults. Journal of Aging and Physical Activity, 6, 127-159; Rikli, R. E., & Jones, C. J. (1999a). Senior fitness test manual. Champaign, IL: Human Kinetics.). A criterion measure to assess physical independence was identified. Next, scores from a subset of 2,140 "moderate-functioning" older adults from a larger cross-sectional database, together with findings from longitudinal research on physical capacity and aging, were used as the basis for proposing fitness standards (performance cut points) associated with having the ability to function independently. Validity and reliability analyses were conducted to test the standards for their accuracy and consistency as predictors of physical independence. Performance standards are presented for men and women ages 60-94 indicating the level of fitness associated with remaining physically independent until late in life. Reliability and validity indicators for the standards ranged between .79 and .97. The proposed standards provide easy-to-use, previously unavailable methods for evaluating physical capacity in older adults relative to that associated with physical independence. Most importantly, the standards can be used in planning interventions that target specific areas of weakness, thus reducing risk for premature loss of mobility and independence.

  15. Construct and Criterion Validity of the PedsQL™ 4.0 Instrument (Pediatric Quality of Life Inventory) in Colombia.

    PubMed

    Amaya-Arias, Ana Carolina; Alzate, Juan Pablo; Eslava-Schmalbach, Javier H

    2017-01-01

    This study aimed at determining the validity of the Pediatric Quality of Life Inventory 4.0 (PedsQL™ 4.0) for the measurement of health-related quality of life (HRQOL) in Colombian children. Validation study of measurement instruments. The PedsQL™ 4.0 was applied by convenience sampling to 375 pairs of children and adolescents between the ages of 5 and 17 and to their parents-caregivers, as well as to 125 parents-caregivers of children between the ages of 2 and 4 in five cities of Colombia (Bogota, Medellin, Cali, Barranquilla and Bucaramanga). Construct validity was assessed through the use of exploratory and confirmatory factor analysis, and criterion validity was assessed by correlations between the PedsQL™ 4.0 and the KIDSCREEN-27. The instrument was applied to 375 children (ages 5-18) and 125 parents of children between the ages of 2 and 4. Factor analysis revealed four factors considered suitable for the sample in both the child and parent reports, whereas Bartlett's test of sphericity showed inter-correlation between variables. Scale and subscales showed proper indicators of internal consistency. It is recommended not to include or review some of the items in the Colombian version of the scale. The Spanish version for Colombia of the PedsQL™ 4.0 displays suitable indicators of criterion and construct validity, therefore becoming a valuable tool for measuring HRQOL in children in our country. Some modifications are recommended for the Colombian version of the scale.

  16. Evaluation of Weighted Scale Reliability and Criterion Validity: A Latent Variable Modeling Approach

    ERIC Educational Resources Information Center

    Raykov, Tenko

    2007-01-01

    A method is outlined for evaluating the reliability and criterion validity of weighted scales based on sets of unidimensional measures. The approach is developed within the framework of latent variable modeling methodology and is useful for point and interval estimation of these measurement quality coefficients in counseling and education…

  17. Criterion and content validity of a novel structured haggling contingent valuation question format versus the bidding game and binary with follow-up format.

    PubMed

    Onwujekwe, Obinna

    2004-02-01

    Contingent valuation question formats that will be used to elicit willingness to pay for goods and services need to be relevant to the area they will be used in order for responses to be valid. A novel contingent valuation question format called the "structured haggling technique" (SH) that resembles the bargaining system in Nigerian markets was designed and its criterion and content validity compared with those of the bidding game (BG) and binary-with-follow-up (BWFU) technique. This was achieved by determining the willingness to pay (WTP) for insecticide-treated nets (ITNs) in Southeast Nigeria. Content validity was determined through observation of actual trading of untreated nets together with interviews with sellers and consumers. Criterion validity was determined by comparing stated and actual WTP. Stated WTP was determined using a questionnaire administered to 810 household heads and actual WTP was determined by offering the nets for sale to all respondents one month later. The phi (correlation) coefficient was used to compare criterion validity across question formats. The phi coefficients were SH (0.60: 95% C.I. 0.50-0.71), BG (0.42: 95% C.I. 0.29-0.54) and the BWFU (0.32: 95% C.I. 0.20-0.44), implying that the BG and SH had similar levels of criterion-validity while the BWFU was the least criterion-valid. However, the SH was the most content-valid. It is necessary to validate the findings in other areas where haggling is common. Future studies should establish the content validity of question formats in the contexts in which they will be used before administering questionnaires.

  18. Validity and Reliability of Criterion-Referenced Measures: Issues and Procedures for Special Educators.

    ERIC Educational Resources Information Center

    Harris, Larry P.; Wolf, Steven R.

    1979-01-01

    The article focuses on the controversy over norm-referenced v criterion-referenced measures (CRM) in assessment of learning disorders. The authors contend that while the reliability of CRMs is generally indisputable, the validity of measures designed from local curricula is still dependent on the intuitive judgments of teachers. (Author/SBH)

  19. Criterion Validity of the Mood and Feelings Questionnaire for Depressive Episodes in Clinic and Non-Clinic Subjects

    ERIC Educational Resources Information Center

    Daviss, W. Burleson; Birmaher, Boris; Melhem, Nadine A.; Axelson, David A.; Michaels, Shana M.; Brent, David A.

    2006-01-01

    Background: Previous measures of pediatric depression have shown inconsistent validity in groups with differing demographics, comorbid diagnoses, and clinic or non-clinic origins. The current study re-examines the criterion validity of child- and parent-versions of the Mood and Feelings Questionnaire (MFQ-C, MFQ-P) in a heterogeneous sample of…

  20. Current Concerns in Validity Theory.

    ERIC Educational Resources Information Center

    Kane, Michael

    Validity is concerned with the clarification and justification of the intended interpretations and uses of observed scores. It has not been easy to formulate a general methodology set of principles for validation, but progress has been made, especially as the field has moved from relatively limited criterion-related models to sophisticated…

  1. Development and Validation of Criterion-Referenced Clinically Relevant Fitness Standards for Maintaining Physical Independence in Later Years

    ERIC Educational Resources Information Center

    Rikli, Roberta E.; Jones, C. Jessie

    2013-01-01

    Purpose: To develop and validate criterion-referenced fitness standards for older adults that predict the level of capacity needed for maintaining physical independence into later life. The proposed standards were developed for use with a previously validated test battery for older adults--the Senior Fitness Test (Rikli, R. E., & Jones, C. J.…

  2. Construct and Criterion Validity of the PedsQL™ 4.0 Instrument (Pediatric Quality of Life Inventory) in Colombia

    PubMed Central

    Amaya-Arias, Ana Carolina; Alzate, Juan Pablo; Eslava-Schmalbach, Javier H

    2017-01-01

    Background: This study aimed at determining the validity of the Pediatric Quality of Life Inventory 4.0 (PedsQL™ 4.0) for the measurement of health-related quality of life (HRQOL) in Colombian children. Methods: Validation study of measurement instruments. The PedsQL™ 4.0 was applied by convenience sampling to 375 pairs of children and adolescents between the ages of 5 and 17 and to their parents-caregivers, as well as to 125 parents-caregivers of children between the ages of 2 and 4 in five cities of Colombia (Bogota, Medellin, Cali, Barranquilla and Bucaramanga). Construct validity was assessed through the use of exploratory and confirmatory factor analysis, and criterion validity was assessed by correlations between the PedsQL™ 4.0 and the KIDSCREEN-27. Results: The instrument was applied to 375 children (ages 5–18) and 125 parents of children between the ages of 2 and 4. Factor analysis revealed four factors considered suitable for the sample in both the child and parent reports, whereas Bartlett's test of sphericity showed inter-correlation between variables. Scale and subscales showed proper indicators of internal consistency. It is recommended not to include or review some of the items in the Colombian version of the scale. Conclusions: The Spanish version for Colombia of the PedsQL™ 4.0 displays suitable indicators of criterion and construct validity, therefore becoming a valuable tool for measuring HRQOL in children in our country. Some modifications are recommended for the Colombian version of the scale. PMID:28900536

  3. A new self-report inventory of dyslexia for students: criterion and construct validity.

    PubMed

    Tamboer, Peter; Vorst, Harrie C M

    2015-02-01

    The validity of a Dutch self-report inventory of dyslexia was ascertained in two samples of students. Six biographical questions, 20 general language statements and 56 specific language statements were based on dyslexia as a multi-dimensional deficit. Dyslexia and non-dyslexia were assessed with two criteria: identification with test results (Sample 1) and classification using biographical information (both samples). Using discriminant analyses, these criteria were predicted with various groups of statements. All together, 11 discriminant functions were used to estimate classification accuracy of the inventory. In Sample 1, 15 statements predicted the test criterion with classification accuracy of 98%, and 18 statements predicted the biographical criterion with classification accuracy of 97%. In Sample 2, 16 statements predicted the biographical criterion with classification accuracy of 94%. Estimations of positive and negative predictive value were 89% and 99%. Items of various discriminant functions were factor analysed to find characteristic difficulties of students with dyslexia, resulting in a five-factor structure in Sample 1 and a four-factor structure in Sample 2. Answer bias was investigated with measures of internal consistency reliability. Less than 20 self-report items are sufficient to accurately classify students with and without dyslexia. This supports the usefulness of self-assessment of dyslexia as a valid alternative to diagnostic test batteries. Copyright © 2015 John Wiley & Sons, Ltd.

  4. Occupation-specific screening for future sickness absence: criterion validity of the trucker strain monitor (TSM).

    PubMed

    De Croon, Einar M; Blonk, Roland W B; Sluiter, Judith K; Frings-Dresen, Monique H W

    2005-02-01

    Monitoring psychological job strain may help occupational physicians to take preventive action at the appropriate time. For this purpose, the 10-item trucker strain monitor (TSM) assessing work-related fatigue and sleeping problems in truck drivers was developed. This study examined (1) test-retest reliability, (2) criterion validity of the TSM with respect to future sickness absence due to psychological health complaints and (3) usefulness of the TSM two-scales structure. The TSM and self-administered questionnaires, providing information about stressful working conditions (job control and job demands) and sickness absence, were sent to a random sample of 2000 drivers in 1998. Of the 1123 responders, 820 returned a completed questionnaire 2 years later (response: 72%). The TSM work-related fatigue scale, the TSM sleeping problems scale and the TSM composite scale showed satisfactory 2-year test-retest reliability (coefficient r=0.62, 0.66 and 0.67, respectively). The work-related fatigue, sleeping problems scale and composite scale had sensitivities of 61, 65 and 61%, respectively in identifying drivers with future sickness absence due to psychological health complaints. The specificity and positive predictive value of the TSM composite scale were 77 and 11%, respectively. The work-related fatigue scale and the sleeping problems scale were moderately strong correlated (r=0.62). However, stressful working conditions were differentially associated with the two scales. The results support the test-retest reliability, criterion validity and two-factor structure of the TSM. In general, the results suggest that the use of occupation-specific psychological job strain questionnaires is fruitful.

  5. 29 CFR 1607.5 - General standards for validity studies.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 29 Labor 4 2010-07-01 2010-07-01 false General standards for validity studies. 1607.5 Section 1607... studies. A. Acceptable types of validity studies. For the purposes of satisfying these guidelines, users may rely upon criterion-related validity studies, content validity studies or construct validity...

  6. A Criterion-Related Validation Study of the Army Core Leader Competency Model

    DTIC Science & Technology

    2007-04-01

    2004). Transformational and transactional leadership: A meta-analytic test of their relative validity. Journal of Applied Psychology , 89, 755- 768...performance criteria in an attempt to adjust ratings for this influence. Leader survey materials were developed and pilot tested at Ft. Drum and Ft... psychological constructs in the behavioral science realm. Numerous theories, popular literature, websites, assessments, and competency models are

  7. A Note on Economic Content and Test Validity.

    ERIC Educational Resources Information Center

    Soper, John C.; Brenneke, Judith Staley

    1987-01-01

    Offers practical tips on how teachers can determine whether classroom tests are actually measuring what they are designed to measure. Discusses criterion-related validity, construct validity, and content validity. Demonstrates how to determine the degree of content validity a particular test may have for a particular course or unit. (Author/DH)

  8. Assessment of performance validity in the Stroop Color and Word Test in mild traumatic brain injury patients: a criterion-groups validation design.

    PubMed

    Guise, Brian J; Thompson, Matthew D; Greve, Kevin W; Bianchini, Kevin J; West, Laura

    2014-03-01

    The current study assessed performance validity on the Stroop Color and Word Test (Stroop) in mild traumatic brain injury (TBI) using criterion-groups validation. The sample consisted of 77 patients with a reported history of mild TBI. Data from 42 moderate-severe TBI and 75 non-head-injured patients with other clinical diagnoses were also examined. TBI patients were categorized on the basis of Slick, Sherman, and Iverson (1999) criteria for malingered neurocognitive dysfunction (MND). Classification accuracy is reported for three indicators (Word, Color, and Color-Word residual raw scores) from the Stroop across a range of injury severities. With false-positive rates set at approximately 5%, sensitivity was as high as 29%. The clinical implications of these findings are discussed. © 2012 The British Psychological Society.

  9. Empirical Validation of Reading Proficiency Guidelines

    ERIC Educational Resources Information Center

    Clifford, Ray; Cox, Troy L.

    2013-01-01

    The validation of ability scales describing multidimensional skills is always challenging, but not impossible. This study applies a multistage, criterion-referenced approach that uses a framework of aligned texts and reading tasks to explore the validity of the ACTFL and related reading proficiency guidelines. Rasch measurement and statistical…

  10. Concurrent Criterion Validity of the Ausburg Multidimensional Personality Instrument (AMPI) Clinical Scales among College Students

    ERIC Educational Resources Information Center

    Kelly, William E.; Lutz, Daniel

    2014-01-01

    The concurrent criterion validity of the Ausburg Multidimensional Personality Instrument (AMPI) clinical scales was examined. The AMPI and several scales purportedly measuring the same or similar constructs as those of the AMPI clinical scales were administered to two samples of college students (N = 134 and N = 118). The correlations between the…

  11. Experimentally Manipulating Items Informs on the (Limited) Construct and Criterion Validity of the Humor Styles Questionnaire

    PubMed Central

    Ruch, Willibald; Heintz, Sonja

    2017-01-01

    How strongly does humor (i.e., the construct-relevant content) in the Humor Styles Questionnaire (HSQ; Martin et al., 2003) determine the responses to this measure (i.e., construct validity)? Also, how much does humor influence the relationships of the four HSQ scales, namely affiliative, self-enhancing, aggressive, and self-defeating, with personality traits and subjective well-being (i.e., criterion validity)? The present paper answers these two questions by experimentally manipulating the 32 items of the HSQ to only (or mostly) contain humor (i.e., construct-relevant content) or to substitute the humor content with non-humorous alternatives (i.e., only assessing construct-irrelevant context). Study 1 (N = 187) showed that the HSQ affiliative scale was mainly determined by humor, self-enhancing and aggressive were determined by both humor and non-humorous context, and self-defeating was primarily determined by the context. This suggests that humor is not the primary source of the variance in three of the HQS scales, thereby limiting their construct validity. Study 2 (N = 261) showed that the relationships of the HSQ scales to the Big Five personality traits and subjective well-being (positive affect, negative affect, and life satisfaction) were consistently reduced (personality) or vanished (subjective well-being) when the non-humorous contexts in the HSQ items were controlled for. For the HSQ self-defeating scale, the pattern of relationships to personality was also altered, supporting an positive rather than a negative view of the humor in this humor style. The present findings thus call for a reevaluation of the role that humor plays in the HSQ (construct validity) and in the relationships to personality and well-being (criterion validity). PMID:28473794

  12. Criterion validity study of the cervical range of motion (CROM) device for rotational range of motion on healthy adults.

    PubMed

    Tousignant, Michel; Smeesters, Cécil; Breton, Anne-Marie; Breton, Emilie; Corriveau, Hélène

    2006-04-01

    This study compared range of motion (ROM) measurements using a cervical range of motion device (CROM) and an optoelectronic system (OPTOTRAK). To examine the criterion validity of the CROM for the measurement of cervical ROM on healthy adults. Whereas measurements of cervical ROM are recognized as part of the assessment of patients with neck pain, few devices are available in clinical settings. Two papers published previously showed excellent criterion validity for measurements of cervical flexion/extension and lateral flexion using the CROM. Subjects performed neck rotation, flexion/extension, and lateral flexion while sitting on a wooden chair. The ROM values were measured by the CROM as well as the OPTOTRAK. The cervical rotational ROM values using the CROM demonstrated a good to excellent linear relationship with those using the OPTOTRAK: right rotation, r = 0.89 (95% confidence interval, 0.81-0.94), and left rotation, r = 0.94 (95% confidence interval, 0.90-0.97). Similar results were also obtained for flexion/extension and lateral flexion ROM values. The CROM showed excellent criterion validity for measurements of cervical rotation. We propose using ROM values measured by the CROM as outcome measures for patients with neck pain.

  13. Incremental validity of the episode size criterion in binge-eating definitions: An examination in women with purging syndromes.

    PubMed

    Forney, K Jean; Bodell, Lindsay P; Haedt-Matt, Alissa A; Keel, Pamela K

    2016-07-01

    Of the two primary features of binge eating, loss of control (LOC) eating is well validated while the role of eating episode size is less clear. Given the ICD-11 proposal to eliminate episode size from the binge-eating definition, the present study examined the incremental validity of the size criterion, controlling for LOC. Interview and questionnaire data come from four studies of 243 women with bulimia nervosa (n = 141) or purging disorder (n = 102). Hierarchical linear regression tested if the largest reported episode size, coded in kilocalories, explained additional variance in eating disorder features, psychopathology, personality traits, and impairment, holding constant LOC eating frequency, age, and body mass index (BMI). Analyses also tested if episode size moderated the association between LOC eating and these variables. Holding LOC constant, episode size explained significant variance in disinhibition, trait anxiety, and eating disorder-related impairment. Episode size moderated the association of LOC eating with purging frequency and depressive symptoms, such that in the presence of larger eating episodes, LOC eating was more closely associated with these features. Neither episode size nor its interaction with LOC explained additional variance in BMI, hunger, restraint, shape concerns, state anxiety, negative urgency, or global functioning. Taken together, results support the incremental validity of the size criterion, in addition to and in combination with LOC eating, for defining binge-eating episodes in purging syndromes. Future research should examine the predictive validity of episode size in both purging and nonpurging eating disorders (e.g., binge eating disorder) to inform nosological schemes. © 2016 Wiley Periodicals, Inc. (Int J Eat Disord 2016; 49:651-662). © 2016 Wiley Periodicals, Inc.

  14. [Criterion Validity of the German Version of the CES-D in the General Population].

    PubMed

    Jahn, Rebecca; Baumgartner, Josef S; van den Nest, Miriam; Friedrich, Fabian; Alexandrowicz, Rainer W; Wancata, Johannes

    2018-04-17

    The "Center of Epidemiologic Studies - Depression scale" (CES-D) is a well-known screening tool for depression. Until now the criterion validity of the German version of the CES-D was not investigated in a sample of the adult general population. 508 study participants of the Austrian general population completed the CES-D. ICD-10 diagnoses were established by using the Schedules for Clinical Assessment in Neuropsychiatry (SCAN). Receiver Operating Characteristics (ROC) analysis was conducted. Possible gender differences were explored. Overall discriminating performance of the CES-D was sufficient (ROC-AUC 0,836). Using the traditional cut-off values of 15/16 and 21/22 respectively the sensitivity was 43.2 % and 32.4 %, respectively. The cut-off value developed on the basis of our sample was 9/10 with a sensitivity of 81.1 % und a specificity of 74.3 %. There were no significant gender differences. This is the first study investigating the criterion validity of the German version of the CES-D in the general population. The optimal cut-off values yielded sufficient sensitivity and specificity, comparable to the values of other screening tools. © Georg Thieme Verlag KG Stuttgart · New York.

  15. Criterion validity and accuracy of global positioning satellite and data logging devices for wheelchair tennis court movement

    PubMed Central

    Sindall, Paul; Lenton, John P.; Whytock, Katie; Tolfrey, Keith; Oyster, Michelle L.; Cooper, Rory A.; Goosey-Tolfrey, Victoria L.

    2013-01-01

    Purpose To compare the criterion validity and accuracy of a 1 Hz non-differential global positioning system (GPS) and data logger device (DL) for the measurement of wheelchair tennis court movement variables. Methods Initial validation of the DL device was performed. GPS and DL were fitted to the wheelchair and used to record distance (m) and speed (m/second) during (a) tennis field (b) linear track, and (c) match-play test scenarios. Fifteen participants were monitored at the Wheelchair British Tennis Open. Results Data logging validation showed underestimations for distance in right (DLR) and left (DLL) logging devices at speeds >2.5 m/second. In tennis-field tests, GPS underestimated distance in five drills. DLL was lower than both (a) criterion and (b) DLR in drills moving forward. Reversing drill direction showed that DLR was lower than (a) criterion and (b) DLL. GPS values for distance and average speed for match play were significantly lower than equivalent values obtained by DL (distance: 2816 (844) vs. 3952 (1109) m, P = 0.0001; average speed: 0.7 (0.2) vs. 1.0 (0.2) m/second, P = 0.0001). Higher peak speeds were observed in DL (3.4 (0.4) vs. 3.1 (0.5) m/second, P = 0.004) during tennis match play. Conclusions Sampling frequencies of 1 Hz are too low to accurately measure distance and speed during wheelchair tennis. GPS units with a higher sampling rate should be advocated in further studies. Modifications to existing DL devices may be required to increase measurement precision. Further research into the validity of movement devices during match play will further inform the demands and movement patterns associated with wheelchair tennis. PMID:23820154

  16. The test-retest reliability and criterion validity of a high-intensity, netball-specific circuit test: The Net-Test.

    PubMed

    Mungovan, Sean F; Peralta, Paula J; Gass, Gregory C; Scanlan, Aaron T

    2018-04-12

    To examine the test-retest reliability and criterion validity of a high-intensity, netball-specific fitness test. Repeated measures, within-subject design. Eighteen female netball players competing in an international competition completed a trial of the Net-Test, which consists of 14 timed netball-specific movements. Players also completed a series of netball-relevant criterion fitness tests. Ten players completed an additional Net-Test trial one week later to assess test-retest reliability using intraclass correlation coefficient (ICC), typical error of measurement (TEM), and coefficient of variation (CV). The typical error of estimate expressed as CV and Pearson correlations were calculated between each criterion test and Net-Test performance to assess criterion validity. Five movements during the Net-Test displayed moderate ICC (0.84-0.90) and two movements displayed high ICC (0.91-0.93). Seven movements and heart rate taken during the Net-Test held low CV (<5%) with values ranging from 1.7 to 9.5% across measures. Total time (41.63±2.05s) during the Net-Test possessed low CV and significant (p<0.05) correlations with 10m sprint time (1.98±0.12s; CV=4.4%, r=0.72), 20m sprint time (3.38±0.19s; CV=3.9%, r=0.79), 505 Change-of-Direction time (2.47±0.08s; CV=2.0%, r=0.80); and maximum oxygen uptake (46.59±2.58 mLkg -1 min -1 ; CV=4.5%, r=-0.66). The Net-Test possesses acceptable reliability for the assessment of netball fitness. Further, the high criterion validity for the Net-Test suggests a range of important netball-specific fitness elements are assessed in combination. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.

  17. Reliability and criterion validity of two applications of the iPhone™ to measure cervical range of motion in healthy participants

    PubMed Central

    2013-01-01

    Summary of background data Recent smartphones, such as the iPhone, are often equipped with an accelerometer and magnetometer, which, through software applications, can perform various inclinometric functions. Although these applications are intended for recreational use, they have the potential to measure and quantify range of motion. The purpose of this study was to estimate the intra and inter-rater reliability as well as the criterion validity of the clinometer and compass applications of the iPhone in the assessment cervical range of motion in healthy participants. Methods The sample consisted of 28 healthy participants. Two examiners measured cervical range of motion of each participant twice using the iPhone (for the estimation of intra and inter-reliability) and once with the CROM (for the estimation of criterion validity). Estimates of reliability and validity were then established using the intraclass correlation coefficient (ICC). Results We observed a moderate intra-rater reliability for each movement (ICC = 0.65-0.85) but a poor inter-rater reliability (ICC < 0.60). For the criterion validity, the ICCs are moderate (>0.50) to good (>0.65) for movements of flexion, extension, lateral flexions and right rotation, but poor (<0.50) for the movement left rotation. Conclusion We found good intra-rater reliability and lower inter-rater reliability. When compared to the gold standard, these applications showed moderate to good validity. However, before using the iPhone as an outcome measure in clinical settings, studies should be done on patients presenting with cervical problems. PMID:23829201

  18. Criterion-related validity of self-reported stair climbing in older adults.

    PubMed

    Higueras-Fresnillo, Sara; Esteban-Cornejo, Irene; Gasque, Pablo; Veiga, Oscar L; Martinez-Gomez, David

    2018-02-01

    Stair climbing is an activity of daily living that might contribute to increase levels of physical activity (PA). To date, there is no study examining the validity of climbing stairs assessed by self-report. The aim of this study was, therefore, to examine the validity of estimated stair climbing from one question included in a common questionnaire compared to a pattern-recognition activity monitor in older adults. A total of 138 older adults (94 women), aged 65-86 years (70.9 ± 4.7 years), from the IMPACT65 + study participated in this validity study. Estimates of stair climbing were obtained from the European Prospective Investigation into Cancer and Nutrition (EPIC) PA questionnaire. An objective assessment of stair climbing was obtained with the Intelligent Device for Energy Expenditure and Activity (IDEEA) monitor. The correlation between both methods to assess stair climbing was fair (ρ = 0.22, p = 0.008 for PA energy expenditure and ρ = 0.26, p = 0.002 for duration). Mean differences between self-report and the IDEEA were 7.96 ± 10.52 vs. 9.88 ± 3.32 METs-min/day for PA energy expenditure, and 0.99 ± 1.32 vs. 1.79 ± 2.02 min/day for duration (both Wilcoxon test p < 0.001). Results from the Bland-Altman analysis indicate that bias between both instruments were -1.91 ± 10.30 METs-min/day and -0.80 ± 1.99 min/day, and corresponding limits of agreement for the two instruments were from 18.27 to -22.10 METs-min/day and from 3.09 to -4.70 min/day, respectively. Our results indicate that self-reported stair climbing has modest validity to accurately rank old age participants, and underestimates both PAEE and its duration, as compared with an objectively measured method.

  19. Reliability and Criterion Validity of a Novel Clinical Test of Simple and Complex Reaction Time in Athletes1

    PubMed Central

    Eckner, James T.; Richardson, James K.; Kim, Hogene; Joshi, Monica S.; Oh, Youkeun K.; Ashton-Miller, James A.

    2015-01-01

    Summary Slowed reaction time (RT) represents both a risk factor for and a consequence of sport concussion. The purpose of this study was to determine the reliability and criterion validity of a novel clinical test of simple and complex RT, called RTclin, in contact sport athletes. Both tasks were adapted from the well-known ruler drop test of RT and involve manually grasping a falling vertical shaft upon its release, with the complex task employing a go/no-go paradigm based on a slight cue. In 46 healthy contact sport athletes (24 males; M = 16.3 yr., SD = 5.0; 22 women: M age= 15.0 yr., SD = 4.0) whose sports included soccer, ice hockey, American football, martial arts, wrestling, and lacrosse, the latency and accuracy of simple and complex RTclin had acceptable test-retest and inter-rater reliabilities and correlated with a computerized criterion standard, the Axon Computerized Cognitive Assessment Tool. Medium to large effect sizes were found. The novel RTclin tests have acceptable reliability and criterion validity for clinical use and hold promise as concussion assessment tools. PMID:26106803

  20. [Criterion and Construct Validity in Nursing Diagnosis "Sedentary Lifestyle" in People over 50 Years Old].

    PubMed

    Guirao-Goris, Silamani J; Ferrer Ferrandis, Esperanza; Montejano Lozoya, Raimunda

    2016-02-18

    The aim of the study is to identify the construct and criterion validity of the nursing diagnosis label Sedentary Lifestyle. A cross-sectional study in a nursing consultation in primary health care was conducted. Participants were all people that was attended for one year over 50 who voluntarily wish to participate (n=85) in the study. Objective weekly physical activity was measured in METs with an Accelerometer, objective measure of performance was measured by gait speed EPESE Battery (both measures that were used as the gold standard), and physical activity questionnaires (RAPA), the COOP-WONCA physical fitness chart. Spearman correlation coefficients, mean comparison tests and analysis of sensitivity and specificity were used as statistical analysis. The diagnosis "Sedentary Lifestyle" showed a positive correlation between its manifestations and physical activity measured in METs (r=0.39) and EPESE gait speed (r=0.35). The diagnosis showed a sensitivity of 85.1% and a specificity of 65.2% and showed ability to discriminate active people from those that are not using METs as a measure of physical activity (t=-4.4). The diagnosis "Sedentary Lifestyle" shows criterion and construct validity.

  1. Psychometric properties of the mobility inventory for agoraphobia: convergent, discriminant, and criterion-related validity.

    PubMed

    Chambless, Dianne L; Sharpless, Brian A; Rodriguez, Dianeth; McCarthy, Kevin S; Milrod, Barbara L; Khalsa, Shabad-Ratan; Barber, Jacques P

    2011-12-01

    Aims of this study were (a) to summarize the psychometric literature on the Mobility Inventory for Agoraphobia (MIA), (b) to examine the convergent and discriminant validity of the MIA's Avoidance Alone and Avoidance Accompanied rating scales relative to clinical severity ratings of anxiety disorders from the Anxiety Disorders Interview Schedule (ADIS), and (c) to establish a cutoff score indicative of interviewers' diagnosis of agoraphobia for the Avoidance Alone scale. A meta-analytic synthesis of 10 published studies yielded positive evidence for internal consistency and convergent and discriminant validity of the scales. Participants in the present study were 129 people with a diagnosis of panic disorder. Internal consistency was excellent for this sample, α=.95 for AAC and .96 for AAL. When the MIA scales were correlated with interviewer ratings, evidence for convergent and discriminant validity for AAL was strong (convergent r with agoraphobia severity ratings=.63 vs. discriminant rs of .10-.29 for other anxiety disorders) and more modest but still positive for AAC (.54 vs. .01-.37). Receiver operating curve analysis indicated that the optimal operating point for AAL as an indicator of ADIS agoraphobia diagnosis was 1.61, which yielded sensitivity of .87 and specificity of .73. Copyright © 2011. Published by Elsevier Ltd.

  2. Psychometric Properties of the Mobility Inventory for Agoraphobia: Convergent, Discriminant, and Criterion-Related Validity

    PubMed Central

    Chambless, Dianne L.; Sharpless, Brian A.; Rodriguez, Dianeth; McCarthy, Kevin S.; Milrod, Barbara L.; Khalsa, Shabad-Ratan; Barber, Jacques P.

    2012-01-01

    Aims of this study were (a) to summarize the psychometric literature on the Mobility Inventory for Agoraphobia (MIA), (b) to examine the convergent and discriminant validity of the MIA’s Avoidance Alone and Avoidance Accompanied rating scales relative to clinical severity ratings of anxiety disorders from the Anxiety Disorders Interview Schedule (ADIS), and (c) to establish a cutoff score indicative of interviewers’ diagnosis of agoraphobia for the Avoidance Alone scale. A meta-analytic synthesis of 10 published studies yielded positive evidence for internal consistency and convergent and discriminant validity of the scales. Participants in the present study were 129 people with a diagnosis of panic disorder. Internal consistency was excellent for this sample, α = .95 for AAC and .96 for AAL. When the MIA scales were correlated with interviewer ratings, evidence for convergent and discriminant validity for AAL was strong (convergent r with agoraphobia severity ratings = .63 vs. discriminant rs of .10-.29 for other anxiety disorders) and more modest but still positive for AAC (.54 vs. .01-.37). Receiver operating curve analysis indicated that the optimal operating point for AAL as an indicator of ADIS agoraphobia diagnosis was 1.61, which yielded sensitivity of .87 and specificity of .73. PMID:22035997

  3. Development, Validation, and Fairness of a Biographical Data Questionnaire for the Air Traffic Control Specialist Occupation

    DTIC Science & Technology

    2012-12-01

    Development and validation. ABA, BQ , and criterion data were extracted from AT- SAT concurrent, criterion- related validation database. Overall, 1,232...dependent on responses to the other instrument. 3 A subset of 260 controllers in the AT- SAT dataset had full and complete ABA, BQ , and criterion data (i.e... SAT cases with ABA, BQ , and criterion data (n=260) was very small, making fairness analyses with the validation sample impractical. However, the

  4. Criterion and Concurrent Validity of the activPAL™ Professional Physical Activity Monitor in Adolescent Females

    PubMed Central

    Dowd, Kieran P.; Harrington, Deirdre M.; Donnelly, Alan E.

    2012-01-01

    Background The activPAL has been identified as an accurate and reliable measure of sedentary behaviour. However, only limited information is available on the accuracy of the activPAL activity count function as a measure of physical activity, while no unit calibration of the activPAL has been completed to date. This study aimed to investigate the criterion validity of the activPAL, examine the concurrent validity of the activPAL, and perform and validate a value calibration of the activPAL in an adolescent female population. The performance of the activPAL in estimating posture was also compared with sedentary thresholds used with the ActiGraph accelerometer. Methodologies Thirty adolescent females (15 developmental; 15 cross-validation) aged 15–18 years performed 5 activities while wearing the activPAL, ActiGraph GT3X, and the Cosmed K4B2. A random coefficient statistics model examined the relationship between metabolic equivalent (MET) values and activPAL counts. Receiver operating characteristic analysis was used to determine activity thresholds and for cross-validation. The random coefficient statistics model showed a concordance correlation coefficient of 0.93 (standard error of the estimate = 1.13). An optimal moderate threshold of 2997 was determined using mixed regression, while an optimal vigorous threshold of 8229 was determined using receiver operating statistics. The activPAL count function demonstrated very high concurrent validity (r = 0.96, p<0.01) with the ActiGraph count function. Levels of agreement for sitting, standing, and stepping between direct observation and the activPAL and ActiGraph were 100%, 98.1%, 99.2% and 100%, 0%, 100%, respectively. Conclusions These findings suggest that the activPAL is a valid, objective measurement tool that can be used for both the measurement of physical activity and sedentary behaviours in an adolescent female population. PMID:23094069

  5. [Examination of the criterion validity of the MMPI-2 Depression, Anxiety, and Anger Content scales].

    PubMed

    Uluç, Sait

    2008-01-01

    Examination of the psychometric properties and content areas of the revised MMPI's (MMPI-2 [Minnesota Multiphasic Personality Inventory-2]) content scales is required. In this study the criterion-related validity of the MMPI-2 Depression, Anxiety, and Anger Content scales was examined using the following conceptually relevant scales: The Beck Depression Inventory (BDI), Beck Anxiety Inventory (BAI), and State Triad Anger Scale (STAS). MMPI-2 Depression, Anxiety, and Anger Content scales, and BDI, BAI, and STAS were administered to a sample of 196 students at Middle East Technical University (n= 196; 122 female, 74 male). Regression analyses were performed to determine if these conceptually relevant scales contributed significantly beyond the content scales. The MMPI-2 Depression Content Scale was compared to BDI, the MMPI-2 Anxiety Scale was compared to BAI, and the MMPI-2 Anger Content Scale was compared to STAS. The internal consistency of the MMPI-2 Depression Content Scale (alpha = 0.82), the MMPI-2 Anxiety Content Scale (alpha = 0.73), and the MMPI-2 Anger Content Scale (alpha = 0.72) was obtained. Criterion validity of the 3 analyzed content scales was demonstrated for both males and females. The findings indicated that (1) the MMPI-2 Depression Content Scale provides information about the general level of depression, (2) the MMPI-2 Anxiety Content Scale assesses subjective anxiety rather than somatic anxiety, and (3) the MMPI-2 Anger Content Scale may provide information about the potential to act out. The findings also provide further evidence that the 3 conceptually relevant scales aid in the interpretation of MMPI-2 scores by contributing additional information beyond the clinical scales.

  6. Criterion and Construct Validity of an Isometric Midthigh-Pull Dynamometer for Assessing Whole-Body Strength in Professional Rugby League Players.

    PubMed

    Dobbin, Nick; Hunwicks, Richard; Jones, Ben; Till, Kevin; Highton, Jamie; Twist, Craig

    2018-02-01

    To examine the criterion and construct validity of an isometric midthigh-pull dynamometer to assess whole-body strength in professional rugby league players. Fifty-six male rugby league players (33 senior and 23 youth players) performed 4 isometric midthigh-pull efforts (ie, 2 on the dynamometer and 2 on the force platform) in a randomized and counterbalanced order. Isometric peak force was underestimated (P < .05) using the dynamometer compared with the force platform (95% LoA: -213.5 ± 342.6 N). Linear regression showed that peak force derived from the dynamometer explained 85% (adjusted R 2  = .85, SEE = 173 N) of the variance in the dependent variable, with the following prediction equation derived: predicted peak force = [1.046 × dynamometer peak force] + 117.594. Cross-validation revealed a nonsignificant bias (P > .05) between the predicted and peak force from the force platform and an adjusted R 2 (79.6%) that represented shrinkage of 0.4% relative to the cross-validation model (80%). Peak force was greater for the senior than the youth professionals using the dynamometer (2261.2 ± 222 cf 1725.1 ± 298.0 N, respectively; P < .05). The isometric midthigh pull assessed using a dynamometer underestimates criterion peak force but is capable of distinguishing muscle-function characteristics between professional rugby league players of different standards.

  7. The construct and criterion validity of the multi-source feedback process to assess physician performance: a meta-analysis

    PubMed Central

    Al Ansari, Ahmed; Donnon, Tyrone; Al Khalifa, Khalid; Darwish, Abdulla; Violato, Claudio

    2014-01-01

    Background The purpose of this study was to conduct a meta-analysis on the construct and criterion validity of multi-source feedback (MSF) to assess physicians and surgeons in practice. Methods In this study, we followed the guidelines for the reporting of observational studies included in a meta-analysis. In addition to PubMed and MEDLINE databases, the CINAHL, EMBASE, and PsycINFO databases were searched from January 1975 to November 2012. All articles listed in the references of the MSF studies were reviewed to ensure that all relevant publications were identified. All 35 articles were independently coded by two authors (AA, TD), and any discrepancies (eg, effect size calculations) were reviewed by the other authors (KA, AD, CV). Results Physician/surgeon performance measures from 35 studies were identified. A random-effects model of weighted mean effect size differences (d) resulted in: construct validity coefficients for the MSF system on physician/surgeon performance across different levels in practice ranged from d=0.14 (95% confidence interval [CI] 0.40–0.69) to d=1.78 (95% CI 1.20–2.30); construct validity coefficients for the MSF on physician/surgeon performance on two different occasions ranged from d=0.23 (95% CI 0.13–0.33) to d=0.90 (95% CI 0.74–1.10); concurrent validity coefficients for the MSF based on differences in assessor group ratings ranged from d=0.50 (95% CI 0.47–0.52) to d=0.57 (95% CI 0.55–0.60); and predictive validity coefficients for the MSF on physician/surgeon performance across different standardized measures ranged from d=1.28 (95% CI 1.16–1.41) to d=1.43 (95% CI 0.87–2.00). Conclusion The construct and criterion validity of the MSF system is supported by small to large effect size differences based on the MSF process and physician/surgeon performance across different clinical and nonclinical domain measures. PMID:24600300

  8. Validation of the Tuebingen CD-25 Inventory as a Measure of Postoperative Health-Related Quality of Life in Patients Treated for Cushing's Disease.

    PubMed

    Milian, Monika; Kreitschmann-Andermahr, Ilonka; Siegel, Sonja; Kleist, Bernadette; Führer-Sakel, Dagmar; Honegger, Juergen; Buchfelder, Michael; Psaras, Tsambika

    2015-01-01

    To evaluate the construct and criterion validity of the Tuebingen Cushing's disease quality of life inventory (Tuebingen CD-25) for application in patients treated for Cushing's disease (CD). A total of 176 patients with adrenocorticotropin hormone-dependent CD (144 of them female, overall mean age 46.1 ± 13.7 years) treated at 3 large tertiary referral centers in Germany were studied. Construct validity was assessed by hypothesis testing (self-perceived symptom reduction assessment) and contrasted groups (patients with vs. without hypercorticolism). For this purpose, already existing data from 55 CD patients was used, representing the hypercortisolemic group. Criterion validity (concurrent validity) was assessed in relation to the Cushing's quality of life questionnaire (CushingQoL), the Short Form 36 health survey (SF-36), and the body mass index (BMI). Patients with self-perceived remarkable symptom reduction had significant lower Tuebingen CD-25 scores (i.e. better health-related quality of life) than patients with self-perceived insufficient symptom reduction (p < 0.05). Similarly, the mean scores of the Tuebingen CD-25 scales were lower in patients without hypercortisolism (total score 27.0 ± 17.2) compared to those with hypercortisolism (total score 45.3 ± 22.1; each p < 0.05), providing evidence for construct validity. Criterion validity was confirmed by the correlations between the Tuebingen CD-25 total score and the CushingQoL (Spearman's coefficient -0.733), as well as all scales of the SF-36 (Spearman's coefficient between -0.447 and -0.700). The analyses presented in this large-sample study provide robust evidence for the construct and criterion validity of the Tuebingen CD-25. © 2015 S. Karger AG, Basel.

  9. Establishing Reliability and Validity of the Criterion Referenced Exam of GeoloGy Standards EGGS

    NASA Astrophysics Data System (ADS)

    Guffey, S. K.; Slater, S. J.; Slater, T. F.; Schleigh, S.; Burrows, A. C.

    2016-12-01

    Discipline-based geoscience education researchers have considerable need for a criterion-referenced, easy-to-administer and -score conceptual diagnostic survey for undergraduates taking introductory science survey courses in order for faculty to better be able to monitor the learning impacts of various interactive teaching approaches. To support ongoing education research across the geosciences, we are continuing to rigorously and systematically work to firmly establish the reliability and validity of the recently released Exam of GeoloGy Standards, EGGS. In educational testing, reliability refers to the consistency or stability of test scores whereas validity refers to the accuracy of the inferences or interpretations one makes from test scores. There are several types of reliability measures being applied to the iterative refinement of the EGGS survey, including test-retest, alternate form, split-half, internal consistency, and interrater reliability measures. EGGS rates strongly on most measures of reliability. For one, Cronbach's alpha provides a quantitative index indicating the extent to which if students are answering items consistently throughout the test and measures inter-item correlations. Traditional item analysis methods further establish the degree to which a particular item is reliably assessing students is actually quantifiable, including item difficulty and item discrimination. Validity, on the other hand, is perhaps best described by the word accuracy. For example, content validity is the to extent to which a measurement reflects the specific intended domain of the content, stemming from judgments of people who are either experts in the testing of that particular content area or are content experts. Perhaps more importantly, face validity is a judgement of how representative an instrument is reflective of the science "at face value" and refers to the extent to which a test appears to measure a the targeted scientific domain as viewed by laypersons

  10. Neurological Outcome Scale for Traumatic Brain Injury: III. Criterion-Related Validity and Sensitivity to Change in the NABIS Hypothermia-II Clinical Trial

    PubMed Central

    Wilde, Elisabeth A.; Moretti, Paolo; MacLeod, Marianne C.; Pedroza, Claudia; Drever, Pamala; Fourwinds, Sierra; Frisby, Melisa L.; Beers, Sue R.; Scott, James N.; Hunter, Jill V.; Traipe, Elfrides; Valadka, Alex B.; Okonkwo, David O.; Zygun, David A.; Puccio, Ava M.; Clifton, Guy L.

    2013-01-01

    Abstract The Neurological Outcome Scale for Traumatic Brain Injury (NOS-TBI) is a measure assessing neurological functioning in patients with TBI. We hypothesized that the NOS-TBI would exhibit adequate concurrent and predictive validity and demonstrate more sensitivity to change, compared with other well-established outcome measures. We analyzed data from the National Acute Brain Injury Study: Hypothermia-II clinical trial. Participants were 16–45 years of age with severe TBI assessed at 1, 3, 6, and 12 months postinjury. For analysis of criterion-related validity (concurrent and predictive), Spearman's rank-order correlations were calculated between the NOS-TBI and the Glasgow Outcome Scale (GOS), GOS-Extended (GOS-E), Disability Rating Scale (DRS), and Neurobehavioral Rating Scale-Revised (NRS-R). Concurrent validity was demonstrated through significant correlations between the NOS-TBI and GOS, GOS-E, DRS, and NRS-R measured contemporaneously at 3, 6, and 12 months postinjury (all p<0.0013). For prediction analyses, the multiplicity-adjusted p value using the false discovery rate was <0.015. The 1-month NOS-TBI score was a significant predictor of outcome in the GOS, GOS-E, and DRS at 3 and 6 months postinjury (all p<0.015). The 3-month NOS-TBI significantly predicted GOS, GOS-E, DRS, and NRS-R outcomes at 6 and 12 months postinjury (all p<0.0015). Sensitivity to change was analyzed using Wilcoxon's signed rank-sum test of subsamples demonstrating no change in the GOS or GOS-E between 3 and 6 months. The NOS-TBI demonstrated higher sensitivity to change, compared with the GOS (p<0.038) and GOS-E (p<0.016). In summary, the NOS-TBI demonstrated adequate concurrent and predictive validity as well as sensitivity to change, compared with gold-standard outcome measures. The NOS-TBI may enhance prediction of outcome in clinical practice and measurement of outcome in TBI research. PMID:23617608

  11. POVERTY, INFANT MORTALITY, AND HOMICIDE RATES IN CROSS-NATIONAL PERPSECTIVE: ASSESSMENTS OF CRITERION AND CONSTRUCT VALIDITY*

    PubMed Central

    Messner, Steven F.; Raffalovich, Lawrence E.; Sutton, Gretchen M.

    2011-01-01

    This paper assesses the extent to which the infant mortality rate might be treated as a “proxy” for poverty in research on cross-national variation in homicide rates. We have assembled a pooled, cross-sectional time-series dataset for 16 advanced nations over the 1993–2000 period that includes standard measures of infant mortality and homicide and also contains information on two commonly used “income-based” poverty measures: a measure intended to reflect “absolute” deprivation and a measure intended to reflect “relative” deprivation. With these data, we are able to assess the criterion validity of the infant mortality rate with reference to the two income-based poverty measures. We are also able to estimate the effects of the various indicators of disadvantage on homicide rates in regression models, thereby assessing construct validity. The results reveal that the infant mortality rate is more strongly correlated with “relative poverty” than with “absolute poverty,” although much unexplained variance remains. In the regression models, the measure of infant mortality and the relative poverty measure yield significant positive effects on homicide rates, while the absolute poverty measure does not exhibit any significant effects. Our analyses suggest that it would be premature to dismiss relative deprivation in cross-national research on homicide, and that disadvantage is best conceptualized and measured as a multidimensional construct. PMID:21643432

  12. Validity of the modified back-saver sit-and-reach test: a comparison with other protocols.

    PubMed

    Hui, S S; Yuen, P Y

    2000-09-01

    Studies have shown that the classical sit-and-reach (CSR) test, the modified sit-and-reach (MSR), and the newly developed back-saver sit-and-reach (BS) test have poor criterion-related validity in estimating low-back flexibility but yielded moderate criterion-related validity in hamstring flexibility. The V sit-and-reach (VSR) test was found to be practical but the validity has not been established. The purpose of this study was to propose a modified back-saver sit-and-reach (MBS) test, which incorporated all advantages of the various protocols, and to compare the criterion-related validity and reliability of all these tests. 158 college students (F = 96, and M = 62; age = 20.77 +/- 2.51) performed CSR, VSR, BS (left and right leg), and MBS (left and right leg) tests in a randomized order. Scores from each test were then correlated with the criterion measures. For all sit-reach tests, intraclass reliability (single trial) was very high (r = 0.89-0.98). MBS yielded significant and highest r with low-back and hamstring criterion for men (r = 0.47-0.67) and women (r = 0.23-0.54). The low-back and right hamstring validity of MBS for men were significantly (P < 0.01) higher than those from BS and CSR, whereas no differences in criterion-related validity were found between the MBS and other protocols in women. The ratings of perceived comfort among the sit-and-reach protocols were significantly different (P < 0.001) from each other. The rating for MBS was observed the most comfortable test as compared with other protocols. The MBS test is not only a reliable test for hamstring and low-back flexibility, it is also a more practical with improved validity for hamstring and low-back flexibility in men than previous protocols.

  13. Empirical agreement in model validation.

    PubMed

    Jebeile, Julie; Barberousse, Anouk

    2016-04-01

    Empirical agreement is often used as an important criterion when assessing the validity of scientific models. However, it is by no means a sufficient criterion as a model can be so adjusted as to fit available data even though it is based on hypotheses whose plausibility is known to be questionable. Our aim in this paper is to investigate into the uses of empirical agreement within the process of model validation. Copyright © 2015 Elsevier Ltd. All rights reserved.

  14. Reliability and validity in a nutshell.

    PubMed

    Bannigan, Katrina; Watson, Roger

    2009-12-01

    To explore and explain the different concepts of reliability and validity as they are related to measurement instruments in social science and health care. There are different concepts contained in the terms reliability and validity and these are often explained poorly and there is often confusion between them. To develop some clarity about reliability and validity a conceptual framework was built based on the existing literature. The concepts of reliability, validity and utility are explored and explained. Reliability contains the concepts of internal consistency and stability and equivalence. Validity contains the concepts of content, face, criterion, concurrent, predictive, construct, convergent (and divergent), factorial and discriminant. In addition, for clinical practice and research, it is essential to establish the utility of a measurement instrument. To use measurement instruments appropriately in clinical practice, the extent to which they are reliable, valid and usable must be established.

  15. [Validity and Reliability of Korean Version of the Spiritual Care Competence Scale].

    PubMed

    Chung, Mi Ja; Park, Youngrye; Eun, Young

    2016-12-01

    The aim of this study was to examine the validity and reliability of the Korean Version of the Spiritual Care Competence Scale (K-SCCS). A cross-sectional study design was used. The K-SCCS consisted of 26 questions to measure spiritual care competence of nurses. Participants, 228 nurses who had more than 3 years'experience as a nurse, completed the survey. Confirmatory factor analysis was used to examine the construct validity and correlations of K-SCCS and spiritual well-being (SWB) were used to examine the criterion validity of K-SCCS. Cronbach's alpha was used to test internal consistency. The construct and the criterion-related validity of K-SCCS were supported as measures of spiritual care competence. Cronbach's alpha was .95. Factor loadings of the 26 questions ranged from .60 to .96. Construct validity of K-SCCS was verified by confirmatory factor analysis (RMSEA=.08, CFI=.90, NFI=.85). Criterion validity compared to the SWB showed significant correlation (r=.44, p<.001). The findings suggest that K-SCCS serves as an appropriate measure of spiritual care competence with validity and reliability. However, further study is needed to retest the verification of the factor analysis related to factor 2 (professionalisation and improving the quality of spiritual care) and factor 3 (personal support and patient counseling). Therefore, we recommend using the total score without distinguishing subscales.

  16. Construction and Validation of the Perceived Opportunity to Craft Scale.

    PubMed

    van Wingerden, Jessica; Niks, Irene M W

    2017-01-01

    We developed and validated a scale to measure employees' perceived opportunity to craft (POC) in two separate studies conducted in the Netherlands (total N = 2329). POC is defined as employees' perception of their opportunity to craft their job. In Study 1, the perceived opportunity to craft scale (POCS) was developed and tested for its factor structure and reliability in an explorative way. Study 2 consisted of confirmatory analyses of the factor structure and reliability of the scale as well as examination of the discriminant and criterion-related validity of the POCS. The results indicated that the scale consists of one dimension and could be reliably measured with five items. Evidence was found for the discriminant validity of the POCS. The scale also showed criterion-related validity when correlated with job crafting (+), job resources (autonomy +; opportunities for professional development +), work engagement (+), and the inactive construct cynicism (-). We discuss the implications of these findings for theory and practice.

  17. Reliability and criterion validity of measurements using a smart phone-based measurement tool for the transverse rotation angle of the pelvis during single-leg lifting.

    PubMed

    Jung, Sung-Hoon; Kwon, Oh-Yun; Jeon, In-Cheol; Hwang, Ui-Jae; Weon, Jong-Hyuck

    2018-01-01

    The purposes of this study were to determine the intra-rater test-retest reliability of a smart phone-based measurement tool (SBMT) and a three-dimensional (3D) motion analysis system for measuring the transverse rotation angle of the pelvis during single-leg lifting (SLL) and the criterion validity of the transverse rotation angle of the pelvis measurement using SBMT compared with a 3D motion analysis system (3DMAS). Seventeen healthy volunteers performed SLL with their dominant leg without bending the knee until they reached a target placed 20 cm above the table. This study used a 3DMAS, considered the gold standard, to measure the transverse rotation angle of the pelvis to assess the criterion validity of the SBMT measurement. Intra-rater test-retest reliability was determined using the SBMT and 3DMAS using intra-class correlation coefficient (ICC) [3,1] values. The criterion validity of the SBMT was assessed with ICC [3,1] values. Both the 3DMAS (ICC = 0.77) and SBMT (ICC = 0.83) showed excellent intra-rater test-retest reliability in the measurement of the transverse rotation angle of the pelvis during SLL in a supine position. Moreover, the SBMT showed an excellent correlation with the 3DMAS (ICC = 0.99). Measurement of the transverse rotation angle of the pelvis using the SBMT showed excellent reliability and criterion validity compared with the 3DMAS.

  18. Design and validation of a comprehensive fecal incontinence questionnaire.

    PubMed

    Macmillan, Alexandra K; Merrie, Arend E H; Marshall, Roger J; Parry, Bryan R

    2008-10-01

    Fecal incontinence can have a profound effect on quality of life. Its prevalence remains uncertain because of stigma, lack of consistent definition, and dearth of validated measures. This study was designed to develop a valid clinical and epidemiologic questionnaire, building on current literature and expertise. Patients and experts undertook face validity testing. Construct validity, criterion validity, and test-retest reliability was undertaken. Construct validity comprised factor analysis and internal consistency of the quality of life scale. The validity of known groups was tested against 77 control subjects by using regression models. Questionnaire results were compared with a stool diary for criterion validity. Test-retest reliability was calculated from repeated questionnaire completion. The questionnaire achieved good face validity. It was completed by 104 patients. The quality of life scale had four underlying traits (factor analysis) and high internal consistency (overall Cronbach alpha = 0.97). Patients and control subjects answered the questionnaire significantly differently (P < 0.01) in known-groups validity testing. Criterion validity assessment found mean differences close to zero. Median reliability for the whole questionnaire was 0.79 (range, 0.35-1). This questionnaire compares favorably with other available instruments, although the interpretation of stool consistency requires further research. Its sensitivity to treatment still needs to be investigated.

  19. Spanish translation, cross-cultural adaptation, and validation of the Questionnaire for Diabetes-Related Foot Disease (Q-DFD)

    PubMed Central

    Castillo-Tandazo, Wilson; Flores-Fortty, Adolfo; Feraud, Lourdes; Tettamanti, Daniel

    2013-01-01

    Purpose To translate, cross-culturally adapt, and validate the Questionnaire for Diabetes-Related Foot Disease (Q-DFD), originally created and validated in Australia, for its use in Spanish-speaking patients with diabetes mellitus. Patients and methods The translation and cross-cultural adaptation were based on international guidelines. The Spanish version of the survey was applied to a community-based (sample A) and a hospital clinic-based sample (samples B and C). Samples A and B were used to determine criterion and construct validity comparing the survey findings with clinical evaluation and medical records, respectively; while sample C was used to determine intra- and inter-rater reliability. Results After completing the rigorous translation process, only four items were considered problematic and required a new translation. In total, 127 patients were included in the validation study: 76 to determine criterion and construct validity and 41 to establish intra- and inter-rater reliability. For an overall diagnosis of diabetes-related foot disease, a substantial level of agreement was obtained when we compared the Q-DFD with the clinical assessment (kappa 0.77, sensitivity 80.4%, specificity 91.5%, positive likelihood ratio [LR+] 9.46, negative likelihood ratio [LR−] 0.21); while an almost perfect level of agreement was obtained when it was compared with medical records (kappa 0.88, sensitivity 87%, specificity 97%, LR+ 29.0, LR− 0.13). Survey reliability showed substantial levels of agreement, with kappa scores of 0.63 and 0.73 for intra- and inter-rater reliability, respectively. Conclusion The translated and cross-culturally adapted Q-DFD showed good psychometric properties (validity, reproducibility, and reliability) that allow its use in Spanish-speaking diabetic populations. PMID:24039434

  20. Criterion and concurrent validity of Conners Adult ADHD Diagnostic Interview for DSM-IV (CAADID) Spanish version.

    PubMed

    Ramos-Quiroga, Josep Antoni; Bosch, Rosa; Richarte, Vanesa; Valero, Sergi; Gómez-Barros, Nuria; Nogueira, Mariana; Palomar, Gloria; Corrales, Montse; Sáez-Francàs, Naia; Corominas, Margarida; Real, Alberto; Vidal, Raquel; Chalita, Pablo J; Casas, Miguel

    2012-01-01

    Attention deficit hyperactivity disorder (ADHD) is a common neuropsychiatric disorder in adulthood. Its diagnosis requires a retrospective evaluation of ADHD symptoms in childhood, the continuity of these symptoms in adulthood, and a differential diagnosis. For these reasons, diagnosis of ADHD in adults is a complex process which needs effective diagnostic tools. To analyse the criterion validity of the CAADID semi-structured interview, Spanish version, and the concurrent validity compared with other ADHD severity scales. An observational case-control study was conducted on 691 patients with ADHD. They were out-patients treated in a program for adults with ADHD in a hospital. A sensitivity of 98.86%, specificity 67.68%, positive predictive value 90.77% and a negative predictive value 94.87% were observed. Diagnostic precision was 91.46%. The kappa index concordance between the clinical diagnostic interview and the CAADID was 0.88. Good concurrent validity was obtained, the CAADID correlated significantly with WURS scale (r=0.522, P<.01), ADHD Rating Scale (r=0.670, P<.0.1) and CAARS (self-rating version; r=0.656, P<.01 and observer-report r=0.514, P<.01). CAADID is a valid and useful tool for the diagnosis of ADHD in adults for clinical, as well as for research purposes. Copyright © 2012 SEP y SEPB. Published by Elsevier España, S.L. All rights reserved.

  1. Do candidate reactions relate to job performance or affect criterion-related validity? A multistudy investigation of relations among reactions, selection test scores, and job performance.

    PubMed

    McCarthy, Julie M; Van Iddekinge, Chad H; Lievens, Filip; Kung, Mei-Chuan; Sinar, Evan F; Campion, Michael A

    2013-09-01

    Considerable evidence suggests that how candidates react to selection procedures can affect their test performance and their attitudes toward the hiring organization (e.g., recommending the firm to others). However, very few studies of candidate reactions have examined one of the outcomes organizations care most about: job performance. We attempt to address this gap by developing and testing a conceptual framework that delineates whether and how candidate reactions might influence job performance. We accomplish this objective using data from 4 studies (total N = 6,480), 6 selection procedures (personality tests, job knowledge tests, cognitive ability tests, work samples, situational judgment tests, and a selection inventory), 5 key candidate reactions (anxiety, motivation, belief in tests, self-efficacy, and procedural justice), 2 contexts (industry and education), 3 continents (North America, South America, and Europe), 2 study designs (predictive and concurrent), and 4 occupational areas (medical, sales, customer service, and technological). Consistent with previous research, candidate reactions were related to test scores, and test scores were related to job performance. Further, there was some evidence that reactions affected performance indirectly through their influence on test scores. Finally, in no cases did candidate reactions affect the prediction of job performance by increasing or decreasing the criterion-related validity of test scores. Implications of these findings and avenues for future research are discussed. PsycINFO Database Record (c) 2013 APA, all rights reserved

  2. Changing abilities vs. changing tasks: Examining validity degradation with test scores and college performance criteria both assessed longitudinally.

    PubMed

    Dahlke, Jeffrey A; Kostal, Jack W; Sackett, Paul R; Kuncel, Nathan R

    2018-05-03

    We explore potential explanations for validity degradation using a unique predictive validation data set containing up to four consecutive years of high school students' cognitive test scores and four complete years of those students' college grades. This data set permits analyses that disentangle the effects of predictor-score age and timing of criterion measurements on validity degradation. We investigate the extent to which validity degradation is explained by criterion dynamism versus the limited shelf-life of ability scores. We also explore whether validity degradation is attributable to fluctuations in criterion variability over time and/or GPA contamination from individual differences in course-taking patterns. Analyses of multiyear predictor data suggest that changes to the determinants of performance over time have much stronger effects on validity degradation than does the shelf-life of cognitive test scores. The age of predictor scores had only a modest relationship with criterion-related validity when the criterion measurement occasion was held constant. Practical implications and recommendations for future research are discussed. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  3. Parents' and Adolescents' Perspectives on Parenting: Evaluating Conceptual Structure, Measurement Invariance, and Criterion Validity.

    PubMed

    Janssens, Annelies; Goossens, Luc; Van Den Noortgate, Wim; Colpin, Hilde; Verschueren, Karine; Van Leeuwen, Karla

    2015-08-01

    Uncertainty persists regarding adequate measurement of parenting behavior during early adolescence. The present study aimed to clarify the conceptual structure of parenting by evaluating three different models that include support, psychological control, and various types of behavioral control (i.e., proactive, punitive, and harsh punitive control). Furthermore, we examined measurement invariance of parenting ratings by 1,111 Flemish adolescents from Grade 7 till 9, their mother, and father. Finally, criterion validity of parenting ratings was estimated in relation to adolescent problem behavior. Results supported a five-factor parenting model indicating multiple aspects of behavioral control, with punitive and harsh punitive control as more intrusive forms and proactive control as a more supportive form. Similar constructs were measured for adolescents, mothers, and fathers (i.e., configural and metric invariance), however on a different scale (i.e., scalar noninvariance). Future research and clinical practices should acknowledge these findings in order to fully grasp the parenting process. © The Author(s) 2014.

  4. [Development and validity of workplace bullying in nursing-type inventory (WPBN-TI)].

    PubMed

    Lee, Younju; Lee, Mihyoung

    2014-04-01

    The purpose of this study was to develop an instrument to assess bullying of nurses, and test the validity and reliability of the instrument. The initial thirty items of WPBN-TI were identified through a review of the literature on types bullying related to nursing and in-depth interviews with 14 nurses who experienced bullying at work. Sixteen items were developed through 2 content validity tests by 9 experts and 10 nurses. The final WPBN-TI instrument was evaluated by 458 nurses from five general hospitals in the Incheon metropolitan area. SPSS 18.0 program was used to assess the instrument based on internal consistency reliability, construct validity, and criterion validity. WPBN-TI consisted of 16 items with three distinct factors (verbal and nonverbal bullying, work-related bullying, and external threats), which explained 60.3% of the total variance. The convergent validity and determinant validity for WPBN-TI were 100.0%, 89.7%, respectively. Known-groups validity of WPBN-TI was proven through the mean difference between subjective perception of bullying. The satisfied criterion validity for WPBN-TI was more than .70. The reliability of WPBN-TI was Cronbach's α of .91. WPBN-TI with high validity and reliability is suitable to determine types of bullying in nursing workplace.

  5. The EQ-5D-5L is a valid approach to measure health related quality of life in patients undergoing bariatric surgery.

    PubMed

    Fermont, Jilles M; Blazeby, Jane M; Rogers, Chris A; Wordsworth, Sarah

    2017-01-01

    Bariatric surgery is considered an effective treatment for individuals with severe and complex obesity. Besides reducing weight and improving obesity related comorbidities such as diabetes, bariatric surgery could improve patients' health-related quality of life. However, the frequently used instrument to measure quality of life, the EQ-5D has not been validated for use in bariatric surgery, which is a major limitation to its use in this clinical context. Our study undertook a psychometric validation of the 5 level EQ-5D (EQ-5D-5L) using clinical trial data to measure health-related quality of life in patients with severe and complex obesity undergoing bariatric surgery. Health-related quality of life was assessed at baseline (before randomisation) and six months later in 189 patients in a randomised controlled trial of bariatric surgery. Patients completed two generic health-related quality of life instruments, the EQ-5D-5L and SF-12, which were used together for the validation using data from all patients in the trial as the trial is ongoing. Psychometric analyses included construct and criterion validity and responsiveness to change. Of the 189 validation patients, 141 (75%) were female, the median age was 49 years old (range 23-70 years) and body mass index ranged from 33-70 kg/m2. For construct validity, there were significant improvements in the distribution of responses in all EQ-5D dimensions between baseline and 6 months after randomisation. For criterion validity, the highest degree of correlation was between the EQ-5D pain/discomfort and SF-12 bodily pain domain. For responsiveness the EQ-5D and SF-12 showed statistically significant improvements in health-related quality of life between baseline and 6 months after randomisation. The EQ-5D-5L is a valid generic measure for measuring health-related quality of life in bariatric surgery patients.

  6. Examining the validity of AHRQ's patient safety indicators (PSIs): is variation in PSI composite score related to hospital organizational factors?

    PubMed

    Shin, Marlena H; Sullivan, Jennifer L; Rosen, Amy K; Solomon, Jeffrey L; Dunn, Edward J; Shimada, Stephanie L; Hayes, Jennifer; Rivard, Peter E

    2014-12-01

    Increasing use of Agency for Healthcare Research and Quality's Patient Safety Indicators (PSIs) for hospital performance measurement intensifies the need to critically assess their validity. Our study examined the extent to which variation in PSI composite score is related to differences in hospital organizational structures or processes (i.e., criterion validity). In site visits to three Veterans Health Administration hospitals with high and three with low PSI composite scores ("low performers" and "high performers," respectively), we interviewed a cross-section of hospital staff. We then coded interview transcripts for evidence in 13 safety-related domains and assessed variation across high and low performers. Evidence of leadership and coordination of work/communication (organizational process domains) was predominantly favorable for high performers only. Evidence in the other domains was either mixed, or there were insufficient data to rate the domains. While we found some evidence of criterion validity, the extent to which variation in PSI rates is related to differences in hospitals' organizational structures/processes needs further study. © The Author(s) 2014.

  7. Criterion-related validity of the short form of the international physical activity questionnaire in adults who are Deaf.

    PubMed

    Menezes, Diogo; Laranjo, Luís; Marmeleira, José

    2017-01-01

    To implement appropriate programs for promoting physical activity (PA) in people who are Deaf, it is important to have valid instruments for assessing PA in this population. The main purpose of this study was to examine the criterion validity of the short form of the International Physical Activity Questionnaire (IPAQ-S) in Deaf adults. This study included 44 adults (18-65 years) of both genders (63.6% were females) who met the inclusion criteria. Objective measures of PA were collected using accelerometers, which were worn by each participant during one week. After using the accelerometer, the IPAQ-S was applied to assess participants' physical activity during the last 7 days. There was no significant correlation between the average time spent in moderate to vigorous physical activity (MVPA) as measured by the accelerometer (40.1 ± 24.5 min/day) and by the IPAQ-S (41.3 ± 57.5 min/day). The IPAQ-S significantly underestimated the time spent in sedentary behavior (7.6 ± 2.7 h/day vs. 10.1 ± 1.6 h/day). Sedentary behavior and MVPA as measured by the accelerometer and the IPAQ-S showed limited agreement. Our results show some limitations on the use of IPAQ-S for quantifying PA among adults who are Deaf. The IPAQ-S tends to overestimate the MVPA and to underestimate sedentary behavior in adults who are Deaf. Copyright © 2016 Elsevier Inc. All rights reserved.

  8. Validity, Responsiveness, Minimal Detectable Change, and Minimal Clinically Important Change of the Pediatric Motor Activity Log in Children with Cerebral Palsy

    ERIC Educational Resources Information Center

    Lin, Keh-chung; Chen, Hui-fang; Chen, Chia-ling; Wang, Tien-ni; Wu, Ching-yi; Hsieh, Yu-wei; Wu, Li-ling

    2012-01-01

    This study examined criterion-related validity and clinimetric properties of the Pediatric Motor Activity Log (PMAL) in children with cerebral palsy. Study participants were 41 children (age range: 28-113 months) and their parents. Criterion-related validity was evaluated by the associations between the PMAL and criterion measures at baseline and…

  9. Age- and Sex-Specific Criterion Validity of the Health Survey for England Physical Activity and Sedentary Behavior Assessment Questionnaire as Compared With Accelerometry

    PubMed Central

    Scholes, Shaun; Coombs, Ngaire; Pedisic, Zeljko; Mindell, Jennifer S.; Bauman, Adrian; Rowlands, Alex V.; Stamatakis, Emmanuel

    2014-01-01

    The criterion validity of the 2008 Physical Activity and Sedentary Behavior Assessment Questionnaire (PASBAQ) was examined in a nationally representative sample of 2,175 persons aged ≥16 years in England using accelerometry. Using accelerometer minutes/day greater than or equal to 200 counts as a criterion, Spearman's correlation coefficient (ρ) for PASBAQ-assessed total activity was 0.30 (95% confidence interval (CI): 0.25, 0.35) in women and 0.20 (95% CI: 0.15, 0.26) in men. Correlations between accelerometer counts/minute of wear time and questionnaire-assessed relative energy expenditure (metabolic equivalent-minutes/day) were higher in women (ρ = 0.41, 95% CI: 0.36, 0.46) than in men (ρ = 0.32, 95% CI: 0.26, 0.38). Similar correlations were observed for minutes/day spent in vigorous activity (women: ρ = 0.39, 95% CI: 0.33, 0.46; men: ρ = 0.31, 95% CI: 0.26, 0.36) and moderate-to-vigorous activity (women: ρ = 0.42, 95% CI: 0.36, 0.48; men: ρ = 0.38, 95% CI: 0.32, 0.45). Correlations for time spent being sedentary (<100 counts/minute) were 0.30 (95% CI: 0.24, 0.35) and 0.25 (95% CI: 0.19, 0.30) in women and men, respectively. Sedentary behavior correlations showed no sex difference. The validity of sedentary behavior and total physical activity was higher in older age groups, but validity was higher in younger persons for vigorous-intensity activity. The PASBAQ is a useful and valid instrument for ranking individuals according to levels of physical activity and sedentary behavior. PMID:24863551

  10. Translation and validation of the Cancer-Related Fatigue Scale in Greek in a sample of patients with advanced prostate cancer

    PubMed Central

    Kaite, Charis; Constantinou, Marianna; Kouta, Christiana

    2016-01-01

    Objective To translate and validate the Cancer-Related Fatigue (CRF) Scale in the Greek language. Design A cross-sectional descriptive design was used in order to translate and validate the CRF Scale in Greek. Factor analyses were performed to understand the psychometric properties of the scale and to establish construct, criterion and convergent validity. Setting Outpatients' oncology clinics of two public hospitals in Cyprus. Participants 148 patients with advanced prostate cancer undergoing chemotherapy. Results The Cancer Fatigue Scale (CFS) had good stability (test–retest reliability r=0.79, p<0.001) and good internal consistency (Cronbach's α coefficient for all 15 items α=0.916). Furthermore, the Kaiser-Meyer-Olkin Measure of Sampling Adequacy (KMO value) was found to be 0.743 and considered to be satisfactory (>0.5). The correlations between the CFS physical scale (CFS-FS scale) and the European Organization for Research and Treatment of Cancer (EORTC) QLQ-C30 physical subscales were found to be significant (r=−0.715). The same occurred between CFS cognitive and EORTC cognitive subscale (r=−0.579). Overall, the criterion validity was verified. The same occurs for the convergent validity of the CFS since all correlations with the Global Health Status (q29–q30) were found to be significant. Conclusions This is the first validation study of the CRF Scale in Greek and warrant of its use in the assessment of prostate cancer patient's related fatigue. However, further testing and validation is needed in the early stages of the disease and in patients in later chemotherapy cycles. PMID:27913557

  11. Proposed modification of the criterion for the region of validity of the inverse-power expansion in diatomic long-range potentials

    NASA Astrophysics Data System (ADS)

    Ji, Bing; Tsai, Chin-Chun; Stwalley, William C.

    1995-04-01

    A modified internuclear distance criterion, RLR- m, as the lower bound for the region of validity of the inverse-power expansion of the diatomic long-range potential is proposed. This new criterion takes into account the spatial orientation of the atomic orbitals while retaining the simplicity of the traditional Le Roy radius, RLR for the interaction of S state atoms. Recent experimental and theoretical results for various excited states in Na 2 suggest that this proposed RLR- m is an appropriate generalization of RLR.

  12. Revision, Criterion Validity, and Multi-group Assessment of the Reactions to Homosexuality Scale

    PubMed Central

    Smolenski, Derek J.; Diamond, Pamela M.; Ross, Michael W.; Simon Rosser, B. R.

    2010-01-01

    Internalized homonegativity encompasses negative attitudes toward one’s own sexual orientation, and is associated with negative mental and physical health outcomes. The Reactions to Homosexuality scale (Ross & Rosser, 1996), an instrument used to measure internalized homonegativity, has been criticized for including content irrelevant to the construct of internalized homonegativity. We revised the scale using exploratory and confirmatory factor analyses, and identified a seven-item, three-factor reduced version that demonstrated measurement invariance across racial/ethnic categorizations and between English and Spanish versions. We also investigated criterion validity by estimating correlations with hypothesized outcomes associated with outness, relationship status, sexual orientation, and gay community affiliation. The evidence of measurement invariance suggests that this scale is appropriate for pluralistic treatment or study groups. PMID:20954058

  13. Standards Performance Continuum: Development and Validation of a Measure of Effective Pedagogy.

    ERIC Educational Resources Information Center

    Doherty, R. William; Hilberg, R. Soleste; Epaloose, Georgia; Tharp, Roland G.

    2002-01-01

    Describes the development and validation of the Standards Performance Continuum (SPC) for assessing teacher performance of the Standards for Effective Pedagogy. Three studies involving Florida, California, and New Mexico public school teachers provided evidence of inter-rater reliability, concurrent validity, and criterion-related validity…

  14. Questioning the Validity of the 4+/5+ Binge or Heavy Drinking Criterion in College and Clinical Populations

    PubMed Central

    Pearson, Matthew R.; Kirouac, Megan; Witkiewitz, Katie

    2015-01-01

    Background and Aims The terms “binge drinking” and “heavy drinking” are both typically operationalized as 4+/5+ standard drinks per occasion for women/men and are commonly used as a proxy for non-problematic (<4/<5) versus problematic (4+/5+) drinking in multiple research contexts. The Food and Drug Administration in the United States (US) recently proposed the 4+/5+ criterion as a primary efficacy endpoint in their guidance for trials examining new medications for alcohol use disorders (AUDs). Internationally, similar cut-offs have been proposed, with the European Medicines Agency having identified reductions in the number of heavy drinking days (defined as 40/60g pure alcohol in women/men) as a primary endpoint for efficacy trials with a harm reduction goal. Analysis and Evidence We question the validity of the 4+/5+ cutoff (and other similar cutoffs) on multiple accounts. The 4+/5+ cutoff has not been shown to have unique predictive validity or clinical utility. The cutoff has been created based on retrospective self-reports and its use demonstrates ecological bias. Given strong evidence that the relationship between alcohol consumption and problems related to drinking is at least monotonic, if not linear, there is little existing evidence to support the 4+/5+ cutoff as a valid marker of problematic alcohol use. Conclusions There is little empirical evidence for the 4+/5+ units per occasion threshold for “binge” or “heavy” drinking in indexing treatment efficacy. Further consideration of an appropriate threshold seems to be warranted. PMID:27605077

  15. Age- and sex-specific criterion validity of the health survey for England Physical Activity and Sedentary Behavior Assessment Questionnaire as compared with accelerometry.

    PubMed

    Scholes, Shaun; Coombs, Ngaire; Pedisic, Zeljko; Mindell, Jennifer S; Bauman, Adrian; Rowlands, Alex V; Stamatakis, Emmanuel

    2014-06-15

    The criterion validity of the 2008 Physical Activity and Sedentary Behavior Assessment Questionnaire (PASBAQ) was examined in a nationally representative sample of 2,175 persons aged ≥16 years in England using accelerometry. Using accelerometer minutes/day greater than or equal to 200 counts as a criterion, Spearman's correlation coefficient (ρ) for PASBAQ-assessed total activity was 0.30 (95% confidence interval (CI): 0.25, 0.35) in women and 0.20 (95% CI: 0.15, 0.26) in men. Correlations between accelerometer counts/minute of wear time and questionnaire-assessed relative energy expenditure (metabolic equivalent-minutes/day) were higher in women (ρ = 0.41, 95% CI: 0.36, 0.46) than in men (ρ = 0.32, 95% CI: 0.26, 0.38). Similar correlations were observed for minutes/day spent in vigorous activity (women: ρ = 0.39, 95% CI: 0.33, 0.46; men: ρ = 0.31, 95% CI: 0.26, 0.36) and moderate-to-vigorous activity (women: ρ = 0.42, 95% CI: 0.36, 0.48; men: ρ = 0.38, 95% CI: 0.32, 0.45). Correlations for time spent being sedentary (<100 counts/minute) were 0.30 (95% CI: 0.24, 0.35) and 0.25 (95% CI: 0.19, 0.30) in women and men, respectively. Sedentary behavior correlations showed no sex difference. The validity of sedentary behavior and total physical activity was higher in older age groups, but validity was higher in younger persons for vigorous-intensity activity. The PASBAQ is a useful and valid instrument for ranking individuals according to levels of physical activity and sedentary behavior. © The Author 2014. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health.

  16. Development and validation of Triarchic construct scales from the psychopathic personality inventory.

    PubMed

    Hall, Jason R; Drislane, Laura E; Patrick, Christopher J; Morano, Mario; Lilienfeld, Scott O; Poythress, Norman G

    2014-06-01

    The Triarchic model of psychopathy describes this complex condition in terms of distinct phenotypic components of boldness, meanness, and disinhibition. Brief self-report scales designed specifically to index these psychopathy facets have thus far demonstrated promising construct validity. The present study sought to develop and validate scales for assessing facets of the Triarchic model using items from a well-validated existing measure of psychopathy-the Psychopathic Personality Inventory (PPI). A consensus-rating approach was used to identify PPI items relevant to each Triarchic facet, and the convergent and discriminant validity of the resulting PPI-based Triarchic scales were evaluated in relation to multiple criterion variables (i.e., other psychopathy inventories, antisocial personality disorder features, personality traits, psychosocial functioning) in offender and nonoffender samples. The PPI-based Triarchic scales showed good internal consistency and related to criterion variables in ways consistent with predictions based on the Triarchic model. Findings are discussed in terms of implications for conceptualization and assessment of psychopathy.

  17. Development and Validation of Triarchic Construct Scales from the Psychopathic Personality Inventory

    PubMed Central

    Hall, Jason R.; Drislane, Laura E.; Patrick, Christopher J.; Morano, Mario; Lilienfeld, Scott O.; Poythress, Norman G.

    2014-01-01

    The Triarchic model of psychopathy describes this complex condition in terms of distinct phenotypic components of boldness, meanness, and disinhibition. Brief self-report scales designed specifically to index these psychopathy facets have thus far demonstrated promising construct validity. The present study sought to develop and validate scales for assessing facets of the Triarchic model using items from a well-validated existing measure of psychopathy—the Psychopathic Personality Inventory (PPI). A consensus rating approach was used to identify PPI items relevant to each Triarchic facet, and the convergent and discriminant validity of the resulting PPI-based Triarchic scales were evaluated in relation to multiple criterion variables (i.e., other psychopathy inventories, antisocial personality disorder features, personality traits, psychosocial functioning) in offender and non-offender samples. The PPI-based Triarchic scales showed good internal consistency and related to criterion variables in ways consistent with predictions based on the Triarchic model. Findings are discussed in terms of implications for conceptualization and assessment of psychopathy. PMID:24447280

  18. Color Trails Test: normative data and criterion validity for the greek adult population.

    PubMed

    Messinis, Lambros; Malegiannaki, Amaryllis-Chryssi; Christodoulou, Tessa; Panagiotopoulos, Vassillis; Papathanasopoulos, Panagiotis

    2011-06-01

    The Color Trails Test (CTT) was developed as a culturally fair analog of the Trail Making Test. In the present study, normative data for the CTT were developed for the Greek adult population and further the criterion validity of the CTT was examined in two clinical groups (29 Parkinson's disease [PD] and 25 acute stroke patients). The instrument was applied to 163 healthy participants, aged 19-75. Stepwise linear regression analyses revealed a significant influence of age and education level on completion time in both parts of the CTT (increased age and decreased educational level contributed to slower completion times for both parts), whereas gender did not influence time to completion of part B. Further, the CTT appears to discriminate adequately between the performance of PD and acute stroke patients and matched healthy controls.

  19. The Validation of a Case-Based, Cumulative Assessment and Progressions Examination

    PubMed Central

    Coker, Adeola O.; Copeland, Jeffrey T.; Gottlieb, Helmut B.; Horlen, Cheryl; Smith, Helen E.; Urteaga, Elizabeth M.; Ramsinghani, Sushma; Zertuche, Alejandra; Maize, David

    2016-01-01

    Objective. To assess content and criterion validity, as well as reliability of an internally developed, case-based, cumulative, high-stakes third-year Annual Student Assessment and Progression Examination (P3 ASAP Exam). Methods. Content validity was assessed through the writing-reviewing process. Criterion validity was assessed by comparing student scores on the P3 ASAP Exam with the nationally validated Pharmacy Curriculum Outcomes Assessment (PCOA). Reliability was assessed with psychometric analysis comparing student performance over four years. Results. The P3 ASAP Exam showed content validity through representation of didactic courses and professional outcomes. Similar scores on the P3 ASAP Exam and PCOA with Pearson correlation coefficient established criterion validity. Consistent student performance using Kuder-Richardson coefficient (KR-20) since 2012 reflected reliability of the examination. Conclusion. Pharmacy schools can implement internally developed, high-stakes, cumulative progression examinations that are valid and reliable using a robust writing-reviewing process and psychometric analyses. PMID:26941435

  20. Validity and Reliability of the Upper Extremity Work Demands Scale.

    PubMed

    Jacobs, Nora W; Berduszek, Redmar J; Dijkstra, Pieter U; van der Sluis, Corry K

    2017-12-01

    Purpose To evaluate validity and reliability of the upper extremity work demands (UEWD) scale. Methods Participants from different levels of physical work demands, based on the Dictionary of Occupational Titles categories, were included. A historical database of 74 workers was added for factor analysis. Criterion validity was evaluated by comparing observed and self-reported UEWD scores. To assess structural validity, a factor analysis was executed. For reliability, the difference between two self-reported UEWD scores, the smallest detectable change (SDC), test-retest reliability and internal consistency were determined. Results Fifty-four participants were observed at work and 51 of them filled in the UEWD twice with a mean interval of 16.6 days (SD 3.3, range = 10-25 days). Criterion validity of the UEWD scale was moderate (r = .44, p = .001). Factor analysis revealed that 'force and posture' and 'repetition' subscales could be distinguished with Cronbach's alpha of .79 and .84, respectively. Reliability was good; there was no significant difference between repeated measurements. An SDC of 5.0 was found. Test-retest reliability was good (intraclass correlation coefficient for agreement = .84) and all item-total correlations were >.30. There were two pairs of highly related items. Conclusion Reliability of the UEWD scale was good, but criterion validity was moderate. Based on current results, a modified UEWD scale (2 items removed, 1 item reworded, divided into 2 subscales) was proposed. Since observation appeared to be an inappropriate gold standard, we advise to investigate other types of validity, such as construct validity, in further research.

  1. The Measurement of Executive Function at Age 3 Years: Psychometric Properties and Criterion Validity of a New Battery of Tasks

    ERIC Educational Resources Information Center

    Willoughby, Michael T.; Blair, Clancy B.; Wirth, R. J.; Greenberg, Mark

    2010-01-01

    In this study, the authors examined the psychometric properties and criterion validity of a newly developed battery of tasks that were designed to assess executive function (EF) abilities in early childhood. The battery was included in the 36-month assessment of the Family Life Project (FLP), a prospective longitudinal study of 1,292 children…

  2. The Validity of the Modified Sit-and-Reach Test in College-Age Students.

    ERIC Educational Resources Information Center

    Minkler, Sharin; Patterson, Patricia

    1994-01-01

    Reports a study that examined the criterion-related validity of the modified sit-and-reach test against criterion measures of hamstring and low back flexibility in college students. Results indicated the modified sit-and-reach test moderately related to hamstring flexibility, but its relation to low back flexibility was low. (SM)

  3. Examining the validity of self-reports on scales measuring students' strategic processing.

    PubMed

    Samuelstuen, Marit S; Bråten, Ivar

    2007-06-01

    Self-report inventories trying to measure strategic processing at a global level have been much used in both basic and applied research. However, the validity of global strategy scores is open to question because such inventories assess strategy perceptions outside the context of specific task performance. The primary aim was to examine the criterion-related and construct validity of the global strategy data obtained with the Cross-Curricular Competencies (CCC) scale. Additionally, we wanted to compare the validity of these data with the validity of data obtained with a task-specific self-report inventory focusing on the same types of strategies. The sample included 269 10th-grade students from 12 different junior high schools. Global strategy use as assessed with the CCC was compared with task-specific strategy use reported in three different reading situations. Moreover, relationships between scores on the CCC and scores on measures of text comprehension were examined and compared with relationships between scores on the task-specific strategy measure and the same comprehension measures. The comparison between the CCC strategy scores and the task-specific strategy scores suggested only modest criterion-related validity for the data obtained with the global strategy inventory. The CCC strategy scores were also not related to the text comprehension measures, indicating poor construct validity. In contrast, the task-specific strategy scores were positively related to the comprehension measures, indicating good construct validity. Attempts to measure strategic processing at a global level seem to have limited validity and utility.

  4. Five-level emergency triage systems: variation in assessment of validity.

    PubMed

    Kuriyama, Akira; Urushidani, Seigo; Nakayama, Takeo

    2017-11-01

    Triage systems are scales developed to rate the degree of urgency among patients who arrive at EDs. A number of different scales are in use; however, the way in which they have been validated is inconsistent. Also, it is difficult to define a surrogate that accurately predicts urgency. This systematic review described reference standards and measures used in previous validation studies of five-level triage systems. We searched PubMed, EMBASE and CINAHL to identify studies that had assessed the validity of five-level triage systems and described the reference standards and measures applied in these studies. Studies were divided into those using criterion validity (reference standards developed by expert panels or triage systems already in use) and those using construct validity (prognosis, costs and resource use). A total of 57 studies examined criterion and construct validity of 14 five-level triage systems. Criterion validity was examined by evaluating (1) agreement between the assigned degree of urgency with objective standard criteria (12 studies), (2) overtriage and undertriage (9 studies) and (3) sensitivity and specificity of triage systems (7 studies). Construct validity was examined by looking at (4) the associations between the assigned degree of urgency and measures gauged in EDs (48 studies) and (5) the associations between the assigned degree of urgency and measures gauged after hospitalisation (13 studies). Particularly, among 46 validation studies of the most commonly used triages (Canadian Triage and Acuity Scale, Emergency Severity Index and Manchester Triage System), 13 and 39 studies examined criterion and construct validity, respectively. Previous studies applied various reference standards and measures to validate five-level triage systems. They either created their own reference standard or used a combination of severity/resource measures. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All

  5. Development and validation of a reading-related assessment battery in Malay for the purpose of dyslexia assessment.

    PubMed

    Lee, Lay Wah

    2008-06-01

    Malay is an alphabetic language with transparent orthography. A Malay reading-related assessment battery which was conceptualised based on the International Dyslexia Association definition of dyslexia was developed and validated for the purpose of dyslexia assessment. The battery consisted of ten tests: Letter Naming, Word Reading, Non-word Reading, Spelling, Passage Reading, Reading Comprehension, Listening Comprehension, Elision, Rapid Letter Naming and Digit Span. Content validity was established by expert judgment. Concurrent validity was obtained using the schools' language tests as criterion. Evidence of predictive and construct validity was obtained through regression analyses and factor analyses. Phonological awareness was the most significant predictor of word-level literacy skills in Malay, with rapid naming making independent secondary contributions. Decoding and listening comprehension made separate contributions to reading comprehension, with decoding as the more prominent predictor. Factor analysis revealed four factors: phonological decoding, phonological naming, comprehension and verbal short-term memory. In conclusion, despite differences in orthography, there are striking similarities in the theoretical constructs of reading-related tasks in Malay and in English.

  6. Detecting Symptom Exaggeration in Combat Veterans Using the MMPI-2 Symptom Validity Scales: A Mixed Group Validation

    ERIC Educational Resources Information Center

    Tolin, David F.; Steenkamp, Maria M.; Marx, Brian P.; Litz, Brett T.

    2010-01-01

    Although validity scales of the Minnesota Multiphasic Personality Inventory-2 (MMPI-2; J. N. Butcher, W. G. Dahlstrom, J. R. Graham, A. Tellegen, & B. Kaemmer, 1989) have proven useful in the detection of symptom exaggeration in criterion-group validation (CGV) studies, usually comparing instructed feigners with known patient groups, the…

  7. Translation and validation of the Cancer-Related Fatigue Scale in Greek in a sample of patients with advanced prostate cancer.

    PubMed

    Charalambous, Andreas; Kaite, Charis; Constantinou, Marianna; Kouta, Christiana

    2016-12-02

    To translate and validate the Cancer-Related Fatigue (CRF) Scale in the Greek language. A cross-sectional descriptive design was used in order to translate and validate the CRF Scale in Greek. Factor analyses were performed to understand the psychometric properties of the scale and to establish construct, criterion and convergent validity. Outpatients' oncology clinics of two public hospitals in Cyprus. 148 patients with advanced prostate cancer undergoing chemotherapy. The Cancer Fatigue Scale (CFS) had good stability (test-retest reliability r=0.79, p<0.001) and good internal consistency (Cronbach's α coefficient for all 15 items α=0.916). Furthermore, the Kaiser-Meyer-Olkin Measure of Sampling Adequacy (KMO value) was found to be 0.743 and considered to be satisfactory (>0.5). The correlations between the CFS physical scale (CFS-FS scale) and the European Organization for Research and Treatment of Cancer (EORTC) QLQ-C30 physical subscales were found to be significant (r=-0.715). The same occurred between CFS cognitive and EORTC cognitive subscale (r=-0.579). Overall, the criterion validity was verified. The same occurs for the convergent validity of the CFS since all correlations with the Global Health Status (q29-q30) were found to be significant. This is the first validation study of the CRF Scale in Greek and warrant of its use in the assessment of prostate cancer patient's related fatigue. However, further testing and validation is needed in the early stages of the disease and in patients in later chemotherapy cycles. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.

  8. Concurrent Validity of the TONI-3

    ERIC Educational Resources Information Center

    Banks, Sandra H.; Franzen, Michael D.

    2010-01-01

    The literature pertaining to intelligence assessment reveals an ongoing discussion about the areas of intelligence captured by nonverbal tests. To date, few studies have investigated the criterion validity of the Test of Nonverbal Intelligence, Third Edition (TONI-3). The present study investigates the concurrent validity of the TONI-3 in a sample…

  9. Criterion validity of the International Physical Activity Questionnaire Short Form (IPAQ-SF) for use in patients with rheumatoid arthritis: comparison with the SenseWear Armband.

    PubMed

    Tierney, M; Fraser, A; Kennedy, N

    2015-06-01

    The International Physical Activity Questionnaire Short Form (IPAQ-SF) is a self-report questionnaire commonly used in patients with rheumatoid arthritis (RA) to measure physical activity. However, despite its frequent use in patients with RA, its validity has not been ascertained in this population. The aim of this study was to examine the criterion validity of energy expenditure from physical activity recorded with the IPAQ-SF in patients with RA compared with the objective criterion measure, the SenseWear Armband (SWA) which has been validated previously in this population. Cross-sectional criterion validation study. Regional hospital outpatient setting. Twenty-two patients with RA attending outpatient rheumatology clinics. Subjects wore an SWA for 7 full consecutive days and completed the IPAQ-SF. Energy expenditure from physical activity recorded by the SWA and the IPAQ-SF. Energy expenditure from physical activity recorded by the IPAQ-SF and the SWA showed a small, non-significant correlation (r=0.407, P=0.60). The IPAQ-SF underestimated energy expenditure from physical activity by 41% compared with the SWA. This was corroborated using Bland and Altman plots, as the IPAQ-SF was found to overestimate energy expenditure from physical activity in nine of the 22 individuals, and underestimate energy expenditure from physical activity in the remaining 13 individuals. The IPAQ-SF has limited use as an accurate and absolute measure for estimating energy expenditure from physical activity in patients with RA. Copyright © 2014 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.

  10. Achievement Emotions and Achievement Goals in Support of the Convergent, Divergent and Criterion Validity of the Spanish-Cognitive Test Anxiety Scale

    ERIC Educational Resources Information Center

    Sánchez-Rosas, Javier; Furlan, Luis Alberto

    2017-01-01

    Based on the control-value theory of achievement emotions and theory of achievement goals, this research provides evidence of convergent, divergent, and criterion validity of the Spanish Cognitive Test Anxiety Scale (S-CTAS). A sample of Argentinean undergraduates responded to several scales administered at three points. At time 1 and 3, the…

  11. Exercise barriers self-efficacy: development and validation of a subcale for individuals with cancer-related lymphedema.

    PubMed

    Buchan, Jena; Janda, Monika; Box, Robyn; Rogers, Laura; Hayes, Sandi

    2015-03-18

    No tool exists to measure self-efficacy for overcoming lymphedema-related exercise barriers in individuals with cancer-related lymphedema. However, an existing scale measures confidence to overcome general exercise barriers in cancer survivors. Therefore, the purpose of this study was to develop, validate and assess the reliability of a subscale, to be used in conjunction with the general barriers scale, for determining exercise barriers self-efficacy in individuals facing lymphedema-related exercise barriers. A lymphedema-specific exercise barriers self-efficacy subscale was developed and validated using a cohort of 106 cancer survivors with cancer-related lymphedema, from Brisbane, Australia. An initial ten-item lymphedema-specific barrier subscale was developed and tested, with participant feedback and principal components analysis results used to guide development of the final version. Validity and test-retest reliability analyses were conducted on the final subscale. The final lymphedema-specific subscale contained five items. Principal components analysis revealed these items loaded highly (>0.75) on a separate factor when tested with a well-established nine-item general barriers scale. The final five-item subscale demonstrated good construct and criterion validity, high internal consistency (Cronbach's alpha = 0.93) and test-retest reliability (ICC = 0.67, p < 0.01). A valid and reliable lymphedema-specific subscale has been developed to assess exercise barriers self-efficacy in individuals with cancer-related lymphedema. This scale can be used in conjunction with an existing general exercise barriers scale to enhance exercise adherence in this understudied patient group.

  12. Estimating activity energy expenditure: how valid are physical activity questionnaires?

    PubMed

    Neilson, Heather K; Robson, Paula J; Friedenreich, Christine M; Csizmadi, Ilona

    2008-02-01

    Activity energy expenditure (AEE) is the modifiable component of total energy expenditure (TEE) derived from all activities, both volitional and nonvolitional. Because AEE may affect health, there is interest in its estimation in free-living people. Physical activity questionnaires (PAQs) could be a feasible approach to AEE estimation in large populations, but it is unclear whether or not any PAQ is valid for this purpose. Our aim was to explore the validity of existing PAQs for estimating usual AEE in adults, using doubly labeled water (DLW) as a criterion measure. We reviewed 20 publications that described PAQ-to-DLW comparisons, summarized study design factors, and appraised criterion validity using mean differences (AEE(PAQ) - AEE(DLW), or TEE(PAQ) - TEE(DLW)), 95% limits of agreement, and correlation coefficients (AEE(PAQ) versus AEE(DLW) or TEE(PAQ) versus TEE(DLW)). Only 2 of 23 PAQs assessed most types of activity over the past year and indicated acceptable criterion validity, with mean differences (TEE(PAQ) - TEE(DLW)) of 10% and 2% and correlation coefficients of 0.62 and 0.63, respectively. At the group level, neither overreporting nor underreporting was more prevalent across studies. We speculate that, aside from reporting error, discrepancies between PAQ and DLW estimates may be partly attributable to 1) PAQs not including key activities related to AEE, 2) PAQs and DLW ascertaining different time periods, or 3) inaccurate assignment of metabolic equivalents to self-reported activities. Small sample sizes, use of correlation coefficients, and limited information on individual validity were problematic. Future research should address these issues to clarify the true validity of PAQs for estimating AEE.

  13. Considerations Underlying the Use of Mixed Group Validation

    ERIC Educational Resources Information Center

    Jewsbury, Paul A.; Bowden, Stephen C.

    2013-01-01

    Mixed Group Validation (MGV) is an approach for estimating the diagnostic accuracy of tests. MGV is a promising alternative to the more commonly used Known Groups Validation (KGV) approach for estimating diagnostic accuracy. The advantage of MGV lies in the fact that the approach does not require a perfect external validity criterion or gold…

  14. The Missing Middle in Validation Research

    ERIC Educational Resources Information Center

    Taylor, Erwin K.; Griess, Thomas

    1976-01-01

    In most selection validation research, only the upper and lower tails of the criterion distribution are used, often yielding misleading or incorrect results. Provides formulas and tables which enable the researcher to account more accurately for the distribution of criterion within the middle range of population. (Author/RW)

  15. easyCBM Beginning Reading Measures: Grades K-1 Alternate Form Reliability and Criterion Validity with the SAT-10. Technical Report #1403

    ERIC Educational Resources Information Center

    Wray, Kraig; Lai, Cheng-Fei; Sáez, Leilani; Alonzo, Julie; Tindal, Gerald

    2013-01-01

    We report the results of an alternate form reliability and criterion validity study of kindergarten and grade 1 (N = 84-199) reading measures from the easyCBM© assessment system and Stanford Early School Achievement Test/Stanford Achievement Test, 10th edition (SESAT/SAT-­10) across 5 time points. The alternate form reliabilities ranged from…

  16. Screening for postdeployment conditions: development and cross-validation of an embedded validity scale in the neurobehavioral symptom inventory.

    PubMed

    Vanderploeg, Rodney D; Cooper, Douglas B; Belanger, Heather G; Donnell, Alison J; Kennedy, Jan E; Hopewell, Clifford A; Scott, Steven G

    2014-01-01

    To develop and cross-validate internal validity scales for the Neurobehavioral Symptom Inventory (NSI). Four existing data sets were used: (1) outpatient clinical traumatic brain injury (TBI)/neurorehabilitation database from a military site (n = 403), (2) National Department of Veterans Affairs TBI evaluation database (n = 48 175), (3) Florida National Guard nonclinical TBI survey database (n = 3098), and (4) a cross-validation outpatient clinical TBI/neurorehabilitation database combined across 2 military medical centers (n = 206). Secondary analysis of existing cohort data to develop (study 1) and cross-validate (study 2) internal validity scales for the NSI. The NSI, Mild Brain Injury Atypical Symptoms, and Personality Assessment Inventory scores. Study 1: Three NSI validity scales were developed, composed of 5 unusual items (Negative Impression Management [NIM5]), 6 low-frequency items (LOW6), and the combination of 10 nonoverlapping items (Validity-10). Cut scores maximizing sensitivity and specificity on these measures were determined, using a Mild Brain Injury Atypical Symptoms score of 8 or more as the criterion for invalidity. Study 2: The same validity scale cut scores again resulted in the highest classification accuracy and optimal balance between sensitivity and specificity in the cross-validation sample, using a Personality Assessment Inventory Negative Impression Management scale with a T score of 75 or higher as the criterion for invalidity. The NSI is widely used in the Department of Defense and Veterans Affairs as a symptom-severity assessment following TBI, but is subject to symptom overreporting or exaggeration. This study developed embedded NSI validity scales to facilitate the detection of invalid response styles. The NSI Validity-10 scale appears to hold considerable promise for validity assessment when the NSI is used as a population-screening tool.

  17. Validity of the Mayer-Salovey-Caruso Emotional Intelligence Test: Youth Version-Research Edition

    ERIC Educational Resources Information Center

    Peters, Christine; Kranzler, John H.; Rossen, Eric

    2009-01-01

    This study examines the criterion-related validity evidence of scores on the Mayer-Salovey-Caruso Emotional Intelligence Test: Youth Version-Research Version. The authors also investigate the relationship between scores on the MSCEIT-YV and chronological age. Results provide initial support for the construct validity of the MSCEIT-YV but also…

  18. Validation of the Intrinsic Spirituality Scale (ISS) with Muslims.

    PubMed

    Hodge, David R; Zidan, Tarek; Husain, Altaf

    2015-12-01

    This study validates an existing spirituality measure--the intrinsic spirituality scale (ISS)--for use with Muslims in the United States. A confirmatory factor analysis was conducted with a diverse sample of self-identified Muslims (N = 281). Validity and reliability were assessed along with criterion and concurrent validity. The measurement model fit the data well, normed χ2 = 2.50, CFI = 0.99, RMSEA = 0.07, and SRMR = 0.02. All 6 items that comprise the ISS demonstrated satisfactory levels of validity (λ > .70) and reliability (R2 > .50). The Cronbach's alpha obtained with the present sample was .93. Appropriate correlations with theoretically linked constructs demonstrated criterion and concurrent validity. The results suggest the ISS is a valid measure of spirituality in clinical settings with the rapidly growing Muslim population. The ISS may, for instance, provide an efficient screening tool to identify Muslims that are particularly likely to benefit from spiritually accommodative treatments. (c) 2015 APA, all rights reserved).

  19. Validation of the Military Entrance Physical Strength Capacity Test. Technical Report 610.

    ERIC Educational Resources Information Center

    Myers, David C.; And Others

    A battery of physical ability tests was validated using a predictive, criterion-related strategy. The battery was given to 1,003 female soldiers and 980 male soldiers before they had begun Army Basic Training. Criterion measures which represented physical competency in Basic Training (physical proficiency tests, sick call, profiles, and separation…

  20. Reliability and Validity of the Professional Counseling Performance Evaluation

    ERIC Educational Resources Information Center

    Shepherd, J. Brad; Britton, Paula J.; Kress, Victoria E.

    2008-01-01

    The definition and measurement of counsellor trainee competency is an issue that has received increased attention yet lacks quantitative study. This research evaluates item responses, scale reliability and intercorrelations, interrater agreement, and criterion-related validity of the Professional Performance Fitness Evaluation/Professional…

  1. Supervisor Health and Safety Support: Scale Development and Validation

    PubMed Central

    Butts, Marcus M.; Hurst, Carrie S.; Eby, Lillian T.

    2013-01-01

    Executive Summary Two studies were conducted to develop a psychometrically sound measure of supervisor health and safety support (SHSS). We identified three dimensions of supervisor support (physical health, psychological health, safety) and used Study 1 to develop items and establish content validity. Study 2 was used to establish the dimensionality of the new measure and provide criterion-related and discriminant validity evidence of the measure using supervisor and subordinate data. The measure had incremental validity in predicting employee performance and psychological strain outcomes above and beyond general work support variables. Implications of these findings and for workplace support theory and practice are discussed. PMID:24771991

  2. Validity of the Eating Attitudes Test and the Eating Disorders Inventory in Bulimia Nervosa.

    ERIC Educational Resources Information Center

    Gross, Janet; And Others

    1986-01-01

    Assessed criterion and concurrent validity of the Eating Attitudes Test and the Eating Disorder Inventory in 82 women with bulimia nervosa. Both tests demonstrated criterion validity by discriminating bulimia nervosa subjects from normals. Only weak support was found for concurrent validity within bulimia subjects. Recommends combination of…

  3. [Development and Validation of the Academic Resilience Inventory for Nursing Students in Taiwan].

    PubMed

    Li, Cheng-Chieh; Wei, Chi-Fang; Tung, Yuk-Ying

    2017-10-01

    Failure to cope with learning pressures has been shown to influence the learning achievement and professional performance of nursing students. In order to enable nursing students to adapt successfully to their academic stress, it is essential to explore their academic resilience in the process of learning. To develop the Academic Resilience Inventory for Nursing Students (ARINS) and to test its reliability and validity. A total of 611 nursing students in central and southern Taiwan were recruited as participants. We divided the sample into two subsamples randomly using R software. The first sample was used to conduct item analysis and exploratory factor analysis. The other sample was used to conduct confirmatory factor analysis, cross validation, and criterion-related validity. There are 15 items in the ARINS, with cognitive maturity, emotional regulation, and help-seeking behavior used as the measurement indicators of academic resilience in nursing students. The assessed goodness-of-fit index indicates that the model fit the data well based upon the CFA and has good convergent validity and discriminant validity. Criterion-related validity was supported by the correlation among ARINS, learning performance and attitude, hope and optimistic, and depression. The ARINS has good reliability and validation and is a suitable measure of academic resilience in nursing students. It is helpful for nursing students to examine their academic stress and coping efficacy in the learning process.

  4. A treatment schedule of conventional physical therapy provided to enhance upper limb sensorimotor recovery after stroke: expert criterion validity and intra-rater reliability.

    PubMed

    Donaldson, Catherine; Tallis, Raymond C; Pomeroy, Valerie M

    2009-06-01

    Inadequate description of treatment hampers progress in stroke rehabilitation. To develop a valid, reliable, standardised treatment schedule of conventional physical therapy provided for the paretic upper limb after stroke. Eleven neurophysiotherapists participated in the established methodology: semi-structured interviews, focus groups and piloting a draft treatment schedule in clinical practice. Different physiotherapists (n=13) used the treatment schedule to record treatment given to stroke patients with mild, moderate and severe upper limb paresis. Rating of adequacy of the treatment schedule was made using a visual analogue scale (0 to 100mm). Mean (95% confidence interval) visual analogue scores were calculated (expert criterion validity). For intra-rater reliability, each physiotherapist observed a video tape of their treatment and immediately completed a treatment schedule recording form on two separate occasions, 4 to 6 weeks apart. The Kappa statistic was calculated for intra-rater reliability. The treatment schedule consists of a one-page A4 recording form and a user booklet, detailing 50 treatment activities. Expert criterion validity was 79 (95% confidence interval 74 to 84). Intra-rater Kappa was 0.81 (P<0.001). This treatment schedule can be used to document conventional physical therapy in subsequent clinical trials in the geographical area of its development. Further work is needed to investigate generalisability beyond this geographical area.

  5. Evaluation of the Criterion and Convergent Validity of the Diagnostic Interview for Social and Communication Disorders in Young and Low-Functioning Children

    ERIC Educational Resources Information Center

    Maljaars, Jarymke; Noens, Ilse; Scholte, Evert; van Berckelaer-Onnes, Ina

    2012-01-01

    The Diagnostic Interview for Social and Communication Disorders (DISCO; Wing, 2006) is a standardized, semi-structured and interviewer-based schedule for diagnosis of autism spectrum disorder (ASD). The objective of this study was to evaluate the criterion and convergent validity of the DISCO-11 ICD-10 algorithm in young and low-functioning…

  6. Development and validation of a tool to evaluate the quality of medical education websites in pathology.

    PubMed

    Alyusuf, Raja H; Prasad, Kameshwar; Abdel Satir, Ali M; Abalkhail, Ali A; Arora, Roopa K

    2013-01-01

    The exponential use of the internet as a learning resource coupled with varied quality of many websites, lead to a need to identify suitable websites for teaching purposes. The aim of this study is to develop and to validate a tool, which evaluates the quality of undergraduate medical educational websites; and apply it to the field of pathology. A tool was devised through several steps of item generation, reduction, weightage, pilot testing, post-pilot modification of the tool and validating the tool. Tool validation included measurement of inter-observer reliability; and generation of criterion related, construct related and content related validity. The validated tool was subsequently tested by applying it to a population of pathology websites. Reliability testing showed a high internal consistency reliability (Cronbach's alpha = 0.92), high inter-observer reliability (Pearson's correlation r = 0.88), intraclass correlation coefficient = 0.85 and κ =0.75. It showed high criterion related, construct related and content related validity. The tool showed moderately high concordance with the gold standard (κ =0.61); 92.2% sensitivity, 67.8% specificity, 75.6% positive predictive value and 88.9% negative predictive value. The validated tool was applied to 278 websites; 29.9% were rated as recommended, 41.0% as recommended with caution and 29.1% as not recommended. A systematic tool was devised to evaluate the quality of websites for medical educational purposes. The tool was shown to yield reliable and valid inferences through its application to pathology websites.

  7. Validity of Various Methods for Determining Velocity, Force, and Power in the Back Squat.

    PubMed

    Banyard, Harry G; Nosaka, Ken; Sato, Kimitake; Haff, G Gregory

    2017-10-01

    To examine the validity of 2 kinematic systems for assessing mean velocity (MV), peak velocity (PV), mean force (MF), peak force (PF), mean power (MP), and peak power (PP) during the full-depth free-weight back squat performed with maximal concentric effort. Ten strength-trained men (26.1 ± 3.0 y, 1.81 ± 0.07 m, 82.0 ± 10.6 kg) performed three 1-repetition-maximum (1RM) trials on 3 separate days, encompassing lifts performed at 6 relative intensities including 20%, 40%, 60%, 80%, 90%, and 100% of 1RM. Each repetition was simultaneously recorded by a PUSH band and commercial linear position transducer (LPT) (GymAware [GYM]) and compared with measurements collected by a laboratory-based testing device consisting of 4 LPTs and a force plate. Trials 2 and 3 were used for validity analyses. Combining all 120 repetitions indicated that the GYM was highly valid for assessing all criterion variables while the PUSH was only highly valid for estimations of PF (r = .94, CV = 5.4%, ES = 0.28, SEE = 135.5 N). At each relative intensity, the GYM was highly valid for assessing all criterion variables except for PP at 20% (ES = 0.81) and 40% (ES = 0.67) of 1RM. Moreover, the PUSH was only able to accurately estimate PF across all relative intensities (r = .92-.98, CV = 4.0-8.3%, ES = 0.04-0.26, SEE = 79.8-213.1 N). PUSH accuracy for determining MV, PV, MF, MP, and PP across all 6 relative intensities was questionable for the back squat, yet the GYM was highly valid at assessing all criterion variables, with some caution given to estimations of MP and PP performed at lighter loads.

  8. Comparative Analysis of the Relative Validity for Subjective Time Rating Scales. Final Report.

    ERIC Educational Resources Information Center

    Carpenter, James B.; And Others

    Since the accuracy and validity of occupational data may vary according to the rating scale format employed, the first phase of the research described in the report employed hypothetical job descriptions from which accurate criterion data could be generated. The second phase of the research required developing an occupational survey instrument…

  9. 29 CFR 1607.14 - Technical standards for validity studies.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... in the design of the study and their effects identified. (5) Statistical relationships. The degree of...; or such factors should be included in the design of the study and their effects identified. (f... arduous effort involving a series of research studies, which include criterion related validity studies...

  10. 29 CFR 1607.14 - Technical standards for validity studies.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... in the design of the study and their effects identified. (5) Statistical relationships. The degree of...; or such factors should be included in the design of the study and their effects identified. (f... arduous effort involving a series of research studies, which include criterion related validity studies...

  11. 29 CFR 1607.14 - Technical standards for validity studies.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... in the design of the study and their effects identified. (5) Statistical relationships. The degree of...; or such factors should be included in the design of the study and their effects identified. (f... arduous effort involving a series of research studies, which include criterion related validity studies...

  12. 29 CFR 1607.14 - Technical standards for validity studies.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... in the design of the study and their effects identified. (5) Statistical relationships. The degree of...; or such factors should be included in the design of the study and their effects identified. (f... arduous effort involving a series of research studies, which include criterion related validity studies...

  13. 29 CFR 1607.14 - Technical standards for validity studies.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... in the design of the study and their effects identified. (5) Statistical relationships. The degree of...; or such factors should be included in the design of the study and their effects identified. (f... arduous effort involving a series of research studies, which include criterion related validity studies...

  14. Evaluation of Validity and Reliability for Hierarchical Scales Using Latent Variable Modeling

    ERIC Educational Resources Information Center

    Raykov, Tenko; Marcoulides, George A.

    2012-01-01

    A latent variable modeling method is outlined, which accomplishes estimation of criterion validity and reliability for a multicomponent measuring instrument with hierarchical structure. The approach provides point and interval estimates for the scale criterion validity and reliability coefficients, and can also be used for testing composite or…

  15. Evaluation of objectivity, reliability and criterion validity of the key indicator method for manual handling operations (KIM-MHO), draft 2007.

    PubMed

    Klußmann, André; Gebhardt, Hansjürgen; Rieger, Monika; Liebers, Falk; Steinberg, Ulf

    2012-01-01

    Upper extremity musculoskeletal symptoms and disorders are common in the working population. The economic and social impact of such disorders is considerable. Long-time, dynamic repetitive exposure of the hand-arm system during manual handling operations (MHO) alone or in combination with static and postural effort are recognised as causes of musculoskeletal symptoms and disorders. The assessment of these manual work tasks is crucial to estimate health risks of exposed employees. For these work tasks, a new method for the assessment of the working conditions was developed and a validation study was performed. The results suggest satisfying criterion validity and moderate objectivity of the KIM-MHO draft 2007. The method was modified and evaluated again. It is planned to release a new version of KIM-MHO in spring 2012.

  16. Visual judgements of steadiness in one-legged stance: reliability and validity.

    PubMed

    Haupstein, T; Goldie, P

    2000-01-01

    There is a paucity of information about the validity and reliability of clinicians' visual judgements of steadiness in one-legged stance. Such judgements are used frequently in clinical practice to support decisions about treatment in the fields of neurology, sports medicine, paediatrics and orthopaedics. The aim of the present study was to address the validity and reliability of visual judgements of steadiness in one-legged stance in a group of physiotherapists. A videotape of 20 five-second performances was shown to 14 physiotherapists with median clinical experience of 6.75 years. Validity of visual judgement was established by correlating scores obtained from an 11-point rating scale with criterion scores obtained from a force platform. In addition, partial correlations were used to control for the potential influence of body weight on the relationship between the visual judgements and criterion scores. Inter-observer reliability was quantified between the physiotherapists; intra-observer reliability was quantified between two tests four weeks apart. Mean criterion-related validity was high, regardless of whether body weight was controlled for statistically (Pearson's r = 0.84, 0.83, respectively). The standard error of estimating the criterion score was 3.3 newtons. Inter-observer reliability was high (ICC (2,1) = 0.81 at Test 1 and 0.82 at Test 2). Intra-observer reliability was high (on average ICC (2,1) = 0.88; Pearson's r = 0.90). The standard error of measurement for the 11-point scale was one unit. The finding of higher accuracy of making visual judgements than previously reported may be due to several aspects of design: use of a criterion score derived from the variability of the force signal which is more discriminating than variability of centre of pressure; use of a discriminating visual rating scale; specificity and clear definition of the phenomenon to be rated.

  17. The Perceived Leadership Communication Questionnaire (PLCQ): Development and Validation.

    PubMed

    Schneider, Frank M; Maier, Michaela; Lovrekovic, Sara; Retzbach, Andrea

    2015-01-01

    The Perceived Leadership Communication Questionnaire (PLCQ) is a short, reliable, and valid instrument for measuring leadership communication from both perspectives of the leader and the follower. Drawing on a communication-based approach to leadership and following a theoretical framework of interpersonal communication processes in organizations, this article describes the development and validation of a one-dimensional 6-item scale in four studies (total N = 604). Results from Study 1 and 2 provide evidence for the internal consistency and factorial validity of the PLCQ's self-rating version (PLCQ-SR)-a version for measuring how leaders perceive their own communication with their followers. Results from Study 3 and 4 show internal consistency, construct validity, and criterion validity of the PLCQ's other-rating version (PLCQ-OR)-a version for measuring how followers perceive the communication of their leaders. Cronbach's α had an average of.80 over the four studies. All confirmatory factor analyses yielded good to excellent model fit indices. Convergent validity was established by average positive correlations of.69 with subdimensions of transformational leadership and leader-member exchange scales. Furthermore, nonsignificant correlations with socially desirable responding indicated discriminant validity. Last, criterion validity was supported by a moderately positive correlation with job satisfaction (r =.31).

  18. Construct validity of adolescents' self-reported big five personality traits: importance of conceptual breadth and initial validation of a short measure.

    PubMed

    Morizot, Julien

    2014-10-01

    While there are a number of short personality trait measures that have been validated for use with adults, few are specifically validated for use with adolescents. To trust such measures, it must be demonstrated that they have adequate construct validity. According to the view of construct validity as a unifying form of validity requiring the integration of different complementary sources of information, this article reports the evaluation of content, factor, convergent, and criterion validities as well as reliability of adolescents' self-reported personality traits. Moreover, this study sought to address an inherent potential limitation of short personality trait measures, namely their limited conceptual breadth. In this study, starting with items from a known measure, after the language-level was adjusted for use with adolescents, items tapping fundamental primary traits were added to determine the impact of added conceptual breadth on the psychometric properties of the scales. The resulting new measure was named the Big Five Personality Trait Short Questionnaire (BFPTSQ). A group of expert judges considered the items to have adequate content validity. Using data from a community sample of early adolescents, the results confirmed the factor validity of the Big Five structure in adolescence as well as its measurement invariance across genders. More important, the added items did improve the convergent and criterion validities of the scales, but did not negatively affect their reliability. This study supports the construct validity of adolescents' self-reported personality traits and points to the importance of conceptual breadth in short personality measures. © The Author(s) 2014.

  19. Adolescent Domain Screening Inventory-Short Form: Development and Initial Validation

    ERIC Educational Resources Information Center

    Corrigan, Matthew J.

    2017-01-01

    This study sought to develop a short version of the ADSI, and investigate its psychometric properties. Methods: This is a secondary analysis. Analysis to determine the Cronbach's Alpha, correlations to determine concurrent criterion validity and known instrument validity and a logistic regression to determine predictive validity were conducted.…

  20. Development and validation of a tool to evaluate the quality of medical education websites in pathology

    PubMed Central

    Alyusuf, Raja H.; Prasad, Kameshwar; Abdel Satir, Ali M.; Abalkhail, Ali A.; Arora, Roopa K.

    2013-01-01

    Background: The exponential use of the internet as a learning resource coupled with varied quality of many websites, lead to a need to identify suitable websites for teaching purposes. Aim: The aim of this study is to develop and to validate a tool, which evaluates the quality of undergraduate medical educational websites; and apply it to the field of pathology. Methods: A tool was devised through several steps of item generation, reduction, weightage, pilot testing, post-pilot modification of the tool and validating the tool. Tool validation included measurement of inter-observer reliability; and generation of criterion related, construct related and content related validity. The validated tool was subsequently tested by applying it to a population of pathology websites. Results and Discussion: Reliability testing showed a high internal consistency reliability (Cronbach's alpha = 0.92), high inter-observer reliability (Pearson's correlation r = 0.88), intraclass correlation coefficient = 0.85 and κ =0.75. It showed high criterion related, construct related and content related validity. The tool showed moderately high concordance with the gold standard (κ =0.61); 92.2% sensitivity, 67.8% specificity, 75.6% positive predictive value and 88.9% negative predictive value. The validated tool was applied to 278 websites; 29.9% were rated as recommended, 41.0% as recommended with caution and 29.1% as not recommended. Conclusion: A systematic tool was devised to evaluate the quality of websites for medical educational purposes. The tool was shown to yield reliable and valid inferences through its application to pathology websites. PMID:24392243

  1. Teachers' Grade Assignment and the Predictive Validity of Criterion-Referenced Grades

    ERIC Educational Resources Information Center

    Thorsen, Cecilia; Cliffordson, Christina

    2012-01-01

    Research has found that grades are the most valid instruments for predicting educational success. Why grades have better predictive validity than, for example, standardized tests is not yet fully understood. One possible explanation is that grades reflect not only subject-specific knowledge and skills but also individual differences in other…

  2. Assessment of the Validity of the Research Diagnostic Criteria for Temporomandibular Disorders: Overview and Methodology

    PubMed Central

    Schiffman, Eric L.; Truelove, Edmond L.; Ohrbach, Richard; Anderson, Gary C.; John, Mike T.; List, Thomas; Look, John O.

    2011-01-01

    AIMS The purpose of the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) Validation Project was to assess the diagnostic validity of this examination protocol. An overview is presented, including Axis I and II methodology and descriptive statistics for the study participant sample. This paper details the development of reliable methods to establish the reference standards for assessing criterion validity of the Axis I RDC/TMD diagnoses. Validity testing for the Axis II biobehavioral instruments was based on previously validated reference standards. METHODS The Axis I reference standards were based on the consensus of 2 criterion examiners independently performing a comprehensive history, clinical examination, and evaluation of imaging. Intersite reliability was assessed annually for criterion examiners and radiologists. Criterion exam reliability was also assessed within study sites. RESULTS Study participant demographics were comparable to those of participants in previous studies using the RDC/TMD. Diagnostic agreement of the criterion examiners with each other and with the consensus-based reference standards was excellent with all kappas ≥ 0.81, except for osteoarthrosis (moderate agreement, k = 0.53). Intrasite criterion exam agreement with reference standards was excellent (k ≥ 0.95). Intersite reliability of the radiologists for detecting computed tomography-disclosed osteoarthrosis and magnetic resonance imaging-disclosed disc displacement was good to excellent (k = 0.71 and 0.84, respectively). CONCLUSION The Validation Project study population was appropriate for assessing the reliability and validity of the RDC/TMD Axis I and II. The reference standards used to assess the validity of Axis I TMD were based on reliable and clinically credible methods. PMID:20213028

  3. The Servant Leadership Survey: Development and Validation of a Multidimensional Measure.

    PubMed

    van Dierendonck, Dirk; Nuijten, Inge

    2011-09-01

    PURPOSE: The purpose of this paper is to describe the development and validation of a multi-dimensional instrument to measure servant leadership. DESIGN/METHODOLOGY/APPROACH: Based on an extensive literature review and expert judgment, 99 items were formulated. In three steps, using eight samples totaling 1571 persons from The Netherlands and the UK with a diverse occupational background, a combined exploratory and confirmatory factor analysis approach was used. This was followed by an analysis of the criterion-related validity. FINDINGS: The final result is an eight-dimensional measure of 30 items: the eight dimensions being: standing back, forgiveness, courage, empowerment, accountability, authenticity, humility, and stewardship. The internal consistency of the subscales is good. The results show that the Servant Leadership Survey (SLS) has convergent validity with other leadership measures, and also adds unique elements to the leadership field. Evidence for criterion-related validity came from studies relating the eight dimensions to well-being and performance. IMPLICATIONS: With this survey, a valid and reliable instrument to measure the essential elements of servant leadership has been introduced. ORIGINALITY/VALUE: The SLS is the first measure where the underlying factor structure was developed and confirmed across several field studies in two countries. It can be used in future studies to test the underlying premises of servant leadership theory. The SLS provides a clear picture of the key servant leadership qualities and shows where improvements can be made on the individual and organizational level; as such, it may also offer a valuable starting point for training and leadership development.

  4. Validity and reliability of criterion based clinical audit to assess obstetrical quality of care in West Africa.

    PubMed

    Pirkle, Catherine M; Dumont, Alexandre; Traore, Mamadou; Zunzunegui, Maria-Victoria

    2012-10-29

    In Mali and Senegal, over 1% of women die giving birth in hospital. At some hospitals, over a third of infants are stillborn. Many deaths are due to substandard medical practices. Criterion-based clinical audits (CBCA) are increasingly used to measure and improve obstetrical care in resource-limited settings, but their measurement properties have not been formally evaluated. In 2011, we published a systematic review of obstetrical CBCA highlighting insufficient considerations of validity and reliability. The objective of this study is to develop an obstetrical CBCA adapted to the West African context and assess its reliability and validity. This work was conducted as a sub-study within a cluster randomized trial known as QUARITE. Criteria were selected based on extensive literature review and expert opinion. Early 2010, two auditors applied the CBCA to identical samples at 8 sites in Mali and Senegal (n = 185) to evaluate inter-rater reliability. In 2010-11, we conducted CBCA at 32 hospitals to assess construct validity (n = 633 patients). We correlated hospital characteristics (resource availability, facility perinatal and maternal mortality) with mean hospital CBCA scores. We used generalized estimating equations to assess whether patient CBCA scores were associated with perinatal mortality. Results demonstrate substantial (ICC = 0.67, 95% CI 0.54; 0.76) to elevated inter-rater reliability (ICC = 0.84, 95% CI 0.77; 0.89) in Senegal and Mali, respectively. Resource availability positively correlated with mean hospital CBCA scores and maternal and perinatal mortality were inversely correlated with hospital CBCA scores. Poor CBCA scores, adjusted for hospital and patient characteristics, were significantly associated with perinatal mortality (OR 1.84, 95% CI 1.01-3.34). Our CBCA has substantial inter-rater reliability and there is compelling evidence of its validity as the tool performs according to theory. Current Controlled Trials ISRCTN46950658.

  5. Creation and Initial Validation of the International Dysphagia Diet Standardisation Initiative Functional Diet Scale

    PubMed Central

    Steele, Catriona M.; Namasivayam-MacDonald, Ashwini M.; Guida, Brittany T.; Cichero, Julie A.; Duivestein, Janice; MRSc; Hanson, Ben; Lam, Peter; Riquelme, Luis F.

    2018-01-01

    Objective To assess consensual validity, interrater reliability, and criterion validity of the International Dysphagia Diet Standardisation Initiative Functional Diet Scale, a new functional outcome scale intended to capture the severity of oropharyngeal dysphagia, as represented by the degree of diet texture restriction recommended for the patient. Design Participants assigned International Dysphagia Diet Standardisation Initiative Functional Diet Scale scores to 16 clinical cases. Consensual validity was measured against reference scores determined by an author reference panel. Interrater reliability was measured overall and across quartile subsets of the dataset. Criterion validity was evaluated versus Functional Oral Intake Scale (FOIS) scores assigned by survey respondents to the same case scenarios. Feedback was requested regarding ease and likelihood of use. Setting Web-based survey. Participants Respondents (NZ170) from 29 countries. Interventions Not applicable. Main Outcome Measures Consensual validity (percent agreement and Kendall t), criterion validity (Spearman rank correlation), and interrater reliability (Kendall concordance and intraclass coefficients). Results The International Dysphagia Diet Standardisation Initiative Functional Diet Scale showed strong consensual validity, criterion validity, and interrater reliability. Scenarios involving liquid-only diets, transition from nonoral feeding, or trial diet advances in therapy showed the poorest consensus, indicating a need for clear instructions on how to score these situations. The International Dysphagia Diet Standardisation Initiative Functional Diet Scale showed greater sensitivity than the FOIS to specific changes in diet. Most (>70%) respondents indicated enthusiasm for implementing the International Dysphagia Diet Standardisation Initiative Functional Diet Scale. Conclusions This initial validation study suggests that the International Dysphagia Diet Standardisation Initiative Functional

  6. Creation and Initial Validation of the International Dysphagia Diet Standardisation Initiative Functional Diet Scale.

    PubMed

    Steele, Catriona M; Namasivayam-MacDonald, Ashwini M; Guida, Brittany T; Cichero, Julie A; Duivestein, Janice; Hanson, Ben; Lam, Peter; Riquelme, Luis F

    2018-05-01

    To assess consensual validity, interrater reliability, and criterion validity of the International Dysphagia Diet Standardisation Initiative Functional Diet Scale, a new functional outcome scale intended to capture the severity of oropharyngeal dysphagia, as represented by the degree of diet texture restriction recommended for the patient. Participants assigned International Dysphagia Diet Standardisation Initiative Functional Diet Scale scores to 16 clinical cases. Consensual validity was measured against reference scores determined by an author reference panel. Interrater reliability was measured overall and across quartile subsets of the dataset. Criterion validity was evaluated versus Functional Oral Intake Scale (FOIS) scores assigned by survey respondents to the same case scenarios. Feedback was requested regarding ease and likelihood of use. Web-based survey. Respondents (N=170) from 29 countries. Not applicable. Consensual validity (percent agreement and Kendall τ), criterion validity (Spearman rank correlation), and interrater reliability (Kendall concordance and intraclass coefficients). The International Dysphagia Diet Standardisation Initiative Functional Diet Scale showed strong consensual validity, criterion validity, and interrater reliability. Scenarios involving liquid-only diets, transition from nonoral feeding, or trial diet advances in therapy showed the poorest consensus, indicating a need for clear instructions on how to score these situations. The International Dysphagia Diet Standardisation Initiative Functional Diet Scale showed greater sensitivity than the FOIS to specific changes in diet. Most (>70%) respondents indicated enthusiasm for implementing the International Dysphagia Diet Standardisation Initiative Functional Diet Scale. This initial validation study suggests that the International Dysphagia Diet Standardisation Initiative Functional Diet Scale has strong consensual and criterion validity and can be used reliably by clinicians

  7. Using the Multiple-Choice Procedure to Measure the Relative Reinforcing Efficacy of Gambling: Initial Validity Evidence Among College Students.

    PubMed

    Butler, Leon H; Irons, Jessica G; Bassett, Drew T; Correia, Christopher J

    2018-06-01

    The multiple choice procedure (MCP) is used to assess the relative reinforcing value of concurrently available stimuli. The MCP was originally developed to assess the reinforcing value of drugs; the current within-subjects study employed the MCP to assess the reinforcing value of gambling behavior. Participants (N = 323) completed six versions of the MCP that presented hypothetical choices between money to be used while gambling ($10 or $25) versus escalating amounts of guaranteed money available immediately or after delays of either 1 week or 1 month. Results suggest that choices on the MCP are correlated with other measures of gambling behavior, thus providing concurrent validity data for using the MCP to quantify the relative reinforcing value of gambling. The MCP for gambling also displayed sensitivity to reinforcer magnitude and delay effects, which provides evidence of criterion validity. The results are consistent with a behavioral economic model of addiction and suggest that the MCP could be a valid tool for future research on gambling behavior.

  8. The Investigation of ADHD Prevalence in Kindergarten Children in Northeast Iran and a Determination of the Criterion Validity of Conners' Questionnaire via Clinical Interview

    ERIC Educational Resources Information Center

    Abdekhodaie, Zahra; Tabatabaei, Seyed Mahmood; Gholizadeh, Mortaza

    2012-01-01

    In this study, the prevalence of attention-deficit hyperactivity disorder (ADHD) in kindergarten children in northeast Iran was investigated, and the criterion validity of Conners' parent-teacher questionnaire was evaluated through the use of clinical interviews. This study was a cross-sectional descriptive research project with children in…

  9. Assessing the criterion validity of four highly abbreviated measures from the Minimal Assessment of Cognitive Function in Multiple Sclerosis (MACFIMS).

    PubMed

    Gromisch, Elizabeth S; Zemon, Vance; Holtzer, Roee; Chiaravalloti, Nancy D; DeLuca, John; Beier, Meghan; Farrell, Eileen; Snyder, Stacey; Schairer, Laura C; Glukhovsky, Lisa; Botvinick, Jason; Sloan, Jessica; Picone, Mary Ann; Kim, Sonya; Foley, Frederick W

    2016-10-01

    Cognitive dysfunction is prevalent in multiple sclerosis. As self-reported cognitive functioning is unreliable, brief objective screening measures are needed. Utilizing widely used full-length neuropsychological tests, this study aimed to establish the criterion validity of highly abbreviated versions of the Brief Visuospatial Memory Test - Revised (BVMT-R), Symbol Digit Modalities Test (SDMT), Delis-Kaplan Executive Function System (D-KEFS) Sorting Test, and Controlled Oral Word Association Test (COWAT) in order to begin developing an MS-specific screening battery. Participants from Holy Name Medical Center and the Kessler Foundation were administered one or more of these four measures. Using test-specific criterion to identify impairment at both -1.5 and -2.0 SD, receiver-operating-characteristic (ROC) analyses of BVMT-R Trial 1, Trial 2, and Trial 1 + 2 raw data (N = 286) were run to calculate the classification accuracy of the abbreviated version, as well as the sensitivity and specificity. The same methods were used for SDMT 30-s and 60-s (N = 321), D-KEFS Sorting Free Card Sort 1 (N = 120), and COWAT letters F and A (N = 298). Using these definitions of impairment, each analysis yielded high classification accuracy (89.3 to 94.3%). BVMT-R Trial 1, SDMT 30-s, D-KEFS Free Card Sort 1, and COWAT F possess good criterion validity in detecting impairment on their respective overall measure, capturing much of the same information as the full version. Along with the first two trials of the California Verbal Learning Test - Second Edition (CVLT-II), these five highly abbreviated measures may be used to develop a brief screening battery.

  10. Criterion-Related Validity of Two Curriculum-Based Measures of Mathematical Skill in Relation to Reading Comprehension in Secondary Students

    ERIC Educational Resources Information Center

    Anselmo, Giancarlo A.; Yarbrough, Jamie L.; Kovaleski, Joseph F.; Tran, Vi N.

    2017-01-01

    This study analyzed the relationship between benchmark scores from two curriculum-based measurement probes in mathematics (M-CBM) and student performance on a state-mandated high-stakes test. Participants were 298 students enrolled in grades 7 and 8 in a rural southeastern school. Specifically, we calculated the criterion-related and predictive…

  11. Validation of the organizational culture assessment instrument.

    PubMed

    Heritage, Brody; Pollock, Clare; Roberts, Lynne

    2014-01-01

    Organizational culture is a commonly studied area in industrial/organizational psychology due to its important role in workplace behaviour, cognitions, and outcomes. Jung et al.'s [1] review of the psychometric properties of organizational culture measurement instruments noted many instruments have limited validation data despite frequent use in both theoretical and applied situations. The Organizational Culture Assessment Instrument (OCAI) has had conflicting data regarding its psychometric properties, particularly regarding its factor structure. Our study examined the factor structure and criterion validity of the OCAI using robust analysis methods on data gathered from 328 (females = 226, males = 102) Australian employees. Confirmatory factor analysis supported a four factor structure of the OCAI for both ideal and current organizational culture perspectives. Current organizational culture data demonstrated expected reciprocally-opposed relationships between three of the four OCAI factors and the outcome variable of job satisfaction but ideal culture data did not, thus indicating possible weak criterion validity when the OCAI is used to assess ideal culture. Based on the mixed evidence regarding the measure's properties, further examination of the factor structure and broad validity of the measure is encouraged.

  12. Validation of the Organizational Culture Assessment Instrument

    PubMed Central

    Heritage, Brody; Pollock, Clare; Roberts, Lynne

    2014-01-01

    Organizational culture is a commonly studied area in industrial/organizational psychology due to its important role in workplace behaviour, cognitions, and outcomes. Jung et al.'s [1] review of the psychometric properties of organizational culture measurement instruments noted many instruments have limited validation data despite frequent use in both theoretical and applied situations. The Organizational Culture Assessment Instrument (OCAI) has had conflicting data regarding its psychometric properties, particularly regarding its factor structure. Our study examined the factor structure and criterion validity of the OCAI using robust analysis methods on data gathered from 328 (females = 226, males = 102) Australian employees. Confirmatory factor analysis supported a four factor structure of the OCAI for both ideal and current organizational culture perspectives. Current organizational culture data demonstrated expected reciprocally-opposed relationships between three of the four OCAI factors and the outcome variable of job satisfaction but ideal culture data did not, thus indicating possible weak criterion validity when the OCAI is used to assess ideal culture. Based on the mixed evidence regarding the measure's properties, further examination of the factor structure and broad validity of the measure is encouraged. PMID:24667839

  13. Long-Term Impact of Valid Case Criterion on Capturing Population-Level Growth under Item Response Theory Equating. Research Report. ETS RR-17-17

    ERIC Educational Resources Information Center

    Deng, Weiling; Monfils, Lora

    2017-01-01

    Using simulated data, this study examined the impact of different levels of stringency of the valid case inclusion criterion on item response theory (IRT)-based true score equating over 5 years in the context of K-12 assessment when growth in student achievement is expected. Findings indicate that the use of the most stringent inclusion criterion…

  14. Turkish Version of Kolcaba's Immobilization Comfort Questionnaire: A Validity and Reliability Study.

    PubMed

    Tosun, Betül; Aslan, Özlem; Tunay, Servet; Akyüz, Aygül; Özkan, Hüseyin; Bek, Doğan; Açıksöz, Semra

    2015-12-01

    The purpose of this study was to determine the validity and reliability of the Turkish version of the Immobilization Comfort Questionnaire (ICQ). The sample used in this methodological study consisted of 121 patients undergoing lower extremity arthroscopy in a training and research hospital. The validity study of the questionnaire assessed language validity, structural validity and criterion validity. Structural validity was evaluated via exploratory factor analysis. Criterion validity was evaluated by assessing the correlation between the visual analog scale (VAS) scores (i.e., the comfort and pain VAS scores) and the ICQ scores using Spearman's correlation test. The Kaiser-Meyer-Olkin coefficient and Bartlett's test of sphericity were used to determine the suitability of the data for factor analysis. Internal consistency was evaluated to determine reliability. The data were analyzed with SPSS version 15.00 for Windows. Descriptive statistics were presented as frequencies, percentages, means and standard deviations. A p value ≤ .05 was considered statistically significant. A moderate positive correlation was found between the ICQ scores and the VAS comfort scores; a moderate negative correlation was found between the ICQ and the VAS pain measures in the criterion validity analysis. Cronbach α values of .75 and .82 were found for the first and second measurements, respectively. The findings of this study reveal that the ICQ is a valid and reliable tool for assessing the comfort of patients in Turkey who are immobilized because of lower extremity orthopedic problems. Copyright © 2015. Published by Elsevier B.V.

  15. Development and preliminary validation of a behavioral task of negative reinforcement underlying risk taking and its relation to problem alcohol use in college freshmen

    PubMed Central

    MacPherson, Laura; Calvin, Nicholas T.; Richards, Jessica M.; Guller, Leila; Mayes, Linda C.; Crowley, Michael J.; Daughters, Stacey B.; Lejuez, C.W.

    2011-01-01

    Background A long line of theoretical and empirical evidence implicates negative reinforcement as a process underlying the etiology and maintenance of risky alcohol use behaviors from adolescence through emerging adulthood. However, the bulk of this literature has relied on self-report measures and there is a notable absence of behavioral modes of assessments of negative reinforcement-based alcohol-related risk-taking. To address this clear gap in the literature, the current study presents the first published data on the reliability and validity of the Maryland Resource for the Behavioral Utilization of the Reinforcement of Negative Stimuli (MRBURNS), which is a modified version of the positive reinforcement-based Balloon Analogue Risk Task (BART). Methods Participants included a convenience sample of 116 college freshmen ever regular drinkers (aged 18–19) who completed both behavioral tasks; self-report measures of negative reinforcement/avoidance constructs and of positive reinforcement/appetitive constructs to examine convergent validity and discriminant validity, respectively; and self-report measures of alcohol use, problems, and motives to examine criterion validity. Results The MRBURNS evidenced sound experimental properties and reliability across task trials. In support of convergent validity, risk taking on the MRBURNS correlated significantly with negative urgency, difficulties in emotion regulation and depressive and anxiety-related symptoms. In support of discriminant validity, performance on the MRBURNS was unrelated to risk taking on the BART, sensation seeking, and trait impulsivity. Finally, pertaining to criterion validity, risk taking on the MRBURNS was related to alcohol-related problems but not heavy episodic alcohol use. Notably, risk taking on the MRBURNS was associated with negative reinforcement-based but not with positive reinforcement-based drinking motives. Conclusions Data from this initial investigation suggest the utility of the

  16. Study to determine the criterion validity of the SenseWear Armband as a measure of physical activity in people with rheumatoid arthritis.

    PubMed

    Tierney, Marie; Fraser, Alexander; Purtill, Helen; Kennedy, Norelee

    2013-06-01

    Measuring physical activity in people with rheumatoid arthritis (RA) is of great importance in light of the increased mortality in this population due to cardiovascular disease. Validation of activity monitors in specific populations is recommended to ensure the accuracy of physical activity measurement. Thus, the purpose of this study was to determine the validity of the SenseWear Pro3 Armband (SWA) as a measure of physical activity during activities of daily living (ADL) in people with RA. Fourteen subjects (8 men and 6 women) with a diagnosis of RA were recruited from rheumatology clinics at the Mid-Western Regional Hospitals, Limerick, Ireland. Participants undertook a series of ADL of varying intensities. The SWA was compared to the criterion measures of the Oxycon Mobile indirect calorimetry system (energy expenditure in kJ) and of manual video observation (step count). Bland and Altman, intraclass correlation coefficient (ICC), and correlation analyses were done using SPSS, version 19.0. The SWA showed substantial agreement (ICC 0.717, P < 0.001) and a strong relationship (Pearson's correlation coefficient = 0.852) compared with the criterion measure when estimating energy expenditure during ADL. However, it was found that the SWA overestimated energy expenditure, particularly at higher intensity levels. The ability of the SWA to estimate step counts during ADL was poor (ICC 0.304, P = 0.038). The SWA can be considered a valid tool to estimate energy expenditure during ADL in the RA population; however, attention should be paid to its tendency to overestimate energy expenditure. Copyright © 2013 by the American College of Rheumatology.

  17. The Research Diagnostic Criteria for Temporomandibular Disorders. I: overview and methodology for assessment of validity.

    PubMed

    Schiffman, Eric L; Truelove, Edmond L; Ohrbach, Richard; Anderson, Gary C; John, Mike T; List, Thomas; Look, John O

    2010-01-01

    The purpose of the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) Validation Project was to assess the diagnostic validity of this examination protocol. The aim of this article is to provide an overview of the project's methodology, descriptive statistics, and data for the study participant sample. This article also details the development of reliable methods to establish the reference standards for assessing criterion validity of the Axis I RDC/TMD diagnoses. The Axis I reference standards were based on the consensus of two criterion examiners independently performing a comprehensive history, clinical examination, and evaluation of imaging. Intersite reliability was assessed annually for criterion examiners and radiologists. Criterion examination reliability was also assessed within study sites. Study participant demographics were comparable to those of participants in previous studies using the RDC/TMD. Diagnostic agreement of the criterion examiners with each other and with the consensus-based reference standards was excellent with all kappas > or = 0.81, except for osteoarthrosis (moderate agreement, k = 0.53). Intrasite criterion examiner agreement with reference standards was excellent (k > or = 0.95). Intersite reliability of the radiologists for detecting computed tomography-disclosed osteoarthrosis and magnetic resonance imaging-disclosed disc displacement was good to excellent (k = 0.71 and 0.84, respectively). The Validation Project study population was appropriate for assessing the reliability and validity of the RDC/TMD Axis I and II. The reference standards used to assess the validity of Axis I TMD were based on reliable and clinically credible methods.

  18. Development and Validation of the Five-by-Five Resilience Scale.

    PubMed

    DeSimone, Justin A; Harms, P D; Vanhove, Adam J; Herian, Mitchel N

    2017-09-01

    This article introduces a new measure of resilience and five related protective factors. The Five-by-Five Resilience Scale (5×5RS) is developed on the basis of theoretical and empirical considerations. Two samples ( N = 475 and N = 613) are used to assess the factor structure, reliability, convergent validity, and criterion-related validity of the 5×5RS. Confirmatory factor analysis supports a bifactor model. The 5×5RS demonstrates adequate internal consistency as evidenced by Cronbach's alpha and empirical reliability estimates. The 5×5RS correlates positively with the Connor-Davidson Resilience Scale (CD-RISC), a commonly used measure of resilience. The 5×5RS exhibits similar criterion-related validity to the CD-RISC as evidenced by positive correlations with satisfaction with life, meaning in life, and secure attachment style as well as negative correlations with rumination and anxious or avoidant attachment styles. 5×5RS scores are positively correlated with healthy behaviors such as exercise and negatively correlated with sleep difficulty and symptomology of anxiety and depression. The 5×5RS incrementally explains variance in some criteria above and beyond the CD-RISC. Item responses are modeled using the graded response model. Information estimates demonstrate the ability of the 5×5RS to assess individuals within at least one standard deviation of the mean on relevant latent traits.

  19. A Model for Investigating Predictive Validity at Highly Selective Institutions.

    ERIC Educational Resources Information Center

    Gross, Alan L.; And Others

    A statistical model for investigating predictive validity at highly selective institutions is described. When the selection ratio is small, one must typically deal with a data set containing relatively large amounts of missing data on both criterion and predictor variables. Standard statistical approaches are based on the strong assumption that…

  20. A comparison of the criterion validity of popular measures of narcissism and narcissistic personality disorder via the use of expert ratings.

    PubMed

    Miller, Joshua D; McCain, Jessica; Lynam, Donald R; Few, Lauren R; Gentile, Brittany; MacKillop, James; Campbell, W Keith

    2014-09-01

    The growing interest in the study of narcissism has resulted in the development of a number of assessment instruments that manifest only modest to moderate convergence. The present studies adjudicate among these measures with regard to criterion validity. In the 1st study, we compared multiple narcissism measures to expert consensus ratings of the personality traits associated with narcissistic personality disorder (NPD; Study 1; N = 98 community participants receiving psychological/psychiatric treatment) according to the Diagnostic and Statistical Manual of Mental Disorders (4th ed., text rev.; DSM-IV-TR; American Psychiatric Association, 2000) using 5-factor model traits as well as the traits associated with the pathological trait model according to the Diagnostic and Statistical Manual of Mental Disorders (5th ed.; American Psychiatric Association, 2013). In Study 2 (N = 274 undergraduates), we tested the criterion validity of an even larger set of narcissism instruments by examining their relations with measures of general and pathological personality, as well as psychopathology, and compared the resultant correlations to the correlations expected by experts for measures of grandiose and vulnerable narcissism. Across studies, the grandiose dimensions from the Five-Factor Narcissism Inventory (FFNI; Glover, Miller, Lynam, Crego, & Widiger, 2012) and the Narcissistic Personality Inventory (Raskin & Terry, 1988) provided the strongest match to expert ratings of DSM-IV-TR NPD and grandiose narcissism, whereas the vulnerable dimensions of the FFNI and the Pathological Narcissism Inventory (Pincus et al., 2009), as well as the Hypersensitive Narcissism Scale (Hendin & Cheek, 1997), provided the best match to expert ratings of vulnerable narcissism. These results should help guide researchers toward the selection of narcissism instruments that are most well suited to capturing different aspects of narcissism. PsycINFO Database Record (c) 2014 APA, all rights reserved.

  1. Validation and cross cultural adaptation of the Italian version of the Harris Hip Score.

    PubMed

    Dettoni, Federico; Pellegrino, Pietro; La Russa, Massimo R; Bonasia, Davide E; Blonna, Davide; Bruzzone, Matteo; Castoldi, Filippo; Rossi, Roberto

    2015-01-01

    The Harris Hip Score (HHS) is one of the most widely used health related quality of life (HRQOL) measures for the assessment of hip pathology: in spite of this, a validation study, and an official Italian version have not been provided yet. The aim of this study was to create an Italian valid and reliable version of the HHS. The score was translated and modified in Italian; then 103 patients with different hip pathologies were evaluated using this HHS version and also with the WOMAC and the SF-12 questionnaires. Content, construct and criterion validities were tested, such as interobserver reliability, test-retest reliability and internal consistency. Cross-cultural adaptation was easy, and only minor adaptation was required in the translation process. Construct and criterion validity of the HHS Italian Version were confirmed by satisfactory values of Spearman's Rho for correlation between specific domains of HHS and Womac and SF12 scores. Interobserver and test-retest reliabilities obtained values of 0.996 and 0.975 respectively; Cronbach's alpha for internal consistency was 0.816. Statistical and clinical analysis showed that HHS is highly valid and reliable in this new Italian version.

  2. Validity of smartphone pedometer applications.

    PubMed

    Orr, Krystn; Howe, Holly S; Omran, Janine; Smith, Kristina A; Palmateer, Tess M; Ma, Alvin E; Faulkner, Guy

    2015-11-30

    Given the widespread use of smartphone pedometer applications and the relatively limited number of published validity tests, this study examined the validity of three popular commercial smartphone pedometer applications (i.e., Accupedo, Moves, and Runtastic Pedometer). Convenience samples of males and females were recruited for laboratory tests [n = 11; mean: aged 24.18 years (±3.06)] and a free-living test [n = 18; mean: aged 28.78 years (±9.52)]. Five conditions were assessed: (a) 20-step test, (b) 40-step stair climbing, (c) treadmill walking and running at different speeds, (d) driving, and (e) 3-day free-living. The Yamax SW-200 pedometer and observed step counts were used as criterion measures. Analyses identified an unacceptable error percentage in all of the applications compared to the pedometer. Given the inaccuracy of these applications, caution is required in their promotion to the public for self-monitoring physical activity and in their use as tools for assessing physical activity in research trials.

  3. Is Echinococcus intermedius a valid species?

    USDA-ARS?s Scientific Manuscript database

    Medical and veterinary sciences require scientific names to discriminate pathogenic organisms in our living environment. Various species concepts have been proposed for metazoan animals. There are, however, constant controversies over their validity because of lack of a common criterion to define ...

  4. Math Placement Validation Study: A Summary of the Criterion-Related Validity Evidence and Multiple Measures Data for the San Diego Community College District.

    ERIC Educational Resources Information Center

    Armstrong, William B.

    In Fall 1994, the San Diego Community College District (SDCCD), in California, conducted a study to determine the validity of the Mathematics Diagnostic Testing Project (MDTP) placement test. The MDTP provides tests at four levels (i.e., algebra readiness, elementary algebra, intermediate algebra, and pre-calculus) and is used in the District for…

  5. Psychometric Validation of the Academic Motivation Scale in a Dental Student Sample.

    PubMed

    Orsini, Cesar; Binnie, Vivian; Evans, Phillip; Ledezma, Priscilla; Fuentes, Fernando; Villegas, Maria J

    2015-08-01

    The Academic Motivation Scale is one of the most frequently used instruments to assess academic motivation. It relies on the self-determination theory of human motivation. However, motivation has been understudied in dental education. Therefore, to address the lack of valid instruments to assess academic motivation in dental education and contribute to future research in the field, the aim of this study was to analyze the psychometric properties of this instrument in a sample of dental students. Participants were 989 Chilean undergraduate dental students (86% response rate) who completed a survey containing a Chilean face-valid version of the Spanish Academic Motivation Scale and three other motivation-related instruments to assess the survey's construct and criterion validity. Later, 76 of the students (out of 100 invited) took the survey again to assess its test-retest stability. The instrument's construct validity was supported by the superior goodness of fit of the seven-subscale Academic Motivation Scale over competing models through confirmatory factor analysis and by the expected correlations among its subscales. The concurrent criterion validity was supported by the confirmation of correlations between its subscales and external criteria. Adequate internal consistency and test-retest correlations were also found. The evidence from this study suggests that the Academic Motivation Scale is a preliminarily valid and reliable instrument to assess motivation in the predoctoral dental context. Future research in this area is needed to confirm or refute these results.

  6. Validation of the Rational and Experiential Multimodal Inventory in the Italian Context.

    PubMed

    Monacis, Lucia; de Palo, Valeria; Di Nuovo, Santo; Sinatra, Maria

    2016-08-01

    The unfavorable relations of the Rational and Experiential Inventory Experiential scale with objective criterion measures and its limited content validity led Norris and Epstein to propose a more content-valid measure of the experiential thinking style, the Rational and Experiential Multimodal Inventory (REIm), in order to assess the several facets of a broader experiential system consisting of interrelated components. This study aimed to provide the Italian validation of the inventory by examining its psychometric features, its factor structure (Study 1, N = 545), and its convergent and discriminant validity (Study 2, N = 257). Study 1 supported the 2- and 4-factor solutions, and multi-group analyses confirmed the invariance measurement across age and gender for both models. Study 2 provided evidence for both the convergent validity by supporting the theoretical associations among Rational and Experiential Multimodal Inventory scores and similar and related measures, and the discriminant validity by showing associations between the two thinking styles and a different but conceptually related construct, i.e., identity formation. No associations between Rational and Experiential Multimodal Inventory scores and social desirability were found. The Italian version of the Rational and Experiential Multimodal Inventory showed satisfactory psychometric properties, thus confirming its validity. © The Author(s) 2016.

  7. Brazilian validation of the Alberta Infant Motor Scale.

    PubMed

    Valentini, Nadia Cristina; Saccani, Raquel

    2012-03-01

    The Alberta Infant Motor Scale (AIMS) is a well-known motor assessment tool used to identify potential delays in infants' motor development. Although Brazilian researchers and practitioners have used the AIMS in laboratories and clinical settings, its translation to Portuguese and validation for the Brazilian population is yet to be investigated. This study aimed to translate and validate all AIMS items with respect to internal consistency and content, criterion, and construct validity. A cross-sectional and longitudinal design was used. A cross-cultural translation was used to generate a Brazilian-Portuguese version of the AIMS. In addition, a validation process was conducted involving 22 professionals and 766 Brazilian infants (aged 0-18 months). The results demonstrated language clarity and internal consistency for the motor criteria (motor development score, α=.90; prone, α=.85; supine, α=.92; sitting, α=.84; and standing, α=.86). The analysis also revealed high discriminative power to identify typical and atypical development (motor development score, P<.001; percentile, P=.04; classification criterion, χ(2)=6.03; P=.05). Temporal stability (P=.07) (rho=.85, P<.001) was observed, and predictive power (P<.001) was limited to the group of infants aged from 3 months to 9 months. Limited predictive validity was observed, which may have been due to the restricted time that the groups were followed longitudinally. In sum, the translated version of AIMS presented adequate validity and reliability.

  8. Assessing health-related quality of life in children with recurrent headache: reliability and validity of the PedsQLTM 4.0 in a pediatric headache sample.

    PubMed

    Connelly, Mark; Rapoff, Michael A

    2006-08-01

    To evaluate the reliability and validity of a commonly used measure of health-related quality of life (HRQOL), the Pediatric Quality of Life Inventory (PedsQL 4.0), in a sample of children with a recurrent headache syndrome. Participants were 40 children aged 7-12 who completed measures of HRQOL, headache-related disability, and headache activity during a baseline period and following a self-directed cognitive-behavioral intervention. The data are supportive of the reliability (internal consistency and test-retest) and validity (criterion related, convergent, known-groups, and responsiveness to intervention) of the PedsQL 4.0 within a pediatric headache sample. We conclude that the PedsQL 4.0 is a reliable and valid measure of HRQOL in children with recurrent headache and captures important information not routinely evaluated in chronic pain populations.

  9. Construction and Initial Validation of the Multiracial Experiences Measure (MEM)

    PubMed Central

    Yoo, Hyung Chol; Jackson, Kelly; Guevarra, Rudy P.; Miller, Matthew J.; Harrington, Blair

    2015-01-01

    This article describes the development and validation of the Multiracial Experiences Measure (MEM): a new measure that assesses uniquely racialized risks and resiliencies experienced by individuals of mixed racial heritage. Across two studies, there was evidence for the validation of the 25-item MEM with 5 subscales including Shifting Expressions, Perceived Racial Ambiguity, Creating Third Space, Multicultural Engagement, and Multiracial Discrimination. The 5-subscale structure of the MEM was supported by a combination of exploratory and confirmatory factor analyses. Evidence of criterion-related validity was partially supported with MEM subscales correlating with measures of racial diversity in one’s social network, color-blind racial attitude, psychological distress, and identity conflict. Evidence of discriminant validity was supported with MEM subscales not correlating with impression management. Implications for future research and suggestions for utilization of the MEM in clinical practice with multiracial adults are discussed. PMID:26460977

  10. Construction and initial validation of the Multiracial Experiences Measure (MEM).

    PubMed

    Yoo, Hyung Chol; Jackson, Kelly F; Guevarra, Rudy P; Miller, Matthew J; Harrington, Blair

    2016-03-01

    This article describes the development and validation of the Multiracial Experiences Measure (MEM): a new measure that assesses uniquely racialized risks and resiliencies experienced by individuals of mixed racial heritage. Across 2 studies, there was evidence for the validation of the 25-item MEM with 5 subscales including Shifting Expressions, Perceived Racial Ambiguity, Creating Third Space, Multicultural Engagement, and Multiracial Discrimination. The 5-subscale structure of the MEM was supported by a combination of exploratory and confirmatory factor analyses. Evidence of criterion-related validity was partially supported with MEM subscales correlating with measures of racial diversity in one's social network, color-blind racial attitude, psychological distress, and identity conflict. Evidence of discriminant validity was supported with MEM subscales not correlating with impression management. Implications for future research and suggestions for utilization of the MEM in clinical practice with multiracial adults are discussed. (c) 2016 APA, all rights reserved).

  11. Validation by simulation of a clinical trial model using the standardized mean and variance criteria.

    PubMed

    Abbas, Ismail; Rovira, Joan; Casanovas, Josep

    2006-12-01

    To develop and validate a model of a clinical trial that evaluates the changes in cholesterol level as a surrogate marker for lipodystrophy in HIV subjects under alternative antiretroviral regimes, i.e., treatment with Protease Inhibitors vs. a combination of nevirapine and other antiretroviral drugs. Five simulation models were developed based on different assumptions, on treatment variability and pattern of cholesterol reduction over time. The last recorded cholesterol level, the difference from the baseline, the average difference from the baseline and level evolution, are the considered endpoints. Specific validation criteria based on a 10% minus or plus standardized distance in means and variances were used to compare the real and the simulated data. The validity criterion was met by all models for considered endpoints. However, only two models met the validity criterion when all endpoints were considered. The model based on the assumption that within-subjects variability of cholesterol levels changes over time is the one that minimizes the validity criterion, standardized distance equal to or less than 1% minus or plus. Simulation is a useful technique for calibration, estimation, and evaluation of models, which allows us to relax the often overly restrictive assumptions regarding parameters required by analytical approaches. The validity criterion can also be used to select the preferred model for design optimization, until additional data are obtained allowing an external validation of the model.

  12. [Evaluation of Suicide Risk Levels in Hospitals: Validity and Reliability Tests].

    PubMed

    Macagnino, Sandro; Steinert, Tilman; Uhlmann, Carmen

    2018-05-01

    Examination of in-hospital suicide risk levels concerning their validity and their reliability. The internal suicide risk levels were evaluated in a cross sectional study of in 163 inpatients. A reliability check was performed via determining interrater-reliability of senior physician, therapist and the responsible nurse. Within the scope of the validity check, we conducted analyses of criterion validity and construct validity. For the total sample an "acceptable" to "good" interrater-reliability (Kendalls W = .77) of suicide risk levels were obtained. Schizophrenic disorders showed the lowest values, for personality disorders we found the highest level of interrater-reliability. When examining the criterion validity, Item-9 of the BDI-II is substantial correlated to our suicide risk levels (ρ m  = .54, p < .01). Within the scope of construct validity check, affective disorders showed the highest correlation (ρ = .77), compatible also with "convergent validity". They differed with schizophrenic disorders which showed the least concordance (ρ = .43). In-hospital suicide risk levels may represent an important contribution to the assessment of suicidal behavior of inpatients experiencing psychiatric treatment due to their overall good validity and reliability. © Georg Thieme Verlag KG Stuttgart · New York.

  13. Validity, responsiveness, and minimal clinically important difference of EQ-5D-5L in stroke patients undergoing rehabilitation.

    PubMed

    Chen, Poyu; Lin, Keh-Chung; Liing, Rong-Jiuan; Wu, Ching-Yi; Chen, Chia-Ling; Chang, Ku-Chou

    2016-06-01

    To examine the criterion validity, responsiveness, and minimal clinically important difference (MCID) of the EuroQoL 5-Dimensions Questionnaire (EQ-5D-5L) and visual analog scale (EQ-VAS) in people receiving rehabilitation after stroke. The EQ-5D-5L, along with four criterion measures-the Medical Research Council scales for muscle strength, the Fugl-Meyer assessment, the functional independence measure, and the Stroke Impact Scale-was administered to 65 patients with stroke before and after 3- to 4-week therapy. Criterion validity was estimated using the Spearman correlation coefficient. Responsiveness was analyzed by the effect size, standardized response mean (SRM), and criterion responsiveness. The MCID was determined by anchor-based and distribution-based approaches. The percentage of patients exceeding the MCID was also reported. Concurrent validity of the EQ-Index was better compared with the EQ-VAS. The EQ-Index has better power for predicting the rehabilitation outcome in the activities of daily living than other motor-related outcome measures. The EQ-Index was moderately responsive to change (SRM = 0.63), whereas the EQ-VAS was only mildly responsive to change. The MCID estimation of the EQ-Index (the percentage of patients exceeding the MCID) was 0.10 (33.8 %) and 0.10 (33.8 %) based on the anchor-based and distribution-based approaches, respectively, and the estimation of EQ-VAS was 8.61 (41.5 %) and 10.82 (32.3 %). The EQ-Index has shown reasonable concurrent validity, limited predictive validity, and acceptable responsiveness for detecting the health-related quality of life in stroke patients undergoing rehabilitation, but not for EQ-VAS. Future research considering different recovery stages after stroke is warranted to validate these estimations.

  14. Toward a Measure of Accountability in Nursing: A Three-Stage Validation Study.

    PubMed

    Drach-Zahavy, Anat; Leonenko, Marina; Srulovici, Einav

    2018-06-04

    To develop and psychometrically evaluate a three-dimensional questionnaire suitable for evaluating personal and organizational accountability in nurses. Accountability is defined as a three-dimensional value, directing professionals to take responsibility for their decisions and actions, to be willing to explain them (transparency) and to be judged according to society's accepted values (answerability). Despite the relatively clear definition, measurement of accountability lags well behind. Existing self-report questionnaires do not fully capture the complexity of the concept; nor do they capture the different sources of accountability (e.g., personal accountability, organizational accountability). A three-stage measure development. Data were collected during 2015-2016. In Phase 1, an initial database of items (N = 74) was developed, based on literature review and qualitative study, establishing face and content validity. In Phase 2, the face, content, construct and criterion-related validity of the initial questionnaires (19 items for personal and organizational accountability questionnaire) was established with a sample of 229 nurses. In Phase 3, the final questionnaires (19 items each) were validated with a new sample of 329 nurses and established construct validity. The final version of the instruments comprised 19 items, suitable for assessing personal and organizational accountability. The questionnaire referred to the dimensions of responsibility, transparency and answerability. The findings established the instrument's content, construct and criterion-related validity, as well as good internal reliability. The questionnaire portrays accountability in nursing, by capturing nurses' subjective perceptions of accountability dimensions (responsibility, transparency, answerability), as demonstrated by personal and organizational values. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  15. Development and initial validation of the appropriate antibiotic use self-efficacy scale.

    PubMed

    Hill, Erin M; Watkins, Kaitlin

    2018-06-04

    While there are various medication self-efficacy scales that exist, none assess self-efficacy for appropriate antibiotic use. The Appropriate Antibiotic Use Self-Efficacy Scale (AAUSES) was developed, pilot tested, and its psychometric properties were examined. Following pilot testing of the scale, a 28-item questionnaire was examined using a sample (n = 289) recruited through the Amazon Mechanical Turk platform. Participants also completed other scales and items, which were used in assessing discriminant, convergent, and criterion-related validity. Test-retest reliability was also examined. After examining the scale and removing items that did not assess appropriate antibiotic use, an exploratory factor analysis was conducted on 13 items from the original scale. Three factors were retained that explained 65.51% of the variance. The scale and its subscales had adequate internal consistency. The scale had excellent test-retest reliability, as well as demonstrated convergent, discriminant, and criterion-related validity. The AAUSES is a valid and reliable scale that assesses three domains of appropriate antibiotic use self-efficacy. The AAUSES may have utility in clinical and research settings in understanding individuals' beliefs about appropriate antibiotic use and related behavioral correlates. Future research is needed to examine the scale's utility in these settings. Copyright © 2018 Elsevier B.V. All rights reserved.

  16. Quantitative model validation of manipulative robot systems

    NASA Astrophysics Data System (ADS)

    Kartowisastro, Iman Herwidiana

    This thesis is concerned with applying the distortion quantitative validation technique to a robot manipulative system with revolute joints. Using the distortion technique to validate a model quantitatively, the model parameter uncertainties are taken into account in assessing the faithfulness of the model and this approach is relatively more objective than the commonly visual comparison method. The industrial robot is represented by the TQ MA2000 robot arm. Details of the mathematical derivation of the distortion technique are given which explains the required distortion of the constant parameters within the model and the assessment of model adequacy. Due to the complexity of a robot model, only the first three degrees of freedom are considered where all links are assumed rigid. The modelling involves the Newton-Euler approach to obtain the dynamics model, and the Denavit-Hartenberg convention is used throughout the work. The conventional feedback control system is used in developing the model. The system behavior to parameter changes is investigated as some parameters are redundant. This work is important so that the most important parameters to be distorted can be selected and this leads to a new term called the fundamental parameters. The transfer function approach has been chosen to validate an industrial robot quantitatively against the measured data due to its practicality. Initially, the assessment of the model fidelity criterion indicated that the model was not capable of explaining the transient record in term of the model parameter uncertainties. Further investigations led to significant improvements of the model and better understanding of the model properties. After several improvements in the model, the fidelity criterion obtained was almost satisfied. Although the fidelity criterion is slightly less than unity, it has been shown that the distortion technique can be applied in a robot manipulative system. Using the validated model, the importance of

  17. The Validity of the Musical Aptitude Profile for Predicting Grades in Freshman Music Theory.

    ERIC Educational Resources Information Center

    Harrison, Carole S.

    1987-01-01

    This study investigated the criterion-related validity of the Musical Aptitude Profile in relation to achievement in freshman music theory as determined by semester grades in the courses and by grades in three course components (paperwork, sight-singing and ear-training). (Author/BS)

  18. Validation of the Chinese Version of the Quality of Nursing Work Life Scale

    PubMed Central

    Fu, Xia; Xu, Jiajia; Song, Li; Li, Hua; Wang, Jing; Wu, Xiaohua; Hu, Yani; Wei, Lijun; Gao, Lingling; Wang, Qiyi; Lin, Zhanyi; Huang, Huigen

    2015-01-01

    Quality of Nursing Work Life (QNWL) serves as a predictor of a nurse’s intent to leave and hospital nurse turnover. However, QNWL measurement tools that have been validated for use in China are lacking. The present study evaluated the construct validity of the QNWL scale in China. A cross-sectional study was conducted conveniently from June 2012 to January 2013 at five hospitals in Guangzhou, which employ 1938 nurses. The participants were asked to complete the QNWL scale and the World Health Organization Quality of Life abbreviated version (WHOQOL-BREF). A total of 1922 nurses provided the final data used for analyses. Sixty-five nurses from the first investigated division were re-measured two weeks later to assess the test-retest reliability of the scale. The internal consistency reliability of the QNWL scale was assessed using Cronbach’s α. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC). Criterion-relation validity was assessed using the correlation of the total scores of the QNWL and the WHOQOL-BREF. Construct validity was assessed with the following indices: χ2 statistics and degrees of freedom; relative mean square error of approximation (RMSEA); the Akaike information criterion (AIC); the consistent Akaike information criterion (CAIC); the goodness-of-fit index (GFI); the adjusted goodness of fit index; and the comparative fit index (CFI). The findings demonstrated high internal consistency (Cronbach’s α = 0.912) and test-retest reliability (interclass correlation coefficient = 0.74) for the QNWL scale. The chi-square test (χ2 = 13879.60, df [degree of freedom] = 813 P = 0.0001) was significant. The RMSEA value was 0.091, and AIC = 1806.00, CAIC = 7730.69, CFI = 0.93, and GFI = 0.74. The correlation coefficient between the QNWL total scores and the WHOQOL-BREF total scores was 0.605 (p<0.01). The QNWL scale was reliable and valid in Chinese-speaking nurses and could be used as a clinical and research

  19. Criterion validity of the Wechsler Intelligence Scale for Children-Fourth Edition after pediatric traumatic brain injury.

    PubMed

    Donders, Jacobus; Janke, Kelly

    2008-07-01

    The performance of 40 children with complicated mild to severe traumatic brain injury on the Wechsler Intelligence Scale for Children-Fourth Edition (WISC-IV; Wechsler, 2003) was compared with that of 40 demographically matched healthy controls. Of the four WISC-IV factor index scores, only Processing Speed yielded a statistically significant group difference (p < .001) as well as a statistically significant negative correlation with length of coma (p < .01). Logistic regression, using Processing Speed to classify individual children, yielded a sensitivity of 72.50% and a specificity of 62.50%, with false positive and false negative rates both exceeding 30%. We conclude that Processing Speed has acceptable criterion validity in the evaluation of children with complicated mild to severe traumatic brain injury but that the WISC-IV should be supplemented with other measures to assure sufficient accuracy in the diagnostic process.

  20. Reliability and validity of the Bowel Function Index for evaluating opioid-induced constipation: translation, cultural adaptation and validation of the Portuguese version (BFI-P).

    PubMed

    Dueñas, María; Mendonça, Liliane; Sampaio, Rute; Gouvinhas, Cláudia; Oliveira, Daniela; Castro-Lopes, José Manuel; Azevedo, Luís Filipe

    2017-03-01

    The Bowel Function Index (BFI) is a simple and sound bowel function and opioid-induced constipation (OIC) screening tool. We aimed to develop the translation and cultural adaptation of this measure (BFI-P) and to assess its reliability and validity for the Portuguese language and a chronic pain population. The BFI-P was created after a process including translation, back translation and cultural adaptation. Participants (n = 226) were recruited in a chronic pain clinic and were assessed at baseline and after one week. Internal consistency, test-retest reliability, responsiveness, construct (convergent and known groups) and factorial validity were assessed. Test-retest reliability had an intra-class correlation of 0.605 for BFI mean score. Internal consistency of BFI had Cronbach's alpha of 0.865. The construct validity of BFI-P was shown to be excellent and the exploratory factor analysis confirmed its unidimensional structure. The responsiveness of BFI-P was excellent, with a suggested 17-19 point and 8-12 point change in score constituting a clinically relevant change in constipation for patients with and without previous constipation, respectively. This study had some limitations, namely, the criterion validity of BFI-P was not directly assessed; and the absence of a direct criterion for OIC precluded the assessment of the criterion based responsiveness of BFI-P. Nevertheless, BFI may importantly contribute to better OIC screening and its Portuguese version (BFI-P) has been shown to have excellent reliability, internal consistency, validity and responsiveness. Further suggestions regarding statistically and clinically important change cut-offs for this instrument are presented.

  1. Reliability and validity of a combat exposure index for Vietnam era veterans.

    PubMed

    Janes, G R; Goldberg, J; Eisen, S A; True, W R

    1991-01-01

    The reliability and validity of a self-report measure of combat exposure are examined in a cohort of male-male twin pairs who served in the military during the Vietnam era. Test-retest reliability for a five-level ordinal index of combat exposure is assessed by use of 192 duplicate sets of responses. The chance-corrected proportion in agreement (as measured by the kappa coefficient) is .84. As a measure of criterion-related validity, the combat index is correlated with the award of combat-related military medals ascertained from the military records. The probability of receiving a Purple Heart, Bronze Star, Commendation Medal and Combat Infantry Badge is associated strongly with the combat exposure index. These results show that this simple index is a reliable and valid measure of combat exposure.

  2. The Dula dangerous driving index in China: an investigation of reliability and validity.

    PubMed

    Qu, Weina; Ge, Yan; Jiang, Caihong; Du, Feng; Zhang, Kan

    2014-03-01

    The aim of this study was to translate the Dula Dangerous Driving Index (DDDI) into Chinese and to verify its reliability and validity. A total of 246 drivers completed the Chinese version of the DDDI and the Driver Behavior Questionnaire (DBQ). Specific sociodemographic variables and traffic violations were also measured. A confirmatory factor analysis confirmed the internal structure of the DDDI, and the four-factor model was supported in China. Measures of convergent and criterion validity demonstrated that the Chinese DDDI was valid. Its convergent validity was supported by its positive relationship with the DBQ, and its criterion validity was tested using its relationship with self-reported accident involvement and traffic violations. Finally, score comparisons between different demographic groups revealed significant differences, thereby linking age and driving years to dangerous driving. Copyright © 2013 Elsevier Ltd. All rights reserved.

  3. The validity and reliability of a dynamic neuromuscular stabilization-heel sliding test for core stability.

    PubMed

    Cha, Young Joo; Lee, Jae Jin; Kim, Do Hyun; You, Joshua Sung H

    2017-10-23

    Core stabilization plays an important role in the regulation of postural stability. To overcome shortcomings associated with pain and severe core instability during conventional core stabilization tests, we recently developed the dynamic neuromuscular stabilization-based heel sliding (DNS-HS) test. The purpose of this study was to establish the criterion validity and test-retest reliability of the novel DNS-HS test. Twenty young adults with core instability completed both the bilateral straight leg lowering test (BSLLT) and DNS-HS test for the criterion validity study and repeated the DNS-HS test for the test-retest reliability study. Criterion validity was determined by comparing hip joint angle data that were obtained from BSLLT and DNS-HS measures. The test-retest reliability was determined by comparing hip joint angle data. Criterion validity was (ICC2,3) = 0.700 (p< 0.05), suggesting a good relationship between the two core stability measures. Test-retest reliability was (ICC3,3) = 0.953 (p< 0.05), indicating excellent consistency between the repeated DNS-HS measurements. Criterion validity data demonstrated a good relationship between the gold standard BSLLT and DNS-HS core stability measures. Test-retest reliability data suggests that DNS-HS core stability was a reliable test for core stability. Clinically, the DNS-HS test is useful to objectively quantify core instability and allow early detection and evaluation.

  4. Continual Response Measurement: Design and Validation.

    ERIC Educational Resources Information Center

    Baggaley, Jon

    1987-01-01

    Discusses reliability and validity of continual response measurement (CRM), a computer-based measurement technique, and its use in social science research. Highlights include the importance of criterion-referencing the data, guidelines for designing studies using CRM, examples typifying their deductive and inductive functions, and a discussion of…

  5. Criterion validity of a competency-based assessment center in medical education--a 4-year follow-up study.

    PubMed

    Rotthoff, Thomas; Ostapczuk, Martin S; Kröncke, Klaus D; Zimmerhofer, Alexander; Decking, Ulrich; Schneider, Matthias; Ritz-Timme, Stefanie

    2014-01-01

    Core competencies have progressively gained importance in medical education. In other contexts, especially personnel selection and development, assessment centers (ACs) are used to assess competencies, but there is only a limited number of studies on competency-based ACs in medical education. To the best of our knowledge, the present study provides the first data on the criterion-related validity of a competency-based AC in medical education. We developed an AC tailored to measure core competencies relevant to medical education (social-ethical, communicative, self, and teaching) and tested its validity in n=30 first-year medical students using 3- to 4-year follow-up measures such as (a) objective structured clinical examinations (OSCE) on basic clinical skills (n=26), (b) OSCE on communication skills (n=21), and (c) peer feedback (n=18). The AC contained three elements: interview, group discussion, and role play. Additionally, a self-report questionnaire was provided as a basis for the interview. Baseline AC average score and teaching competency correlated moderately with the communication OSCE average score (r=0.41, p=0.03, and r=0.38, p=0.04, respectively). Social-ethical competency in the AC showed a very strong convergent association with the communication OSCE average score (r=0.60, p<0.01). The AC total score also showed a moderate correlation with the overall peer feedback score provided in Year 4 (r=0.38, p=0.06). In addition, communicative competency correlated strongly with the overall peer feedback (r=0.50, p=0.02). We found predominantly low and insignificant correlations between the AC and the OSCE on basic clinical skills (r=-0.33 to 0.30, all p's>0.05). The results showed that competency-based ACs can be used at a very early stage of medical training to successfully predict future performance in core competencies.

  6. Criterion validity of a competency-based assessment center in medical education - a 4-year follow-up study.

    PubMed

    Rotthoff, Thomas; Ostapczuk, Martin S; Kröncke, Klaus D; Zimmerhofer, Alexander; Decking, Ulrich; Schneider, Matthias; Ritz-Timme, Stefanie

    2014-01-01

    Introduction Core competencies have progressively gained importance in medical education. In other contexts, especially personnel selection and development, assessment centers (ACs) are used to assess competencies, but there is only a limited number of studies on competency-based ACs in medical education. To the best of our knowledge, the present study provides the first data on the criterion-related validity of a competency-based AC in medical education. Methods We developed an AC tailored to measure core competencies relevant to medical education (social-ethical, communicative, self, and teaching) and tested its validity in n=30 first-year medical students using 3- to 4-year follow-up measures such as (a) objective structured clinical examinations (OSCE) on basic clinical skills (n=26), (b) OSCE on communication skills (n=21), and (c) peer feedback (n=18). The AC contained three elements: interview, group discussion, and role play. Additionally, a self-report questionnaire was provided as a basis for the interview. Results Baseline AC average score and teaching competency correlated moderately with the communication OSCE average score (r=0.41, p=0.03, and r=0.38, p=0.04, respectively). Social-ethical competency in the AC showed a very strong convergent association with the communication OSCE average score (r=0.60, p<0.01). The AC total score also showed a moderate correlation with the overall peer feedback score provided in Year 4 (r=0.38, p=0.06). In addition, communicative competency correlated strongly with the overall peer feedback (r=0.50, p=0.02). We found predominantly low and insignificant correlations between the AC and the OSCE on basic clinical skills (r=-0.33 to 0.30, all p's>0.05). Conclusion The results showed that competency-based ACs can be used at a very early stage of medical training to successfully predict future performance in core competencies.

  7. [Spanish validation of Game Addiction Scale for Adolescents (GASA)].

    PubMed

    Lloret Irles, Daniel; Morell Gomis, Ramon; Marzo Campos, Juan Carlos; Tirado González, Sonia

    The aim of this study is to adapt and validate the Game Addiction Scale for Adolescents (GASA) to the Spanish youth population. Cultural adaptation and validation study. Secondary Education centres. Two independent studies were conducted on a group of 466 young people with a mean age of 15.27 years (13-18, SD: 1.83) and 48.7% ♀ and on another group of 566, with a mean age of 21.24 years (19-26; SD: 1.86) 44.1% ♀. Addiction to video games (GASA); Game behavior (Game habits usage questionnaire), Impulsiveness (Plutchik Impulsiveness Scale) and Group Pressure (Ad hoc questionnaire). The Spanish version of GASA has shown good reliability and true to the original scale factor structure. As regards criterion validity, GASA scores are significantly different according to four criteria related to problem gambling: Game intensity and frequency, impulsiveness, and peer pressure. The results show that the adapted version GASA is adequate and a valid tool for assessing problematic gaming behaviour. Copyright © 2017 Elsevier España, S.L.U. All rights reserved.

  8. Anxiety measures validated in perinatal populations: a systematic review.

    PubMed

    Meades, Rose; Ayers, Susan

    2011-09-01

    Research and screening of anxiety in the perinatal period is hampered by a lack of psychometric data on self-report anxiety measures used in perinatal populations. This paper aimed to review self-report measures that have been validated with perinatal women. A systematic search was carried out of four electronic databases. Additional papers were obtained through searching identified articles. Thirty studies were identified that reported validation of an anxiety measure with perinatal women. Most commonly validated self-report measures were the General Health Questionnaire (GHQ), State-Trait Anxiety Inventory (STAI), and Hospital Anxiety and Depression Scales (HADS). Of the 30 studies included, 11 used a clinical interview to provide criterion validity. Remaining studies reported one or more other forms of validity (factorial, discriminant, concurrent and predictive) or reliability. The STAI shows criterion, discriminant and predictive validity and may be most useful for research purposes as a specific measure of anxiety. The Kessler 10 (K-10) may be the best short screening measure due to its ability to differentiate anxiety disorders. The Depression Anxiety Stress Scales 21 (DASS-21) measures multiple types of distress, shows appropriate content, and remains to be validated against clinical interview in perinatal populations. Nineteen studies did not report sensitivity or specificity data. The early stages of research into perinatal anxiety, the multitude of measures in use, and methodological differences restrict comparison of measures across studies. There is a need for further validation of self-report measures of anxiety in the perinatal period to enable accurate screening and detection of anxiety symptoms and disorders. Copyright © 2010 Elsevier B.V. All rights reserved.

  9. An alternative to the balance error scoring system: using a low-cost balance board to improve the validity/reliability of sports-related concussion balance testing.

    PubMed

    Chang, Jasper O; Levy, Susan S; Seay, Seth W; Goble, Daniel J

    2014-05-01

    Recent guidelines advocate sports medicine professionals to use balance tests to assess sensorimotor status in the management of concussions. The present study sought to determine whether a low-cost balance board could provide a valid, reliable, and objective means of performing this balance testing. Criterion validity testing relative to a gold standard and 7 day test-retest reliability. University biomechanics laboratory. Thirty healthy young adults. Balance ability was assessed on 2 days separated by 1 week using (1) a gold standard measure (ie, scientific grade force plate), (2) a low-cost Nintendo Wii Balance Board (WBB), and (3) the Balance Error Scoring System (BESS). Validity of the WBB center of pressure path length and BESS scores were determined relative to the force plate data. Test-retest reliability was established based on intraclass correlation coefficients. Composite scores for the WBB had excellent validity (r = 0.99) and test-retest reliability (R = 0.88). Both the validity (r = 0.10-0.52) and test-retest reliability (r = 0.61-0.78) were lower for the BESS. These findings demonstrate that a low-cost balance board can provide improved balance testing accuracy/reliability compared with the BESS. This approach provides a potentially more valid/reliable, yet affordable, means of assessing sports-related concussion compared with current methods.

  10. Eating Disorder Diagnostic Scale: Additional Evidence of Reliability and Validity

    ERIC Educational Resources Information Center

    Stice, Eric; Fisher, Melissa; Martinez, Erin

    2004-01-01

    The authors conducted 4 studies investigating the reliability and validity of the Eating Disorder Diagnostic Scale (HDDS; E. Stice, C. F. Telch, & S. L. Rizvi, 2000), a brief self-report measure for diagnosing anorexia nervosa, bulimia nervosa, and binge eating disorder. Study 1 found that the HDDS showed criterion validity with interview-based…

  11. Development and Initial Validation of the Multicultural Personality Inventory (MPI).

    PubMed

    Ponterotto, Joseph G; Fietzer, Alexander W; Fingerhut, Esther C; Woerner, Scott; Stack, Lauren; Magaldi-Dopman, Danielle; Rust, Jonathan; Nakao, Gen; Tsai, Yu-Ting; Black, Natasha; Alba, Renaldo; Desai, Miraj; Frazier, Chantel; LaRue, Alyse; Liao, Pei-Wen

    2014-01-01

    Two studies summarize the development and initial validation of the Multicultural Personality Inventory (MPI). In Study 1, the 115-item prototype MPI was administered to 415 university students where exploratory factor analysis resulted in a 70-item, 7-factor model. In Study 2, the 70-item MPI and theoretically related companion instruments were administered to a multisite sample of 576 university students. Confirmatory factory analysis found the 7-factor structure to be a relatively good fit to the data (Comparative Fit Index =.954; root mean square error of approximation =.057), and MPI factors predicted variance in criterion variables above and beyond the variance accounted for by broad personality traits (i.e., Big Five). Study limitations and directions for further validation research are specified.

  12. Validity of proposed DSM-5 diagnostic criteria for nicotine use disorder: results from 734 Israeli lifetime smokers

    PubMed Central

    Shmulewitz, D.; Wall, M.M.; Aharonovich, E.; Spivak, B.; Weizman, A.; Frisch, A.; Grant, B. F.; Hasin, D.

    2013-01-01

    Background The fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5) proposes aligning nicotine use disorder (NUD) criteria with those for other substances, by including the current DSM fourth edition (DSM-IV) nicotine dependence (ND) criteria, three abuse criteria (neglect roles, hazardous use, interpersonal problems) and craving. Although NUD criteria indicate one latent trait, evidence is lacking on: (1) validity of each criterion; (2) validity of the criteria as a set; (3) comparative validity between DSM-5 NUD and DSM-IV ND criterion sets; and (4) NUD prevalence. Method Nicotine criteria (DSM-IV ND, abuse and craving) and external validators (e.g. smoking soon after awakening, number of cigarettes per day) were assessed with a structured interview in 734 lifetime smokers from an Israeli household sample. Regression analysis evaluated the association between validators and each criterion. Receiver operating characteristic analysis assessed the association of the validators with the DSM-5 NUD set (number of criteria endorsed) and tested whether DSM-5 or DSM-IV provided the most discriminating criterion set. Changes in prevalence were examined. Results Each DSM-5 NUD criterion was significantly associated with the validators, with strength of associations similar across the criteria. As a set, DSM-5 criteria were significantly associated with the validators, were significantly more discriminating than DSM-IV ND criteria, and led to increased prevalence of binary NUD (two or more criteria) over ND. Conclusions All findings address previous concerns about the DSM-IV nicotine diagnosis and its criteria and support the proposed changes for DSM-5 NUD, which should result in improved diagnosis of nicotine disorders. PMID:23312475

  13. Test Anxiety and the Validity of Cognitive Tests: A Confirmatory Factor Analysis Perspective and Some Empirical Findings

    ERIC Educational Resources Information Center

    Wicherts, Jelte M.; Scholten, Annemarie Zand

    2010-01-01

    The validity of cognitive ability tests is often interpreted solely as a function of the cognitive abilities that these tests are supposed to measure, but other factors may be at play. The effects of test anxiety on the criterion related validity (CRV) of tests was the topic of a recent study by Reeve, Heggestad, and Lievens (2009) (Reeve, C. L.,…

  14. Criterion for evaluating the predictive ability of nonlinear regression models without cross-validation.

    PubMed

    Kaneko, Hiromasa; Funatsu, Kimito

    2013-09-23

    We propose predictive performance criteria for nonlinear regression models without cross-validation. The proposed criteria are the determination coefficient and the root-mean-square error for the midpoints between k-nearest-neighbor data points. These criteria can be used to evaluate predictive ability after the regression models are updated, whereas cross-validation cannot be performed in such a situation. The proposed method is effective and helpful in handling big data when cross-validation cannot be applied. By analyzing data from numerical simulations and quantitative structural relationships, we confirm that the proposed criteria enable the predictive ability of the nonlinear regression models to be appropriately quantified.

  15. Student-to-Student Confirmation in the College Classroom: The Development and Validation of the Student-to-Student Confirmation Scale

    ERIC Educational Resources Information Center

    LaBelle, Sara; Johnson, Zac D.

    2018-01-01

    Three studies were conducted to generate a valid and reliable instrument to measure student-to-student confirmation. Study One (N = 396) sought to establish a factor structure based on previous research. Study Two (N = 396) sought to confirm this factor structure and assess criterion-related validity. Study Three (N = 283) sought to assess…

  16. Psychological Flexibility of Nurses in a Cancer Hospital: Preliminary Validation of a Chinese Version of the Work-related Acceptance and Action Questionnaire.

    PubMed

    Xu, Xianghua; Liu, Xiangyu; Ou, Meijun; Xie, Chanjuan; Chen, Yongyi

    2018-01-01

    To translate the English work-related acceptance and action questionnaire (WAAQ), make cross-cultural adaptations, and examine its psychometric properties when used by Chinese oncology nurses. After translation, the psychometric properties of the Chinese WAAQ were analyzed among 417 nurses, and content validity was determined by six experts. Item-level content validity index (CVI) values were between 0.83 and 1.00; scale-level CVI/universal agreement (S-CVI/UA) and S-CVI/average were 0.86 and 0.98, respectively, which implicated a good content validity. The correlation of the Chinese WAAQ with AAQ-II ( r s = -0.247, P < 0.001) suggested criterion validity, and those with General Health Questionnaire-12 (-0.250, <0.001) and general self-efficacy scale (0.491, <0.001) and Utrecht work engagement scale (UWES) (0.439, <0.001) suggested convergent validity. Exploratory factor analysis identified a seven-item, one-factor structure of WAAQ. The Chinese version of WAAQ had high internal consistency (Cronbach's α = 0.920), with an item-total correlation coefficient of 0.702-0.828 ( P < 0.05), split-half reliability of 0.933, and test-retest reliability of 0.772. The Chinese WAAQ is a reliable and valid tool for assessing psychological flexibility in Chinese oncology nurses.

  17. Development and validation of the Alcohol Myopia Scale.

    PubMed

    Lac, Andrew; Berger, Dale E

    2013-09-01

    Alcohol myopia theory conceptualizes the ability of alcohol to narrow attention and how this demand on mental resources produces the impairments of self-inflation, relief, and excess. The current research was designed to develop and validate a scale based on this framework. People who were alcohol users rated items representing myopic experiences arising from drinking episodes in the past month. In Study 1 (N = 260), the preliminary 3-factor structure was supported by exploratory factor analysis. In Study 2 (N = 289), the 3-factor structure was substantiated with confirmatory factor analysis, and it was superior in fit to an empirically indefensible 1-factor structure. The final 14-item scale was evaluated with internal consistency reliability, discriminant validity, convergent validity, criterion validity, and incremental validity. The alcohol myopia scale (AMS) illuminates conceptual underpinnings of this theory and yields insights for understanding the tunnel vision that arises from intoxication.

  18. Validation of the Spanish Addiction Severity Index Multimedia Version (S-ASI-MV).

    PubMed

    Butler, Stephen F; Redondo, José Pedro; Fernandez, Kathrine C; Villapiano, Albert

    2009-01-01

    This study aimed to develop and test the reliability and validity of a Spanish adaptation of the ASI-MV, a computer administered version of the Addiction Severity Index, called the S-ASI-MV. Participants were 185 native Spanish-speaking adult clients from substance abuse treatment facilities serving Spanish-speaking clients in Florida, New Mexico, California, and Puerto Rico. Participants were administered the S-ASI-MV as well as Spanish versions of the general health subscale of the SF-36, the work and family unit subscales of the Social Adjustment Scale Self-Report, the Michigan Alcohol Screening Test, the alcohol and drug subscales of the Personality Assessment Inventory, and the Hopkins Symptom Checklist-90. Three-to-five-day test-retest reliability was examined along with criterion validity, convergent/discriminant validity, and factorial validity. Measurement invariance between the English and Spanish versions of the ASI-MV was also examined. The S-ASI-MV demonstrated good test-retest reliability (ICCs for composite scores between .59 and .93), criterion validity (rs for composite scores between .66 and .87), and convergent/discriminant validity. Factorial validity and measurement invariance were demonstrated. These results compared favorably with those reported for the original interviewer version of the ASI and the English version of the ASI-MV.

  19. [CRITERION-RELATED VALIDITY OF SIT-AND-REACH TEST AS A MEASURE OF HAMSTRING EXTENSIBILITY IN OLDER WOMEN].

    PubMed

    López-Miñarro, Pedro Ángel; Vaquero-Cristóbal, Raquel; Muyor, José María; Espejo-Antúnez, Luis

    2015-07-01

    lumbo-sacral posture and the sit-andreach score have been proposed as measures of hamstring extensibility. However, the validity is influenced by sample characteristics. to determine the validity of lumbo-horizontal angle and score in the sit-and-reach test as measures of hamstring extensibility in older women. a hundred and twenty older women performed the straight leg raise test with both leg, and the sit-and-reach test (SR) in a random order. For the sitand- reach test, the score and the lumbo-sacral posture in bending (lumbo-horizontal angle, L-Hfx) were measured. the mean values of straight leg raise in left and right leg were 81.70 ± 13.83º and 82.10 ± 14.36º, respectively. The mean value of EPR of both legs was 81.90 ± 12.70º. The mean values of SR score and L-Hfx were -1.54 ± 8.09 cm and 91.08º ± 9.32º, respectively. The correlation values between the mean straight leg raise test with respect to lumbo-sacral posture and SR score were moderate (L-Hfx: r = -0.72, p < 0.01; SR: r = 0.70, p < 0.01). Both variables independently explained about 50% of the variance (L-Hfx: R2 = 0.52, p < 0,001; SR: R2 = 0.49, p < 0,001). the validity of lumbo-sacral posture in bending as measure of hamstring muscle extensibility on older women is moderate, with similar values than SR score. Copyright AULA MEDICA EDICIONES 2014. Published by AULA MEDICA. All rights reserved.

  20. State of the art in the validation of screening methods for the control of antibiotic residues: is there a need for further development?

    PubMed

    Gaudin, Valérie

    2017-09-01

    Screening methods are used as a first-line approach to detect the presence of antibiotic residues in food of animal origin. The validation process guarantees that the method is fit-for-purpose, suited to regulatory requirements, and provides evidence of its performance. This article is focused on intra-laboratory validation. The first step in validation is characterisation of performance, and the second step is the validation itself with regard to pre-established criteria. The validation approaches can be absolute (a single method) or relative (comparison of methods), overall (combination of several characteristics in one) or criterion-by-criterion. Various approaches to validation, in the form of regulations, guidelines or standards, are presented and discussed to draw conclusions on their potential application for different residue screening methods, and to determine whether or not they reach the same conclusions. The approach by comparison of methods is not suitable for screening methods for antibiotic residues. The overall approaches, such as probability of detection (POD) and accuracy profile, are increasingly used in other fields of application. They may be of interest for screening methods for antibiotic residues. Finally, the criterion-by-criterion approach (Decision 2002/657/EC and of European guideline for the validation of screening methods), usually applied to the screening methods for antibiotic residues, introduced a major characteristic and an improvement in the validation, i.e. the detection capability (CCβ). In conclusion, screening methods are constantly evolving, thanks to the development of new biosensors or liquid chromatography coupled to tandem-mass spectrometry (LC-MS/MS) methods. There have been clear changes in validation approaches these last 20 years. Continued progress is required and perspectives for future development of guidelines, regulations and standards for validation are presented here.

  1. [Reliability and Validity of the Korean Version of the Perinatal Post-Traumatic Stress Disorder Questionnaire].

    PubMed

    Park, Yu Kyung; Ju, Hyeon Ok; Na, Hunjoo

    2016-02-01

    The Perinatal Post-Traumatic Stress Disorder Questionnaire (PPQ) was designed to measure post-traumatic symptoms related to childbirth and symptoms during postnatal period. The purpose of this study was to develop a translated Korean version of the PPQ and to evaluate reliability and validity of the Korean PPQ. Participants were 196 mothers at one to 18 months after giving childbirth and data were collected through e-mails. The PPQ was translated into Korean using translation guideline from World Health Organization. For this study Cronbach's alpha and split-half reliability were used to evaluate the reliability of the PPQ. Exploratory Factor Analysis (EFA), Confirmatory Factor Analysis (CFA), and known-group validity were conducted to examine construct validity. Correlations of the PPQ with Impact of Event Scale (IES), Beck Depression Inventory II (BDI-II), and Beck Anxiety Inventory (BAI) were used to test a criterion validity of the PPQ. Cronbach's alpha and Spearman-Brown split-half correlation coefficient were 0.91 and 0.77, respectively. EFA identified a 3-factor solution including arousal, avoidance, and intrusion factors and CFA revealed the strongest support for the 3-factor model. The correlations of the PPQ with IES, BDI-II, and BAI were .99, .60, and .72, respectively, pointing to criterion validity of a high level. The Korean version PPQ is a useful tool for screening and assessing mothers' experiencing emotional distress related to child birth and during the postnatal period. The PPQ also reflects Post Traumatic Stress Disorder's diagnostic standards well.

  2. Environmental education curriculum evaluation questionnaire: A reliability and validity study

    NASA Astrophysics Data System (ADS)

    Minner, Daphne Diane

    The intention of this research project was to bridge the gap between social science research and application to the environmental domain through the development of a theoretically derived instrument designed to give educators a template by which to evaluate environmental education curricula. The theoretical base for instrument development was provided by several developmental theories such as Piaget's theory of cognitive development, Developmental Systems Theory, Life-span Perspective, as well as curriculum research within the area of environmental education. This theoretical base fueled the generation of a list of components which were then translated into a questionnaire with specific questions relevant to the environmental education domain. The specific research question for this project is: Can a valid assessment instrument based largely on human development and education theory be developed that reliably discriminates high, moderate, and low quality in environmental education curricula? The types of analyses conducted to answer this question were interrater reliability (percent agreement, Cohen's Kappa coefficient, Pearson's Product-Moment correlation coefficient), test-retest reliability (percent agreement, correlation), and criterion-related validity (correlation). Face validity and content validity were also assessed through thorough reviews. Overall results indicate that 29% of the questions on the questionnaire demonstrated a high level of interrater reliability and 43% of the questions demonstrated a moderate level of interrater reliability. Seventy-one percent of the questions demonstrated a high test-retest reliability and 5% a moderate level. Fifty-five percent of the questions on the questionnaire were reliable (high or moderate) both across time and raters. Only eight questions (8%) did not show either interrater or test-retest reliability. The global overall rating of high, medium, or low quality was reliable across both coders and time, indicating

  3. Validity of the Externalizing Spectrum Inventory in a Criminal Offender Sample: Relations with Disinhibitory Psychopathology, Personality, and Psychopathic Features

    PubMed Central

    Venables, Noah C.; Patrick, Christopher J.

    2013-01-01

    The Externalizing Spectrum Inventory (ESI; Krueger, Markon, Patrick, Benning, & Kramer, 2007) provides a self-report based method for indexing a range of correlated problem behaviors and traits in the domain of deficient impulse control. The ESI organizes lower-order behaviors and traits of this kind around higher-order factors encompassing general disinhibitory proneness, callous-aggression, and substance abuse. The current study used data from a male prisoner sample (N = 235) to evaluate the validity of ESI total and factor scores in relation to external criterion measures consisting of externalizing disorder symptoms (including child and adult antisocial deviance and substance-related problems) assessed via diagnostic interview, personality traits assessed by self-report, and psychopathic features as assessed by both interview and self-report. Results provide evidence for the validity of the ESI measurement model and point to its potential utility as a referent for research on the neurobiological correlates and etiological bases of externalizing proneness. PMID:21787091

  4. Validity of the Externalizing Spectrum Inventory in a criminal offender sample: relations with disinhibitory psychopathology, personality, and psychopathic features.

    PubMed

    Venables, Noah C; Patrick, Christopher J

    2012-03-01

    The Externalizing Spectrum Inventory (ESI; Krueger, Markon, Patrick, Benning, & Kramer, 2007) provides a self-report based method for indexing a range of correlated problem behaviors and traits in the domain of deficient impulse control. The ESI organizes lower order behaviors and traits of this kind around higher order factors encompassing general disinhibitory proneness, callous-aggression, and substance abuse. In the current study, we used data from a male prisoner sample (N = 235) to evaluate the validity of ESI total and factor scores in relation to external criterion measures consisting of externalizing disorder symptoms (including child and adult antisocial deviance and substance-related problems) assessed via diagnostic interviews, personality traits assessed with self-reports, and psychopathic features as assessed with both interviews and self-reports. Results provide evidence for the validity of the ESI measurement model and point to its potential usefulness as a referent for research on the neurobiological correlates and etiological bases of externalizing proneness.

  5. Psychometric validation of the PROQOL-HIV questionnaire, a new health-related quality of life instrument-specific to HIV disease.

    PubMed

    Duracinsky, Martin; Lalanne, Christophe; Le Coeur, Sophie; Herrmann, Susan; Berzins, Baiba; Armstrong, Andrew Richard; Lau, Joseph Tak Fai; Fournier, Isabelle; Chassany, Olivier

    2012-04-15

    This study reports the psychometric validation of a new HIV/AIDS-specific health-related quality of life (HRQL) questionnaire, the Patient Reported Outcomes Quality of Life-HIV. The instrument was developed simultaneously across Europe, North and South America, Africa, Asia, and Australia to assess multidimensional quality of life impairments in the era of highly active antiretroviral therapy. A cross-sectional study was performed in 8 countries. The pilot 70-item questionnaire was co-administered with the HIV symptoms index, the EQ-5D and Medical Outcomes Study-HIV questionnaires. Demographic and biomedical data were collected. After item analysis and reduction, convergent discriminant concurrent validity and known-group validity were examined. Internal consistency and reliability scores were assessed using Cronbach alpha and intraclass correlation. The final sample of 791 patients was composed of 64% males (median age: 41 years, HIV diagnosis = 5 years), 13.8% were treatment naive. Item reduction yielded a 43-item form surveying 8 dimensions and 1 global health item that showed good convergent and discriminant validity and reliability (98% scaling success; Cronbach alphas 0.77-0.89). Correlations with EQ-5D and Medical Outcomes Study-HIV complied with concurrent validity expectations; likewise, correlations against the number of self-reported symptoms and depression showed good support for criterion validity. A test-retest study on French patients (n = 34) showed temporal stability (intraclass correlation coefficient = 0.86). Significant and meaningful differences of HRQL scores between countries were found. The Patient Reported Outcomes Quality of Life-HIV questionnaire is a valid and reliable instrument for assessing HRQL specific to HIV disease in different cultures and healthcare systems.

  6. Reliability and Validity of Survey Instruments to Measure Work-Related Fatigue in the Emergency Medical Services Setting: A Systematic Review.

    PubMed

    Patterson, P Daniel; Weaver, Matthew D; Fabio, Anthony; Teasley, Ellen M; Renn, Megan L; Curtis, Brett R; Matthews, Margaret E; Kroemer, Andrew J; Xun, Xiaoshuang; Bizhanova, Zhadyra; Weiss, Patricia M; Sequeira, Denisse J; Coppler, Patrick J; Lang, Eddy S; Higgins, J Stephen

    2018-02-15

    This study sought to systematically search the literature to identify reliable and valid survey instruments for fatigue measurement in the Emergency Medical Services (EMS) occupational setting. A systematic review study design was used and searched six databases, including one website. The research question guiding the search was developed a priori and registered with the PROSPERO database of systematic reviews: "Are there reliable and valid instruments for measuring fatigue among EMS personnel?" (2016:CRD42016040097). The primary outcome of interest was criterion-related validity. Important outcomes of interest included reliability (e.g., internal consistency), and indicators of sensitivity and specificity. Members of the research team independently screened records from the databases. Full-text articles were evaluated by adapting the Bolster and Rourke system for categorizing findings of systematic reviews, and the rated data abstracted from the body of literature as favorable, unfavorable, mixed/inconclusive, or no impact. The Grading of Recommendations, Assessment, Development and Evaluation (GRADE) methodology was used to evaluate the quality of evidence. The search strategy yielded 1,257 unique records. Thirty-four unique experimental and non-experimental studies were determined relevant following full-text review. Nineteen studies reported on the reliability and/or validity of ten different fatigue survey instruments. Eighteen different studies evaluated the reliability and/or validity of four different sleepiness survey instruments. None of the retained studies reported sensitivity or specificity. Evidence quality was rated as very low across all outcomes. In this systematic review, limited evidence of the reliability and validity of 14 different survey instruments to assess the fatigue and/or sleepiness status of EMS personnel and related shift worker groups was identified.

  7. Validation of the Lollipop Test: A Diagnostic Screening Test of School Readiness.

    ERIC Educational Resources Information Center

    Chew, Alex L.; Morris, John D.

    1984-01-01

    The validity of the Lollipop Test: A Diagnostic Screening Test of School Readiness was examined using the Metropolitan Readiness Test (MRT), Level I, Form Q, as the criterion. Appreciable concurrent validity was found across test batteries. Implications for school readiness screening are discussed. (Author/BS)

  8. Reconsidering vocational interests for personnel selection: the validity of an interest-based selection test in relation to job knowledge, job performance, and continuance intentions.

    PubMed

    Van Iddekinge, Chad H; Putka, Dan J; Campbell, John P

    2011-01-01

    Although vocational interests have a long history in vocational psychology, they have received extremely limited attention within the recent personnel selection literature. We reconsider some widely held beliefs concerning the (low) validity of interests for predicting criteria important to selection researchers, and we review theory and empirical evidence that challenge such beliefs. We then describe the development and validation of an interests-based selection measure. Results of a large validation study (N = 418) reveal that interests predicted a diverse set of criteria—including measures of job knowledge, job performance, and continuance intentions—with corrected, cross-validated Rs that ranged from .25 to .46 across the criteria (mean R = .31). Interests also provided incremental validity beyond measures of general cognitive aptitude and facets of the Big Five personality dimensions in relation to each criterion. Furthermore, with a couple exceptions, the interest scales were associated with small to medium subgroup differences, which in most cases favored women and racial minorities. Taken as a whole, these results appear to call into question the prevailing thought that vocational interests have limited usefulness for selection.

  9. Validating Pedometer-Based Physical Activity Time against Accelerometer in Middle School Physical Education

    ERIC Educational Resources Information Center

    Gao, Zan; Lee, Amelia M.; Solmon, Melinda A.; Kosma, Maria; Carson, Russell L.; Zhang, Tao; Domangue, Elizabeth; Moore, Delilah

    2010-01-01

    The purpose of this study was to validate physical activity time in middle school physical education as measured by pedometers in relation to a criterion measure, namely, students' accelerometer determined moderate to vigorous physical activity (MVPA). Participants were 155 sixth to eighth graders participating in regularly scheduled physical…

  10. Comparison of the Incremental Validity of the Old and New MCAT.

    ERIC Educational Resources Information Center

    Wolf, Fredric M.; And Others

    The predictive and incremental validity of both the Old and New Medical College Admission Test (MCAT) was examined and compared with a sample of over 300 medical students. Results of zero order and incremental validity coefficients, as well as prediction models resulting from all possible subsets regression analyses using Mallow's Cp criterion,…

  11. Community validation of the IDEA study cognitive screen in rural Tanzania.

    PubMed

    Gray, William K; Paddick, Stella Maria; Collingwood, Cecilia; Kisoli, Aloyce; Mbowe, Godfrey; Mkenda, Sarah; Lissu, Carolyn; Rogathi, Jane; Kissima, John; Walker, Richard W; Mushi, Declare; Chaote, Paul; Ogunniyi, Adesola; Dotchin, Catherine L

    2016-11-01

    The dementia diagnosis gap in sub-Saharan Africa (SSA) is large, partly because of difficulties in screening for cognitive impairment in the community. As part of the Identification and Intervention for Dementia in Elderly Africans (IDEA) study, we aimed to validate the IDEA cognitive screen in a community-based sample in rural Tanzania METHODS: Study participants were recruited from people who attended screening days held in villages within the rural Hai district of Tanzania. Criterion validity was assessed against the gold standard clinical dementia diagnosis using DSM-IV criteria. Construct validity was assessed against, age, education, sex and grip strength and instrumental activities of daily living (IADLs). Internal consistency and floor and ceiling effects were also examined. During community screening, the IDEA cognitive screen had high criterion validity, with an area under the receiver operating characteristic curve of 0.855 (95% CI 0.794 to 0.915). Higher scores on the screen were significantly correlated with lower age, male sex, having attended school, better grip strength and improved performance in activities of daily living. Factor analysis revealed a single factor with an eigenvalue greater than one, although internal consistency was only moderate (Cronbach's alpha = 0.534). The IDEA cognitive screen had high criterion and construct validity and is suitable for use as a cognitive screening instrument in a community setting in SSA. Only moderate internal consistency may partly reflect the multi-domain nature of dementia as diagnosed clinically. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  12. Test validity and performance validity: considerations in providing a framework for development of an ability-focused neuropsychological test battery.

    PubMed

    Larrabee, Glenn J

    2014-11-01

    Literature on test validity and performance validity is reviewed to propose a framework for specification of an ability-focused battery (AFB). Factor analysis supports six domains of ability: first, verbal symbolic; secondly, visuoperceptual and visuospatial judgment and problem solving; thirdly, sensorimotor skills; fourthly, attention/working memory; fifthly, processing speed; finally, learning and memory (which can be divided into verbal and visual subdomains). The AFB should include at least three measures for each of the six domains, selected based on various criteria for validity including sensitivity to presence of disorder, sensitivity to severity of disorder, correlation with important activities of daily living, and containing embedded/derived measures of performance validity. Criterion groups should include moderate and severe traumatic brain injury, and Alzheimer's disease. Validation groups should also include patients with left and right hemisphere stroke, to determine measures sensitive to lateralized cognitive impairment and so that the moderating effects of auditory comprehension impairment and neglect can be analyzed on AFB measures. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  13. Biofeedback in Partial Weight Bearing: Validity of 3 Different Devices.

    PubMed

    van Lieshout, Remko; Stukstette, Mirelle J; de Bie, Rob A; Vanwanseele, Benedicte; Pisters, Martijn F

    2016-11-01

    Study Design Controlled laboratory study to assess criterion-related validity, with a cross-sectional within-subject design. Background Patients with orthopaedic conditions have difficulties complying with partial weight-bearing instructions. Technological advances have resulted in biofeedback devices that offer real-time feedback. However, the accuracy of these devices is mostly unknown. Inaccurate feedback can result in incorrect lower-limb loading and may lead to delayed healing. Objectives To investigate validity of peak force measurements obtained using 3 different biofeedback devices under varying levels of partial weight-bearing categories. Methods Validity of 3 biofeedback devices (OpenGo science, SmartStep, and SensiStep) was assessed. Healthy participants were instructed to walk at a self-selected speed with crutches under 3 different weight-bearing conditions, categorized as a percentage range of body weight: 1% to 20%, greater than 20% to 50%, and greater than 50% to 75%. Peak force data from the biofeedback devices were compared with the peak vertical ground reaction force measured with a force plate. Criterion validity was estimated using simple and regression-based Bland-Altman 95% limits of agreement and weighted kappas. Results Fifty-five healthy adults (58% male) participated. Agreement with the gold standard was substantial for the SmartStep, moderate for OpenGo science, and slight for SensiStep (weighted ± = 0.76, 0.58, and 0.19, respectively). For the 1% to 20% and greater than 20% to 50% weight-bearing categories, both the OpenGo science and SmartStep had acceptable limits of agreement. For the weight-bearing category greater than 50% to 75%, none of the devices had acceptable agreement. Conclusion The OpenGo science and SmartStep provided valid feedback in the lower weight-bearing categories, and the SensiStep showed poor validity of feedback in all weight-bearing categories. J Orthop Sports Phys Ther 2016;46(11):-1. Epub 12 Oct 2016. doi:10

  14. The Reliability and Validity of the Coopersmith Self-Esteem Inventory-Form B.

    ERIC Educational Resources Information Center

    Chiu, Lian-Hwang

    1985-01-01

    The purpose of this study was to determine the test-retest reliability and concurrent validity of the short form (Form B) of the Coopersmith Self-Esteem Inventory. Criterion measures for validity included: (1) sociometric measures; (2) teacher's popularity ranking; and, (3) self-esteem rating. (Author/LMO)

  15. Appearance motives to tan and not tan: evidence for validity and reliability of a new scale.

    PubMed

    Cafri, Guy; Thompson, J Kevin; Roehrig, Megan; Rojas, Ariz; Sperry, Steffanie; Jacobsen, Paul B; Hillhouse, Joel

    2008-04-01

    Risk for skin cancer is increased by UV exposure and decreased by sun protection. Appearance reasons to tan and not tan have consistently been shown to be related to intentions and behaviors to UV exposure and protection. This study was designed to determine the factor structure of appearance motives to tan and not tan, evaluate the extent to which this factor structure is gender invariant, test for mean differences in the identified factors, and evaluate internal consistency, temporal stability, and criterion-related validity. Five-hundred eighty-nine females and 335 male college students were used to test confirmatory factor analysis models within and across gender groups, estimate latent mean differences, and use the correlation coefficient and Cronbach's alpha to further evaluate the reliability and validity of the identified factors. A measurement invariant (i.e., factor-loading invariant) model was identified with three higher-order factors: sociocultural influences to tan (lower order factors: media, friends, family, significant others), appearance reasons to tan (general, acne, body shape), and appearance reasons not to tan (skin aging, immediate skin damage). Females had significantly higher means than males on all higher-order factors. All subscales had evidence of internal consistency, temporal stability, and criterion-related validity. This study offers a framework and measurement instrument that has evidence of validity and reliability for evaluating appearance-based motives to tan and not tan.

  16. Measuring physical activity in young people with cerebral palsy: validity and reliability of the ActivPAL™ monitor.

    PubMed

    Bania, Theofani

    2014-09-01

    We determined the criterion validity and the retest reliability of the ΑctivPAL™ monitor in young people with diplegic cerebral palsy (CP). Activity monitor data were compared with the criterion of video recording for 10 participants. For the retest reliability, activity monitor data were collected from 24 participants on two occasions. Participants had to have diplegic CP and be between 14 and 22 years of age. They also had to be of Gross Motor Function Classification System level II or III. Outcomes were time spent in standing, number of steps (physical activity) and time spent in sitting (sedentary behaviour). For criterion validity, coefficients of determination were all high (r(2)  ≥ 0.96), and limits of group agreement were relatively narrow, but limits of agreement for individuals were narrow only for number of steps (≥5.5%). Relative reliability was high for number of steps (intraclass correlation coefficient = 0.87) and moderate for time spent in sitting and lying, and time spent in standing (intraclass correlation coefficients = 0.60-0.66). For groups, changes of up to 7% could be due to measurement error with 95% confidence, but for individuals, changes as high as 68% could be due to measurement error. The results support the criterion validity and the retest reliability of the ActivPAL™ to measure physical activity and sedentary behaviour in groups of young people with diplegic CP but not in individuals. Copyright © 2014 John Wiley & Sons, Ltd.

  17. Clinical validity of prototype personality disorder ratings in adolescents.

    PubMed

    Defife, Jared A; Haggerty, Greg; Smith, Scott W; Betancourt, Luis; Ahmed, Zain; Ditkowsky, Keith

    2015-01-01

    A growing body of research shows that personality pathology in adolescents is clinically distinctive and frequently stable into adulthood. A reliable and useful method for rating personality pathology in adolescent patients has the potential to enhance conceptualization, dissemination, and treatment effectiveness. The aim of this study is to examine the clinical validity of a prototype matching approach (derived from the Shedler Westen Assessment Procedure-Adolescent Version) for quantifying personality pathology in an adolescent inpatient sample. Sixty-six adolescent inpatients and their parents or legal guardians completed forms of the Child Behavior Checklist (CBCL) assessing emotional and behavioral problems. Clinical criterion variables including suicide history, substance use, and fights with peers were also assessed. Patients' individual and group therapists on the inpatient unit completed personality prototype ratings. Prototype diagnoses demonstrated substantial reliability (median intraclass correlation coefficient =.75) across independent ratings from individual and group therapists. Personality prototype ratings correlated with the CBCL scales and clinical criterion variables in anticipated and meaningful ways. As seen in prior research with adult samples, prototype personality ratings show clinical validity across independent clinician raters previously unfamiliar with the approach, and they are meaningfully related to clinical symptoms, behavioral problems, and adaptive functioning.

  18. Clinical Validity of Prototype Personality Disorder Ratings in Adolescents

    PubMed Central

    DeFife, Jared A.; Haggerty, Greg; Smith, Scott W.; Betancourt, Luis; Ahmed, Zain; Ditkowsky, Keith

    2015-01-01

    A growing body of research shows that personality pathology in adolescents is clinically distinctive and frequently stable into adulthood. A reliable and useful method for rating personality pathology in adolescent patients has the potential to enhance conceptualization, dissemination, and treatment effectiveness. The aim of this study is to examine the clinical validity of a prototype matching approach (derived from the Shedler Westen Assessment Procedure – Adolescent Version) for quantifying personality pathology in an adolescent inpatient sample. Sixty-six adolescent inpatients and their parents or legal guardians completed forms of the Child Behavior Checklist (CBCL) assessing emotional and behavioral problems. Clinical criterion variables including suicide history, substance use, and fights with peers were also assessed. Patients’ individual and group therapists on the inpatient unit completed personality prototype ratings. Prototype diagnoses demonstrated substantial reliability (median ICC = .75) across independent ratings from individual and group therapists. Personality prototype ratings correlated with the CBCL scales and clinical criterion variables in anticipated and meaningful ways. As seen in prior research with adult samples, prototype personality ratings show clinical validity across independent clinician raters previously unfamiliar with the approach, and they are meaningfully related to clinical symptoms, behavioral problems, and adaptive functioning. PMID:25457971

  19. Use of the recognition heuristic depends on the domain's recognition validity, not on the recognition validity of selected sets of objects.

    PubMed

    Pohl, Rüdiger F; Michalkiewicz, Martha; Erdfelder, Edgar; Hilbig, Benjamin E

    2017-07-01

    According to the recognition-heuristic theory, decision makers solve paired comparisons in which one object is recognized and the other not by recognition alone, inferring that recognized objects have higher criterion values than unrecognized ones. However, success-and thus usefulness-of this heuristic depends on the validity of recognition as a cue, and adaptive decision making, in turn, requires that decision makers are sensitive to it. To this end, decision makers could base their evaluation of the recognition validity either on the selected set of objects (the set's recognition validity), or on the underlying domain from which the objects were drawn (the domain's recognition validity). In two experiments, we manipulated the recognition validity both in the selected set of objects and between domains from which the sets were drawn. The results clearly show that use of the recognition heuristic depends on the domain's recognition validity, not on the set's recognition validity. In other words, participants treat all sets as roughly representative of the underlying domain and adjust their decision strategy adaptively (only) with respect to the more general environment rather than the specific items they are faced with.

  20. The Physical Education and School Sport Environment Inventory: Preliminary Validation and Reliability

    ERIC Educational Resources Information Center

    Fairclough, Stuart J.; Hilland, Toni A.; Vinson, Don; Stratton, Gareth

    2012-01-01

    The study purpose was to assess preliminary validity and reliability of the Physical Education and School Sport Environment Inventory (PESSEI), which was designed to audit physical education (PE) and school sport spaces and resources. PE teachers from eight English secondary schools completed the PESSEI. Criterion validity was assessed by…

  1. [Reliability and validity analysis of simplified Chinese version of QOL questionnaire of olfactory disorders].

    PubMed

    Jin, X F; Wang, J; Li, Y J; Liu, J F; Ni, D F

    2016-09-20

    Objective: To cross-culturally translate the questionnaire of olfactory disorders(QOD)into a simplified Chinese version, and evaluate its reliability and validity in clinical. Method: A simplified Chinese version of the QOD was evaluated in test-retest reliability, split-half reliability and internal consistency.Then it was evaluated in validity test including content validity, criterion-related validity, responsibility. Criterion-related validity was using the medical outcome study's 36-item short rorm health survey(SF-36) and the World Health Organization quality of life-brief (WHOQOL-BREF) for comparison. Result: A total of 239 patients with olfactory dysfunction were enrolled and tested, in which 195 patients completed all three surveys(QOD, SF-36, WHOQOL-BREF). The test-retest reliabilities of the QOD-parosmia statements(QOD-P), QOD-quality of life(QOD-QoL), and the QOD-visual simulation(QOD-VAS)sections were 0.799( P <0.01),0.781( P <0.01),0.488( P <0.01), respectively, and the Cronbach' s α coefficients reliability were 0.477,0.812,0.889,respectively.The split-half reliability of QOD-QoL was 0.89. There was no correlation between the QOD-P section and the SF-36, but there were statistically significant correlations between the QOD-QoL and QOD-VAS sections with the SF-36. There was no correlation between the QOD-P section and the WHOQOL-BREF, but there were statistically significant correlations between the QOD-QoL and QOD-VAS sections with the SF-36 in most sections. Conclusion: The simplified Chinese version of the QOD was testified to be a reliable and valid questionnaire for evaluating patients with olfactory dysfunction living in mainland of China.The QOD-P section needs further modifications to properly adapt patients with Chinese cultural and knowledge background. Copyright© by the Editorial Department of Journal of Clinical Otorhinolaryngology Head and Neck Surgery.

  2. A Comparison of Three Job Engagement Measures: Examining their Factorial and Criterion-Related Validity.

    PubMed

    Wefald, Andrew J; Mills, Maura J; Smith, Michael R; Downey, Ronald G

    2012-03-01

    Engagement is an emerging job attitude that purports to measure employees' psychological presence at and involvement in their work. This research compares three academic approaches to engagement, and makes recommendations regarding the most appropriate conceptualisation and measurement of the construct in future research. The current research also investigates whether any of these three approaches to engagement contribute unique variance to the prediction of turnover intentions above and beyond the predictive capacity of alternative constructs. An online survey was taken by 382 employees and managers from a mid-sized financial institution. Results failed to support either a multi- or unidimensional factor structure for the Utrecht Work Engagement Scale (UWES) engagement measure. For the Shirom-Melamed Vigor Measure (SMVM), a multi-dimensional structure was identified as a good fit, while a unidimensional structure fit poorly. The uni-factorial structure of Britt's engagement measure was confirmed. The Schaufeli measure of engagement was a strong predictor of work outcomes; however, when controlling for job satisfaction and affective commitment, that measure lost its ability to predict intentions to leave. Two components of the Shirom vigor measure held their predictive validity. Collectively, these findings suggest that the Shirom vigor measure may provide better insight into whether and how much a person is 'into' his or her job. The Schaufeli measure was a good predictor of important work outcomes, but when job satisfaction and affective commitment were controlled, it lost its predictive validity. We were not able to confirm the three-factor structure of the Schaufeli measure. Two components of the Shirom vigor measure predicted turnover intentions after controlling for job satisfaction and affective commitment, suggesting less overlap with those constructs than the Schaufeli measure of engagement. This research adds important information on the nature of

  3. Development and validation of the Chinese version of the Diabetes Management Self-efficacy Scale.

    PubMed

    Vivienne Wu, Shu-Fang; Courtney, Mary; Edwards, Helen; McDowell, Jan; Shortridge-Baggett, Lillie M; Chang, Pei-Jen

    2008-04-01

    The purpose of this study was to translate the Diabetes Management Self-Efficacy Scale (DMSES) into Chinese and test the validity and reliability of the instrument within a Taiwanese population. A two-stage design was used for this study. Stage I consisted of a multi-stepped process of forward and backward translation, using focus groups and consensus meetings to translate the 20-item Australia/English version DMSES to Chinese and test content validity. Stage II established the psychometric properties of the Chinese version DMSES (C-DMSES) by examining the criterion, convergent and construct validity, internal consistency and stability testing. The sample for Stage II comprised 230 patients with type 2 diabetes aged 30 years or more from a diabetes outpatient clinic in Taiwan. Three items were modified to better reflect Chinese practice. The C-DMSES obtained a total average CVI score of .86. The convergent validity of the C-DMSES correlated well with the validated measure of the General Self-Efficacy Scale in measuring self-efficacy (r=.55; p<.01). Criterion-related validity showed that the C-DMSES was a significant predictor of the Summary of Diabetes Self-Care Activities scores (Beta=.58; t=10.75, p<.01). Factor analysis supported the C-DMSES being composed of four subscales. Good internal consistency (Cronbach's alpha=.77 to .93) and test-retest reliability (Pearson correlation coefficient r=.86, p<.01) were found. The C-DMSES is a brief and psychometrically sound measure for evaluation of self-efficacy towards management of diabetes by persons with type 2 diabetes in Chinese populations.

  4. Statistical Validation of a Web-Based GIS Application and Its Applicability to Cardiovascular-Related Studies.

    PubMed

    Lee, Jae Eun; Sung, Jung Hye; Malouhi, Mohamad

    2015-12-22

    There is abundant evidence that neighborhood characteristics are significantly linked to the health of the inhabitants of a given space within a given time frame. This study is to statistically validate a web-based GIS application designed to support cardiovascular-related research developed by the NIH funded Research Centers in Minority Institutions (RCMI) Translational Research Network (RTRN) Data Coordinating Center (DCC) and discuss its applicability to cardiovascular studies. Geo-referencing, geocoding and geospatial analyses were conducted for 500 randomly selected home addresses in a U.S. southeastern Metropolitan area. The correlation coefficient, factor analysis and Cronbach's alpha (α) were estimated to quantify measures of the internal consistency, reliability and construct/criterion/discriminant validity of the cardiovascular-related geospatial variables (walk score, number of hospitals, fast food restaurants, parks and sidewalks). Cronbach's α for CVD GEOSPATIAL variables was 95.5%, implying successful internal consistency. Walk scores were significantly correlated with number of hospitals (r = 0.715; p < 0.0001), fast food restaurants (r = 0.729; p < 0.0001), parks (r = 0.773; p < 0.0001) and sidewalks (r = 0.648; p < 0.0001) within a mile from homes. It was also significantly associated with diversity index (r = 0.138, p = 0.0023), median household incomes (r = -0.181; p < 0.0001), and owner occupied rates (r = -0.440; p < 0.0001). However, its non-significant correlation was found with median age, vulnerability, unemployment rate, labor force, and population growth rate. Our data demonstrates that geospatial data generated by the web-based application were internally consistent and demonstrated satisfactory validity. Therefore, the GIS application may be useful to apply to cardiovascular-related studies aimed to investigate potential impact of geospatial factors on diseases and/or the long-term effect of clinical trials.

  5. Psychological Flexibility of Nurses in a Cancer Hospital: Preliminary Validation of a Chinese Version of the Work-related Acceptance and Action Questionnaire

    PubMed Central

    Xu, Xianghua; Liu, Xiangyu; Ou, Meijun; Xie, Chanjuan; Chen, Yongyi

    2018-01-01

    Objective: To translate the English work-related acceptance and action questionnaire (WAAQ), make cross-cultural adaptations, and examine its psychometric properties when used by Chinese oncology nurses. Methods: After translation, the psychometric properties of the Chinese WAAQ were analyzed among 417 nurses, and content validity was determined by six experts. Results: Item-level content validity index (CVI) values were between 0.83 and 1.00; scale-level CVI/universal agreement (S-CVI/UA) and S-CVI/average were 0.86 and 0.98, respectively, which implicated a good content validity. The correlation of the Chinese WAAQ with AAQ-II (rs = −0.247, P < 0.001) suggested criterion validity, and those with General Health Questionnaire-12 (−0.250, <0.001) and general self-efficacy scale (0.491, <0.001) and Utrecht work engagement scale (UWES) (0.439, <0.001) suggested convergent validity. Exploratory factor analysis identified a seven-item, one-factor structure of WAAQ. The Chinese version of WAAQ had high internal consistency (Cronbach's α = 0.920), with an item-total correlation coefficient of 0.702–0.828 (P < 0.05), split-half reliability of 0.933, and test-retest reliability of 0.772. Conclusions: The Chinese WAAQ is a reliable and valid tool for assessing psychological flexibility in Chinese oncology nurses. PMID:29379839

  6. Validation of a Spanish version of the Spine Functional Index.

    PubMed

    Cuesta-Vargas, Antonio I; Gabel, Charles P

    2014-06-27

    The Spine Functional Index (SFI) is a recently published, robust and clinimetrically valid patient reported outcome measure. The purpose of this study was the adaptation and validation of a Spanish-version (SFI-Sp) with cultural and linguistic equivalence. A two stage observational study was conducted. The SFI was cross-culturally adapted to Spanish through double forward and backward translation then validated for its psychometric characteristics. Participants (n = 226) with various spine conditions of >12 weeks duration completed the SFI-Sp and a region specific measure: for the back, the Roland Morris Questionnaire (RMQ) and Backache Index (BADIX); for the neck, the Neck Disability Index (NDI); for general health the EQ-5D and SF-12. The full sample was employed to determine internal consistency, concurrent criterion validity by region and health, construct validity and factor structure. A subgroup (n = 51) was used to determine reliability at seven days. The SFI-Sp demonstrated high internal consistency (α = 0.85) and reliability (r = 0.96). The factor structure was one-dimensional and supported construct validity. Criterion specific validity for function was high with the RMQ (r = 0.79), moderate with the BADIX (r = 0.59) and low with the NDI (r = 0.46). For general health it was low with the EQ-5D and inversely correlated (r = -0.42) and fair with the Physical and Mental Components of the SF-12 and inversely correlated (r = -0.56 and r = -0.48), respectively. The study limitations included the lack of longitudinal data regarding other psychometric properties, specifically responsiveness. The SFI-Sp was demonstrated as a valid and reliable spine-regional outcome measure. The psychometric properties were comparable to and supported those of the English-version, however further longitudinal investigations are required.

  7. INCLEN Diagnostic Tool for Autism Spectrum Disorder (INDT-ASD): development and validation.

    PubMed

    Juneja, Monica; Mishra, Devendra; Russell, Paul S S; Gulati, Sheffali; Deshmukh, Vaishali; Tudu, Poma; Sagar, Rajesh; Silberberg, Donald; Bhutani, Vinod K; Pinto, Jennifer M; Durkin, Maureen; Pandey, Ravindra M; Nair, M K C; Arora, Narendra K

    2014-05-01

    To develop and validate INCLEN Diagnostic Tool for Autism Spectrum Disorder (INDT-ASD). Diagnostic test evaluation by cross sectional design. Four tertiary pediatric neurology centers in Delhi and Thiruvanthapuram, India. Children aged 2-9 years were enrolled in the study. INDT-ASD and Childhood Autism Rating Scale (CARS) were administered in a randomly decided sequence by trained psychologist, followed by an expert evaluation by DSM-IV TR diagnostic criteria (gold standard). Psychometric parameters of diagnostic accuracy, validity (construct, criterion and convergent) and internal consistency. 154 children (110 boys, mean age 64.2 mo) were enrolled. The overall diagnostic accuracy (AUC=0.97, 95% CI 0.93, 0.99; P<0.001) and validity (sensitivity 98%, specificity 95%, positive predictive value 91%, negative predictive value 99%) of INDT-ASD for Autism spectrum disorder were high, taking expert diagnosis using DSM-IV-TR as gold standard. The concordance rate between the INDT-ASD and expert diagnosis for 'ASD group' was 82.52% [Cohen's k=0.89; 95% CI (0.82, 0.97); P=0.001]. The internal consistency of INDT-ASD was 0.96. The convergent validity with CARS (r = 0.73, P= 0.001) and divergent validity with Binet-Kamat Test of intelligence (r = -0.37; P=0.004) were significantly high. INDT-ASD has a 4-factor structure explaining 85.3% of the variance. INDT-ASD has high diagnostic accuracy, adequate content validity, good internal consistency high criterion validity and high to moderate convergent validity and 4-factor construct validity for diagnosis of Autistm spectrum disorder.

  8. Validation in the clinical process: four settings for objectification of the subjectivity of understanding.

    PubMed

    Beland, H

    1994-12-01

    Clinical material is presented for discussion with the aim of exemplifying the author's conceptions of validation in a number of sessions and in psychoanalytic research and of making them verifiable, susceptible to consensus and/or falsifiable. Since Freud's postscript to the Dora case, the first clinical validation in the history of psychoanalysis, validation has been group-related and society-related, that is to say, it combines the evidence of subjectivity with the consensus of the research community (the scientific community). Validation verifies the conformity of the unconscious transference meaning with the analyst's understanding. The deciding criterion is the patient's reaction to the interpretation. In terms of the theory of science, validation in the clinical process corresponds to experimental testing of truth in the sphere of inanimate nature. Four settings of validation can be distinguished: the analyst's self-supervision during the process of understanding, which goes from incomprehension to comprehension (container-contained, PS-->D, selected fact); the patient's reaction to the interpretation (insight) and the analyst's assessment of the reaction; supervision and second thoughts; and discussion in groups and publications leading to consensus. It is a peculiarity of psychoanalytic research that in the event of positive validation the three criteria of truth (evidence, consensus and utility) coincide.

  9. Criterion-Validity of Commercially Available Physical Activity Tracker to Estimate Step Count, Covered Distance and Energy Expenditure during Sports Conditions

    PubMed Central

    Wahl, Yvonne; Düking, Peter; Droszez, Anna; Wahl, Patrick; Mester, Joachim

    2017-01-01

    Background: In the past years, there was an increasing development of physical activity tracker (Wearables). For recreational people, testing of these devices under walking or light jogging conditions might be sufficient. For (elite) athletes, however, scientific trustworthiness needs to be given for a broad spectrum of velocities or even fast changes in velocities reflecting the demands of the sport. Therefore, the aim was to evaluate the validity of eleven Wearables for monitoring step count, covered distance and energy expenditure (EE) under laboratory conditions with different constant and varying velocities. Methods: Twenty healthy sport students (10 men, 10 women) performed a running protocol consisting of four 5 min stages of different constant velocities (4.3; 7.2; 10.1; 13.0 km·h−1), a 5 min period of intermittent velocity, and a 2.4 km outdoor run (10.1 km·h−1) while wearing eleven different Wearables (Bodymedia Sensewear, Beurer AS 80, Polar Loop, Garmin Vivofit, Garmin Vivosmart, Garmin Vivoactive, Garmin Forerunner 920XT, Fitbit Charge, Fitbit Charge HR, Xaomi MiBand, Withings Pulse Ox). Step count, covered distance, and EE were evaluated by comparing each Wearable with a criterion method (Optogait system and manual counting for step count, treadmill for covered distance and indirect calorimetry for EE). Results: All Wearables, except Bodymedia Sensewear, Polar Loop, and Beurer AS80, revealed good validity (small MAPE, good ICC) for all constant and varying velocities for monitoring step count. For covered distance, all Wearables showed a very low ICC (<0.1) and high MAPE (up to 50%), revealing no good validity. The measurement of EE was acceptable for the Garmin, Fitbit and Withings Wearables (small to moderate MAPE), while Bodymedia Sensewear, Polar Loop, and Beurer AS80 showed a high MAPE up to 56% for all test conditions. Conclusion: In our study, most Wearables provide an acceptable level of validity for step counts at different constant and

  10. [Elaboration and validation of a tool to measure psychological well-being: WBMMS].

    PubMed

    Massé, R; Poulin, C; Dassa, C; Lambert, J; Bélair, S; Battaglini, M A

    1998-01-01

    Psychological well-being scales used in epidemiologic surveys usually show high construct validity. The content validation, however, is less convincing since these scales rest on lists of items that reflect the theoretical model of the authors. In this study we present results of the construct and criterion validation of a new Well-Being Manifestations Measure Scale (WBMMS) founded on an initial list of manifestations derived from an original content validation in a general population. It is concluded that national and public health epidemiologic surveys should include both measures of positive and negative mental health.

  11. Detecting symptom exaggeration in combat veterans using the MMPI-2 symptom validity scales: a mixed group validation.

    PubMed

    Tolin, David F; Steenkamp, Maria M; Marx, Brian P; Litz, Brett T

    2010-12-01

    Although validity scales of the Minnesota Multiphasic Personality Inventory-2 (MMPI-2; J. N. Butcher, W. G. Dahlstrom, J. R. Graham, A. Tellegen, & B. Kaemmer, 1989) have proven useful in the detection of symptom exaggeration in criterion-group validation (CGV) studies, usually comparing instructed feigners with known patient groups, the application of these scales has been problematic when assessing combat veterans undergoing posttraumatic stress disorder (PTSD) examinations. Mixed group validation (MGV) was employed to determine the efficacy of MMPI-2 exaggeration scales in compensation-seeking (CS) and noncompensation-seeking (NCS) veterans. Unlike CGV, MGV allows for a mix of exaggerating and nonexaggerating individuals in each group, does not require that the exaggeration versus nonexaggerating status of any individual be known, and can be adjusted for different base-rate estimates. MMPI-2 responses of 377 male veterans were examined according to CS versus NCS status. MGV was calculated using 4 sets of base-rate estimates drawn from the literature. The validity scales generally performed well (adequate sensitivity, specificity, and efficiency) under most base-rate estimations, and most produced cutoff scores that showed adequate detection of symptom exaggeration, regardless of base-rate assumptions. These results support the use of MMPI-2 validity scales for PTSD evaluations in veteran populations, even under varying base rates of symptom exaggeration.

  12. Validity, responsiveness, minimal detectable change, and minimal clinically important change of the Pediatric Motor Activity Log in children with cerebral palsy.

    PubMed

    Lin, Keh-chung; Chen, Hui-fang; Chen, Chia-ling; Wang, Tien-ni; Wu, Ching-yi; Hsieh, Yu-wei; Wu, Li-ling

    2012-01-01

    This study examined criterion-related validity and clinimetric properties of the Pediatric Motor Activity Log (PMAL) in children with cerebral palsy. Study participants were 41 children (age range: 28-113 months) and their parents. Criterion-related validity was evaluated by the associations between the PMAL and criterion measures at baseline and posttreatment, including the self-care, mobility, and cognition subscale, the total performance of the Functional Independence Measure in children (WeeFIM), and the grasping and visual-motor integration of the Peabody Developmental Motor Scales. Pearson correlation coefficients were calculated. Responsiveness was examined using the paired t test and the standardized response mean, the minimal detectable change was captured at the 90% confidence level, and the minimal clinically important change was estimated using anchor-based and distribution-based approaches. The PMAL-QOM showed fair concurrent validity at pretreatment and posttreatment and predictive validity, whereas the PMAL-AOU had fair concurrent validity at posttreatment only. The PMAL-AOU and PMAL-QOM were both markedly responsive to change after treatment. Improvement of at least 0.67 points on the PMAL-AOU and 0.66 points on the PMAL-QOM can be considered as a true change, not measurement error. A mean change has to exceed the range of 0.39-0.94 on the PMAL-AOU and the range of 0.38-0.74 on the PMAL-QOM to be regarded as clinically important change. Copyright © 2011 Elsevier Ltd. All rights reserved.

  13. Reliability and Validity of the Musculoskeletal Tumor Society Scoring System for the Upper Extremity in Japanese Patients.

    PubMed

    Uehara, Kosuke; Ogura, Koichi; Akiyama, Toru; Shinoda, Yusuke; Iwata, Shintaro; Kobayashi, Eisuke; Tanzawa, Yoshikazu; Yonemoto, Tsukasa; Kawano, Hirotaka; Kawai, Akira

    2017-09-01

    The Musculoskeletal Tumor Society (MSTS) scoring system developed in 1993 is a widely used disease-specific evaluation tool for assessment of physical function in patients with musculoskeletal tumors; however, only a few studies have confirmed its reliability and validity. The aim of this study was to validate the MSTS scoring system for the upper extremity (MSTS-UE) in Japanese patients with musculoskeletal tumors for use by others in research. Does the MSTS-UE have: (1) sufficient reliability and internal consistency; (2) adequate construct validity; and (3) reasonable criterion validity in comparison to the Toronto Extremity Salvage Score (TESS) or SF-36? Reliability was performed using test-retest analysis, and internal consistency was evaluated with Cronbach's alpha coefficient. Construct validity was evaluated using a scree plot to confirm the construct number and the Akaike information criterion network. Criterion validity was evaluated by comparing the MSTS-UE with the TESS and SF-36. The test-retest reliability with intraclass correlation coefficient (0.95; 95% CI, 0.91-0.97) was excellent, and internal consistency with Cronbach's α (0.7; 95% CI, 0.53-0.81) was acceptable. There were no ceiling and floor effects. The Akaike Information Criterion network showed that lifting ability, pain, and dexterity played central roles among the components. The MSTS-UE showed substantial correlation with the TESS scoring scale (r = 0.75; p < 0.001) and fair correlation with the SF-36 physical component summary (r = 0.37; p = 0.007). Although the MSTS-UE showed slight correlation with the SF-36 mental component summary, the emotional acceptance component of the MSTS-UE showed fair correlation (r = 0.29; p = 0.039). We can conclude that the MSTS is not an adequate measure of general health-related quality of life; however, this system was designed mainly to be a simple measure of function in a single extremity. To evaluate the mental state of patients with

  14. Random Qualitative Validation: A Mixed-Methods Approach to Survey Validation

    ERIC Educational Resources Information Center

    Van Duzer, Eric

    2012-01-01

    The purpose of this paper is to introduce the process and value of Random Qualitative Validation (RQV) in the development and interpretation of survey data. RQV is a method of gathering clarifying qualitative data that improves the validity of the quantitative analysis. This paper is concerned with validity in relation to the participants'…

  15. The development and validity of the Salford Gait Tool: an observation-based clinical gait assessment tool.

    PubMed

    Toro, Brigitte; Nester, Christopher J; Farren, Pauline C

    2007-03-01

    To develop the construct, content, and criterion validity of the Salford Gait Tool (SF-GT) and to evaluate agreement between gait observations using the SF-GT and kinematic gait data. Tool development and comparative evaluation. University in the United Kingdom. For designing construct and content validity, convenience samples of 10 children with hemiplegic, diplegic, and quadriplegic cerebral palsy (CP) and 152 physical therapy students and 4 physical therapists were recruited. For developing criterion validity, kinematic gait data of 13 gait clusters containing 56 children with hemiplegic, diplegic, and quadriplegic CP and 11 neurologically intact children was used. For clinical evaluation, a convenience sample of 23 pediatric physical therapists participated. We developed a sagittal plane observational gait assessment tool through a series of design, test, and redesign iterations. The tool's grading system was calibrated using kinematic gait data of 13 gait clusters and was evaluated by comparing the agreement of gait observations using the SF-GT with kinematic gait data. Criterion standard kinematic gait data. There was 58% mean agreement based on grading categories and 80% mean agreement based on degree estimations evaluated with the least significant difference method. The new SF-GT has good concurrent criterion validity.

  16. [Design and validation of a questionnaire for psychosocial nursing diagnosis in Primary Care].

    PubMed

    Brito-Brito, Pedro Ruymán; Rodríguez-Álvarez, Cristobalina; Sierra-López, Antonio; Rodríguez-Gómez, José Ángel; Aguirre-Jaime, Armando

    2012-01-01

    To develop a valid, reliable and easy-to-use questionnaire for a psychosocial nursing diagnosis. The study was performed in two phases: first phase, questionnaire design and construction; second phase, validity and reliability tests. A bank of items was constructed using the NANDA classification as a theoretical framework. Each item was assigned a Likert scale or dichotomous response. The combination of responses to the items constituted the diagnostic rules to assign up to 28 labels. A group of experts carried out the validity test for content. Other validated scales were used as reference standards for the criterion validity tests. Forty-five nurses provided the questionnaire to the patients on three separate occasions over a period of three weeks, and the other validated scales only once to 188 randomly selected patients in Primary Care centres in Tenerife (Spain). Validity tests for construct confirmed the six dimensions of the questionnaire with 91% of total variance explained. Validity tests for criterion showed a specificity of 66%-100%, and showed high correlations with the reference scales when the questionnaire was assigning nursing diagnoses. Reliability tests showed agreement of 56%-91% (P<.001), and a 93% internal consistency. The Questionnaire for Psychosocial Nursing Diagnosis was called CdePS, and included 61 items. The CdePS is a valid, reliable and easy-to-use tool in Primary Care centres to improve the assigning of a psychosocial nursing diagnosis. Copyright © 2011 Elsevier España, S.L. All rights reserved.

  17. Validating SPICES as a Screening Tool for Frailty Risks among Hospitalized Older Adults

    PubMed Central

    Aronow, Harriet Udin; Borenstein, Jeff; Haus, Flora; Braunstein, Glenn D.; Bolton, Linda Burnes

    2014-01-01

    Older patients are vulnerable to adverse hospital events related to frailty. SPICES, a common screening protocol to identify risk factors in older patients, alerts nurses to initiate care plans to reduce the probability of patient harm. However, there is little published validating the association between SPICES and measures of frailty and adverse outcomes. This paper used data from a prospective cohort study on frailty among 174 older adult inpatients to validate SPICES. Almost all patients met one or more SPICES criteria. The sum of SPICES was significantly correlated with age and other well-validated assessments for vulnerability, comorbid conditions, and depression. Individuals meeting two or more SPICES criteria had a risk of adverse hospital events three times greater than individuals with either no or one criterion. Results suggest that as a screening tool used within 24 hours of admission, SPICES is both valid and predictive of adverse events. PMID:24876954

  18. [Validation of the Scale of Hope in Terminal Illness for relatives brief version (SHTI-b). Validity and reliability analysis.

    PubMed

    Villacieros, M; Bermejo, J C; Hassoun, H

    2017-12-29

    Bermejo and Villacieros' Scale of Hope in Terminal Disease (SHTD) specifically collects meanings of hope facing terminal disease, including considerations relating to psycho-emotional support and that have a transcendental sense. The objective of this paper is to validate the SHTD abbreviated and rephrased to adapt all the items to a single domain. Starting from the published SHTD, an exploratory factor analysis (EFA) was carried out with a sample of 177 valid questionnaires. In a second study, with another sample of 180 valid questionnaires, a confirmatory factor analysis (CFA) and a correlation analysis with other measurements of spiritual wellbeing (Functional Assessment of Chronic Illness Therapy-Sp) and hope (Herth Hope Index) were done. A bidimensional model with satisfactory goodness of fit index values was obtained (GFI = 0.991; CFI = 0.984; SRMR = 0.08; RMSEA = 0.057); the Relations of Transcendence factor obtained a Cronbach's alpha of 0.872 and Personal Relations an alpha of 0.762. The correlations of the SHTI-rb with external measures were: r = 0.527with FACIT; r = 0.266 with HHI; r = 0.667 with the Spirituality subscale of FACIT; and r = 0.348 with the Interrelation factor of HHI. The Relations of Transcendence subscale correlated with both Layout and Expectation and Interrelation of HHI (r = 0.162 and r = 0.329 respectively), while the scale of Personal Relations only correlated with Interrelation of HHI (r = 0.244). The Scale of Hope in Terminal Illness for relatives (brief version) is a valid and reliable specific instrument for terminal patients.

  19. Validity and reliability of the Brazilian version of the Work Ability Index questionnaire.

    PubMed

    Martinez, Maria Carmen; Latorre, Maria do Rosário Dias de Oliveira; Fischer, Frida Marina

    2009-06-01

    To evaluate the validity and reliability of the Portuguese language version of a work ability index. Cross sectional survey of a sample of 475 workers from an electrical company in the state of Sao Paulo, Southeastern Brazil (spread across ten municipalities in the Campinas area), carried out in 2005. The following aspects of the Brazilian version of the Work Ability Index were evaluated: construct validity, using factorial exploratory analysis, and discriminant capacity, by comparing mean Work Ability Index scores in two groups with different absenteeism levels; criterion validity, by determining the correlation between self-reported health and Work Ability Index score; and reliability, using Cronbach's alpha to determine the internal consistency of the questionnaire. Factorial analysis indicated three factors in the work ability construct: issues pertaining to 'mental resources' (20.6% of the variance), self-perceived work ability (18.9% of the variance), and presence of diseases and health-related limitations (18.4% of the variance). The index was capable of discriminating workers according to levels of absenteeism, identifying a significantly lower (p<0.0001) mean score among subjects with high absenteeism (37.2 points) when compared to those with low absenteeism (42.3 points). Criterion validity analysis showed a correlation between the index and all dimensions of health status analyzed (p<0.0001). Reliability of the index was high, with a Cronbach's alpha of 0.72. The Brazilian version of the Work Ability Index showed satisfactory psychometric properties with respect to construct validity, thus constituting an appropriate option for evaluating work ability in both individual and population-based settings.

  20. Measuring personality functioning in older adults: construct validity of the Severity Indices of Personality Functioning - Short Form (SIPP-SF).

    PubMed

    Rossi, Gina; Debast, Inge; van Alphen, S P J

    2017-07-01

    The dimensional personality disorders model in the Diagnostic and Statistical Manual (DSM)-5 section III conceptually differentiates impaired personality functioning (criterion A) from the presence of pathological traits (criterion B). This study is the first to specifically address the measurement of criterion A in older adults. Moreover, the convergent/divergent validity of criterion A and criterion B will be compared in younger and older age groups. The Severity Indices of Personality Functioning - Short Form (SIPP-SF) was administered in older (N = 171) and younger adults (N = 210). The factorial structure was analyzed with exploratory structural equation modeling. Differences in convergent/divergent validity between personality functioning (SIPP-SF) and pathological traits (Personality Inventory for DSM-5; Dimensional Assessment of Personality Pathology-Basic Questionnaire) were examined across age groups. Identity Integration, Relational Capacities, Responsibility, Self-Control, and Social Concordance were corroborated as higher order domains. Although the SIPP-SF domains measured unique variation, some high correlations with pathological traits referred to overlapping constructs. Moreover, in older adults, personality functioning was more strongly related to Psychoticism, Disinhibition, Antagonism and Dissocial Behavior compared to younger adults. The SIPP-SF construct validity was demonstrated in terms of a structure of five higher order domains of personality functioning. The instrument is promising as a possible measure of impaired personality functioning in older adults. As such, it is a useful clinical tool to follow up effects of therapy on levels of personality functioning. Moreover, traits were associated with different degrees of personality functioning across age groups.

  1. Educational testing validity and reliability in pharmacy and medical education literature.

    PubMed

    Hoover, Matthew J; Jung, Rose; Jacobs, David M; Peeters, Michael J

    2013-12-16

    To evaluate and compare the reliability and validity of educational testing reported in pharmacy education journals to medical education literature. Descriptions of validity evidence sources (content, construct, criterion, and reliability) were extracted from articles that reported educational testing of learners' knowledge, skills, and/or abilities. Using educational testing, the findings of 108 pharmacy education articles were compared to the findings of 198 medical education articles. For pharmacy educational testing, 14 articles (13%) reported more than 1 validity evidence source while 83 articles (77%) reported 1 validity evidence source and 11 articles (10%) did not have evidence. Among validity evidence sources, content validity was reported most frequently. Compared with pharmacy education literature, more medical education articles reported both validity and reliability (59%; p<0.001). While there were more scholarship of teaching and learning (SoTL) articles in pharmacy education compared to medical education, validity, and reliability reporting were limited in the pharmacy education literature.

  2. The Reliability, Validity, and Evaluation of the Objective Structured Clinical Examination in Podiatry (Chiropody).

    ERIC Educational Resources Information Center

    Woodburn, Jim; Sutcliffe, Nick

    1996-01-01

    The Objective Structured Clinical Examination (OSCE), initially developed for undergraduate medical education, has been adapted for assessment of clinical skills in podiatry students. A 12-month pilot study found the test had relatively low levels of reliability, high construct and criterion validity, and good stability of performance over time.…

  3. Criterion-Referenced Testing in Foreign Language Teaching.

    ERIC Educational Resources Information Center

    Takala, Sauli

    A review of literature serves as the basis for a discussion of various aspects of criterion-referenced tests. The aspects discussed are: teaching and evaluation objectives, criterion- and norm-referenced measurement, stages in construction of criterion-referenced tests, construction and selection of items, test validity, and test reliability.…

  4. Criterion Validity Evidence for the easyCBM© CCSS Math Measures: Grades 6-8. Technical Report #1402

    ERIC Educational Resources Information Center

    Anderson, Daniel; Rowley, Brock; Alonzo, Julie; Tindal, Gerald

    2012-01-01

    The easyCBM© CCSS Math tests were developed to help inform teachers' instructional decisions by providing relevant information on students' mathematical skills, relative to the Common Core State Standards (CCSS). This technical report describes a study to explore the validity of the easyCBM© CCSS Math tests by evaluating the relation between…

  5. English Placement Testing, Multiple Measures, and Disproportionate Impact: An Analysis of the Criterion- and Content-Related Validity Evidence for the Reading & Writing Placement Tests in the San Diego Community College District.

    ERIC Educational Resources Information Center

    Armstrong, William B.

    As part of an effort to statistically validate the placement tests used in California's San Diego Community College District (SDCCD) a study was undertaken to review the criteria- and content-related validity of the Assessment and Placement Services (APS) reading and writing tests. Evidence of criteria and content validity was gathered from…

  6. Validity of the Miller forensic assessment of symptoms test in psychiatric inpatients.

    PubMed

    Veazey, Connie H; Wagner, Alisha L; Hays, J Ray; Miller, Holly A

    2005-06-01

    This study investigated the validity of the Miller Forensic Assessment of Symptoms Test (M-FAST), a brief measure of malingering, in an inpatient psychiatric sample of 70. Among those patients who also completed the Personality Assessment Inventory (N=44), Total M-FAST score was related in the expected directions to the Personality Assessment Inventory validity scales and indexes, providing evidence for concurrent validity of the M-FAST. With the PAI malingering index used as a criterion, we examined the diagnostic efficiency of the M-FAST and found a cut score of 8 represented the best balance of sensitivity, specificity, positive predictive power, and negative predictive power. Based on this cut-score of 8, 16% of the population was classified as malingering. The M-FAST appears to be an excellent rapid screen for symptom exaggeration in this population and setting.

  7. Are validated outcome measures used in distal radial fractures truly valid?

    PubMed Central

    Nienhuis, R. W.; Bhandari, M.; Goslings, J. C.; Poolman, R. W.; Scholtes, V. A. B.

    2016-01-01

    Objectives Patient-reported outcome measures (PROMs) are often used to evaluate the outcome of treatment in patients with distal radial fractures. Which PROM to select is often based on assessment of measurement properties, such as validity and reliability. Measurement properties are assessed in clinimetric studies, and results are often reviewed without considering the methodological quality of these studies. Our aim was to systematically review the methodological quality of clinimetric studies that evaluated measurement properties of PROMs used in patients with distal radial fractures, and to make recommendations for the selection of PROMs based on the level of evidence of each individual measurement property. Methods A systematic literature search was performed in PubMed, EMbase, CINAHL and PsycINFO databases to identify relevant clinimetric studies. Two reviewers independently assessed the methodological quality of the studies on measurement properties, using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. Level of evidence (strong / moderate / limited / lacking) for each measurement property per PROM was determined by combining the methodological quality and the results of the different clinimetric studies. Results In all, 19 out of 1508 identified unique studies were included, in which 12 PROMs were rated. The Patient-rated wrist evaluation (PRWE) and the Disabilities of Arm, Shoulder and Hand questionnaire (DASH) were evaluated on most measurement properties. The evidence for the PRWE is moderate that its reliability, validity (content and hypothesis testing), and responsiveness are good. The evidence is limited that its internal consistency and cross-cultural validity are good, and its measurement error is acceptable. There is no evidence for its structural and criterion validity. The evidence for the DASH is moderate that its responsiveness is good. The evidence is limited that its reliability and the

  8. Parents' assessment of circadian preference in elementary school-aged children: Validity and relations to educational outcomes.

    PubMed

    Scherrer, Vsevolod; Roberts, Richard; Preckel, Franzis

    2016-01-01

    Meta-analyses suggest that morning-oriented students obtain better school grades than evening-oriented students. This finding has generally been found for students in high school using self-report data for the assessment of circadian preference. Two studies (N = 2718/192) investigated whether these findings generalize across samples (i.e. elementary school-aged students) and methods (i.e. parent reports). These studies also explored whether the relation between circadian preference and school achievement could be explained within an expectancy-value framework. To this end, the Lark-Owl Chronotype Indicator (LOCI) was modified to obtain parents' evaluations of their children's circadian preference, while students completed a battery of assessments designed to explore the test-criterion evidence. Structural equation modeling and correlational analyses revealed: (1) morning and evening orientation were two separable factors of children's circadian preference; (2) correlations with behavioral (e.g. sleep and eating times) and psychological (e.g. cognitive ability) data supported the test-criterion validity of both factors; (3) morning orientation was positively related to school achievement and (4) consistent with an expectancy-value framework this relation was mediated by children's academic self-concept (ASC). These findings have important research and policy implications for considering circadian preference in the schooling of elementary students.

  9. Social anxiety questionnaire (SAQ): Development and preliminary validation.

    PubMed

    Łakuta, Patryk

    2018-05-30

    The Social Anxiety Questionnaire (SAQ) was designed to assess five dimensions of social anxiety as posited by the Clark and Wells' (1995; Clark, 2001) cognitive model. The development of the SAQ involved generation of an item pool, followed by a verification of content validity and the theorized factor structure (Study 1). The final version of the SAQ was then assessed for reliability, temporal stability (test re-test reliability), and construct, criterion-related, and contrasted-group validity (Study 2, 3, and 4). Following a systematic process, the results provide support for the SAQ as reliable, and both theoretically and empirically valid measure. A five-factor structure of the SAQ verified and replicated through confirmatory factor analyses reflect five dimensions of social anxiety: negative self-processing; self-focused attention and self-monitoring; safety behaviours; somatic and cognitive symptoms; and anticipatory and post-event rumination. Results suggest that the SAQ possesses good psychometric properties, while recognizing that additional validation is a required future research direction. It is important to replicate these findings in diverse populations, including a large clinical sample. The SAQ is a promising measure that supports social anxiety as a multidimensional construct, and the foundational role of self-focused cognitive processes in generation and maintenance of social anxiety symptoms. The findings make a significant contribution to the literature, moreover, the SAQ is a first instrument that offers to assess all, proposed by the Clark-Wells model, specific cognitive-affective, physiological, attitudinal, and attention processes related to social anxiety. Copyright © 2018 Elsevier B.V. All rights reserved.

  10. Cross-cultural adaptation and validation of the Ankle Osteoarthritis Scale for use in French-speaking populations.

    PubMed

    Angers, Magalie; Svotelis, Amy; Balg, Frederic; Allard, Jean-Pascal

    2016-04-01

    The Ankle Osteoarthritis Scale (AOS) is a self-administered score specific for ankle osteoarthritis (OA) with excellent reliability and strong construct and criterion validity. Many recent randomized multicentre trials have used the AOS, and the involvement of the French-speaking population is limited by the absence of a French version. Our goal was to develop a French version and validate the psychometric properties to assure equivalence to the original English version. Translation was performed according to American Association of Orthopaedic Surgeons (AAOS) 2000 guidelines for cross-cultural adaptation. Similar to the validation process of the English AOS, we evaluated the psychometric properties of the French version (AOS-Fr): criterion validity (AOS-Fr v. Western Ontario and McMaster Universities Arthritis Index [WOMAC] and SF-36 scores), construct validity (AOS-Fr correlation to single heel-lift test), and reliability (AOS-Fr test-retest). Sixty healthy individuals tested a prefinal version of the AOS-Fr for comprehension, leading to modifications and a final version that was approved by C. Saltzman, author of the AOS. We then recruited patients with ankle OA for evaluation of the AOS-Fr psychometric properties. Twenty-eight patients with ankle OA participated in the evaluation. The AOS-Fr showed strong criterion validity (AOS:WOMAC r = 0.709 and AOS:SF-36 r = -0.654) and construct validity (r = 0.664) and proved to be reliable (test-retest intraclass correlation coefficient = 0.922). The AOS-Fr is a reliable and valid score equivalent to the English version in terms of psychometric properties, thus is available for use in multicentre trials.

  11. Validity and relative validity of a novel digital approach for 24-h dietary recall in athletes.

    PubMed

    Baker, Lindsay B; Heaton, Lisa E; Stein, Kimberly W; Nuccio, Ryan P; Jeukendrup, Asker E

    2014-04-30

    We developed a digital dietary analysis tool for athletes (DATA) using a modified 24-h recall method and an integrated, customized nutrient database. The purpose of this study was to assess DATA's validity and relative validity by measuring its agreement with registered dietitians' (RDs) direct observations (OBSERVATION) and 24-h dietary recall interviews using the USDA 5-step multiple-pass method (INTERVIEW), respectively. Fifty-six athletes (14-20 y) completed DATA and INTERVIEW in randomized counter-balanced order. OBSERVATION (n = 26) consisted of RDs recording participants' food/drink intake in a 24-h period and were completed the day prior to DATA and INTERVIEW. Agreement among methods was estimated using a repeated measures t-test and Bland-Altman analysis. The paired differences (with 95% confidence intervals) between DATA and OBSERVATION were not significant for carbohydrate (10.1%, -1.2-22.7%) and protein (14.1%, -3.2-34.5%) but was significant for energy (14.4%, 1.2-29.3%). There were no differences between DATA and INTERVIEW for energy (-1.1%, -9.1-7.7%), carbohydrate (0.2%, -7.1-8.0%) or protein (-2.7%, -11.3-6.7%). Bland-Altman analysis indicated significant positive correlations between absolute values of the differences and the means for OBSERVATION vs. DATA (r = 0.40 and r = 0.47 for energy and carbohydrate, respectively) and INTERVIEW vs. DATA (r = 0.52, r = 0.29, and r = 0.61 for energy, carbohydrate, and protein, respectively). There were also wide 95% limits of agreement (LOA) for most method comparisons. The mean bias ratio (with 95% LOA) for OBSERVATION vs. DATA was 0.874 (0.551-1.385) for energy, 0.906 (0.522-1.575) for carbohydrate, and 0.895(0.395-2.031) for protein. The mean bias ratio (with 95% LOA) for INTERVIEW vs. DATA was 1.016 (0.538-1.919) for energy, 0.995 (0.563-1.757) for carbohydrate, and 1.031 (0.514-2.068) for protein. DATA has good relative validity for group-level comparisons in athletes. However, there are large variations

  12. Failure mode and effects analysis outputs: are they valid?

    PubMed Central

    2012-01-01

    Background Failure Mode and Effects Analysis (FMEA) is a prospective risk assessment tool that has been widely used within the aerospace and automotive industries and has been utilised within healthcare since the early 1990s. The aim of this study was to explore the validity of FMEA outputs within a hospital setting in the United Kingdom. Methods Two multidisciplinary teams each conducted an FMEA for the use of vancomycin and gentamicin. Four different validity tests were conducted: · Face validity: by comparing the FMEA participants’ mapped processes with observational work. · Content validity: by presenting the FMEA findings to other healthcare professionals. · Criterion validity: by comparing the FMEA findings with data reported on the trust’s incident report database. · Construct validity: by exploring the relevant mathematical theories involved in calculating the FMEA risk priority number. Results Face validity was positive as the researcher documented the same processes of care as mapped by the FMEA participants. However, other healthcare professionals identified potential failures missed by the FMEA teams. Furthermore, the FMEA groups failed to include failures related to omitted doses; yet these were the failures most commonly reported in the trust’s incident database. Calculating the RPN by multiplying severity, probability and detectability scores was deemed invalid because it is based on calculations that breach the mathematical properties of the scales used. Conclusion There are significant methodological challenges in validating FMEA. It is a useful tool to aid multidisciplinary groups in mapping and understanding a process of care; however, the results of our study cast doubt on its validity. FMEA teams are likely to need different sources of information, besides their personal experience and knowledge, to identify potential failures. As for FMEA’s methodology for scoring failures, there were discrepancies between the teams’ estimates

  13. Failure mode and effects analysis outputs: are they valid?

    PubMed

    Shebl, Nada Atef; Franklin, Bryony Dean; Barber, Nick

    2012-06-10

    Failure Mode and Effects Analysis (FMEA) is a prospective risk assessment tool that has been widely used within the aerospace and automotive industries and has been utilised within healthcare since the early 1990s. The aim of this study was to explore the validity of FMEA outputs within a hospital setting in the United Kingdom. Two multidisciplinary teams each conducted an FMEA for the use of vancomycin and gentamicin. Four different validity tests were conducted: Face validity: by comparing the FMEA participants' mapped processes with observational work. Content validity: by presenting the FMEA findings to other healthcare professionals. Criterion validity: by comparing the FMEA findings with data reported on the trust's incident report database. Construct validity: by exploring the relevant mathematical theories involved in calculating the FMEA risk priority number. Face validity was positive as the researcher documented the same processes of care as mapped by the FMEA participants. However, other healthcare professionals identified potential failures missed by the FMEA teams. Furthermore, the FMEA groups failed to include failures related to omitted doses; yet these were the failures most commonly reported in the trust's incident database. Calculating the RPN by multiplying severity, probability and detectability scores was deemed invalid because it is based on calculations that breach the mathematical properties of the scales used. There are significant methodological challenges in validating FMEA. It is a useful tool to aid multidisciplinary groups in mapping and understanding a process of care; however, the results of our study cast doubt on its validity. FMEA teams are likely to need different sources of information, besides their personal experience and knowledge, to identify potential failures. As for FMEA's methodology for scoring failures, there were discrepancies between the teams' estimates and similar incidents reported on the trust's incident

  14. Spanish version of the screening Örebro musculoskeletal pain questionnaire: a cross-cultural adaptation and validation.

    PubMed

    Cuesta-Vargas, Antonio Ignacio; González-Sánchez, Manuel

    2014-10-29

    Spanish is one of the five most spoken languages in the world. There is currently no published Spanish version of the Örebro Musculoskeletal Pain Questionnaire (OMPQ). The aim of the present study is to describe the process of translating the OMPQ into Spanish and to perform an analysis of reliability, internal structure, internal consistency and concurrent criterion-related validity. Translation and psychometric testing. Two independent translators translated the OMPQ into Spanish. From both translations a consensus version was achieved. A backward translation was made to verify and resolve any semantic or conceptual problems. A total of 104 patients (67 men/37 women) with a mean age of 53.48 (±11.63), suffering from chronic musculoskeletal disorders, twice completed a Spanish version of the OMPQ. Statistical analysis was performed to evaluate the reliability, the internal structure, internal consistency and concurrent criterion-related validity with reference to the gold standard questionnaire SF-12v2. All variables except "Coping" showed a rate above 0.85 on reliability. The internal structure calculation through exploratory factor analysis indicated that 75.2% of the variance can be explained with six components with an eigenvalue higher than 1 and 52.1% with only three components higher than 10% of variance explained. In the concurrent criterion-related validity, several significant correlations were seen close to 0.6, exceeding that value in the correlation between general health and total value of the OMPQ. The Spanish version of the screening questionnaire OMPQ can be used to identify Spanish patients with musculoskeletal pain at risk of developing a chronic disability.

  15. Validity and Bias of Academic Achievement Measures in the First Year of Elementary School

    ERIC Educational Resources Information Center

    Hammes, Patricia Simone; Bigras, Marc; Crepaldi, Maria Aparecida

    2016-01-01

    We tested the criterion-related validity and potential bias of two measures of pupils' academic achievement: the Teacher Rating Scale (TRS) and the Mathematics and Literacy Achievement Tests (MLTs). These measures are representative of assessment methods largely used in the elementary school. The aims were: (1) to verify the extent to which TRS…

  16. PTSD’s risky behavior criterion: Relation with DSM-5 PTSD symptom clusters and psychopathology

    PubMed Central

    Contractor, Ateka A.; Weiss, Nicole H.; Dranger, Paula; Ruggero, Camilo; Armour, Cherie

    2017-01-01

    A new symptom criterion of reckless and self-destructive behaviors (E2) was recently added to posttraumatic stress disorder’s (PTSD) diagnostic criteria in DSM-5, which is unsurprising given the well-established relation between PTSD and risky behaviors. Researchers have questioned the significance and incremental validity of this symptom criterion within PTSD’s symptomatology. Unprecedented to our knowledge, we aim to compare trauma-exposed groups differing on their endorsement status of the risky behavior symptom on several psychopathology constructs (PTSD, depression, distress tolerance, rumination, anger). The sample included 123 trauma-exposed participants seeking mental health treatment (M age=35.70; 68.30% female) who completed self-report questionnaires assessing PTSD symptoms, depression, rumination, distress tolerance, and anger. Results of independent samples t-tests indicated that participants who endorsed the E2 criterion at a clinically significant level reported significantly greater PTSD subscale severity; depression severity; rumination facets of repetitive thoughts, counterfactual thinking, and problem-focused thinking; and anger reactions; and significantly less absorption and regulation (distress tolerance facets) compared to participants who did not endorse the E2 criterion at a clinically significant level. Results indicate the utility of the E2 criterion in identifying trauma-exposed individual with greater posttraumatic distress, and emphasize the importance of targeting such behaviors in treatment. PMID:28285248

  17. The reliability and validity of a sexual functioning questionnaire.

    PubMed

    Corty, E W; Althof, S E; Kurit, D M

    1996-01-01

    The present study assessed the reliability and validity of a measure of sexual functioning, the CMSH-SFQ, for male patients and their partners. The CMSH-SFQ measures erectile and orgasmic functioning, sexual drive, frequency of sexual behavior, and sexual satisfaction. Test-retest reliability was assessed with 19 males and 19 females for the baseline CMSH-SFQ. Criterion validity was measured by comparing the answers of 25 male patients to those of their partners at baseline and follow-up. The majority of items had acceptable levels of reliability and validity. The CMSH-SFQ provides a reliable and valid device that can be used to measure global sexual functioning in men and their partners and may be used to evaluate the efficacy of treatments for sexual dysfunctions. Limitations and suggestions for use of the CMSH-SFQ are addressed.

  18. PTSD's risky behavior criterion: Relation with DSM-5 PTSD symptom clusters and psychopathology.

    PubMed

    Contractor, Ateka A; Weiss, Nicole H; Dranger, Paula; Ruggero, Camilo; Armour, Cherie

    2017-06-01

    A new symptom criterion of reckless and self-destructive behaviors (E2) was recently added to posttraumatic stress disorder's (PTSD) diagnostic criteria in DSM-5, which is unsurprising given the well-established relation between PTSD and risky behaviors. Researchers have questioned the significance and incremental validity of this symptom criterion within PTSD's symptomatology. Unprecedented to our knowledge, we aim to compare trauma-exposed groups differing on their endorsement status of the risky behavior symptom on several psychopathology constructs (PTSD, depression, distress tolerance, rumination, anger). The sample included 123 trauma-exposed participants seeking mental health treatment (M age=35.70; 68.30% female) who completed self-report questionnaires assessing PTSD symptoms, depression, rumination, distress tolerance, and anger. Results of independent samples t-tests indicated that participants who endorsed the E2 criterion at a clinically significant level reported significantly greater PTSD subscale severity; depression severity; rumination facets of repetitive thoughts, counterfactual thinking, and problem-focused thinking; and anger reactions; and significantly less absorption and regulation (distress tolerance facets) compared to participants who did not endorse the E2 criterion at a clinically significant level. Results indicate the utility of the E2 criterion in identifying trauma-exposed individual with greater posttraumatic distress, and emphasize the importance of targeting such behaviors in treatment. Copyright © 2017 Elsevier Ireland Ltd. All rights reserved.

  19. The development and validation of the Incivility from Customers Scale.

    PubMed

    Wilson, Nicole L; Holmvall, Camilla M

    2013-07-01

    Scant research has examined customers as sources of workplace incivility, despite evidence suggesting that mistreatment is more common from organizational outsiders, including customers, than from organizational members (Grandey, Kern, & Frone, 2007; Schat & Kelloway, 2005). As an important step in extending the literature on customer incivility, we conducted two studies to develop and validate a measure of this construct. Study 1 used focus groups of retail and restaurant employees (n = 30) to elicit a list of uncivil customer behaviors, based on which we wrote initial scale items. Study 2 used a correlational survey design (n = 439) to pare down the number of scale items to 10 and to garner reliability and validity evidence for the scale. Exploratory and confirmatory factor analyses show that the scale is unidimensional and distinguishable from measures of the related, but distinct, constructs of interpersonal justice and psychological aggression from customers. Reliability analyses show that the scale is internally consistent. Significant correlations between the scale and individuals' job satisfaction, turnover intentions, and general and job-specific psychological strain provide evidence of criterion-related validity. Hierarchical regression analyses show that the scale significantly predicts three of four organizational and personal strain outcomes over and above a workplace incivility measure adapted for customer incivility, providing some evidence of incremental validity. Limitations and future research directions are discussed. PsycINFO Database Record (c) 2013 APA, all rights reserved.

  20. Validating the Multidimensional Measure of Cultural Identity Scales for Latinos Among Latina Mothers and Daughters

    PubMed Central

    Dillon, Frank R.; Félix-Ortiz, Maria; Rice, Christopher; De La Rosa, Mario; Rojas, Patria; Duan, Rui

    2009-01-01

    The psychometric properties of the Multidimensional Measure of Cultural Identity Scales for Latinos (MMCISL; Félix-Ortiz, Newcomb, & Myers, 1994) have never been examined in an adult Latina sample representing various levels of nativity and nationality. The rationale for the study was to confirm the factor structure and psychometric properties of the MMCISL with a predominantly immigrant sample of Latina mothers and daughters (n = 316). Adequate reliability estimates were found for 6 of the original 10 scales. Confirmatory factor analyses provided evidence of construct validity for the reliable scales. The Preferred Latino Affiliation scale was the only scale to meet strict measurement invariance criteria across mothers and daughters. Criterion validity was evidenced by relations between the Familiarity with Latino Culture scale and all criterion variables. Implications for acculturation and cultural identity research involving the MMCISL are discussed. PMID:19364206

  1. Development and Validation of the Family Beliefs Inventory: A Measure of Unrealistic Beliefs among Parents and Adolescents.

    ERIC Educational Resources Information Center

    Roehling, Patricia Vincent; Robin, Arthur L.

    1986-01-01

    Evaluated the criterion-related validity of the Family Beliefs Inventory, a new self-report measure of unreasonable beliefs regarding parent-adolescent relationships. Distressed fathers displayed more unreasonable beliefs concerning ruination, obedience, perfectionism, and malicious intent than nondistressed fathers. Distressed adolescents…

  2. The validation of a home food inventory.

    PubMed

    Fulkerson, Jayne A; Nelson, Melissa C; Lytle, Leslie; Moe, Stacey; Heitzler, Carrie; Pasch, Keryn E

    2008-11-04

    Home food inventories provide an efficient method for assessing home food availability; however, few are validated. The present study's aim was to develop and validate a home food inventory that is easily completed by research participants in their homes and includes a comprehensive range of both healthful and less healthful foods that are associated with obesity. A home food inventory (HFI) was developed and tested with two samples. Sample 1 included 51 adult participants and six trained research staff who independently completed the HFI in participants' homes. Sample 2 included 342 families in which parents completed the HFI and the Diet History Questionnaire (DHQ) and students completed three 24-hour dietary recall interviews. HFI items assessed 13 major food categories as well as two categories assessing ready-access to foods in the kitchen and the refrigerator. An obesogenic household food availability score was also created. To assess criterion validity, participants' and research staffs' assessment of home food availability were compared (staff = gold standard). Criterion validity was evaluated with kappa, sensitivity, and specificity. Construct validity was assessed with correlations of five HFI major food category scores with servings of the same foods and associated nutrients from the DHQ and dietary recalls. Kappa statistics for all 13 major food categories and the two ready-access categories ranged from 0.61 to 0.83, indicating substantial agreement. Sensitivity ranged from 0.69 to 0.89, and specificity ranged from 0.86 to 0.95. Spearman correlations between staff and participant major food category scores ranged from 0.71 to 0.97. Correlations between the HFI scores and food group servings and nutrients on the DHQ (parents) were all significant (p < .05) while about half of associations between the HFI and dietary recall interviews (adolescents) were significant (p < .05). The obesogenic home food availability score was significantly associated (p

  3. Reliability and validity of cervical position measurements in individuals with and without chronic neck pain.

    PubMed

    Dunleavy, Kim; Neil, Joseph; Tallon, Allison; Adamo, Diane E

    2015-09-01

    The cervical range of motion device (CROM) has been shown to provide reliable forward head position (FHP) measurement when the upper cervical angle (UCA) is controlled. However, measurement without UCA standardization is reflective of habitual patterns. Criterion validity has not been reported. The purposes of this study were to establish: (1) criterion validity of CROM FHP and UCA compared to Optotrak data, (2) relative reliability and minimal detectable change (MDC95) in patients with and without cervical pain, and (3) to compare UCA and FHP in patients with and without pain in habitual postures. (1) Within-subjects single session concurrent criterion validity design. Simultaneous CROM and OP measurement was conducted in habitual sitting posture in 16 healthy young adults. (2) Reliability and MDC95 of UCA and FHP were calculated from three trials. (3) Values for adults over 35 years with cervical pain and age-matched healthy controls were compared. (1) Forward head position distances were moderately correlated and UCA angles were highly correlated. The mean (standard deviation) differences can be expected to vary between 1·48 cm (1·74) for FHP and -1·7 (2·46)° for UCA. (2) Reliability for CROM FHP measurements were good to excellent (no pain) and moderate (pain). Cervical range of motion FHP MDC95 was moderately low (no pain), and moderate (pain). Reliability for CROM UCA measurements was excellent and MDC95 low for both groups. There was no difference in FHP distances between the pain and no pain groups, UCA was significantly more extended in the pain group (P<0·05). Cervical range of motion FHP measurements were only moderately correlated with Optotrak data, and limits of agreement (LOA) and MDC95 were relatively large. There was also no difference in CROM FHP distance between older symptomatic and asymptomatic individuals. Cervical range of motion FHP measurement is therefore not recommended as a clinical outcome measure. Cervical range of motion UCA

  4. Reliability and validity of cervical position measurements in individuals with and without chronic neck pain

    PubMed Central

    Neil, Joseph; Tallon, Allison; Adamo, Diane E.

    2015-01-01

    Objectives The cervical range of motion device (CROM) has been shown to provide reliable forward head position (FHP) measurement when the upper cervical angle (UCA) is controlled. However, measurement without UCA standardization is reflective of habitual patterns. Criterion validity has not been reported. The purposes of this study were to establish: (1) criterion validity of CROM FHP and UCA compared to Optotrak data, (2) relative reliability and minimal detectable change (MDC95) in patients with and without cervical pain, and (3) to compare UCA and FHP in patients with and without pain in habitual postures. Methods (1) Within-subjects single session concurrent criterion validity design. Simultaneous CROM and OP measurement was conducted in habitual sitting posture in 16 healthy young adults. (2) Reliability and MDC95 of UCA and FHP were calculated from three trials. (3) Values for adults over 35 years with cervical pain and age-matched healthy controls were compared. Results (1) Forward head position distances were moderately correlated and UCA angles were highly correlated. The mean (standard deviation) differences can be expected to vary between 1·48 cm (1·74) for FHP and −1·7 (2·46)° for UCA. (2) Reliability for CROM FHP measurements were good to excellent (no pain) and moderate (pain). Cervical range of motion FHP MDC95 was moderately low (no pain), and moderate (pain). Reliability for CROM UCA measurements was excellent and MDC95 low for both groups. There was no difference in FHP distances between the pain and no pain groups, UCA was significantly more extended in the pain group (P<0·05). Discussion Cervical range of motion FHP measurements were only moderately correlated with Optotrak data, and limits of agreement (LOA) and MDC95 were relatively large. There was also no difference in CROM FHP distance between older symptomatic and asymptomatic individuals. Cervical range of motion FHP measurement is therefore not recommended as a clinical outcome

  5. Validity of Factors of the Psychopathy Checklist–Revised in Female Prisoners

    PubMed Central

    Kennealy, Patrick J.; Hicks, Brian M.; Patrick, Christopher J.

    2008-01-01

    The validity of the Psychopathy Checklist–Revised (PCL-R) has been examined extensively in men, but its validity for women remains understudied. Specifically, the correlates of the general construct of psychopathy and its components as assessed by PCL-R total, factor, and facet scores have yet to be examined in depth. Based on previous research conducted with male offenders, a large female inmate sample was used to examine the patterns of relations between total, factor, and facet scores on the PCL-R and various criterion variables. These variables include ratings of psychopathy based on Cleckley’s criteria, symptoms of antisocial personality disorder, and measures of substance use and abuse, criminal behavior, institutional misconduct, interpersonal aggression, normal range personality, intellectual functioning, and social background variables. Results were highly consistent with past findings in male samples and provide further evidence for the construct validity of the PCL-R two-factor and four-facet models across genders. PMID:17986651

  6. Validation of the Dutch Eating Behaviour Questionnaire (DEBQ) among Maltese women.

    PubMed

    Dutton, Elaine; Dovey, Terence M

    2016-12-01

    The main aim of this study was to assess the dimensional structure of the Maltese version of the Dutch Eating Behaviour Questionnaire (DEBQ) and evaluate the instrument's validity and reliability among Maltese women (N = 586). Exploratory factor analysis reflected the theoretical structure of three factors; emotional, restrained and external eating which was supported by a Confirmatory Factor analysis. Minor issues with specific items in the Emotional and External eating scale were identified and discussed. Criterion-related validity was ascertained through correlations with the EAT-26. The study also assessed the DEBQ's predictive value in differentiating between BMI groups and between dieters and weight maintainers. The results suggest that the Maltese DEBQ is a psychometrically valid and reliable instrument for assessing eating behaviours with women in the Maltese community. The study also highlights the critical role of Emotional and Restrained eating in dieting and overweight Maltese women. Copyright © 2016 Elsevier Ltd. All rights reserved.

  7. Quantifying Human Movement Using the Movn Smartphone App: Validation and Field Study

    PubMed Central

    2017-01-01

    Background The use of embedded smartphone sensors offers opportunities to measure physical activity (PA) and human movement. Big data—which includes billions of digital traces—offers scientists a new lens to examine PA in fine-grained detail and allows us to track people’s geocoded movement patterns to determine their interaction with the environment. Objective The objective of this study was to examine the validity of the Movn smartphone app (Moving Analytics) for collecting PA and human movement data. Methods The criterion and convergent validity of the Movn smartphone app for estimating energy expenditure (EE) were assessed in both laboratory and free-living settings, compared with indirect calorimetry (criterion reference) and a stand-alone accelerometer that is commonly used in PA research (GT1m, ActiGraph Corp, convergent reference). A supporting cross-validation study assessed the consistency of activity data when collected across different smartphone devices. Global positioning system (GPS) and accelerometer data were integrated with geographical information software to demonstrate the feasibility of geospatial analysis of human movement. Results A total of 21 participants contributed to linear regression analysis to estimate EE from Movn activity counts (standard error of estimation [SEE]=1.94 kcal/min). The equation was cross-validated in an independent sample (N=42, SEE=1.10 kcal/min). During laboratory-based treadmill exercise, EE from Movn was comparable to calorimetry (bias=0.36 [−0.07 to 0.78] kcal/min, t82=1.66, P=.10) but overestimated as compared with the ActiGraph accelerometer (bias=0.93 [0.58-1.29] kcal/min, t89=5.27, P<.001). The absolute magnitude of criterion biases increased as a function of locomotive speed (F1,4=7.54, P<.001) but was relatively consistent for the convergent comparison (F1,4=1.26, P<.29). Furthermore, 95% limits of agreement were consistent for criterion and convergent biases, and EE from Movn was strongly

  8. Simulated Driving Assessment (SDA) for Teen Drivers: Results from a Validation Study

    PubMed Central

    McDonald, Catherine C.; Kandadai, Venk; Loeb, Helen; Seacrist, Thomas S.; Lee, Yi-Ching; Winston, Zachary; Winston, Flaura K.

    2015-01-01

    Background Driver error and inadequate skill are common critical reasons for novice teen driver crashes, yet few validated, standardized assessments of teen driving skills exist. The purpose of this study was to evaluate the construct and criterion validity of a newly developed Simulated Driving Assessment (SDA) for novice teen drivers. Methods The SDA's 35-minute simulated drive incorporates 22 variations of the most common teen driver crash configurations. Driving performance was compared for 21 inexperienced teens (age 16–17 years, provisional license ≤90 days) and 17 experienced adults (age 25–50 years, license ≥5 years, drove ≥100 miles per week, no collisions or moving violations ≤3 years). SDA driving performance (Error Score) was based on driving safety measures derived from simulator and eye-tracking data. Negative driving outcomes included simulated collisions or run-off-the-road incidents. A professional driving evaluator/instructor reviewed videos of SDA performance (DEI Score). Results The SDA demonstrated construct validity: 1.) Teens had a higher Error Score than adults (30 vs. 13, p=0.02); 2.) For each additional error committed, the relative risk of a participant's propensity for a simulated negative driving outcome increased by 8% (95% CI: 1.05–1.10, p<0.01). The SDA demonstrated criterion validity: Error Score was correlated with DEI Score (r=−0.66, p<0.001). Conclusions This study supports the concept of validated simulated driving tests like the SDA to assess novice driver skill in complex and hazardous driving scenarios. The SDA, as a standard protocol to evaluate teen driver performance, has the potential to facilitate screening and assessment of teen driving readiness and could be used to guide targeted skill training. PMID:25740939

  9. Assessment of Fearless Dominance and Impulsive Antisociality via Normal Personality Measures: Convergent Validity, Criterion Validity, and Developmental Change

    PubMed Central

    Witt, Edward A.; Donnellan, M. Brent; Blonigen, Daniel M.; Krueger, Robert F.; Conger, Rand D.

    2009-01-01

    This report provides evidence for the reliability, validity, and developmental course of the psychopathic personality traits of Fearless Dominance (FD) and Impulsive Antisociality (IA) as assessed by items from Multidimensional Personality Questionnaire (MPQ; Patrick, Curtin, & Tellegen, 2002). In Study 1, MPQ-based measures of FD and IA were strongly correlated with their corresponding composite scores from the Psychopathic Personality Inventory-Revised (Lilienfeld & Widows, 2005). In Study 2, FD and IA had relatively distinct associations with measures of normal and maladaptive personality traits. In Study 3, FD and IA had substantial retest coefficients during the transition to adulthood and both traits showed average declines with an especially substantial drop in IA. In Study 4, FD and IA were correlated with measures of internalizing and externalizing problems in ways consistent with previous research and theory. Collectively, these results provide important information about the assessment of FD and IA. PMID:19365767

  10. Reliability, construct and criterion validity of the KIDSCREEN-10 score: a short measure for children and adolescents’ well-being and health-related quality of life

    PubMed Central

    Erhart, Michael; Rajmil, Luis; Herdman, Michael; Auquier, Pascal; Bruil, Jeanet; Power, Mick; Duer, Wolfgang; Abel, Thomas; Czemy, Ladislav; Mazur, Joanna; Czimbalmos, Agnes; Tountas, Yannis; Hagquist, Curt; Kilroe, Jean

    2010-01-01

    Background To assess the criterion and construct validity of the KIDSCREEN-10 well-being and health-related quality of life (HRQoL) score, a short version of the KIDSCREEN-52 and KIDSCREEN-27 instruments. Methods The child self-report and parent report versions of the KIDSCREEN-10 were tested in a sample of 22,830 European children and adolescents aged 8–18 and their parents (n = 16,237). Correlation with the KIDSCREEN-52 and associations with other generic HRQoL measures, physical and mental health, and socioeconomic status were examined. Score differences by age, gender, and country were investigated. Results Correlations between the 10-item KIDSCREEN score and KIDSCREEN-52 scales ranged from r = 0.24 to 0.72 (r = 0.27–0.72) for the self-report version (proxy-report version). Coefficients below r = 0.5 were observed for the KIDSCREEN-52 dimensions Financial Resources and Being Bullied only. Cronbach alpha was 0.82 (0.78), test–retest reliability was ICC = 0.70 (0.67) for the self- (proxy-)report version. Correlations between other children self-completed HRQoL questionnaires and KIDSCREEN-10 ranged from r = 0.43 to r = 0.63 for the KIDSCREEN children self-report and r = 0.22–0.40 for the KIDSCREEN parent proxy report. Known group differences in HRQoL between physically/mentally healthy and ill children were observed in the KIDSCREEN-10 self and proxy scores. Associations with self-reported psychosomatic complaints were r = −0.52 (−0.36) for the KIDSCREEN-10 self-report (proxy-report). Statistically significant differences in KIDSCREEN-10 self and proxy scores were found by socioeconomic status, age, and gender. Conclusions Our results indicate that the KIDSCREEN-10 provides a valid measure of a general HRQoL factor in children and adolescents, but the instrument does not represent well most of the single dimensions of the original KIDSCREEN-52. Test–retest reliability was slightly below a priori defined thresholds. PMID:20668950

  11. Publishing nutrition research: validity, reliability, and diagnostic test assessment in nutrition-related research.

    PubMed

    Gleason, Philip M; Harris, Jeffrey; Sheean, Patricia M; Boushey, Carol J; Bruemmer, Barbara

    2010-03-01

    This is the sixth in a series of monographs on research design and analysis. The purpose of this article is to describe and discuss several concepts related to the measurement of nutrition-related characteristics and outcomes, including validity, reliability, and diagnostic tests. The article reviews the methodologic issues related to capturing the various aspects of a given nutrition measure's reliability, including test-retest, inter-item, and interobserver or inter-rater reliability. Similarly, it covers content validity, indicators of absolute vs relative validity, and internal vs external validity. With respect to diagnostic assessment, the article summarizes the concepts of sensitivity and specificity. The hope is that dietetics practitioners will be able to both use high-quality measures of nutrition concepts in their research and recognize these measures in research completed by others. Copyright 2010 American Dietetic Association. Published by Elsevier Inc. All rights reserved.

  12. Development and Validation of a Multidimensional Measure of Family Supportive Supervisor Behaviors (FSSB)

    PubMed Central

    Hammer, Leslie B.; Kossek, Ellen Ernst; Yragui, Nanette L.; Bodner, Todd E.; Hanson, Ginger C.

    2011-01-01

    Due to growing work-family demands, supervisors need to effectively exhibit family supportive supervisor behaviors (FSSB). Drawing on social support theory and using data from two samples of lower wage workers, the authors develop and validate a measure of FSSB, defined as behaviors exhibited by supervisors that are supportive of families. FSSB is conceptualized as a multidimensional superordinate construct with four subordinate dimensions: emotional support, instrumental support, role modeling behaviors, and creative work-family management. Results from multilevel confirmatory factor analyses and multilevel regression analyses provide evidence of construct, criterion-related, and incremental validity. The authors found FSSB to be significantly related to work-family conflict, work-family positive spillover, job satisfaction, and turnover intentions over and above measures of general supervisor support. PMID:21660254

  13. [Internal consistency and criterion validity and reliability of the Mexican Version of the Child Behavior Checklist 1.5-5 (CBCL/1.5-5)].

    PubMed

    Albores-Gallo, Lilia; Hernández-Guzmán, Laura; Hasfura-Buenaga, Cecilia; Navarro-Luna, Enrique

    To investigate the validity and internal consistency of the Mexican version of the CBCL/1.5 -5 that assesses the most common psychopathology in pre-school children in clinical and epidemiological settings. A total of 438 parents from two groups, clinical-psychiatric (N= 62) and community (N= 376) completed the CBCL/1.5-5/Mexican version. The internal consistency was high for total problems α=0.95, and internalized α=0.89 and externalized α=0.91 subscales. The test re-test (one week) using the intraclass correlation coefficient (ICC) was ≥ 0.95 for the internalized, externalized, and total problems subscales. The ROC curve for the criterion status of clinically-referred vs. non-referred using the total problems scale ≥ 24 resulted in an AUC (area under curve) of 0.77, a specificity 0.73, and a sensitivity of 0.70. The CBCL/1.5 -5/Mexican version is a reliable and valid tool. Copyright © 2016 Sociedad Chilena de Pediatría. Publicado por Elsevier España, S.L.U. All rights reserved.

  14. Validity and relative validity of a novel digital approach for 24-h dietary recall in athletes

    PubMed Central

    2014-01-01

    Background We developed a digital dietary analysis tool for athletes (DATA) using a modified 24-h recall method and an integrated, customized nutrient database. The purpose of this study was to assess DATA’s validity and relative validity by measuring its agreement with registered dietitians’ (RDs) direct observations (OBSERVATION) and 24-h dietary recall interviews using the USDA 5-step multiple-pass method (INTERVIEW), respectively. Methods Fifty-six athletes (14–20 y) completed DATA and INTERVIEW in randomized counter-balanced order. OBSERVATION (n = 26) consisted of RDs recording participants’ food/drink intake in a 24-h period and were completed the day prior to DATA and INTERVIEW. Agreement among methods was estimated using a repeated measures t-test and Bland-Altman analysis. Results The paired differences (with 95% confidence intervals) between DATA and OBSERVATION were not significant for carbohydrate (10.1%, -1.2–22.7%) and protein (14.1%, -3.2–34.5%) but was significant for energy (14.4%, 1.2–29.3%). There were no differences between DATA and INTERVIEW for energy (-1.1%, -9.1–7.7%), carbohydrate (0.2%, -7.1–8.0%) or protein (-2.7%, -11.3–6.7%). Bland-Altman analysis indicated significant positive correlations between absolute values of the differences and the means for OBSERVATION vs. DATA (r = 0.40 and r = 0.47 for energy and carbohydrate, respectively) and INTERVIEW vs. DATA (r = 0.52, r = 0.29, and r = 0.61 for energy, carbohydrate, and protein, respectively). There were also wide 95% limits of agreement (LOA) for most method comparisons. The mean bias ratio (with 95% LOA) for OBSERVATION vs. DATA was 0.874 (0.551-1.385) for energy, 0.906 (0.522-1.575) for carbohydrate, and 0.895(0.395-2.031) for protein. The mean bias ratio (with 95% LOA) for INTERVIEW vs. DATA was 1.016 (0.538-1.919) for energy, 0.995 (0.563-1.757) for carbohydrate, and 1.031 (0.514-2.068) for protein. Conclusion DATA has good relative

  15. Systematic review of the concurrent and predictive validity of MRI biomarkers in OA

    PubMed Central

    Hunter, D.J.; Zhang, W.; Conaghan, Philip G.; Hirko, K.; Menashe, L.; Li, L.; Reichmann, W.M.; Losina, E.

    2012-01-01

    SUMMARY Objective To summarize literature on the concurrent and predictive validity of MRI-based measures of osteoarthritis (OA) structural change. Methods An online literature search was conducted of the OVID, EMBASE, CINAHL, PsychInfo and Cochrane databases of articles published up to the time of the search, April 2009. 1338 abstracts obtained with this search were preliminarily screened for relevance by two reviewers. Of these, 243 were selected for data extraction for this analysis on validity as well as separate reviews on discriminate validity and diagnostic performance. Of these 142 manuscripts included data pertinent to concurrent validity and 61 manuscripts for the predictive validity review. For this analysis we extracted data on criterion (concurrent and predictive) validity from both longitudinal and cross-sectional studies for all synovial joint tissues as it relates to MRI measurement in OA. Results Concurrent validity of MRI in OA has been examined compared to symptoms, radiography, histology/pathology, arthroscopy, CT, and alignment. The relation of bone marrow lesions, synovitis and effusion to pain was moderate to strong. There was a weak or no relation of cartilage morphology or meniscal tears to pain. The relation of cartilage morphology to radiographic OA and radiographic joint space was inconsistent. There was a higher frequency of meniscal tears, synovitis and other features in persons with radiographic OA. The relation of cartilage to other constructs including histology and arthroscopy was stronger. Predictive validity of MRI in OA has been examined for ability to predict total knee replacement (TKR), change in symptoms, radiographic progression as well as MRI progression. Quantitative cartilage volume change and presence of cartilage defects or bone marrow lesions are potential predictors of TKR. Conclusion MRI has inherent strengths and unique advantages in its ability to visualize multiple individual tissue pathologies relating to pain

  16. Creation of a computer self-efficacy measure: analysis of internal consistency, psychometric properties, and validity.

    PubMed

    Howard, Matt C

    2014-10-01

    Computer self-efficacy is an often studied construct that has been shown to be related to an array of important individual outcomes. Unfortunately, existing measures of computer self-efficacy suffer from several deficiencies, including criterion contamination, outdated wording, and/or inadequate psychometric properties. For this reason, the current article presents the creation of a new computer self-efficacy measure. In Study 1, an over-representative item list is created and subsequently reduced through exploratory factor analysis to create an initial measure, and the discriminant validity of this initial measure is tested. In Study 2, the unidimensional factor structure of the initial measure is supported through confirmatory factor analysis and further reduced into a final, 12-item measure. In Study 3, the convergent and criterion validity of the 12-item measure is tested. Overall, this three study process demonstrates that the new computer self-efficacy measure has superb psychometric properties and internal reliability, and demonstrates excellent evidence for several aspects of validity. It is hoped that the 12-item computer self-efficacy measure will be utilized in future research on computer self-efficacy, which is discussed in the current article.

  17. Reliability and validity of the Adolescent Stress Questionnaire in a sample of European adolescents - the HELENA study

    PubMed Central

    2011-01-01

    Background Since stress is hypothesized to play a role in the etiology of obesity during adolescence, research on associations between adolescent stress and obesity-related parameters and behaviours is essential. Due to lack of a well-established recent stress checklist for use in European adolescents, the study investigated the reliability and validity of the Adolescent Stress Questionnaire (ASQ) for assessing perceived stress in European adolescents. Methods The ASQ was translated into the languages of the participating cities (Ghent, Stockholm, Vienna, Zaragoza, Pecs and Athens) and was implemented within the HELENA cross-sectional study. A total of 1140 European adolescents provided a valid ASQ, comprising 10 component scales, used for internal reliability (Cronbach α) and construct validity (confirmatory factor analysis or CFA). Contributions of socio-demographic (gender, age, pubertal stage, socio-economic status) characteristics to the ASQ score variances were investigated. Two-hundred adolescents also provided valid saliva samples for cortisol analysis to compare with the ASQ scores (criterion validity). Test-retest reliability was investigated using two ASQ assessments from 37 adolescents. Results Cronbach α-values of the ASQ scales (0.57 to 0.88) demonstrated a moderate internal reliability of the ASQ, and intraclass correlation coefficients (0.45 to 0.84) established an insufficient test-retest reliability of the ASQ. The adolescents' gender (girls had higher stress scores than boys) and pubertal stage (those in a post-pubertal development had higher stress scores than others) significantly contributed to the variance in ASQ scores, while their age and socio-economic status did not. CFA results showed that the original scale construct fitted moderately with the data in our European adolescent population. Only in boys, four out of 10 ASQ scale scores were a significant positive predictor for baseline wake-up salivary cortisol, suggesting a rather poor

  18. Development and validation of the simulation-based learning evaluation scale.

    PubMed

    Hung, Chang-Chiao; Liu, Hsiu-Chen; Lin, Chun-Chih; Lee, Bih-O

    2016-05-01

    The instruments that evaluate a student's perception of receiving simulated training are English versions and have not been tested for reliability or validity. The aim of this study was to develop and validate a Chinese version Simulation-Based Learning Evaluation Scale (SBLES). Four stages were conducted to develop and validate the SBLES. First, specific desired competencies were identified according to the National League for Nursing and Taiwan Nursing Accreditation Council core competencies. Next, the initial item pool was comprised of 50 items related to simulation that were drawn from the literature of core competencies. Content validity was established by use of an expert panel. Finally, exploratory factor analysis and confirmatory factor analysis were conducted for construct validity, and Cronbach's coefficient alpha determined the scale's internal consistency reliability. Two hundred and fifty students who had experienced simulation-based learning were invited to participate in this study. Two hundred and twenty-five students completed and returned questionnaires (response rate=90%). Six items were deleted from the initial item pool and one was added after an expert panel review. Exploratory factor analysis with varimax rotation revealed 37 items remaining in five factors which accounted for 67% of the variance. The construct validity of SBLES was substantiated in a confirmatory factor analysis that revealed a good fit of the hypothesized factor structure. The findings tally with the criterion of convergent and discriminant validity. The range of internal consistency for five subscales was .90 to .93. Items were rated on a 5-point scale from 1 (strongly disagree) to 5 (strongly agree). The results of this study indicate that the SBLES is valid and reliable. The authors recommend that the scale could be applied in the nursing school to evaluate the effectiveness of simulation-based learning curricula. Copyright © 2016 Elsevier Ltd. All rights reserved.

  19. Development and validation of a cerebral oximeter capable of absolute accuracy.

    PubMed

    MacLeod, David B; Ikeda, Keita; Vacchiano, Charles; Lobbestael, Aaron; Wahr, Joyce A; Shaw, Andrew D

    2012-12-01

    Cerebral oximetry may be a valuable monitor, but few validation data are available, and most report the change from baseline rather than absolute accuracy, which may be affected by individuals whose oximetric values are outside the expected range. The authors sought to develop and validate a cerebral oximeter capable of absolute accuracy. An in vivo research study. A university human physiology laboratory. Healthy human volunteers were enrolled in calibration and validation studies of 2 cerebral oximetric sensors, the Nonin 8000CA and 8004CA. The 8000CA validation study identified 5 individuals with atypical cerebral oxygenation values; their data were used to design the 8004CA sensor, which subsequently underwent calibration and validation. Volunteers were taken through a stepwise hypoxia protocol to a minimum saturation of peripheral oxygen. Arteriovenous saturation (70% jugular bulb venous saturation and 30% arterial saturation) at 6 hypoxic plateaus was used as the reference value for the cerebral oximeter. Absolute accuracy was defined using a combination of the bias and precision of the paired saturations (A(RMS)). In the validation study for the 8000CA sensor (n = 9, 106 plateaus), relative accuracy was an A(RMS) of 2.7, with an absolute accuracy of 8.1, meeting the criteria for a relative (trend) monitor, but not an absolute monitor. In the validation study for the 8004CA sensor (n = 11, 119 plateaus), the A(RMS) of the 8004CA was 4.1, meeting the prespecified success criterion of <5.0. The Nonin cerebral oximeter using the 8004CA sensor can provide absolute data on regional cerebral saturation compared with arteriovenous saturation, even in subjects previously shown to have values outside the normal population distribution curves. Copyright © 2012 Elsevier Inc. All rights reserved.

  20. The development and preliminary validation of the behavior, environment, and changeability survey (BECS).

    PubMed

    Walsh, Jennifer R; Hebert, Angel; Byrd-Bredbenner, Carol; Carey, Gale; Colby, Sarah; Brown-Esters, Onikia N; Greene, Geoffrey; Hoerr, Sharon; Horacek, Tanya; Kattelmann, Kendra; Kidd, Tandalayo; Koenings, Mallory; Phillips, Beatrice; Shelnutt, Karla P; White, Adrienne A

    2012-01-01

    To develop and test the validity of the Behavior, Environment, and Changeability Survey (BECS) for identifying the importance and changeability of nutrition, exercise, and stress management behavior and related aspects of the environment. A cross-sectional, online survey of the BECS and selected validated instruments. Ten state universities. A convenience sample of college students (n = 1,283), ages 18-24 years. Principal component analysis was used to confirm a 6-component structure of the BECS in 2 independent samples for the purpose of cross-validation. Internal consistency was measured and construct and criterion-related analyses were conducted to test the reliability and validity of the BECS subscales. Six components representing 34 BECS items were revealed from the original 69 items and explained 64% of the total variance. Six scales were retained, and internal consistency of each ranged from α = .82 to .93. BECS Nutrition Behavior and Nutrition Changeability scale scores were highest for participants in action/maintenance Stages of Change for fruit and vegetable intake. There is strong support for the use of the BECS when planning health programs to gain insight into behavior that young adults are willing to improve, specifically related to nutrition, exercise, and sleep. Copyright © 2012 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.

  1. Nursing Intensive-Care Satisfaction Scale [NICSS]: Development and validation of a patient-centred instrument.

    PubMed

    Romero-García, Marta; de la Cueva-Ariza, Laura; Benito-Aracil, Llucia; Lluch-Canut, Teresa; Trujols-Albet, Joan; Martínez-Momblan, Maria Antonia; Juvé-Udina, Maria-Eulàlia; Delgado-Hito, Pilar

    2018-06-01

    The aim of this study was to develop and validate the Nursing Intensive-Care Satisfaction Scale to measures satisfaction with nursing care from the critical care patient's perspective. Instruments that measure satisfaction with nursing cares have been designed and validated without taking the patient's perspective into consideration. Despite the benefits and advances in measuring satisfaction with nursing care, none instrument is specifically designed to assess satisfaction in intensive care units. Instrument development. The population were all discharged patients (January 2013 - January 2015) from three Intensive Care Units of a third level hospital (N = 200). All assessment instruments were given to discharged patients and 48 hours later, to analyse the temporal stability, only the questionnaire was given again. The validation process of the scale included the analysis of internal consistency, temporal stability; validity of construct through a confirmatory factor analysis; and criterion validity. Reliability was 0.95. The intraclass correlation coefficient for the total scale was 0.83 indicating a good temporal stability. Construct validity showed an acceptable fit and factorial structure with four factors, in accordance with the theoretical model, being Consequences factor the best correlated with other factors. Criterion validity, presented a correlation between low and high (range: 0.42-0.68). The scale has been designed and validated incorporating the perspective of critical care patients. Thanks to its reliability and validity, this questionnaire can be used both in research and in clinical practice. The scale offers a possibility to assess and develop interventions to improve patient satisfaction with nursing care. © 2018 John Wiley & Sons Ltd.

  2. The Reliability and Validity of the Self-Reported Drinking Measures in the Army’s Health Risk Appraisal Survey

    PubMed Central

    Bell, Nicole S.; Williams, Jeffrey O.; Senier, Laura; Strowman, Shelley R.; Amoroso, Paul J.

    2007-01-01

    Background The reliability and validity of self-reported drinking behaviors from the Army Health Risk Appraisal (HRA) survey are unknown. Methods We compared demographics and health experiences of those who completed the HRA with those who did not (1991–1998). We also evaluated the reliability and validity of eight HRA alcohol-related items, including the CAGE, weekly drinking quantity, and drinking and driving measures. We used Cohen’s κ and Pearson’s r to assess reliability and convergent validity. To assess criterion (predictive) validity, we used proportional hazards and logistical regression models predicting alcohol-related hospitalizations and alcohol-related separations from the Army, respectively. Results A total of 404,966 soldiers completed an HRA. No particular demographic group seems to be over- or underrepresented. Although few respondents skipped alcohol items, those who did tended to be older and of minority race. The alcohol items demonstrate a reasonable degree of reliability, with Cronbach’s α = 0.69 and test-retest reliability associations in the 0.75–0.80 range for most items over 2- to 30-day interims between surveys. The alcohol measures showed good criterion-related validity: those consuming more than 21 drinks per week were at 6 times the risk for subsequent alcohol-related hospitalization versus those who abstained from drinking (hazard ratio, 6.36; 95% confidence interval=5.79, 6.99). Those who said their friends worried about their drinking were almost 5 times more likely to be discharged due to alcoholism (risk ratio, 4.9; 95% confidence interval=4.00, 6.04) and 6 times more likely to experience an alcohol-related hospitalization (hazard ratio, 6.24; 95% confidence interval=5.74, 6.77). Conclusions The Army’s HRA alcohol items seem to elicit reliable and valid responses. Because HRAs contain identifiers, alcohol use can be linked with subsequent health and occupational outcomes, making the HRA a useful epidemiological

  3. Factor Structure, Reliability and Criterion Validity of the Autism-Spectrum Quotient (AQ): A Study in Dutch Population and Patient Groups

    PubMed Central

    Bartels, Meike; Cath, Danielle C.; Boomsma, Dorret I.

    2008-01-01

    The factor structure of the Dutch translation of the Autism-Spectrum Quotient (AQ; a continuous, quantitative measure of autistic traits) was evaluated with confirmatory factor analyses in a large general population and student sample. The criterion validity of the AQ was examined in three matched patient groups (autism spectrum conditions (ASC), social anxiety disorder, and obsessive–compulsive disorder). A two factor model, consisting of a “Social interaction” factor and “Attention to detail” factor could be identified. The internal consistency and test–retest reliability of the AQ were satisfactory. High total AQ and factor scores were specific to ASC patients. Men scored higher than women and science students higher than non-science students. The Dutch translation of the AQ is a reliable instrument to assess autism spectrum conditions. PMID:18302013

  4. The Predictive Validity of the Minnesota Reading Assessment for Students in Postsecondary Vocational Education Programs.

    ERIC Educational Resources Information Center

    Brown, James M.; Chang, Gerald

    1982-01-01

    The predictive validity of the Minnesota Reading Assessment (MRA) when used to project potential performance of postsecondary vocational-technical education students was examined. Findings confirmed the MRA to be a valid predictor, although the error in prediction varied between the criterion variables. (Author/GK)

  5. Psychometric validation of a condom self-efficacy scale in Korean.

    PubMed

    Cha, EunSeok; Kim, Kevin H; Burke, Lora E

    2008-01-01

    When an instrument is translated for use in cross-cultural research, it needs to account for cultural factors without distorting the psychometric properties of the instrument. To validate the psychometric properties of the condom self-efficacy scale (CSE) originally developed for American adolescents and young adults after translating the scale to Korean (CSE-K) to determine its suitability for cross-cultural research among Korean college students. A cross-sectional, correlational design was used with an exploratory survey methodology through self-report questionnaires. A convenience sample of 351 students, aged 18 to 25 years, were recruited at a university in Seoul, Korea. The participants completed the CSE-K and the intention of condom use scales after they were translated from English to Korean using a combined translation technique. A demographic and sex history questionnaire, which included an item to assess actual condom usage, was also administered. Mean, variance, reliability, criterion validity, and factorial validity using confirmatory factor analysis were assessed in the CSE-K. Norms for the CSE-K were similar, but not identical, to norms for the English version. The means of all three subscales were lower for the CSE-K than for the original CSE; however, the obtained variance in CSE-K was roughly similar with the original CSE. The Cronbach's alpha coefficient for the total scale was higher for the CSE-K (.91) than that for either the CSE (.85) or CSE in Thai (.85). Criterion validity and construct validity of the CSE-K were confirmed. The CSE-K was a reliable and valid scale in measuring condom self-efficacy among Korean college students. The findings suggest that the CSE was an appropriate instrument to conduct cross-cultural research on sexual behavior in adolescents and young adults.

  6. Validation of the SCOFF questionnaire for screening of eating disorders among Mexican university students.

    PubMed

    Sanchez-Armass, Omar; Raffaelli, Marcela; Andrade, Flavia Cristina Drumond; Wiley, Angela R; Noyola, Aida Nacielli Morales; Arguelles, Alejandra Cepeda; Aradillas-Garcia, Celia

    2017-03-01

    To evaluate the criterion validity and diagnostic utility of the SCOFF, a brief eating disorder (ED) screening instrument, in a Mexican sample. The study was conducted in two phases in 2012. Phase I involved the administration of self-report measures [the SCOFF and the Eating Disorder Inventory-2, (EDI-2)] to 1057 students aged 17-56 years (M age = 21.0, SD = 3.4; 67 % female) from three colleges at the Universidad Autónoma de San Luis Potosí, Mexico. In Phase II, a random subsample of these students (n = 104) participated in the eating disorder examination, a structured interview that yields ED diagnoses. Analyses were conducted to evaluate the SCOFF's criterion validity by examining (a) correlations between scores on the SCOFF and the EDI-2 and (b) the SCOFF's ability to differentiate diagnosed ED cases and non-cases. EDI-2 subscales showed high correlations with the SCOFF scores proving initial evidence of criterion validity. A score of two points on the SCOFF optimized the sensitivity (78 %) and specificity (84 %). With this cutoff, the SCOFF correctly classified over half the cases (PPV = 58 %) and screened out the majority of non-cases (NPV = 93 %) providing further evidence of criterion validity. Analyses were repeated separately for men and women, yielding gender-specific information on the SCOFF's performance. Taken as a whole, results indicated that the SCOFF can be a useful tool for identifying Mexican university students who are at risk of eating disorders.

  7. Assessing the environmental characteristics of cycling routes to school: a study on the reliability and validity of a Google Street View-based audit.

    PubMed

    Vanwolleghem, Griet; Van Dyck, Delfien; Ducheyne, Fabian; De Bourdeaudhuij, Ilse; Cardon, Greet

    2014-06-10

    Google Street View provides a valuable and efficient alternative to observe the physical environment compared to on-site fieldwork. However, studies on the use, reliability and validity of Google Street View in a cycling-to-school context are lacking. We aimed to study the intra-, inter-rater reliability and criterion validity of EGA-Cycling (Environmental Google Street View Based Audit - Cycling to school), a newly developed audit using Google Street View to assess the physical environment along cycling routes to school. Parents (n = 52) of 11-to-12-year old Flemish children, who mostly cycled to school, completed a questionnaire and identified their child's cycling route to school on a street map. Fifty cycling routes of 11-to-12-year olds were identified and physical environmental characteristics along the identified routes were rated with EGA-Cycling (5 subscales; 37 items), based on Google Street View. To assess reliability, two researchers performed the audit. Criterion validity of the audit was examined by comparing the ratings based on Google Street View with ratings through on-site assessments. Intra-rater reliability was high (kappa range 0.47-1.00). Large variations in the inter-rater reliability (kappa range -0.03-1.00) and criterion validity scores (kappa range -0.06-1.00) were reported, with acceptable inter-rater reliability values for 43% of all items and acceptable criterion validity for 54% of all items. EGA-Cycling can be used to assess physical environmental characteristics along cycling routes to school. However, to assess the micro-environment specifically related to cycling, on-site assessments have to be added.

  8. Assessing Sleep Disturbance in Low Back Pain: The Validity of Portable Instruments

    PubMed Central

    Alsaadi, Saad M.; McAuley, James H.; Hush, Julia M.; Bartlett, Delwyn J.; McKeough, Zoe M.; Grunstein, Ronald R.; Dungan, George C.; Maher, Chris G.

    2014-01-01

    Although portable instruments have been used in the assessment of sleep disturbance for patients with low back pain (LBP), the accuracy of the instruments in detecting sleep/wake episodes for this population is unknown. This study investigated the criterion validity of two portable instruments (Armband and Actiwatch) for assessing sleep disturbance in patients with LBP. 50 patients with LBP performed simultaneous overnight sleep recordings in a university sleep laboratory. All 50 participants were assessed by Polysomnography (PSG) and the Armband and a subgroup of 33 participants wore an Actiwatch. Criterion validity was determined by calculating epoch-by-epoch agreement, sensitivity, specificity and prevalence and bias- adjusted kappa (PABAK) for sleep versus wake between each instrument and PSG. The relationship between PSG and the two instruments was assessed using intraclass correlation coefficients (ICC 2, 1). The study participants showed symptoms of sub-threshold insomnia (mean ISI = 13.2, 95% CI = 6.36) and poor sleep quality (mean PSQI = 9.20, 95% CI = 4.27). Observed agreement with PSG was 85% and 88% for the Armband and Actiwatch. Sensitivity was 0.90 for both instruments and specificity was 0.54 and 0.67 and PABAK of 0.69 and 0.77 for the Armband and Actiwatch respectively. The ICC (95%CI) was 0.76 (0.61 to 0.86) and 0.80 (0.46 to 0.92) for total sleep time, 0.52 (0.29 to 0.70) and 0.55 (0.14 to 0.77) for sleep efficiency, 0.64 (0.45 to 0.78) and 0.52 (0.23 to 0.73) for wake after sleep onset and 0.13 (−0.15 to 0.39) and 0.33 (−0.05 to 0.63) for sleep onset latency, for the Armband and Actiwatch, respectively. The findings showed that both instruments have varied criterion validity across the sleep parameters from excellent validity for measures of total sleep time, good validity for measures of sleep efficiency and wake after onset to poor validity for sleep onset latency. PMID:24763506

  9. Assessing sleep disturbance in low back pain: the validity of portable instruments.

    PubMed

    Alsaadi, Saad M; McAuley, James H; Hush, Julia M; Bartlett, Delwyn J; McKeough, Zoe M; Grunstein, Ronald R; Dungan, George C; Maher, Chris G

    2014-01-01

    Although portable instruments have been used in the assessment of sleep disturbance for patients with low back pain (LBP), the accuracy of the instruments in detecting sleep/wake episodes for this population is unknown. This study investigated the criterion validity of two portable instruments (Armband and Actiwatch) for assessing sleep disturbance in patients with LBP. 50 patients with LBP performed simultaneous overnight sleep recordings in a university sleep laboratory. All 50 participants were assessed by Polysomnography (PSG) and the Armband and a subgroup of 33 participants wore an Actiwatch. Criterion validity was determined by calculating epoch-by-epoch agreement, sensitivity, specificity and prevalence and bias- adjusted kappa (PABAK) for sleep versus wake between each instrument and PSG. The relationship between PSG and the two instruments was assessed using intraclass correlation coefficients (ICC 2, 1). The study participants showed symptoms of sub-threshold insomnia (mean ISI = 13.2, 95% CI = 6.36) and poor sleep quality (mean PSQI = 9.20, 95% CI = 4.27). Observed agreement with PSG was 85% and 88% for the Armband and Actiwatch. Sensitivity was 0.90 for both instruments and specificity was 0.54 and 0.67 and PABAK of 0.69 and 0.77 for the Armband and Actiwatch respectively. The ICC (95%CI) was 0.76 (0.61 to 0.86) and 0.80 (0.46 to 0.92) for total sleep time, 0.52 (0.29 to 0.70) and 0.55 (0.14 to 0.77) for sleep efficiency, 0.64 (0.45 to 0.78) and 0.52 (0.23 to 0.73) for wake after sleep onset and 0.13 (-0.15 to 0.39) and 0.33 (-0.05 to 0.63) for sleep onset latency, for the Armband and Actiwatch, respectively. The findings showed that both instruments have varied criterion validity across the sleep parameters from excellent validity for measures of total sleep time, good validity for measures of sleep efficiency and wake after onset to poor validity for sleep onset latency.

  10. Validity Arguments for Diagnostic Assessment Using Automated Writing Evaluation

    ERIC Educational Resources Information Center

    Chapelle, Carol A.; Cotos, Elena; Lee, Jooyoung

    2015-01-01

    Two examples demonstrate an argument-based approach to validation of diagnostic assessment using automated writing evaluation (AWE). "Criterion"®, was developed by Educational Testing Service to analyze students' papers grammatically, providing sentence-level error feedback. An interpretive argument was developed for its use as part of…

  11. German validation of the Conners Adult ADHD Rating Scales (CAARS) II: reliability, validity, diagnostic sensitivity and specificity.

    PubMed

    Christiansen, H; Kis, B; Hirsch, O; Matthies, S; Hebebrand, J; Uekermann, J; Abdel-Hamid, M; Kraemer, M; Wiltfang, J; Graf, E; Colla, M; Sobanski, E; Alm, B; Rösler, M; Jacob, C; Jans, T; Huss, M; Schimmelmann, B G; Philipsen, A

    2012-07-01

    The German version of the Conners Adult ADHD Rating Scales (CAARS) has proven to show very high model fit in confirmative factor analyses with the established factors inattention/memory problems, hyperactivity/restlessness, impulsivity/emotional lability, and problems with self-concept in both large healthy control and ADHD patient samples. This study now presents data on the psychometric properties of the German CAARS-self-report (CAARS-S) and observer-report (CAARS-O) questionnaires. CAARS-S/O and questions on sociodemographic variables were filled out by 466 patients with ADHD, 847 healthy control subjects that already participated in two prior studies, and a total of 896 observer data sets were available. Cronbach's-alpha was calculated to obtain internal reliability coefficients. Pearson correlations were performed to assess test-retest reliability, and concurrent, criterion, and discriminant validity. Receiver Operating Characteristics (ROC-analyses) were used to establish sensitivity and specificity for all subscales. Coefficient alphas ranged from .74 to .95, and test-retest reliability from .85 to .92 for the CAARS-S, and from .65 to .85 for the CAARS-O. All CAARS subscales, except problems with self-concept correlated significantly with the Barrett Impulsiveness Scale (BIS), but not with the Wender Utah Rating Scale (WURS). Criterion validity was established with ADHD subtype and diagnosis based on DSM-IV criteria. Sensitivity and specificity were high for all four subscales. The reported results confirm our previous study and show that the German CAARS-S/O do indeed represent a reliable and cross-culturally valid measure of current ADHD symptoms in adults. Copyright © 2011 Elsevier Masson SAS. All rights reserved.

  12. Validation of the M. D. Anderson Symptom Inventory multiple myeloma module

    PubMed Central

    2013-01-01

    Background The symptom burden associated with multiple myeloma (MM) is often severe. Presently, no instrument comprehensively assesses disease-related and treatment-related symptoms in patients with MM. We sought to validate a module of the M. D. Anderson Symptom Inventory (MDASI) developed specifically for patients with MM (MDASI-MM). Methods The MDASI-MM was developed with clinician input, cognitive debriefing, and literature review, and administered to 132 patients undergoing induction chemotherapy or stem cell transplantation. We demonstrated the MDASI-MM’s reliability (Cronbach α values); criterion validity (item and subscale correlations between the MDASI-MM and the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire (EORTC QLQ-C30) and the EORTC MM module (QLQ-MY20)), and construct validity (differences between groups by performance status). Ratings from transplant patients were examined to demonstrate the MDASI-MM’s sensitivity in detecting the acute worsening of symptoms post-transplantation. Results The MDASI-MM demonstrated excellent correlations with subscales of the 2 EORTC instruments, strong ability to distinguish clinically different patient groups, high sensitivity in detecting change in patients’ performance status, and high reliability. Cognitive debriefing confirmed that the MDASI-MM encompasses the breadth of symptoms relevant to patients with MM. Conclusion The MDASI-MM is a valid, reliable, comprehensive-yet-concise tool that is recommended as a uniform symptom assessment instrument for patients with MM. PMID:23384030

  13. Validation of a German Version of the Ethical Leadership at Work Questionnaire by Kalshoven et al. (2011)

    PubMed Central

    Steinmann, Barbara; Nübold, Annika; Maier, Günter W.

    2016-01-01

    The present study evaluates the psychometric properties of a German version of the Ethical Leadership at Work questionnaire (ELW-D), and further embeds the construct of ethical leadership within its nomological network. Confirmatory factor analyses (CFAs) based on the total sample of N = 363 employees support the assumed seven-factor structure of the German translation. Within a sub-sample of N = 133, the ELW-D shows positive correlations with related leadership behaviors (transformational leadership, contingent reward, and servant leadership), and negative correlations with destructive ones (passive leadership, autocratic leadership, and abusive supervision), approving convergent validity of the scale. Comparisons of correlated correlation coefficients reveal restrictions of its discriminant validity. In support of the criterion-related validity (N = 100), the ELW-D relates to work-related attitudes (e.g., job satisfaction, satisfaction with the leader, trust in the leader) and follower behaviors (e.g., extra effort, organizational citizenship behavior) in the way expected. Besides, ELW-D-dimensions show incremental validity over and above the Ethical Leadership Scale, emphasizing the added value of this questionnaire. PMID:27065916

  14. Validation of a German Version of the Ethical Leadership at Work Questionnaire by Kalshoven et al. (2011).

    PubMed

    Steinmann, Barbara; Nübold, Annika; Maier, Günter W

    2016-01-01

    The present study evaluates the psychometric properties of a German version of the Ethical Leadership at Work questionnaire (ELW-D), and further embeds the construct of ethical leadership within its nomological network. Confirmatory factor analyses (CFAs) based on the total sample of N = 363 employees support the assumed seven-factor structure of the German translation. Within a sub-sample of N = 133, the ELW-D shows positive correlations with related leadership behaviors (transformational leadership, contingent reward, and servant leadership), and negative correlations with destructive ones (passive leadership, autocratic leadership, and abusive supervision), approving convergent validity of the scale. Comparisons of correlated correlation coefficients reveal restrictions of its discriminant validity. In support of the criterion-related validity (N = 100), the ELW-D relates to work-related attitudes (e.g., job satisfaction, satisfaction with the leader, trust in the leader) and follower behaviors (e.g., extra effort, organizational citizenship behavior) in the way expected. Besides, ELW-D-dimensions show incremental validity over and above the Ethical Leadership Scale, emphasizing the added value of this questionnaire.

  15. Transcultural Adaptation and Validation of the German Version of the Vocal Tract Discomfort Scale.

    PubMed

    Lukaschyk, Julia; Brockmann-Bauser, Meike; Beushausen, Ulla

    2017-03-01

    Currently, there is no standardized German questionnaire to assess vocal tract discomfort in voice patients. The aim of this study was to evaluate the internal consistency, reliability, and validity of the German version of the Vocal Tract Discomfort (VTD) Scale. This is a cross-sectional study. First, a cross-cultural translation and adaptation from English to German was performed. One hundred seven patients between the ages of 18 and 76 with voice disorders were divided into two different diagnosis-related groups (organic and functional voice disorder) and 50 vocally healthy adults were included. All participants completed the VTD Scale and the Voice Handicap Index (VHI). The internal consistency of the VTD Scale was analyzed through Cronbach's α coefficient. Pearson correlation between the VDT Scale and VHI total scores was used to determine criterion validity. The VDT Scale score differences related to diagnosis groups were assessed with analysis of variance. Excellent internal consistency was found (α = 0.919, P < 0.05), and criterion validity was confirmed by a high correlation between the total VTD Scale and VHI (r = 0.674). There was a significant difference between the diagnosis groups' total VTD Scale score (F[4.135] = 15.114, P = 0.000). Furthermore, the vocally healthy adults had significantly lower values than the two diagnosis groups (x¯: 11.48, s = 8.340). The German version of the VTD Scale has an excellent internal consistency and reliability, and shows high clinical validity. Thus, it is a useful instrument in voice diagnostics. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  16. [Preventable drug-related morbidity: determining valid indicators for primary care in Portugal].

    PubMed

    Guerreiro, Mara Pereira; Cantrill, Judith A; Martins, Ana Paula

    2007-01-01

    Preventable drug-related morbidity (PDRM) indicators are operational measures of therapeutic risk management. These clinical indicators, which cover a wide range of drugs, combine process and outcome in the same instrument. They were developed in the US and have been validated for primary care settings in the US, UK and Canada. This study is part of a research programme; it aimed to determine a valid set of PDRM indicators for adult patients in primary care in Portugal. Face validity of 61 US and UK-derived indicators translated to Portuguese was preliminarily determined by means of a postal questionnaire using a purposive sample of four Portuguese pharmacists with different backgrounds. Preliminary content validity of indicators approved in the previous stage was determined by cross-checking each definition of PDRM with standard drug information sources in Portugal. Face and content validity of indicators yielded by preliminary work were then established by a 37 expert panel (20 community pharmacists and 17 general practitioners) using a two-round Delphi survey. Data were analysed using SPSS release 11.5. Nineteen indicators were ruled out in preliminary validation. Changes were made in the content of eight of the remaining 42 indicators; these were related to differences in the drugs being marketed and patterns of drug monitoring between countries. Thirty-five indicators were consensus approved as PDRM for adult patients in Portuguese primary care by the Delphi panel.

  17. Statistical methodology: II. Reliability and validity assessment in study design, Part B.

    PubMed

    Karras, D J

    1997-02-01

    Validity measures the correspondence between a test and other purported measures of the same or similar qualities. When a reference standard exists, a criterion-based validity coefficient can be calculated. If no such standard is available, the concepts of content and construct validity may be used, but quantitative analysis may not be possible. The Pearson and Spearman tests of correlation are often used to assess the correspondence between tests, but do not account for measurement biases and may yield misleading results. Techniques that measure interest differences may be more meaningful in validity assessment, and the kappa statistic is useful for analyzing categorical variables. Questionnaires often can be designed to allow quantitative assessment of reliability and validity, although this may be difficult. Inclusion of homogeneous questions is necessary to assess reliability. Analysis is enhanced by using Likert scales or similar techniques that yield ordinal data. Validity assessment of questionnaires requires careful definition of the scope of the test and comparison with previously validated tools.

  18. The Development and Validation of a Life Experience Inventory for the Identification of Creative Electrical Engineers.

    ERIC Educational Resources Information Center

    Michael, William B.; Colson, Kenneth R.

    1979-01-01

    The construction and validation of the Life Experience Inventory (LEI) for the identification of creative electrical engineers are described. Using the number of patents held or pending as a criterion measure, the LEI was found to have high concurrent validity. (JKS)

  19. Validation of the preschool and primary school form of a questionnaire assessing parents' childrearing behavior.

    PubMed

    Meunier, Jean-Christophe; Roskam, Isabelle

    2009-01-01

    This study presents a validation of a scale that assesses parents' childrearing behavior toward young children. The scale was validated on 565 parents of 2- to 7-year-old children. The current results replicated the factor solution of the original scale designed for parents of school-aged children. The scale demonstrated good psychometric properties: moderate to high internal consistency, the expected relations with criterion variables (parental self-efficacy beliefs, child's behavior and personality), and discriminative properties according to the parents' gender and educational level, the child's age and gender, and the difference between referred and nonreferred children.

  20. A newer and broader definition of burnout: validation of the "Burnout Clinical Subtype Questionnaire (BCSQ-36)".

    PubMed

    Montero-Marín, Jesús; García-Campayo, Javier

    2010-06-02

    Burnout syndrome has been clinically characterised by a series of three subtypes: frenetic, underchallenged, and worn-out, with reference to coping strategies for stress and frustration at work with different degrees of dedication. The aims of the study are to present an operating definition of these subtypes in order to assess their reliability and convergent validity with respect to a standard burnout criterion and to examine differences with regard to sex and the temporary nature of work contracts. An exploratory factor analysis was performed by the main component method on a range of items devised by experts. The sample was composed of 409 employees of the University of Zaragoza, Spain. The reliability of the scales was assessed with Cronbach's alpha, convergent validity in relation to the Maslach Burnout Inventory with Pearson's r, and differences with Student's t-test and the Mann-Whitney U test. The factorial validity and reliability of the scales were good. The subtypes presented relations of differing degrees with the criterion dimensions, which were greater when dedication to work was lower. The frenetic profile presented fewer relations with the criterion dimensions while the worn-out profile presented relations of the greatest magnitude. Sex was not influential in establishing differences. However, the temporary nature of work contracts was found to have an effect: temporary employees exhibited higher scores in the frenetic profile (p < 0.001), while permanent employees did so in the underchallenged (p = 0.018) and worn-out (p < 0.001) profiles. The classical Maslach description of burnout does not include the frenetic profile; therefore, these patients are not recognised. The developed questionnaire may be a useful tool for the design and appraisal of specific preventive and treatment approaches based on the type of burnout experienced.

  1. Reliability and Validity of the Behavioral Addiction Measure for Video Gaming.

    PubMed

    Sanders, James L; Williams, Robert J

    2016-01-01

    Most tests of video game addiction have weak construct validity and limited ability to correctly identify people in denial. The purpose of the present research was to investigate the reliability and validity of a new test of video game addiction (Behavioral Addiction Measure-Video Gaming [BAM-VG]) that was developed in part to address these deficiencies. Regular adult video gamers (n = 506) were recruited from a Canadian online panel and completed a survey containing three measures of excessive video gaming (BAM-VG; DSM-5 criteria for Internet Gaming Disorder [IGD]; and the IGD-20), as well as questions concerning extensiveness of video game involvement and self-report of problems associated with video gaming. One month later, they were reassessed for the purposes of establishing test-retest reliability. The BAM-VG demonstrated good internal consistency as well as 1 month test-retest reliability. Criterion-related validity was demonstrated by significant correlations with the following: time spent playing, self-identification of video game problems, and scores on other instruments designed to assess video game addiction (DSM-5 IGD, IGD-20). Consistent with the theory, principal component analysis identified two components underlying the BAM-VG that roughly correspond with impaired control and significant negative consequences deriving from this impaired control. Together with its excellent construct validity and other technical features, the BAM-VG represents a reliable and valid test of video game addiction.

  2. Construct Validity of the Relationship Profile Test: Links with measures of psychopathology and adult attachment

    PubMed Central

    Haggerty, Greg; Bornstein, Robert F.; Khalid, Mohammad; Sharma, Vishal; Riaz, Usman; Blanchard, Mark; Siefert, Caleb J; Sinclair, Samuel J.

    2015-01-01

    This study assessed the construct validity of the Relationship Profile Test (RPT; Bornstein & Languirand, 2003) with a substance abuse sample. One hundred-eight substance abuse patients completed the RPT, Experiences in Close Relationships Scale (ECR-SF; Wei, Russell, Mallinckrodt, & Vogel, 2007), Personality Assessment Inventory (PAI; Morey, 1991), and Symptom Checklist-90-Revised (SCL-90-R: Derogatis 1983). Results suggest that the RPT has good construct validity when compared against theoretically related broadband measures of personality, psychopathology and adult attachment. Overall, health hependency was negatively related to measures of psychopathology and insecure attachment, and overdependence was positively related to measures of psychopathology and attachment anxiety. Many of the predictions regarding RPT detachment and the criterion measures were not supported. Implications of these findings are discussed. PMID:26620463

  3. Translation and validation of the Canadian diabetes risk assessment questionnaire in China.

    PubMed

    Guo, Jia; Shi, Zhengkun; Chen, Jyu-Lin; Dixon, Jane K; Wiley, James; Parry, Monica

    2018-01-01

    To adapt the Canadian Diabetes Risk Assessment Questionnaire for the Chinese population and to evaluate its psychometric properties. A cross-sectional study was conducted with a convenience sample of 194 individuals aged 35-74 years from October 2014 to April 2015. The Canadian Diabetes Risk Assessment Questionnaire was adapted and translated for the Chinese population. Test-retest reliability was conducted to measure stability. Criterion and convergent validity of the adapted questionnaire were assessed using 2-hr 75 g oral glucose tolerance tests and the Finnish Diabetes Risk Scores, respectively. Sensitivity and specificity were evaluated to establish its predictive validity. The test-retest reliability was 0.988. Adequate validity of the adapted questionnaire was demonstrated by positive correlations found between the scores and 2-hr 75 g oral glucose tolerance tests (r = .343, p < .001) and with the Finnish Diabetes Risk Scores (r = .738, p < .001). The area under receiver operating characteristic curve was 0.705 (95% CI .632, .778), demonstrating moderate diagnostic value at a cutoff score of 30. The sensitivity was 73%, with a positive predictive value of 57% and negative predictive value of 78%. Our results provided evidence supporting the translation consistency, content validity, convergent validity, criterion validity, sensitivity, and specificity of the translated Canadian Diabetes Risk Assessment Questionnaire with minor modifications. This paper provides clinical, practical, and methodological information on how to adapt a diabetes risk calculator between cultures for public health nurses. © 2017 Wiley Periodicals, Inc.

  4. [Spanish version of the Satisfaction With Decision scale: cross-cultural adaptation, validity and reliability].

    PubMed

    Chabrera, Carolina; Areal, Joan; Font, Albert; Caro, Mónica; Bonet, Marta; Zabalegui, Adelaida

    2015-01-01

    The aim of this study is to develop a Spanish version of the Satisfaction With Decision scale (SWDs) and analyse the psychometric properties of validity and reliability. An observational, descriptive study and validation of a tool to measure satisfaction with the decision. Urology, Radiation oncology, and Medical oncology Departments of the Hospital Universitari Germans Trias i Pujol, Institut Català d'Oncologia and the Institut Oncològic del Vallès - Hospital General de Catalunya. A total of 170 participants diagnosed with prostate cancer, and who could read and write in Spanish and gave their informed consent. A translation, back-translation and cross-cultural adaptation to Spanish was performed on the SWDs. The content validity, criterion validity, construct validity and reliability (internal consistency and stability) of the Spanish version were evaluated. The SWDs contains 6 items with 5-item Likert scales. A Spanish version (ESD) was obtained that was linguistically and conceptually equivalent to the original version. Criterion validity, the ESD correlated with "satisfaction with the decision" using a linear analogue scale, was significant (r=0.63, P<.01) for all items. The factorial analysis showed a unique dimension to explain 82.08% of the variance. The ESD showed excellent results in terms of internal consistency (Cronbach alpha=0.95) and good test-retest reliability with intraclass correlation coefficient of 0.711. The ESD is a validated Spanish scale to measure the satisfaction with the decisions taken in health, and demonstrates a correct validity and reliability. Copyright © 2015 Elsevier España, S.L.U. All rights reserved.

  5. The French-Canadian validation of a disease-specific, patient-reported outcome measure for lupus.

    PubMed

    Bourré-Tessier, J; Clarke, A E; Kosinski, M; Mikolaitis-Preuss, R A; Bernatsky, S; Block, J A; Jolly, M

    2014-12-01

    The objective of this paper is to perform the cross-cultural validation of the French version of the LupusPRO, a disease-targeted patient-reported outcome measure, among systemic lupus erythematosus (SLE) patients in Canada. The French version of the LupusPRO and the MOS SF-36 were administered; demographic, clinical and serological characteristics were obtained. Disease activity (SELENA-SLEDAI and the Lupus Foundation of America definition of flare) and damage (SLICC/ACR SDI) were assessed. Physician disease activity and damage assessments were ascertained using visual analog scales. Internal consistency reliability (ICR), test-retest reliability (TRT), convergent and discriminant validity (against corresponding domains of the SF-36), criterion validity (against disease activity, damage or health status) and known group validity were tested. A total of 99 French-Canadian SLE patients participated (97% women, mean (SD) age 45.2 (14.5) years). The median (IQR) SELENA-SLEDAI and SDI were 3.5 (6.0) and 1.0 (2.0), respectively. The ICR of the LupusPRO domains ranged from 0.81 to 0.93 (except for lupus symptoms, procreation and coping), while TRT ranged from 0.72 to 0.95. Convergent and discriminant validity, criterion validity and known group validity against disease activity, damage and health status measures were observed. Confirmatory factor analysis showed a good fit. The LupusPRO has fair psychometric properties among French-Canadian patients with SLE. © The Author(s) 2014 Reprints and permissions: sagepub.co.uk/journalsPermissions.nav.

  6. Transcultural and psychometric validation of the Dispositional Resilience Scale (DRS-15) in Chinese adult women.

    PubMed

    Wong, Janet Yuen-Ha; Fong, Daniel Yee-Tak; Choi, Anna Wai-Man; Chan, Claudia Kor-Yee; Tiwari, Agnes; Chan, Ko Ling; Lai, Vincent; Logan, Tk; Bartone, Paul

    2014-11-01

    The aim of this study was to report translation and transcultural adaptation of the 15-item Dispositional Resilience Scale in traditional Chinese (C-DRS-15) and evaluate its psychometric properties. The DRS is a self-report instrument that measures psychological hardiness. We followed an international standard of cross-cultural translation and validation of patient-reported outcome measures to create the Chinese version. Then, the translated C-DRS-15 was validated on 542 Chinese women from a population-based sample in Hong Kong. The internal consistency and criterion-related validity were investigated. Exploratory and confirmatory factor analysis revealed that the C-DRS-15 was supported by a modified three-factor structure in our Chinese sample (RMSEA = .06, CFI = .94, TLI = .92, and SRMR = .06). The reliability (Cronbach's α coefficient = .78) and validity were satisfactory. Total resilience score was negatively correlated with depression (p < .001), with non-depressed women scoring higher on the C-DRS-15. The C-DRS-15 was demonstrated to be a reliable and valid measurement to assess hardiness in Chinese women.

  7. Validation of the Internet Gaming Disorder Scale - Short-Form (IGDS9-SF) in an Italian-speaking sample.

    PubMed

    Monacis, Lucia; Palo, Valeria de; Griffiths, Mark D; Sinatra, Maria

    2016-12-01

    Background and aims The inclusion of Internet Gaming Disorder (IGD) in Section III of the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders has increased the interest of researchers in the development of new standardized psychometric tools for the assessment of such a disorder. To date, the nine-item Internet Gaming Disorder Scale - Short-Form (IGDS9-SF) has only been validated in English, Portuguese, and Slovenian languages. Therefore, the aim of this investigation was to examine the psychometric properties of the IGDS9-SF in an Italian-speaking sample. Methods A total of 757 participants were recruited to the present study. Confirmatory factor analysis and multi-group analyses were applied to assess the construct validity. Reliability analyses comprised the average variance extracted, the standard error of measurement, and the factor determinacy coefficient. Convergent and criterion validities were established through the associations with other related constructs. The receiver operating characteristic curve analysis was used to determine an empirical cut-off point. Results Findings confirmed the single-factor structure of the instrument, its measurement invariance at the configural level, and the convergent and criterion validities. Satisfactory levels of reliability and a cut-off point of 21 were obtained. Discussion and conclusions The present study provides validity evidence for the use of the Italian version of the IGDS9-SF and may foster research into gaming addiction in the Italian context.

  8. Reliability and Validity of the Japanese Version of the Kinesthetic and Visual Imagery Questionnaire (KVIQ)

    PubMed Central

    Nakano, Hideki; Kodama, Takayuki; Ukai, Kazumasa; Kawahara, Satoru; Horikawa, Shiori; Murata, Shin

    2018-01-01

    In this study, we aimed to (1) translate the English version of the Kinesthetic and Visual Imagery Questionnaire (KVIQ), which assesses motor imagery ability, into Japanese, and (2) investigate the reliability and validity of the Japanese KVIQ. We enrolled 28 healthy adults in this study. We used Cronbach’s alpha coefficients to assess reliability reflected by the internal consistency. Additionally, we assessed validity reflected by the criterion-related validity between the Japanese KVIQ and the Japanese version of the Movement Imagery Questionnaire-Revised (MIQ-R) with Spearman’s rank correlation coefficients. The Cronbach’s alpha coefficients for the KVIQ-20 were 0.88 (Visual) and 0.91 (Kinesthetic), which indicates high reliability. There was a significant positive correlation between the Japanese KVIQ-20 (Total) and the Japanese MIQ-R (Total) (r = 0.86, p < 0.01). Our results suggest that the Japanese KVIQ is an assessment that is a reliable and valid index of motor imagery ability. PMID:29724042

  9. Reliability and Validity of the Japanese Version of the Kinesthetic and Visual Imagery Questionnaire (KVIQ).

    PubMed

    Nakano, Hideki; Kodama, Takayuki; Ukai, Kazumasa; Kawahara, Satoru; Horikawa, Shiori; Murata, Shin

    2018-05-02

    In this study, we aimed to (1) translate the English version of the Kinesthetic and Visual Imagery Questionnaire (KVIQ), which assesses motor imagery ability, into Japanese, and (2) investigate the reliability and validity of the Japanese KVIQ. We enrolled 28 healthy adults in this study. We used Cronbach’s alpha coefficients to assess reliability reflected by the internal consistency. Additionally, we assessed validity reflected by the criterion-related validity between the Japanese KVIQ and the Japanese version of the Movement Imagery Questionnaire-Revised (MIQ-R) with Spearman’s rank correlation coefficients. The Cronbach’s alpha coefficients for the KVIQ-20 were 0.88 (Visual) and 0.91 (Kinesthetic), which indicates high reliability. There was a significant positive correlation between the Japanese KVIQ-20 (Total) and the Japanese MIQ-R (Total) (r = 0.86, p < 0.01). Our results suggest that the Japanese KVIQ is an assessment that is a reliable and valid index of motor imagery ability.

  10. A Note on the Incremental Validity of Aggregate Predictors.

    ERIC Educational Resources Information Center

    Day, H. D.; Marshall, David

    Three computer simulations were conducted to show that very high aggregate predictive validity coefficients can occur when the across-case variability in absolute score stability occurring in both the predictor and criterion matrices is quite small. In light of the increase in internal consistency reliability achieved by the method of aggregation…

  11. Job Embeddedness Demonstrates Incremental Validity When Predicting Turnover Intentions for Australian University Employees

    PubMed Central

    Heritage, Brody; Gilbert, Jessica M.; Roberts, Lynne D.

    2016-01-01

    Job embeddedness is a construct that describes the manner in which employees can be enmeshed in their jobs, reducing their turnover intentions. Recent questions regarding the properties of quantitative job embeddedness measures, and their predictive utility, have been raised. Our study compared two competing reflective measures of job embeddedness, examining their convergent, criterion, and incremental validity, as a means of addressing these questions. Cross-sectional quantitative data from 246 Australian university employees (146 academic; 100 professional) was gathered. Our findings indicated that the two compared measures of job embeddedness were convergent when total scale scores were examined. Additionally, job embeddedness was capable of demonstrating criterion and incremental validity, predicting unique variance in turnover intention. However, this finding was not readily apparent with one of the compared job embeddedness measures, which demonstrated comparatively weaker evidence of validity. We discuss the theoretical and applied implications of these findings, noting that job embeddedness has a complementary place among established determinants of turnover intention. PMID:27199817

  12. Toward a Process-Focused Model of Test Score Validity: Improving Psychological Assessment in Science and Practice

    ERIC Educational Resources Information Center

    Bornstein, Robert F.

    2011-01-01

    Although definitions of validity have evolved considerably since L. J. Cronbach and P. E. Meehl's classic (1955) review, contemporary validity research continues to emphasize correlational analyses assessing predictor-criterion relationships, with most outcome criteria being self-reports. The present article describes an alternative way of…

  13. Reliability and validity of a tool to measure the severity of tongue thrust in children: the Tongue Thrust Rating Scale.

    PubMed

    Serel Arslan, S; Demir, N; Karaduman, A A

    2017-02-01

    This study aimed to develop a scale called Tongue Thrust Rating Scale (TTRS), which categorised tongue thrust in children in terms of its severity during swallowing, and to investigate its validity and reliability. The study describes the developmental phase of the TTRS and presented its content and criterion-based validity and interobserver and intra-observer reliability. For content validation, seven experts assessed the steps in the scale over two Delphi rounds. Two physical therapists evaluated videos of 50 children with cerebral palsy (mean age, 57·9 ± 16·8 months), using the TTRS to test criterion-based validity, interobserver and intra-observer reliability. The Karaduman Chewing Performance Scale (KCPS) and Drooling Severity and Frequency Scale (DSFS) were used for criterion-based validity. All the TTRS steps were deemed necessary. The content validity index was 0·857. A very strong positive correlation was found between two examinations by one physical therapist, which indicated intra-observer reliability (r = 0·938, P < 0·001). A very strong positive correlation was also found between the TTRS scores of two physical therapists, indicating interobserver reliability (r = 0·892, P < 0·001). There was also a strong positive correlation between the TTRS and KCPS (r = 0·724, P < 0·001) and a very strong positive correlation between the TTRS scores and DSFS (r = 0·822 and r = 0·755; P < 0·001). These results demonstrated the criterion-based validity of the TTRS. The TTRS is a valid, reliable and clinically easy-to-use functional instrument to document the severity of tongue thrust in children. © 2016 John Wiley & Sons Ltd.

  14. Screening for cognitive impairment in older individuals. Validation study of a computer-based test.

    PubMed

    Green, R C; Green, J; Harrison, J M; Kutner, M H

    1994-08-01

    This study examined the validity of a computer-based cognitive test that was recently designed to screen the elderly for cognitive impairment. Criterion-related validity was examined by comparing test scores of impaired patients and normal control subjects. Construct-related validity was computed through correlations between computer-based subtests and related conventional neuropsychological subtests. University center for memory disorders. Fifty-two patients with mild cognitive impairment by strict clinical criteria and 50 unimpaired, age- and education-matched control subjects. Control subjects were rigorously screened by neurological, neuropsychological, imaging, and electrophysiological criteria to identify and exclude individuals with occult abnormalities. Using a cut-off total score of 126, this computer-based instrument had a sensitivity of 0.83 and a specificity of 0.96. Using a prevalence estimate of 10%, predictive values, positive and negative, were 0.70 and 0.96, respectively. Computer-based subtests correlated significantly with conventional neuropsychological tests measuring similar cognitive domains. Thirteen (17.8%) of 73 volunteers with normal medical histories were excluded from the control group, with unsuspected abnormalities on standard neuropsychological tests, electroencephalograms, or magnetic resonance imaging scans. Computer-based testing is a valid screening methodology for the detection of mild cognitive impairment in the elderly, although this particular test has important limitations. Broader applications of computer-based testing will require extensive population-based validation. Future studies should recognize that normal control subjects without a history of disease who are typically used in validation studies may have a high incidence of unsuspected abnormalities on neurodiagnostic studies.

  15. Does the decision in a validation process of a surrogate endpoint change with level of significance of treatment effect? A proposal on validation of surrogate endpoints.

    PubMed

    Sertdemir, Y; Burgut, R

    2009-01-01

    In recent years the use of surrogate end points (S) has become an interesting issue. In clinical trials, it is important to get treatment outcomes as early as possible. For this reason there is a need for surrogate endpoints (S) which are measured earlier than the true endpoint (T). However, before a surrogate endpoint can be used it must be validated. For a candidate surrogate endpoint, for example time to recurrence, the validation result may change dramatically between clinical trials. The aim of this study is to show how the validation criterion (R(2)(trial)) proposed by Buyse et al. are influenced by the magnitude of treatment effect with an application using real data. The criterion R(2)(trial) proposed by Buyse et al. (2000) is applied to the four data sets from colon cancer clinical trials (C-01, C-02, C-03 and C-04). Each clinical trial is analyzed separately for treatment effect on survival (true endpoint) and recurrence free survival (surrogate endpoint) and this analysis is done also for each center in each trial. Results are used for standard validation analysis. The centers were grouped by the Wald statistic in 3 equal groups. Validation criteria R(2)(trial) were 0.641 95% CI (0.432-0.782), 0.223 95% CI (0.008-0.503), 0.761 95% CI (0.550-0.872) and 0.560 95% CI (0.404-0.687) for C-01, C-02, C-03 and C-04 respectively. The R(2)(trial) criteria changed by the Wald statistics observed for the centers used in the validation process. Higher the Wald statistic groups are higher the R(2)(trial) values observed. The recurrence free survival is not a good surrogate for overall survival in clinical trials with non significant treatment effects and moderate for significant treatment effects. This shows that the level of significance of treatment effect should be taken into account in validation process of surrogate endpoints.

  16. The Trait Emotional Intelligence Questionnaire: Internal Structure, Convergent, Criterion, and Incremental Validity in an Italian Sample

    ERIC Educational Resources Information Center

    Andrei, Federica; Smith, Martin M.; Surcinelli, Paola; Baldaro, Bruno; Saklofske, Donald H.

    2016-01-01

    This study investigated the structure and validity of the Italian translation of the Trait Emotional Intelligence Questionnaire. Data were self-reported from 227 participants. Confirmatory factor analysis supported the four-factor structure of the scale. Hierarchical regressions also demonstrated its incremental validity beyond demographics, the…

  17. Validation of the Weight Concerns Scale Applied to Brazilian University Students.

    PubMed

    Dias, Juliana Chioda Ribeiro; da Silva, Wanderson Roberto; Maroco, João; Campos, Juliana Alvares Duarte Bonini

    2015-06-01

    The aim of this study was to evaluate the validity and reliability of the Portuguese version of the Weight Concerns Scale (WCS) when applied to Brazilian university students. The scale was completed by 1084 university students from Brazilian public education institutions. A confirmatory factor analysis was conducted. The stability of the model in independent samples was assessed through multigroup analysis, and the invariance was estimated. Convergent, concurrent, divergent, and criterion validities as well as internal consistency were estimated. Results indicated that the one-factor model presented an adequate fit to the sample and values of convergent validity. The concurrent validity with the Body Shape Questionnaire and divergent validity with the Maslach Burnout Inventory for Students were adequate. Internal consistency was adequate, and the factorial structure was invariant in independent subsamples. The results present a simple and short instrument capable of precisely and accurately assessing concerns with weight among Brazilian university students. Copyright © 2015 Elsevier Ltd. All rights reserved.

  18. Validation of the Australian Propensity for Angry Driving Scale (Aus-PADS).

    PubMed

    Leal, Nerida L; Pachana, Nancy A

    2009-09-01

    The present study used a university sample to assess the test-retest reliability and validity of the Australian Propensity for Angry Driving Scale (Aus-PADS). The scale has stability over time, and convergent validity was established, as Aus-PADS scores correlated significantly with established anger and impulsivity measures. Discriminant validity was also established, as Aus-PADS scores did not correlate with Venturesomeness scores. The Aus-PADS has demonstrated criterion validity, as scores were correlated with behavioural measures, such as yelling at other drivers, gesturing at other drivers, and feeling angry but not doing anything. Aus-PADS scores reliably predicted the frequency of these behaviours over and above other study variables. No significant relationship between aggressive driving and crash involvement was observed. It was concluded that the Aus-PADS is a reliable and valid tool appropriate for use in Australian research, and that the potential relationship between aggressive driving and crash involvement warrants further investigation with a more representative (and diverse) driver sample.

  19. Validity of the Digital Inclinometer and iPhone When Measuring Thoracic Spine Rotation.

    PubMed

    Bucke, Jonathan; Spencer, Simon; Fawcett, Louise; Sonvico, Lawrence; Rushton, Alison; Heneghan, Nicola R

    2017-09-01

      Spinal axial rotation is required for many functional and sporting activities. Eighty percent of axial rotation occurs in the thoracic spine. Existing measures of thoracic spine rotation commonly involve laboratory equipment, use a seated position, and include lumbar motion. A simple performance-based outcome measure would allow clinicians to evaluate isolated thoracic spine rotation. Currently, no valid measure exists.   To explore the criterion and concurrent validity of a digital inclinometer (DI) and iPhone Clinometer app (iPhone) for measuring thoracic spine rotation using the heel-sit position.   Controlled laboratory study.   University laboratory.   A total of 23 asymptomatic healthy participants (14 men, 9 women; age = 25.82 ± 4.28 years, height = 170.26 ± 8.01 cm, mass = 67.50 ± 9.46 kg, body mass index = 23.26 ± 2.79) were recruited from a student population.   We took DI and iPhone measurements of thoracic spine rotation in the heel-sit position concurrently with dual-motion analysis (laboratory measure) and ultrasound imaging of the underlying bony tissue motion (reference standard). To determine the criterion and concurrent validity, we used the Pearson product moment correlation coefficient (r, 2 tailed) and Bland-Altman plots.   The DI (r = 0.88, P < .001) and iPhone (r = 0.88, P < .001) demonstrated strong criterion validity. Both also had strong concurrent validity (r = 0.98, P < .001). Bland-Altman plots illustrated mean differences of 5.82° (95% confidence interval [CI] = 20.37°, -8.73°) and 4.94° (95% CI = 19.23°, -9.35°) between the DI and iPhone, respectively, and the reference standard and 0.87° (95% CI = 6.79°, -5.05°) between the DI and iPhone.   The DI and iPhone provided valid measures of thoracic spine rotation in the heel-sit position. Both can be used in clinical practice to assess thoracic spine rotation, which may be valuable when evaluating thoracic dysfunction.

  20. Concurrent validity and clinical usefulness of several individually administered tests of children's social-emotional cognition.

    PubMed

    McKown, Clark

    2007-03-01

    In this study, the validity of 5 tests of children's social-emotional cognition, defined as their encoding, memory, and interpretation of social information, was tested. Participants were 126 clinic-referred children between the ages of 5 and 17. All 5 tests were evaluated in terms of their (a) concurrent validity, (b) incremental validity, and (c) clinical usefulness in predicting social functioning. Tests included measures of nonverbal sensitivity, social language, and social problem solving. Criterion measures included parent and teacher report of social functioning. Analyses support the concurrent validity of all measures, and the incremental validity and clinical usefulness of tests of pragmatic language and problem solving.

  1. Validation of Cost-Effectiveness Criterion for Evaluating Noise Abatement Measures

    DOT National Transportation Integrated Search

    1999-04-01

    This project will provide the Texas Department of Transportation (TxDOT)with information about the effects of the current cost-effectiveness criterion. The project has reviewed (1) the cost-effectiveness criteria used by other states, (2) the noise b...

  2. Validity of the posttraumatic stress disorders (PTSD) checklist in pregnant women.

    PubMed

    Gelaye, Bizu; Zheng, Yinnan; Medina-Mora, Maria Elena; Rondon, Marta B; Sánchez, Sixto E; Williams, Michelle A

    2017-05-12

    The PTSD Checklist-civilian (PCL-C) is one of the most commonly used self-report measures of PTSD symptoms, however, little is known about its validity when used in pregnancy. This study aims to evaluate the reliability and validity of the PCL-C as a screen for detecting PTSD symptoms among pregnant women. A total of 3372 pregnant women who attended their first prenatal care visit in Lima, Peru participated in the study. We assessed the reliability of the PCL-C items using Cronbach's alpha. Criterion validity and performance characteristics of PCL-C were assessed against an independent, blinded Clinician-Administered PTSD Scale (CAPS) interview using measures of sensitivity, specificity and receiver operating characteristics (ROC) curves. We tested construct validity using exploratory and confirmatory factor analytic approaches. The reliability of the PCL-C was excellent (Cronbach's alpha =0.90). ROC analysis showed that a cut-off score of 26 offered optimal discriminatory power, with a sensitivity of 0.86 (95% CI: 0.78-0.92) and a specificity of 0.63 (95% CI: 0.62-0.65). The area under the ROC curve was 0.75 (95% CI: 0.71-0.78). A three-factor solution was extracted using exploratory factor analysis and was further complemented with three other models using confirmatory factor analysis (CFA). In a CFA, a three-factor model based on DSM-IV symptom structure had reasonable fit statistics with comparative fit index of 0.86 and root mean square error of approximation of 0.09. The Spanish-language version of the PCL-C may be used as a screening tool for pregnant women. The PCL-C has good reliability, criterion validity and factorial validity. The optimal cut-off score obtained by maximizing the sensitivity and specificity should be considered cautiously; women who screened positive may require further investigation to confirm PTSD diagnosis.

  3. The cross-validated AUC for MCP-logistic regression with high-dimensional data.

    PubMed

    Jiang, Dingfeng; Huang, Jian; Zhang, Ying

    2013-10-01

    We propose a cross-validated area under the receiving operator characteristic (ROC) curve (CV-AUC) criterion for tuning parameter selection for penalized methods in sparse, high-dimensional logistic regression models. We use this criterion in combination with the minimax concave penalty (MCP) method for variable selection. The CV-AUC criterion is specifically designed for optimizing the classification performance for binary outcome data. To implement the proposed approach, we derive an efficient coordinate descent algorithm to compute the MCP-logistic regression solution surface. Simulation studies are conducted to evaluate the finite sample performance of the proposed method and its comparison with the existing methods including the Akaike information criterion (AIC), Bayesian information criterion (BIC) or Extended BIC (EBIC). The model selected based on the CV-AUC criterion tends to have a larger predictive AUC and smaller classification error than those with tuning parameters selected using the AIC, BIC or EBIC. We illustrate the application of the MCP-logistic regression with the CV-AUC criterion on three microarray datasets from the studies that attempt to identify genes related to cancers. Our simulation studies and data examples demonstrate that the CV-AUC is an attractive method for tuning parameter selection for penalized methods in high-dimensional logistic regression models.

  4. Validation of a Portuguese version of the Information Needs in Cardiac Rehabilitation (INCR) scale in Brazil.

    PubMed

    Ghisi, Gabriela Lima de Melo; Dos Santos, Rafaella Zulianello; Bonin, Christiani Batista Decker; Roussenq, Suellen; Grace, Sherry L; Oh, Paul; Benetti, Magnus

    2014-01-01

    To translate, culturally adapt and psychometrically validate the Information Needs in Cardiac Rehabilitation (INCR) tool to Portuguese. The identification of information needs is considered the first step to improve knowledge that ultimately could improve health outcomes. The Portuguese version generated was tested in 300 cardiac rehabilitation patients (CR) (34% women; mean age = 61.3 ± 2.1 years old). Test-retest reliability was assessed using intraclass correlation coefficient (ICC), the internal consistency using Cronbach's alpha, and the criterion validity was assessed with regard to patients' education and duration in CR. All 9 subscales were considered internally consistent (á > 0.7). Significant differences between mean total needs and educational level (p < 0.05) and duration in CR (p = 0.03) supported criterion validity. The overall mean (4.6 ± 0.4), as well as the means of the 9 subscales were high (emergency/safety was the greatest need). The Portuguese INCR was demonstrated to have sufficient reliability, consistency and validity. Copyright © 2014 Elsevier Inc. All rights reserved.

  5. Psychometric Validation of a Short Acculturation Scale for Korean Immigrants

    PubMed Central

    Choi, Sarah E.; Reed, Preston L.

    2011-01-01

    Background Acculturation has been identified as a determinant of health behavior and outcome among ethnic minorities. The high prevalence of lifestyle related chronic diseases and risk factors among Korean immigrants calls for a valid short acculturation scale to use in clinical practice and health research settings. Objectives To validate the psychometric properties of a Short Acculturation Scale originally developed for Hispanics (SASH) after translating the scale to Korean (SAS-K) to determine its suitability for use with Korean immigrants. Method A self-administered questionnaire was completed by 143 Korean immigrants with type 2 diabetes aged 30–80 years from a Korean community in Southern California. Confirmatory factor analysis, criterion validity, and internal reliability were utilized to evaluate the psychometric properties of the SAS-K. Results Using a second-order confirmatory factor analysis, a three-factor structure [χ2 (51) = 121.49, p < .001, CFI = .950, SRMR = .055, RMSEA = .099] was confirmed. The SAS-K was associated positively with length of residence, age of arrival, and English proficiency. Reliability for the total SAS-K was .93. Cronbach’s alpha coefficients for each subscale of the SAS-K ranged from .80 (social relations) to .95 (media). Discussion The 12-item, easy-to-use SAS-K demonstrated satisfactory reliability and validity and thus is an appropriate instrument for measuring acculturation in Korean immigrants. The short nature and ease of administration of the SAS-K makes it an ideal choice for healthcare providers and researchers to assess acculturation levels quickly and easily, and to further the development and use of more culture-appropriate interventions. PMID:21677595

  6. Relative validity of a semiquantitative food frequency questionnaire designed for schoolchildren in western Greece

    PubMed Central

    Roumelioti, Maria; Leotsinidis, Michalis

    2009-01-01

    Background The use of food frequency questionnaires (FFQs) has become increasingly important in epidemiologic studies. During the past few decades, a wide variety of nutritional studies have used the semiquantitative FFQ as a tool for assessing and evaluating dietary intake. One of the main concerns in a dietary analysis is the validity of the collected dietary data. Methods This paper discusses several methodological and statistical issues related to the validation of a semiquantitative FFQ. This questionnaire was used to assess the nutritional habits of schoolchildren in western Greece. For validation purposes, we selected 200 schoolchildren and contacted their respective parents. We evaluated the relative validity of 400 FFQs (200 children's FFQs and 200 parents' FFQs). Results The correlations between the children's and the parents' questionnaire responses showed that the questionnaire we designed was appropriate for fulfilling the purposes of our study and in ranking subjects according to food group intake. Conclusion Our study shows that the semiquantitative FFQ provides a reasonably reliable measure of dietary intake and corroborates the relative validity of our questionnaire. PMID:19196469

  7. Visual reproduction subtest of the Wechsler Memory Scale-Revised: analysis of construct validity.

    PubMed

    Williams, M A; Rich, M A; Reed, L K; Jackson, W T; LaMarche, J A; Boll, T J

    1998-11-01

    This study assessed the construct validity of Visual Reproduction (VR) Cards A (Flags) and B (Boxes) from the original Wechsler Memory Scale (WMS) compared to Flags and Boxes from the revised edition of the WMS (WMS-R). Independent raters scored Flags and Boxes using both the original and revised scoring criteria and correlations were obtained with age, education, IQ, and four separate criterion memory measures. Results show that for Flags, there is a tendency for the revised scoring criteria to produce improved construct validity. For Boxes, however, there was a trend in the opposite direction, with the revised scoring criteria demonstrating worse construct validity. Factor analysis suggests that Flags are a more distinct measure of visual memory, whereas Boxes are more complex and significantly associated with conceptual reasoning abilities. Using the revised scoring criteria, Boxes were found to be more strongly related to IQ than Flags. This difference was not found using the original scoring criteria.

  8. Design for validation: An approach to systems validation

    NASA Technical Reports Server (NTRS)

    Carter, William C.; Dunham, Janet R.; Laprie, Jean-Claude; Williams, Thomas; Howden, William; Smith, Brian; Lewis, Carl M. (Editor)

    1989-01-01

    Every complex system built is validated in some manner. Computer validation begins with review of the system design. As systems became too complicated for one person to review, validation began to rely on the application of adhoc methods by many individuals. As the cost of the changes mounted and the expense of failure increased, more organized procedures became essential. Attempts at devising and carrying out those procedures showed that validation is indeed a difficult technical problem. The successful transformation of the validation process into a systematic series of formally sound, integrated steps is necessary if the liability inherent in the future digita-system-based avionic and space systems is to be minimized. A suggested framework and timetable for the transformtion are presented. Basic working definitions of two pivotal ideas (validation and system life-cyle) are provided and show how the two concepts interact. Many examples are given of past and present validation activities by NASA and others. A conceptual framework is presented for the validation process. Finally, important areas are listed for ongoing development of the validation process at NASA Langley Research Center.

  9. Validation of a home food inventory among low-income Spanish- and Somali-speaking families.

    PubMed

    Hearst, Mary O; Fulkerson, Jayne A; Parke, Michelle; Martin, Lauren

    2013-07-01

    To refine and validate an existing home food inventory (HFI) for low-income Somali- and Spanish-speaking families. Formative assessment was conducted using two focus groups, followed by revisions of the HFI, translation of written materials and instrument validation in participants’ homes. Twin Cities Metropolitan Area, Minnesota, USA. Thirty low-income families with children of pre-school age (fifteen Spanish-speaking; fifteen Somali-speaking) completed the HFI simultaneously with, but independently of, a trained staff member. Analysis consisted of calculation of both item-specific and average food group kappa coefficients, specificity, sensitivity and Spearman’s correlation between participants’ and staff scores as a means of assessing criterion validity of individual items, food categories and the obesogenic score. The formative assessment revealed the need for few changes/additions for food items typically found in Spanish-speaking households. Somali-speaking participants requested few additions, but many deletions, including frozen processed food items, non-perishable produce and many sweets as they were not typical food items kept in the home. Generally, all validity indices were within an acceptable range, with the exception of values associated with items such as ‘whole wheat bread’ (k = 0.16). The obesogenic score (presence of high-fat, high-energy foods) had high criterion validity with k = 0.57, sensitivity = 91.8%, specificity = 70.6% and Spearman correlation = 0.78. The revised HFI is a valid assessment tool for use among Spanish and Somali households. This instrument refinement and validation process can be replicated with other population groups.

  10. Validation of the Internet Gaming Disorder Scale – Short-Form (IGDS9-SF) in an Italian-speaking sample

    PubMed Central

    Monacis, Lucia; de Palo, Valeria; Griffiths, Mark D.; Sinatra, Maria

    2016-01-01

    Background and aims The inclusion of Internet Gaming Disorder (IGD) in Section III of the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders has increased the interest of researchers in the development of new standardized psychometric tools for the assessment of such a disorder. To date, the nine-item Internet Gaming Disorder Scale – Short-Form (IGDS9-SF) has only been validated in English, Portuguese, and Slovenian languages. Therefore, the aim of this investigation was to examine the psychometric properties of the IGDS9-SF in an Italian-speaking sample. Methods A total of 757 participants were recruited to the present study. Confirmatory factor analysis and multi-group analyses were applied to assess the construct validity. Reliability analyses comprised the average variance extracted, the standard error of measurement, and the factor determinacy coefficient. Convergent and criterion validities were established through the associations with other related constructs. The receiver operating characteristic curve analysis was used to determine an empirical cut-off point. Results Findings confirmed the single-factor structure of the instrument, its measurement invariance at the configural level, and the convergent and criterion validities. Satisfactory levels of reliability and a cut-off point of 21 were obtained. Discussion and conclusions The present study provides validity evidence for the use of the Italian version of the IGDS9-SF and may foster research into gaming addiction in the Italian context. PMID:27876422

  11. Validity of the inexpensive Stepping Meter in counting steps in free living conditions: a pilot study

    PubMed Central

    De Cocker, K; Cardon, G; De Bourdeaudhuij, I

    2006-01-01

    Objectives To evaluate if inexpensive Stepping Meters are valid in counting steps in adults in free living conditions. Methods For six days, 35 healthy volunteers wore a criterion Yamax Digiwalker and five Stepping Meters every day until all 973 pedometers had been tested. Steps were recorded daily, and the differences between counts from the Digiwalker and the Stepping Meter were expressed as a percentage of the valid value of the Digiwalker step counts. The criterion used to determine if a Stepping Meter was valid was a maximum deviation of 10% from the Digiwalker step counts. Results A total of 252 (25.9%) Stepping Meters met the criterion, whereas 74.1% made an overestimation or underestimation of more than 10%. In more than one third (36.6%) of the invalid Stepping Meters, the deviation was greater than 50%. Most (64.8%) of the invalid pedometers overestimated the actual steps taken. Conclusions Inexpensive Stepping Meters cannot be used in community interventions as they will give participants the wrong message. PMID:16790485

  12. Translating and validating a Training Needs Assessment tool into Greek

    PubMed Central

    Markaki, Adelais; Antonakis, Nikos; Hicks, Carolyn M; Lionis, Christos

    2007-01-01

    Background The translation and cultural adaptation of widely accepted, psychometrically tested tools is regarded as an essential component of effective human resource management in the primary care arena. The Training Needs Assessment (TNA) is a widely used, valid instrument, designed to measure professional development needs of health care professionals, especially in primary health care. This study aims to describe the translation, adaptation and validation of the TNA questionnaire into Greek language and discuss possibilities of its use in primary care settings. Methods A modified version of the English self-administered questionnaire consisting of 30 items was used. Internationally recommended methodology, mandating forward translation, backward translation, reconciliation and pretesting steps, was followed. Tool validation included assessing item internal consistency, using the alpha coefficient of Cronbach. Reproducibility (test – retest reliability) was measured by the kappa correlation coefficient. Criterion validity was calculated for selected parts of the questionnaire by correlating respondents' research experience with relevant research item scores. An exploratory factor analysis highlighted how the items group together, using a Varimax (oblique) rotation and subsequent Cronbach's alpha assessment. Results The psychometric properties of the Greek version of the TNA questionnaire for nursing staff employed in primary care were good. Internal consistency of the instrument was very good, Cronbach's alpha was found to be 0.985 (p < 0.001) and Kappa coefficient for reproducibility was found to be 0.928 (p < 0.0001). Significant positive correlations were found between respondents' current performance levels on each of the research items and amount of research involvement, indicating good criterion validity in the areas tested. Factor analysis revealed seven factors with eigenvalues of > 1.0, KMO (Kaiser-Meyer-Olkin) measure of sampling adequacy = 0.680 and

  13. Reliability and validity of two multidimensional self-reported physical activity questionnaires in people with chronic low back pain.

    PubMed

    Carvalho, Flávia A; Morelhão, Priscila K; Franco, Marcia R; Maher, Chris G; Smeets, Rob J E M; Oliveira, Crystian B; Freitas Júnior, Ismael F; Pinto, Rafael Z

    2017-02-01

    Although there is some evidence for reliability and validity of self-report physical activity (PA) questionnaires in the general adult population, it is unclear whether we can assume similar measurement properties in people with chronic low back pain (LBP). To determine the test-retest reliability of the International Physical Activity Questionnaire (IPAQ) long-version and the Baecke Physical Activity Questionnaire (BPAQ) and their criterion-related validity against data derived from accelerometers in patients with chronic LBP. Cross-sectional study. Patients with non-specific chronic LBP were recruited. Each participant attended the clinic twice (one week interval) and completed self-report PA. Accelerometer measures >7 days included time spent in moderate-and-vigorous physical activity, steps/day, counts/minute, and vector magnitude counts/minute. Intraclass Correlation Coefficients (ICC) and Bland and Altman method were used to determine reliability and spearman rho correlation were used for criterion-related validity. A total of 73 patients were included in our analyses. The reliability analyses revealed that the BPAQ and its subscales have moderate to excellent reliability (ICC 2,1 : 0.61 to 0.81), whereas IPAQ and most IPAQ domains (except walking) showed poor reliability (ICC 2,1 : 0.20 to 0.40). The Bland and Altman method revealed larger discrepancies for the IPAQ. For the validity analysis, questionnaire and accelerometer measures showed at best fair correlation (rho < 0.37). Although the BPAQ showed better reliability than the IPAQ long-version, both questionnaires did not demonstrate acceptable validity against accelerometer data. These findings suggest that questionnaire and accelerometer PA measures should not be used interchangeably in this population. Copyright © 2016 Elsevier Ltd. All rights reserved.

  14. Cross-cultural adaptation and validation of the French version of the Expanded Prostate cancer Index Composite questionnaire for health-related quality of life in prostate cancer patients.

    PubMed

    Anota, Amélie; Mariet, Anne-Sophie; Maingon, Philippe; Joly, Florence; Bosset, Jean-François; Guizard, Anne-Valérie; Bittard, Hugues; Velten, Michel; Mercier, Mariette

    2016-12-06

    Health-related quality of life (HRQoL) has been positioned as one of the major endpoints in oncology. Thus, there is a need to validate cancer-site specific survey instruments. This study aimed to perform a transcultural adaptation of the 50-item Expanded Prostate cancer Index Composite (EPIC) questionnaire for HRQoL in prostate cancer patients and to validate the psychometric properties of the French-language version. The EPIC questionnaire measures urinary, bowel, sexual and hormonal domains. The first step, corresponding to transcultural adaptation of the original English version of the EPIC was performed according to the back translation technique. The second step, comprising the validation of the psychometric properties of the EPIC questionnaire, was performed in patients under treatment for localized prostate cancer (treatment group) and in patients cured of prostate cancer (cured group). The EORTC QLQ-C30 and QLQ-PR25 prostate cancer module were also completed by patients to assess criterion validity. Two assessments were performed, i.e., before and at the end of treatment for the Treatment group, to assess sensitivity to change; and at 2 weeks' interval in the Cured group to assess test-retest reliability. Psychometric properties were explored according to classical test theory. The first step showed overall good acceptability and understanding of the questionnaire. In the second step, 215 patients were included from January 2012 to June 2014: 125 in the Treatment group, and 90 in the Cured group. All domains exhibited good internal consistency, except the bowel domain (Cronbach's α = 0.61). No floor effect was observed. Test-retest reliability assessed in the cured group was acceptable, expect for bowel function (intraclass coefficient = 0.68). Criterion validity was good for each domain and subscale. Construct validity was not demonstrated for the hormonal and bowel domains. Sensitivity to change was exhibited for 5/8 subscales and 2/4 summary

  15. The validity and reliability of diagnoses of work-related mental ill-health.

    PubMed

    O'Neill, E; McNamee, R; Agius, R; Gittins, M; Hussey, L; Turner, S

    2008-11-01

    To establish the reliability and validity of work-related mental ill-health diagnoses. A UK-based surveillance scheme for work-related ill-health involving occupational physicians (OPs) reporting suggests that mental ill-health incidence is increasing by around 13% per year, with anxiety, depression and "other work-related stress" being the most common diagnoses. There have been no studies of the validity and reliability of such diagnoses. Given the existence of a large network of psychiatrists (PSYs) also involved in surveillance of work-related ill-health, an opportunity arose to measure the concurrent validity and reliability of work-related mental ill-health diagnoses. 100 anonymised summaries of cases previously reported by OPs or PSYs were collected; each was sent to 5 PSYs and 5 OPs, who assigned a diagnosis and judged whether the case was work-related. Concurrent validity of the ill-health aspect of the diagnoses, and of the opinion as to work-relatedness, was assessed by comparing the overall classifications of cases by OPs and PSYs. Reliability of the diagnostic classification was measured by kappa matrices. Diagnostic proportions for PSYs and OPs demonstrated good agreement for anxiety, depression, anxiety plus depression and "stress" (11%, 34%, 27%, 14%) and (14%, 30%, 27%, 17%), respectively. In both groups, kappa coefficients were high for a psychotic diagnosis (0.78, 95% CI: 0.74 to 0.83), but not as high for anxiety (0.27, 95% CI: 0.23 to 0.32), depression (0.34, 95% CI: 0.29 to 0.38) and "stress" (0.15, 95% CI: 0.10 to 0.19). The odds ratio of classifying a case as work-related among PSYs compared to OPs was 2.39 (95% CI: 1.68 to 3.38), p<0.001. The overall agreement between OPs and PSYs on mental ill-health diagnoses suggests that OP diagnoses are valid for epidemiological purposes. However, the within-group reliability of the diagnosis "stress" is low. Given differences in judgements about work-relatedness, further research is needed to

  16. Validity and reliability of sleep time questionnaires in children and adolescents: A systematic review and meta-analysis.

    PubMed

    Nascimento-Ferreira, Marcus V; Collese, Tatiana S; de Moraes, Augusto César F; Rendo-Urteaga, Tara; Moreno, Luis A; Carvalho, Heráclito B

    2016-12-01

    Sleep duration has been associated with several health outcomes in children and adolescents. As an extensive number of questionnaires are currently used to investigate sleep schedule or sleep time, we performed a systematic review of criterion validation of sleep time questionnaires for children and adolescents, considering accelerometers as the reference method. We found a strong correlation between questionnaires and accelerometers for weeknights and a moderate correlation for weekend nights. When considering only studies performing a reliability assessment of the used questionnaires, a significant increase in the correlations for both weeknights and weekend nights was observed. In conclusion, moderate to strong criterion validity of sleep time questionnaires was observed; however, the reliability assessment of the questionnaires showed strong validation performance. Copyright © 2015 Elsevier Ltd. All rights reserved.

  17. Factorial validity of the Delta Healthy Eating Attitudes Scale related to diet quality

    USDA-ARS?s Scientific Manuscript database

    The objective of this study was to establish factorial validity of decisional balance (DB), self-efficacy (SE) and social support (SS) psychosocial constructs and convergent validity related to diet quality. A sample (N=319) of predominantly African American women from the Mississippi Communities fo...

  18. Reliability and validity of the Chinese CECA10 questionnaire for Chinese patients with condyloma acuminata

    PubMed Central

    Guo, Xinying; Wu, Xinjuan; Guo, Aimin; Zhao, Yanwei

    2018-01-01

    Abstract Condyloma acuminata (CA) is a sexually transmitted disease that affects quality of life (QOL). CECA10 is an English-language questionnaire for assessing QOL in patients with CA, but there is no equivalent in China. This study aimed to develop a validated and reliable Chinese version of CECA10. The Chinese CECA10 was developed from the English version by forward translation, back translation, comparison with the original, cultural adjustments, and a pre-test (5 patients). The Chinese CECA10 and EuroQol Five Dimensions Three Level Questionnaire (EQ-5D-3L) was administered to patients with CA. Content validity (item/scale content validity indexes, I-CVI/S-CVI), test–retest reliability (intraclass coefficient, ICC), internal consistency (Cronbach α), criterion validity (comparison with the Dermatology Life Quality Index, DLQL, using Spearman correlation analysis), construct validity (exploratory factor analysis), and discriminant validity (between subgroups based on number of warts, number of recurrences, or number of sites involved) were assessed. The Chinese CECA10 had good test–retest reliability (ICC = 0.98, P < .001), internal consistency (Cronbach α values of 0.88, 0.84, and 0.83 for the total questionnaire, psychological dimension, and sexual dimension, respectively), content validity (I-CVI = 1 for all items), and criterion validity (r = -0.50, P < .001). Exploratory factor analysis extracted 2 factors with a cumulative contribution of 61.75%; the factor loading with each item was >0.4. Discriminant validity was not high. The mean CECA10 and EQ-VAS scores of 211 patients with CA (28.19 ± 7.16 years; 139 males) were 34.56 ± 19.01 and 64.64 ± 19.28, respectively. The Chinese CECA10 has good reliability and validity for evaluating the QOL of Chinese patients with CA. PMID:29489693

  19. Convergent and Divergent Validity of the Learning Transfer System Inventory

    ERIC Educational Resources Information Center

    Holton, Elwood F., III; Bates, Reid A.; Bookter, Annette I.; Yamkovenko, V. Bogdan

    2007-01-01

    The Learning Transfer System Inventory (LTSI) was developed to identify a select set of factors with the potential to substantially enhance or inhibit transfer of learning to the work environment. It has undergone a variety of validation studies, including construct, criterion, and crosscultural studies. However, the convergent and divergent…

  20. Fracture mechanics validity limits

    NASA Technical Reports Server (NTRS)

    Lambert, Dennis M.; Ernst, Hugo A.

    1994-01-01

    Fracture behavior is characteristics of a dramatic loss of strength compared to elastic deformation behavior. Fracture parameters have been developed and exhibit a range within which each is valid for predicting growth. Each is limited by the assumptions made in its development: all are defined within a specific context. For example, the stress intensity parameters, K, and the crack driving force, G, are derived using an assumption of linear elasticity. To use K or G, the zone of plasticity must be small as compared to the physical dimensions of the object being loaded. This insures an elastic response, and in this context, K and G will work well. Rice's J-integral has been used beyond the limits imposed on K and G. J requires an assumption of nonlinear elasticity, which is not characteristic of real material behavior, but is thought to be a reasonable approximation if unloading is kept to a minimum. As well, the constraint cannot change dramatically (typically, the crack extension is limited to ten-percent of the initial remaining ligament length). Rice, et al investigated the properties required of J-type parameters, J(sub x), and showed that the time rate, dJ(sub x)/dt, must not be a function of the crack extension rate, da/dt. Ernst devised the modified-J parameter, J(sub M), that meets this criterion. J(sub M) correlates fracture data to much higher crack growth than does J. Ultimately, a limit of the validity of J(sub M) is anticipated, and this has been estimated to be at a crack extension of about 40-percent of the initial remaining ligament length. None of the various parameters can be expected to describe fracture in an environment of gross plasticity, in which case the process is better described by deformation parameters, e.g., stress and strain. In the current study, various schemes to identify the onset of the plasticity-dominated behavior, i.e., the end of fracture mechanics validity, are presented. Each validity limit parameter is developed in

  1. Cross-cultural validity of a dietary questionnaire for studies of dental caries risk in Japanese.

    PubMed

    Shinga-Ishihara, Chikako; Nakai, Yukie; Milgrom, Peter; Murakami, Kaori; Matsumoto-Nakano, Michiyo

    2014-01-02

    Diet is a major modifiable contributing factor in the etiology of dental caries. The purpose of this paper is to examine the reliability and cross-cultural validity of the Japanese version of the Food Frequency Questionnaire to assess dietary intake in relation to dental caries risk in Japanese. The 38-item Food Frequency Questionnaire, in which Japanese food items were added to increase content validity, was translated into Japanese, and administered to two samples. The first sample comprised 355 pregnant women with mean age of 29.2 ± 4.2 years for the internal consistency and criterion validity analyses. Factor analysis (principal components with Varimax rotation) was used to determine dimensionality. The dietary cariogenicity score was calculated from the Food Frequency Questionnaire and used for the analyses. Salivary mutans streptococci level was used as a semi-quantitative assessment of dental caries risk and measured by Dentocult SM. Dentocult SM scores were compared with the dietary cariogenicity score computed from the Food Frequency Questionnaire to examine criterion validity, and assessed by Spearman's correlation coefficient (rs) and Kruskal-Wallis test. Test-retest reliability of the Food Frequency Questionnaire was assessed with a second sample of 25 adults with mean age of 34.0 ± 3.0 years by using the intraclass correlation coefficient analysis. The Japanese language version of the Food Frequency Questionnaire showed high test-retest reliability (ICC = 0.70) and good criterion validity assessed by relationship with salivary mutans streptococci levels (rs = 0.22; p < 0.001). Factor analysis revealed four subscales that construct the questionnaire (solid sugars, solid and starchy sugars, liquid and semisolid sugars, sticky and slowly dissolving sugars). Internal consistency were low to acceptable (Cronbach's alpha = 0.67 for the total scale, 0.46-0.61 for each subscale). Mean dietary cariogenicity scores were 50.8 ± 19.5 in the first sample, 47.4

  2. Criterion validity of a Wechsler-III Scale Short Form in a sample of brazilian elderly

    PubMed Central

    Banhato, Eliane Ferreira Carvalho; Leite, Isabel Cristina Gonçalves; Guedes, Danielle Viveiros; Chaoubah, Alfredo

    2010-01-01

    Although a normative process, changes in cognitive functioning vary among older adults. The differential diagnosis between normal and pathological aging must be made early using psychometrically adequate measures. Objectives To assess the evidence of criterion validity of a Short Form (SF) of the Wechsler-III Scale containing eight subtests (SF8) by determining its sensitivity, specificity, positive and negative predictive values and cut-off points for Brazilian elderly from different age groups. Methods 168 individuals, aged 60 years or above, living in the community or in an institution, were assigned to case and control groups, and investigated according to age range. Measures included a sociodemographic questionnaire, the Mini-Mental State Examination (MMSE), Verbal Fluency Test, Clock-Drawing Test and the SF8. Results More than two thirds of the sample was women (73.8%), mean age was 74.5 years (SD=8.9), mean education was 6.2 years (SD=4.8) and 40.5% were widows/widowers. In the total sample, the best cut-off point for the SF8 was 142 while cut offs among individuals aged 60 to 69 years, 70 to 79 years, and more than 80 years were 160, 129 and 129, respectively. Conclusions The results demonstrated the importance of different cut-off points for different age ranges. Sensitivity and specificity values of the SF8 were sufficiently high to warrant the use of the SF8 as an instrument to identify cognitive impairment in the elderly. PMID:29213688

  3. Criterion validity of a Wechsler-III Scale Short Form in a sample of brazilian elderly.

    PubMed

    Banhato, Eliane Ferreira Carvalho; Leite, Isabel Cristina Gonçalves; Guedes, Danielle Viveiros; Chaoubah, Alfredo

    2010-01-01

    Although a normative process, changes in cognitive functioning vary among older adults. The differential diagnosis between normal and pathological aging must be made early using psychometrically adequate measures. To assess the evidence of criterion validity of a Short Form (SF) of the Wechsler-III Scale containing eight subtests (SF8) by determining its sensitivity, specificity, positive and negative predictive values and cut-off points for Brazilian elderly from different age groups. 168 individuals, aged 60 years or above, living in the community or in an institution, were assigned to case and control groups, and investigated according to age range. Measures included a sociodemographic questionnaire, the Mini-Mental State Examination (MMSE), Verbal Fluency Test, Clock-Drawing Test and the SF8. More than two thirds of the sample was women (73.8%), mean age was 74.5 years (SD=8.9), mean education was 6.2 years (SD=4.8) and 40.5% were widows/widowers. In the total sample, the best cut-off point for the SF8 was 142 while cut offs among individuals aged 60 to 69 years, 70 to 79 years, and more than 80 years were 160, 129 and 129, respectively. The results demonstrated the importance of different cut-off points for different age ranges. Sensitivity and specificity values of the SF8 were sufficiently high to warrant the use of the SF8 as an instrument to identify cognitive impairment in the elderly.

  4. Development and Validation of a Measure of Quality of Life for the Young Elderly in Sri Lanka.

    PubMed

    de Silva, Sudirikku Hennadige Padmal; Jayasuriya, Anura Rohan; Rajapaksa, Lalini Chandika; de Silva, Ambepitiyawaduge Pubudu; Barraclough, Simon

    2016-01-01

    Sri Lanka has one of the fastest aging populations in the world. Measurement of quality of life (QoL) in the elderly needs instruments developed that encompass the sociocultural settings. An instrument was developed to measure QoL in the young elderly in Sri Lanka (QLI-YES), using accepted methods to generate and reduce items. The measure was validated using a community sample. Construct, criterion and predictive validity and reliability were tested. A first-order model of 24 items with 6 domains was found to have good fit indices (CMIN/df = 1.567, RMR = 0.05, CFI = 0.95, and RMSEA = 0.053). Both criterion and predictive validity were demonstrated. Good internal consistency reliability (Cronbach's α = 0.93) was shown. The development of the QLI-YES using a societal perspective relevant to the social and cultural beliefs has resulted in a robust and valid instrument to measure QoL for the young elderly in Sri Lanka. © 2015 APJPH.

  5. Development and Validation of a Measure of Quality of Life for the Young Elderly in Sri Lanka

    PubMed Central

    de Silva, Sudirikku Hennadige Padmal; Jayasuriya, Anura Rohan; Rajapaksa, Lalini Chandika; de Silva, Ambepitiyawaduge Pubudu; Barraclough, Simon

    2016-01-01

    Sri Lanka has one of the fastest aging populations in the world. Measurement of quality of life (QoL) in the elderly needs instruments developed that encompass the sociocultural settings. An instrument was developed to measure QoL in the young elderly in Sri Lanka (QLI-YES), using accepted methods to generate and reduce items. The measure was validated using a community sample. Construct, criterion and predictive validity and reliability were tested. A first-order model of 24 items with 6 domains was found to have good fit indices (CMIN/df = 1.567, RMR = 0.05, CFI = 0.95, and RMSEA = 0.053). Both criterion and predictive validity were demonstrated. Good internal consistency reliability (Cronbach’s α = 0.93) was shown. The development of the QLI-YES using a societal perspective relevant to the social and cultural beliefs has resulted in a robust and valid instrument to measure QoL for the young elderly in Sri Lanka. PMID:26712893

  6. Validity of field expedient devices to assess core temperature during exercise in the cold.

    PubMed

    Bagley, James R; Judelson, Daniel A; Spiering, Barry A; Beam, William C; Bartolini, J Albert; Washburn, Brian V; Carney, Keven R; Muñoz, Colleen X; Yeargin, Susan W; Casa, Douglas J

    2011-12-01

    Exposure to cold environments affects human performance and physiological function. Major medical organizations recommend rectal temperature (TREC) to evaluate core body temperature (TcORE) during exercise in the cold; however, other field expedient devices claim to measure TCORE. The purpose of this study was to determine if field expedient devices provide valid measures of TcRE during rest and exercise in the cold. Participants included 13 men and 12 women (age = 24 +/- 3 yr, height = 170.7 +/- 10.6 cm, mass = 73.4 +/- 16.7 kg, body fat = 18 +/- 7%) who reported being healthy and at least recreationally active. During 150 min of cold exposure, subjects sequentially rested for 30 min, cycled for 90 min (heart rate = 120-140 bpm), and rested for an additional 30 min. Investigators compared aural (T(AUR)), expensive axillary (T(AXLe)), inexpensive axillary (T(AXLi)), forehead (T(FOR)), gastrointestinal (T(GI)), expensive oral (T(ORLe)), inexpensive oral (T(ORLi)), and temporal (T(TEM)) temperatures to T(REc) every 15 min. Researchers used mean difference between each device and T(REC) (i.e., mean bias) as the primary criterion for validity. T(AUR), T(AXLe), T(AXLi), T(FOR), TORLe, T(ORLi), and TTEM provided significantly lower measures compared to T(REC) and fell below our validity criterion. T(GI) significantly exceeded T(REC) at three of eleven time points, but no significant difference existed between mean T(REC) and T(GI) across time. Only T(GI) achieved our validity criterion and compared favorably to T(REC). T(GI) offers a valid measurement with which to assess T(CORE) during rest and exercise in the cold; athletic trainers, mountain rescuers, and military medical personnel should avoid other field expedient devices in similar conditions.

  7. SANSMIC Validation.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Weber, Paula D.; Rudeen, David Keith; Lord, David L.

    2014-08-01

    SANSMIC is solution mining software that was developed and utilized by SNL in its role as geotechnical advisor to the US DOE SPR for planning purposes. Three SANSMIC leach modes - withdrawal, direct, and reverse leach - have been revalidated with multiple test cases for each mode. The withdrawal mode was validated using high quality data from recent leach activity while the direct and reverse modes utilized data from historical cavern completion reports. Withdrawal results compared very well with observed data, including the location and size of shelves due to string breaks with relative leached volume differences ranging from 6more » - 10% and relative radius differences from 1.5 - 3%. Profile comparisons for the direct mode were very good with relative leached volume differences ranging from 6 - 12% and relative radius differences from 5 - 7%. First, second, and third reverse configurations were simulated in order to validate SANSMIC over a range of relative hanging string and OBI locations. The first-reverse was simulated reasonably well with relative leached volume differences ranging from 1 - 9% and relative radius differences from 5 - 12%. The second-reverse mode showed the largest discrepancies in leach profile. Leached volume differences ranged from 8 - 12% and relative radius differences from 1 - 10%. In the third-reverse, relative leached volume differences ranged from 10 - 13% and relative radius differences were %7E4 %. Comparisons to historical reports were quite good, indicating that SANSMIC is essentially the same as documented and validated in the early 1980's.« less

  8. Development and validation of a knowledge test for health professionals regarding lifestyle modification.

    PubMed

    Talip, Whadi-ah; Steyn, Nelia P; Visser, Marianne; Charlton, Karen E; Temple, Norman

    2003-09-01

    We wanted to develop and validate a test that assesses the knowledge and practices of health professionals (HPs) with regard to the role of nutrition, physical activity, and smoking cessation (lifestyle modification) in chronic diseases of lifestyle. A descriptive cross-sectional validation study was carried out. The validation design consisted of two phases, namely 1) test planning and development and 2) test evaluation. The study sample consisted of five groups of HPs: dietitians, dietetic interns, general practitioners, medical students, and nurses. The overall response rate was 58%, resulting in a sample size of 186 participants. A test was designed to evaluate the knowledge and practices of HPs. The test was first evaluated by an expert group to ensure content, construct, and face validity. Thereafter, the questionnaire was tested on five groups of HPs to test for criterion validity. Internal consistency was evaluated by Cronbach's alpha. An expert panel ensured content, construct, and face validity of the test. Groups with the most training and exposure to nutrition (dietitians and dietetic interns) had the highest group mean score, ranging from 61% to 88%, whereas those with limited nutrition training (general practitioners, medical students, and nurses) had significantly lower scores, ranging from 26% to 80%. This result demonstrated criterion validity. Internal consistency of the overall test demonstrated a Cronbach's alpha of 0.99. Most HPs identified the mass media as their main source of information on lifestyle modification. These HPs also identified lack of time, lack of patient compliance, and lack of knowledge as barriers that prevent them from providing counseling on lifestyle modification. The results of this study showed that this test instrument identifies groups of health professionals with adequate training (knowledge) in lifestyle modification and those who require further training (knowledge).

  9. Construct validity of the Korean Women's Abuse Intolerance Scale.

    PubMed

    Choi, Myunghan; Phillips, Linda R; Figueredo, Aurelio José; Insel, Katheleen; Min, Sung-Kil

    2008-01-01

    Domestic violence against married women has persisted throughout Korean history. However, very little empirical research has been conducted in Korea about domestic violence, its causes, or women's responses. To develop and test psychometrically the Korean Women's Abuse Intolerance Scale (KWAIS) to measure women's propensity or desire to leave abusive husbands in Korea. The first phase of the investigation involved qualitative research to explore the themes of women's responses to domestic violence and the development of the instrument. The second phase was a preliminary study conducted to examine women's responses to domestic violence. In the third phase, construct validity of the scale was established, using a sample of 184 married women living in Korea. The KWAIS adequately supported the underlying theory of women's responses to domestic violence, demonstrating strong content validity, high internal consistency (Cronbach's alpha of .98), and criterion-related validity evidenced by significant correlations that supported hypotheses among abuse intolerance and abuse (r = .69), traditional family ideology (r = -.78), marital satisfaction (r = -.85), attitude toward power ascription (r = .63), and collectivism (r = -.88). Factor analysis yielded a four-factor structure, explaining 78.4% of the common variance. Factor loadings ranged from .65 to .93. The findings for the psychometric properties of the KWAIS established its potential as a research instrument in measuring Korean women's propensity or desire to leave abusive husbands. Future studies need to focus on determining the predictive validity of the KWAIS and evaluating cross-cultural differences in women's propensity or desire to leave abusive husbands.

  10. [Validity and reliability of Korean version of the Family Management Measure (Korean FaMM) for families with children having chronic illness].

    PubMed

    Kim, Dong Hee; Im, Yeo Jin

    2013-02-01

    To develop and test the validity and reliability of the Korean version of the Family Management Measure (Korean FaMM) to assess applicability for families with children having chronic illnesses. The Korean FaMM was articulated through forward-backward translation methods. Internal consistency reliability, construct and criterion validity were calculated using PASW WIN (19.0) and AMOS (20.0). Survey data were collected from 341 mothers of children suffering from chronic disease enrolled in a university hospital in Seoul, South Korea. The Korean version of FaMM showed reliable internal consistency with Cronbach's alpha for the total scale of .69-.91. Factor loadings of the 53 items on the six sub-scales ranged from 0.28-0.84. The model of six subscales for the Korean FaMM was validated by expiratory and confirmatory factor analysis (χ²<.001, RMR<.05, GFI, AGFI, NFI, NNFI>.08). Criterion validity compared to the Parental Stress Index (PSI) showed significant correlation. The findings of this study demonstrate that the Korean FaMM showed satisfactory construct and criterion validity and reliability. It is useful to measure Korean family's management style with their children who have a chronic illness.

  11. Parental self-efficacy in childhood overweight: validation of the Lifestyle Behavior Checklist in the Netherlands.

    PubMed

    Gerards, Sanne M P L; Hummel, Karin; Dagnelie, Pieter C; de Vries, Nanne K; Kremers, Stef P J

    2013-01-18

    Evaluating whether parental challenges and self-efficacy toward managing children's lifestyle behaviors are successfully addressed by interventions requires valid instruments. The Lifestyle Behavior Checklist (LBC) has recently been developed in the Australian context. It consists of two subscales: the Problem scale, which measures parental perceptions of children's behavioral problems related to overweight and obesity, and the Confidence scale, measuring parental self-efficacy in dealing with these problems. The aim of the current study was to systematically translate the questionnaire into Dutch and to evaluate its internal consistency, construct validity and test-retest reliability. The LBC was systematically translated by four experts at Maastricht University. In total, 392 parents of 3-to13-year-old children were invited to fill out two successive online questionnaires with a two-week interval. Of these, 273 parents responded to the first questionnaire (test, response rate = 69.6%), and of the 202 who could be invited for the second questionnaire (retest), 100 responded (response rate = 49.5%). We assessed the questionnaire's internal consistency (Cronbach's α), construct validity (Spearman's Rho correlation tests, using the criterion measures: restrictiveness, nurturance, and psychological control), and test-retest reliability (Spearman's Rho correlation tests). Both scales had high internal consistency (Cronbach's α ≥ 0.90). Spearman correlation coefficients indicated acceptable test-retest reliability for both the Problem scale (rs = 0.74) and the Confidence scale (rs = 0.70). The LBC Problem scale was significantly correlated to all criterion scales (nurturance, restrictiveness, psychological control) in the hypothesized direction, and the LBC Confidence scale was significantly correlated with nurturance and psychological control in the hypothesized direction, but not with restrictiveness. The Dutch translation of the LBC was found to be a reliable and

  12. Construct Validation of a Multidimensional Computerized Adaptive Test for Fatigue in Rheumatoid Arthritis

    PubMed Central

    Nikolaus, Stephanie; Bode, Christina; Taal, Erik; Vonkeman, Harald E.; Glas, Cees A. W.; van de Laar, Mart A. F. J.

    2015-01-01

    Objective Multidimensional computerized adaptive testing enables precise measurements of patient-reported outcomes at an individual level across different dimensions. This study examined the construct validity of a multidimensional computerized adaptive test (CAT) for fatigue in rheumatoid arthritis (RA). Methods The ‘CAT Fatigue RA’ was constructed based on a previously calibrated item bank. It contains 196 items and three dimensions: ‘severity’, ‘impact’ and ‘variability’ of fatigue. The CAT was administered to 166 patients with RA. They also completed a traditional, multidimensional fatigue questionnaire (BRAF-MDQ) and the SF-36 in order to examine the CAT’s construct validity. A priori criterion for construct validity was that 75% of the correlations between the CAT dimensions and the subscales of the other questionnaires were as expected. Furthermore, comprehensive use of the item bank, measurement precision and score distribution were investigated. Results The a priori criterion for construct validity was supported for two of the three CAT dimensions (severity and impact but not for variability). For severity and impact, 87% of the correlations with the subscales of the well-established questionnaires were as expected but for variability, 53% of the hypothesised relations were found. Eighty-nine percent of the items were selected between one and 137 times for CAT administrations. Measurement precision was excellent for the severity and impact dimensions, with more than 90% of the CAT administrations reaching a standard error below 0.32. The variability dimension showed good measurement precision with 90% of the CAT administrations reaching a standard error below 0.44. No floor- or ceiling-effects were found for the three dimensions. Conclusion The CAT Fatigue RA showed good construct validity and excellent measurement precision on the dimensions severity and impact. The dimension variability had less ideal measurement characteristics

  13. Criterion validity and clinical usefulness of Attention Deficit Hyperactivity Disorder Rating Scale IV in attention deficit hyperactivity disorder (ADHD) as a function of method and age.

    PubMed

    López-Villalobos, José A; Andrés-De Llano, Jesús; López-Sánchez, María V; Rodríguez-Molinero, Luis; Garrido-Redondo, Mercedes; Sacristán-Martín, Ana M; Martínez-Rivera, María T; Alberola-López, Susana

    2017-02-01

    The aim of this research is to analyze Attention Deficit Hyperactivity Disorder Rating Scales IV (ADHD RS-IV) criteria validity and its clinical usefulness for the assessment of Attention Deficit Hyperactivity Disorder (ADHD) as a function of assessment method and age. A sample was obtained from an epidemiological study (n = 1095, 6-16 years). Clinical cases of ADHD  (ADHD-CL) were selected by dimensional ADHD RS-IV and later by clinical interview (DSM-IV). ADHD-CL cases were compared with four categorical results of ADHD RS-IV provided by parents (CATPA), teachers (CATPR), either parents or teachers (CATPAOPR) and both parents and teachers (CATPA&PR). Criterion validity and clinical usefulness of the answer modalities to ADHD RS-IV were studied. ADHD-CL rate was 6.9% in childhood, 6.2% in preadolescence and 6.9% in adolescence. Alternative methods to the clinical interview led to increased numbers of ADHD cases in all age groups analyzed, in the following sequence: CATPAOPR> CATPRO> CATPA> CATPA&PR> ADHD-CL. CATPA&PR was the procedure with the greatest validity, specificity and clinical usefulness in all three age groups, particularly in the childhood. Isolated use of ADHD RS-IV leads to an increase in ADHD cases compared to clinical interview, and varies depending on the procedure used.

  14. Validity of the occupational sitting and physical activity questionnaire.

    PubMed

    Chau, Josephine Y; Van Der Ploeg, Hidde P; Dunn, Scott; Kurko, John; Bauman, Adrian E

    2012-01-01

    Sitting at work is an emerging occupational health risk. Few instruments designed for use in population-based research measure occupational sitting and standing as distinct behaviors. This study aimed to develop and validate brief measure of occupational sitting and physical activity. A convenience sample (n = 99, 61% female) was recruited from two medium-sized workplaces and by word-of-mouth in Sydney, Australia. Participants completed the newly developed Occupational Sitting and Physical Activity Questionnaire (OSPAQ) and a modified version of the MONICA Optional Study on Physical Activity Questionnaire (modified MOSPA-Q) twice, 1 wk apart. Participants also wore an ActiGraph accelerometer for the 7 d in between the test and retest. Analyses determined test-retest reliability with intraclass correlation coefficients and assessed criterion validity against accelerometers using the Spearman ρ. The test-retest intraclass correlation coefficients for occupational sitting, standing, and walking for OSPAQ ranged from 0.73 to 0.90, while that for the modified MOSPA-Q ranged from 0.54 to 0.89. Comparison of sitting measures with accelerometers showed higher Spearman correlations for the OSPAQ (r = 0.65) than for the modified MOSPA-Q (r = 0.52). Criterion validity correlations for occupational standing and walking measures were comparable for both instruments with accelerometers (standing: r = 0.49; walking: r = 0.27-0.29). The OSPAQ has excellent test-retest reliability and moderate validity for estimating time spent sitting and standing at work and is comparable to existing occupational physical activity measures for assessing time spent walking at work. The OSPAQ brief instrument measures sitting and standing at work as distinct behaviors and would be especially suitable in national health surveys, prospective cohort studies, and other studies that are limited by space constraints for questionnaire items.

  15. Factor structure and criterion validity across the full scale and ten short forms of the CES-D among Chinese adolescents.

    PubMed

    Yang, Wenhui; Xiong, Ge; Garrido, Luis Eduardo; Zhang, John X; Wang, Meng-Cheng; Wang, Chong

    2018-04-16

    We systematically examined the factor structure and criterion validity across the full scale and 10 short forms of the Center for Epidemiological Studies Depression Scale (CES-D) with Chinese youth. Participants were 5,434 Chinese adolescents in Grades 7 to 12 who completed the full CES-D; 612 of them further completed a structured diagnostic interview with the major depressive disorder (MDD) module of the Kiddie Schedule for Affective Disorder and Schizophrenia for School-age Children. Using a split-sample approach, a series of 4-, 3-, 2-, and 1-factor models were tested using exploratory structural equation modeling and cross-validated using confirmatory factor analysis; the dimensionality was also evaluated by parallel analysis in conjunction with the scree test and aided by factor mixture analysis. The results indicated that a single-factor model of depression with a wording method factor fitted the data well, and was the optimal structure underlying the scores of the full and shortened CES-D. Additionally, receiver operating characteristic curve analyses for MDD case detection showed that the CES-D full-scale scores accurately detected MDD youth (area under the curve [AUC] = .84). Furthermore, the short-form scores produced comparable AUCs with the full scale (.82 to .85), as well as similar levels of sensitivity and specificity when using optimal cutoffs. These findings suggest that depression among Chinese adolescents can be adequately measured and screened for by a single-factor structure underlying the CES-D scores, and that the short forms provide a viable alternative to the full instrument. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  16. Alberta infant motor scale: reliability and validity when used on preterm infants in Taiwan.

    PubMed

    Jeng, S F; Yau, K I; Chen, L C; Hsiao, S F

    2000-02-01

    The goal of this study was to examine the reliability and validity of measurements obtained with the Alberta Infant Motor Scale (AIMS) for evaluation of preterm infants in Taiwan. Two independent groups of preterm infants were used to investigate the reliability (n=45) and validity (n=41) for the AIMS. In the reliability study, the AIMS was administered to the infants by a physical therapist, and infant performance was videotaped. The performance was then rescored by the same therapist and by 2 other therapists to examine the intrarater and interrater reliability. In the validity study, the AIMS and the Bayley Motor Scale were administered to the infants at 6 and 12 months of age to examine criterion-related validity. Intraclass correlation coefficients (ICCs) for intrarater and interrater reliability of measurements obtained with the AIMS were high (ICC=.97-.99). The AIMS scores correlated with the Bayley Motor Scale scores at 6 and 12 months (r=.78 and.90), although the AIMS scores at 6 months were only moderately predictive of the motor function at 12 months (r=.56). The results suggest that measurements obtained with the AIMS have acceptable reliability and concurrent validity but limited predictive value for evaluating preterm Taiwanese infants.

  17. A new test for the assessment of working memory in clinical settings: Validation and norming of a month ordering task.

    PubMed

    Buekenhout, Imke; Leitão, José; Gomes, Ana A

    2018-05-24

    Month ordering tasks have been used in experimental settings to obtain measures of working memory (WM) capacity in older/clinical groups based solely on their face validity. We sought to assess the appropriateness of using a month ordering task in other contexts, including clinical settings, as a psychometrically sound WM assessment. To this end, we constructed a month ordering task (ucMOT), studied its reliability (internal consistency and temporal stability), and gathered construct-related and criterion-related validity evidence for its use as a WM assessment. The ucMOT proved to be internally consistent and temporally stable, and analyses of the criterion-related validity evidence revealed that its scores predicted the efficiency of language comprehension processes known to depend crucially on WM resources, namely, processes involved in pronoun interpretation. Furthermore, all ucMOT items discriminated between younger and older age groups; the global scores were significantly correlated with scores on well-established WM tasks and presented lower correlations with instruments that evaluate different (although related) processes, namely, inhibition and processing speed. We conclude that the ucMOT possesses solid psychometric properties. Accordingly, we acquired normative data for the Portuguese population, which we present as a regression-based algorithm that yields z scores adjusted for age, gender, and years of formal education. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  18. Validation of the Gifted Rating Scales–School Form in China

    PubMed Central

    Li, Huijun; Pfeiffer, Steven I.; Petscher, Yaacov; Kumtepe, Alper T.; Mo, Guofang

    2015-01-01

    The Gifted Rating Scales–School Form (GRS-S), a teacher-completed rating scale, is designed to identify five types of giftedness and motivation. This study examines the reliability and validity of a Chinese-translated version of the GRS-S with a sample of Chinese elementary and middle school students (N = 499). The Chinese GRSS was found to have high internal consistency. Results of the confirmatory factor analysis corroborated the six-factor solution of the original GRS-S. Comparison of the GRS-S scores and measures of academic performance provide preliminary support for the criterion validity of the Chinese-translated GRS-S. Significant age and gender differences on the Chinese GRS-S were found. Results provide preliminary support for the Chinese version of the GRS-S as a reliable and valid measure of giftedness for Chinese students. PMID:26346730

  19. High resolution ion chamber array delivery quality assurance for robotic radiosurgery: Commissioning and validation.

    PubMed

    Blanck, Oliver; Masi, Laura; Chan, Mark K H; Adamczyk, Sebastian; Albrecht, Christian; Damme, Marie-Christin; Loutfi-Krauss, Britta; Alraun, Manfred; Fehr, Roman; Ramm, Ulla; Siebert, Frank-Andre; Stelljes, Tenzin Sonam; Poppinga, Daniela; Poppe, Björn

    2016-06-01

    High precision radiosurgery demands comprehensive delivery-quality-assurance techniques. The use of a liquid-filled ion-chamber-array for robotic-radiosurgery delivery-quality-assurance was investigated and validated using several test scenarios and routine patient plans. Preliminary evaluation consisted of beam profile validation and analysis of source-detector-distance and beam-incidence-angle response dependence. The delivery-quality-assurance analysis is performed in four steps: (1) Array-to-plan registration, (2) Evaluation with standard Gamma-Index criteria (local-dose-difference⩽2%, distance-to-agreement⩽2mm, pass-rate⩾90%), (3) Dose profile alignment and dose distribution shift until maximum pass-rate is found, and (4) Final evaluation with 1mm distance-to-agreement criterion. Test scenarios consisted of intended phantom misalignments, dose miscalibrations, and undelivered Monitor Units. Preliminary method validation was performed on 55 clinical plans in five institutions. The 1000SRS profile measurements showed sufficient agreement compared with a microDiamond detector for all collimator sizes. The relative response changes can be up to 2.2% per 10cm source-detector-distance change, but remains within 1% for the clinically relevant source-detector-distance range. Planned and measured dose under different beam-incidence-angles showed deviations below 1% for angles between 0° and 80°. Small-intended errors were detected by 1mm distance-to-agreement criterion while 2mm criteria failed to reveal some of these deviations. All analyzed delivery-quality-assurance clinical patient plans were within our tight tolerance criteria. We demonstrated that a high-resolution liquid-filled ion-chamber-array can be suitable for robotic radiosurgery delivery-quality-assurance and that small errors can be detected with tight distance-to-agreement criterion. Further improvement may come from beam specific correction for incidence angle and source

  20. Validation of the Proficiency Examination for Diagnostic Radiologic Technology. Final Report.

    ERIC Educational Resources Information Center

    Educational Testing Service, Princeton, NJ.

    The validity of the Proficiency Examination for Diagnostic Radiologic Technology was investigated, using 140 radiologic technologists who took both the written Proficiency Examination and a performance test. As an additional criterion measure of job proficiency, supervisors' assessments were obtained for 128 of the technologists. The resulting…

  1. British isles lupus assessment group 2004 index is valid for assessment of disease activity in systemic lupus erythematosus

    PubMed Central

    Yee, Chee-Seng; Farewell, Vernon; Isenberg, David A; Rahman, Anisur; Teh, Lee-Suan; Griffiths, Bridget; Bruce, Ian N; Ahmad, Yasmeen; Prabu, Athiveeraramapandian; Akil, Mohammed; McHugh, Neil; D'Cruz, David; Khamashta, Munther A; Maddison, Peter; Gordon, Caroline

    2007-01-01

    Objective To determine the construct and criterion validity of the British Isles Lupus Assessment Group 2004 (BILAG-2004) index for assessing disease activity in systemic lupus erythematosus (SLE). Methods Patients with SLE were recruited into a multicenter cross-sectional study. Data on SLE disease activity (scores on the BILAG-2004 index, Classic BILAG index, and Systemic Lupus Erythematosus Disease Activity Index 2000 [SLEDAI-2K]), investigations, and therapy were collected. Overall BILAG-2004 and overall Classic BILAG scores were determined by the highest score achieved in any of the individual systems in the respective index. Erythrocyte sedimentation rates (ESRs), C3 levels, C4 levels, anti–double-stranded DNA (anti-dsDNA) levels, and SLEDAI-2K scores were used in the analysis of construct validity, and increase in therapy was used as the criterion for active disease in the analysis of criterion validity. Statistical analyses were performed using ordinal logistic regression for construct validity and logistic regression for criterion validity. Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated. Results Of the 369 patients with SLE, 92.7% were women, 59.9% were white, 18.4% were Afro-Caribbean and 18.4% were South Asian. Their mean ± SD age was 41.6 ± 13.2 years and mean disease duration was 8.8 ± 7.7 years. More than 1 assessment was obtained on 88.6% of the patients, and a total of 1,510 assessments were obtained. Increasing overall scores on the BILAG-2004 index were associated with increasing ESRs, decreasing C3 levels, decreasing C4 levels, elevated anti-dsDNA levels, and increasing SLEDAI-2K scores (all P < 0.01). Increase in therapy was observed more frequently in patients with overall BILAG-2004 scores reflecting higher disease activity. Scores indicating active disease (overall BILAG-2004 scores of A and B) were significantly associated with increase in therapy (odds ratio [OR] 19.3, P

  2. The revised Generalized Expectancy for Success Scale: a validity and reliability study.

    PubMed

    Hale, W D; Fiedler, L R; Cochran, C D

    1992-07-01

    The Generalized Expectancy for Success Scale (GESS; Fibel & Hale, 1978) was revised and assessed for reliability and validity. The revised version was administered to 199 college students along with other conceptually related measures, including the Rosenberg Self-Esteem Scale, the Life Orientation Test, and Rotter's Internal-External Locus of Control Scale. One subsample of students also completed the Eysenck Personality Inventory, while another subsample performed a criterion-related task that involved risk taking. Item analysis yielded 25 items with correlations of .45 or higher with the total score. Results indicated high internal consistency and test-retest reliability.

  3. Validity, Responsiveness, Minimal Detectable Change, and Minimal Clinically Important Change of "Pediatric Balance Scale" in Children with Cerebral Palsy

    ERIC Educational Resources Information Center

    Chen, Chia-ling; Shen, I-hsuan; Chen, Chung-yao; Wu, Ching-yi; Liu, Wen-Yu; Chung, Chia-ying

    2013-01-01

    This study examined criterion-related validity and clinimetric properties of the pediatric balance scale ("PBS") in children with cerebral palsy (CP). Forty-five children with CP (age range: 19-77 months) and their parents participated in this study. At baseline and at follow up, Pearson correlation coefficients were used to determine…

  4. Validation of Diagnostic Measures Based on Latent Class Analysis: A Step Forward in Response Bias Research

    ERIC Educational Resources Information Center

    Thomas, Michael L.; Lanyon, Richard I.; Millsap, Roger E.

    2009-01-01

    The use of criterion group validation is hindered by the difficulty of classifying individuals on latent constructs. Latent class analysis (LCA) is a method that can be used for determining the validity of scales meant to assess latent constructs without such a priori classifications. The authors used this method to examine the ability of the L…

  5. Construct Validity: Advances in Theory and Methodology

    PubMed Central

    Strauss, Milton E.; Smith, Gregory T.

    2008-01-01

    Measures of psychological constructs are validated by testing whether they relate to measures of other constructs as specified by theory. Each test of relations between measures reflects on the validity of both the measures and the theory driving the test. Construct validation concerns the simultaneous process of measure and theory validation. In this chapter, we review the recent history of validation efforts in clinical psychological science that has led to this perspective, and we review five recent advances in validation theory and methodology of importance for clinical researchers. These are: the emergence of nonjustificationist philosophy of science; an increasing appreciation for theory and the need for informative tests of construct validity; valid construct representation in experimental psychopathology; the need to avoid representing multidimensional constructs with a single score; and the emergence of effective new statistical tools for the evaluation of convergent and discriminant validity. PMID:19086835

  6. Reliability and Validity of the Lichtenberg Financial Decision Screening Scale.

    PubMed

    Lichtenberg, Peter A; Teresi, Jeanne A; Ocepek-Welikson, Katja; Eimicke, Joseph P

    2017-03-01

    The scarcity of empirically validated assessment instruments continues to impede the work of professionals in a number of fields, including medicine, finance, and estate planning; adult protective services; and criminal justice-and, more importantly, it impedes their ability to effectively assist and, in some case, protect their clients. Other professionals (e.g. legal, financial, medical, mental health services) are in a position to prevent financial exploitation and would benefit from access to new instruments. The Lichtenberg Financial Decision Screening Scale (LFDSS) was introduced in 2016, along with evidence for its convergent validity (Lichtenberg et al., 2016). Using a sample of 213 participants, this study investigated the internal consistency of the LFDSS and its criterion validity based on ratings by professionals using the scale. Results demonstrate that the LFDSS has excellent internal consistency and clinical utility properties. This paper provides support for use of the LFDSS as a reliable and valid instrument. The LFDSS and instructions for its use are included in the article, along with information about online tools and support.

  7. Explicating Validity

    ERIC Educational Resources Information Center

    Kane, Michael T.

    2016-01-01

    How we choose to use a term depends on what we want to do with it. If "validity" is to be used to support a score interpretation, validation would require an analysis of the plausibility of that interpretation. If validity is to be used to support score uses, validation would require an analysis of the appropriateness of the proposed…

  8. [Measurement of unemployment-related psychological stress: Validation of the Unemployment Stress (USS)].

    PubMed

    Szabóné Kapuvári, Virág; Martos, Tamás

    2016-01-01

    Nowadays the theme of unemployment and the given answers of it are up to date questions in psychology. In spite of this fact, the psychological methods measuring this phenomenon are often missing. That is why the Unemployment Stress Scale (USS) is presented in this article. The aim of our study is to develop a scale called USS and test it's validity and reliability. There were 287 adult unemployed persons asked in this study. Besides the USS we used the Beck Depression Scale, the Spielberger Anxiety Scale (TRAIT), the Sense of Coherence Scale (Hungarian version) and the Rosenberg Self Esteem Scale. According to our results, USS has showed an excellent criterion and construct validity. A useful scale has been formed according to test-retest results. (Cronbach-alfa: 0.88 and 0.86 according to the samples). Moreover our scale has a strong correlation with the Spielberger Anxiety Scale (TRAIT) and the Beck Depression Scale. These chracteristics of the new scale proved that we fond a factor, independent from the self esteem and the sense of coherence, which represents the stress level in the situation of unemployment. This scale is a professional construction to measure stress contributed to unemployment. The USS can be a useful scale in clinical practice because after measuring with this scale we can protect the personality of the unemployed by representing the actual unemployment stress level. That is why professionals can help earlier in a crisis like this.

  9. Validity and reliability of bioelectrical impedance analysis and skinfold thickness in predicting body fat in military personnel.

    PubMed

    Aandstad, Anders; Holtberget, Kristian; Hageberg, Rune; Holme, Ingar; Anderssen, Sigmund A

    2014-02-01

    Previous studies show that body composition is related to injury risk and physical performance in soldiers. Thus, valid methods for measuring body composition in military personnel are needed. The frequently used body mass index method is not a valid measure of body composition in soldiers, but reliability and validity of alternative field methods are less investigated in military personnel. Thus, we carried out test and retest of skinfold (SKF), single frequency bioelectrical impedance analysis (SF-BIA), and multifrequency bioelectrical impedance analysis measurements in 65 male and female soldiers. Several validated equations were used to predict percent body fat from these methods. Dual-energy X-ray absorptiometry was also measured, and acted as the criterion method. Results showed that SF-BIA was the most reliable method in both genders. In women, SF-BIA was also the most valid method, whereas SKF or a combination of SKF and SF-BIA produced the highest validity in men. Reliability and validity varied substantially among the equations examined. The best methods and equations produced test-retest 95% limits of agreement below ±1% points, whereas the corresponding validity figures were ±3.5% points. Each investigator and practitioner must consider whether such measurement errors are acceptable for its specific use. Reprint & Copyright © 2014 Association of Military Surgeons of the U.S.

  10. Reliability and validity of the Spanish Language Wechsler Adult Intelligence Scale (3rd Edition) in a sample of American, urban, Spanish-speaking Hispanics.

    PubMed

    Renteria, Laura; Li, Susan Tinsley; Pliskin, Neil H

    2008-05-01

    The utility of the Spanish WAIS-III was investigated by examining its reliability and validity among 100 Spanish-speaking participants. Results indicated that the internal consistency of the subtests was satisfactory, but inadequate for Letter Number Sequencing. Criterion validity was adequate. Convergent and discriminant validity results were generally similar to the North American normative sample. Paired sample t-tests suggested that the WAIS-III may underestimate ability when compared to the criterion measures that were utilized to assess validity. This study provides support for the use of the Spanish WAIS-III in urban Hispanic populations, but also suggests that caution be used when administering specific subtests, due to the nature of the Latin America alphabet and potential test bias.

  11. Assessment of a condition-specific quality-of-life measure for patients with developmentally absent teeth: validity and reliability testing.

    PubMed

    Akram, A J; Ireland, A J; Postlethwaite, K C; Sandy, J R; Jerreat, A S

    2013-11-01

    This article describes the process of validity and reliability testing of a condition-specific quality-of-life measure for patients with hypodontia presenting for orthodontic treatment. The development of the instrument is described in a previous article. Royal Devon and Exeter NHS Foundation Trust & Musgrove Park Hospital, Taunton. The child perception questionnaire was used as a standard against which to test criterion validity. The Bland and Altman method was used to check agreement between the two questionnaires. Construct validity was tested using principal component analysis on the four sections of the questionnaire. Test-retest reliability was tested using intraclass correlation coefficient and Bland and Altman method. Cronbach's alpha was used to test internal consistency reliability. Overall the questionnaire showed good reliability, criterion and construct validity. This together with previous evidence of good face and content validity suggests that the instrument may prove useful in clinical practice and further research. This study has demonstrated that the newly developed condition-specific quality-of-life questionnaire is both valid and reliable for use in young patients with hypodontia. © 2013 John Wiley & Sons A/S. Published by Blackwell Publishing Ltd.

  12. [Reliability and validity of warning signs checklist for screening psychological, behavioral and developmental problems of children].

    PubMed

    Huang, X N; Zhang, Y; Feng, W W; Wang, H S; Cao, B; Zhang, B; Yang, Y F; Wang, H M; Zheng, Y; Jin, X M; Jia, M X; Zou, X B; Zhao, C X; Robert, J; Jing, Jin

    2017-06-02

    Objective: To evaluate the reliability and validity of warning signs checklist developed by the National Health and Family Planning Commission of the People's Republic of China (NHFPC), so as to determine the screening effectiveness of warning signs on developmental problems of early childhood. Method: Stratified random sampling method was used to assess the reliability and validity of checklist of warning sign and 2 110 children 0 to 6 years of age(1 513 low-risk subjects and 597 high-risk subjects) were recruited from 11 provinces of China. The reliability evaluation for the warning signs included the test-retest reliability and interrater reliability. With the use of Age and Stage Questionnaire (ASQ) and Gesell Development Diagnosis Scale (GESELL) as the criterion scales, criterion validity was assessed by determining the correlation and consistency between the screening results of warning signs and the criterion scales. Result: In terms of the warning signs, the screening positive rates at different ages ranged from 10.8%(21/141) to 26.2%(51/137). The median (interquartile) testing time for each subject was 1(0.6) minute. Both the test-retest reliability and interrater reliability of warning signs reached 0.7 or above, indicating that the stability was good. In terms of validity assessment, there was remarkable consistency between ASQ and warning signs, with the Kappa value of 0.63. With the use of GESELL as criterion, it was determined that the sensitivity of warning signs in children with suspected developmental delay was 82.2%, and the specificity was 77.7%. The overall Youden index was 0.6. Conclusion: The reliability and validity of warning signs checklist for screening early childhood developmental problems have met the basic requirements of psychological screening scales, with the characteristics of short testing time and easy operation. Thus, this warning signs checklist can be used for screening psychological and behavioral problems of early childhood

  13. Assessing the Validity of Self-Reported Stress-Related Growth

    ERIC Educational Resources Information Center

    Frazier, Patricia A.; Kaler, Matthew E.

    2006-01-01

    The purpose of these studies was to assess the validity of self-reported stress-related growth (SRG). In Study 1, individuals with breast cancer (n = 70) generally did not report greater well-being than a matched comparison group (n = 70). In Study 2, there were no significant differences in well-being between undergraduate students who said that…

  14. The local lymph node assay and the assessment of relative potency: status of validation.

    PubMed

    Basketter, David A; Gerberick, Frank; Kimber, Ian

    2007-08-01

    For the prediction of skin sensitization potential, the local lymph node assay (LLNA) is a fully validated alternative to guinea-pig tests. More recently, information from LLNA dose-response analyses has been used to assess the relative potency of skin sensitizing chemicals. These data are then deployed for risk assessment and risk management. In this commentary, the utility and validity of these relative potency measurements are reviewed. It is concluded that the LLNA does provide a valuable assessment of relative sensitizing potency in the form of the estimated concentration of a chemical required to produce a threefold stimulation of draining lymph node cell proliferation compared with concurrent controls (EC3 value) and that all reasonable validation requirements have been addressed successfully. EC3 measurements are reproducible in both intra- and interlaboratory evaluations and are stable over time. It has been shown also, by several independent groups, that EC3 values correlate closely with data on relative human skin sensitization potency. Consequently, the recommendation made here is that LLNA EC3 measurements should now be regarded as a validated method for the determination of the relative potency of skin sensitizing chemicals, a conclusion that has already been reached by a number of independent expert groups.

  15. Validity and reliability of the Japanese version of the Newest Vital Sign: a preliminary study.

    PubMed

    Kogure, Takamichi; Sumitani, Masahiko; Suka, Machi; Ishikawa, Hirono; Odajima, Takeshi; Igarashi, Ataru; Kusama, Makiko; Okamoto, Masako; Sugimori, Hiroki; Kawahara, Kazuo

    2014-01-01

    Health literacy (HL) refers to the ability to obtain, process, and understand basic health information and services, and is thus needed to make appropriate health decisions. The Newest Vital Sign (NVS) is comprised of 6 questions about an ice cream nutrition label and assesses HL numeracy skills. We developed a Japanese version of the NVS (NVS-J) and evaluated the validity and reliability of the NVS-J in patients with chronic pain. The translation of the original NVS into Japanese was achieved as per the published guidelines. An observational study was subsequently performed to evaluate the validity and reliability of the NVS-J in 43 Japanese patients suffering from chronic pain. Factor analysis with promax rotation, using the Kaiser criterion (eigenvalues ≥1.0), and a scree plot revealed that the main component of the NVS-J consists of three determinative factors, and each factor consists of two NVS-J items. The criterion-related validity of the total NVS-J score was significantly correlated with the total score of Ishikawa et al.'s self-rated HL Questionnaire, the clinical global assessment of comprehensive HL level, cognitive function, and the Brinkman index. In addition, Cronbach's coefficient for the total score of the NVS-J was adequate (alpha = 0.72). This study demonstrated that the NVS-J has good validity and reliability. Further, the NVS-J consists of three determinative factors: "basic numeracy ability," "complex numeracy ability," and "serious-minded ability." These three HL abilities comprise a 3-step hierarchical structure. Adequate HL should be promoted in chronic pain patients to enable coping, improve functioning, and increase activities of daily living (ADLs) and quality of life (QOL).

  16. Assessment of sedentary behaviors and transport-related activities by questionnaire: a validation study.

    PubMed

    Mensah, Keitly; Maire, Aurélia; Oppert, Jean-Michel; Dugas, Julien; Charreire, Hélène; Weber, Christiane; Simon, Chantal; Nazare, Julie-Anne

    2016-08-09

    Comprehensive assessment of sedentary behavior (SB) and physical activity (PA), including transport-related activities (TRA), is required to design innovative PA promotion strategies. There are few validated instruments that simultaneously assess the different components of human movement according to their context of practice (e.g. work, transport, leisure). We examined test-retest reliability and validity of the Sedentary, Transportation and Activity Questionnaire (STAQ), a newly developed questionnaire dedicated to assessing context-specific SB, TRA and PA. Ninety six subjects (51 women) kept a contextualized activity-logbook and wore a hip accelerometer (Actigraph GT3X + (TM)) for a 7-day or 14-day period, at the end of which they completed the STAQ. Activity-energy expenditure was measured in a subgroup of 45 subjects using the double labeled water (DLW) method. Test-retest reliability was assessed using intra-class-coefficients (ICC) in a subgroup of 32 subjects who filled the questionnaire twice one month apart. Accelerometry was annotated using the logbook to obtain total and context-specific objective estimates of SB. Spearman correlations, Bland-Altman plots and ICC were used to analyze validity with logbook, accelerometry and DLW data validity criteria. Test-retest reliability was fair for total sitting time (ICC = 0.52), good to excellent for work sitting time (ICC = 0.71), transport-related walking (ICC = 0.61) and car use (ICC = 0.67), and leisure screen-related SB (ICC = 0.64-0.79), but poor for total sitting time during leisure and transport-related contexts. For validity, compared to accelerometry, significant correlations were found for STAQ estimates of total (r = 0.54) and context-specific sitting times with stronger correlations for work sitting time (r = 0.88), and screen times (TV/DVD viewing: r = 0.46; other screens: r = 0.42) than for transport (r = 0.35) or leisure-related sitting-times (r

  17. Reliability and validity of the Tilburg Frailty Indicator (TFI) among Chinese community-dwelling older people.

    PubMed

    Dong, Lijuan; Liu, Na; Tian, Xiaoyu; Qiao, Xiaoxia; Gobbens, Robbert J J; Kane, Robert L; Wang, Cuili

    2017-11-01

    To translate the Tilburg Frailty Indicator (TFI) into Chinese and assess its reliability and validity. A sample of 917 community-dwelling older people, aged ≥60 years, in a Chinese city was included between August 2015 and March 2016. Construct validity was assessed using alternative measures corresponding to the TFI items, including self-rated health status (SRH), unintentional weight loss, walking speed, timed-up-and-go tests (TUGT), making telephone calls, grip strength, exhaustion, Short Portable Mental Status Questionnaire (SPMSQ), Geriatric Depression scale (GDS-15), emotional role, Adaptability Partnership Growth Affection and Resolve scale (APGAR) and Social Support Rating Scale (SSRS). Fried's phenotype and frailty index were measured to evaluate criterion validity. Adverse health outcomes (ADL and IADL disability, healthcare utilization, GDS-15, SSRS) were used to assess predictive (concurrent) validity. The internal consistency reliability was good (Cronbach's α=0.71). The test-retest reliability was strong (r=0.88). Kappa coefficients showed agreements between the TFI items and corresponding alternative measures. Alternative measures correlated as expected with the three domains of TFI, with an exclusion that alternative psychological measures had similar correlations with psychological and physical domains of the TFI. The Chinese TFI had excellent criterion validity with the AUCs regarding physical phenotype and frailty index of 0.87 and 0.86, respectively. The predictive (concurrent) validities of the adverse health outcomes and healthcare utilization were acceptable (AUCs: 0.65-0.83). The Chinese TFI has good validity and reliability as an integral instrument to measure frailty of older people living in the community in China. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. Development and Validation of the Spanish-English Language Proficiency Scale (SELPS)

    ERIC Educational Resources Information Center

    Smyk, Ekaterina; Restrepo, M. Adelaida; Gorin, Joanna S.; Gray, Shelley

    2013-01-01

    Purpose: This study examined the development and validation of a criterion-referenced Spanish-English Language Proficiency Scale (SELPS) that was designed to assess the oral language skills of sequential bilingual children ages 4-8. This article reports results for the English proficiency portion of the scale. Method: The SELPS assesses syntactic…

  19. Development and Validation of Personality Disorder Spectra Scales for the MMPI-2-RF.

    PubMed

    Sellbom, Martin; Waugh, Mark H; Hopwood, Christopher J

    2018-01-01

    The purpose of this study was to develop and validate a set of MMPI-2-RF (Ben-Porath & Tellegen, 2008/2011) personality disorder (PD) spectra scales. These scales could serve the purpose of assisting with DSM-5 PD diagnosis and help link categorical and dimensional conceptions of personality pathology within the MMPI-2-RF. We developed and provided initial validity results for scales corresponding to the 10 PD constructs listed in the DSM-5 using data from student, community, clinical, and correctional samples. Initial validation efforts indicated good support for criterion validity with an external PD measure as well as with dimensional personality traits included in the DSM-5 alternative model for PDs. Construct validity results using psychosocial history and therapists' ratings in a large clinical sample were generally supportive as well. Overall, these brief scales provide clinicians using MMPI-2-RF data with estimates of DSM-5 PD constructs that can support cross-model connections between categorical and dimensional assessment approaches.

  20. Validation of the Arabic Version of the Internet Gaming Disorder-20 Test.

    PubMed

    Hawi, Nazir S; Samaha, Maya

    2017-04-01

    In recent years, researchers have been trying to shed light on gaming addiction and its association with different psychiatric disorders and psychological determinants. The latest edition version of the American Psychiatric Association's Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) included in its Section 3 Internet Gaming Disorder (IGD) as a condition for further empirical study and proposed nine criteria for the diagnosis of IGD. The 20-item Internet Gaming Disorder (IGD-20) Test was developed as a valid and reliable tool to assess gaming addiction based on the nine criteria set by the DSM-5. The aim of this study is to validate an Arabic version of the IGD-20 Test. The Arabic version of IGD-20 will not only help in identifying Arabic-speaking pathological gamers but also stimulate cross-cultural studies that could contribute to an area in need of more research for insight and treatment. After a process of translation and back-translation and with the participation of a sizable sample of Arabic-speaking adolescents, the present study conducted a psychometric validation of the IGD-20 Test. Our confirmatory factor analysis showed the validity of the Arabic version of the IGD-20 Test. The one-factor model of the Arabic IGD-20 Test had very good psychometric properties, and it fitted the sample data extremely well. In addition, correlation analysis between the IGD-20 Test and the daily duration on weekdays and weekends gameplay revealed significant positive relationships that warranted a criterion-related validation. Thus, the Arabic version of the IGD-20 Test is a valid and reliable measure of IGD among Arabic-speaking populations.

  1. Reliability and validity of the Outcome Expectations for Exercise Scale-2.

    PubMed

    Resnick, Barbara

    2005-10-01

    Development of a reliable and valid measure of outcome expectations for exercise for older adults will help establish the relationship between outcome expectations and exercise and facilitate the development of interventions to increase physical activity in older adults. The purpose of this study was to test the reliability and validity of the Outcome Expectations for Exercise-2 Scale (OEE-2), a 13-item measure with two subscales: positive OEE (POEE) and negative OEE (NOEE). The OEE-2 scale was given to 161 residents in a continuing-care retirement community. There was some evidence of validity based on confirmatory factor analysis, Rasch-analysis INFIT and OUTFIT statistics, and convergent validity and test criterion relationships. There was some evidence for reliability of the OEE-2 based on alpha coefficients, person- and item-separation reliability indexes, and R(2)values. Based on analyses, suggested revisions are provided for future use of the OEE-2. Although ongoing reliability and validity testing are needed, the OEE-2 scale can be used to identify older adults with low outcome expectations for exercise, and interventions can then be implemented to strengthen these expectations and improve exercise behavior.

  2. Cross-cultural validity of a dietary questionnaire for studies of dental caries risk in Japanese

    PubMed Central

    2014-01-01

    Background Diet is a major modifiable contributing factor in the etiology of dental caries. The purpose of this paper is to examine the reliability and cross-cultural validity of the Japanese version of the Food Frequency Questionnaire to assess dietary intake in relation to dental caries risk in Japanese. Methods The 38-item Food Frequency Questionnaire, in which Japanese food items were added to increase content validity, was translated into Japanese, and administered to two samples. The first sample comprised 355 pregnant women with mean age of 29.2 ± 4.2 years for the internal consistency and criterion validity analyses. Factor analysis (principal components with Varimax rotation) was used to determine dimensionality. The dietary cariogenicity score was calculated from the Food Frequency Questionnaire and used for the analyses. Salivary mutans streptococci level was used as a semi-quantitative assessment of dental caries risk and measured by Dentocult SM. Dentocult SM scores were compared with the dietary cariogenicity score computed from the Food Frequency Questionnaire to examine criterion validity, and assessed by Spearman’s correlation coefficient (rs) and Kruskal-Wallis test. Test-retest reliability of the Food Frequency Questionnaire was assessed with a second sample of 25 adults with mean age of 34.0 ± 3.0 years by using the intraclass correlation coefficient analysis. Results The Japanese language version of the Food Frequency Questionnaire showed high test-retest reliability (ICC = 0.70) and good criterion validity assessed by relationship with salivary mutans streptococci levels (rs = 0.22; p < 0.001). Factor analysis revealed four subscales that construct the questionnaire (solid sugars, solid and starchy sugars, liquid and semisolid sugars, sticky and slowly dissolving sugars). Internal consistency were low to acceptable (Cronbach’s alpha = 0.67 for the total scale, 0.46-0.61 for each subscale). Mean dietary

  3. Health Sciences-Evidence Based Practice questionnaire (HS-EBP) for measuring transprofessional evidence-based practice: Creation, development and psychometric validation.

    PubMed

    Fernández-Domínguez, Juan Carlos; de Pedro-Gómez, Joan Ernest; Morales-Asencio, José Miguel; Bennasar-Veny, Miquel; Sastre-Fullana, Pedro; Sesé-Abad, Albert

    2017-01-01

    Most of the EBP measuring instruments available to date present limitations both in the operationalisation of the construct and also in the rigour of their psychometric development, as revealed in the literature review performed. The aim of this paper is to provide rigorous and adequate reliability and validity evidence of the scores of a new transdisciplinary psychometric tool, the Health Sciences Evidence-Based Practice (HS-EBP), for measuring the construct EBP in Health Sciences professionals. A pilot study and a subsequent two-stage validation test sample were conducted to progressively refine the instrument until a reduced 60-item version with a five-factor latent structure. Reliability was analysed through both Cronbach's alpha coefficient and intraclass correlations (ICC). Latent structure was contrasted using confirmatory factor analysis (CFA) following a model comparison aproach. Evidence of criterion validity of the scores obtained was achieved by considering attitudinal resistance to change, burnout, and quality of professional life as criterion variables; while convergent validity was assessed using the Spanish version of the Evidence-Based Practice Questionnaire (EBPQ-19). Adequate evidence of both reliability and ICC was obtained for the five dimensions of the questionnaire. According to the CFA model comparison, the best fit corresponded to the five-factor model (RMSEA = 0.049; CI 90% RMSEA = [0.047; 0.050]; CFI = 0.99). Adequate criterion and convergent validity evidence was also provided. Finally, the HS-EBP showed the capability to find differences between EBP training levels as an important evidence of decision validity. Reliability and validity evidence obtained regarding the HS-EBP confirm the adequate operationalisation of the EBP construct as a process put into practice to respond to every clinical situation arising in the daily practice of professionals in health sciences (transprofessional). The tool could be useful for EBP individual

  4. The Queensland high risk foot form (QHRFF) – is it a reliable and valid clinical research tool for foot disease?

    PubMed Central

    2014-01-01

    Background Foot disease complications, such as foot ulcers and infection, contribute to considerable morbidity and mortality. These complications are typically precipitated by “high-risk factors”, such as peripheral neuropathy and peripheral arterial disease. High-risk factors are more prevalent in specific “at risk” populations such as diabetes, kidney disease and cardiovascular disease. To the best of the authors’ knowledge a tool capturing multiple high-risk factors and foot disease complications in multiple at risk populations has yet to be tested. This study aimed to develop and test the validity and reliability of a Queensland High Risk Foot Form (QHRFF) tool. Methods The study was conducted in two phases. Phase one developed a QHRFF using an existing diabetes foot disease tool, literature searches, stakeholder groups and expert panel. Phase two tested the QHRFF for validity and reliability. Four clinicians, representing different levels of expertise, were recruited to test validity and reliability. Three cohorts of patients were recruited; one tested criterion measure reliability (n = 32), another tested criterion validity and inter-rater reliability (n = 43), and another tested intra-rater reliability (n = 19). Validity was determined using sensitivity, specificity and positive predictive values (PPV). Reliability was determined using Kappa, weighted Kappa and intra-class correlation (ICC) statistics. Results A QHRFF tool containing 46 items across seven domains was developed. Criterion measure reliability of at least moderate categories of agreement (Kappa > 0.4; ICC > 0.75) was seen in 91% (29 of 32) tested items. Criterion validity of at least moderate categories (PPV > 0.7) was seen in 83% (60 of 72) tested items. Inter- and intra-rater reliability of at least moderate categories (Kappa > 0.4; ICC > 0.75) was seen in 88% (84 of 96) and 87% (20 of 23) tested items respectively. Conclusions The QHRFF had

  5. Traditional Masculinity and Femininity: Validation of a New Scale Assessing Gender Roles.

    PubMed

    Kachel, Sven; Steffens, Melanie C; Niedlich, Claudia

    2016-01-01

    Gender stereotype theory suggests that men are generally perceived as more masculine than women, whereas women are generally perceived as more feminine than men. Several scales have been developed to measure fundamental aspects of gender stereotypes (e.g., agency and communion, competence and warmth, or instrumentality and expressivity). Although omitted in later version, Bem's original Sex Role Inventory included the items "masculine" and "feminine" in addition to more specific gender-stereotypical attributes. We argue that it is useful to be able to measure these two core concepts in a reliable, valid, and parsimonious way. We introduce a new and brief scale, the Traditional Masculinity-Femininity (TMF) scale, designed to assess central facets of self-ascribed masculinity-femininity. Studies 1-2 used known-groups approaches (participants differing in gender and sexual orientation) to validate the scale and provide evidence of its convergent validity. As expected the TMF reliably measured a one-dimensional masculinity-femininity construct. Moreover, the TMF correlated moderately with other gender-related measures. Demonstrating incremental validity, the TMF predicted gender and sexual orientation in a superior way than established adjective-based measures. Furthermore, the TMF was connected to criterion characteristics, such as judgments as straight by laypersons for the whole sample, voice pitch characteristics for the female subsample, and contact to gay men for the male subsample, and outperformed other gender-related scales. Taken together, as long as gender differences continue to exist, we suggest that the TMF provides a valuable methodological addition for research into gender stereotypes.

  6. Traditional Masculinity and Femininity: Validation of a New Scale Assessing Gender Roles

    PubMed Central

    Kachel, Sven; Steffens, Melanie C.; Niedlich, Claudia

    2016-01-01

    Gender stereotype theory suggests that men are generally perceived as more masculine than women, whereas women are generally perceived as more feminine than men. Several scales have been developed to measure fundamental aspects of gender stereotypes (e.g., agency and communion, competence and warmth, or instrumentality and expressivity). Although omitted in later version, Bem's original Sex Role Inventory included the items “masculine” and “feminine” in addition to more specific gender-stereotypical attributes. We argue that it is useful to be able to measure these two core concepts in a reliable, valid, and parsimonious way. We introduce a new and brief scale, the Traditional Masculinity-Femininity (TMF) scale, designed to assess central facets of self-ascribed masculinity-femininity. Studies 1–2 used known-groups approaches (participants differing in gender and sexual orientation) to validate the scale and provide evidence of its convergent validity. As expected the TMF reliably measured a one-dimensional masculinity-femininity construct. Moreover, the TMF correlated moderately with other gender-related measures. Demonstrating incremental validity, the TMF predicted gender and sexual orientation in a superior way than established adjective-based measures. Furthermore, the TMF was connected to criterion characteristics, such as judgments as straight by laypersons for the whole sample, voice pitch characteristics for the female subsample, and contact to gay men for the male subsample, and outperformed other gender-related scales. Taken together, as long as gender differences continue to exist, we suggest that the TMF provides a valuable methodological addition for research into gender stereotypes. PMID:27458394

  7. Development and Validation of the Masculine Attributes Questionnaire

    PubMed Central

    Cho, Junhan; Kogan, Steven M.

    2017-01-01

    The present study describes the development and validation of the Masculine Attributes Questionnaire (MAQ). The purpose of this study was to develop a theoretically and empirically grounded measure of masculine attributes for sexual health research with African American young men. Consistent with Whitehead’s theory, the MAQ items were hypothesized to comprise two components representing reputation-based and respect-based attributes. The sample included 505 African American men aged 19 to 22 years (M = 20.29, SD = 1.10) living in resource-poor communities in the rural South. Convergent and discriminant validity of the MAQ were assessed by examining the associations of masculinity attributes with psychosocial factors. Criterion validity was assessed by examining the extent to which the MAQ subscales predicted sexual risk behavior outcomes. Consistent with study hypotheses, the MAQ was composed of (a) reputation-based attributes oriented toward sexual prowess, toughness, and authority-defying behavior and (b) respect-based attributes oriented toward economic independence, socially approved levels of hard work and education, and committed romantic relationships. Reputation-based attributes were associated positively with street code and negatively related to academic orientation, vocational engagement, and self-regulation, whereas respect-based attributes were associated positively with academic and vocational orientations and self-regulation. Finally, reputation-based attributes predicted sexual risk behaviors including concurrent sexual partnerships, multiple sexual partners, marijuana use, and incarceration, net of the influence of respect-based attributes. The development of the MAQ provides a new measure that permits systematic quantitative investigation of the associations between African American men’s masculinity ideology and sexual risk behavior. PMID:28413906

  8. Development and Validation of the Masculine Attributes Questionnaire.

    PubMed

    Cho, Junhan; Kogan, Steven M

    2017-07-01

    The present study describes the development and validation of the Masculine Attributes Questionnaire (MAQ). The purpose of this study was to develop a theoretically and empirically grounded measure of masculine attributes for sexual health research with African American young men. Consistent with Whitehead's theory, the MAQ items were hypothesized to comprise two components representing reputation-based and respect-based attributes. The sample included 505 African American men aged 19 to 22 years ( M = 20.29, SD = 1.10) living in resource-poor communities in the rural South. Convergent and discriminant validity of the MAQ were assessed by examining the associations of masculinity attributes with psychosocial factors. Criterion validity was assessed by examining the extent to which the MAQ subscales predicted sexual risk behavior outcomes. Consistent with study hypotheses, the MAQ was composed of (a) reputation-based attributes oriented toward sexual prowess, toughness, and authority-defying behavior and (b) respect-based attributes oriented toward economic independence, socially approved levels of hard work and education, and committed romantic relationships. Reputation-based attributes were associated positively with street code and negatively related to academic orientation, vocational engagement, and self-regulation, whereas respect-based attributes were associated positively with academic and vocational orientations and self-regulation. Finally, reputation-based attributes predicted sexual risk behaviors including concurrent sexual partnerships, multiple sexual partners, marijuana use, and incarceration, net of the influence of respect-based attributes. The development of the MAQ provides a new measure that permits systematic quantitative investigation of the associations between African American men's masculinity ideology and sexual risk behavior.

  9. Research Diagnostic Criteria for Temporomandibular Disorders: Validity of Axis I Diagnoses

    PubMed Central

    Truelove, Edmond; Pan, Wei; Look, John O.; Mancl, Lloyd A.; Ohrbach, Richard K.; Velly, Ana; Huggins, Kimberly; Lenton, Patricia; Schiffman, Eric L.

    2011-01-01

    AIMS To estimate the criterion validity of the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) Axis I TMD diagnoses. METHODS A combined total of 614 TMD community and clinic cases and 91 controls were examined at 3 study sites. RDC/TMD Axis I diagnoses were algorithmically derived from an examination performed by calibrated dental hygienists. Reference standards (Gold Standards) were established by means of consensus diagnoses rendered by 2 TMD experts using all available clinical data, including imaging studies. Validity of the RDC/TMD Axis I TMD diagnoses was estimated relative to reference-standard diagnoses (gold standard diagnoses). Target sensitivity and specificity were set a priori at ≥ 0.70 and ≥ 0.95, respectively. RESULTS Target sensitivity and specificity were not observed for any of the 8 RDC/TMD diagnoses. The highest validity was achieved for Group Ia myofascial pain (sensitivity 0.65, specificity 0.92) and Group Ib myofascial pain with limited opening (sensitivity 0.79, specificity 0.92). Target sensitivity and specificity were observed only when both Group I diagnoses were combined (0.87 and 0.98, respectively). For Group II (disc displacements) and Group III (arthralgia, arthritis, arthrosis) diagnoses, all estimates for sensitivity were below target (0.03 to 0.53), and specificity ranged from below to on target (0.86 to 0.99). CONCLUSION The RDC/TMD Axis I TMD diagnoses did not reach the targets set at sensitivity of ≥ 0.70 and specificity of ≥ 0.95. Target validity was obtained only for myofascial pain without differentiation between normal and limited opening. Revision of the current Axis I TMD diagnostic algorithms is warranted to improve their validity. PMID:20213030

  10. Validation of the Acoustic Voice Quality Index in the Japanese Language.

    PubMed

    Hosokawa, Kiyohito; Barsties, Ben; Iwahashi, Toshihiko; Iwahashi, Mio; Kato, Chieri; Iwaki, Shinobu; Sasai, Hisanori; Miyauchi, Akira; Matsushiro, Naoki; Inohara, Hidenori; Ogawa, Makoto; Maryn, Youri

    2017-03-01

    The Acoustic Voice Quality Index (AVQI) is a multivariate construct for quantification of overall voice quality based on the analysis of continuous speech and sustained vowel. The stability and validity of the AVQI is well established in several language families. However, the Japanese language has distinct characteristics with respect to several parameters of articulatory and phonatory physiology. The aim of the study was to confirm the criterion-related concurrent validity of AVQI, as well as its responsiveness to change and diagnostic accuracy for voice assessment in the Japanese-speaking population. This is a retrospective study. A total of 336 voice recordings, which included 69 pairs of voice recordings (before and after therapeutic interventions), were eligible for the study. The auditory-perceptual judgment of overall voice quality was evaluated by five experienced raters. The concurrent validity, responsiveness to change, and diagnostic accuracy of the AVQI were estimated. The concurrent validity and responsiveness to change based on the overall voice quality was indicated by high correlation coefficients 0.828 and 0.767, respectively. Receiver operating characteristic analysis revealed an excellent diagnostic accuracy for discrimination between dysphonic and normophonic voices (area under the curve: 0.905). The best threshold level for the AVQI of 3.15 corresponded with a sensitivity of 72.5% and specificity of 95.2%, with the positive and negative likelihood ratios of 15.1 and 0.29, respectively. We demonstrated the validity of the AVQI as a tool for assessment of overall voice quality and that of voice therapy outcomes in the Japanese-speaking population. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  11. Assessing Procedural Competence: Validity Considerations.

    PubMed

    Pugh, Debra M; Wood, Timothy J; Boulet, John R

    2015-10-01

    Simulation-based medical education (SBME) offers opportunities for trainees to learn how to perform procedures and to be assessed in a safe environment. However, SBME research studies often lack robust evidence to support the validity of the interpretation of the results obtained from tools used to assess trainees' skills. The purpose of this paper is to describe how a validity framework can be applied when reporting and interpreting the results of a simulation-based assessment of skills related to performing procedures. The authors discuss various sources of validity evidence because they relate to SBME. A case study is presented.

  12. On Validity Theory and Test Validation

    ERIC Educational Resources Information Center

    Sireci, Stephen G.

    2007-01-01

    Lissitz and Samuelsen (2007) propose a new framework for conceptualizing test validity that separates analysis of test properties from analysis of the construct measured. In response, the author of this article reviews fundamental characteristics of test validity, drawing largely from seminal writings as well as from the accepted standards. He…

  13. The Chinese version of the Outcome Expectations for Exercise scale: validation study.

    PubMed

    Lee, Ling-Ling; Chiu, Yu-Yun; Ho, Chin-Chih; Wu, Shu-Chen; Watson, Roger

    2011-06-01

    Estimates of the reliability and validity of the English nine-item Outcome Expectations for Exercise (OEE) scale have been tested and found to be valid for use in various settings, particularly among older people, with good internal consistency and validity. Data on the use of the OEE scale among older Chinese people living in the community and how cultural differences might affect the administration of the OEE scale are limited. To test the validity and reliability of the Chinese version of the Outcome Expectations for Exercise scale among older people. A cross-sectional validation study was designed to test the Chinese version of the OEE scale (OEE-C). Reliability was examined by testing both the internal consistency for the overall scale and the squared multiple correlation coefficient for the single item measure. The validity of the scale was tested on the basis of both a traditional psychometric test and a confirmatory factor analysis using structural equation modelling. The Mokken Scaling Procedure (MSP) was used to investigate if there were any hierarchical, cumulative sets of items in the measure. The OEE-C scale was tested in a group of older people in Taiwan (n=108, mean age=77.1). There was acceptable internal consistency (alpha=.85) and model fit in the scale. Evidence of the validity of the measure was demonstrated by the tests for criterion-related validity and construct validity. There was a statistically significant correlation between exercise outcome expectations and exercise self-efficacy (r=.34, p<.01). An analysis of the Mokken Scaling Procedure found that nine items of the scale were all retained in the analysis and the resulting scale was reliable and statistically significant (p=.0008). The results obtained in the present study provided acceptable levels of reliability and validity evidence for the Chinese Outcome Expectations for Exercise scale when used with older people in Taiwan. Future testing of the OEE-C scale needs to be carried out

  14. CFD validation experiments for hypersonic flows

    NASA Technical Reports Server (NTRS)

    Marvin, Joseph G.

    1992-01-01

    A roadmap for CFD code validation is introduced. The elements of the roadmap are consistent with air-breathing vehicle design requirements and related to the important flow path components: forebody, inlet, combustor, and nozzle. Building block and benchmark validation experiments are identified along with their test conditions and measurements. Based on an evaluation criteria, recommendations for an initial CFD validation data base are given and gaps identified where future experiments could provide new validation data.

  15. Is the Simple Shoulder Test a valid outcome instrument for shoulder arthroplasty?

    PubMed

    Hsu, Jason E; Russ, Stacy M; Somerson, Jeremy S; Tang, Anna; Warme, Winston J; Matsen, Frederick A

    2017-10-01

    The Simple Shoulder Test (SST) is a brief, inexpensive, and widely used patient-reported outcome tool, but it has not been rigorously evaluated for patients having shoulder arthroplasty. The goal of this study was to rigorously evaluate the validity of the SST for outcome assessment in shoulder arthroplasty using a systematic review of the literature and an analysis of its properties in a series of 408 surgical cases. SST scores, 36-Item Short Form Health Survey scores, and satisfaction scores were collected preoperatively and 2 years postoperatively. Responsiveness was assessed by comparing preoperative and 2-year postoperative scores. Criterion validity was determined by correlating the SST with the 36-Item Short Form Health Survey. Construct validity was tested through 5 clinical hypotheses regarding satisfaction, comorbidities, insurance status, previous failed surgery, and narcotic use. Scores after arthroplasty improved from 3.9 ± 2.8 to 10.2 ± 2.3 (P < .001). The change in SST correlated strongly with patient satisfaction (P < .001). The SST had large Cohen's d effect sizes and standardized response means. Criterion validity was supported by significant differences between satisfied and unsatisfied patients, those with more severe and less severe comorbidities, those with workers' compensation or Medicaid and other types of insurance, those with and without previous failed shoulder surgery, and those taking and those not taking narcotic pain medication before surgery (P < .005). These data combined with a systematic review of the literature demonstrate that the SST is a valid and responsive patient-reported outcome measure for assessing the outcomes of shoulder arthroplasty. Copyright © 2017 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Elsevier Inc. All rights reserved.

  16. Simulated Driving Assessment (SDA) for teen drivers: results from a validation study.

    PubMed

    McDonald, Catherine C; Kandadai, Venk; Loeb, Helen; Seacrist, Thomas S; Lee, Yi-Ching; Winston, Zachary; Winston, Flaura K

    2015-06-01

    Driver error and inadequate skill are common critical reasons for novice teen driver crashes, yet few validated, standardised assessments of teen driving skills exist. The purpose of this study is to evaluate the construct and criterion validity of a newly developed Simulated Driving Assessment (SDA) for novice teen drivers. The SDA's 35 min simulated drive incorporates 22 variations of the most common teen driver crash configurations. Driving performance was compared for 21 inexperienced teens (age 16-17 years, provisional license ≤90 days) and 17 experienced adults (age 25-50 years, license ≥5 years, drove ≥100 miles per week, no collisions or moving violations ≤3 years). SDA driving performance (Error Score) was based on driving safety measures derived from simulator and eye-tracking data. Negative driving outcomes included simulated collisions or run-off-the-road incidents. A professional driving evaluator/instructor (DEI Score) reviewed videos of SDA performance. The SDA demonstrated construct validity: (1) teens had a higher Error Score than adults (30 vs. 13, p=0.02); (2) For each additional error committed, the RR of a participant's propensity for a simulated negative driving outcome increased by 8% (95% CI 1.05 to 1.10, p<0.01). The SDA-demonstrated criterion validity: Error Score was correlated with DEI Score (r=-0.66, p<0.001). This study supports the concept of validated simulated driving tests like the SDA to assess novice driver skill in complex and hazardous driving scenarios. The SDA, as a standard protocol to evaluate teen driver performance, has the potential to facilitate screening and assessment of teen driving readiness and could be used to guide targeted skill training. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  17. Validity and Reliability of a New Instrument to Measure Cancer-Related Fatigue in Adolescents

    PubMed Central

    Hinds, Pamela S.; Hockenberry, Marilyn; Tong, Xin; Rai, Shesh N.; Gattuso, Jamie S.; McCarthy, Kathleen; Pui, Ching-Hon; Srivastava, Deo Kumar

    2008-01-01

    Adolescents undergoing treatment for cancer rate fatigue as their most prevalent and intense cancer- and treatment-related effect. Parents and staff rate it similarly. Despite its reported prevalence, intensity, and distressing effects, cancer-related fatigue in adolescents is not routinely assessed during or after cancer treatment. We contend that the insufficient clinical attention is primarily due to the lack of a reliable and valid self-report instrument with which adolescent cancer-related fatigue can be measured. Our aim was to determine the reliability and construct validity of a new instrument and its ability to measure change in fatigue over time. Initial testing involved 64 adolescents undergoing curative treatment of cancer who completed the Fatigue Scale-Adolescent (FS-A) at two to four key points in treatment in one of four studies. Internal consistency estimates ranged from 0.67 to 0.95. Validity estimates involving the FS-A with the parent version ranged from 0.13 to 0.76; estimates involving the staff version and the Reynolds Depression Scale were 0.27 and 0.87 respectively. Additional validity findings included significant fatigue differences between anemic and non-anemic patients (P = 0.042) and the emergence of four factors in an exploratory factor analysis. Findings further indicate that the FS-A can be used to measure change over time (t = 2.55, P <0.01). In summary, the FS-A has moderate to strong reliability and impressive validity coefficients for a new research instrument. PMID:17629669

  18. Are implicit self-esteem measures valid for assessing individual and cultural differences?

    PubMed

    Falk, Carl F; Heine, Steven J; Takemura, Kosuke; Zhang, Cathy X J; Hsu, Chih-Wei

    2015-02-01

    Our research utilized two popular theoretical conceptualizations of implicit self-esteem: 1) implicit self-esteem as a global automatic reaction to the self; and 2) implicit self-esteem as a context/domain specific construct. Under this framework, we present an extensive search for implicit self-esteem measure validity among different cultural groups (Study 1) and under several experimental manipulations (Study 2). In Study 1, Euro-Canadians (N = 107), Asian-Canadians (N = 187), and Japanese (N = 112) completed a battery of implicit self-esteem, explicit self-esteem, and criterion measures. Included implicit self-esteem measures were either popular or provided methodological improvements upon older methods. Criterion measures were sampled from previous research on implicit self-esteem and included self-report and independent ratings. In Study 2, Americans (N = 582) completed a shorter battery of these same types of measures under either a control condition, an explicit prime meant to activate the self-concept in a particular context, or prime meant to activate self-competence related implicit attitudes. Across both studies, explicit self-esteem measures far outperformed implicit self-esteem measures in all cultural groups and under all experimental manipulations. Implicit self-esteem measures are not valid for individual or cross-cultural comparisons. We speculate that individuals may not form implicit associations with the self as an attitudinal object. © 2013 Wiley Periodicals, Inc.

  19. Measurement of alienation among adolescents: construct validity of three scales on powerlessness, meaninglessness and social isolation.

    PubMed

    Rayce, Signe Boe; Kreiner, Svend; Damsgaard, Mogens Trab; Nielsen, Tine; Holstein, Bjørn Evald

    2017-01-01

    Psychological alienation is an important concept in the study of adolescents' health and behavior but no gold standard for measuring alienation among adolescents exists. There is a need for new scales with high validity for use in adolescent health and social research. The purpose of the present study was to develop and validate alienation scales in accordance with Seeman's conceptualization of alienation focusing on three independent variants specifically relevant in adolescent health research: powerlessness, meaninglessness and social isolation. Cross-sectional data from 3083 adolescents aged 13 to 15 years from the Danish contribution to the cross-national study Health Behaviour in School-aged Children (HBSC) were used. We identified and developed items, addressed content and face validity through interviews, and examined the criterion-related construct validity of the scales using graphical loglinear Rasch models (GLLRM). The three scales each comprised three to five face valid items. The powerlessness scale reflected the adolescent's expectancy as to whether his/her behavior can determine the outcome or reinforcement he/she seeks. The meaninglessness scale reflected the expectancy as to whether satisfactory predictions regarding the effects of one's behavior are possible. Finally, the social isolation scale reflected whether the adolescent had a low expectancy for inclusion and social acceptance. All scales contained some uniform local dependency and differential item functioning. However, only to a limited degree, which could be accounted for using GLLRM. Thus the scales fitted GLLRMs and can therefore be considered to be essentially construct valid and essentially objective. The three alienation scales appear to be content and face valid and fulfill the psychometric properties of a good construct valid reflective scale. This suggests that the scales may be appropriate in future large-scale surveys to examine the relation between alienation and a range of

  20. 22 CFR 42.72 - Validity of visas.

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ... 22 Foreign Relations 1 2012-04-01 2012-04-01 false Validity of visas. 42.72 Section 42.72 Foreign Relations DEPARTMENT OF STATE VISAS VISAS: DOCUMENTATION OF IMMIGRANTS UNDER THE IMMIGRATION AND NATIONALITY ACT, AS AMENDED Issuance of Immigrant Visas § 42.72 Validity of visas. (a) Period of validity. With...

  1. 22 CFR 42.72 - Validity of visas.

    Code of Federal Regulations, 2011 CFR

    2011-04-01

    ... 22 Foreign Relations 1 2011-04-01 2011-04-01 false Validity of visas. 42.72 Section 42.72 Foreign Relations DEPARTMENT OF STATE VISAS VISAS: DOCUMENTATION OF IMMIGRANTS UNDER THE IMMIGRATION AND NATIONALITY ACT, AS AMENDED Issuance of Immigrant Visas § 42.72 Validity of visas. (a) Period of validity. With...

  2. 22 CFR 42.72 - Validity of visas.

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ... 22 Foreign Relations 1 2013-04-01 2013-04-01 false Validity of visas. 42.72 Section 42.72 Foreign Relations DEPARTMENT OF STATE VISAS VISAS: DOCUMENTATION OF IMMIGRANTS UNDER THE IMMIGRATION AND NATIONALITY ACT, AS AMENDED Issuance of Immigrant Visas § 42.72 Validity of visas. (a) Period of validity. With...

  3. 22 CFR 42.72 - Validity of visas.

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ... 22 Foreign Relations 1 2014-04-01 2014-04-01 false Validity of visas. 42.72 Section 42.72 Foreign Relations DEPARTMENT OF STATE VISAS VISAS: DOCUMENTATION OF IMMIGRANTS UNDER THE IMMIGRATION AND NATIONALITY ACT, AS AMENDED Issuance of Immigrant Visas § 42.72 Validity of visas. (a) Period of validity. With...

  4. 22 CFR 51.4 - Validity of passports.

    Code of Federal Regulations, 2011 CFR

    2011-04-01

    ... 22 Foreign Relations 1 2011-04-01 2011-04-01 false Validity of passports. 51.4 Section 51.4 Foreign Relations DEPARTMENT OF STATE NATIONALITY AND PASSPORTS PASSPORTS General § 51.4 Validity of passports. (a) Signature of bearer. A passport book is valid only when signed by the bearer in the space...

  5. 22 CFR 51.4 - Validity of passports.

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ... 22 Foreign Relations 1 2013-04-01 2013-04-01 false Validity of passports. 51.4 Section 51.4 Foreign Relations DEPARTMENT OF STATE NATIONALITY AND PASSPORTS PASSPORTS General § 51.4 Validity of passports. (a) Signature of bearer. A passport book is valid only when signed by the bearer in the space...

  6. 22 CFR 51.4 - Validity of passports.

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ... 22 Foreign Relations 1 2014-04-01 2014-04-01 false Validity of passports. 51.4 Section 51.4 Foreign Relations DEPARTMENT OF STATE NATIONALITY AND PASSPORTS PASSPORTS General § 51.4 Validity of passports. (a) Signature of bearer. A passport book is valid only when signed by the bearer in the space...

  7. 22 CFR 51.4 - Validity of passports.

    Code of Federal Regulations, 2010 CFR

    2010-04-01

    ... 22 Foreign Relations 1 2010-04-01 2010-04-01 false Validity of passports. 51.4 Section 51.4 Foreign Relations DEPARTMENT OF STATE NATIONALITY AND PASSPORTS PASSPORTS General § 51.4 Validity of passports. (a) Signature of bearer. A passport book is valid only when signed by the bearer in the space...

  8. 22 CFR 51.4 - Validity of passports.

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ... 22 Foreign Relations 1 2012-04-01 2012-04-01 false Validity of passports. 51.4 Section 51.4 Foreign Relations DEPARTMENT OF STATE NATIONALITY AND PASSPORTS PASSPORTS General § 51.4 Validity of passports. (a) Signature of bearer. A passport book is valid only when signed by the bearer in the space...

  9. Assessment of radiation-induced xerostomia: validation of the Italian version of the xerostomia questionnaire in head and neck cancer patients.

    PubMed

    Pellegrino, Federica; Groff, Elena; Bastiani, Luca; Fattori, Bruno; Sotti, Guido

    2015-04-01

    Xerostomia is the most common acute and late side effect of radiation treatment for head and neck cancer. Affecting taste perception, chewing, swallowing and speech, xerostomia is also the major cause of decreased quality of life. The aims of this study were to validate the Italian translation of the self-reported eight-item xerostomia questionnaire (XQ) and determine its psychometric properties in patients treated with radiotherapy for head and neck cancer. An observational cross-sectional study was conducted in the Radiotherapy Unit of the Veneto Institute of Oncology - IOV in Padua. The XQ was translated according to international guidelines and filled out by 102 patients. Construct validity was assessed using principal component analysis, internal consistency using Cronbach's α coefficient and test-retest reliability at 1-month interval using the intraclass correlation coefficient (ICC). Criterion-related validity was evaluated to compare the Italian version of XQ with the European Organization for Research and Treatment of Cancer (EORTC) Core Quality-of-Life Questionnaire (QLQ-C30) and its Head and Neck Cancer Module (QLQ-H&N35). Cronbach's α for the Italian version of XQ was strong at α = 0.93, test-retest reliability was also strong (0.79) and factor analysis confirmed that the questionnaire was one-dimensional. Criterion-related validity was excellent with high association with the EORTC QLQ-H&N35 xerostomia and sticky saliva scales. The Italian version of XQ has excellent psychometric properties and can be used to evaluate the impact of emerging radiation delivery techniques aiming at preventing xerostomia.

  10. Development of a framework for international certification by OIE of diagnostic tests validated as fit for purpose.

    PubMed

    Wright, P; Edwards, S; Diallo, A; Jacobson, R

    2006-01-01

    Historically, the OIE has focused on test methods applicable to trade and the international movement of animals and animal products. With its expanding role as the World Organisation for Animal Health, the OIE has recognised the need to evaluate test methods relative to specific diagnostic applications other than trade. In collaboration with its international partners, the OIE solicited input from experts through consultants' meetings on the development of guidelines for validation and certification of diagnostic assays for infectious animal diseases. Recommendations from the first meeting were formally adopted and have subsequently been acted upon by the OIE. A validation template has been developed that specifically requires a test to be fit or suited for its intended purpose (e.g. as a screening or a confirmatory test). This is a key criterion for validation. The template incorporates four distinct stages of validation, each of which has bearing on the evaluation of fitness for purpose. The OIE has just recently created a registry for diagnostic tests that fulfil these validation requirements. Assay developers are invited to submit validation dossiers to the OIE for evaluation by a panel of experts. Recognising that validation is an incremental process, tests methods achieving at least the first stages of validation may be provisionally accepted. To provide additional confidence in assay performance, the OIE, through its network of Reference Laboratories, has embarked on the development of evaluation panels. These panels would contain specially selected test samples that would assist in verifying fitness for purpose.

  11. Validation of the mothers object relations scales in 2–4 year old children and comparison with the child–parent relationship scale

    PubMed Central

    2013-01-01

    Background The quality of the parent–child relationship has an important effect on a wide range of child outcomes. The evaluation of interventions to promote healthy parenting and family relationships is dependent on outcome measures which can quantify the quality of parent–child relationships. Between the Mothers’ Object Relations – Short Form (MORS-SF) scale for babies and the Child–parent Relationship Scale (C-PRS) there is an age gap where no validated scales are available. We report the development and testing of an adaptation of the MORS-SF; the MORS (Child) scale and its use in children from the age of 2 years to 4 years. This scale aims to capture the nature of the parent–child relationship in a form which is short enough to be used in population surveys and intervention evaluations. Methods Construct and criterion validity, item salience and internal consistency were assessed in a sample of 166 parents of children aged 2–4 years old and compared with that of the C-PRS. The performance of the MORS (Child) as part of a composite measure with the HOME inventory was compared with that of the C-PRS using data collected in a randomised controlled trial and the national evaluation of Sure Start. Results MORS (Child) performed well in children aged 2–4 with high construct and criterion validity, item salience and internal consistency. One item in the C-PRS failed to load on either subscale and parents found this scale slightly more difficult to complete than the MORS (Child). The two measures performed very similarly in a factor analysis with the HOME inventory producing almost identical loadings. Conclusions Adapting the MORS-SF for children aged 2–4 years old produces a scale to assess parent–child relationships that is easy to use and outperforms the more commonly used C-PRS in several respects. PMID:23518176

  12. Concurrent Validity and Classification Accuracy of Curriculum-Based Measurement for Written Expression

    ERIC Educational Resources Information Center

    Furey, William M.; Marcotte, Amanda M.; Hintze, John M.; Shackett, Caroline M.

    2016-01-01

    The study presents a critical analysis of written expression curriculum-based measurement (WE-CBM) metrics derived from 3- and 10-min test lengths. Criterion validity and classification accuracy were examined for Total Words Written (TWW), Correct Writing Sequences (CWS), Percent Correct Writing Sequences (%CWS), and Correct Minus Incorrect…

  13. Utility of ultrasound for body fat assessment: validity and reliability compared to a multicompartment criterion.

    PubMed

    Smith-Ryan, Abbie E; Blue, Malia N M; Trexler, Eric T; Hirsch, Katie R

    2018-03-01

    Measurement of body composition to assess health risk and prevention is expanding. Accurate portable techniques are needed to facilitate use in clinical settings. This study evaluated the accuracy and repeatability of a portable ultrasound (US) in comparison with a four-compartment criterion for per cent body fat (%Fat) in overweight/obese adults. Fifty-one participants (mean ± SD; age: 37·2 ± 11·3 years; BMI: 31·6 ± 5·2 kg m -2 ) were measured for %Fat using US (GE Logiq-e) and skinfolds. A subset of 36 participants completed a second day of the same measurements, to determine reliability. US and skinfold %Fat were calculated using the seven-site Jackson-Pollock equation. The Wang 4C model was used as the criterion method for %Fat. Compared to a gold standard criterion, US %Fat (36·4 ± 11·8%; P = 0·001; standard error of estimate [SEE] = 3·5%) was significantly higher than the criterion (33·0 ± 8·0%), but not different than skinfolds (35·3 ± 5·9%; P = 0·836; SEE = 4·5%). US resulted in good reliability, with no significant differences from Day 1 (39·95 ± 15·37%) to Day 2 (40·01 ± 15·42%). Relative consistency was 0·96, and standard error of measure was 0·94%. Although US overpredicted %Fat compared to the criterion, a moderate SEE for US is suggestive of a practical assessment tool in overweight individuals. %Fat differences reported from these field-based techniques are less than reported by other single-measurement laboratory methods and therefore may have utility in a clinical setting. This technique may also accurately track changes. © 2016 Scandinavian Society of Clinical Physiology and Nuclear Medicine. Published by John Wiley & Sons Ltd.

  14. Development and validation of health related quality of life questionnaire (Indian scenario) in diabetic foot ulcer patients.

    PubMed

    Kateel, Ramya; Augustine, Alfred J; Ullal, Sheetal; Prabhu, Shivananda; Bhat, Rahul; Adhikari, Prabha

    2017-12-01

    To develop and validate Health Related Quality of Life Questionnaire in Diabetic Foot Ulcer Patients (HRQLQDFU) for Indian scenario. This study was conducted in two phases. First phase was Development of HRQLQDFU which included literature search and expert interview. Second phase was validation of HRQLQDFL which included face validation, content validation and construct validation. Face validation was done by ten diabetic foot ulcer patients, ten practicing nurses and ten care givers. They were asked to read and respond to questionnaire and report any difficulty in understanding the questions. Further they were asked to add any item to the questionnaire which according to them has a significant effect on quality of life. Content validation was done by six subject experts who judged the content relevance of questionnaire with score ranging from zero to four; zero being least relevant and four being most relevant. Content validity index was calculated for each question. Questions having content validity index≥0.8 were selected for the study. Reliability was tested by calculating Cronbach's alpha. In the development phase a questionnaire containing 37 questions with six domains was developed. None of patient had difficulty in understanding questions. After content validation a new questionnaire containing 20 questions was developed. Cronbach's alpha was 0.86 which shows good reliability. The new health related quality of life questionnaire on diabetic foot ulcer patients for an Indian scenario is validated and can be a reliably measure for quality of life in diabetic foot ulcer patients. Copyright © 2017 Diabetes India. Published by Elsevier Ltd. All rights reserved.

  15. Item Development and Validity Testing for a Self- and Proxy Report: The Safe Driving Behavior Measure

    PubMed Central

    Classen, Sherrilene; Winter, Sandra M.; Velozo, Craig A.; Bédard, Michel; Lanford, Desiree N.; Brumback, Babette; Lutz, Barbara J.

    2010-01-01

    OBJECTIVE We report on item development and validity testing of a self-report older adult safe driving behaviors measure (SDBM). METHOD On the basis of theoretical frameworks (Precede–Proceed Model of Health Promotion, Haddon’s matrix, and Michon’s model), existing driving measures, and previous research and guided by measurement theory, we developed items capturing safe driving behavior. Item development was further informed by focus groups. We established face validity using peer reviewers and content validity using expert raters. RESULTS Peer review indicated acceptable face validity. Initial expert rater review yielded a scale content validity index (CVI) rating of 0.78, with 44 of 60 items rated ≥0.75. Sixteen unacceptable items (≤0.5) required major revision or deletion. The next CVI scale average was 0.84, indicating acceptable content validity. CONCLUSION The SDBM has relevance as a self-report to rate older drivers. Future pilot testing of the SDBM comparing results with on-road testing will define criterion validity. PMID:20437917

  16. Developing a contributing factor classification scheme for Rasmussen's AcciMap: Reliability and validity evaluation.

    PubMed

    Goode, N; Salmon, P M; Taylor, N Z; Lenné, M G; Finch, C F

    2017-10-01

    One factor potentially limiting the uptake of Rasmussen's (1997) Accimap method by practitioners is the lack of a contributing factor classification scheme to guide accident analyses. This article evaluates the intra- and inter-rater reliability and criterion-referenced validity of a classification scheme developed to support the use of Accimap by led outdoor activity (LOA) practitioners. The classification scheme has two levels: the system level describes the actors, artefacts and activity context in terms of 14 codes; the descriptor level breaks the system level codes down into 107 specific contributing factors. The study involved 11 LOA practitioners using the scheme on two separate occasions to code a pre-determined list of contributing factors identified from four incident reports. Criterion-referenced validity was assessed by comparing the codes selected by LOA practitioners to those selected by the method creators. Mean intra-rater reliability scores at the system (M = 83.6%) and descriptor (M = 74%) levels were acceptable. Mean inter-rater reliability scores were not consistently acceptable for both coding attempts at the system level (M T1  = 68.8%; M T2  = 73.9%), and were poor at the descriptor level (M T1  = 58.5%; M T2  = 64.1%). Mean criterion referenced validity scores at the system level were acceptable (M T1  = 73.9%; M T2  = 75.3%). However, they were not consistently acceptable at the descriptor level (M T1  = 67.6%; M T2  = 70.8%). Overall, the results indicate that the classification scheme does not currently satisfy reliability and validity requirements, and that further work is required. The implications for the design and development of contributing factors classification schemes are discussed. Copyright © 2017 Elsevier Ltd. All rights reserved.

  17. Trait and state anxiety across academic evaluative contexts: development and validation of the MTEA-12 and MSEA-12 scales.

    PubMed

    Sotardi, Valerie A

    2018-05-01

    Educational measures of anxiety focus heavily on students' experiences with tests yet overlook other assessment contexts. In this research, two brief multiscale questionnaires were developed and validated to measure trait evaluation anxiety (MTEA-12) and state evaluation anxiety (MSEA-12) for use in various assessment contexts in non-clinical, educational settings. The research included a cross-sectional analysis of self-report data using authentic assessment settings in which evaluation anxiety was measured. Instruments were tested using a validation sample of 241 first-year university students in New Zealand. Scale development included component structures for state and trait scales based on existing theoretical frameworks. Analyses using confirmatory factor analysis and descriptive statistics indicate that the scales are reliable and structurally valid. Multivariate general linear modeling using subscales from the MTEA-12, MSEA-12, and student grades suggest adequate criterion-related validity. Initial predictive validity in which one relevant MTEA-12 factor explained between 21% and 54% of the variance in three MSEA-12 factors. Results document MTEA-12 and MSEA-12 as reliable measures of trait and state dimensions of evaluation anxiety for test and writing contexts. Initial estimates suggest the scales as having promising validity, and recommendations for further validation are outlined.

  18. Transformational and transactional leadership: a meta-analytic test of their relative validity.

    PubMed

    Judge, Timothy A; Piccolo, Ronald F

    2004-10-01

    This study provided a comprehensive examination of the full range of transformational, transactional, and laissez-faire leadership. Results (based on 626 correlations from 87 sources) revealed an overall validity of .44 for transformational leadership, and this validity generalized over longitudinal and multisource designs. Contingent reward (.39) and laissez-faire (-.37) leadership had the next highest overall relations; management by exception (active and passive) was inconsistently related to the criteria. Surprisingly, there were several criteria for which contingent reward leadership had stronger relations than did transformational leadership. Furthermore, transformational leadership was strongly correlated with contingent reward (.80) and laissez-faire (-.65) leadership. Transformational and contingent reward leadership generally predicted criteria controlling for the other leadership dimensions, although transformational leadership failed to predict leader job performance. (c) 2004 APA, all rights reserved

  19. Manual unloading of the lumbar spine: can it identify immediate responders to mechanical traction in a low back pain population? A study of reliability and criterion referenced predictive validity

    PubMed Central

    Swanson, Brian T.; Riley, Sean P.; Cote, Mark P.; Leger, Robin R.; Moss, Isaac L.; Carlos,, John

    2016-01-01

    Background To date, no research has examined the reliability or predictive validity of manual unloading tests of the lumbar spine to identify potential responders to lumbar mechanical traction. Purpose To determine: (1) the intra and inter-rater reliability of a manual unloading test of the lumbar spine and (2) the criterion referenced predictive validity for the manual unloading test. Methods Ten volunteers with low back pain (LBP) underwent a manual unloading test to establish reliability. In a separate procedure, 30 consecutive patients with LBP (age 50·86±11·51) were assessed for pain in their most provocative standing position (visual analog scale (VAS) 49·53±25·52 mm). Patients were assessed with a manual unloading test in their most provocative position followed by a single application of intermittent mechanical traction. Post traction, pain in the provocative position was reassessed and utilized as the outcome criterion. Results The test of unloading demonstrated substantial intra and inter-rater reliability K = 1·00, P = 0·002, K = 0·737, P = 0·001, respectively. There were statistically significant within group differences for pain response following traction for patients with a positive manual unloading test (P<0·001), while patients with a negative manual unloading test did not demonstrate a statistically significant change (P>0·05). There were significant between group differences for proportion of responders to traction based on manual unloading response (P = 0·031), and manual unloading response demonstrated a moderate to strong relationship with traction response Phi = 0·443, P = 0·015. Discussion and conclusion The manual unloading test appears to be a reliable test and has a moderate to strong correlation with pain relief that exceeds minimal clinically important difference (MCID) following traction supporting the validity of this test. PMID:27559274

  20. The Individualized Classroom Assessment Scoring System (inCLASS): Preliminary Reliability and Validity of a System for Observing Preschoolers’ Competence in Classroom Interactions

    PubMed Central

    Downer, Jason T.; Booren, Leslie M.; Lima, Olivia K.; Luckner, Amy E.; Pianta, Robert C.

    2012-01-01

    This paper introduces the Individualized Classroom Assessment Scoring System (inCLASS), an observation tool that targets children’s interactions in preschool classrooms with teachers, peers, and tasks. In particular, initial evidence is reported of the extent to which the inCLASS meets the following psychometric criteria: inter-rater reliability, normal distributions and adequate range, construct validity, and criterion-related validity. These initial findings suggest that the inCLASS has the potential to provide an authentic, contextualized assessment of young children’s classroom behaviors. Future directions for research with the inCLASS are discussed. PMID:23175598

  1. Psychometric examination and factorial validity of the Exercise Dependence Scale-Revised in Italian exercisers.

    PubMed

    Costa, Sebastiano; Cuzzocrea, Francesca; Hausenblas, Heather A; Larcan, Rosalba; Oliva, Patrizia

    2012-12-01

    Background and aims The purpose of this study was to verify the factorial structure, internal validity, reliability, and criterion validity of the 21-item Exercise Dependence Scale-Revised (EDS-R) in an Italian sample. Methods Italian voluntary (N = 519) users of gyms who had a history of regular exercise for over a year completed the EDS-R and measures of exercise frequency. Results and conclusions Confirmatory factor analyses demonstrated a good fit to the hypothesized 7-factor model, and adequate internal consistency for the scale was evidenced. Criterion validity was evidenced by significant correlations among all the subscale of the EDS and exercise frequency. Finally, individuals at risk for exercise dependence reported more exercise behavior compared to the nondependent-symptomatic and nondependent-asymptomatic groups. These results suggest that the seven subscales of the Italian version of the EDS are measuring the construct of exercise dependence as defined by the DSM-IV criteria for substance dependence and also confirm previous research using the EDS-R in other languages. More research is needed to examine the psychometric properties of the EDS-R in diverse populations with various research designs.

  2. Development of a Valid and Reliable Knee Articular Cartilage Condition-Specific Study Methodological Quality Score.

    PubMed

    Harris, Joshua D; Erickson, Brandon J; Cvetanovich, Gregory L; Abrams, Geoffrey D; McCormick, Frank M; Gupta, Anil K; Verma, Nikhil N; Bach, Bernard R; Cole, Brian J

    2014-02-01

    Condition-specific questionnaires are important components in evaluation of outcomes of surgical interventions. No condition-specific study methodological quality questionnaire exists for evaluation of outcomes of articular cartilage surgery in the knee. To develop a reliable and valid knee articular cartilage-specific study methodological quality questionnaire. Cross-sectional study. A stepwise, a priori-designed framework was created for development of a novel questionnaire. Relevant items to the topic were identified and extracted from a recent systematic review of 194 investigations of knee articular cartilage surgery. In addition, relevant items from existing generic study methodological quality questionnaires were identified. Items for a preliminary questionnaire were generated. Redundant and irrelevant items were eliminated, and acceptable items modified. The instrument was pretested and items weighed. The instrument, the MARK score (Methodological quality of ARticular cartilage studies of the Knee), was tested for validity (criterion validity) and reliability (inter- and intraobserver). A 19-item, 3-domain MARK score was developed. The 100-point scale score demonstrated face validity (focus group of 8 orthopaedic surgeons) and criterion validity (strong correlation to Cochrane Quality Assessment score and Modified Coleman Methodology Score). Interobserver reliability for the overall score was good (intraclass correlation coefficient [ICC], 0.842), and for all individual items of the MARK score, acceptable to perfect (ICC, 0.70-1.000). Intraobserver reliability ICC assessed over a 3-week interval was strong for 2 reviewers (≥0.90). The MARK score is a valid and reliable knee articular cartilage condition-specific study methodological quality instrument. This condition-specific questionnaire may be used to evaluate the quality of studies reporting outcomes of articular cartilage surgery in the knee.

  3. A novel cuffless device for self-measurement of blood pressure: concept, performance and clinical validation.

    PubMed

    Boubouchairopoulou, N; Kollias, A; Chiu, B; Chen, B; Lagou, S; Anestis, P; Stergiou, G S

    2017-07-01

    A pocket-size cuffless electronic device for self-measurement of blood pressure (BP) has been developed (Freescan, Maisense Inc., Zhubei, Taiwan). The device estimates BP within 10 s using three embedded electrodes and one force sensor that is applied over the radial pulse to evaluate the pulse wave. Before use, basic anthropometric characteristics are recorded on the device, and individualized initial calibration is required based on a standard BP measurement performed using an upper-arm BP monitor. The device performance in providing valid BP readings was evaluated in 313 normotensive and hypertensive adults in three study phases during which the device sensor was upgraded. A formal validation study of a prototype device against mercury sphygmomanometer was performed according to the American National Standards Institute/Association for the Advancement of Medical Instrumentation/International Organization for Standardization (ANSI/AAMI/ISO) 2013 protocol. The test device succeeded in obtaining a valid BP measurement (three successful readings within up to five attempts) in 55-72% of the participants, which reached 87% with device sensor upgrade. For the validation study, 125 adults were recruited and 85 met the protocol requirements for inclusion. The mean device-observers BP difference was 3.2±6.7 (s.d.) mm Hg for systolic and 2.6±4.6 mm Hg for diastolic BP (criterion 1). The estimated s.d. (inter-subject variability) were 5.83 and 4.17 mm Hg respectively (criterion 2). These data suggest that this prototype cuffless BP monitor provides valid self-measurements in the vast majority of adults, and satisfies the BP measurement accuracy criteria of the ANSI/AAMI/ISO 2013 validation protocol.

  4. A criterion for maximum resin flow in composite materials curing process

    NASA Astrophysics Data System (ADS)

    Lee, Woo I.; Um, Moon-Kwang

    1993-06-01

    On the basis of Springer's resin flow model, a criterion for maximum resin flow in autoclave curing is proposed. Validity of the criterion was proved for two resin systems (Fiberite 976 and Hercules 3501-6 epoxy resin). The parameter required for the criterion can be easily estimated from the measured resin viscosity data. The proposed criterion can be used in establishing the proper cure cycle to ensure maximum resin flow and, thus, the maximum compaction.

  5. A reliable and valid questionnaire was developed to measure computer vision syndrome at the workplace.

    PubMed

    Seguí, María del Mar; Cabrero-García, Julio; Crespo, Ana; Verdú, José; Ronda, Elena

    2015-06-01

    To design and validate a questionnaire to measure visual symptoms related to exposure to computers in the workplace. Our computer vision syndrome questionnaire (CVS-Q) was based on a literature review and validated through discussion with experts and performance of a pretest, pilot test, and retest. Content validity was evaluated by occupational health, optometry, and ophthalmology experts. Rasch analysis was used in the psychometric evaluation of the questionnaire. Criterion validity was determined by calculating the sensitivity and specificity, receiver operator characteristic curve, and cutoff point. Test-retest repeatability was tested using the intraclass correlation coefficient (ICC) and concordance by Cohen's kappa (κ). The CVS-Q was developed with wide consensus among experts and was well accepted by the target group. It assesses the frequency and intensity of 16 symptoms using a single rating scale (symptom severity) that fits the Rasch rating scale model well. The questionnaire has sensitivity and specificity over 70% and achieved good test-retest repeatability both for the scores obtained [ICC = 0.802; 95% confidence interval (CI): 0.673, 0.884] and CVS classification (κ = 0.612; 95% CI: 0.384, 0.839). The CVS-Q has acceptable psychometric properties, making it a valid and reliable tool to control the visual health of computer workers, and can potentially be used in clinical trials and outcome research. Copyright © 2015 Elsevier Inc. All rights reserved.

  6. Standards for Evaluating Criterion-Referenced Tests.

    ERIC Educational Resources Information Center

    Walker, Clinton B.

    Standards for evaluating criterion-referenced tests are presented. Twenty-one standards, grouped in three categories, are discussed. Category one is defined as measurement properties and is comprised of conceptual validity, including description of the domain, test item agreement with objectives, and item representativeness of the objectives; and…

  7. Validity, discriminative ability, and reliability of the hearing-related quality of life questionnaire for adolescents.

    PubMed

    Rachakonda, Tara; Jeffe, Donna B; Shin, Jennifer J; Mankarious, Leila; Fanning, Robert J; Lesperance, Marci M; Lieu, Judith E C

    2014-02-01

    The prevalence of hearing loss (HL) in adolescents has grown over the past decade, but hearing-related quality of life (QOL) has not been well-measured. We sought to develop a reliable, valid measure of hearing-related QOL for adolescents and the Hearing Environments And Reflection on Quality of Life (HEAR-QL). Multisite observational study. Adolescents with HL and siblings without HL were recruited from five centers. Participants completed the HEAR-QL and validated questionnaires measuring generic pediatric QOL (PedsQL), depression and anxiety (RCADS-25), and hearing-related QOL for adults (HHIA) to determine construct and discriminant validity. Participants completed the HEAR-QL 2 weeks later for test-retest reliability. We used exploratory principal components analysis to determine the HEAR-QL factor structure and measured reliability. Sensitivity and specificity of the HEAR-QL, PedsQL, HHIA, and RCADS-25 were assessed. We compared scores on all surveys between those with normal hearing, unilateral, and bilateral HL. A total of 233 adolescents (13-18 years old) participated: 179 with HL, 54 without HL. The original 45-item HEAR-QL was shortened to 28 items after determining factor structure. The resulting HEAR-QL-28 demonstrated excellent reliability (Cronbach's alpha = 0.95) and construct validity (HHIA: r = .845, PedsQL: r = .587; RCADS-25: r = .433). The HEAR-QL-28 displayed excellent discriminant validity, with higher area under the curve (0.932) than the PedsQL (0.597) or RCADS-25 (0.529). Teens with bilateral HL using hearing devices reported worse QOL on the HEAR-QL and HHIA than peers with HL not using devices. The HEAR-QL is a sensitive, reliable, and valid measure of hearing-related QOL for adolescents. 2b. © 2013 The American Laryngological, Rhinological and Otological Society, Inc.

  8. Assessing Anger Expression: Construct Validity of Three Emotion Expression-Related Measures

    PubMed Central

    Jasinski, Matthew J.; Lumley, Mark A.; Latsch, Deborah V.; Schuster, Erik; Kinner, Ellen; Burns, John W.

    2016-01-01

    Self-report measures of emotional expression are common, but their validity to predict objective emotional expression, particularly of anger, is unclear. We tested the validity of the Anger Expression Inventory (AEI; Spielberger et al., 1985)), Emotional Approach Coping Scale (EAC; Stanton, Kirk, Cameron & Danoff-Burg, 2000), and Toronto Alexithymia Scale-20 (TAS-20; Bagby, Taylor, & Parker, 1994) to predict objective anger expression in 95 adults with chronic back pain. Participants attempted to solve a difficult computer maze by following the directions of a confederate who treated them rudely and unjustly. Participants then expressed their feelings for 4 minutes. Blinded raters coded the videos for anger expression, and a software program analyzed expression transcripts for anger-related words. Analyses related each questionnaire to anger expression. The AEI anger-out scale predicted greater anger expression, as expected, but AEI anger-in did not. The EAC emotional processing scale predicted less anger expression, but the EAC emotional expression scale was unrelated to anger expression. Finally, the TAS-20 predicted greater anger expression. Findings support the validity of the AEI anger-out scale but raise questions about the other measures. The assessment of emotional expression by self-report is complex and perhaps confounded by general emotional experience, the specificity or generality of the emotion(s) assessed, and self-awareness limitations. Performance-based or clinician-rated measures of emotion expression are needed. PMID:27248355

  9. Evaluating Maintenance Performance: The Development of Graphic Symbolic Substitutes for Criterion Referenced Job Task Performance Tests for Electronic Maintenance. Final Report.

    ERIC Educational Resources Information Center

    Shriver, Edgar L.; Foley, John P., Jr.

    A battery of criterion referenced Job Task Performance Tests (JTPT) was developed because paper and pencil tests of job knowledge and electronic theory had very poor criterion-related or empirical validity with respect to the ability of electronic maintenance men to perform their job. Although the original JTPT required the use of actual…

  10. STR-validator: an open source platform for validation and process control.

    PubMed

    Hansson, Oskar; Gill, Peter; Egeland, Thore

    2014-11-01

    This paper addresses two problems faced when short tandem repeat (STR) systems are validated for forensic purposes: (1) validation is extremely time consuming and expensive, and (2) there is strong consensus about what to validate but not how. The first problem is solved by powerful data processing functions to automate calculations. Utilising an easy-to-use graphical user interface, strvalidator (hereafter referred to as STR-validator) can greatly increase the speed of validation. The second problem is exemplified by a series of analyses, and subsequent comparison with published material, highlighting the need for a common validation platform. If adopted by the forensic community STR-validator has the potential to standardise the analysis of validation data. This would not only facilitate information exchange but also increase the pace at which laboratories are able to switch to new technology. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  11. Polish translation and validation of the Pelvic Organ Prolapse/Urinary Incontinence Sexual Questionnaire, IUGA-Revised (PISQ-IR).

    PubMed

    Grzybowska, Magdalena Emilia; Piaskowska-Cala, Justyna; Wydra, Dariusz Grzegorz

    2017-12-29

    The aim of the study was to translate into Polish the Pelvic Organ Prolapse/Incontinence Sexual Questionnaire, IUGA-Revised (PISQ-IR), which evaluates sexual function in sexually active (SA) and not SA (NSA) women with pelvic floor disorders (PFD), and to validate the Polish version. After translation, back-translation and cognitive interviews, the final version of PISQ-IR was established. The study group included 252 women with PFD (124 NSA and 128 SA). All women underwent clinical evaluation and completed the PISQ-IR. For test-retest reliability, the questionnaire was administered to 99 patients twice at an interval of 2 weeks. The analysis of criterion validity required the subjects to complete self-reported measures. Internal consistency and criterion validity were assessed separately for NSA and SA women for the PISQ-IR subscales. The mean age of the women was 60.9 ± 10.6 years and their mean BMI was 27.9 ± 4.9 kg/m 2 . Postmenopausal women constituted 82.5% of the study group. Urinary incontinence (UI) was diagnosed in 60 women (23.8%), pelvic organ prolapse (POP) in 90 (35.7%), and UI and POP in 102 (40.5%). Fecal incontinence was reported by 45 women (17.9%). The PISQ-IR Polish version proved to have good internal consistency in NSA women (α 0.651 to 0.857) and SA women (α 0.605 to 0.887), and strong reliability in all subscales (Pearson's coefficient 0.759-0.899; p < 0.001). Criterion validity confirmed moderate to strong correlations between PISQ-IR scores and self-reported measures in SA subscales, as well the SA summary score, and weak to moderate correlations in NSA women. The PISQ-IR Polish version is a valid tool for evaluating sexual function in women with PFD.

  12. Development and psychometric validation of a scale to assess information needs in cardiac rehabilitation: the INCR Tool.

    PubMed

    Ghisi, Gabriela Lima de Melo; Grace, Sherry L; Thomas, Scott; Evans, Michael F; Oh, Paul

    2013-06-01

    To develop and psychometrically validate a tool to assess information needs in cardiac rehabilitation (CR) patients. After a literature search, 60 information items divided into 11 areas of needs were identified. To establish content validity, they were reviewed by an expert panel (N=10). Refined items were pilot-tested in 34 patients on a 5-point Likert-scale from 1 "really not helpful" to 5 "very important". A final version was generated and psychometrically tested in 203 CR patients. Test-retest reliability was assessed via the intraclass correlation coefficient (ICC), the internal consistency using Cronbach's alpha, and criterion validity was assessed with regard to patient's education and duration in CR. Five items were excluded after ICC analysis as well as one area of needs. All 10 areas were considered internally consistent (Cronbach's alpha>0.7). Criterion validity was supported by significant differences in mean scores by educational level (p<0.05) and duration in CR (p<0.001). The mean total score was 4.08 ± 0.53. Patients rated safety as their greatest information need. The INCR Tool was demonstrated to have good reliability and validity. This is an appropriate tool for application in clinical and research settings, assessing patients' needs during CR and as part of education programming. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  13. Development of a framework for international certification by the OIE of diagnostic tests validated as fit for purpose.

    PubMed

    Wright, P; Edwards, S; Diallo, A; Jacobson, R

    2007-01-01

    Historically, the OIE has focussed on test methods applicable to trade and the international movement of animals and animal products. With its expanding role as the World Organisation for Animal Health, the OIE has recognised the need to evaluate test methods relative to specific diagnostic applications other than trade. In collaboration with its international partners, the OIE solicited input from experts through consultants meetings on the development of guidelines for validation and certification of diagnostic assays for infectious animal diseases. Recommendations from the first meeting were formally adopted and have subsequently been acted upon by the OIE. A validation template has been developed that specifically requires a test to be fit or suited for its intended purpose (e.g. as a screening or a confirmatory test). This is a key criterion for validation. The template incorporates four distinct stages of validation, each of which has bearing on the evaluation of fitness for purpose. The OIE has just recently created a registry for diagnostic tests that fulfil these validation requirements. Assay developers are invited to submit validation dossiers to the OIE for evaluation by a panel of experts. Recognising that validation is an incremental process, tests methods achieving at least the first stages of validation may be provisionally accepted. To provide additional confidence in assay performance, the OIE, through its network of Reference Laboratories, has embarked on the development of evaluation panels. These panels would contain specially selected test samples that would assist in verifying fitness for purpose.

  14. Validity and reliability of Optojump photoelectric cells for estimating vertical jump height.

    PubMed

    Glatthorn, Julia F; Gouge, Sylvain; Nussbaumer, Silvio; Stauffacher, Simone; Impellizzeri, Franco M; Maffiuletti, Nicola A

    2011-02-01

    Vertical jump is one of the most prevalent acts performed in several sport activities. It is therefore important to ensure that the measurements of vertical jump height made as a part of research or athlete support work have adequate validity and reliability. The aim of this study was to evaluate concurrent validity and reliability of the Optojump photocell system (Microgate, Bolzano, Italy) with force plate measurements for estimating vertical jump height. Twenty subjects were asked to perform maximal squat jumps and countermovement jumps, and flight time-derived jump heights obtained by the force plate were compared with those provided by Optojump, to examine its concurrent (criterion-related) validity (study 1). Twenty other subjects completed the same jump series on 2 different occasions (separated by 1 week), and jump heights of session 1 were compared with session 2, to investigate test-retest reliability of the Optojump system (study 2). Intraclass correlation coefficients (ICCs) for validity were very high (0.997-0.998), even if a systematic difference was consistently observed between force plate and Optojump (-1.06 cm; p < 0.001). Test-retest reliability of the Optojump system was excellent, with ICCs ranging from 0.982 to 0.989, low coefficients of variation (2.7%), and low random errors (±2.81 cm). The Optojump photocell system demonstrated strong concurrent validity and excellent test-retest reliability for the estimation of vertical jump height. We propose the following equation that allows force plate and Optojump results to be used interchangeably: force plate jump height (cm) = 1.02 × Optojump jump height + 0.29. In conclusion, the use of Optojump photoelectric cells is legitimate for field-based assessments of vertical jump height.

  15. Minimizing false positive error with multiple performance validity tests: response to Bilder, Sugar, and Hellemann (2014 this issue).

    PubMed

    Larrabee, Glenn J

    2014-01-01

    Bilder, Sugar, and Hellemann (2014 this issue) contend that empirical support is lacking for use of multiple performance validity tests (PVTs) in evaluation of the individual case, differing from the conclusions of Davis and Millis (2014), and Larrabee (2014), who found no substantial increase in false positive rates using a criterion of failure of ≥ 2 PVTs and/or Symptom Validity Tests (SVTs) out of multiple tests administered. Reconsideration of data presented in Larrabee (2014) supports a criterion of ≥ 2 out of up to 7 PVTs/SVTs, as keeping false positive rates close to and in most cases below 10% in cases with bona fide neurologic, psychiatric, and developmental disorders. Strategies to minimize risk of false positive error are discussed, including (1) adjusting individual PVT cutoffs or criterion for number of PVTs failed, for examinees who have clinical histories placing them at risk for false positive identification (e.g., severe TBI, schizophrenia), (2) using the history of the individual case to rule out conditions known to result in false positive errors, (3) using normal performance in domains mimicked by PVTs to show that sufficient native ability exists for valid performance on the PVT(s) that have been failed, and (4) recognizing that as the number of PVTs/SVTs failed increases, the likelihood of valid clinical presentation decreases, with a corresponding increase in the likelihood of invalid test performance and symptom report.

  16. [Validation of a dietary habits questionnaire related to fats and sugars intake].

    PubMed

    Aráuz Hernández, Ana Gladys; Roselló Araya, Marlene; Guzmán Padilla, Sonia; Padilla Vargas, Gioconda

    2008-12-01

    The objective of this study was to design and validate a psychometric tool to measure dietary practices related to the intake of fats and sugars in a sample of overweight and obese adults. Classical test theory was applied. The validated construct was dietary habits, and the following theoretical dimensions were utilized: exclusion, modification, substitution and replacement. These had been previously defined in similar studies conducted in other countries. The tool was validated with 139 adults, males and females, with body mass indexes equal to or higher than 25. Construct validity for each section of the tool was obtained through factor analysis. The final tool was made up of 47 items. Cronbach's Alpha reliability coefficient was 0.948, which indicates a highly satisfactory internal consistency. Using sediment graph data and factor analysis of the four proposed theoretical dimensions of behavior, items were fused into two dimensions with a cumulative variance of 58%. These were renamed "elimination" and "modification". Cronbach's Alphas were 0.906 and 0.873, respectively, indicating a high level of reliability for construct measurement. Results show the need to adapt foreign tools to our socio-cultural context before utilizing them in interventions intended to modify dietary patterns, since these are interrelated to other aspects of the culture itself.

  17. Validity of two alternative systems for measuring vertical jump height.

    PubMed

    Leard, John S; Cirillo, Melissa A; Katsnelson, Eugene; Kimiatek, Deena A; Miller, Tim W; Trebincevic, Kenan; Garbalosa, Juan C

    2007-11-01

    Vertical jump height is frequently used by coaches, health care professionals, and strength and conditioning professionals to objectively measure function. The purpose of this study is to determine the concurrent validity of the jump and reach method (Vertec) and the contact mat method (Just Jump) in assessing vertical jump height when compared with the criterion reference 3-camera motion analysis system. Thirty-nine college students, 25 females and 14 males between the ages of 18 and 25 (mean age 20.65 years), were instructed to perform the countermovement jump. Reflective markers were placed at the base of the individual's sacrum for the 3-camera motion analysis system to measure vertical jump height. The subject was then instructed to stand on the Just Jump mat beneath the Vertec and perform the jump. Measurements were recorded from each of the 3 systems simultaneously for each jump. The Pearson r statistic between the video and the jump and reach (Vertec) was 0.906. The Pearson r between the video and contact mat (Just Jump) was 0.967. Both correlations were significant at the 0.01 level. Analysis of variance showed a significant difference among the 3 means F(2,235) = 5.51, p < 0.05. The post hoc analysis showed a significant difference between the criterion reference (M = 0.4369 m) and the Vertec (M = 0.3937 m, p = 0.005) but not between the criterion reference and the Just Jump system (M = 0.4420 m, p = 0.972). The Just Jump method of measuring vertical jump height is a valid measure when compared with the 3-camera system. The Vertec was found to have a high correlation with the criterion reference, but the mean differed significantly. This study indicates that a higher degree of confidence is warranted when comparing Just Jump results with a 3-camera system study.

  18. Development and validation of a new tool to measure Iranian pregnant women's empowerment.

    PubMed

    Borghei, N S; Taghipour, A; Roudsari, R Latifnejad; Keramat, A

    2016-03-15

    Empowering pregnant women improves their health and reduces maternal mortality, but there is a lack of suitable tools to measure women's empowerment in some cultures. This study aimed to design and validate a questionnaire for measuring the dimensions of empowerment among Iranian pregnant women. After a literature review, and face and content validity testing, a 38-item questionnaire was developed and tested on a sample of 161 pregnant women. Factor analysis grouped the items into 3 subscales: educational empowerment (e.g. prenatal training), autonomy (e.g. financial independency and mental ability) and sociopolitical empowerment (e.g. involvement in social and political activities). Criterion validity testing showed a strong positive correlation of the total scale and subscales scores with the Kameda and the Spritzer empowerment scales. Cronbach alpha was 0.92 for total empowerment. A total of 32 items remained in the Self-Structured Pregnancy Empowerment Questionnaire, which is a valid new tool to measure the dimensions of pregnant women's empowerment.

  19. [Validation of a scale to assess the labour quality of life in public hospitals from Tlaxcala].

    PubMed

    Hernández-Vicente, Irma Alejandra; Lumbreras-Guzmán, Marivel; Méndez-Hernández, Pablo; Rojas-Lima, Elodia; Cervantes-Rodríguez, Margarita; Juárez-Flores, Clara Arlina

    2017-01-01

    To validate a scale for assessing the labour quality of life in public hospitals (LQL-PH) from Tlaxcala, Mexico. The instrument was validated among 669 health workers from six hospitals from the Ministry of Health of Tlaxcala, Mexico. Content validity was by inquiry to experts, construct validity by factor analysis, criterion validity by comparing with other scales, and reliability with Cronbach's Alpha. The factor analysis uncovered four dimensions: "individual welfare", "conditions and labour environment", "organization", and "well-being accomplished by the work"; reliability was 0.921. Workers who perceibed better LQL-PH were: under 50 years old, with temporary contract, with less seniority in job, with work schedule at daytime of weekends, and those with academic degree. LQL-PH showed to be an instrument phsycometrically valid and reliable. It's recommendable to prove this scale in other public and private health institutions, as well as its relationship with key health care indicators of labour performance and management.

  20. Further development and construct validation of MMPI-2-RF indices of global psychopathy, fearless-dominance, and impulsive-antisociality in a sample of incarcerated women.

    PubMed

    Phillips, Tasha R; Sellbom, Martin; Ben-Porath, Yossef S; Patrick, Christopher J

    2014-02-01

    Replicating and extending research by Sellbom et al. (M. Sellbom, Y. S. Ben-Porath, C. J. Patrick, D. B. Wygant, D. M. Gartland, & K. P. Stafford, 2012, Development and Construct Validation of the MMPI-2-RF Measures of Global Psychopathy, Fearless-Dominance, and Impulsive-Antisociality, Personality Disorders: Theory, Research, and Treatment, 3, 17-38), the current study examined the criterion-related validity of three self-report indices of psychopathy that were derived from scores on the Minnesota Multiphasic Personality Inventory (MMPI)-2-Restructured Form (MMPI-2-RF; Y. S. Ben-Porath & A. Tellegen, 2008, Minnesota Multiphasic Personality Inventory-2-Restructured Form: Manual for Administration, Scoring, and Interpretation, Minneapolis, MN: University of Minnesota Press). We estimated psychopathy indices by regressing scores from the Psychopathic Personality Inventory (PPI; S. O. Lilienfeld & B. P. Andrews, 1996, Development and Preliminary Validation of a Self-Report Measure of Psychopathic Personality Traits in Noncriminal Populations, Journal of Personality Assessment, 66, 488-524) and its two distinct facets, Fearless-Dominance and Impulsive-Antisociality, onto conceptually selected MMPI-2-RF scales. Data for a newly collected sample of 230 incarcerated women were combined with existing data from Sellbom et al.'s (2012) male correctional and mixed-gender college samples to establish regression equations with optimal generalizability. Correlation and regression analyses were then used to examine associations between the MMPI-2-RF-based estimates of PPI psychopathy and criterion measures (i.e., other well-established measures of psychopathy and conceptually related personality traits), and to evaluate whether gender moderated these associations. The MMPI-2-RF-based psychopathy indices correlated as expected with criterion measures and showed only one significant moderating effect for gender, namely, in the association between psychopathy and narcissism. These

  1. The Transition Readiness Assessment Questionnaire (TRAQ): its factor structure, reliability, and validity.

    PubMed

    Wood, David L; Sawicki, Gregory S; Miller, M David; Smotherman, Carmen; Lukens-Bull, Katryne; Livingood, William C; Ferris, Maria; Kraemer, Dale F

    2014-01-01

    National consensus statements recommend that providers regularly assess the transition readiness skills of adolescent and young adults (AYA). In 2010 we developed a 29-item version of Transition Readiness Assessment Questionnaire (TRAQ). We reevaluated item performance and factor structure, and reassessed the TRAQ's reliability and validity. We surveyed youth from 3 academic clinics in Jacksonville, Florida; Chapel Hill, North Carolina; and Boston, Massachusetts. Participants were AYA with special health care needs aged 14 to 21 years. From a convenience sample of 306 patients, we conducted item reduction strategies and exploratory factor analysis (EFA). On a second convenience sample of 221 patients, we conducted confirmatory factor analysis (CFA). Internal reliability was assessed by Cronbach's alpha and criterion validity. Analyses were conducted by the Wilcoxon rank sum test and mixed linear models. The item reduction and EFA resulted in a 20-item scale with 5 identified subscales. The CFA conducted on a second sample provided a good fit to the data. The overall scale has high reliability overall (Cronbach's alpha = .94) and good reliability for 4 of the 5 subscales (Cronbach's alpha ranging from .90 to .77 in the pooled sample). Each of the 5 subscale scores were significantly higher for adolescents aged 18 years and older versus those younger than 18 (P < .0001) in both univariate and multivariate analyses. The 20-item, 5-factor structure for the TRAQ is supported by EFA and CFA on independent samples and has good internal reliability and criterion validity. Additional work is needed to expand or revise the TRAQ subscales and test their predictive validity. Copyright © 2014 Academic Pediatric Association. Published by Elsevier Inc. All rights reserved.

  2. Health Sciences-Evidence Based Practice questionnaire (HS-EBP) for measuring transprofessional evidence-based practice: Creation, development and psychometric validation

    PubMed Central

    Fernández-Domínguez, Juan Carlos; de Pedro-Gómez, Joan Ernest; Morales-Asencio, José Miguel; Sastre-Fullana, Pedro; Sesé-Abad, Albert

    2017-01-01

    Introduction Most of the EBP measuring instruments available to date present limitations both in the operationalisation of the construct and also in the rigour of their psychometric development, as revealed in the literature review performed. The aim of this paper is to provide rigorous and adequate reliability and validity evidence of the scores of a new transdisciplinary psychometric tool, the Health Sciences Evidence-Based Practice (HS-EBP), for measuring the construct EBP in Health Sciences professionals. Methods A pilot study and a subsequent two-stage validation test sample were conducted to progressively refine the instrument until a reduced 60-item version with a five-factor latent structure. Reliability was analysed through both Cronbach’s alpha coefficient and intraclass correlations (ICC). Latent structure was contrasted using confirmatory factor analysis (CFA) following a model comparison aproach. Evidence of criterion validity of the scores obtained was achieved by considering attitudinal resistance to change, burnout, and quality of professional life as criterion variables; while convergent validity was assessed using the Spanish version of the Evidence-Based Practice Questionnaire (EBPQ-19). Results Adequate evidence of both reliability and ICC was obtained for the five dimensions of the questionnaire. According to the CFA model comparison, the best fit corresponded to the five-factor model (RMSEA = 0.049; CI 90% RMSEA = [0.047; 0.050]; CFI = 0.99). Adequate criterion and convergent validity evidence was also provided. Finally, the HS-EBP showed the capability to find differences between EBP training levels as an important evidence of decision validity. Conclusions Reliability and validity evidence obtained regarding the HS-EBP confirm the adequate operationalisation of the EBP construct as a process put into practice to respond to every clinical situation arising in the daily practice of professionals in health sciences (transprofessional). The

  3. Cross-cultural adaptation and validation of the Italian Psychosocial Impact of Dental Aesthetics Questionnaire (PIDAQ).

    PubMed

    Bucci, Rosaria; Rongo, Roberto; Zito, Eugenio; Galeotti, Angela; Valletta, Rosa; D'Antò, Vincenzo

    2015-03-01

    To validate and cross-culturally adapt the Italian version of the Psychological Impact of Dental Aesthetics Questionnaire (PIDAQ) among Italian young adults. After translation, back translation, and cross-cultural adaptation of the English PIDAQ, a first version of the Italian questionnaire was pretested. The final Italian PIDAQ was administered to 598 subjects aged 18-30 years, along with two other instruments: the aesthetic component of the index of orthodontic treatment need (IOTN-AC) and the perception of occlusion scale (POS), which identified the self-reporting grade of malocclusion. Structural validity was assessed by means of factorial analysis, internal consistency was measured with Cronbach's alpha coefficient (α), convergent validity was assessed by means of Spearman correlation, and test-retest reliability was calculated with intra-class correlation coefficient (ICC) and standard measurement error. Criterion validity was evaluated by multivariate and univariate analysis of variance with Bonferroni post hoc tests. The α of the Italian PIDAQ domains ranged between 0.79 and 0.92. The ICC was between 0.81 and 0.90. The mean scores of each PIDAQ domain showed a statistically significant difference when analysed according to the IOTN-AC and POS scores. The satisfactory psychometric properties make PIDAQ a usable tool for future studies on oral health-related quality of life among Italian young adults.

  4. Modeling Group Differences in OLS and Orthogonal Regression: Implications for Differential Validity Studies

    ERIC Educational Resources Information Center

    Kane, Michael T.; Mroch, Andrew A.

    2010-01-01

    In evaluating the relationship between two measures across different groups (i.e., in evaluating "differential validity") it is necessary to examine differences in correlation coefficients and in regression lines. Ordinary least squares (OLS) regression is the standard method for fitting lines to data, but its criterion for optimal fit…

  5. Validation of scores of use of inhalation devices: valoration of errors *

    PubMed Central

    Zambelli-Simões, Letícia; Martins, Maria Cleusa; Possari, Juliana Carneiro da Cunha; Carvalho, Greice Borges; Coelho, Ana Carla Carvalho; Cipriano, Sonia Lucena; de Carvalho-Pinto, Regina Maria; Cukier, Alberto; Stelmach, Rafael

    2015-01-01

    Abstract Objective: To validate two scores quantifying the ability of patients to use metered dose inhalers (MDIs) or dry powder inhalers (DPIs); to identify the most common errors made during their use; and to identify the patients in need of an educational program for the use of these devices. Methods: This study was conducted in three phases: validation of the reliability of the inhaler technique scores; validation of the contents of the two scores using a convenience sample; and testing for criterion validation and discriminant validation of these instruments in patients who met the inclusion criteria. Results: The convenience sample comprised 16 patients. Interobserver disagreement was found in 19% and 25% of the DPI and MDI scores, respectively. After expert analysis on the subject, the scores were modified and were applied in 72 patients. The most relevant difficulty encountered during the use of both types of devices was the maintenance of total lung capacity after a deep inhalation. The degree of correlation of the scores by observer was 0.97 (p < 0.0001). There was good interobserver agreement in the classification of patients as able/not able to use a DPI (50%/50% and 52%/58%; p < 0.01) and an MDI (49%/51% and 54%/46%; p < 0.05). Conclusions: The validated scores allow the identification and correction of inhaler technique errors during consultations and, as a result, improvement in the management of inhalation devices. PMID:26398751

  6. Validity, sensitivity and specificity of the mentation, behavior and mood subscale of the UPDRS.

    PubMed

    Holroyd, Suzanne; Currie, Lillian J; Wooten, G Frederick

    2008-06-01

    The unified Parkinson's disease rating scale (UPDRS) is the most widely used tool to rate the severity and the stage of Parkinson's disease (PD). However, the mentation, behavior and mood (MBM) subscale of the UPDRS has received little investigation regarding its validity and sensitivity. Three items of this subscale were compared to criterion tests to examine validity, sensitivity and specificity. Ninety-seven patients with idiopathic PD were assessed on the UPDRS. Scores on three items of the MBM subscale, intellectual impairment, thought disorder and depression, were compared to criterion tests, the telephone interview for cognition status (TICS), psychiatric assessment for psychosis and the geriatric depression scale (GDS). Non-parametric tests of association were performed to examine concurrent validity of the MBM items. The sensitivities, specificities and optimal cutoff scores for each MBM item were estimated by receiver operating characteristic (ROC) curve analysis. The MBM items demonstrated low to moderate correlation with the criterion tests, and the sensitivity and specificity were not strong. Even using a score of 7.0 on the items of the MBM demonstrated a sensitivity/specificity of only 0.19/0.48 for intellectual impairment, 0.60/0.72 for thought disorder and 0.61/0.87 for depression. Using a more appropriate cutoff of 2.0 revealed sensitivities of 0.01, 0.38 and 0.13 respectively. The MBM subscale items of intellectual impairment, thought disorder and depression are not appropriate for screening or diagnostic purposes. Tools such as the TICS and the GDS should be considered instead.

  7. Development and content validity of a screening instrument for gaming addiction in adolescents: the Gaming Addiction Identification Test (GAIT).

    PubMed

    Vadlin, Sofia; Åslund, Cecilia; Nilsson, Kent W

    2015-08-01

    This study describes the development of a screening tool for gaming addiction in adolescents - the Gaming Addiction Identification Test (GAIT). Its development was based on the research literature on gaming and addiction. An expert panel comprising professional raters (n = 7), experiential adolescent raters (n = 10), and parent raters (n = 10) estimated the content validity of each item (I-CVI) as well as of the whole scale (S-CVI/Ave), and participated in a cognitive interview about the GAIT scale. The mean scores for both I-CVI and S-CVI/Ave ranged between 0.97 and 0.99 compared with the lowest recommended I-CVI value of 0.78 and the S-CVI/Ave value of 0.90. There were no sex differences and no differences between expert groups regarding ratings in content validity. No differences in the overall evaluation of the scale emerged in the cognitive interviews. Our conclusions were that GAIT showed good content validity in capturing gaming addiction. The GAIT needs further investigation into its psychometric properties of construct validity (convergent and divergent validity) and criterion-related validity, as well as its reliability in both clinical settings and in community settings with adolescents. © 2015 Scandinavian Psychological Associations and John Wiley & Sons Ltd.

  8. Feelings about culture scales: development, factor structure, reliability, and validity.

    PubMed

    Maffini, Cara S; Wong, Y Joel

    2015-04-01

    Although measures of cultural identity, values, and behavior exist in the multicultural psychological literature, there is currently no measure that explicitly assesses ethnic minority individuals' positive and negative affect toward culture. Therefore, we developed 2 new measures called the Feelings About Culture Scale--Ethnic Culture and Feelings About Culture Scale--Mainstream American Culture and tested their psychometric properties. In 6 studies, we piloted the measures, conducted factor analyses to clarify their factor structure, and examined reliability and validity. The factor structure revealed 2 dimensions reflecting positive and negative affect for each measure. Results provided evidence for convergent, discriminant, criterion-related, and incremental validity as well as the reliability of the scales. The Feelings About Culture Scales are the first known measures to examine both positive and negative affect toward an individual's ethnic culture and mainstream American culture. The focus on affect captures dimensions of psychological experiences that differ from cognitive and behavioral constructs often used to measure cultural orientation. These measures can serve as a valuable contribution to both research and counseling by providing insight into the nuanced affective experiences ethnic minority individuals have toward culture. (c) 2015 APA, all rights reserved).

  9. Linguistic Validation of the M. D. Anderson Symptom Inventory in Persian-Speaking Iranian Cancer Patients.

    PubMed

    Saadatpour, Leila; Hemati, Simin; Habibi, Farzaneh; Behzadi, Erfan; Hashemi-Jazi, Marsa Sadat; Kheirabadi, Gholamreza; Mirbagher, Leila; Gholamrezaei, Ali

    2015-09-01

    Various symptoms frequently affect cancer patients' quality of life. Appropriate assessment of these symptoms provides valuable data for cancer management. This study aimed to validate the Persian version of the M. D. Anderson Symptom Inventory (MDASI-P). This cross-sectional study was conducted at four cancer treatment centers in two cities in Iran. Breast cancer and colorectal cancer patients aged 18 years and older were consecutively included in the study. The standard forward-backward translation method was applied. Patients completed the MDASI-P along with the previously validated Persian version of the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire-Core 30 (EORTC QLQ-C30). Construct validity (factor analysis), criterion validity (against the EORTC QLQ-C30), and reliability (Cronbach's alpha) were analyzed. A total of 146 breast cancer and 94 colorectal cancer patients were studied. Factor analysis for the symptom severity items resulted in a three-factor solution, further reduced to a two-factor solution: general symptoms and gastrointestinal symptoms. Correlation of the MDASI-P symptom severity items with corresponding EORTC QLQ-C30 symptom items (r = 0.48-0.75) and MDASI-P interference items with corresponding EORTC QLQ-C30 functioning domains (r = -0.46 to -0.23) supported the criterion validity. Cronbach's alpha was 0.90, 0.88, and 0.77 for the total questionnaire, symptom severity items, and the interference subscale, respectively. The MDASI-P is a feasible, valid, and reliable instrument for evaluation of symptoms in Persian-speaking cancer patients and can be used to improve symptom management in these patients. Copyright © 2015 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.

  10. Translation, Cross-cultural Adaptation and Psychometric Validation of the Korean-Language Cardiac Rehabilitation Barriers Scale (CRBS-K).

    PubMed

    Baek, Sora; Park, Hee-Won; Lee, Yookyung; Grace, Sherry L; Kim, Won-Seok

    2017-10-01

    To perform a translation and cross-cultural adaptation of the Cardiac Rehabilitation Barriers Scale (CRBS) for use in Korea, followed by psychometric validation. The CRBS was developed to assess patients' perception of the degree to which patient, provider and health system-level barriers affect their cardiac rehabilitation (CR) participation. The CRBS consists of 21 items (barriers to adherence) rated on a 5-point Likert scale. The first phase was to translate and cross-culturally adapt the CRBS to the Korean language. After back-translation, both versions were reviewed by a committee. The face validity was assessed in a sample of Korean patients (n=53) with history of acute myocardial infarction that did not participate in CR through semi-structured interviews. The second phase was to assess the construct and criterion validity of the Korean translation as well as internal reliability, through administration of the translated version in 104 patients, principle component analysis with varimax rotation and cross-referencing against CR use, respectively. The length, readability, and clarity of the questionnaire were rated well, demonstrating face validity. Analysis revealed a six-factor solution, demonstrating construct validity. Cronbach's alpha was greater than 0.65. Barriers rated highest included not knowing about CR and not being contacted by a program. The mean CRBS score was significantly higher among non-attendees (2.71±0.26) than CR attendees (2.51±0.18) (p<0.01). The Korean version of CRBS has demonstrated face, content and criterion validity, suggesting it may be useful for assessing barriers to CR utilization in Korea.

  11. 22 CFR 42.72 - Validity of visas.

    Code of Federal Regulations, 2010 CFR

    2010-04-01

    ... Relations DEPARTMENT OF STATE VISAS VISAS: DOCUMENTATION OF IMMIGRANTS UNDER THE IMMIGRATION AND NATIONALITY ACT, AS AMENDED Issuance of Immigrant Visas § 42.72 Validity of visas. (a) Period of validity. With the exception indicated herein, the period of validity of an immigrant visa shall not exceed six...

  12. Validation and Adjustment of the Leipzig-Halifax Acute Aortic Dissection Type A Scorecard.

    PubMed

    Mejàre-Berggren, Hanna; Olsson, Christian

    2017-11-01

    The novel Leipzig-Halifax (LH) scorecard for acute aortic dissection type A (AADA) stratifies risk of in-hospital death based on age, malperfusion syndromes, critical preoperative state, and coronary disease. The study aim was to externally validate the LH scorecard performance and, if adequate, propose adjustments. All consecutive AADA patients operated on from 1996 to 2016 (n = 509) were included to generate an external validation cohort. Variables related to in-hospital death were analyzed using univariable and multivariable analysis. The LH scorecard was applied to the validation cohort, compared with the original study, and variable selection was adjusted using validation measures for discrimination and calibration. In-hospital mortality rate was 17.7% (LH cohort 18.7%). Critical preoperative state and Penn class non-Aa were independent predictors (odds ratio [OR] 2.42 and 2.45, respectively) of in-hospital death. The LH scorecard was adjusted to include Penn class non-Aa, critical preoperative state, and coronary disease. Assessing discrimination, area under receiver operator characteristic curve for the LH scorecard was 0.61 versus 0.66 for the new scorecard (p = 0.086). In-hospital mortality rates in low-, medium-, and high-risk groups were 14%, 15%, and 48%, respectively (LH scorecard) versus 11%, 23%, and 43%, respectively (new scorecard), and goodness-of-fit p value was 0.01 versus 0.86, indicating better calibration by the new scorecard. A lower Akaike information criterion value, 464 versus 448, favored the new scorecard. Through adjustment of the LH scorecard after external validation, prognostic performance improved. Further validated, the LH scorecard could be a valuable risk prediction tool. Copyright © 2017 The Society of Thoracic Surgeons. Published by Elsevier Inc. All rights reserved.

  13. Validity and validation of expert (Q)SAR systems.

    PubMed

    Hulzebos, E; Sijm, D; Traas, T; Posthumus, R; Maslankiewicz, L

    2005-08-01

    At a recent workshop in Setubal (Portugal) principles were drafted to assess the suitability of (quantitative) structure-activity relationships ((Q)SARs) for assessing the hazards and risks of chemicals. In the present study we applied some of the Setubal principles to test the validity of three (Q)SAR expert systems and validate the results. These principles include a mechanistic basis, the availability of a training set and validation. ECOSAR, BIOWIN and DEREK for Windows have a mechanistic or empirical basis. ECOSAR has a training set for each QSAR. For half of the structural fragments the number of chemicals in the training set is >4. Based on structural fragments and log Kow, ECOSAR uses linear regression to predict ecotoxicity. Validating ECOSAR for three 'valid' classes results in predictivity of > or = 64%. BIOWIN uses (non-)linear regressions to predict the probability of biodegradability based on fragments and molecular weight. It has a large training set and predicts non-ready biodegradability well. DEREK for Windows predictions are supported by a mechanistic rationale and literature references. The structural alerts in this program have been developed with a training set of positive and negative toxicity data. However, to support the prediction only a limited number of chemicals in the training set is presented to the user. DEREK for Windows predicts effects by 'if-then' reasoning. The program predicts best for mutagenicity and carcinogenicity. Each structural fragment in ECOSAR and DEREK for Windows needs to be evaluated and validated separately.

  14. Validity and reliability of three commonly used quality of life measures in a large European population of coronary heart disease patients.

    PubMed

    De Smedt, Delphine; Clays, Els; Doyle, Frank; Kotseva, Kornelia; Prugger, Christof; Pająk, Andrzej; Jennings, Catriona; Wood, David; De Bacquer, Dirk

    2013-09-01

    To investigate the validity and reliability of the EuroQol-5D (EQ-5D), the 12-item Short-Form Health Survey (SF-12v2), and the Hospital Anxiety and Depression Scale (HADS) in a stable coronary population. Cross-sectional study EUROASPIRE III. Quality of life data (QoL) were available on 8745 patients hospitalized for coronary artery bypass graft (CABG), percutaneous coronary intervention (PCI), acute myocardial infarction (AMI), or myocardial ischemia. They were interviewed and examined at least 6 months after their hospital admission. Reliability and validity of the 3 instruments were tested. Internal consistency, and discriminative, convergent, criterion and construct validity were assessed. Cronbach's alpha indicated good internal consistency for all measures (0.73 to 0.87). Discriminative validity analyses confirmed significant QoL differences between known groups: age, gender, educational level. In addition, all hypothesized correlations between QoL constructs (convergent validity) and items (criterion validity) were confirmed with significant correlations. Confirmatory factor analyses indicated good construct validity for HADS and SF-12v2. On country-specific level, results were roughly similar. The EQ-5D as well as the SF-12v2 and the HADS are reliable and valid instruments for use in a stable coronary population, both on aggregate European level and on country-specific level. However, our results must be generalized with caution, because EUROASPIRE III patients might not be representative for all patients with stable coronary heart disease. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  15. The 2002 NIMH Provisional Diagnostic Criteria for Depression of Alzheimer's Disease (PDC-dAD): Gauging their Validity over a Decade Later.

    PubMed

    Sepehry, Amir A; Lee, Philip E; Hsiung, Ging-Yuek R; Beattie, B Lynn; Feldman, Howard H; Jacova, Claudia

    2017-01-01

    Presented herein is evidence for criterion, content, and convergent/discriminant validity of the NIMH-Provisional Diagnostic Criteria for depression of Alzheimer's Disease (PDC-dAD) that were formulated to address depression in Alzheimer's disease (AD). Using meta-analytic and systematic review methods, we examined criterion validity evidence in epidemiological and clinical studies comparing the PDC-dAD to Diagnostic and Statistical Manual of Mental Disorders fourth edition (DSM-IV), and International Classification of Disease (ICD 9) depression diagnostic criteria. We estimated prevalence of depression by PDC, DSM, and ICD with an omnibus event rate effect-size. We also examined diagnostic agreement between PDC and DSM. To gauge content validity, we reviewed rates of symptom endorsement for each diagnostic approach. Finally, we examined the PDC's relationship with assessment scales (global cognition, neuropsychiatric, and depression definition) for convergent validity evidence. The aggregate evidence supports the validity of the PDC-dAD. Our findings suggest that depression in AD differs from other depressive disorders including Major Depressive Disorder (MDD) in that dAD is more prevalent, with generally a milder presentation and with unique features not captured by the DSM. Although the PDC are the current standard for diagnosis of depression in AD, we identified the need for their further optimization based on predictive validity evidence.

  16. Identifying dyspepsia in the Greek population: translation and validation of a questionnaire

    PubMed Central

    Anastasiou, Foteini; Antonakis, Nikos; Chaireti, Georgia; Theodorakis, Pavlos N; Lionis, Christos

    2006-01-01

    Background Studies on clinical issues, including diagnostic strategies, are considered to be the core content of general practice research. The use of standardised instruments is regarded as an important component for the development of Primary Health Care research capacity. Demand for epidemiological cross-cultural comparisons in the international setting and the use of common instruments and definitions valid to each culture is bigger than ever. Dyspepsia is a common complaint in primary practice but little is known with respect to its incidence in Greece. There are some references about the Helicobacter Pylori infection in patients with functional dyspepsia or gastric ulcer in Greece but there is no specific instrument for the identification of dyspepsia. This paper reports on the validation and translation into Greek, of an English questionnaire for the identification of dyspepsia in the general population and discusses several possibilities of its use in the Greek primary care. Methods The selected English postal questionnaire for the identification of people with dyspepsia in the general population consists of 30 items and was developed in 1995. The translation and cultural adaptation of the questionnaire has been performed according to international standards. For the validation of the instrument the internal consistency of the items was established using the alpha coefficient of Chronbach, the reproducibility (test – retest reliability) was measured by kappa correlation coefficient and the criterion validity was calculated against the diagnosis of the patients' records using also kappa correlation coefficient. Results The final Greek version of the postal questionnaire for the identification of dyspepsia in the general population was reliably translated. The internal consistency of the questionnaire was good, Chronbach's alpha was found to be 0.88 (95% CI: 0.81–0.93), suggesting that all items were appropriate to measure. Kappa coefficient for

  17. Identifying dyspepsia in the Greek population: translation and validation of a questionnaire.

    PubMed

    Anastasiou, Foteini; Antonakis, Nikos; Chaireti, Georgia; Theodorakis, Pavlos N; Lionis, Christos

    2006-03-04

    Studies on clinical issues, including diagnostic strategies, are considered to be the core content of general practice research. The use of standardised instruments is regarded as an important component for the development of Primary Health Care research capacity. Demand for epidemiological cross-cultural comparisons in the international setting and the use of common instruments and definitions valid to each culture is bigger than ever. Dyspepsia is a common complaint in primary practice but little is known with respect to its incidence in Greece. There are some references about the Helicobacter Pylori infection in patients with functional dyspepsia or gastric ulcer in Greece but there is no specific instrument for the identification of dyspepsia. This paper reports on the validation and translation into Greek, of an English questionnaire for the identification of dyspepsia in the general population and discusses several possibilities of its use in the Greek primary care. The selected English postal questionnaire for the identification of people with dyspepsia in the general population consists of 30 items and was developed in 1995. The translation and cultural adaptation of the questionnaire has been performed according to international standards. For the validation of the instrument the internal consistency of the items was established using the alpha coefficient of Chronbach, the reproducibility (test - retest reliability) was measured by kappa correlation coefficient and the criterion validity was calculated against the diagnosis of the patients' records using also kappa correlation coefficient. The final Greek version of the postal questionnaire for the identification of dyspepsia in the general population was reliably translated. The internal consistency of the questionnaire was good, Chronbach's alpha was found to be 0.88 (95% CI: 0.81-0.93), suggesting that all items were appropriate to measure. Kappa coefficient for reproducibility (test - retest

  18. [Validation and verfication of microbiology methods].

    PubMed

    Camaró-Sala, María Luisa; Martínez-García, Rosana; Olmos-Martínez, Piedad; Catalá-Cuenca, Vicente; Ocete-Mochón, María Dolores; Gimeno-Cardona, Concepción

    2015-01-01

    Clinical microbiologists should ensure, to the maximum level allowed by the scientific and technical development, the reliability of the results. This implies that, in addition to meeting the technical criteria to ensure their validity, they must be performed with a number of conditions that allows comparable results to be obtained, regardless of the laboratory that performs the test. In this sense, the use of recognized and accepted reference methodsis the most effective tool for these guarantees. The activities related to verification and validation of analytical methods has become very important, as there is continuous development, as well as updating techniques and increasingly complex analytical equipment, and an interest of professionals to ensure quality processes and results. The definitions of validation and verification are described, along with the different types of validation/verification, and the types of methods, and the level of validation necessary depending on the degree of standardization. The situations in which validation/verification is mandatory and/or recommended is discussed, including those particularly related to validation in Microbiology. It stresses the importance of promoting the use of reference strains as controls in Microbiology and the use of standard controls, as well as the importance of participation in External Quality Assessment programs to demonstrate technical competence. The emphasis is on how to calculate some of the parameters required for validation/verification, such as the accuracy and precision. The development of these concepts can be found in the microbiological process SEIMC number 48: «Validation and verification of microbiological methods» www.seimc.org/protocols/microbiology. Copyright © 2013 Elsevier España, S.L.U. y Sociedad Española de Enfermedades Infecciosas y Microbiología Clínica. All rights reserved.

  19. Validity of the Aberrant Behavior Checklist in a Clinical Sample of Toddlers

    ERIC Educational Resources Information Center

    Karabekiroglu, Koray; Aman, Michael G.

    2009-01-01

    We investigated the congruent and criterion validity of the Aberrant Behavior Checklist (ABC) in a clinical sample of toddlers seen over 1 year in Turkey. All consecutive patients (N = 93), 14-43 months old (mean, 30.6 mos.), in a child psychiatry outpatient clinic were included. The ABC, Autism Behavior Checklist (AuBC), and Child Behavior…

  20. Homework Stress: Construct Validation of a Measure

    ERIC Educational Resources Information Center

    Katz, Idit; Buzukashvili, Tamara; Feingold, Liat

    2012-01-01

    This article presents 2 studies aimed at validating a measure of stress experienced by children and parents around the issue of homework, applying Benson's program of validation (Benson, 1998). Study 1 provides external validity of the measure by supporting hypothesized relations between stress around homework and students' and parents' positive…

  1. 45 CFR 162.1011 - Valid code sets.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... 45 Public Welfare 1 2012-10-01 2012-10-01 false Valid code sets. 162.1011 Section 162.1011 Public Welfare DEPARTMENT OF HEALTH AND HUMAN SERVICES ADMINISTRATIVE DATA STANDARDS AND RELATED REQUIREMENTS ADMINISTRATIVE REQUIREMENTS Code Sets § 162.1011 Valid code sets. Each code set is valid within the dates...

  2. 45 CFR 162.1011 - Valid code sets.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... 45 Public Welfare 1 2013-10-01 2013-10-01 false Valid code sets. 162.1011 Section 162.1011 Public Welfare DEPARTMENT OF HEALTH AND HUMAN SERVICES ADMINISTRATIVE DATA STANDARDS AND RELATED REQUIREMENTS ADMINISTRATIVE REQUIREMENTS Code Sets § 162.1011 Valid code sets. Each code set is valid within the dates...

  3. 45 CFR 162.1011 - Valid code sets.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... 45 Public Welfare 1 2010-10-01 2010-10-01 false Valid code sets. 162.1011 Section 162.1011 Public Welfare DEPARTMENT OF HEALTH AND HUMAN SERVICES ADMINISTRATIVE DATA STANDARDS AND RELATED REQUIREMENTS ADMINISTRATIVE REQUIREMENTS Code Sets § 162.1011 Valid code sets. Each code set is valid within the dates...

  4. 45 CFR 162.1011 - Valid code sets.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... 45 Public Welfare 1 2014-10-01 2014-10-01 false Valid code sets. 162.1011 Section 162.1011 Public Welfare Department of Health and Human Services ADMINISTRATIVE DATA STANDARDS AND RELATED REQUIREMENTS ADMINISTRATIVE REQUIREMENTS Code Sets § 162.1011 Valid code sets. Each code set is valid within the dates...

  5. 45 CFR 162.1011 - Valid code sets.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... 45 Public Welfare 1 2011-10-01 2011-10-01 false Valid code sets. 162.1011 Section 162.1011 Public Welfare DEPARTMENT OF HEALTH AND HUMAN SERVICES ADMINISTRATIVE DATA STANDARDS AND RELATED REQUIREMENTS ADMINISTRATIVE REQUIREMENTS Code Sets § 162.1011 Valid code sets. Each code set is valid within the dates...

  6. BMI-based body size guides for women and men: development and validation of a novel pictorial method to assess weight-related concepts.

    PubMed

    Harris, C V; Bradlyn, A S; Coffman, J; Gunel, E; Cottrell, L

    2008-02-01

    To develop and evaluate two BMI-based instruments to determine perceptions of weight status, particularly perceptions of overweight and obesity, using pictorial images of women and men. Pictures of adults with known BMI values were used to construct gender-specific body size guides (BSGs) containing 10 bodies that ranged from underweight to class III obesity. Figures were standardized and a composite face was added to each. The BSGs were administered to 400 adults to assess the psychometric properties of the instruments and weight perceptions. High correlations between the BMIs of respondents and the BMIs of the current body selected by respondents provided strong support for the criterion-related validity of the BSGs, and the logical pattern of responses to items assessing perception of weight categories supported construct validity for the scales. Test-retest reliability, assessed by correlations for both current and ideal body, was also high, despite the lengthy 6-month testing interval. Respondents' perceptions of the bodies within specific weight categories indicated that a majority failed to recognize the overweight female as overweight and perceived the overweight male as normal weight. Obese bodies were generally unrecognized as such until the bodies reached the higher levels of obesity (that is, BMI values >39). Perception of weight was influenced by the respondents' weight status and gender. Psychometric analyses indicated the BSGs are valid and reliable instruments. These results, coupled with the face validity of the scales and the relationship between the bodies and BMI values, indicate the BSGs offer advantages over existing instruments for researchers of weight perception and body image. Administration of the scales to an adult sample confirmed that overweight and obesity are under-recognized. Increased efforts to improve public understanding of these terms are needed and the BSGs may provide useful tools for this purpose.

  7. Bayesian cross-validation for model evaluation and selection, with application to the North American Breeding Bird Survey

    USGS Publications Warehouse

    Link, William; Sauer, John R.

    2016-01-01

    The analysis of ecological data has changed in two important ways over the last 15 years. The development and easy availability of Bayesian computational methods has allowed and encouraged the fitting of complex hierarchical models. At the same time, there has been increasing emphasis on acknowledging and accounting for model uncertainty. Unfortunately, the ability to fit complex models has outstripped the development of tools for model selection and model evaluation: familiar model selection tools such as Akaike's information criterion and the deviance information criterion are widely known to be inadequate for hierarchical models. In addition, little attention has been paid to the evaluation of model adequacy in context of hierarchical modeling, i.e., to the evaluation of fit for a single model. In this paper, we describe Bayesian cross-validation, which provides tools for model selection and evaluation. We describe the Bayesian predictive information criterion and a Bayesian approximation to the BPIC known as the Watanabe-Akaike information criterion. We illustrate the use of these tools for model selection, and the use of Bayesian cross-validation as a tool for model evaluation, using three large data sets from the North American Breeding Bird Survey.

  8. The Internalized Stigma of Mental Illness (ISMI) scale: validation of the Japanese version.

    PubMed

    Tanabe, Yosuke; Hayashi, Kunihiko; Ideno, Yuki

    2016-04-29

    The present study investigated the reliability and validity of a Japanese version of the Internalized Stigma of Mental Illness (ISMI) scale, designed to assess internalized stigma experienced by people with mental illness. A survey was conducted with 173 outpatients with mental illness who attended psychiatric clinics on a regular basis. A retest was conducted with 51 participants to evaluate the scale's psychometric properties. The alpha coefficient for the overall internal consistency was 0.91, and the coefficients of the individual ISMI subscales ranged from 0.57 to 0.81. The test-retest reliability was r = 0.85 (n = 51, P < 0.01). In terms of criterion-related validity, the Japanese version of the ISMI scale was significantly correlated with the Beck Depression Inventory (r = 0.61, P < 0.01), the Rosenberg Self-Esteem Scale (r = -0.53, P < 0.01), and the Empowerment Scale (r = -0.52, P < 0.01). In addition, factor analyses of the ISMI items demonstrated a four-factor solution for the alienation, stereotype endorsement, discrimination experience, and social withdrawal subscales, with the stigma resistance items excluded. The Japanese version of the ISMI scale demonstrated similar reliability and validity to the original English version. Therefore, the Japanese version of the ISMI scale may be an effective and valid tool to measure internalized stigma among Japanese people who have a mental illness.

  9. A Controlled Evaluation of the Distress Criterion for Binge Eating Disorder

    ERIC Educational Resources Information Center

    Grilo, Carlos M.; White, Marney A.

    2011-01-01

    Objective: Research has examined various aspects of the validity of the research criteria for binge eating disorder (BED) but has yet to evaluate the utility of Criterion C, "marked distress about binge eating." This study examined the significance of the marked distress criterion for BED using 2 complementary comparison groups. Method:…

  10. A CFD validation roadmap for hypersonic flows

    NASA Technical Reports Server (NTRS)

    Marvin, Joseph G.

    1992-01-01

    A roadmap for computational fluid dynamics (CFD) code validation is developed. The elements of the roadmap are consistent with air-breathing vehicle design requirements and related to the important flow path components: forebody, inlet, combustor, and nozzle. Building block and benchmark validation experiments are identified along with their test conditions and measurements. Based on an evaluation criteria, recommendations for an initial CFD validation data base are given and gaps identified where future experiments would provide the needed validation data.

  11. A CFD validation roadmap for hypersonic flows

    NASA Technical Reports Server (NTRS)

    Marvin, Joseph G.

    1993-01-01

    A roadmap for computational fluid dynamics (CFD) code validation is developed. The elements of the roadmap are consistent with air-breathing vehicle design requirements and related to the important flow path components: forebody, inlet, combustor, and nozzle. Building block and benchmark validation experiments are identified along with their test conditions and measurements. Based on an evaluation criteria, recommendations for an initial CFD validation data base are given and gaps identified where future experiments would provide the needed validation data.

  12. Reliability and validity of the Children's Fear Survey Schedule-Dental Subscale for Arabic-speaking children: a cross-sectional study.

    PubMed

    El-Housseiny, Azza A; Alsadat, Farah A; Alamoudi, Najlaa M; El Derwi, Douaa A; Farsi, Najat M; Attar, Moaz H; Andijani, Basil M

    2016-04-14

    Early recognition of dental fear is essential for the effective delivery of dental care. This study aimed to test the reliability and validity of the Arabic version of the Children's Fear Survey Schedule-Dental Subscale (CFSS-DS). A school-based sample of 1546 children was randomly recruited. The Arabic version of the CFSS-DS was completed by children during class time. The scale was tested for internal consistency and test-retest reliability. To test criterion validity, children's behavior was assessed using the Frankl scale during dental examination, and results were compared with children's CFSS-DS scores. To test the scale's construct validity, scores on "fear of going to the dentist soon" were correlated with CFSS-DS scores. Factor analysis was also used. The Arabic version of the CFSS-DS showed high reliability regarding both test-retest reliability (intraclass correlation = 0.83, p < 0.001) and internal consistency (Cronbach's α = 0.88). It showed good criterion validity: children with negative behavior had significantly higher fear scores (t = 13.67, p < 0.001). It also showed moderate construct validity (Spearman's rho correlation, r = 0.53, p < 0.001). Factor analysis identified the following factors: "fear of invasive dental procedures," "fear of less invasive dental procedures" and "fear of strangers." The Arabic version of the CFSS-DS is a reliable and valid measure of dental fear in Arabic-speaking children. Pediatric dentists and researchers may use this validated version of the CFSS-DS to measure dental fear in Arabic-speaking children.

  13. Reynolds Adolescent Depression Scale - Second Edition: initial validation of the Korean version.

    PubMed

    Hyun, Myung-Sun; Nam, Kyoung-A; Kang, Hee Sun; Reynolds, William M

    2009-03-01

    This paper is a report of a study conducted to test the validity and reliability of the Reynolds Adolescent Depression Scale - Second Edition in Korean culture. Depression is a significant mental health problem in adolescents. The Reynolds Adolescent Depression Scale - Second Edition has been shown to be a useful tool to assess depression in adolescents, with extensive research on this measure having been conducted in western cultures. Measures developed in western cultures need to be tested and validated before being used in Asian cultures. The participants were a convenience sample of 440 Korean adolescents with a mean age of 13.78 years (sd = 0.95) from grades 7 to 9 in three public middle schools in South Korea. A cross-sectional design was used. Back-translation was used to create the Korean version, with additional testing for cultural meaning and comprehension. The data were collected at the end of 2004. Internal consistency reliability for the Korean version of the Reynolds Adolescent Depression Scale - Second Edition was 0.89, with subscale reliability ranging from 0.66 to 0.81. Evidence for criterion-related, convergent and discriminant validity for the Korean version of the Reynolds Adolescent Depression Scale - Second Edition was found. Confirmatory factor analysis supported the 4-factor structure of Reynolds Adolescent Depression Scale - Second Edition. Our results support the validity and reliability for the Korean version of the Reynolds Adolescent Depression Scale - Second Edition as a measure of depression and suggest that it can be used to screen students and to evaluate the effectiveness of preventive interventions in school settings.

  14. Measuring assessment standards in undergraduate medical programs: Development and validation of AIM tool.

    PubMed

    Sajjad, Madiha; Khan, Rehan Ahmed; Yasmeen, Rahila

    2018-01-01

    To develop a tool to evaluate faculty perceptions of assessment quality in an undergraduate medical program. The Assessment Implementation Measure (AIM) tool was developed by a mixed method approach. A preliminary questionnaire developed through literature review was submitted to a panel of 10 medical education experts for a three-round 'Modified Delphi technique'. Panel agreement of > 75% was considered the criterion for inclusion of items in the questionnaire. Cognitive pre-testing of five faculty members was conducted. Pilot study was done with 30 randomly selected faculty members. Content validity index (CVI) was calculated for individual items (I-CVI) and composite scale (S-CVI). Cronbach's alpha was calculated to determine the internal consistency reliability of the tool. The final AIM tool had 30 items after the Delphi process. S-CVI was 0.98 with the S-CVI/Avg method and 0.86 by S-CVI/UA method, suggesting good content validity. Cut-off value of < 0.9 I-CVI was taken as criterion for item deletion. Cognitive pre-testing revealed good item interpretation. Cronbach's alpha calculated for the AIM was 0.9, whereas Cronbach's alpha for the four domains ranged from 0.67 to 0.80. 'AIM' is a relevant and useful instrument with good content validity and reliability of results, and may be used to evaluate the teachers´ perceptions about assessment quality.

  15. Validity Evidence in Scale Development: The Application of Cross Validation and Classification-Sequencing Validation

    ERIC Educational Resources Information Center

    Acar, Tu¨lin

    2014-01-01

    In literature, it has been observed that many enhanced criteria are limited by factor analysis techniques. Besides examinations of statistical structure and/or psychological structure, such validity studies as cross validation and classification-sequencing studies should be performed frequently. The purpose of this study is to examine cross…

  16. Reproducibility and validity of the Shanghai Men's Health Study physical activity questionnaire.

    PubMed

    Jurj, Adriana L; Wen, Wanqing; Xiang, Yong-Bing; Matthews, Charles E; Liu, Dake; Zheng, Wei; Shu, Xiao-Ou

    2007-05-15

    Reproducibility and validity of the physical activity questionnaire (PAQ) used in the Shanghai Men's Health Study (2003-2006, People's Republic of China) was evaluated in a random sample of 196 participants aged 40-74 years. Participants completed a PAQ at baseline and again 1 year later, 12 monthly 7-day physical activity recalls, and four quarterly 1-week physical activity logs. Reproducibility was evaluated by using the two PAQs and validity by comparing the PAQs with 1-year averages of the two criterion measures: 7-day physical activity recall and physical activity log. The PAQ had moderate to high reproducibility for measuring adult exercise participation (kappa = 0.60) and energy expenditure (r(s) = 0.68), nonexercise activities (correlation coefficients = 0.42-0.68), and total daily energy expenditure (r(s) = 0.68, kappa(quartiles) = 0.47). Correlations between the PAQ and criterion measures of adult exercise were 0.45 (7-day physical activity recall) and 0.51 (physical activity log) for the first PAQ and 0.62 (7-day physical activity recall) and 0.71 (physical activity log) for the second PAQ. Correlations between PAQ nonexercise activities and the physical activity log and 7-day physical activity recall were 0.31-0.86. Correlations for total energy expenditure were high (0.62-0.77). Results indicate that the Shanghai Men's Health Study PAQ has reasonable reproducibility and validity for classifying men by their level of exercise and nonexercise activities in this cohort.

  17. An entropy-based nonparametric test for the validation of surrogate endpoints.

    PubMed

    Miao, Xiaopeng; Wang, Yong-Cheng; Gangopadhyay, Ashis

    2012-06-30

    We present a nonparametric test to validate surrogate endpoints based on measure of divergence and random permutation. This test is a proposal to directly verify the Prentice statistical definition of surrogacy. The test does not impose distributional assumptions on the endpoints, and it is robust to model misspecification. Our simulation study shows that the proposed nonparametric test outperforms the practical test of the Prentice criterion in terms of both robustness of size and power. We also evaluate the performance of three leading methods that attempt to quantify the effect of surrogate endpoints. The proposed method is applied to validate magnetic resonance imaging lesions as the surrogate endpoint for clinical relapses in a multiple sclerosis trial. Copyright © 2012 John Wiley & Sons, Ltd.

  18. Extending Structural Analyses of the Rosenberg Self-Esteem Scale to Consider Criterion-Related Validity: Can Composite Self-Esteem Scores Be Good Enough?

    PubMed

    Donnellan, M Brent; Ackerman, Robert A; Brecheen, Courtney

    2016-01-01

    Although the Rosenberg Self-Esteem Scale (RSES) is the most widely used measure of global self-esteem in the literature, there are ongoing disagreements about its factor structure. This methodological debate informs how the measure should be used in substantive research. Using a sample of 1,127 college students, we test the overall fit of previously specified models for the RSES, including a newly proposed bifactor solution (McKay, Boduszek, & Harvey, 2014 ). We extend previous work by evaluating how various latent factors from these structural models are related to a set of criterion variables frequently studied in the self-esteem literature. A strict unidimensional model poorly fit the data, whereas models that accounted for correlations between negatively and positively keyed items tended to fit better. However, global factors from viable structural models had similar levels of association with criterion variables and with the pattern of results obtained with a composite global self-esteem variable calculated from observed scores. Thus, we did not find compelling evidence that different structural models had substantive implications, thereby reducing (but not eliminating) concerns about the integrity of the self-esteem literature based on overall composite scores for the RSES.

  19. Validation of the Persian version of the Daily Spiritual Experiences Scale (DSES) in Pregnant Women: A Proper Tool to Assess Spirituality Related to Mental Health.

    PubMed

    Saffari, Mohsen; Amini, Hossein; Sheykh-Oliya, Zarindokht; Pakpour, Amir H; Koenig, Harold G

    2017-12-01

    Assessing spirituality in healthy pregnant women may lead to supportive interventions that will improve their care. A psychometrically valid measure such as the Daily Spiritual Experiences Scale (DSES) may be helpful in this regard. The current study sought to adapt a Persian version of DSES for use in pregnancy. A total of 377 pregnant women were recruited from three general hospitals located in Tehran, Iran. Administered scales were the DSES, Duke University Religion Index, Santa Clara Strength of Religious Faith scale, and Depression Anxiety Stress Scale, as well as demographic measures. Reliability of the DSES was tested using Cronbach's alpha for internal consistency and the intraclass correlation coefficient (ICC) for test-retest stability. Scale validity was assessed by criterion-related tests, known-groups comparison, and exploratory factor analysis. Participant's mean age was 27.7 (4.1), and most were nulliparous (70%). The correlation coefficient between individual items on the scale and the total score was greater than 0.30 in most cases. Cronbach's alpha for the scale was 0.90. The ICC for 2-week test-retest reliability was high (0.86). Relationships between similar and dissimilar scales indicated acceptable convergent and divergent validity. The factor structure of the scale indicated a single factor that explained 59% of the variance. The DSES was found to be a reliable and valid measure of spirituality in pregnant Iranian women. This scale may be used to examine the relationship between spirituality and health outcomes, research that may lead to supportive interventions in this population.

  20. Reliability and validity: Part II.

    PubMed

    Davis, Debora Winders

    2004-01-01

    Determining measurement reliability and validity involves complex processes. There is usually room for argument about most instruments. It is important that the researcher clearly describes the processes upon which she made the decision to use a particular instrument, and presents the evidence available showing that the instrument is reliable and valid for the current purposes. In some cases, the researcher may need to conduct pilot studies to obtain evidence upon which to decide whether the instrument is valid for a new population or a different setting. In all cases, the researcher must present a clear and complete explanation for the choices, she has made regarding reliability and validity. The consumer must then judge the degree to which the researcher has provided adequate and theoretically sound rationale. Although I have tried to touch on most of the important concepts related to measurement reliability and validity, it is beyond the scope of this column to be exhaustive. There are textbooks devoted entirely to specific measurement issues if readers require more in-depth knowledge.