good criterion validity: Topics by Science.gov

Sample records for good criterion validity

Discriminative and Criterion Validity of the Autism Spectrum Identity Scale (ASIS)

ERIC Educational Resources Information Center

McDonald, T. A. M.

2017-01-01

Individuals on the autism spectrum face stigma that can influence identity development. Previous research on the 22-item Autism Spectrum Identity Scale (ASIS) reported a four-factor structure with strong split-sample cross-validation and good internal consistency. This study reports the discriminative and criterion validity of the ASIS with other…
Concurrent criterion validity of the safe driving behavior measure: a predictor of on-road driving outcomes.

PubMed

Classen, Sherrilene; Wang, Yanning; Winter, Sandra M; Velozo, Craig A; Lanford, Desiree N; Bédard, Michel

2013-01-01

We determined the concurrent criterion validity of the Safe Driving Behavior Measure (SDBM) for on-road outcomes (passing or failing the on-road test as determined by a certified driving rehabilitation specialist) among older drivers and their family members-caregivers. On the basis of ratings from 168 older drivers and 168 family members-caregivers, we calculated receiver operating characteristic curves. The drivers' area under the curve (AUC) was .620 (95% confidence interval [CI] = .514-.725, p = .043). The family members-caregivers' AUC was .726 (95% CI = .622-.829, p ≤ .01). Older drivers' ratings showed statistically significant yet poor concurrent criterion validity, but family members-caregivers' ratings showed good concurrent criterion validity for the criterion on-road driving test. Continuing research with a more representative sample is being pursued to confirm the SDBM's concurrent criterion validity. This screening tool may be useful for generalist practitioners to use in making decisions regarding driving. Copyright © 2013 by the American Occupational Therapy Association, Inc.
Concurrent Criterion Validity of the Safe Driving Behavior Measure: A Predictor of On-Road Driving Outcomes

PubMed Central

Wang, Yanning; Winter, Sandra M.; Velozo, Craig A.; Lanford, Desiree N.; Bédard, Michel

2013-01-01

We determined the concurrent criterion validity of the Safe Driving Behavior Measure (SDBM) for on-road outcomes (passing or failing the on-road test as determined by a certified driving rehabilitation specialist) among older drivers and their family members–caregivers. On the basis of ratings from 168 older drivers and 168 family members–caregivers, we calculated receiver operating characteristic curves. The drivers’ area under the curve (AUC) was .620 (95% confidence interval [CI] = .514–.725, p = .043). The family members–caregivers’ AUC was .726 (95% CI = .622–.829, p ≤ .01). Older drivers’ ratings showed statistically significant yet poor concurrent criterion validity, but family members–caregivers’ ratings showed good concurrent criterion validity for the criterion on-road driving test. Continuing research with a more representative sample is being pursued to confirm the SDBM’s concurrent criterion validity. This screening tool may be useful for generalist practitioners to use in making decisions regarding driving. PMID:23245789
The validity and reliability of a dynamic neuromuscular stabilization-heel sliding test for core stability.

PubMed

Cha, Young Joo; Lee, Jae Jin; Kim, Do Hyun; You, Joshua Sung H

2017-10-23

Core stabilization plays an important role in the regulation of postural stability. To overcome shortcomings associated with pain and severe core instability during conventional core stabilization tests, we recently developed the dynamic neuromuscular stabilization-based heel sliding (DNS-HS) test. The purpose of this study was to establish the criterion validity and test-retest reliability of the novel DNS-HS test. Twenty young adults with core instability completed both the bilateral straight leg lowering test (BSLLT) and DNS-HS test for the criterion validity study and repeated the DNS-HS test for the test-retest reliability study. Criterion validity was determined by comparing hip joint angle data that were obtained from BSLLT and DNS-HS measures. The test-retest reliability was determined by comparing hip joint angle data. Criterion validity was (ICC2,3) = 0.700 (p< 0.05), suggesting a good relationship between the two core stability measures. Test-retest reliability was (ICC3,3) = 0.953 (p< 0.05), indicating excellent consistency between the repeated DNS-HS measurements. Criterion validity data demonstrated a good relationship between the gold standard BSLLT and DNS-HS core stability measures. Test-retest reliability data suggests that DNS-HS core stability was a reliable test for core stability. Clinically, the DNS-HS test is useful to objectively quantify core instability and allow early detection and evaluation.
Reliability and criterion validity of two applications of the iPhone™ to measure cervical range of motion in healthy participants

PubMed Central

2013-01-01

Summary of background data Recent smartphones, such as the iPhone, are often equipped with an accelerometer and magnetometer, which, through software applications, can perform various inclinometric functions. Although these applications are intended for recreational use, they have the potential to measure and quantify range of motion. The purpose of this study was to estimate the intra and inter-rater reliability as well as the criterion validity of the clinometer and compass applications of the iPhone in the assessment cervical range of motion in healthy participants. Methods The sample consisted of 28 healthy participants. Two examiners measured cervical range of motion of each participant twice using the iPhone (for the estimation of intra and inter-reliability) and once with the CROM (for the estimation of criterion validity). Estimates of reliability and validity were then established using the intraclass correlation coefficient (ICC). Results We observed a moderate intra-rater reliability for each movement (ICC = 0.65-0.85) but a poor inter-rater reliability (ICC < 0.60). For the criterion validity, the ICCs are moderate (>0.50) to good (>0.65) for movements of flexion, extension, lateral flexions and right rotation, but poor (<0.50) for the movement left rotation. Conclusion We found good intra-rater reliability and lower inter-rater reliability. When compared to the gold standard, these applications showed moderate to good validity. However, before using the iPhone as an outcome measure in clinical settings, studies should be done on patients presenting with cervical problems. PMID:23829201
Validity and Reliability of the Upper Extremity Work Demands Scale.

PubMed

Jacobs, Nora W; Berduszek, Redmar J; Dijkstra, Pieter U; van der Sluis, Corry K

2017-12-01

Purpose To evaluate validity and reliability of the upper extremity work demands (UEWD) scale. Methods Participants from different levels of physical work demands, based on the Dictionary of Occupational Titles categories, were included. A historical database of 74 workers was added for factor analysis. Criterion validity was evaluated by comparing observed and self-reported UEWD scores. To assess structural validity, a factor analysis was executed. For reliability, the difference between two self-reported UEWD scores, the smallest detectable change (SDC), test-retest reliability and internal consistency were determined. Results Fifty-four participants were observed at work and 51 of them filled in the UEWD twice with a mean interval of 16.6 days (SD 3.3, range = 10-25 days). Criterion validity of the UEWD scale was moderate (r = .44, p = .001). Factor analysis revealed that 'force and posture' and 'repetition' subscales could be distinguished with Cronbach's alpha of .79 and .84, respectively. Reliability was good; there was no significant difference between repeated measurements. An SDC of 5.0 was found. Test-retest reliability was good (intraclass correlation coefficient for agreement = .84) and all item-total correlations were >.30. There were two pairs of highly related items. Conclusion Reliability of the UEWD scale was good, but criterion validity was moderate. Based on current results, a modified UEWD scale (2 items removed, 1 item reworded, divided into 2 subscales) was proposed. Since observation appeared to be an inappropriate gold standard, we advise to investigate other types of validity, such as construct validity, in further research.
Assessment of the Validity of the Research Diagnostic Criteria for Temporomandibular Disorders: Overview and Methodology

PubMed Central

Schiffman, Eric L.; Truelove, Edmond L.; Ohrbach, Richard; Anderson, Gary C.; John, Mike T.; List, Thomas; Look, John O.

2011-01-01

AIMS The purpose of the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) Validation Project was to assess the diagnostic validity of this examination protocol. An overview is presented, including Axis I and II methodology and descriptive statistics for the study participant sample. This paper details the development of reliable methods to establish the reference standards for assessing criterion validity of the Axis I RDC/TMD diagnoses. Validity testing for the Axis II biobehavioral instruments was based on previously validated reference standards. METHODS The Axis I reference standards were based on the consensus of 2 criterion examiners independently performing a comprehensive history, clinical examination, and evaluation of imaging. Intersite reliability was assessed annually for criterion examiners and radiologists. Criterion exam reliability was also assessed within study sites. RESULTS Study participant demographics were comparable to those of participants in previous studies using the RDC/TMD. Diagnostic agreement of the criterion examiners with each other and with the consensus-based reference standards was excellent with all kappas ≥ 0.81, except for osteoarthrosis (moderate agreement, k = 0.53). Intrasite criterion exam agreement with reference standards was excellent (k ≥ 0.95). Intersite reliability of the radiologists for detecting computed tomography-disclosed osteoarthrosis and magnetic resonance imaging-disclosed disc displacement was good to excellent (k = 0.71 and 0.84, respectively). CONCLUSION The Validation Project study population was appropriate for assessing the reliability and validity of the RDC/TMD Axis I and II. The reference standards used to assess the validity of Axis I TMD were based on reliable and clinically credible methods. PMID:20213028
The Research Diagnostic Criteria for Temporomandibular Disorders. I: overview and methodology for assessment of validity.

PubMed

Schiffman, Eric L; Truelove, Edmond L; Ohrbach, Richard; Anderson, Gary C; John, Mike T; List, Thomas; Look, John O

2010-01-01

The purpose of the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) Validation Project was to assess the diagnostic validity of this examination protocol. The aim of this article is to provide an overview of the project's methodology, descriptive statistics, and data for the study participant sample. This article also details the development of reliable methods to establish the reference standards for assessing criterion validity of the Axis I RDC/TMD diagnoses. The Axis I reference standards were based on the consensus of two criterion examiners independently performing a comprehensive history, clinical examination, and evaluation of imaging. Intersite reliability was assessed annually for criterion examiners and radiologists. Criterion examination reliability was also assessed within study sites. Study participant demographics were comparable to those of participants in previous studies using the RDC/TMD. Diagnostic agreement of the criterion examiners with each other and with the consensus-based reference standards was excellent with all kappas > or = 0.81, except for osteoarthrosis (moderate agreement, k = 0.53). Intrasite criterion examiner agreement with reference standards was excellent (k > or = 0.95). Intersite reliability of the radiologists for detecting computed tomography-disclosed osteoarthrosis and magnetic resonance imaging-disclosed disc displacement was good to excellent (k = 0.71 and 0.84, respectively). The Validation Project study population was appropriate for assessing the reliability and validity of the RDC/TMD Axis I and II. The reference standards used to assess the validity of Axis I TMD were based on reliable and clinically credible methods.
Criterion and content validity of a novel structured haggling contingent valuation question format versus the bidding game and binary with follow-up format.

PubMed

Onwujekwe, Obinna

2004-02-01

Contingent valuation question formats that will be used to elicit willingness to pay for goods and services need to be relevant to the area they will be used in order for responses to be valid. A novel contingent valuation question format called the "structured haggling technique" (SH) that resembles the bargaining system in Nigerian markets was designed and its criterion and content validity compared with those of the bidding game (BG) and binary-with-follow-up (BWFU) technique. This was achieved by determining the willingness to pay (WTP) for insecticide-treated nets (ITNs) in Southeast Nigeria. Content validity was determined through observation of actual trading of untreated nets together with interviews with sellers and consumers. Criterion validity was determined by comparing stated and actual WTP. Stated WTP was determined using a questionnaire administered to 810 household heads and actual WTP was determined by offering the nets for sale to all respondents one month later. The phi (correlation) coefficient was used to compare criterion validity across question formats. The phi coefficients were SH (0.60: 95% C.I. 0.50-0.71), BG (0.42: 95% C.I. 0.29-0.54) and the BWFU (0.32: 95% C.I. 0.20-0.44), implying that the BG and SH had similar levels of criterion-validity while the BWFU was the least criterion-valid. However, the SH was the most content-valid. It is necessary to validate the findings in other areas where haggling is common. Future studies should establish the content validity of question formats in the contexts in which they will be used before administering questionnaires.
Development and Validation of a Measure of Quality of Life for the Young Elderly in Sri Lanka.

PubMed

de Silva, Sudirikku Hennadige Padmal; Jayasuriya, Anura Rohan; Rajapaksa, Lalini Chandika; de Silva, Ambepitiyawaduge Pubudu; Barraclough, Simon

2016-01-01

Sri Lanka has one of the fastest aging populations in the world. Measurement of quality of life (QoL) in the elderly needs instruments developed that encompass the sociocultural settings. An instrument was developed to measure QoL in the young elderly in Sri Lanka (QLI-YES), using accepted methods to generate and reduce items. The measure was validated using a community sample. Construct, criterion and predictive validity and reliability were tested. A first-order model of 24 items with 6 domains was found to have good fit indices (CMIN/df = 1.567, RMR = 0.05, CFI = 0.95, and RMSEA = 0.053). Both criterion and predictive validity were demonstrated. Good internal consistency reliability (Cronbach's α = 0.93) was shown. The development of the QLI-YES using a societal perspective relevant to the social and cultural beliefs has resulted in a robust and valid instrument to measure QoL for the young elderly in Sri Lanka. © 2015 APJPH.
Development and Validation of a Measure of Quality of Life for the Young Elderly in Sri Lanka

PubMed Central

de Silva, Sudirikku Hennadige Padmal; Jayasuriya, Anura Rohan; Rajapaksa, Lalini Chandika; de Silva, Ambepitiyawaduge Pubudu; Barraclough, Simon

2016-01-01

Sri Lanka has one of the fastest aging populations in the world. Measurement of quality of life (QoL) in the elderly needs instruments developed that encompass the sociocultural settings. An instrument was developed to measure QoL in the young elderly in Sri Lanka (QLI-YES), using accepted methods to generate and reduce items. The measure was validated using a community sample. Construct, criterion and predictive validity and reliability were tested. A first-order model of 24 items with 6 domains was found to have good fit indices (CMIN/df = 1.567, RMR = 0.05, CFI = 0.95, and RMSEA = 0.053). Both criterion and predictive validity were demonstrated. Good internal consistency reliability (Cronbach’s α = 0.93) was shown. The development of the QLI-YES using a societal perspective relevant to the social and cultural beliefs has resulted in a robust and valid instrument to measure QoL for the young elderly in Sri Lanka. PMID:26712893
Psychometric evaluation of the Swedish version of Rosenberg's self-esteem scale.

PubMed

Eklund, Mona; Bäckström, Martin; Hansson, Lars

2018-04-01

The widely used Rosenberg's self-esteem scale (RSES) has not been evaluated for psychometric properties in Sweden. This study aimed at analyzing its factor structure, internal consistency, criterion, convergent and discriminant validity, sensitivity to change, and whether a four-graded Likert-type response scale increased its reliability and validity compared to a yes/no response scale. People with mental illness participating in intervention studies to (1) promote everyday life balance (N = 223) or (2) remedy self-stigma (N = 103) were included. Both samples completed the RSES and questionnaires addressing quality of life and sociodemographic data. Sample 1 also completed instruments chosen to assess convergent and discriminant validity: self-mastery (convergent validity), level of functioning and occupational engagement (discriminant validity). Confirmatory factor analysis (CFA), structural equation modeling, and conventional inferential statistics were used. Based on both samples, the Swedish RSES formed one factor and exhibited high internal consistency (>0.90). The two response scales were equivalent. Criterion validity in relation to quality of life was demonstrated. RSES could distinguish between women and men (women scoring lower) and between diagnostic groups (people with depression scoring lower). Correlations >0.5 with variables chosen to reflect convergent validity and around 0.2 with variables used to address discriminant validity further highlighted the construct validity of RSES. The instrument also showed sensitivity to change. The Swedish RSES exhibited a one-component factor structure and showed good psychometric properties in terms of good internal consistency, criterion, convergent and discriminant validity, and sensitivity to change. The yes/no and the four-graded Likert-type response scales worked equivalently.
[Evaluation of Suicide Risk Levels in Hospitals: Validity and Reliability Tests].

PubMed

Macagnino, Sandro; Steinert, Tilman; Uhlmann, Carmen

2018-05-01

Examination of in-hospital suicide risk levels concerning their validity and their reliability. The internal suicide risk levels were evaluated in a cross sectional study of in 163 inpatients. A reliability check was performed via determining interrater-reliability of senior physician, therapist and the responsible nurse. Within the scope of the validity check, we conducted analyses of criterion validity and construct validity. For the total sample an "acceptable" to "good" interrater-reliability (Kendalls W = .77) of suicide risk levels were obtained. Schizophrenic disorders showed the lowest values, for personality disorders we found the highest level of interrater-reliability. When examining the criterion validity, Item-9 of the BDI-II is substantial correlated to our suicide risk levels (ρ m = .54, p < .01). Within the scope of construct validity check, affective disorders showed the highest correlation (ρ = .77), compatible also with "convergent validity". They differed with schizophrenic disorders which showed the least concordance (ρ = .43). In-hospital suicide risk levels may represent an important contribution to the assessment of suicidal behavior of inpatients experiencing psychiatric treatment due to their overall good validity and reliability. © Georg Thieme Verlag KG Stuttgart · New York.
Statistical Validation of Surrogate Endpoints: Another Look at the Prentice Criterion and Other Criteria.

PubMed

Saraf, Sanatan; Mathew, Thomas; Roy, Anindya

2015-01-01

For the statistical validation of surrogate endpoints, an alternative formulation is proposed for testing Prentice's fourth criterion, under a bivariate normal model. In such a setup, the criterion involves inference concerning an appropriate regression parameter, and the criterion holds if the regression parameter is zero. Testing such a null hypothesis has been criticized in the literature since it can only be used to reject a poor surrogate, and not to validate a good surrogate. In order to circumvent this, an equivalence hypothesis is formulated for the regression parameter, namely the hypothesis that the parameter is equivalent to zero. Such an equivalence hypothesis is formulated as an alternative hypothesis, so that the surrogate endpoint is statistically validated when the null hypothesis is rejected. Confidence intervals for the regression parameter and tests for the equivalence hypothesis are proposed using bootstrap methods and small sample asymptotics, and their performances are numerically evaluated and recommendations are made. The choice of the equivalence margin is a regulatory issue that needs to be addressed. The proposed equivalence testing formulation is also adopted for other parameters that have been proposed in the literature on surrogate endpoint validation, namely, the relative effect and proportion explained.
The development and validity of the Salford Gait Tool: an observation-based clinical gait assessment tool.

PubMed

Toro, Brigitte; Nester, Christopher J; Farren, Pauline C

2007-03-01

To develop the construct, content, and criterion validity of the Salford Gait Tool (SF-GT) and to evaluate agreement between gait observations using the SF-GT and kinematic gait data. Tool development and comparative evaluation. University in the United Kingdom. For designing construct and content validity, convenience samples of 10 children with hemiplegic, diplegic, and quadriplegic cerebral palsy (CP) and 152 physical therapy students and 4 physical therapists were recruited. For developing criterion validity, kinematic gait data of 13 gait clusters containing 56 children with hemiplegic, diplegic, and quadriplegic CP and 11 neurologically intact children was used. For clinical evaluation, a convenience sample of 23 pediatric physical therapists participated. We developed a sagittal plane observational gait assessment tool through a series of design, test, and redesign iterations. The tool's grading system was calibrated using kinematic gait data of 13 gait clusters and was evaluated by comparing the agreement of gait observations using the SF-GT with kinematic gait data. Criterion standard kinematic gait data. There was 58% mean agreement based on grading categories and 80% mean agreement based on degree estimations evaluated with the least significant difference method. The new SF-GT has good concurrent criterion validity.
The psychometric properties of the Portuguese version of the Personality Inventory for DSM-5.

PubMed

Pires, Rute; Sousa Ferreira, Ana; Guedes, David

2017-10-01

The DSM-5 Section III proposes a hybrid dimensional-categorical model of conceptualizing personality and its disorders that includes assessment of impairments in personality functioning (criterion A) and maladaptive personality traits (criterion B). The Personality Inventory for the DSM-5 is a new dimensional tool, composed of 220 items organized into 25 facets that delineate five higher order domains of clinically relevant personality differences, and was developed to operationalize the DSM-5 model of pathological personality traits. The current studies address the internal consistency (study 1), the test-retest reliability (study 2) and the criterion validity (studies 3 and 4) of the Portuguese version of the PID-5 in samples of native speaking psychology students. Results indicated good internal consistency reliabilities and good temporal stability reliabilities for the majority of the PID-5 traits. The correlational pattern of the PID-5 traits with two measures of personality was in accordance with theoretical expectations and showed its concurrent validity. © 2017 Scandinavian Psychological Associations and John Wiley & Sons Ltd.
[Development and Validation of the Academic Resilience Inventory for Nursing Students in Taiwan].

PubMed

Li, Cheng-Chieh; Wei, Chi-Fang; Tung, Yuk-Ying

2017-10-01

Failure to cope with learning pressures has been shown to influence the learning achievement and professional performance of nursing students. In order to enable nursing students to adapt successfully to their academic stress, it is essential to explore their academic resilience in the process of learning. To develop the Academic Resilience Inventory for Nursing Students (ARINS) and to test its reliability and validity. A total of 611 nursing students in central and southern Taiwan were recruited as participants. We divided the sample into two subsamples randomly using R software. The first sample was used to conduct item analysis and exploratory factor analysis. The other sample was used to conduct confirmatory factor analysis, cross validation, and criterion-related validity. There are 15 items in the ARINS, with cognitive maturity, emotional regulation, and help-seeking behavior used as the measurement indicators of academic resilience in nursing students. The assessed goodness-of-fit index indicates that the model fit the data well based upon the CFA and has good convergent validity and discriminant validity. Criterion-related validity was supported by the correlation among ARINS, learning performance and attitude, hope and optimistic, and depression. The ARINS has good reliability and validation and is a suitable measure of academic resilience in nursing students. It is helpful for nursing students to examine their academic stress and coping efficacy in the learning process.
Introducing the Professionalism Mini-Evaluation Exercise (P-MEX) in Japan: results from a multicenter, cross-sectional study.

PubMed

Tsugawa, Yusuke; Ohbu, Sadayoshi; Cruess, Richard; Cruess, Sylvia; Okubo, Tomoya; Takahashi, Osamu; Tokuda, Yasuharu; Heist, Brian S; Bito, Seiji; Itoh, Toshiyuki; Aoki, Akiko; Chiba, Tsutomu; Fukui, Tsuguya

2011-08-01

Despite the growing importance of and interest in medical professionalism, there is no standardized tool for its measurement. The authors sought to verify the validity, reliability, and generalizability of the Professionalism Mini-Evaluation Exercise (P-MEX), a previously developed and tested tool, in the context of Japanese hospitals. A multicenter, cross-sectional evaluation study was performed to investigate the validity, reliability, and generalizability of the P-MEX in seven Japanese hospitals. In 2009-2010, 378 evaluators (attending physicians, nurses, peers, and junior residents) completed 360-degree assessments of 165 residents and fellows using the P-MEX. The content validity and criterion-related validity were examined, and the construct validity of the P-MEX was investigated by performing confirmatory factor analysis through a structural equation model. The reliability was tested using generalizability analysis. The contents of the P-MEX achieved good acceptance in a preliminary working group, and the poststudy survey revealed that 302 (79.9%) evaluators rated the P-MEX items as appropriate, indicating good content validity. The correlation coefficient between P-MEX scores and external criteria was 0.78 (P < .001), demonstrating good criterion-related validity. Confirmatory factor analysis verified high path coefficient (0.60-0.99) and adequate goodness of fit of the model. The generalizability analysis yielded a high dependability coefficient, suggesting good reliability, except when evaluators were peers or junior residents. Findings show evidence of adequate validity, reliability, and generalizability of the P-MEX in Japanese hospital settings. The P-MEX is the only evaluation tool for medical professionalism verified in both a Western and East Asian cultural context.
Validity and extension of the SCS-CN method for computing infiltration and rainfall-excess rates

NASA Astrophysics Data System (ADS)

Mishra, Surendra Kumar; Singh, Vijay P.

2004-12-01

A criterion is developed for determining the validity of the Soil Conservation Service curve number (SCS-CN) method. According to this criterion, the existing SCS-CN method is found to be applicable when the potential maximum retention, S, is less than or equal to twice the total rainfall amount. The criterion is tested using published data of two watersheds. Separating the steady infiltration from capillary infiltration, the method is extended for predicting infiltration and rainfall-excess rates. The extended SCS-CN method is tested using 55 sets of laboratory infiltration data on soils varying from Plainfield sand to Yolo light clay, and the computed and observed infiltration and rainfall-excess rates are found to be in good agreement.
Assessing traumatic event exposure: general issues and preliminary findings for the Stressful Life Events Screening Questionnaire.

PubMed

Goodman, L A; Corcoran, C; Turner, K; Yuan, N; Green, B L

1998-07-01

This article reviews the psychometric properties of the Stressful Life Events Screening Questionnaire (SLESQ), a recently developed trauma history screening measure, and discusses the complexities involved in assessing trauma exposure. There are relatively few general measures of exposure to a variety of types of traumatic events, and most of those that exist have not been subjected to rigorous psychometric evaluation. The SLESQ showed good test-retest reliability, with a median kappa of .73, adequate convergent validity (with a lengthier interview) with a median kappa of .64, and good discrimination between Criterion A and non-Criterion A events. The discussion addresses some of the challenges of assessing traumatic event exposure along the dimensions of defining traumatic events, assessment methodologies, reporting consistency, and incident validation.

Assessment of a condition-specific quality-of-life measure for patients with developmentally absent teeth: validity and reliability testing.

PubMed

Akram, A J; Ireland, A J; Postlethwaite, K C; Sandy, J R; Jerreat, A S

2013-11-01

This article describes the process of validity and reliability testing of a condition-specific quality-of-life measure for patients with hypodontia presenting for orthodontic treatment. The development of the instrument is described in a previous article. Royal Devon and Exeter NHS Foundation Trust & Musgrove Park Hospital, Taunton. The child perception questionnaire was used as a standard against which to test criterion validity. The Bland and Altman method was used to check agreement between the two questionnaires. Construct validity was tested using principal component analysis on the four sections of the questionnaire. Test-retest reliability was tested using intraclass correlation coefficient and Bland and Altman method. Cronbach's alpha was used to test internal consistency reliability. Overall the questionnaire showed good reliability, criterion and construct validity. This together with previous evidence of good face and content validity suggests that the instrument may prove useful in clinical practice and further research. This study has demonstrated that the newly developed condition-specific quality-of-life questionnaire is both valid and reliable for use in young patients with hypodontia. © 2013 John Wiley & Sons A/S. Published by Blackwell Publishing Ltd.
Design and validation of a comprehensive fecal incontinence questionnaire.

PubMed

Macmillan, Alexandra K; Merrie, Arend E H; Marshall, Roger J; Parry, Bryan R

2008-10-01

Fecal incontinence can have a profound effect on quality of life. Its prevalence remains uncertain because of stigma, lack of consistent definition, and dearth of validated measures. This study was designed to develop a valid clinical and epidemiologic questionnaire, building on current literature and expertise. Patients and experts undertook face validity testing. Construct validity, criterion validity, and test-retest reliability was undertaken. Construct validity comprised factor analysis and internal consistency of the quality of life scale. The validity of known groups was tested against 77 control subjects by using regression models. Questionnaire results were compared with a stool diary for criterion validity. Test-retest reliability was calculated from repeated questionnaire completion. The questionnaire achieved good face validity. It was completed by 104 patients. The quality of life scale had four underlying traits (factor analysis) and high internal consistency (overall Cronbach alpha = 0.97). Patients and control subjects answered the questionnaire significantly differently (P < 0.01) in known-groups validity testing. Criterion validity assessment found mean differences close to zero. Median reliability for the whole questionnaire was 0.79 (range, 0.35-1). This questionnaire compares favorably with other available instruments, although the interpretation of stool consistency requires further research. Its sensitivity to treatment still needs to be investigated.
Correlates of the MMPI-2-RF in a college setting.

PubMed

Forbey, Johnathan D; Lee, Tayla T C; Handel, Richard W

2010-12-01

The current study examined empirical correlates of scores on Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF; A. Tellegen & Y. S. Ben-Porath, 2008; Y. S. Ben-Porath & A. Tellegen, 2008) scales in a college setting. The MMPI-2-RF and six criterion measures (assessing anger, assertiveness, sex roles, cognitive failures, social avoidance, and social fear) were administered to 846 college students (nmen = 264, nwomen = 582) to examine the convergent and discriminant validity of scores on the MMPI-2-RF Specific Problems and Interest scales. Results demonstrated evidence of generally good convergent score validity for the selected MMPI-2-RF scales, reflected in large effect size correlations with criterion measure scores. Further, MMPI-2-RF scale scores demonstrated adequate discriminant validity, reflected in relatively low comparative median correlations between scores on MMPI-2-RF substantive scale sets and criterion measures. Limitations and future directions are discussed.
The Marital Disaffection Scale: An Inventory for Assessing Emotional Estrangement in Marriage.

ERIC Educational Resources Information Center

Kayser, Karen

1996-01-01

Describes a self-report scale measuring levels of disaffection toward one's spouse. A questionnaire containing the Marital Disaffection Scale (MDS) and other disaffection measures of marital happiness was administered to 76 spouses. Results indicated good criterion-related validity, discriminant validity, and interitem reliability. Findings…
Criterion validity study of the cervical range of motion (CROM) device for rotational range of motion on healthy adults.

PubMed

Tousignant, Michel; Smeesters, Cécil; Breton, Anne-Marie; Breton, Emilie; Corriveau, Hélène

2006-04-01

This study compared range of motion (ROM) measurements using a cervical range of motion device (CROM) and an optoelectronic system (OPTOTRAK). To examine the criterion validity of the CROM for the measurement of cervical ROM on healthy adults. Whereas measurements of cervical ROM are recognized as part of the assessment of patients with neck pain, few devices are available in clinical settings. Two papers published previously showed excellent criterion validity for measurements of cervical flexion/extension and lateral flexion using the CROM. Subjects performed neck rotation, flexion/extension, and lateral flexion while sitting on a wooden chair. The ROM values were measured by the CROM as well as the OPTOTRAK. The cervical rotational ROM values using the CROM demonstrated a good to excellent linear relationship with those using the OPTOTRAK: right rotation, r = 0.89 (95% confidence interval, 0.81-0.94), and left rotation, r = 0.94 (95% confidence interval, 0.90-0.97). Similar results were also obtained for flexion/extension and lateral flexion ROM values. The CROM showed excellent criterion validity for measurements of cervical rotation. We propose using ROM values measured by the CROM as outcome measures for patients with neck pain.
Measurement properties of depression questionnaires in patients with diabetes: a systematic review.

PubMed

van Dijk, Susan E M; Adriaanse, Marcel C; van der Zwaan, Lennart; Bosmans, Judith E; van Marwijk, Harm W J; van Tulder, Maurits W; Terwee, Caroline B

2018-06-01

To conduct a systematic review on measurement properties of questionnaires measuring depressive symptoms in adult patients with type 1 or type 2 diabetes. A systematic review of the literature in MEDLINE, EMbase and PsycINFO was performed. Full text, original articles, published in any language up to October 2016 were included. Eligibility for inclusion was independently assessed by three reviewers who worked in pairs. Methodological quality of the studies was evaluated by two independent reviewers using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. Quality of the questionnaires was rated per measurement property, based on the number and quality of the included studies and the reported results. Of 6286 unique hits, 21 studies met our criteria evaluating nine different questionnaires in multiple settings and languages. The methodological quality of the included studies was variable for the different measurement properties: 9/15 studies scored 'good' or 'excellent' on internal consistency, 2/5 on reliability, 0/1 on content validity, 10/10 on structural validity, 8/11 on hypothesis testing, 1/5 on cross-cultural validity, and 4/9 on criterion validity. For the CES-D, there was strong evidence for good internal consistency, structural validity, and construct validity; moderate evidence for good criterion validity; and limited evidence for good cross-cultural validity. The PHQ-9 and WHO-5 also performed well on several measurement properties. However, the evidence for structural validity of the PHQ-9 was inconclusive. The WHO-5 was less extensively researched and originally not developed to measure depression. Currently, the CES-D is best supported for measuring depressive symptoms in diabetes patients.
Systematic review of measurement properties of questionnaires measuring somatization in primary care patients.

PubMed

Sitnikova, Kate; Dijkstra-Kersten, Sandra M A; Mokkink, Lidwine B; Terluin, Berend; van Marwijk, Harm W J; Leone, Stephanie S; van der Horst, Henriëtte E; van der Wouden, Johannes C

2017-12-01

The aim of this review is to critically appraise the evidence on measurement properties of self-report questionnaires measuring somatization in adult primary care patients and to provide recommendations about which questionnaires are most useful for this purpose. We assessed the methodological quality of included studies using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. To draw overall conclusions about the quality of the questionnaires, we conducted an evidence synthesis using predefined criteria for judging the measurement properties. We found 24 articles on 9 questionnaires. Studies on the Patient Health Questionnaire-15 (PHQ-15) and the Four-Dimensional Symptom Questionnaire (4DSQ) somatization subscale prevailed and covered the broadest range of measurement properties. These questionnaires had the best internal consistency, test-retest reliability, structural validity, and construct validity. The PHQ-15 also had good criterion validity, whereas the 4DSQ somatization subscale was validated in several languages. The Bodily Distress Syndrome (BDS) checklist had good internal consistency and structural validity. Some evidence was found for good construct validity and criterion validity of the Physical Symptom Checklist (PSC-51) and good construct validity of the Symptom Check-List (SCL-90-R) somatization subscale. However, these three questionnaires were only studied in a small number of primary care studies. Based on our findings, we recommend the use of either the PHQ-15 or 4DSQ somatization subscale for somatization in primary care. Other questionnaires, such as the BDS checklist, PSC-51 and the SCL-90-R somatization subscale show promising results but have not been studied extensively in primary care. Copyright © 2017 Elsevier Inc. All rights reserved.
Developing a short measure of organizational justice: a multisample health professionals study.

PubMed

Elovainio, Marko; Heponiemi, Tarja; Kuusio, Hannamaria; Sinervo, Timo; Hintsa, Taina; Aalto, Anna-Mari

2010-11-01

To develop and test the validity of a short version of the original questionnaire measuring organizational justice. The study samples comprised working physicians (N = 2792) and registered nurses (n = 2137) from the Finnish Health Professionals study. Structural equation modelling was applied to test structural validity, using the justice scales. Furthermore, criterion validity was explored with well-being (sleeping problems) and health indicators (psychological distress/self-rated health). The short version of the organizational justice questionnaire (eight items) provides satisfactory psychometric properties (internal consistency, a good model fit of the data). All scales were associated with an increased risk of sleeping problems and psychological distress, indicating satisfactory criterion validity. This short version of the organizational justice questionnaire provides a useful tool for epidemiological studies focused on health-adverse effects of work environment.
The Reliability, Validity, and Evaluation of the Objective Structured Clinical Examination in Podiatry (Chiropody).

ERIC Educational Resources Information Center

Woodburn, Jim; Sutcliffe, Nick

1996-01-01

The Objective Structured Clinical Examination (OSCE), initially developed for undergraduate medical education, has been adapted for assessment of clinical skills in podiatry students. A 12-month pilot study found the test had relatively low levels of reliability, high construct and criterion validity, and good stability of performance over time.…
Testing fine motor coordination via telehealth: effects of video characteristics on reliability and validity.

PubMed

Hoenig, Helen M; Amis, Kristopher; Edmonds, Carol; Morgan, Michelle S; Landerman, Lawrence; Caves, Kevin

2017-01-01

Background There is limited research about the effects of video quality on the accuracy of assessments of physical function. Methods A repeated measures study design was used to assess reliability and validity of the finger-nose test (FNT) and the finger-tapping test (FTT) carried out with 50 veterans who had impairment in gross and/or fine motor coordination. Videos were scored by expert raters under eight differing conditions, including in-person, high definition video with slow motion review and standard speed videos with varying bit rates and frame rates. Results FTT inter-rater reliability was excellent with slow motion video (ICC 0.98-0.99) and good (ICC 0.59) under the normal speed conditions. Inter-rater reliability for FNT 'attempts' was excellent (ICC 0.97-0.99) for all viewing conditions; for FNT 'misses' it was good to excellent (ICC 0.89) with slow motion review but substantially worse (ICC 0.44) on the normal speed videos. FTT criterion validity (i.e. compared to slow motion review) was excellent (β = 0.94) for the in-person rater and good ( β = 0.77) on normal speed videos. Criterion validity for FNT 'attempts' was excellent under all conditions ( r ≥ 0.97) and for FNT 'misses' it was good to excellent under all conditions ( β = 0.61-0.81). Conclusions In general, the inter-rater reliability and validity of the FNT and FTT assessed via video technology is similar to standard clinical practices, but is enhanced with slow motion review and/or higher bit rate.
An Improvement of the Anisotropy and Formability Predictions of Aluminum Alloy Sheets

NASA Astrophysics Data System (ADS)

Banabic, D.; Comsa, D. S.; Jurco, P.; Wagner, S.; Vos, M.

2004-06-01

The paper presents an yield criterion for orthotropic sheet metals and its implementation in a theoretical model in order to calculate the Forming Limit Curves. The proposed yield criterion has been validated for two aluminum alloys: AA3103-0 and AA5182-0, respectively. The biaxial tensile test of cross specimens has been used for the determination of the experimental yield locus. The new yield criterion has been implemented in the Marciniak-Kuczynski model for the calculus of limit strains. The calculated Forming Limit Curves have been compared with the experimental ones, determined by frictionless test: bulge test, plane strain test and uniaxial tensile test. The predicted Forming Limit Curves using the new yield criterion are in good agreement with the experimental ones.
Psychometric Properties of the Adapted Skillstreaming Checklist for High-Functioning Children with ASD

ERIC Educational Resources Information Center

Lopata, Christopher; Rodgers, Jonathan D.; Donnelly, James P.; Thomeer, Marcus L.; McDonald, Christin A.; Volker, Martin A.

2017-01-01

This study examined the reliability and criterion-related validity of parent ratings on the Adapted Skillstreaming Checklist (ASC) for a sample of 275 high-functioning children, ages 6-12 years, with ASD. Internal consistency for the total sample was 0.92. For two subsamples, test-retest reliability was very good at the 6-week and good at the…
Developing and testing the patient-centred innovation questionnaire for hospital nurses.

PubMed

Huang, Ching-Yuan; Weng, Rhay-Hung; Wu, Tsung-Chin; Lin, Tzu-En; Hsu, Ching-Tai; Hung, Chiu-Hsia; Tsai, Yu-Chen

2018-03-01

Develop the patient-centred innovation questionnaire for hospital nurses and establish its validity and reliability. Patient-centred care has been adopted by health care managers in their efforts to improve health care quality. It is regarded as a core concept for developing innovation. A cross-sectional study was employed to collect data from hospital nurses in Taiwan. This study was divided into two stages: pilot study and main study. In the main study, 596 valid responses were collected. This study adopted reliability analysis, exploratory factor analysis, confirmatory factor analysis and selected nurse innovation scale as a criterion to test criterion-related validity. Five-dimension patient-centred innovation questionnaire was proposed: access and practicability, co-ordination and communication, sharing power and responsibility, care continuity, family and person focus. Each dimension demonstrated a reliability of 0.89-0.98. All dimensions had acceptable convergent and discriminate validity. The patient-centred innovation questionnaire and nurse innovation scale exhibited a significantly positive correlation. Patient-centred innovation questionnaire not only had a good theoretical basis but also had sufficient reliability and construct validity, and criterion-related validity. Patient-centred innovation questionnaire could give a measure for evaluating the implementation of patient-centred care and could be used as a management tool during the process of nurse innovation. © 2017 John Wiley & Sons Ltd.
Measuring assessment standards in undergraduate medical programs: Development and validation of AIM tool.

PubMed

Sajjad, Madiha; Khan, Rehan Ahmed; Yasmeen, Rahila

2018-01-01

To develop a tool to evaluate faculty perceptions of assessment quality in an undergraduate medical program. The Assessment Implementation Measure (AIM) tool was developed by a mixed method approach. A preliminary questionnaire developed through literature review was submitted to a panel of 10 medical education experts for a three-round 'Modified Delphi technique'. Panel agreement of > 75% was considered the criterion for inclusion of items in the questionnaire. Cognitive pre-testing of five faculty members was conducted. Pilot study was done with 30 randomly selected faculty members. Content validity index (CVI) was calculated for individual items (I-CVI) and composite scale (S-CVI). Cronbach's alpha was calculated to determine the internal consistency reliability of the tool. The final AIM tool had 30 items after the Delphi process. S-CVI was 0.98 with the S-CVI/Avg method and 0.86 by S-CVI/UA method, suggesting good content validity. Cut-off value of < 0.9 I-CVI was taken as criterion for item deletion. Cognitive pre-testing revealed good item interpretation. Cronbach's alpha calculated for the AIM was 0.9, whereas Cronbach's alpha for the four domains ranged from 0.67 to 0.80. 'AIM' is a relevant and useful instrument with good content validity and reliability of results, and may be used to evaluate the teachers´ perceptions about assessment quality.
Nutrition screening tools: does one size fit all? A systematic review of screening tools for the hospital setting.

PubMed

van Bokhorst-de van der Schueren, Marian A E; Guaitoli, Patrícia Realino; Jansma, Elise P; de Vet, Henrica C W

2014-02-01

Numerous nutrition screening tools for the hospital setting have been developed. The aim of this systematic review is to study construct or criterion validity and predictive validity of nutrition screening tools for the general hospital setting. A systematic review of English, French, German, Spanish, Portuguese and Dutch articles identified via MEDLINE, Cinahl and EMBASE (from inception to the 2nd of February 2012). Additional studies were identified by checking reference lists of identified manuscripts. Search terms included key words for malnutrition, screening or assessment instruments, and terms for hospital setting and adults. Data were extracted independently by 2 authors. Only studies expressing the (construct, criterion or predictive) validity of a tool were included. 83 studies (32 screening tools) were identified: 42 studies on construct or criterion validity versus a reference method and 51 studies on predictive validity on outcome (i.e. length of stay, mortality or complications). None of the tools performed consistently well to establish the patients' nutritional status. For the elderly, MNA performed fair to good, for the adults MUST performed fair to good. SGA, NRS-2002 and MUST performed well in predicting outcome in approximately half of the studies reviewed in adults, but not in older patients. Not one single screening or assessment tool is capable of adequate nutrition screening as well as predicting poor nutrition related outcome. Development of new tools seems redundant and will most probably not lead to new insights. New studies comparing different tools within one patient population are required. Copyright © 2013 Elsevier Ltd and European Society for Clinical Nutrition and Metabolism. All rights reserved.
Measuring Sexual Motives: A Test of the Psychometric Properties of the Sexual Motivations Scale.

PubMed

Jardin, Charles; Garey, Lorra; Zvolensky, Michael J

2017-01-01

Sexual motives refer to functions served by sexual behavior. The Sex Motivations Scale (SMS) has frequently been used to assess sexual motives. At its development, the SMS demonstrated good internal consistency; convergent, divergent, and criterion validity; and configural invariance across sex, age, and Caucasians and African Americans. Yet the metric and scalar invariance of the SMS has not been examined, nor has the measurement invariance of the SMS across Hispanic and Asian Americans, sexual minority status, and relationship status been tested. The criterion validity of the SMS also has yet to be examined for nonintercourse sexual behaviors, such as sexting. The present study aimed to address these gaps in a diverse sample of 2,201 college students (77.60% female; M age = 22.06; 27.84% Caucasian). Results further affirmed the configural, metric, and scalar invariance of the SMS. The convergent and divergent validity of the SMS was supported in relation to positive and negative affect and attachment patterns; and specific SMS subscales demonstrated associations with sexual intercourse behaviors and sexting, supporting the criterion validity of the SMS. These findings suggest the relevance of the SMS in assessing sexual motives across diverse populations and behaviors.
Development and Validation of Triarchic Construct Scales from the Psychopathic Personality Inventory

PubMed Central

Hall, Jason R.; Drislane, Laura E.; Patrick, Christopher J.; Morano, Mario; Lilienfeld, Scott O.; Poythress, Norman G.

2014-01-01

The Triarchic model of psychopathy describes this complex condition in terms of distinct phenotypic components of boldness, meanness, and disinhibition. Brief self-report scales designed specifically to index these psychopathy facets have thus far demonstrated promising construct validity. The present study sought to develop and validate scales for assessing facets of the Triarchic model using items from a well-validated existing measure of psychopathy—the Psychopathic Personality Inventory (PPI). A consensus rating approach was used to identify PPI items relevant to each Triarchic facet, and the convergent and discriminant validity of the resulting PPI-based Triarchic scales were evaluated in relation to multiple criterion variables (i.e., other psychopathy inventories, antisocial personality disorder features, personality traits, psychosocial functioning) in offender and non-offender samples. The PPI-based Triarchic scales showed good internal consistency and related to criterion variables in ways consistent with predictions based on the Triarchic model. Findings are discussed in terms of implications for conceptualization and assessment of psychopathy. PMID:24447280
Development and validation of Triarchic construct scales from the psychopathic personality inventory.

PubMed

Hall, Jason R; Drislane, Laura E; Patrick, Christopher J; Morano, Mario; Lilienfeld, Scott O; Poythress, Norman G

2014-06-01

The Triarchic model of psychopathy describes this complex condition in terms of distinct phenotypic components of boldness, meanness, and disinhibition. Brief self-report scales designed specifically to index these psychopathy facets have thus far demonstrated promising construct validity. The present study sought to develop and validate scales for assessing facets of the Triarchic model using items from a well-validated existing measure of psychopathy-the Psychopathic Personality Inventory (PPI). A consensus-rating approach was used to identify PPI items relevant to each Triarchic facet, and the convergent and discriminant validity of the resulting PPI-based Triarchic scales were evaluated in relation to multiple criterion variables (i.e., other psychopathy inventories, antisocial personality disorder features, personality traits, psychosocial functioning) in offender and nonoffender samples. The PPI-based Triarchic scales showed good internal consistency and related to criterion variables in ways consistent with predictions based on the Triarchic model. Findings are discussed in terms of implications for conceptualization and assessment of psychopathy.
Developing a tool to measure satisfaction among health professionals in sub-Saharan Africa

PubMed Central

2013-01-01

Background In sub-Saharan Africa, lack of motivation and job dissatisfaction have been cited as causes of poor healthcare quality and outcomes. Measurement of health workers’ satisfaction adapted to sub-Saharan African working conditions and cultures is a challenge. The objective of this study was to develop a valid and reliable instrument to measure satisfaction among health professionals in the sub-Saharan African context. Methods A survey was conducted in Senegal and Mali in 2011 among 962 care providers (doctors, midwives, nurses and technicians) practicing in 46 hospitals (capital, regional and district). The participation rate was very high: 97% (937/962). After exploratory factor analysis (EFA), construct validity was assessed through confirmatory factor analysis (CFA). The discriminant validity of our subscales was evaluated by comparing the average variance extracted (AVE) for each of the constructs with the squared interconstruct correlation (SIC), and finally for criterion validity, each subscale was tested with two hypotheses. Two dimensions of reliability were assessed: internal consistency with Cronbach’s alpha subscales and stability over time using a test-retest process. Results Eight dimensions of satisfaction encompassing 24 items were identified and validated using a process that combined psychometric analyses and expert opinions: continuing education, salary and benefits, management style, tasks, work environment, workload, moral satisfaction and job stability. All eight dimensions demonstrated significant discriminant validity. The final model showed good performance, with a root mean square error of approximation (RMSEA) of 0.0508 (90% CI: 0.0448 to 0.0569) and a comparative fit index (CFI) of 0.9415. The concurrent criterion validity of the eight dimensions was good. Reliability was assessed based on internal consistency, which was good for all dimensions but one (moral satisfaction < 0.70). Test-retest showed satisfactory temporal stability (intra class coefficient range: 0.60 to 0.91). Conclusions Job satisfaction is a complex construct; this study provides a multidimensional instrument whose content, construct and criterion validities were verified to ensure its suitability for the sub-Saharan African context. When using these subscales in further studies, the variability of the reliability of the subscales should be taken in to account for calculating the sample sizes. The instrument will be useful in evaluative studies which will help guide interventions aimed at improving both the quality of care and its effectiveness. PMID:23826720
PubMed

Steagall, Paulo V M; Monteiro, Beatriz P; Lavoie, Anne-Marie; Frank, Diane; Troncy, Eric; Luna, Stelio P L; Brondani, Juliana T

2017-01-01

Validation of the French version of the UNESP-Botucatu multidimensional composite pain scale for assessing postoperative pain in cats. The aim of this study was to validate the French version of the UNESP-Botucatu multidimensional composite pain scale (MCPS-Fr) to assess postoperative pain in cats. Two veterinarians and one DVM student identified three domains of behavior based on video analyses: "psychomotor change", "protection of the painful area" and "physiological variables". Internal consistency was excellent (Cronbach's alpha coefficient of 0.94, 0.90 and 0.61, respectively). Criterion validity was good to very good when evaluations from the three observers were compared with a "gold standard". Inter- and intra-rater reliability for each scale item were good to very good. The optimal cut-off point identified with a ROC curve was > 7 (scale range 0-30 points), with a sensitivity of 97.8% and specificity of 99.1%. The MCPS-Fr is a valid, reliable and responsive instrument for assessing acute pain in cats undergoing ovariohysterectomy.(Translated by Dr. Beatriz Monteiro).

COMFORT scale: a reliable and valid method to measure the amount of stress of ventilated preterm infants.

PubMed

Wielenga, J M; De Vos, R; de Leeuw, R; De Haan, R J

2004-01-01

Assessment of clinimetric properties and diagnostic quality of a stress measurement scale (COMFORT scale). Sample of an open population. Neonatology department (Neonatal Intensive Care Unit), Academic Medical Centre/Emma Children's Hospital, Amsterdam, The Netherlands. One clinical expert and 9 observers observed ventilated premature born babies simultaneously. Criterion validity was assessed by correlating the COMFORT scale with the clinical judgment regarding the amount of stress. Interobserver reliability was assessed on the clinical judgment as well as on the COMFORT scale. Diagnostic qualities were evaluated with a ROC curve. On 19 ventilated prematurely born babies (mean gestational age 30 weeks, mean birth weight 1385 gm), one clinical expert and 9 observers made 30 paired observations. The criterion validity of the COMFORT scale was good (Pearson's r of 0.84). The interobserver reliability of the clinical judgment was very good (weighted Kappa 0.84). The interobserver reliability of each item varied from good to almost perfect (weighted Kappa of 0.64 for muscle tone to 1.00 on heart rate). The reliability of the total COMFORT scale score was satisfying (intra-class correlation coefficient of 0.94). The diagnostic quality of the COMFORT scale was excellent, at a cut-off point of 20 the sensitivity was 100 percent, the specificity was 77 percent, and the area under the curve (AUC) of 0.95. In this first evaluation, the COMFORT scale appears to be a valid and reliable measurement tool to assess the stress of ventilated prematurely born babies.
Validity and reliability of three commonly used quality of life measures in a large European population of coronary heart disease patients.

PubMed

De Smedt, Delphine; Clays, Els; Doyle, Frank; Kotseva, Kornelia; Prugger, Christof; Pająk, Andrzej; Jennings, Catriona; Wood, David; De Bacquer, Dirk

2013-09-01

To investigate the validity and reliability of the EuroQol-5D (EQ-5D), the 12-item Short-Form Health Survey (SF-12v2), and the Hospital Anxiety and Depression Scale (HADS) in a stable coronary population. Cross-sectional study EUROASPIRE III. Quality of life data (QoL) were available on 8745 patients hospitalized for coronary artery bypass graft (CABG), percutaneous coronary intervention (PCI), acute myocardial infarction (AMI), or myocardial ischemia. They were interviewed and examined at least 6 months after their hospital admission. Reliability and validity of the 3 instruments were tested. Internal consistency, and discriminative, convergent, criterion and construct validity were assessed. Cronbach's alpha indicated good internal consistency for all measures (0.73 to 0.87). Discriminative validity analyses confirmed significant QoL differences between known groups: age, gender, educational level. In addition, all hypothesized correlations between QoL constructs (convergent validity) and items (criterion validity) were confirmed with significant correlations. Confirmatory factor analyses indicated good construct validity for HADS and SF-12v2. On country-specific level, results were roughly similar. The EQ-5D as well as the SF-12v2 and the HADS are reliable and valid instruments for use in a stable coronary population, both on aggregate European level and on country-specific level. However, our results must be generalized with caution, because EUROASPIRE III patients might not be representative for all patients with stable coronary heart disease. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Validity and reliability of the Japanese version of the FIM + FAM in patients with cerebrovascular accident.

PubMed

Miki, Emi; Yamane, Shingo; Yamaoka, Mai; Fujii, Hiroe; Ueno, Hiroka; Kawahara, Toshie; Tanaka, Keiko; Tamashiro, Hiroaki; Inoue, Eiji; Okamoto, Takatsugu; Kuriyama, Masaru

2016-09-01

The study aim was to investigate the validity and reliability of the Functional Independence Measure and Functional Assessment Measure (FIM + FAM), which is unfamiliar in Japan, by using its Japanese version (FIM + FAM-j) in patients with cerebrovascular accident (CVA). Forty-two CVA patients participated. Criterion validity was examined by correlating the full scale and subscales of FIM + FAM-j with several well-established measurements using Spearman's correlation coefficient. Reliability was evaluated by internal consistency (tested by Cronbach's alpha coefficient) and intra-rater reliability (tested by Kendall's tau correlation coefficient). Good-to-excellent criterion validity was found between the full scale and motor subscales of the FIM + FAM-j and the Barthel Index, National Institutes of Health Stroke Scale, modified Rankin Scale, and lower extremity Brunnstrom Recovery Stage. High internal consistency was observed within the full-scale FIM + FAM-j and the motor and cognitive subscales (Cronbach's alphas were 0.968, 0.954, and 0.948, respectively). Additionally, good intra-rater reliability was observed within the full scale and motor subscales, and excellent reliability for the cognitive subscales (taus were 0.83, 0.80, and 0.98, respectively). This study showed that the FIM + FAM-j demonstrated acceptable levels of validity and reliability when used for CVA as a measure of disability.
The Perceived Leadership Communication Questionnaire (PLCQ): Development and Validation.

PubMed

Schneider, Frank M; Maier, Michaela; Lovrekovic, Sara; Retzbach, Andrea

2015-01-01

The Perceived Leadership Communication Questionnaire (PLCQ) is a short, reliable, and valid instrument for measuring leadership communication from both perspectives of the leader and the follower. Drawing on a communication-based approach to leadership and following a theoretical framework of interpersonal communication processes in organizations, this article describes the development and validation of a one-dimensional 6-item scale in four studies (total N = 604). Results from Study 1 and 2 provide evidence for the internal consistency and factorial validity of the PLCQ's self-rating version (PLCQ-SR)-a version for measuring how leaders perceive their own communication with their followers. Results from Study 3 and 4 show internal consistency, construct validity, and criterion validity of the PLCQ's other-rating version (PLCQ-OR)-a version for measuring how followers perceive the communication of their leaders. Cronbach's α had an average of.80 over the four studies. All confirmatory factor analyses yielded good to excellent model fit indices. Convergent validity was established by average positive correlations of.69 with subdimensions of transformational leadership and leader-member exchange scales. Furthermore, nonsignificant correlations with socially desirable responding indicated discriminant validity. Last, criterion validity was supported by a moderately positive correlation with job satisfaction (r =.31).
Reliability and validity of cervical position measurements in individuals with and without chronic neck pain.

PubMed

Dunleavy, Kim; Neil, Joseph; Tallon, Allison; Adamo, Diane E

2015-09-01

The cervical range of motion device (CROM) has been shown to provide reliable forward head position (FHP) measurement when the upper cervical angle (UCA) is controlled. However, measurement without UCA standardization is reflective of habitual patterns. Criterion validity has not been reported. The purposes of this study were to establish: (1) criterion validity of CROM FHP and UCA compared to Optotrak data, (2) relative reliability and minimal detectable change (MDC95) in patients with and without cervical pain, and (3) to compare UCA and FHP in patients with and without pain in habitual postures. (1) Within-subjects single session concurrent criterion validity design. Simultaneous CROM and OP measurement was conducted in habitual sitting posture in 16 healthy young adults. (2) Reliability and MDC95 of UCA and FHP were calculated from three trials. (3) Values for adults over 35 years with cervical pain and age-matched healthy controls were compared. (1) Forward head position distances were moderately correlated and UCA angles were highly correlated. The mean (standard deviation) differences can be expected to vary between 1·48 cm (1·74) for FHP and -1·7 (2·46)° for UCA. (2) Reliability for CROM FHP measurements were good to excellent (no pain) and moderate (pain). Cervical range of motion FHP MDC95 was moderately low (no pain), and moderate (pain). Reliability for CROM UCA measurements was excellent and MDC95 low for both groups. There was no difference in FHP distances between the pain and no pain groups, UCA was significantly more extended in the pain group (P<0·05). Cervical range of motion FHP measurements were only moderately correlated with Optotrak data, and limits of agreement (LOA) and MDC95 were relatively large. There was also no difference in CROM FHP distance between older symptomatic and asymptomatic individuals. Cervical range of motion FHP measurement is therefore not recommended as a clinical outcome measure. Cervical range of motion UCA measurements showed good criterion validity, excellent test-retest reliability, and achievable MDC95 in asymptomatic and symptomatic participants. Differences of more than 6° are required to exceed error. Cervical range of motion UCA shows promise as a useful reliable and valid measurement, particularly as patients with cervical pain exhibited significantly more extended angles.
Reliability and validity of cervical position measurements in individuals with and without chronic neck pain

PubMed Central

Neil, Joseph; Tallon, Allison; Adamo, Diane E.

2015-01-01

Objectives The cervical range of motion device (CROM) has been shown to provide reliable forward head position (FHP) measurement when the upper cervical angle (UCA) is controlled. However, measurement without UCA standardization is reflective of habitual patterns. Criterion validity has not been reported. The purposes of this study were to establish: (1) criterion validity of CROM FHP and UCA compared to Optotrak data, (2) relative reliability and minimal detectable change (MDC95) in patients with and without cervical pain, and (3) to compare UCA and FHP in patients with and without pain in habitual postures. Methods (1) Within-subjects single session concurrent criterion validity design. Simultaneous CROM and OP measurement was conducted in habitual sitting posture in 16 healthy young adults. (2) Reliability and MDC95 of UCA and FHP were calculated from three trials. (3) Values for adults over 35 years with cervical pain and age-matched healthy controls were compared. Results (1) Forward head position distances were moderately correlated and UCA angles were highly correlated. The mean (standard deviation) differences can be expected to vary between 1·48 cm (1·74) for FHP and −1·7 (2·46)° for UCA. (2) Reliability for CROM FHP measurements were good to excellent (no pain) and moderate (pain). Cervical range of motion FHP MDC95 was moderately low (no pain), and moderate (pain). Reliability for CROM UCA measurements was excellent and MDC95 low for both groups. There was no difference in FHP distances between the pain and no pain groups, UCA was significantly more extended in the pain group (P<0·05). Discussion Cervical range of motion FHP measurements were only moderately correlated with Optotrak data, and limits of agreement (LOA) and MDC95 were relatively large. There was also no difference in CROM FHP distance between older symptomatic and asymptomatic individuals. Cervical range of motion FHP measurement is therefore not recommended as a clinical outcome measure. Cervical range of motion UCA measurements showed good criterion validity, excellent test–retest reliability, and achievable MDC95 in asymptomatic and symptomatic participants. Differences of more than 6° are required to exceed error. Cervical range of motion UCA shows promise as a useful reliable and valid measurement, particularly as patients with cervical pain exhibited significantly more extended angles. PMID:26917936
A systematic review of reliability and objective criterion-related validity of physical activity questionnaires.

PubMed

Helmerhorst, Hendrik J F; Brage, Søren; Warren, Janet; Besson, Herve; Ekelund, Ulf

2012-08-31

Physical inactivity is one of the four leading risk factors for global mortality. Accurate measurement of physical activity (PA) and in particular by physical activity questionnaires (PAQs) remains a challenge. The aim of this paper is to provide an updated systematic review of the reliability and validity characteristics of existing and more recently developed PAQs and to quantitatively compare the performance between existing and newly developed PAQs.A literature search of electronic databases was performed for studies assessing reliability and validity data of PAQs using an objective criterion measurement of PA between January 1997 and December 2011. Articles meeting the inclusion criteria were screened and data were extracted to provide a systematic overview of measurement properties. Due to differences in reported outcomes and criterion methods a quantitative meta-analysis was not possible.In total, 31 studies testing 34 newly developed PAQs, and 65 studies examining 96 existing PAQs were included. Very few PAQs showed good results on both reliability and validity. Median reliability correlation coefficients were 0.62-0.71 for existing, and 0.74-0.76 for new PAQs. Median validity coefficients ranged from 0.30-0.39 for existing, and from 0.25-0.41 for new PAQs.Although the majority of PAQs appear to have acceptable reliability, the validity is moderate at best. Newly developed PAQs do not appear to perform substantially better than existing PAQs in terms of reliability and validity. Future PAQ studies should include measures of absolute validity and the error structure of the instrument.
A systematic review of reliability and objective criterion-related validity of physical activity questionnaires

PubMed Central

2012-01-01

Physical inactivity is one of the four leading risk factors for global mortality. Accurate measurement of physical activity (PA) and in particular by physical activity questionnaires (PAQs) remains a challenge. The aim of this paper is to provide an updated systematic review of the reliability and validity characteristics of existing and more recently developed PAQs and to quantitatively compare the performance between existing and newly developed PAQs. A literature search of electronic databases was performed for studies assessing reliability and validity data of PAQs using an objective criterion measurement of PA between January 1997 and December 2011. Articles meeting the inclusion criteria were screened and data were extracted to provide a systematic overview of measurement properties. Due to differences in reported outcomes and criterion methods a quantitative meta-analysis was not possible. In total, 31 studies testing 34 newly developed PAQs, and 65 studies examining 96 existing PAQs were included. Very few PAQs showed good results on both reliability and validity. Median reliability correlation coefficients were 0.62–0.71 for existing, and 0.74–0.76 for new PAQs. Median validity coefficients ranged from 0.30–0.39 for existing, and from 0.25–0.41 for new PAQs. Although the majority of PAQs appear to have acceptable reliability, the validity is moderate at best. Newly developed PAQs do not appear to perform substantially better than existing PAQs in terms of reliability and validity. Future PAQ studies should include measures of absolute validity and the error structure of the instrument. PMID:22938557
The Transition Readiness Assessment Questionnaire (TRAQ): its factor structure, reliability, and validity.

PubMed

Wood, David L; Sawicki, Gregory S; Miller, M David; Smotherman, Carmen; Lukens-Bull, Katryne; Livingood, William C; Ferris, Maria; Kraemer, Dale F

2014-01-01

National consensus statements recommend that providers regularly assess the transition readiness skills of adolescent and young adults (AYA). In 2010 we developed a 29-item version of Transition Readiness Assessment Questionnaire (TRAQ). We reevaluated item performance and factor structure, and reassessed the TRAQ's reliability and validity. We surveyed youth from 3 academic clinics in Jacksonville, Florida; Chapel Hill, North Carolina; and Boston, Massachusetts. Participants were AYA with special health care needs aged 14 to 21 years. From a convenience sample of 306 patients, we conducted item reduction strategies and exploratory factor analysis (EFA). On a second convenience sample of 221 patients, we conducted confirmatory factor analysis (CFA). Internal reliability was assessed by Cronbach's alpha and criterion validity. Analyses were conducted by the Wilcoxon rank sum test and mixed linear models. The item reduction and EFA resulted in a 20-item scale with 5 identified subscales. The CFA conducted on a second sample provided a good fit to the data. The overall scale has high reliability overall (Cronbach's alpha = .94) and good reliability for 4 of the 5 subscales (Cronbach's alpha ranging from .90 to .77 in the pooled sample). Each of the 5 subscale scores were significantly higher for adolescents aged 18 years and older versus those younger than 18 (P < .0001) in both univariate and multivariate analyses. The 20-item, 5-factor structure for the TRAQ is supported by EFA and CFA on independent samples and has good internal reliability and criterion validity. Additional work is needed to expand or revise the TRAQ subscales and test their predictive validity. Copyright © 2014 Academic Pediatric Association. Published by Elsevier Inc. All rights reserved.
Validation of the German version of the Nurse-Work Instability Scale: baseline survey findings of a prospective study of a cohort of geriatric care workers

PubMed Central

2013-01-01

Background A prospective study of a cohort of nursing staff from nursing homes was undertaken to validate the Nurse-Work Instability Scale (Nurse-WIS). Baseline investigation data was used to test reliability, construct validity and criterion validity. Method A survey of nursing staff from nursing homes was conducted using a questionnaire containing the Nurse-WIS along with other survey instruments (including SF-12, WAI, SPE). The self-reported number of days’ sick leave taken and if a pension for reduced work capacity was drawn were recorded. The reliability of the scale was checked by item difficulty (P), item discrimination (rjt) and by internal consistency according to Cronbach’s coefficient. The hypotheses for checking construct validity were tested on the basis of correlations. Pearson’s chi-square was used to test concurrent criterion validity; discriminant validity was tested by means of binary logistic regression. Results 396 persons answered the questionnaire (21.3% response rate). More than 80% were female and mostly work full-time in a rotating shift pattern. Following the test for item discrimination, two items were removed from the Nurse-WIS test. According to Cronbach’s (0.927) the scale provides a high degree of measuring accuracy. All hypotheses and assumptions used to test validity were confirmed: As the Nurse-WIS risk increases, health-related quality of life, work ability and job satisfaction decline. Depressive symptoms and a poor subjective prognosis of earning capacity are also more frequent. Musculoskeletal disorders and impairments of psychological well-being are more frequent. Age also influences the Nurse-WIS result. While 12.0% of those below the age of 35 had an increased risk, the figure for those aged over 55 was 50%. Conclusion This study is the first validation study of the Nurse-WIS to date. The Nurse-WIS shows good reliability, good validity and a good level of measuring accuracy. It appears to be suitable for recording prevention and rehabilitation needs among health care workers. If, in the follow-up, the Nurse-WIS likewise proves to be a reliable screening instrument with good predictive validity, it could ensure that suitable action is taken at an early stage, thereby helping to counteract early retirement and the anticipated shortage of health care workers. PMID:24330532
The Work-Health-Check (WHC): a brief new tool for assessing psychosocial stress in the workplace.

PubMed

Gadinger, M C; Schilling, O; Litaker, D; Fischer, J E

2012-01-01

Brief, psychometrically robust questionnaires assessing work-related psychosocial stressors are lacking. The purpose of the study is to evaluate the psychometric properties of a brief new questionnaire for assessing sources of work-related psychosocial stress. Managers, blue- and white-collar workers (n= 628 at measurement point one, n=459 at measurement point two), sampled from an online panel of a German marketing research institute. We either developed or identified appropriate items from existing questionnaires for ten scales, which are conceptually based in work stress models and reflected either work-related demands or resources. Factorial structure was evaluated by confirmatory factor analyses (CFA). Scale reliability was assessed by Cronbach's Alpha, and test-retest; correlations with work-related efforts demonstrated convergent and discriminant validity for the demand and resource scales, respectively. Scale correlations with health indicators tested criterion validity. All scales had satisfactory reliability (Cronbach's Alpha: 0.74-0.93, retest reliabilities: 0.66-0.81). CFA supported the anticipated factorial structure. Significant correlations between job-related efforts and demand scales (mean r=0.44) and non-significant correlations with the resource scales (mean r=0.07) suggested good convergent and discriminant validity, respectively. Scale correlations with health indicators demonstrated good criterion validity. The WHC appears to be a brief, psychometrically robust instrument for assessing work-related psychosocial stressors.
Translating and validating a Training Needs Assessment tool into Greek

PubMed Central

Markaki, Adelais; Antonakis, Nikos; Hicks, Carolyn M; Lionis, Christos

2007-01-01

Background The translation and cultural adaptation of widely accepted, psychometrically tested tools is regarded as an essential component of effective human resource management in the primary care arena. The Training Needs Assessment (TNA) is a widely used, valid instrument, designed to measure professional development needs of health care professionals, especially in primary health care. This study aims to describe the translation, adaptation and validation of the TNA questionnaire into Greek language and discuss possibilities of its use in primary care settings. Methods A modified version of the English self-administered questionnaire consisting of 30 items was used. Internationally recommended methodology, mandating forward translation, backward translation, reconciliation and pretesting steps, was followed. Tool validation included assessing item internal consistency, using the alpha coefficient of Cronbach. Reproducibility (test – retest reliability) was measured by the kappa correlation coefficient. Criterion validity was calculated for selected parts of the questionnaire by correlating respondents' research experience with relevant research item scores. An exploratory factor analysis highlighted how the items group together, using a Varimax (oblique) rotation and subsequent Cronbach's alpha assessment. Results The psychometric properties of the Greek version of the TNA questionnaire for nursing staff employed in primary care were good. Internal consistency of the instrument was very good, Cronbach's alpha was found to be 0.985 (p < 0.001) and Kappa coefficient for reproducibility was found to be 0.928 (p < 0.0001). Significant positive correlations were found between respondents' current performance levels on each of the research items and amount of research involvement, indicating good criterion validity in the areas tested. Factor analysis revealed seven factors with eigenvalues of > 1.0, KMO (Kaiser-Meyer-Olkin) measure of sampling adequacy = 0.680 and Bartlett's test of sphericity, p < 0.001. Conclusion The translated and adapted Greek version is comparable with the original English instrument in terms of validity and reliability and it is suitable to assess professional development needs of nursing staff in Greek primary care settings. PMID:17474989
Translation and validation of the Cancer-Related Fatigue Scale in Greek in a sample of patients with advanced prostate cancer.

PubMed

Charalambous, Andreas; Kaite, Charis; Constantinou, Marianna; Kouta, Christiana

2016-12-02

To translate and validate the Cancer-Related Fatigue (CRF) Scale in the Greek language. A cross-sectional descriptive design was used in order to translate and validate the CRF Scale in Greek. Factor analyses were performed to understand the psychometric properties of the scale and to establish construct, criterion and convergent validity. Outpatients' oncology clinics of two public hospitals in Cyprus. 148 patients with advanced prostate cancer undergoing chemotherapy. The Cancer Fatigue Scale (CFS) had good stability (test-retest reliability r=0.79, p<0.001) and good internal consistency (Cronbach's α coefficient for all 15 items α=0.916). Furthermore, the Kaiser-Meyer-Olkin Measure of Sampling Adequacy (KMO value) was found to be 0.743 and considered to be satisfactory (>0.5). The correlations between the CFS physical scale (CFS-FS scale) and the European Organization for Research and Treatment of Cancer (EORTC) QLQ-C30 physical subscales were found to be significant (r=-0.715). The same occurred between CFS cognitive and EORTC cognitive subscale (r=-0.579). Overall, the criterion validity was verified. The same occurs for the convergent validity of the CFS since all correlations with the Global Health Status (q29-q30) were found to be significant. This is the first validation study of the CRF Scale in Greek and warrant of its use in the assessment of prostate cancer patient's related fatigue. However, further testing and validation is needed in the early stages of the disease and in patients in later chemotherapy cycles. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.
Translation and validation of the Cancer-Related Fatigue Scale in Greek in a sample of patients with advanced prostate cancer

PubMed Central

Kaite, Charis; Constantinou, Marianna; Kouta, Christiana

2016-01-01

Objective To translate and validate the Cancer-Related Fatigue (CRF) Scale in the Greek language. Design A cross-sectional descriptive design was used in order to translate and validate the CRF Scale in Greek. Factor analyses were performed to understand the psychometric properties of the scale and to establish construct, criterion and convergent validity. Setting Outpatients' oncology clinics of two public hospitals in Cyprus. Participants 148 patients with advanced prostate cancer undergoing chemotherapy. Results The Cancer Fatigue Scale (CFS) had good stability (test–retest reliability r=0.79, p<0.001) and good internal consistency (Cronbach's α coefficient for all 15 items α=0.916). Furthermore, the Kaiser-Meyer-Olkin Measure of Sampling Adequacy (KMO value) was found to be 0.743 and considered to be satisfactory (>0.5). The correlations between the CFS physical scale (CFS-FS scale) and the European Organization for Research and Treatment of Cancer (EORTC) QLQ-C30 physical subscales were found to be significant (r=−0.715). The same occurred between CFS cognitive and EORTC cognitive subscale (r=−0.579). Overall, the criterion validity was verified. The same occurs for the convergent validity of the CFS since all correlations with the Global Health Status (q29–q30) were found to be significant. Conclusions This is the first validation study of the CRF Scale in Greek and warrant of its use in the assessment of prostate cancer patient's related fatigue. However, further testing and validation is needed in the early stages of the disease and in patients in later chemotherapy cycles. PMID:27913557
Reliability and validity of the Tilburg Frailty Indicator (TFI) among Chinese community-dwelling older people.

PubMed

Dong, Lijuan; Liu, Na; Tian, Xiaoyu; Qiao, Xiaoxia; Gobbens, Robbert J J; Kane, Robert L; Wang, Cuili

2017-11-01

To translate the Tilburg Frailty Indicator (TFI) into Chinese and assess its reliability and validity. A sample of 917 community-dwelling older people, aged ≥60 years, in a Chinese city was included between August 2015 and March 2016. Construct validity was assessed using alternative measures corresponding to the TFI items, including self-rated health status (SRH), unintentional weight loss, walking speed, timed-up-and-go tests (TUGT), making telephone calls, grip strength, exhaustion, Short Portable Mental Status Questionnaire (SPMSQ), Geriatric Depression scale (GDS-15), emotional role, Adaptability Partnership Growth Affection and Resolve scale (APGAR) and Social Support Rating Scale (SSRS). Fried's phenotype and frailty index were measured to evaluate criterion validity. Adverse health outcomes (ADL and IADL disability, healthcare utilization, GDS-15, SSRS) were used to assess predictive (concurrent) validity. The internal consistency reliability was good (Cronbach's α=0.71). The test-retest reliability was strong (r=0.88). Kappa coefficients showed agreements between the TFI items and corresponding alternative measures. Alternative measures correlated as expected with the three domains of TFI, with an exclusion that alternative psychological measures had similar correlations with psychological and physical domains of the TFI. The Chinese TFI had excellent criterion validity with the AUCs regarding physical phenotype and frailty index of 0.87 and 0.86, respectively. The predictive (concurrent) validities of the adverse health outcomes and healthcare utilization were acceptable (AUCs: 0.65-0.83). The Chinese TFI has good validity and reliability as an integral instrument to measure frailty of older people living in the community in China. Copyright © 2017 Elsevier B.V. All rights reserved.
Evaluation of Criterion Validity for Scales with Congeneric Measures

ERIC Educational Resources Information Center

Raykov, Tenko

2007-01-01

A method for estimating criterion validity of scales with homogeneous components is outlined. It accomplishes point and interval estimation of interrelationship indices between composite scores and criterion variables and is useful for testing hypotheses about criterion validity of measurement instruments. The method can also be used with missing…
Tobacco Use Prevention for the Young (TUPY-S): Development, Validity and Reliability of an Interactive Multimedia Strategy from the Adolescents’ Perspective in Malaysia

PubMed Central

Zin, Faridah Mohd; Hillaluddin, Azlin Hilma; Mustaffa, Jamaludin

2017-01-01

Objective: This study aims to develop, validate and determine the reliability of an interactive multimedia strategy to prevent tobacco use among the young (TUPY-S) from an adolescents’ perspective. Methods: A descriptive study design was utilized. A modular instruction guideline by Russel (1974) was followed in the entire process, comprising a feasibility study, a review of existing modules, specification of the objectives, identification of the construct criterion items, learner analysis and entry behavior specification, establishment of the sequence instruction and media selection, a tryout with students and a field test. Result: Feasibility was agreed among the researchers and the school authorities. Culturally suitable rigorously developed tobacco use preventive strategies delivered using information technology (IT) are lacking in the literature. The objective of TUPY-S is to prevent tobacco use among adolescents living in Malaysia. Identified construct criterion items include knowledge, attitude, intention to use, self-efficacy, and refusal skill. The target population was early adolescents belonging to generation-Z. Content was developed from the adolescents’ perspective and delivered using IT in Malay language. Content validity, assessed by six experts in the field and module development, was good at 86%. The students’ tryout showed satisfactory face validity subjectively and objectively (85.5%) and high alpha Cronbach reliability (0.91). Conclusion: TUPY-S was confirmed to suit early adolescents of the current generation living in Malaysia. It demonstrated good content validity among the experts, satisfactory face validity and reliability among the target population. TUPY-S is ready to be evaluated for its effectiveness among early adolescents. PMID:28612599
Reliability and criterion-related validity of a new repeated agility test

PubMed Central

Makni, E; Jemni, M; Elloumi, M; Chamari, K; Nabli, MA; Padulo, J; Moalla, W

2016-01-01

The study aimed to assess the reliability and the criterion-related validity of a new repeated sprint T-test (RSTT) that includes intense multidirectional intermittent efforts. The RSTT consisted of 7 maximal repeated executions of the agility T-test with 25 s of passive recovery rest in between. Forty-five team sports players performed two RSTTs separated by 3 days to assess the reliability of best time (BT) and total time (TT) of the RSTT. The intra-class correlation coefficient analysis revealed a high relative reliability between test and retest for BT and TT (>0.90). The standard error of measurement (<0.50) showed that the RSTT has a good absolute reliability. The minimal detectable change values for BT and TT related to the RSTT were 0.09 s and 0.58 s, respectively. To check the criterion-related validity of the RSTT, players performed a repeated linear sprint (RLS) and a repeated sprint with changes of direction (RSCD). Significant correlations between the BT and TT of the RLS, RSCD and RSTT were observed (p<0.001). The RSTT is, therefore, a reliable and valid measure of the intermittent repeated sprint agility performance. As this ability is required in all team sports, it is suggested that team sports coaches, fitness coaches and sports scientists consider this test in their training follow-up. PMID:27274109
Validation of the Spanish Addiction Severity Index Multimedia Version (S-ASI-MV).

PubMed

Butler, Stephen F; Redondo, José Pedro; Fernandez, Kathrine C; Villapiano, Albert

2009-01-01

This study aimed to develop and test the reliability and validity of a Spanish adaptation of the ASI-MV, a computer administered version of the Addiction Severity Index, called the S-ASI-MV. Participants were 185 native Spanish-speaking adult clients from substance abuse treatment facilities serving Spanish-speaking clients in Florida, New Mexico, California, and Puerto Rico. Participants were administered the S-ASI-MV as well as Spanish versions of the general health subscale of the SF-36, the work and family unit subscales of the Social Adjustment Scale Self-Report, the Michigan Alcohol Screening Test, the alcohol and drug subscales of the Personality Assessment Inventory, and the Hopkins Symptom Checklist-90. Three-to-five-day test-retest reliability was examined along with criterion validity, convergent/discriminant validity, and factorial validity. Measurement invariance between the English and Spanish versions of the ASI-MV was also examined. The S-ASI-MV demonstrated good test-retest reliability (ICCs for composite scores between .59 and .93), criterion validity (rs for composite scores between .66 and .87), and convergent/discriminant validity. Factorial validity and measurement invariance were demonstrated. These results compared favorably with those reported for the original interviewer version of the ASI and the English version of the ASI-MV.
[Reliability and validity of warning signs checklist for screening psychological, behavioral and developmental problems of children].

PubMed

Huang, X N; Zhang, Y; Feng, W W; Wang, H S; Cao, B; Zhang, B; Yang, Y F; Wang, H M; Zheng, Y; Jin, X M; Jia, M X; Zou, X B; Zhao, C X; Robert, J; Jing, Jin

2017-06-02

Objective: To evaluate the reliability and validity of warning signs checklist developed by the National Health and Family Planning Commission of the People's Republic of China (NHFPC), so as to determine the screening effectiveness of warning signs on developmental problems of early childhood. Method: Stratified random sampling method was used to assess the reliability and validity of checklist of warning sign and 2 110 children 0 to 6 years of age(1 513 low-risk subjects and 597 high-risk subjects) were recruited from 11 provinces of China. The reliability evaluation for the warning signs included the test-retest reliability and interrater reliability. With the use of Age and Stage Questionnaire (ASQ) and Gesell Development Diagnosis Scale (GESELL) as the criterion scales, criterion validity was assessed by determining the correlation and consistency between the screening results of warning signs and the criterion scales. Result: In terms of the warning signs, the screening positive rates at different ages ranged from 10.8%(21/141) to 26.2%(51/137). The median (interquartile) testing time for each subject was 1(0.6) minute. Both the test-retest reliability and interrater reliability of warning signs reached 0.7 or above, indicating that the stability was good. In terms of validity assessment, there was remarkable consistency between ASQ and warning signs, with the Kappa value of 0.63. With the use of GESELL as criterion, it was determined that the sensitivity of warning signs in children with suspected developmental delay was 82.2%, and the specificity was 77.7%. The overall Youden index was 0.6. Conclusion: The reliability and validity of warning signs checklist for screening early childhood developmental problems have met the basic requirements of psychological screening scales, with the characteristics of short testing time and easy operation. Thus, this warning signs checklist can be used for screening psychological and behavioral problems of early childhood, especially in community settings.

Validation of the Chinese Version of the Quality of Nursing Work Life Scale

PubMed Central

Fu, Xia; Xu, Jiajia; Song, Li; Li, Hua; Wang, Jing; Wu, Xiaohua; Hu, Yani; Wei, Lijun; Gao, Lingling; Wang, Qiyi; Lin, Zhanyi; Huang, Huigen

2015-01-01

Quality of Nursing Work Life (QNWL) serves as a predictor of a nurse’s intent to leave and hospital nurse turnover. However, QNWL measurement tools that have been validated for use in China are lacking. The present study evaluated the construct validity of the QNWL scale in China. A cross-sectional study was conducted conveniently from June 2012 to January 2013 at five hospitals in Guangzhou, which employ 1938 nurses. The participants were asked to complete the QNWL scale and the World Health Organization Quality of Life abbreviated version (WHOQOL-BREF). A total of 1922 nurses provided the final data used for analyses. Sixty-five nurses from the first investigated division were re-measured two weeks later to assess the test-retest reliability of the scale. The internal consistency reliability of the QNWL scale was assessed using Cronbach’s α. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC). Criterion-relation validity was assessed using the correlation of the total scores of the QNWL and the WHOQOL-BREF. Construct validity was assessed with the following indices: χ2 statistics and degrees of freedom; relative mean square error of approximation (RMSEA); the Akaike information criterion (AIC); the consistent Akaike information criterion (CAIC); the goodness-of-fit index (GFI); the adjusted goodness of fit index; and the comparative fit index (CFI). The findings demonstrated high internal consistency (Cronbach’s α = 0.912) and test-retest reliability (interclass correlation coefficient = 0.74) for the QNWL scale. The chi-square test (χ2 = 13879.60, df [degree of freedom] = 813 P = 0.0001) was significant. The RMSEA value was 0.091, and AIC = 1806.00, CAIC = 7730.69, CFI = 0.93, and GFI = 0.74. The correlation coefficient between the QNWL total scores and the WHOQOL-BREF total scores was 0.605 (p<0.01). The QNWL scale was reliable and valid in Chinese-speaking nurses and could be used as a clinical and research instrument for measuring work-related factors among nurses in China. PMID:25950838
Reliability and validity of the Chinese CECA10 questionnaire for Chinese patients with condyloma acuminata

PubMed Central

Guo, Xinying; Wu, Xinjuan; Guo, Aimin; Zhao, Yanwei

2018-01-01

Abstract Condyloma acuminata (CA) is a sexually transmitted disease that affects quality of life (QOL). CECA10 is an English-language questionnaire for assessing QOL in patients with CA, but there is no equivalent in China. This study aimed to develop a validated and reliable Chinese version of CECA10. The Chinese CECA10 was developed from the English version by forward translation, back translation, comparison with the original, cultural adjustments, and a pre-test (5 patients). The Chinese CECA10 and EuroQol Five Dimensions Three Level Questionnaire (EQ-5D-3L) was administered to patients with CA. Content validity (item/scale content validity indexes, I-CVI/S-CVI), test–retest reliability (intraclass coefficient, ICC), internal consistency (Cronbach α), criterion validity (comparison with the Dermatology Life Quality Index, DLQL, using Spearman correlation analysis), construct validity (exploratory factor analysis), and discriminant validity (between subgroups based on number of warts, number of recurrences, or number of sites involved) were assessed. The Chinese CECA10 had good test–retest reliability (ICC = 0.98, P < .001), internal consistency (Cronbach α values of 0.88, 0.84, and 0.83 for the total questionnaire, psychological dimension, and sexual dimension, respectively), content validity (I-CVI = 1 for all items), and criterion validity (r = -0.50, P < .001). Exploratory factor analysis extracted 2 factors with a cumulative contribution of 61.75%; the factor loading with each item was >0.4. Discriminant validity was not high. The mean CECA10 and EQ-VAS scores of 211 patients with CA (28.19 ± 7.16 years; 139 males) were 34.56 ± 19.01 and 64.64 ± 19.28, respectively. The Chinese CECA10 has good reliability and validity for evaluating the QOL of Chinese patients with CA. PMID:29489693
from the Adolescents’ Perspective in Malaysia

PubMed

Mohd Zin, Faridah; Hillaluddin, Azlin Hilma; Mustaffa, Jamaludin

2017-05-01

Objective: This study aims to develop, validate and determine the reliability of an interactive multimedia strategy to prevent tobacco use among the young (TUPY-S) from an adolescents’ perspective. Methods: A descriptive study design was utilized. A modular instruction guideline by Russel (1974) was followed in the entire process, comprising a feasibility study, a review of existing modules, specification of the objectives, identification of the construct criterion items, learner analysis and entry behavior specification, establishment of the sequence instruction and media selection, a tryout with students and a field test. Result: Feasibility was agreed among the researchers and the school authorities. Culturally suitable rigorously developed tobacco use preventive strategies delivered using information technology (IT) are lacking in the literature. The objective of TUPY-S is to prevent tobacco use among adolescents living in Malaysia. Identified construct criterion items include knowledge, attitude, intention to use, self-efficacy, and refusal skill. The target population was early adolescents belonging to generation-Z. Content was developed from the adolescents’ perspective and delivered using IT in Malay language. Content validity, assessed by six experts in the field and module development, was good at 86%. The students’ tryout showed satisfactory face validity subjectively and objectively (85.5%) and high alpha Cronbach reliability (0.91). Conclusion: TUPY-S was confirmed to suit early adolescents of the current generation living in Malaysia. It demonstrated good content validity among the experts, satisfactory face validity and reliability among the target population. TUPY-S is ready to be evaluated for its effectiveness among early adolescents. Creative Commons Attribution License
A Rapid Assessment Tool for affirming good practice in midwifery education programming.

PubMed

Fullerton, Judith T; Johnson, Peter; Lobe, Erika; Myint, Khine Haymar; Aung, Nan Nan; Moe, Thida; Linn, Nay Aung

2016-03-01

to design a criterion-referenced assessment tool that could be used globally in a rapid assessment of good practices and bottlenecks in midwifery education programs. a standard tool development process was followed, to generate standards and reference criteria; followed by external review and field testing to document psychometric properties. review of standards and scoring criteria were conducted by stakeholders around the globe. Field testing of the tool was conducted in Myanmar. eleven of Myanmar׳s 22 midwifery education programs participated in the assessment. the clinimetric tool was demonstrated to have content validity and high inter-rater reliability in use. a globally validated tool, and accompanying user guide and handbook are now available for conducting rapid assessments of compliance with good practice criteria in midwifery education programming. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
A Model for Estimating the Reliability and Validity of Criterion-Referenced Measures.

ERIC Educational Resources Information Center

Edmonston, Leon P.; Randall, Robert S.

A decision model designed to determine the reliability and validity of criterion referenced measures (CRMs) is presented. General procedures which pertain to the model are discussed as to: Measures of relationship, Reliability, Validity (content, criterion-oriented, and construct validation), and Item Analysis. The decision model is presented in…
Discriminant Validity Assessment: Use of Fornell & Larcker criterion versus HTMT Criterion

NASA Astrophysics Data System (ADS)

Hamid, M. R. Ab; Sami, W.; Mohmad Sidek, M. H.

2017-09-01

Assessment of discriminant validity is a must in any research that involves latent variables for the prevention of multicollinearity issues. Fornell and Larcker criterion is the most widely used method for this purpose. However, a new method has emerged for establishing the discriminant validity assessment through heterotrait-monotrait (HTMT) ratio of correlations method. Therefore, this article presents the results of discriminant validity assessment using these methods. Data from previous study was used that involved 429 respondents for empirical validation of value-based excellence model in higher education institutions (HEI) in Malaysia. From the analysis, the convergent, divergent and discriminant validity were established and admissible using Fornell and Larcker criterion. However, the discriminant validity is an issue when employing the HTMT criterion. This shows that the latent variables under study faced the issue of multicollinearity and should be looked into for further details. This also implied that the HTMT criterion is a stringent measure that could detect the possible indiscriminant among the latent variables. In conclusion, the instrument which consisted of six latent variables was still lacking in terms of discriminant validity and should be explored further.
Development, Validation, and Fairness of a Biographical Data Questionnaire for the Air Traffic Control Specialist Occupation

DTIC Science & Technology

2012-12-01

Development and validation. ABA, BQ , and criterion data were extracted from AT- SAT concurrent, criterion- related validation database. Overall, 1,232...dependent on responses to the other instrument. 3 A subset of 260 controllers in the AT- SAT dataset had full and complete ABA, BQ , and criterion data (i.e... SAT cases with ABA, BQ , and criterion data (n=260) was very small, making fairness analyses with the validation sample impractical. However, the
Development and validation of the irritable bowel syndrome scale under the system of quality of life instruments for chronic diseases QLICD-IBS: combinations of classical test theory and generalizability theory.

PubMed

Lei, Pingguang; Lei, Guanghe; Tian, Jianjun; Zhou, Zengfen; Zhao, Miao; Wan, Chonghua

2014-10-01

This paper is aimed to develop the irritable bowel syndrome (IBS) scale of the system of Quality of Life Instruments for Chronic Diseases (QLICD-IBS) by the modular approach and validate it by both classical test theory and generalizability theory. The QLICD-IBS was developed based on programmed decision procedures with multiple nominal and focus group discussions, in-depth interview, and quantitative statistical procedures. One hundred twelve inpatients with IBS were used to provide the data measuring QOL three times before and after treatments. The psychometric properties of the scale were evaluated with respect to validity, reliability, and responsiveness employing correlation analysis, factor analyses, multi-trait scaling analysis, t tests and also G studies and D studies of generalizability theory analysis. Multi-trait scaling analysis, correlation, and factor analyses confirmed good construct validity and criterion-related validity when using SF-36 as a criterion. Test-retest reliability coefficients (Pearson r and intra-class correlation (ICC)) for the overall score and all domains were higher than 0.80; the internal consistency α for all domains at two measurements were higher than 0.70 except for the social domain (0.55 and 0.67, respectively). The overall score and scores for all domains/facets had statistically significant changes after treatments with moderate or higher effect size standardized response mean (SRM) ranging from 0.72 to 1.02 at domain levels. G coefficients and index of dependability (Ф coefficients) confirmed the reliability of the scale further with more exact variance components. The QLICD-IBS has good validity, reliability, responsiveness, and some highlights and can be used as the quality of life instrument for patients with IBS.
Home Healthcare Nurses' Job Satisfaction Scale: refinement and psychometric testing.

PubMed

Ellenbecker, Carol H; Byleckie, James J

2005-10-01

This paper describes a study to further develop and test the psychometric properties of the Home Healthcare Nurses' Job Satisfaction Scale, including reliability and construct and criterion validity. Numerous scales have been developed to measure nurses' job satisfaction. Only one, the Home Healthcare Nurses' Job Satisfaction Scale, has been designed specifically to measure job satisfaction of home healthcare nurses. The Home Healthcare Nurses' Job Satisfaction Scale is based on a theoretical model that integrates the findings of empirical research related to job satisfaction. A convenience sample of 340 home healthcare nurses completed the Home Healthcare Nurses' Job Satisfaction Scale and the Mueller and McCloskey Satisfaction Scale, which was used to test criterion validity. Factor analysis was used for testing and refinement of the theory-based assignment of items to constructs. Reliability was assessed by Cronbach's alpha internal consistency reliability coefficients. The data were collected in 2003. Nine factors contributing to home healthcare nurses' job satisfaction emerged from the factor analysis and were strongly supported by the underlying theory. Factor loadings were all above 0.4. Cronbach's alpha coefficients for each of the nine subscales ranged from 0.64 to 0.83; the alpha for the global scale was 0.89. The correlations between the Home Healthcare Nurses' Job Satisfaction Scale and Mueller and McCloskey Satisfaction Scale was 0.79, indicating good criterion-related validity. The Home Healthcare Nurses' Job Satisfaction Scale has potential as a reliable and valid scale for measurement of job satisfaction of home healthcare nurses.
Evidence for the Criterion Validity and Clinical Utility of the Pathological Narcissism Inventory

ERIC Educational Resources Information Center

Thomas, Katherine M.; Wright, Aidan G. C.; Lukowitsky, Mark R.; Donnellan, M. Brent; Hopwood, Christopher J.

2012-01-01

In this study, the authors evaluated aspects of criterion validity and clinical utility of the grandiosity and vulnerability components of the Pathological Narcissism Inventory (PNI) using two undergraduate samples (N = 299 and 500). Criterion validity was assessed by evaluating the correlations of narcissistic grandiosity and narcissistic…
Validation of the preschool and primary school form of a questionnaire assessing parents' childrearing behavior.

PubMed

Meunier, Jean-Christophe; Roskam, Isabelle

2009-01-01

This study presents a validation of a scale that assesses parents' childrearing behavior toward young children. The scale was validated on 565 parents of 2- to 7-year-old children. The current results replicated the factor solution of the original scale designed for parents of school-aged children. The scale demonstrated good psychometric properties: moderate to high internal consistency, the expected relations with criterion variables (parental self-efficacy beliefs, child's behavior and personality), and discriminative properties according to the parents' gender and educational level, the child's age and gender, and the difference between referred and nonreferred children.
Psychometric properties and differential explanation of a short measure of effort-reward imbalance at work: a study of industrial workers in Germany.

PubMed

Li, Jian; Loerbroks, Adrian; Jarczok, Marc N; Schöllgen, Ina; Bosch, Jos A; Mauss, Daniel; Siegrist, Johannes; Fischer, Joachim E

2012-09-01

We test the psychometric properties of a short version of the Effort-Reward Imbalance (ERI) questionnaire in addition to testing an interaction term of this model's main components on health functioning. A self-administered survey was conducted in a sample of 2,738 industrial workers (77% men with mean age 41.6 years) from a large manufacturing company in Southern Germany. The internal consistency reliability, structural validity, and criterion validity were analyzed. Satisfactory internal consistencies of the three scales: "Effort", "reward", and "overcommitment", were obtained (Cronbach's alpha coefficients 0.77, 0.82, and 0.83, respectively). Confirmatory factor analysis showed a good model fit of the data with the theoretical structure (AGFI = 0.94, RMSEA = 0.060). Evidence of criterion validity was demonstrated. Importantly, a significant synergistic interaction effect of ERI and overcommitment on poor mental health functioning was observed (odds ratio 6.74 (95% CI 5.32-8.52); synergy index 1.78 (95% CI 1.25-2.55)). This short version of the ERI questionnaire is a reliable and valid tool for epidemiological research on occupational health. Copyright © 2012 Wiley Periodicals, Inc.
Psychometric examination and factorial validity of the Exercise Dependence Scale-Revised in Italian exercisers.

PubMed

Costa, Sebastiano; Cuzzocrea, Francesca; Hausenblas, Heather A; Larcan, Rosalba; Oliva, Patrizia

2012-12-01

Background and aims The purpose of this study was to verify the factorial structure, internal validity, reliability, and criterion validity of the 21-item Exercise Dependence Scale-Revised (EDS-R) in an Italian sample. Methods Italian voluntary (N = 519) users of gyms who had a history of regular exercise for over a year completed the EDS-R and measures of exercise frequency. Results and conclusions Confirmatory factor analyses demonstrated a good fit to the hypothesized 7-factor model, and adequate internal consistency for the scale was evidenced. Criterion validity was evidenced by significant correlations among all the subscale of the EDS and exercise frequency. Finally, individuals at risk for exercise dependence reported more exercise behavior compared to the nondependent-symptomatic and nondependent-asymptomatic groups. These results suggest that the seven subscales of the Italian version of the EDS are measuring the construct of exercise dependence as defined by the DSM-IV criteria for substance dependence and also confirm previous research using the EDS-R in other languages. More research is needed to examine the psychometric properties of the EDS-R in diverse populations with various research designs.
The Motivational Value Systems Questionnaire (MVSQ): Psychometric Analysis Using a Forced Choice Thurstonian IRT Model

PubMed Central

Merk, Josef; Schlotz, Wolff; Falter, Thomas

2017-01-01

This study presents a new measure of value systems, the Motivational Value Systems Questionnaire (MVSQ), which is based on a theory of value systems by psychologist Clare W. Graves. The purpose of the instrument is to help people identify their personal hierarchies of value systems and thus become more aware of what motivates and demotivates them in work-related contexts. The MVSQ is a forced-choice (FC) measure, making it quicker to complete and more difficult to intentionally distort, but also more difficult to assess its psychometric properties due to ipsativity of FC data compared to rating scales. To overcome limitations of ipsative data, a Thurstonian IRT (TIRT) model was fitted to the questionnaire data, based on a broad sample of N = 1,217 professionals and students. Comparison of normative (IRT) scale scores and ipsative scores suggested that MVSQ IRT scores are largely freed from restrictions due to ipsativity and thus allow interindividual comparison of scale scores. Empirical reliability was estimated using a sample-based simulation approach which showed acceptable and good estimates and, on average, slightly higher test-retest reliabilities. Further, validation studies provided evidence on both construct validity and criterion-related validity. Scale score correlations and associations of scores with both age and gender were largely in line with theoretically- and empirically-based expectations, and results of a multitrait-multimethod analysis supports convergent and discriminant construct validity. Criterion validity was assessed by examining the relation of value system preferences to departmental affiliation which revealed significant relations in line with prior hypothesizing. These findings demonstrate the good psychometric properties of the MVSQ and support its application in the assessment of value systems in work-related contexts. PMID:28979228
The Motivational Value Systems Questionnaire (MVSQ): Psychometric Analysis Using a Forced Choice Thurstonian IRT Model.

PubMed

Merk, Josef; Schlotz, Wolff; Falter, Thomas

2017-01-01

This study presents a new measure of value systems, the Motivational Value Systems Questionnaire (MVSQ), which is based on a theory of value systems by psychologist Clare W. Graves. The purpose of the instrument is to help people identify their personal hierarchies of value systems and thus become more aware of what motivates and demotivates them in work-related contexts. The MVSQ is a forced-choice (FC) measure, making it quicker to complete and more difficult to intentionally distort, but also more difficult to assess its psychometric properties due to ipsativity of FC data compared to rating scales. To overcome limitations of ipsative data, a Thurstonian IRT (TIRT) model was fitted to the questionnaire data, based on a broad sample of N = 1,217 professionals and students. Comparison of normative (IRT) scale scores and ipsative scores suggested that MVSQ IRT scores are largely freed from restrictions due to ipsativity and thus allow interindividual comparison of scale scores. Empirical reliability was estimated using a sample-based simulation approach which showed acceptable and good estimates and, on average, slightly higher test-retest reliabilities. Further, validation studies provided evidence on both construct validity and criterion-related validity. Scale score correlations and associations of scores with both age and gender were largely in line with theoretically- and empirically-based expectations, and results of a multitrait-multimethod analysis supports convergent and discriminant construct validity. Criterion validity was assessed by examining the relation of value system preferences to departmental affiliation which revealed significant relations in line with prior hypothesizing. These findings demonstrate the good psychometric properties of the MVSQ and support its application in the assessment of value systems in work-related contexts.
Internalized HIV Stigma and Disclosure Concerns: Development and Validation of Two Scales in Spanish-Speaking Populations.

PubMed

Hernansaiz-Garrido, Helena; Alonso-Tapia, Jesús

2017-01-01

Internalized stigma and disclosure concerns are key elements for the study of mental health in people living with HIV. Since no measures of these constructs were available for Spanish population, this study sought to develop such instruments, to analyze their reliability and validity and to provide a short version. A heterogeneous sample of 458 adults from different Spanish-speaking countries completed the HIV-Internalized Stigma Scale and the HIV-Disclosure Concerns Scale, along with the Hospital Anxiety and Depression Scale, Rosenberg's Self-esteem Scale and other socio-demographic variables. Reliability and correlation analyses, exploratory factor analyses, path analyses with latent variables, and ANOVAs were conducted to test the scales' psychometric properties. The scales showed good reliability in terms of internal consistency and temporal stability, as well as good sensitivity and factorial and criterion validity. The HIV-Internalized Stigma Scale and the HIV-Disclosure Concerns Scale are reliable and valid means to assess these variables in several contexts.
Validation of a short measure of effort-reward imbalance in the workplace: evidence from China.

PubMed

Li, Jian; Loerbroks, Adrian; Shang, Li; Wege, Natalia; Wahrendorf, Morten; Siegrist, Johannes

2012-01-01

Work stress is an emergent risk in occupational health in China, and its measurement is still a critical issue. The aim of this study was to examine the reliability and validity of a short version of the effort-reward imbalance (ERI) questionnaire in a sample of Chinese workers. A community-based survey was conducted in 1,916 subjects aged 30-65 years with paid employment (971 men and 945 women). Acceptable internal consistencies of the three scales, effort, reward and overcommitment, were obtained. Confirmatory factor analysis showed a good model fit of the data with the theoretical structure (goodness-of-fit index = 0.95). Evidence of criterion validity was demonstrated, as all three scales were independently associated with elevated odds ratios of both poor physical and mental health. Based on the findings of our study, this short version of the ERI questionnaire is considered to be a reliable and valid tool for measuring psychosocial work environment in Chinese working populations.
Examining the validity of self-reports on scales measuring students' strategic processing.

PubMed

Samuelstuen, Marit S; Bråten, Ivar

2007-06-01

Self-report inventories trying to measure strategic processing at a global level have been much used in both basic and applied research. However, the validity of global strategy scores is open to question because such inventories assess strategy perceptions outside the context of specific task performance. The primary aim was to examine the criterion-related and construct validity of the global strategy data obtained with the Cross-Curricular Competencies (CCC) scale. Additionally, we wanted to compare the validity of these data with the validity of data obtained with a task-specific self-report inventory focusing on the same types of strategies. The sample included 269 10th-grade students from 12 different junior high schools. Global strategy use as assessed with the CCC was compared with task-specific strategy use reported in three different reading situations. Moreover, relationships between scores on the CCC and scores on measures of text comprehension were examined and compared with relationships between scores on the task-specific strategy measure and the same comprehension measures. The comparison between the CCC strategy scores and the task-specific strategy scores suggested only modest criterion-related validity for the data obtained with the global strategy inventory. The CCC strategy scores were also not related to the text comprehension measures, indicating poor construct validity. In contrast, the task-specific strategy scores were positively related to the comprehension measures, indicating good construct validity. Attempts to measure strategic processing at a global level seem to have limited validity and utility.
Changing abilities vs. changing tasks: Examining validity degradation with test scores and college performance criteria both assessed longitudinally.

PubMed

Dahlke, Jeffrey A; Kostal, Jack W; Sackett, Paul R; Kuncel, Nathan R

2018-05-03

We explore potential explanations for validity degradation using a unique predictive validation data set containing up to four consecutive years of high school students' cognitive test scores and four complete years of those students' college grades. This data set permits analyses that disentangle the effects of predictor-score age and timing of criterion measurements on validity degradation. We investigate the extent to which validity degradation is explained by criterion dynamism versus the limited shelf-life of ability scores. We also explore whether validity degradation is attributable to fluctuations in criterion variability over time and/or GPA contamination from individual differences in course-taking patterns. Analyses of multiyear predictor data suggest that changes to the determinants of performance over time have much stronger effects on validity degradation than does the shelf-life of cognitive test scores. The age of predictor scores had only a modest relationship with criterion-related validity when the criterion measurement occasion was held constant. Practical implications and recommendations for future research are discussed. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
[Psychometric validation in Spanish of the Brazilian short version of the Primary Care Assessment Tools-users questionnaire for the evaluation of the orientation of health systems towards primary care].

PubMed

Vázquez Peña, Fernando; Harzheim, Erno; Terrasa, Sergio; Berra, Silvina

2017-02-01

To validate the Brazilian short version of the PCAT for adult patients in Spanish. Analysis of secondary data from studies made to validate the extended version of the PCAT questionnaire. City of Córdoba, Argentina. Primary health care. The sample consisted of 46% of parents, whose children were enrolled in secondary education in three institutes in the city of Cordoba, and the remaining 54% were adult users of the National University of Cordoba Health Insurance. Pearson's correlation coefficient comparing the extended and short versions. Goodness-of-fit indices in confirmatory factor analysis, composite reliability, average variance extracted, and Cronbach's alpha values, in order to assess the construct validity and the reliability of the short version. The values of Pearson's correlation coefficient between this short version and the long version were high .818 (P<.001), implying a very good criterion validity. The indicators of good global adjustment to the confirmatory factor analysis were good. The value of composite reliability was good (.802), but under the variance media extracted: .3306, since 3 variables had weak factorials loads. The Cronbach's alpha was acceptable (.85). The short version of the PCAT-users developed in Brazil showed an acceptable psychometric performance in Spanish as a quick assessment tool, in a comparative study with the extended version. Copyright © 2016 Elsevier España, S.L.U. All rights reserved.

Assessing the criterion validity of four highly abbreviated measures from the Minimal Assessment of Cognitive Function in Multiple Sclerosis (MACFIMS).

PubMed

Gromisch, Elizabeth S; Zemon, Vance; Holtzer, Roee; Chiaravalloti, Nancy D; DeLuca, John; Beier, Meghan; Farrell, Eileen; Snyder, Stacey; Schairer, Laura C; Glukhovsky, Lisa; Botvinick, Jason; Sloan, Jessica; Picone, Mary Ann; Kim, Sonya; Foley, Frederick W

2016-10-01

Cognitive dysfunction is prevalent in multiple sclerosis. As self-reported cognitive functioning is unreliable, brief objective screening measures are needed. Utilizing widely used full-length neuropsychological tests, this study aimed to establish the criterion validity of highly abbreviated versions of the Brief Visuospatial Memory Test - Revised (BVMT-R), Symbol Digit Modalities Test (SDMT), Delis-Kaplan Executive Function System (D-KEFS) Sorting Test, and Controlled Oral Word Association Test (COWAT) in order to begin developing an MS-specific screening battery. Participants from Holy Name Medical Center and the Kessler Foundation were administered one or more of these four measures. Using test-specific criterion to identify impairment at both -1.5 and -2.0 SD, receiver-operating-characteristic (ROC) analyses of BVMT-R Trial 1, Trial 2, and Trial 1 + 2 raw data (N = 286) were run to calculate the classification accuracy of the abbreviated version, as well as the sensitivity and specificity. The same methods were used for SDMT 30-s and 60-s (N = 321), D-KEFS Sorting Free Card Sort 1 (N = 120), and COWAT letters F and A (N = 298). Using these definitions of impairment, each analysis yielded high classification accuracy (89.3 to 94.3%). BVMT-R Trial 1, SDMT 30-s, D-KEFS Free Card Sort 1, and COWAT F possess good criterion validity in detecting impairment on their respective overall measure, capturing much of the same information as the full version. Along with the first two trials of the California Verbal Learning Test - Second Edition (CVLT-II), these five highly abbreviated measures may be used to develop a brief screening battery.
Substance versus style: a new look at social desirability in motivating contexts.

PubMed

Smith, D Brent; Ellingson, Jill E

2002-04-01

Although there is an emerging consensus that social desirability does not meaningfully affect criterion-related validity, several researchers have reaffirmed the argument that social desirability degrades the construct validity of personality measures. Yet, most research demonstrating the adverse consequences of faking for construct validity uses a fake-good instruction set. The consequence of such a manipulation is to exacerbate the effects of response distortion beyond what would be expected under realistic circumstances (e.g., an applicant setting). The research reported in this article was designed to assess these issues by using real-world contexts not influenced by artificial instructions. Results suggest that response distortion has little impact on the construct validity of personality measures used in selection contexts.
Development, pilot testing and psychometric validation of a short version of the coronary artery disease education questionnaire: The CADE-Q SV.

PubMed

Ghisi, Gabriela Lima de Melo; Sandison, Nicole; Oh, Paul

2016-03-01

To develop, pilot test and psychometrically validate a shorter version of the coronary artery disease education questionnaire (CADE-Q), called CADE-Q SV. Based on previous versions of the CADE-Q, cardiac rehabilitation (CR) experts developed 20 items divided into 5 knowledge domains to comprise the first version of the CADE-Q SV. To establish content validity, they were reviewed by an expert panel (N=12). Refined items were pilot-tested in 20 patients, in which clarity was provided. A final version was generated and psychometrically-tested in 132CR patients. Test-retest reliability was assessed via the intraclass correlation coefficient (ICC), the internal consistency using Cronbach's alpha, and criterion validity with regard to patients' education and duration in CR. All ICC coefficients meet the minimum recommended standard. All domains were considered internally consistent (α>0.7). Criterion validity was supported by significant differences in mean scores by educational level (p<0.01) and duration in CR (p<0.05). Knowledge about exercise and nutrition was higher than knowledge about medical condition. The CADE-Q SV was demonstrated to have good reliability and validity. This is a short, quick and appropriate tool for application in clinical and research settings, assessing patients' knowledge during CR and as part of education programming. Copyright © 2015. Published by Elsevier Ireland Ltd.
Development and psychometric validation of a scale to assess information needs in cardiac rehabilitation: the INCR Tool.

PubMed

Ghisi, Gabriela Lima de Melo; Grace, Sherry L; Thomas, Scott; Evans, Michael F; Oh, Paul

2013-06-01

To develop and psychometrically validate a tool to assess information needs in cardiac rehabilitation (CR) patients. After a literature search, 60 information items divided into 11 areas of needs were identified. To establish content validity, they were reviewed by an expert panel (N=10). Refined items were pilot-tested in 34 patients on a 5-point Likert-scale from 1 "really not helpful" to 5 "very important". A final version was generated and psychometrically tested in 203 CR patients. Test-retest reliability was assessed via the intraclass correlation coefficient (ICC), the internal consistency using Cronbach's alpha, and criterion validity was assessed with regard to patient's education and duration in CR. Five items were excluded after ICC analysis as well as one area of needs. All 10 areas were considered internally consistent (Cronbach's alpha>0.7). Criterion validity was supported by significant differences in mean scores by educational level (p<0.05) and duration in CR (p<0.001). The mean total score was 4.08 ± 0.53. Patients rated safety as their greatest information need. The INCR Tool was demonstrated to have good reliability and validity. This is an appropriate tool for application in clinical and research settings, assessing patients' needs during CR and as part of education programming. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
The Psychometric Parameters of the Farsi Form of the Arabic Scale of Death Anxiety

PubMed Central

Abdel-Khalek, Ahmed M.; Lester, David

2017-01-01

The aim of this study was to describe the psychometric properties of the Farsi Form of the Arabic Scale of Death Anxiety (ASDA). The original scale was first translated into Farsi by language experts using the back translation procedure and then administered to a total of 252 Iranian college students and 52 psychiatric outpatients from psychiatric and psychological clinics. The one-week test-retest reliability of the Farsi version in a sample of college students was 0.78, indicating good temporal stability and corroborating the trait-like nature of scores. Cronbach's α was 0.90 for the college students and 0.92 for the psychiatric outpatients, indicating high internal consistency. Scale scores correlated 0.46 with Death Obsession Scale scores, 0.56 with Death Depression Scale scores, 0.41 with Death Anxiety Scale scores, and 0.40 with Wish to be Dead Scale scores, indicating good construct and criterion-related validity. A principal component analysis with a Varimax rotation yielded four factors in the sample of Iranian college students, indicating a lack of homogeneity in the content of the scale. Male students obtained a significant higher mean score than did females. It was concluded that the Farsi ASDA had good internal consistency, temporal stability, criterion-related validity, and a factor structure reflecting important features of death anxiety. In general, the Farsi ASDA could be recommended for use in research on death anxiety among Iranian college students and psychiatric outpatients. PMID:28698887
The Psychometric Parameters of the Farsi Form of the Arabic Scale of Death Anxiety.

PubMed

Dadfar, Mahboubeh; Abdel-Khalek, Ahmed M; Lester, David; Atef Vahid, Mohammad Kazem

2017-01-01

The aim of this study was to describe the psychometric properties of the Farsi Form of the Arabic Scale of Death Anxiety (ASDA). The original scale was first translated into Farsi by language experts using the back translation procedure and then administered to a total of 252 Iranian college students and 52 psychiatric outpatients from psychiatric and psychological clinics. The one-week test-retest reliability of the Farsi version in a sample of college students was 0.78, indicating good temporal stability and corroborating the trait-like nature of scores. Cronbach's α was 0.90 for the college students and 0.92 for the psychiatric outpatients, indicating high internal consistency. Scale scores correlated 0.46 with Death Obsession Scale scores, 0.56 with Death Depression Scale scores, 0.41 with Death Anxiety Scale scores, and 0.40 with Wish to be Dead Scale scores, indicating good construct and criterion-related validity. A principal component analysis with a Varimax rotation yielded four factors in the sample of Iranian college students, indicating a lack of homogeneity in the content of the scale. Male students obtained a significant higher mean score than did females. It was concluded that the Farsi ASDA had good internal consistency, temporal stability, criterion-related validity, and a factor structure reflecting important features of death anxiety. In general, the Farsi ASDA could be recommended for use in research on death anxiety among Iranian college students and psychiatric outpatients.
The French-Canadian validation of a disease-specific, patient-reported outcome measure for lupus.

PubMed

Bourré-Tessier, J; Clarke, A E; Kosinski, M; Mikolaitis-Preuss, R A; Bernatsky, S; Block, J A; Jolly, M

2014-12-01

The objective of this paper is to perform the cross-cultural validation of the French version of the LupusPRO, a disease-targeted patient-reported outcome measure, among systemic lupus erythematosus (SLE) patients in Canada. The French version of the LupusPRO and the MOS SF-36 were administered; demographic, clinical and serological characteristics were obtained. Disease activity (SELENA-SLEDAI and the Lupus Foundation of America definition of flare) and damage (SLICC/ACR SDI) were assessed. Physician disease activity and damage assessments were ascertained using visual analog scales. Internal consistency reliability (ICR), test-retest reliability (TRT), convergent and discriminant validity (against corresponding domains of the SF-36), criterion validity (against disease activity, damage or health status) and known group validity were tested. A total of 99 French-Canadian SLE patients participated (97% women, mean (SD) age 45.2 (14.5) years). The median (IQR) SELENA-SLEDAI and SDI were 3.5 (6.0) and 1.0 (2.0), respectively. The ICR of the LupusPRO domains ranged from 0.81 to 0.93 (except for lupus symptoms, procreation and coping), while TRT ranged from 0.72 to 0.95. Convergent and discriminant validity, criterion validity and known group validity against disease activity, damage and health status measures were observed. Confirmatory factor analysis showed a good fit. The LupusPRO has fair psychometric properties among French-Canadian patients with SLE. © The Author(s) 2014 Reprints and permissions: sagepub.co.uk/journalsPermissions.nav.
Development and psychometric testing of the Protective Reasons Against Suicide Inventory for assessing older Chinese-speaking outpatients in primary care settings.

PubMed

Wang, Yi-Wen; Tsai, Yun-Fang; Lee, Shwu-Hua; Chen, Ying-Jen; Chen, Hsiu-Fang

2016-07-01

To develop and psychometrically test the Protective Reasons against Suicide Inventory among older Chinese-speaking outpatients. Tools currently exist to test reasons for living among individuals of all ages in western countries, but few are available to assess older adults' protective reasons against suicide in Asia. A cross-sectional survey to investigate protective reasons against suicide among older Chinese-speaking outpatients. The Protective Reasons against Suicide Inventory was developed based on individual interviews with 83 older outpatients in Taiwan, the literature and the authors' clinical experiences. The resulting Inventory was examined in 2013 for content validity, face validity, construct validity, criterion-related validity, internal consistency reliability and test-retest reliability. The Inventory had excellent content validity and face validity. Factor analysis yielded a seven-factor solution, accounting for 87·7% of the variance. Scores on the global Inventory and its subscales tended to be higher in outpatients diagnosed without suicidal ideation than in outpatients diagnosed with suicidal ideation, indicating good criterion validity. Inventory reliability and the intraclass correlation coefficient were satisfactory. The Protective Reasons against Suicide Inventory can be completed in 5 minutes and is perceived as easy to complete. Moreover, the Inventory yielded highly acceptable parameters for validity and reliability. The Protective Reasons against Suicide Inventory can be used to assess older Chinese-speaking outpatients for factors that protect them from attempting suicide. © 2016 John Wiley & Sons Ltd.
The Italian version of the Depression Anxiety Stress Scales-21: Factor structure and psychometric properties on community and clinical samples.

PubMed

Bottesi, Gioia; Ghisi, Marta; Altoè, Gianmarco; Conforti, Erica; Melli, Gabriele; Sica, Claudio

2015-07-01

The Depression Anxiety Stress Scales-21 (DASS-21) is the short version of a self-report measure that was originally developed to provide maximum differentiation between depressive and anxious symptoms. Despite encouraging evidence, the factor structure and other features of the DASS-21 are yet to be firmly established. A community sample of 417 participants and two clinical groups (32 depressive patients and 25 anxious patients) completed the Italian version of the DASS-21 along with several measures of psychopathology. Confirmatory factor analyses suggested that the DASS-21 is a measure of general distress plus three additional orthogonal dimensions (anxiety, depression, and stress). The internal consistency and temporal stability of the measure were good; each DASS-21 scale correlated more strongly with a measure of a similar construct, demonstrating good convergent and divergent validity. Lastly, the DASS-21 demonstrated good criterion-oriented validity. The validity of the Italian DASS-21 and its utility, both for community and clinical individuals, are supported. Copyright © 2015 Elsevier Inc. All rights reserved.
Financial decision-making abilities and financial exploitation in older African Americans: Preliminary validity evidence for the Lichtenberg Financial Decision Rating Scale (LFDRS).

PubMed

Lichtenberg, Peter A; Ficker, Lisa J; Rahman-Filipiak, Annalise

2016-01-01

This study examines preliminary evidence for the Lichtenberg Financial Decision Rating Scale (LFDRS), a new person-centered approach to assessing capacity to make financial decisions, and its relationship to self-reported cases of financial exploitation in 69 older African Americans. More than one third of individuals reporting financial exploitation also had questionable decisional abilities. Overall, decisional ability score and current decision total were significantly associated with cognitive screening test and financial ability scores, demonstrating good criterion validity. Study findings suggest that impaired decisional abilities may render older adults more vulnerable to financial exploitation, and that the LFDRS is a valid tool.
Cross-cultural adaptation and validation of the French version of the Expanded Prostate cancer Index Composite questionnaire for health-related quality of life in prostate cancer patients.

PubMed

Anota, Amélie; Mariet, Anne-Sophie; Maingon, Philippe; Joly, Florence; Bosset, Jean-François; Guizard, Anne-Valérie; Bittard, Hugues; Velten, Michel; Mercier, Mariette

2016-12-06

Health-related quality of life (HRQoL) has been positioned as one of the major endpoints in oncology. Thus, there is a need to validate cancer-site specific survey instruments. This study aimed to perform a transcultural adaptation of the 50-item Expanded Prostate cancer Index Composite (EPIC) questionnaire for HRQoL in prostate cancer patients and to validate the psychometric properties of the French-language version. The EPIC questionnaire measures urinary, bowel, sexual and hormonal domains. The first step, corresponding to transcultural adaptation of the original English version of the EPIC was performed according to the back translation technique. The second step, comprising the validation of the psychometric properties of the EPIC questionnaire, was performed in patients under treatment for localized prostate cancer (treatment group) and in patients cured of prostate cancer (cured group). The EORTC QLQ-C30 and QLQ-PR25 prostate cancer module were also completed by patients to assess criterion validity. Two assessments were performed, i.e., before and at the end of treatment for the Treatment group, to assess sensitivity to change; and at 2 weeks' interval in the Cured group to assess test-retest reliability. Psychometric properties were explored according to classical test theory. The first step showed overall good acceptability and understanding of the questionnaire. In the second step, 215 patients were included from January 2012 to June 2014: 125 in the Treatment group, and 90 in the Cured group. All domains exhibited good internal consistency, except the bowel domain (Cronbach's α = 0.61). No floor effect was observed. Test-retest reliability assessed in the cured group was acceptable, expect for bowel function (intraclass coefficient = 0.68). Criterion validity was good for each domain and subscale. Construct validity was not demonstrated for the hormonal and bowel domains. Sensitivity to change was exhibited for 5/8 subscales and 2/4 summary scores for patients who experienced toxicities during treatment. The French EPIC questionnaire seems to have adequate psychometric properties, comparable to those exhibited by the original English-language version, except for the construct validity, which was not available in original version.
INCLEN Diagnostic Tool for Autism Spectrum Disorder (INDT-ASD): development and validation.

PubMed

Juneja, Monica; Mishra, Devendra; Russell, Paul S S; Gulati, Sheffali; Deshmukh, Vaishali; Tudu, Poma; Sagar, Rajesh; Silberberg, Donald; Bhutani, Vinod K; Pinto, Jennifer M; Durkin, Maureen; Pandey, Ravindra M; Nair, M K C; Arora, Narendra K

2014-05-01

To develop and validate INCLEN Diagnostic Tool for Autism Spectrum Disorder (INDT-ASD). Diagnostic test evaluation by cross sectional design. Four tertiary pediatric neurology centers in Delhi and Thiruvanthapuram, India. Children aged 2-9 years were enrolled in the study. INDT-ASD and Childhood Autism Rating Scale (CARS) were administered in a randomly decided sequence by trained psychologist, followed by an expert evaluation by DSM-IV TR diagnostic criteria (gold standard). Psychometric parameters of diagnostic accuracy, validity (construct, criterion and convergent) and internal consistency. 154 children (110 boys, mean age 64.2 mo) were enrolled. The overall diagnostic accuracy (AUC=0.97, 95% CI 0.93, 0.99; P<0.001) and validity (sensitivity 98%, specificity 95%, positive predictive value 91%, negative predictive value 99%) of INDT-ASD for Autism spectrum disorder were high, taking expert diagnosis using DSM-IV-TR as gold standard. The concordance rate between the INDT-ASD and expert diagnosis for 'ASD group' was 82.52% [Cohen's k=0.89; 95% CI (0.82, 0.97); P=0.001]. The internal consistency of INDT-ASD was 0.96. The convergent validity with CARS (r = 0.73, P= 0.001) and divergent validity with Binet-Kamat Test of intelligence (r = -0.37; P=0.004) were significantly high. INDT-ASD has a 4-factor structure explaining 85.3% of the variance. INDT-ASD has high diagnostic accuracy, adequate content validity, good internal consistency high criterion validity and high to moderate convergent validity and 4-factor construct validity for diagnosis of Autistm spectrum disorder.
Nursing Intensive-Care Satisfaction Scale [NICSS]: Development and validation of a patient-centred instrument.

PubMed

Romero-García, Marta; de la Cueva-Ariza, Laura; Benito-Aracil, Llucia; Lluch-Canut, Teresa; Trujols-Albet, Joan; Martínez-Momblan, Maria Antonia; Juvé-Udina, Maria-Eulàlia; Delgado-Hito, Pilar

2018-06-01

The aim of this study was to develop and validate the Nursing Intensive-Care Satisfaction Scale to measures satisfaction with nursing care from the critical care patient's perspective. Instruments that measure satisfaction with nursing cares have been designed and validated without taking the patient's perspective into consideration. Despite the benefits and advances in measuring satisfaction with nursing care, none instrument is specifically designed to assess satisfaction in intensive care units. Instrument development. The population were all discharged patients (January 2013 - January 2015) from three Intensive Care Units of a third level hospital (N = 200). All assessment instruments were given to discharged patients and 48 hours later, to analyse the temporal stability, only the questionnaire was given again. The validation process of the scale included the analysis of internal consistency, temporal stability; validity of construct through a confirmatory factor analysis; and criterion validity. Reliability was 0.95. The intraclass correlation coefficient for the total scale was 0.83 indicating a good temporal stability. Construct validity showed an acceptable fit and factorial structure with four factors, in accordance with the theoretical model, being Consequences factor the best correlated with other factors. Criterion validity, presented a correlation between low and high (range: 0.42-0.68). The scale has been designed and validated incorporating the perspective of critical care patients. Thanks to its reliability and validity, this questionnaire can be used both in research and in clinical practice. The scale offers a possibility to assess and develop interventions to improve patient satisfaction with nursing care. © 2018 John Wiley & Sons Ltd.
Validity, Responsiveness, Minimal Detectable Change, and Minimal Clinically Important Change of the Pediatric Motor Activity Log in Children with Cerebral Palsy

ERIC Educational Resources Information Center

Lin, Keh-chung; Chen, Hui-fang; Chen, Chia-ling; Wang, Tien-ni; Wu, Ching-yi; Hsieh, Yu-wei; Wu, Li-ling

2012-01-01

This study examined criterion-related validity and clinimetric properties of the Pediatric Motor Activity Log (PMAL) in children with cerebral palsy. Study participants were 41 children (age range: 28-113 months) and their parents. Criterion-related validity was evaluated by the associations between the PMAL and criterion measures at baseline and…
The Cognitive Abilities Scale--Second Edition Preschool Form: Studies of Concurrent Criterion-Related, Construct, and Predictive Criterion-Related Validity

ERIC Educational Resources Information Center

Swanson, Jennifer R.; Bradley-Johnson, Sharon; Johnson, C. Merle; O'Dell, Anna Rubenaker

2009-01-01

Three studies examine the validity of the Preschool Form of the Cognitive Abilities Scale--Second Edition (CAS-2). Significant high concurrent criterion-related validity correlations, corrected for restricted range, are found between the CAS-2 and the Detroit Test of Learning Ability--Primary: Third Edition for 26 three-year-olds (r[subscript c] =…
Psychometric Validation of the Academic Motivation Scale in a Dental Student Sample.

PubMed

Orsini, Cesar; Binnie, Vivian; Evans, Phillip; Ledezma, Priscilla; Fuentes, Fernando; Villegas, Maria J

2015-08-01

The Academic Motivation Scale is one of the most frequently used instruments to assess academic motivation. It relies on the self-determination theory of human motivation. However, motivation has been understudied in dental education. Therefore, to address the lack of valid instruments to assess academic motivation in dental education and contribute to future research in the field, the aim of this study was to analyze the psychometric properties of this instrument in a sample of dental students. Participants were 989 Chilean undergraduate dental students (86% response rate) who completed a survey containing a Chilean face-valid version of the Spanish Academic Motivation Scale and three other motivation-related instruments to assess the survey's construct and criterion validity. Later, 76 of the students (out of 100 invited) took the survey again to assess its test-retest stability. The instrument's construct validity was supported by the superior goodness of fit of the seven-subscale Academic Motivation Scale over competing models through confirmatory factor analysis and by the expected correlations among its subscales. The concurrent criterion validity was supported by the confirmation of correlations between its subscales and external criteria. Adequate internal consistency and test-retest correlations were also found. The evidence from this study suggests that the Academic Motivation Scale is a preliminarily valid and reliable instrument to assess motivation in the predoctoral dental context. Future research in this area is needed to confirm or refute these results.
Implementation and validation of collapsed cone superposition for radiopharmaceutical dosimetry of photon emitters

NASA Astrophysics Data System (ADS)

Sanchez-Garcia, Manuel; Gardin, Isabelle; Lebtahi, Rachida; Dieudonné, Arnaud

2015-10-01

Two collapsed cone (CC) superposition algorithms have been implemented for radiopharmaceutical dosimetry of photon emitters. The straight CC (SCC) superposition method uses a water energy deposition kernel (EDKw) for each electron, positron and photon components, while the primary and scatter CC (PSCC) superposition method uses different EDKw for primary and once-scattered photons. PSCC was implemented only for photons originating from the nucleus, precluding its application to positron emitters. EDKw are linearly scaled by radiological distance, taking into account tissue density heterogeneities. The implementation was tested on 100, 300 and 600 keV mono-energetic photons and 18F, 99mTc, 131I and 177Lu. The kernels were generated using the Monte Carlo codes MCNP and EGSnrc. The validation was performed on 6 phantoms representing interfaces between soft-tissues, lung and bone. The figures of merit were γ (3%, 3 mm) and γ (5%, 5 mm) criterions corresponding to the computation comparison on 80 absorbed doses (AD) points per phantom between Monte Carlo simulations and CC algorithms. PSCC gave better results than SCC for the lowest photon energy (100 keV). For the 3 isotopes computed with PSCC, the percentage of AD points satisfying the γ (5%, 5 mm) criterion was always over 99%. A still good but worse result was found with SCC, since at least 97% of AD-values verified the γ (5%, 5 mm) criterion, except a value of 57% for the 99mTc with the lung/bone interface. The CC superposition method for radiopharmaceutical dosimetry is a good alternative to Monte Carlo simulations while reducing computation complexity.
Implementation and validation of collapsed cone superposition for radiopharmaceutical dosimetry of photon emitters.

PubMed

Sanchez-Garcia, Manuel; Gardin, Isabelle; Lebtahi, Rachida; Dieudonné, Arnaud

2015-10-21

Two collapsed cone (CC) superposition algorithms have been implemented for radiopharmaceutical dosimetry of photon emitters. The straight CC (SCC) superposition method uses a water energy deposition kernel (EDKw) for each electron, positron and photon components, while the primary and scatter CC (PSCC) superposition method uses different EDKw for primary and once-scattered photons. PSCC was implemented only for photons originating from the nucleus, precluding its application to positron emitters. EDKw are linearly scaled by radiological distance, taking into account tissue density heterogeneities. The implementation was tested on 100, 300 and 600 keV mono-energetic photons and (18)F, (99m)Tc, (131)I and (177)Lu. The kernels were generated using the Monte Carlo codes MCNP and EGSnrc. The validation was performed on 6 phantoms representing interfaces between soft-tissues, lung and bone. The figures of merit were γ (3%, 3 mm) and γ (5%, 5 mm) criterions corresponding to the computation comparison on 80 absorbed doses (AD) points per phantom between Monte Carlo simulations and CC algorithms. PSCC gave better results than SCC for the lowest photon energy (100 keV). For the 3 isotopes computed with PSCC, the percentage of AD points satisfying the γ (5%, 5 mm) criterion was always over 99%. A still good but worse result was found with SCC, since at least 97% of AD-values verified the γ (5%, 5 mm) criterion, except a value of 57% for the (99m)Tc with the lung/bone interface. The CC superposition method for radiopharmaceutical dosimetry is a good alternative to Monte Carlo simulations while reducing computation complexity.
An Independent Psychometric Evaluation of the PROMS Measure of Music Perception Skills.

PubMed

Kunert, Richard; Willems, Roel M; Hagoort, Peter

2016-01-01

The Profile of Music Perception Skills (PROMS) is a recently developed measure of perceptual music skills which has been shown to have promising psychometric properties. In this paper we extend the evaluation of its brief version to three kinds of validity using an individual difference approach. The brief PROMS displays good discriminant validity with working memory, given that it does not correlate with backward digit span (r = .04). Moreover, it shows promising criterion validity (association with musical training (r = .45), musicianship status (r = .48), and self-rated musical talent (r = .51)). Finally, its convergent validity, i.e. relation to an unrelated measure of music perception skills, was assessed by correlating the brief PROMS to harmonic closure judgment accuracy. Two independent samples point to good convergent validity of the brief PROMS (r = .36; r = .40). The same association is still significant in one of the samples when including self-reported music skill in a partial correlation (rpartial = .30; rpartial = .17). Overall, the results show that the brief version of the PROMS displays a very good pattern of construct validity. Especially its tuning subtest stands out as a valuable part for music skill evaluations in Western samples. We conclude by briefly discussing the choice faced by music cognition researchers between different musical aptitude measures of which the brief PROMS is a well evaluated example.
An Independent Psychometric Evaluation of the PROMS Measure of Music Perception Skills

PubMed Central

Willems, Roel M.; Hagoort, Peter

2016-01-01

The Profile of Music Perception Skills (PROMS) is a recently developed measure of perceptual music skills which has been shown to have promising psychometric properties. In this paper we extend the evaluation of its brief version to three kinds of validity using an individual difference approach. The brief PROMS displays good discriminant validity with working memory, given that it does not correlate with backward digit span (r = .04). Moreover, it shows promising criterion validity (association with musical training (r = .45), musicianship status (r = .48), and self-rated musical talent (r = .51)). Finally, its convergent validity, i.e. relation to an unrelated measure of music perception skills, was assessed by correlating the brief PROMS to harmonic closure judgment accuracy. Two independent samples point to good convergent validity of the brief PROMS (r = .36; r = .40). The same association is still significant in one of the samples when including self-reported music skill in a partial correlation (rpartial = .30; rpartial = .17). Overall, the results show that the brief version of the PROMS displays a very good pattern of construct validity. Especially its tuning subtest stands out as a valuable part for music skill evaluations in Western samples. We conclude by briefly discussing the choice faced by music cognition researchers between different musical aptitude measures of which the brief PROMS is a well evaluated example. PMID:27398805

Investigation of limit state criteria for amorphous metals

NASA Astrophysics Data System (ADS)

Comanici, A. M.; Sandovici, A.; Barsanescu, P. D.

2016-08-01

The name of amorphous metals is assigned to metals that have a non-crystalline structure, but they are also very similar to glass if we look into their properties. A very distinguished feature is the fact that amorphous metals, also known as metallic glasses, show a good electrical conductivity. The extension of the limit state criteria for different materials makes this type of alloy a choice to validate the new criterions. Using a new criterion developed for biaxial and triaxial state of stress, the results are investigated in order to determine the applicability of the mathematical model for these amorphous metals. Especially for brittle materials, it is extremely important to find suitable fracture criterion. Mohr-Coulomb criterion, which is permitting a linear failure envelope, is often used for very brittle materials. But for metallic glasses this criterion is not consistent with the experimental determinations. For metallic glasses, and other high-strength materials, Rui Tao Qu and Zhe Feng Zhang proposed a failure envelope modeling with an ellipse in σ-τ coordinates. In this paper this model is being developed for principal stresses space. It is also proposed a method for transforming σ-τ coordinates in principal stresses coordinates and the theoretical results are consistent with the experimental ones.
Construct Validation of a Multidimensional Computerized Adaptive Test for Fatigue in Rheumatoid Arthritis

PubMed Central

Nikolaus, Stephanie; Bode, Christina; Taal, Erik; Vonkeman, Harald E.; Glas, Cees A. W.; van de Laar, Mart A. F. J.

2015-01-01

Objective Multidimensional computerized adaptive testing enables precise measurements of patient-reported outcomes at an individual level across different dimensions. This study examined the construct validity of a multidimensional computerized adaptive test (CAT) for fatigue in rheumatoid arthritis (RA). Methods The ‘CAT Fatigue RA’ was constructed based on a previously calibrated item bank. It contains 196 items and three dimensions: ‘severity’, ‘impact’ and ‘variability’ of fatigue. The CAT was administered to 166 patients with RA. They also completed a traditional, multidimensional fatigue questionnaire (BRAF-MDQ) and the SF-36 in order to examine the CAT’s construct validity. A priori criterion for construct validity was that 75% of the correlations between the CAT dimensions and the subscales of the other questionnaires were as expected. Furthermore, comprehensive use of the item bank, measurement precision and score distribution were investigated. Results The a priori criterion for construct validity was supported for two of the three CAT dimensions (severity and impact but not for variability). For severity and impact, 87% of the correlations with the subscales of the well-established questionnaires were as expected but for variability, 53% of the hypothesised relations were found. Eighty-nine percent of the items were selected between one and 137 times for CAT administrations. Measurement precision was excellent for the severity and impact dimensions, with more than 90% of the CAT administrations reaching a standard error below 0.32. The variability dimension showed good measurement precision with 90% of the CAT administrations reaching a standard error below 0.44. No floor- or ceiling-effects were found for the three dimensions. Conclusion The CAT Fatigue RA showed good construct validity and excellent measurement precision on the dimensions severity and impact. The dimension variability had less ideal measurement characteristics, pointing to the need to recalibrate the CAT item bank with a two-dimensional model, solely consisting of severity and impact. PMID:26710104
Development and validation of the coronary heart disease scale under the system of quality of life instruments for chronic diseases QLICD-CHD: combinations of classical test theory and Generalizability Theory.

PubMed

Wan, Chonghua; Li, Hezhan; Fan, Xuejin; Yang, Ruixue; Pan, Jiahua; Chen, Wenru; Zhao, Rong

2014-06-04

Quality of life (QOL) for patients with coronary heart disease (CHD) is now concerned worldwide with the specific instruments being seldom and no one developed by the modular approach. This paper is aimed to develop the CHD scale of the system of Quality of Life Instruments for Chronic Diseases (QLICD-CHD) by the modular approach and validate it by both classical test theory and Generalizability Theory. The QLICD-CHD was developed based on programmed decision procedures with multiple nominal and focus group discussions, in-depth interview, pre-testing and quantitative statistical procedures. 146 inpatients with CHD were used to provide the data measuring QOL three times before and after treatments. The psychometric properties of the scale were evaluated with respect to validity, reliability and responsiveness employing correlation analysis, factor analyses, multi-trait scaling analysis, t-tests and also G studies and D studies of Genralizability Theory analysis. Multi-trait scaling analysis, correlation and factor analyses confirmed good construct validity and criterion-related validity when using SF-36 as a criterion. The internal consistency α and test-retest reliability coefficients (Pearson r and Intra-class correlations ICC) for the overall instrument and all domains were higher than 0.70 and 0.80 respectively; The overall and all domains except for social domain had statistically significant changes after treatments with moderate effect size SRM (standardized response mea) ranging from 0.32 to 0.67. G-coefficients and index of dependability (Ф coefficients) confirmed the reliability of the scale further with more exact variance components. The QLICD-CHD has good validity, reliability, and moderate responsiveness and some highlights, and can be used as the quality of life instrument for patients with CHD. However, in order to obtain better reliability, the numbers of items for social domain should be increased or the items' quality, not quantity, should be improved.
[Spanish version of the Satisfaction With Decision scale: cross-cultural adaptation, validity and reliability].

PubMed

Chabrera, Carolina; Areal, Joan; Font, Albert; Caro, Mónica; Bonet, Marta; Zabalegui, Adelaida

2015-01-01

The aim of this study is to develop a Spanish version of the Satisfaction With Decision scale (SWDs) and analyse the psychometric properties of validity and reliability. An observational, descriptive study and validation of a tool to measure satisfaction with the decision. Urology, Radiation oncology, and Medical oncology Departments of the Hospital Universitari Germans Trias i Pujol, Institut Català d'Oncologia and the Institut Oncològic del Vallès - Hospital General de Catalunya. A total of 170 participants diagnosed with prostate cancer, and who could read and write in Spanish and gave their informed consent. A translation, back-translation and cross-cultural adaptation to Spanish was performed on the SWDs. The content validity, criterion validity, construct validity and reliability (internal consistency and stability) of the Spanish version were evaluated. The SWDs contains 6 items with 5-item Likert scales. A Spanish version (ESD) was obtained that was linguistically and conceptually equivalent to the original version. Criterion validity, the ESD correlated with "satisfaction with the decision" using a linear analogue scale, was significant (r=0.63, P<.01) for all items. The factorial analysis showed a unique dimension to explain 82.08% of the variance. The ESD showed excellent results in terms of internal consistency (Cronbach alpha=0.95) and good test-retest reliability with intraclass correlation coefficient of 0.711. The ESD is a validated Spanish scale to measure the satisfaction with the decisions taken in health, and demonstrates a correct validity and reliability. Copyright © 2015 Elsevier España, S.L.U. All rights reserved.
Development and testing of the cancer multidisciplinary team meeting observational tool (MDT-MOT)

PubMed Central

Harris, Jenny; Taylor, Cath; Sevdalis, Nick; Jalil, Rozh; Green, James S.A.

2016-01-01

Abstract Objective To develop a tool for independent observational assessment of cancer multidisciplinary team meetings (MDMs), and test criterion validity, inter-rater reliability/agreement and describe performance. Design Clinicians and experts in teamwork used a mixed-methods approach to develop and refine the tool. Study 1 observers rated pre-determined optimal/sub-optimal MDM film excerpts and Study 2 observers independently rated video-recordings of 10 MDMs. Setting Study 2 included 10 cancer MDMs in England. Participants Testing was undertaken by 13 health service staff and a clinical and non-clinical observer. Intervention None. Main Outcome Measures Tool development, validity, reliability/agreement and variability in MDT performance. Results Study 1: Observers were able to discriminate between optimal and sub-optimal MDM performance (P ≤ 0.05). Study 2: Inter-rater reliability was good for 3/10 domains. Percentage of absolute agreement was high (≥80%) for 4/10 domains and percentage agreement within 1 point was high for 9/10 domains. Four MDTs performed well (scored 3+ in at least 8/10 domains), 5 MDTs performed well in 6–7 domains and 1 MDT performed well in only 4 domains. Leadership and chairing of the meeting, the organization and administration of the meeting, and clinical decision-making processes all varied significantly between MDMs (P ≤ 0.01). Conclusions MDT-MOT demonstrated good criterion validity. Agreement between clinical and non-clinical observers (within one point on the scale) was high but this was inconsistent with reliability coefficients and warrants further investigation. If further validated MDT-MOT might provide a useful mechanism for the routine assessment of MDMs by the local workforce to drive improvements in MDT performance. PMID:27084499
Development and testing of the cancer multidisciplinary team meeting observational tool (MDT-MOT).

PubMed

Harris, Jenny; Taylor, Cath; Sevdalis, Nick; Jalil, Rozh; Green, James S A

2016-06-01

To develop a tool for independent observational assessment of cancer multidisciplinary team meetings (MDMs), and test criterion validity, inter-rater reliability/agreement and describe performance. Clinicians and experts in teamwork used a mixed-methods approach to develop and refine the tool. Study 1 observers rated pre-determined optimal/sub-optimal MDM film excerpts and Study 2 observers independently rated video-recordings of 10 MDMs. Study 2 included 10 cancer MDMs in England. Testing was undertaken by 13 health service staff and a clinical and non-clinical observer. None. Tool development, validity, reliability/agreement and variability in MDT performance. Study 1: Observers were able to discriminate between optimal and sub-optimal MDM performance (P ≤ 0.05). Study 2: Inter-rater reliability was good for 3/10 domains. Percentage of absolute agreement was high (≥80%) for 4/10 domains and percentage agreement within 1 point was high for 9/10 domains. Four MDTs performed well (scored 3+ in at least 8/10 domains), 5 MDTs performed well in 6-7 domains and 1 MDT performed well in only 4 domains. Leadership and chairing of the meeting, the organization and administration of the meeting, and clinical decision-making processes all varied significantly between MDMs (P ≤ 0.01). MDT-MOT demonstrated good criterion validity. Agreement between clinical and non-clinical observers (within one point on the scale) was high but this was inconsistent with reliability coefficients and warrants further investigation. If further validated MDT-MOT might provide a useful mechanism for the routine assessment of MDMs by the local workforce to drive improvements in MDT performance. © The Author 2016. Published by Oxford University Press in association with the International Society for Quality in Health Care; all rights reserved.
Media ratings for movies, music, video games, and television: a review of the research and recommendations for improvements.

PubMed

Gentile, Douglas A; Humphrey, Jeremy; Walsh, David A

2005-06-01

This article review is organized by studies that are relevant for testing the reliability and validity of ratings systems. Specifically, the interrater reliability, consistency, temporal stability, content validity, construct validity, and criterion validity of media ratings systems are reviewed. Data that are related to testing the "forbidden fruit" and "tainted fruit" hypotheses also are reviewed. Several changes are recommended to improve the ratings systems, including the creation of a universal ratings system that could be applied equally to all media. The research reviewed here can provide a guide for how to construct a reliable, valid, and more useful ratings system. This is important because the decisions that parents make regarding their children's media use can be only as good as the information to which the parents have access.
Criterion and concurrent validity of Conners Adult ADHD Diagnostic Interview for DSM-IV (CAADID) Spanish version.

PubMed

Ramos-Quiroga, Josep Antoni; Bosch, Rosa; Richarte, Vanesa; Valero, Sergi; Gómez-Barros, Nuria; Nogueira, Mariana; Palomar, Gloria; Corrales, Montse; Sáez-Francàs, Naia; Corominas, Margarida; Real, Alberto; Vidal, Raquel; Chalita, Pablo J; Casas, Miguel

2012-01-01

Attention deficit hyperactivity disorder (ADHD) is a common neuropsychiatric disorder in adulthood. Its diagnosis requires a retrospective evaluation of ADHD symptoms in childhood, the continuity of these symptoms in adulthood, and a differential diagnosis. For these reasons, diagnosis of ADHD in adults is a complex process which needs effective diagnostic tools. To analyse the criterion validity of the CAADID semi-structured interview, Spanish version, and the concurrent validity compared with other ADHD severity scales. An observational case-control study was conducted on 691 patients with ADHD. They were out-patients treated in a program for adults with ADHD in a hospital. A sensitivity of 98.86%, specificity 67.68%, positive predictive value 90.77% and a negative predictive value 94.87% were observed. Diagnostic precision was 91.46%. The kappa index concordance between the clinical diagnostic interview and the CAADID was 0.88. Good concurrent validity was obtained, the CAADID correlated significantly with WURS scale (r=0.522, P<.01), ADHD Rating Scale (r=0.670, P<.0.1) and CAARS (self-rating version; r=0.656, P<.01 and observer-report r=0.514, P<.01). CAADID is a valid and useful tool for the diagnosis of ADHD in adults for clinical, as well as for research purposes. Copyright © 2012 SEP y SEPB. Published by Elsevier España, S.L. All rights reserved.
Workaholism in Brazil: measurement and individual differences.

PubMed

Romeo, Marina; Yepes-Baldó, Montserrat; Berger, Rita; Netto Da Costa, Francisco Franco

2014-01-01

The aim of this research is the measurement and assessment of individual differences of workaholism in Brazil, an important issue which affects the competitiveness of companies. The WART 15-PBV was applied to a sample of 153 managers from companies located in Brazil, 82 (53.6%) women and 71 (46.4%) men. Ages ranged from 20 to 69 years with an average value of 41 (SD=9.06). We analyzed, on one hand, the factor structure of the questionnaire, its internal consistency and convergent (with the Dutch Work Addiction Scale - DUWAS) and criterion validity (with General Health Questionnaire GHQ). On the other hand, we analyzed individual gender differences on workaholism. WART15-PBV has good psychometric properties, and evidence for convergent and criterion validity. Females and males differed on Impaired Communication / Self-Absorption dimension. This dimension has a direct effect only on mens health perception, while Compulsive tendencies dimension has a direct effect for both genders. The findings suggest the WART15-PBV is a valid measure of workaholism that would contribute to the workers health and their professional and personal life, in order to encourage adequate conditions in the workplace taking into account workers individual differences.
Polish translation and validation of the Pelvic Organ Prolapse/Urinary Incontinence Sexual Questionnaire, IUGA-Revised (PISQ-IR).

PubMed

Grzybowska, Magdalena Emilia; Piaskowska-Cala, Justyna; Wydra, Dariusz Grzegorz

2017-12-29

The aim of the study was to translate into Polish the Pelvic Organ Prolapse/Incontinence Sexual Questionnaire, IUGA-Revised (PISQ-IR), which evaluates sexual function in sexually active (SA) and not SA (NSA) women with pelvic floor disorders (PFD), and to validate the Polish version. After translation, back-translation and cognitive interviews, the final version of PISQ-IR was established. The study group included 252 women with PFD (124 NSA and 128 SA). All women underwent clinical evaluation and completed the PISQ-IR. For test-retest reliability, the questionnaire was administered to 99 patients twice at an interval of 2 weeks. The analysis of criterion validity required the subjects to complete self-reported measures. Internal consistency and criterion validity were assessed separately for NSA and SA women for the PISQ-IR subscales. The mean age of the women was 60.9 ± 10.6 years and their mean BMI was 27.9 ± 4.9 kg/m 2 . Postmenopausal women constituted 82.5% of the study group. Urinary incontinence (UI) was diagnosed in 60 women (23.8%), pelvic organ prolapse (POP) in 90 (35.7%), and UI and POP in 102 (40.5%). Fecal incontinence was reported by 45 women (17.9%). The PISQ-IR Polish version proved to have good internal consistency in NSA women (α 0.651 to 0.857) and SA women (α 0.605 to 0.887), and strong reliability in all subscales (Pearson's coefficient 0.759-0.899; p < 0.001). Criterion validity confirmed moderate to strong correlations between PISQ-IR scores and self-reported measures in SA subscales, as well the SA summary score, and weak to moderate correlations in NSA women. The PISQ-IR Polish version is a valid tool for evaluating sexual function in women with PFD.
A newer and broader definition of burnout: validation of the "Burnout Clinical Subtype Questionnaire (BCSQ-36)".

PubMed

Montero-Marín, Jesús; García-Campayo, Javier

2010-06-02

Burnout syndrome has been clinically characterised by a series of three subtypes: frenetic, underchallenged, and worn-out, with reference to coping strategies for stress and frustration at work with different degrees of dedication. The aims of the study are to present an operating definition of these subtypes in order to assess their reliability and convergent validity with respect to a standard burnout criterion and to examine differences with regard to sex and the temporary nature of work contracts. An exploratory factor analysis was performed by the main component method on a range of items devised by experts. The sample was composed of 409 employees of the University of Zaragoza, Spain. The reliability of the scales was assessed with Cronbach's alpha, convergent validity in relation to the Maslach Burnout Inventory with Pearson's r, and differences with Student's t-test and the Mann-Whitney U test. The factorial validity and reliability of the scales were good. The subtypes presented relations of differing degrees with the criterion dimensions, which were greater when dedication to work was lower. The frenetic profile presented fewer relations with the criterion dimensions while the worn-out profile presented relations of the greatest magnitude. Sex was not influential in establishing differences. However, the temporary nature of work contracts was found to have an effect: temporary employees exhibited higher scores in the frenetic profile (p < 0.001), while permanent employees did so in the underchallenged (p = 0.018) and worn-out (p < 0.001) profiles. The classical Maslach description of burnout does not include the frenetic profile; therefore, these patients are not recognised. The developed questionnaire may be a useful tool for the design and appraisal of specific preventive and treatment approaches based on the type of burnout experienced.
Psychometric properties and validation of the Reasons for Living Inventory in an outpatient clinical population in Malaysia.

PubMed

Aishvarya, S; Maniam, T; Karuthan, C; Sidi, Hatta; Ruzyanei, Nik; Oei, T P S

2014-01-01

The Reasons For Living Inventory has been shown to have good psychometric properties in Western populations for the past three decades. The present study examined the psychometric properties and factor structure of English and Malay version of the Reasons For Living (RFL) Inventory in a sample of clinical outpatients in Malaysia. The RFL is designed to assess an individual's various reasons for not committing suicide. A total of 483 participants (283 with psychiatric illnesses and 200 with non-psychiatric medical illnesses) completed the RFL and other self-report instruments. Results of the EFA (exploratory factor analysis) and CFA (confirmatory factor analysis) supported the fit for the six-factor oblique model as the best-fitting model. The internal consistency of the RFL was α=.94 and it was found to be high with good concurrent, criterion and discriminative validities. Thus, the RFL is a reliable and valid instrument to measure the various reasons for not committing suicide among psychiatry and medical outpatients in Malaysia. © 2014.
An Application of Practical Strategies in Assessing the Criterion-Related Validity of Credentialing Examinations.

ERIC Educational Resources Information Center

Fidler, James R.

1993-01-01

Criterion-related validities of 2 laboratory practitioner certification examinations for medical technologists (MTs) and medical laboratory technicians (MLTs) were assessed for 81 MT and 70 MLT examinees. Validity coefficients are presented for both measures. Overall, summative ratings yielded stronger validity coefficients than ratings based on…
Validation of a combined health literacy and numeracy instrument for patients with type 2 diabetes.

PubMed

Luo, Huabin; Patil, Shivajirao P; Wu, Qiang; Bell, Ronny A; Cummings, Doyle M; Adams, Alyssa D; Hambidge, Bertha; Craven, Kay; Gao, Fei

2018-05-20

This study aimed to validate a new consolidated measure of health literacy and numeracy (health literacy scale [HLS] plus the subjective numeracy scale [SNS]) in patients with type 2 diabetes (T2DM). A convenience sample (N = 102) of patients with T2DM was recruited from an academic family medicine center in the southeastern US between September-December 2017. Participants completed a questionnaire that included the composite HLS/SNS (22 questions) and a commonly used objective measure of health literacy-S-TOFHLA (40 questions). Internal reliability of the HLS/SNS was assessed using Cronbach's alpha. Criterion and construct validity was assessed against the S-TOFHLA. The composite HLS/SNS had good internal reliability (Cronbach's alpha = 0.83). A confirmatory factor analysis revealed there were four factors in the new instrument. Model fit indices showed good model-data fit (RMSEA = 0.08). The Spearman's rank order correlation coefficient between the HLS/SNS and the S-TOFHLA was 0.45 (p < 0.01). Our study suggests that the composite HLS/SNS is a reliable, valid instrument. Published by Elsevier B.V.
Inflammatory bowel disease-specific health-related quality of life instruments: a systematic review of measurement properties.

PubMed

Chen, Xin-Lin; Zhong, Liang-Huan; Wen, Yi; Liu, Tian-Wen; Li, Xiao-Ying; Hou, Zheng-Kun; Hu, Yue; Mo, Chuan-Wei; Liu, Feng-Bin

2017-09-15

This review aims to critically appraise and compare the measurement properties of inflammatory bowel disease (IBD)-specific health-related quality of life instruments. Medline, EMBASE and ISI Web of Knowledge were searched from their inception to May 2016. IBD-specific instruments for patients with Crohn's disease, ulcerative colitis or IBD were enrolled. The basic characteristics and domains of the instruments were collected. The methodological quality of measurement properties and measurement properties of the instruments were assessed. Fifteen IBD-specific instruments were included, which included twelve instruments for adult IBD patients and three for paediatric IBD patients. All of the instruments were developed in North American and European countries. The following common domains were identified: IBD-related symptoms, physical, emotional and social domain. The methodological quality was satisfactory for content validity; fair in internal consistency, reliability, structural validity, hypotheses testing and criterion validity; and poor in measurement error, cross-cultural validity and responsiveness. For adult IBD patients, the IBDQ-32 and its short version (SIBDQ) had good measurement properties and were the most widely used worldwide. For paediatric IBD patients, the IMPACT-III had good measurement properties and had more translated versions. Most methodological quality should be promoted, especially measurement error, cross-cultural validity and responsiveness. The IBDQ-32 was the most widely used instrument with good reliability and validity, followed by the SIBDQ and IMPACT-III. Further validation studies are necessary to support the use of other instruments.
Transcultural adaptation and initial validation of Brazilian-Portuguese version of the Basel assessment of adherence to immunosuppressive medications scale (BAASIS) in kidney transplants

PubMed Central

2013-01-01

Background Transplant recipients are expected to adhere to a lifelong immunosuppressant therapeutic regimen. However, nonadherence to treatment is an underestimated problem for which no properly validated measurement tool is available for Portuguese-speaking patients. We aimed to initially validate the Basel Assessment of Adherence to Immunosuppressive Medications Scale (BAASIS®) to accurately estimate immunosuppressant nonadherence in Brazilian transplant patients. Methods The BAASIS® (English version) was transculturally adapted and its psychometric properties were assessed. The transcultural adaptation was performed using the Guillemin protocol. Psychometric testing included reliability (intraobserver and interobserver reproducibility, agreement, Kappa coefficient, and the Cronbach’s alpha) and validity (content, criterion, and construct validities). Results The final version of the transculturally adapted BAASIS® was pretested, and no difficulties in understanding its content were found. The intraobserver and interobserver reproducibility variances (0.007 and 0.003, respectively), the Cronbach’s alpha (0.7), Kappa coefficient (0.88) and the agreement (95.2%) suggest accuracy, preciseness and reliability. For construct validity, exploratory factorial analysis demonstrated unidimensionality of the first three questions (r = 0.76, r = 0.80, and r = 0.68). For criterion validity, the adapted BAASIS® was correlated with another self-report instrument, the Measure of Adherence to Treatment, and showed good congruence (r = 0.65). Conclusions The BAASIS® has adequate psychometric properties and may be employed in advance to measure adherence to posttransplant immunosuppressant treatments. This instrument will be the first one validated to use in this specific transplant population and in the Portuguese language. PMID:23692889
Reliability and validity of the Children's Fear Survey Schedule-Dental Subscale for Arabic-speaking children: a cross-sectional study.

PubMed

El-Housseiny, Azza A; Alsadat, Farah A; Alamoudi, Najlaa M; El Derwi, Douaa A; Farsi, Najat M; Attar, Moaz H; Andijani, Basil M

2016-04-14

Early recognition of dental fear is essential for the effective delivery of dental care. This study aimed to test the reliability and validity of the Arabic version of the Children's Fear Survey Schedule-Dental Subscale (CFSS-DS). A school-based sample of 1546 children was randomly recruited. The Arabic version of the CFSS-DS was completed by children during class time. The scale was tested for internal consistency and test-retest reliability. To test criterion validity, children's behavior was assessed using the Frankl scale during dental examination, and results were compared with children's CFSS-DS scores. To test the scale's construct validity, scores on "fear of going to the dentist soon" were correlated with CFSS-DS scores. Factor analysis was also used. The Arabic version of the CFSS-DS showed high reliability regarding both test-retest reliability (intraclass correlation = 0.83, p < 0.001) and internal consistency (Cronbach's α = 0.88). It showed good criterion validity: children with negative behavior had significantly higher fear scores (t = 13.67, p < 0.001). It also showed moderate construct validity (Spearman's rho correlation, r = 0.53, p < 0.001). Factor analysis identified the following factors: "fear of invasive dental procedures," "fear of less invasive dental procedures" and "fear of strangers." The Arabic version of the CFSS-DS is a reliable and valid measure of dental fear in Arabic-speaking children. Pediatric dentists and researchers may use this validated version of the CFSS-DS to measure dental fear in Arabic-speaking children.
Criterion-Related Validity: Assessing the Value of Subscores

ERIC Educational Resources Information Center

Davison, Mark L.; Davenport, Ernest C., Jr.; Chang, Yu-Feng; Vue, Kory; Su, Shiyang

2015-01-01

Criterion-related profile analysis (CPA) can be used to assess whether subscores of a test or test battery account for more criterion variance than does a single total score. Application of CPA to subscore evaluation is described, compared to alternative procedures, and illustrated using SAT data. Considerations other than validity and reliability…
Development and Initial Validation of the Multicultural Personality Inventory (MPI).

PubMed

Ponterotto, Joseph G; Fietzer, Alexander W; Fingerhut, Esther C; Woerner, Scott; Stack, Lauren; Magaldi-Dopman, Danielle; Rust, Jonathan; Nakao, Gen; Tsai, Yu-Ting; Black, Natasha; Alba, Renaldo; Desai, Miraj; Frazier, Chantel; LaRue, Alyse; Liao, Pei-Wen

2014-01-01

Two studies summarize the development and initial validation of the Multicultural Personality Inventory (MPI). In Study 1, the 115-item prototype MPI was administered to 415 university students where exploratory factor analysis resulted in a 70-item, 7-factor model. In Study 2, the 70-item MPI and theoretically related companion instruments were administered to a multisite sample of 576 university students. Confirmatory factory analysis found the 7-factor structure to be a relatively good fit to the data (Comparative Fit Index =.954; root mean square error of approximation =.057), and MPI factors predicted variance in criterion variables above and beyond the variance accounted for by broad personality traits (i.e., Big Five). Study limitations and directions for further validation research are specified.
Validity of the Eating Attitudes Test and the Eating Disorders Inventory in Bulimia Nervosa.

ERIC Educational Resources Information Center

Gross, Janet; And Others

1986-01-01

Assessed criterion and concurrent validity of the Eating Attitudes Test and the Eating Disorder Inventory in 82 women with bulimia nervosa. Both tests demonstrated criterion validity by discriminating bulimia nervosa subjects from normals. Only weak support was found for concurrent validity within bulimia subjects. Recommends combination of…

Detecting depression among adolescents in Santiago, Chile: sex differences.

PubMed

Araya, Ricardo; Montero-Marin, Jesus; Barroilhet, Sergio; Fritsch, Rosemarie; Gaete, Jorge; Montgomery, Alan

2013-04-23

Depression among adolescents is common but most cases go undetected. Brief questionnaires offer an opportunity to identify probable cases but properly validated cut-off points are often unavailable, especially in non-western countries. Sex differences in the prevalence of depression become marked in adolescence and this needs to be accounted when establishing cut-off points. This study involved adolescents attending secondary state schools in Santiago, Chile. We compared the self-reported Beck Depression Inventory-II with a psychiatric interview to ascertain diagnosis. General psychometric features were estimated before establishing the criterion validity of the BDI-II. The BDI-II showed good psychometric properties with good internal consistency, a clear unidimensional factorial structure, and good capacity to discriminate between cases and non-cases of depression. Optimal cut-off points to establish caseness for depression were much higher for girls than boys. Sex discrepancies were primarily explained by differences in scores among those with depression rather than among those without depression. It is essential to validate scales with the populations intended to be used with. Sex differences are often ignored when applying cut-off points, leading to substantial misclassification. Early detection of depression is essential if we think that early intervention is a clinically important goal.
An Evaluation of Available Models for Estimating the Reliability and Validity of Criterion Referenced Measures.

ERIC Educational Resources Information Center

Oakland, Thomas

New strategies for evaluation criterion referenced measures (CRM) are discussed. These strategies examine the following issues: (1) the use of normed referenced measures (NRM) as CRM and then estimating the reliability and validity of such measures in terms of variance from an arbitrarily specified criterion score, (2) estimation of the…
Development of the Informing Relatives Inventory (IRI): Assessing Index Patients' Knowledge, Motivation and Self-Efficacy Regarding the Disclosure of Hereditary Cancer Risk Information to Relatives.

PubMed

de Geus, Eveline; Aalfs, Cora M; Menko, Fred H; Sijmons, Rolf H; Verdam, Mathilde G E; de Haes, Hanneke C J M; Smets, Ellen M A

2015-08-01

Despite the use of genetic services, counselees do not always share hereditary cancer information with at-risk relatives. Reasons for not informing relatives may be categorized as a lack of: knowledge, motivation, and/or self-efficacy. This study aims to develop and test the psychometric properties of the Informing Relatives Inventory, a battery of instruments that intend to measure counselees' knowledge, motivation, and self-efficacy regarding the disclosure of hereditary cancer risk information to at-risk relatives. Guided by the proposed conceptual framework, existing instruments were selected and new instruments were developed. We tested the instruments' acceptability, dimensionality, reliability, and criterion-related validity in consecutive index patients visiting the Clinical Genetics department with questions regarding hereditary breast and/or ovarian cancer or colon cancer. Data of 211 index patients were included (response rate = 62%). The Informing Relatives Inventory (IRI) assesses three barriers in disclosure representing seven domains. Instruments assessing index patients' (positive) motivation and self-efficacy were acceptable and reliable and suggested good criterion-related validity. Psychometric properties of instruments assessing index patients knowledge were disputable. These items were moderately accepted by index patients and the criterion-related validity was weaker. This study presents a first conceptual framework and associated inventory (IRI) that improves insight into index patients' barriers regarding the disclosure of genetic cancer information to at-risk relatives. Instruments assessing (positive) motivation and self-efficacy proved to be reliable measurements. Measuring index patients knowledge appeared to be more challenging. Further research is necessary to ensure IRI's dimensionality and sensitivity to change.
Criterion-Validity of Commercially Available Physical Activity Tracker to Estimate Step Count, Covered Distance and Energy Expenditure during Sports Conditions

PubMed Central

Wahl, Yvonne; Düking, Peter; Droszez, Anna; Wahl, Patrick; Mester, Joachim

2017-01-01

Background: In the past years, there was an increasing development of physical activity tracker (Wearables). For recreational people, testing of these devices under walking or light jogging conditions might be sufficient. For (elite) athletes, however, scientific trustworthiness needs to be given for a broad spectrum of velocities or even fast changes in velocities reflecting the demands of the sport. Therefore, the aim was to evaluate the validity of eleven Wearables for monitoring step count, covered distance and energy expenditure (EE) under laboratory conditions with different constant and varying velocities. Methods: Twenty healthy sport students (10 men, 10 women) performed a running protocol consisting of four 5 min stages of different constant velocities (4.3; 7.2; 10.1; 13.0 km·h−1), a 5 min period of intermittent velocity, and a 2.4 km outdoor run (10.1 km·h−1) while wearing eleven different Wearables (Bodymedia Sensewear, Beurer AS 80, Polar Loop, Garmin Vivofit, Garmin Vivosmart, Garmin Vivoactive, Garmin Forerunner 920XT, Fitbit Charge, Fitbit Charge HR, Xaomi MiBand, Withings Pulse Ox). Step count, covered distance, and EE were evaluated by comparing each Wearable with a criterion method (Optogait system and manual counting for step count, treadmill for covered distance and indirect calorimetry for EE). Results: All Wearables, except Bodymedia Sensewear, Polar Loop, and Beurer AS80, revealed good validity (small MAPE, good ICC) for all constant and varying velocities for monitoring step count. For covered distance, all Wearables showed a very low ICC (<0.1) and high MAPE (up to 50%), revealing no good validity. The measurement of EE was acceptable for the Garmin, Fitbit and Withings Wearables (small to moderate MAPE), while Bodymedia Sensewear, Polar Loop, and Beurer AS80 showed a high MAPE up to 56% for all test conditions. Conclusion: In our study, most Wearables provide an acceptable level of validity for step counts at different constant and intermittent running velocities reflecting sports conditions. However, the covered distance, as well as the EE could not be assessed validly with the investigated Wearables. Consequently, covered distance and EE should not be monitored with the presented Wearables, in sport specific conditions. PMID:29018355
Development and validation of the Chinese version of the Diabetes Management Self-efficacy Scale.

PubMed

Vivienne Wu, Shu-Fang; Courtney, Mary; Edwards, Helen; McDowell, Jan; Shortridge-Baggett, Lillie M; Chang, Pei-Jen

2008-04-01

The purpose of this study was to translate the Diabetes Management Self-Efficacy Scale (DMSES) into Chinese and test the validity and reliability of the instrument within a Taiwanese population. A two-stage design was used for this study. Stage I consisted of a multi-stepped process of forward and backward translation, using focus groups and consensus meetings to translate the 20-item Australia/English version DMSES to Chinese and test content validity. Stage II established the psychometric properties of the Chinese version DMSES (C-DMSES) by examining the criterion, convergent and construct validity, internal consistency and stability testing. The sample for Stage II comprised 230 patients with type 2 diabetes aged 30 years or more from a diabetes outpatient clinic in Taiwan. Three items were modified to better reflect Chinese practice. The C-DMSES obtained a total average CVI score of .86. The convergent validity of the C-DMSES correlated well with the validated measure of the General Self-Efficacy Scale in measuring self-efficacy (r=.55; p<.01). Criterion-related validity showed that the C-DMSES was a significant predictor of the Summary of Diabetes Self-Care Activities scores (Beta=.58; t=10.75, p<.01). Factor analysis supported the C-DMSES being composed of four subscales. Good internal consistency (Cronbach's alpha=.77 to .93) and test-retest reliability (Pearson correlation coefficient r=.86, p<.01) were found. The C-DMSES is a brief and psychometrically sound measure for evaluation of self-efficacy towards management of diabetes by persons with type 2 diabetes in Chinese populations.
First evaluation of the Behavioral Addiction Indoor Tanning Screener (BAITS) in a nationwide representative sample.

PubMed

Diehl, K; Görig, T; Breitbart, E W; Greinert, R; Hillhouse, J J; Stapleton, J L; Schneider, S

2018-01-01

Evidence suggests that indoor tanning may have addictive properties. However, many instruments for measuring indoor tanning addiction show poor validity and reliability. Recently, a new instrument, the Behavioral Addiction Indoor Tanning Screener (BAITS), has been developed. To test the validity and reliability of the BAITS by using a multimethod approach. We used data from the first wave of the National Cancer Aid Monitoring on Sunbed Use, which included a cognitive pretest (August 2015) and a Germany-wide representative survey (October to December 2015). In the cognitive pretest 10 users of tanning beds were interviewed and 3000 individuals aged 14-45 years were included in the representative survey. Potential symptoms of indoor tanning addiction were measured using the BAITS, a brief screening survey with seven items (answer categories: yes vs. no). Criterion validity was assessed by comparing the results of BAITS with usage parameters. Additionally, we tested internal consistency and construct validity. A total of 19·7% of current and 1·8% of former indoor tanning users were screened positive for symptoms of a potential indoor tanning addiction. We found significant associations between usage parameters and the BAITS (criterion validity). Internal consistency (reliability) was good (Kuder-Richardson-20, 0·854). The BAITS was shown to be a homogeneous construct (construct validity). Compared with other short instruments measuring symptoms of a potential indoor tanning addiction, the BAITS seems to be a valid and reliable tool. With its short length and the binary items the BAITS is easy to use in large surveys. © 2017 British Association of Dermatologists.
Validation of the Chinese version of functional assessment of anorexia-cachexia therapy (FAACT) scale for measuring quality of life in cancer patients with cachexia.

PubMed

Zhou, Ting; Yang, Kaixiang; Thapa, Sudip; Fu, Qiang; Jiang, Yongsheng; Yu, Shiying

2017-04-01

The assessment of quality of life (QOL) is an important part of cachexia management for cancer patients. Functional assessment of anorexia-cachexia therapy (FAACT), a specific QOL instrument for cachexia patients, has not been validated in Chinese population. The aim of this study was to validate the FAACT scale in Chinese cancer patients for its future use. Eligible cancer patients were included in our study. Patients' demographic and clinical characteristics were collected from the electronic medical records. Patients were asked to complete the Chinese version of FAACT scale and the MD Anderson symptom inventory (MDASI), and then the reliability and validity were analyzed. A total of 285 patients were enrolled in our study, data of 241 patients were evaluated. Coefficients of Cronbach's alpha, test-retest and split-half analyses were all greater than 0.8, which indicated an excellent reliability for FAACT scale. In item-subscale correlation analysis and factor analysis, good construct validity for FAACT scale was found. The correlation between FAACT and MDASI interference subscale showed reasonable criterion-related validity, and for further clinical validation, the FAACT scale showed excellent discriminative validity for distinguishing patients in different cachexia status and in different performance status. The Chinese version of FAACT scale has good reliability and validity and is suitable for measuring QOL of cachexia patients in Chinese population.
Development of a job stressor scale for nurses caring for patients with intractable neurological diseases.

PubMed

Ando, Yukako; Kataoka, Tsuyoshi; Okamura, Hitoshi; Tanaka, Katsutoshi; Kobayashi, Toshio

2013-12-01

The purpose of this research is to verify the reliability and validity of a job stressor scale for nurses caring for patients with intractable neurological diseases. A mail survey was conducted using a self-report questionnaire. The subjects were 263 nurses and assistant nurses working in wards specializing in intractable neurological diseases. The response rate was 71.9% (valid response rate, 66.2%). With regard to reliability, internal consistency and stability were assessed. Internal consistency was examined via Cronbach's alpha. For stability, the test-retest method was performed and stability was examined via intraclass correlation coefficients. With regard to validity, factor validity, criterion-related validity, and content validity were assessed. Exploratory factor analysis was used for factor validity. For criterion-related validity, an existing scale was used as an external criterion; concurrent validity was examined via Spearman's rank correlation coefficients. As a result of analysis, there were 26 items in the scale created with an eight factor structure. Cronbach's a for the 26 items was 0.90; with the exception of two factors, alpha for all of the individual sub-factors was high at 0.7 or higher. The intraclass correlation coefficient for the 26 items was 0.89 (p < 0.001). With regard to criterion-related validity, concurrent validity was confirmed and the correlation coefficient with an external criterion was 0.73 (p < 0.001). For content validity, subjects who responded that "The questionnaire represents a stressor well or to a degree" accounted for 81% of the total responses. Reliability and validity were confirmed, so the scale created in the current research is a usable scale.
Criterion-Related Validity of the TOEFL iBT Listening Section. TOEFL iBT Research Report. RR-09-02

ERIC Educational Resources Information Center

Sawaki, Yasuyo; Nissan, Susan

2009-01-01

The study investigated the criterion-related validity of the "Test of English as a Foreign Language"[TM] Internet-based test (TOEFL[R] iBT) Listening section by examining its relationship to a criterion measure designed to reflect language-use tasks that university students encounter in everyday academic life: listening to academic…
Assessing Sleep Disturbance in Low Back Pain: The Validity of Portable Instruments

PubMed Central

Alsaadi, Saad M.; McAuley, James H.; Hush, Julia M.; Bartlett, Delwyn J.; McKeough, Zoe M.; Grunstein, Ronald R.; Dungan, George C.; Maher, Chris G.

2014-01-01

Although portable instruments have been used in the assessment of sleep disturbance for patients with low back pain (LBP), the accuracy of the instruments in detecting sleep/wake episodes for this population is unknown. This study investigated the criterion validity of two portable instruments (Armband and Actiwatch) for assessing sleep disturbance in patients with LBP. 50 patients with LBP performed simultaneous overnight sleep recordings in a university sleep laboratory. All 50 participants were assessed by Polysomnography (PSG) and the Armband and a subgroup of 33 participants wore an Actiwatch. Criterion validity was determined by calculating epoch-by-epoch agreement, sensitivity, specificity and prevalence and bias- adjusted kappa (PABAK) for sleep versus wake between each instrument and PSG. The relationship between PSG and the two instruments was assessed using intraclass correlation coefficients (ICC 2, 1). The study participants showed symptoms of sub-threshold insomnia (mean ISI = 13.2, 95% CI = 6.36) and poor sleep quality (mean PSQI = 9.20, 95% CI = 4.27). Observed agreement with PSG was 85% and 88% for the Armband and Actiwatch. Sensitivity was 0.90 for both instruments and specificity was 0.54 and 0.67 and PABAK of 0.69 and 0.77 for the Armband and Actiwatch respectively. The ICC (95%CI) was 0.76 (0.61 to 0.86) and 0.80 (0.46 to 0.92) for total sleep time, 0.52 (0.29 to 0.70) and 0.55 (0.14 to 0.77) for sleep efficiency, 0.64 (0.45 to 0.78) and 0.52 (0.23 to 0.73) for wake after sleep onset and 0.13 (−0.15 to 0.39) and 0.33 (−0.05 to 0.63) for sleep onset latency, for the Armband and Actiwatch, respectively. The findings showed that both instruments have varied criterion validity across the sleep parameters from excellent validity for measures of total sleep time, good validity for measures of sleep efficiency and wake after onset to poor validity for sleep onset latency. PMID:24763506
Assessing sleep disturbance in low back pain: the validity of portable instruments.

PubMed

Alsaadi, Saad M; McAuley, James H; Hush, Julia M; Bartlett, Delwyn J; McKeough, Zoe M; Grunstein, Ronald R; Dungan, George C; Maher, Chris G

2014-01-01

Although portable instruments have been used in the assessment of sleep disturbance for patients with low back pain (LBP), the accuracy of the instruments in detecting sleep/wake episodes for this population is unknown. This study investigated the criterion validity of two portable instruments (Armband and Actiwatch) for assessing sleep disturbance in patients with LBP. 50 patients with LBP performed simultaneous overnight sleep recordings in a university sleep laboratory. All 50 participants were assessed by Polysomnography (PSG) and the Armband and a subgroup of 33 participants wore an Actiwatch. Criterion validity was determined by calculating epoch-by-epoch agreement, sensitivity, specificity and prevalence and bias- adjusted kappa (PABAK) for sleep versus wake between each instrument and PSG. The relationship between PSG and the two instruments was assessed using intraclass correlation coefficients (ICC 2, 1). The study participants showed symptoms of sub-threshold insomnia (mean ISI = 13.2, 95% CI = 6.36) and poor sleep quality (mean PSQI = 9.20, 95% CI = 4.27). Observed agreement with PSG was 85% and 88% for the Armband and Actiwatch. Sensitivity was 0.90 for both instruments and specificity was 0.54 and 0.67 and PABAK of 0.69 and 0.77 for the Armband and Actiwatch respectively. The ICC (95%CI) was 0.76 (0.61 to 0.86) and 0.80 (0.46 to 0.92) for total sleep time, 0.52 (0.29 to 0.70) and 0.55 (0.14 to 0.77) for sleep efficiency, 0.64 (0.45 to 0.78) and 0.52 (0.23 to 0.73) for wake after sleep onset and 0.13 (-0.15 to 0.39) and 0.33 (-0.05 to 0.63) for sleep onset latency, for the Armband and Actiwatch, respectively. The findings showed that both instruments have varied criterion validity across the sleep parameters from excellent validity for measures of total sleep time, good validity for measures of sleep efficiency and wake after onset to poor validity for sleep onset latency.
Evaluation of Measurement Instrument Criterion Validity in Finite Mixture Settings

ERIC Educational Resources Information Center

Raykov, Tenko; Marcoulides, George A.; Li, Tenglong

2016-01-01

A method for evaluating the validity of multicomponent measurement instruments in heterogeneous populations is discussed. The procedure can be used for point and interval estimation of criterion validity of linear composites in populations representing mixtures of an unknown number of latent classes. The approach permits also the evaluation of…
Evaluation of Validity and Reliability for Hierarchical Scales Using Latent Variable Modeling

ERIC Educational Resources Information Center

Raykov, Tenko; Marcoulides, George A.

2012-01-01

A latent variable modeling method is outlined, which accomplishes estimation of criterion validity and reliability for a multicomponent measuring instrument with hierarchical structure. The approach provides point and interval estimates for the scale criterion validity and reliability coefficients, and can also be used for testing composite or…
An appraisal of the psychometric properties of the Clinician version of the Apathy Evaluation Scale (AES-C).

PubMed

Clarke, Diana E; Van Reekum, Robert; Patel, Jigisha; Simard, Martine; Gomez, Everlyne; Streiner, David L

2007-01-01

This article examines the psychometric properties of the clinician version of the Apathy Evaluation Scale (AES-C) to determine its ability to characterize, quantify and differentiate apathy. Critical appraisals of the item-reduction processes, effectiveness of the administration, coding and scoring procedures, and the reliability and validity of the scale were carried out. For training, administration and rating of the AES-C, clearer guidelines, including a more standardized list of verbal and non-verbal apathetic cues, are needed. There is evidence of high internal consistency for the scale across studies. In addition, the original study reported good test-retest and inter-rater reliability coefficients. However, there is a lack of replication on these more stable and informative measures of reliability and as such they warrant further investigation. The research evidence confirms that the AES-C shows good discriminant, convergent and criterion validity. However, evidence of its predictive validity is limited. As this aspect of validity refers to the scale's ability to predict future outcomes, which is important for treatment and rehabilitation planning, further assessment of the predictive validity of the AES-C is needed. In conclusion, the AES-C is a reliable and valid measure for the characterization and quantification of apathy. Copyright (c) 2007 John Wiley & Sons, Ltd.
Development and Validation of a Questionnaire on Breastfeeding Intentions, Attitudes and Knowledge of a Sample of Croatian Secondary-School Students.

PubMed

Čatipović, Marija; Marković, Martina; Grgurić, Josip

2018-04-27

Validating a questionnaire/instrument before proceeding to the field for data collection is important. An 18-item breastfeeding intention, 39-item attitude and 44-item knowledge questionnaire was validated in a Croatian sample of secondary-school students ( N = 277). For the intentions, principal component analysis (PCA) yielded a four-factor solution with 8 items explaining 68.3% of the total variance. Cronbach’s alpha (0.71) indicated satisfactory internal consistency. For the attitudes, PCA showed a seven-factor structure with 33 items explaining 58.41% of total variance. Cronbach’s alpha (0.87) indicated good internal consistency. There were 13 knowledge questions that were retained after item analysis, showing good internal consistency (KR20 = 0.83). In terms of criterion validity, the questionnaire differentiated between students who received breastfeeding education compared to students who were not educated in breastfeeding. Correlations between intentions and attitudes (r = 0.49), intentions and knowledge (r = 0.29), and attitudes and knowledge (r = 0.38) confirmed concurrent validity. The final instrument is reliable and valid for data collection on breastfeeding. Therefore, the instrument is recommended for evaluation of breastfeeding education programs aimed at upper-grade elementary and secondary school students.
Development and Validation of a Questionnaire on Breastfeeding Intentions, Attitudes and Knowledge of a Sample of Croatian Secondary-School Students

PubMed Central

Marković, Martina; Grgurić, Josip

2018-01-01

Background: Validating a questionnaire/instrument before proceeding to the field for data collection is important. Methods: An 18-item breastfeeding intention, 39-item attitude and 44-item knowledge questionnaire was validated in a Croatian sample of secondary-school students (N = 277). Results: For the intentions, principal component analysis (PCA) yielded a four-factor solution with 8 items explaining 68.3% of the total variance. Cronbach’s alpha (0.71) indicated satisfactory internal consistency. For the attitudes, PCA showed a seven-factor structure with 33 items explaining 58.41% of total variance. Cronbach’s alpha (0.87) indicated good internal consistency. There were 13 knowledge questions that were retained after item analysis, showing good internal consistency (KR20 = 0.83). In terms of criterion validity, the questionnaire differentiated between students who received breastfeeding education compared to students who were not educated in breastfeeding. Correlations between intentions and attitudes (r = 0.49), intentions and knowledge (r = 0.29), and attitudes and knowledge (r = 0.38) confirmed concurrent validity. Conclusions: The final instrument is reliable and valid for data collection on breastfeeding. Therefore, the instrument is recommended for evaluation of breastfeeding education programs aimed at upper-grade elementary and secondary school students. PMID:29702616
Construct Validity of the Relationship Profile Test: Links with measures of psychopathology and adult attachment

PubMed Central

Haggerty, Greg; Bornstein, Robert F.; Khalid, Mohammad; Sharma, Vishal; Riaz, Usman; Blanchard, Mark; Siefert, Caleb J; Sinclair, Samuel J.

2015-01-01

This study assessed the construct validity of the Relationship Profile Test (RPT; Bornstein & Languirand, 2003) with a substance abuse sample. One hundred-eight substance abuse patients completed the RPT, Experiences in Close Relationships Scale (ECR-SF; Wei, Russell, Mallinckrodt, & Vogel, 2007), Personality Assessment Inventory (PAI; Morey, 1991), and Symptom Checklist-90-Revised (SCL-90-R: Derogatis 1983). Results suggest that the RPT has good construct validity when compared against theoretically related broadband measures of personality, psychopathology and adult attachment. Overall, health hependency was negatively related to measures of psychopathology and insecure attachment, and overdependence was positively related to measures of psychopathology and attachment anxiety. Many of the predictions regarding RPT detachment and the criterion measures were not supported. Implications of these findings are discussed. PMID:26620463
A systematic review of the measurement properties of the European Organisation for Research and Treatment of Cancer In-patient Satisfaction with Care Questionnaire, the EORTC IN-PATSAT32.

PubMed

Neijenhuijs, Koen I; Jansen, Femke; Aaronson, Neil K; Brédart, Anne; Groenvold, Mogens; Holzner, Bernhard; Terwee, Caroline B; Cuijpers, Pim; Verdonck-de Leeuw, Irma M

2018-05-07

The EORTC IN-PATSAT32 is a patient-reported outcome measure (PROM) to assess cancer patients' satisfaction with in-patient health care. The aim of this study was to investigate whether the initial good measurement properties of the IN-PATSAT32 are confirmed in new studies. Within the scope of a larger systematic review study (Prospero ID 42017057237), a systematic search was performed of Embase, Medline, PsycINFO, and Web of Science for studies that investigated measurement properties of the IN-PATSAT32 up to July 2017. Study quality was assessed, data were extracted, and synthesized according to the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) methodology. Nine studies were included in this review. The evidence on reliability and construct validity were rated as sufficient and of the quality of the evidence as moderate. The evidence on structural validity was rated as insufficient and of low quality. The evidence on internal consistency was indeterminate. Measurement error, responsiveness, criterion validity, and cross-cultural validity were not reported in the included studies. Measurement error could be calculated for two studies and was judged indeterminate. In summary, the IN-PATSAT32 performs as expected with respect to reliability and construct validity. No firm conclusions can be made yet whether the IN-PATSAT32 also performs as well with respect to structural validity and internal consistency. Further research on these measurement properties of the PROM is therefore needed as well as on measurement error, responsiveness, criterion validity, and cross-cultural validity. For future studies, it is recommended to take the COSMIN methodology into account.
The Validation of a Case-Based, Cumulative Assessment and Progressions Examination

PubMed Central

Coker, Adeola O.; Copeland, Jeffrey T.; Gottlieb, Helmut B.; Horlen, Cheryl; Smith, Helen E.; Urteaga, Elizabeth M.; Ramsinghani, Sushma; Zertuche, Alejandra; Maize, David

2016-01-01

Objective. To assess content and criterion validity, as well as reliability of an internally developed, case-based, cumulative, high-stakes third-year Annual Student Assessment and Progression Examination (P3 ASAP Exam). Methods. Content validity was assessed through the writing-reviewing process. Criterion validity was assessed by comparing student scores on the P3 ASAP Exam with the nationally validated Pharmacy Curriculum Outcomes Assessment (PCOA). Reliability was assessed with psychometric analysis comparing student performance over four years. Results. The P3 ASAP Exam showed content validity through representation of didactic courses and professional outcomes. Similar scores on the P3 ASAP Exam and PCOA with Pearson correlation coefficient established criterion validity. Consistent student performance using Kuder-Richardson coefficient (KR-20) since 2012 reflected reliability of the examination. Conclusion. Pharmacy schools can implement internally developed, high-stakes, cumulative progression examinations that are valid and reliable using a robust writing-reviewing process and psychometric analyses. PMID:26941435
Creation and Initial Validation of the International Dysphagia Diet Standardisation Initiative Functional Diet Scale

PubMed Central

Steele, Catriona M.; Namasivayam-MacDonald, Ashwini M.; Guida, Brittany T.; Cichero, Julie A.; Duivestein, Janice; MRSc; Hanson, Ben; Lam, Peter; Riquelme, Luis F.

2018-01-01

Objective To assess consensual validity, interrater reliability, and criterion validity of the International Dysphagia Diet Standardisation Initiative Functional Diet Scale, a new functional outcome scale intended to capture the severity of oropharyngeal dysphagia, as represented by the degree of diet texture restriction recommended for the patient. Design Participants assigned International Dysphagia Diet Standardisation Initiative Functional Diet Scale scores to 16 clinical cases. Consensual validity was measured against reference scores determined by an author reference panel. Interrater reliability was measured overall and across quartile subsets of the dataset. Criterion validity was evaluated versus Functional Oral Intake Scale (FOIS) scores assigned by survey respondents to the same case scenarios. Feedback was requested regarding ease and likelihood of use. Setting Web-based survey. Participants Respondents (NZ170) from 29 countries. Interventions Not applicable. Main Outcome Measures Consensual validity (percent agreement and Kendall t), criterion validity (Spearman rank correlation), and interrater reliability (Kendall concordance and intraclass coefficients). Results The International Dysphagia Diet Standardisation Initiative Functional Diet Scale showed strong consensual validity, criterion validity, and interrater reliability. Scenarios involving liquid-only diets, transition from nonoral feeding, or trial diet advances in therapy showed the poorest consensus, indicating a need for clear instructions on how to score these situations. The International Dysphagia Diet Standardisation Initiative Functional Diet Scale showed greater sensitivity than the FOIS to specific changes in diet. Most (>70%) respondents indicated enthusiasm for implementing the International Dysphagia Diet Standardisation Initiative Functional Diet Scale. Conclusions This initial validation study suggests that the International Dysphagia Diet Standardisation Initiative Functional Diet Scale has strong consensual and criterion validity and can be used reliably by clinicians to capture diet texture restriction and progression in people with dysphagia. PMID:29428348

Creation and Initial Validation of the International Dysphagia Diet Standardisation Initiative Functional Diet Scale.

PubMed

Steele, Catriona M; Namasivayam-MacDonald, Ashwini M; Guida, Brittany T; Cichero, Julie A; Duivestein, Janice; Hanson, Ben; Lam, Peter; Riquelme, Luis F

2018-05-01

To assess consensual validity, interrater reliability, and criterion validity of the International Dysphagia Diet Standardisation Initiative Functional Diet Scale, a new functional outcome scale intended to capture the severity of oropharyngeal dysphagia, as represented by the degree of diet texture restriction recommended for the patient. Participants assigned International Dysphagia Diet Standardisation Initiative Functional Diet Scale scores to 16 clinical cases. Consensual validity was measured against reference scores determined by an author reference panel. Interrater reliability was measured overall and across quartile subsets of the dataset. Criterion validity was evaluated versus Functional Oral Intake Scale (FOIS) scores assigned by survey respondents to the same case scenarios. Feedback was requested regarding ease and likelihood of use. Web-based survey. Respondents (N=170) from 29 countries. Not applicable. Consensual validity (percent agreement and Kendall τ), criterion validity (Spearman rank correlation), and interrater reliability (Kendall concordance and intraclass coefficients). The International Dysphagia Diet Standardisation Initiative Functional Diet Scale showed strong consensual validity, criterion validity, and interrater reliability. Scenarios involving liquid-only diets, transition from nonoral feeding, or trial diet advances in therapy showed the poorest consensus, indicating a need for clear instructions on how to score these situations. The International Dysphagia Diet Standardisation Initiative Functional Diet Scale showed greater sensitivity than the FOIS to specific changes in diet. Most (>70%) respondents indicated enthusiasm for implementing the International Dysphagia Diet Standardisation Initiative Functional Diet Scale. This initial validation study suggests that the International Dysphagia Diet Standardisation Initiative Functional Diet Scale has strong consensual and criterion validity and can be used reliably by clinicians to capture diet texture restriction and progression in people with dysphagia. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Toward a Measure of Accountability in Nursing: A Three-Stage Validation Study.

PubMed

Drach-Zahavy, Anat; Leonenko, Marina; Srulovici, Einav

2018-06-04

To develop and psychometrically evaluate a three-dimensional questionnaire suitable for evaluating personal and organizational accountability in nurses. Accountability is defined as a three-dimensional value, directing professionals to take responsibility for their decisions and actions, to be willing to explain them (transparency) and to be judged according to society's accepted values (answerability). Despite the relatively clear definition, measurement of accountability lags well behind. Existing self-report questionnaires do not fully capture the complexity of the concept; nor do they capture the different sources of accountability (e.g., personal accountability, organizational accountability). A three-stage measure development. Data were collected during 2015-2016. In Phase 1, an initial database of items (N = 74) was developed, based on literature review and qualitative study, establishing face and content validity. In Phase 2, the face, content, construct and criterion-related validity of the initial questionnaires (19 items for personal and organizational accountability questionnaire) was established with a sample of 229 nurses. In Phase 3, the final questionnaires (19 items each) were validated with a new sample of 329 nurses and established construct validity. The final version of the instruments comprised 19 items, suitable for assessing personal and organizational accountability. The questionnaire referred to the dimensions of responsibility, transparency and answerability. The findings established the instrument's content, construct and criterion-related validity, as well as good internal reliability. The questionnaire portrays accountability in nursing, by capturing nurses' subjective perceptions of accountability dimensions (responsibility, transparency, answerability), as demonstrated by personal and organizational values. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
The Servant Leadership Survey: Development and Validation of a Multidimensional Measure.

PubMed

van Dierendonck, Dirk; Nuijten, Inge

2011-09-01

PURPOSE: The purpose of this paper is to describe the development and validation of a multi-dimensional instrument to measure servant leadership. DESIGN/METHODOLOGY/APPROACH: Based on an extensive literature review and expert judgment, 99 items were formulated. In three steps, using eight samples totaling 1571 persons from The Netherlands and the UK with a diverse occupational background, a combined exploratory and confirmatory factor analysis approach was used. This was followed by an analysis of the criterion-related validity. FINDINGS: The final result is an eight-dimensional measure of 30 items: the eight dimensions being: standing back, forgiveness, courage, empowerment, accountability, authenticity, humility, and stewardship. The internal consistency of the subscales is good. The results show that the Servant Leadership Survey (SLS) has convergent validity with other leadership measures, and also adds unique elements to the leadership field. Evidence for criterion-related validity came from studies relating the eight dimensions to well-being and performance. IMPLICATIONS: With this survey, a valid and reliable instrument to measure the essential elements of servant leadership has been introduced. ORIGINALITY/VALUE: The SLS is the first measure where the underlying factor structure was developed and confirmed across several field studies in two countries. It can be used in future studies to test the underlying premises of servant leadership theory. The SLS provides a clear picture of the key servant leadership qualities and shows where improvements can be made on the individual and organizational level; as such, it may also offer a valuable starting point for training and leadership development.
Spanish version of the Mattis Dementia Rating Scale-2 for early detection of Alzheimer's disease and mild cognitive impairment.

PubMed

Boycheva, Elina; Contador, Israel; Fernández-Calvo, Bernardino; Ramos-Campos, Francisco; Puertas-Martín, Verónica; Villarejo-Galende, Alberto; Bermejo-Pareja, Félix

2018-06-01

We aimed to analyse the clinical utility of the Mattis Dementia Rating Scale (MDRS-2) for early detection of Alzheimer's disease (AD) and amnestic mild cognitive impairment (MCI) in a sample of Spanish older adults. A total of 125 participants (age = 75.12 ± 6.83, years of education =7.08 ± 3.57) were classified in three diagnostic groups: 45 patients with mild AD, 37 with amnestic MCI-single and multiple domain and 43 cognitively healthy controls (HCs). Reliability, criterion validity and diagnostic accuracy of the MDRS-2 (total and subscales) were analysed. The MDRS-2 scores, adjusted by socio-demographic characteristics, were calculated through hierarchical multiple regression analysis. The global scale had adequate reliability (α = 0.736) and good criterion validity (r = 0.760, p < .001) with the Mini-Mental State Examination. The optimal cut-off point between AD patients and HCs was 124 (sensitivity [Se] = 97% and specificity [Sp] = 95%), whereas 131 (Se = 89%, Sp = 81%) was the optimal cut-off point between MCI and HCs. An optimal cut-off point of 123 had good Se (0.97), but poor Sp (0.56) to differentiate AD and MCI groups. The Memory and Initiation/Perseveration subscales had the highest discriminative capacity between the groups. The MDRS-2 is a reliable and valid instrument for the assessment of cognitive impairment in Spanish older adults. In particular, optimal capacity emerged for the detection of early AD and MCI. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
Children's Behavior in the Postanesthesia Care Unit: The Development of the Child Behavior Coding System-PACU (CBCS-P)

PubMed Central

Tan, Edwin T.; Martin, Sarah R.; Fortier, Michelle A.; Kain, Zeev N.

2012-01-01

Objective To develop and validate a behavioral coding measure, the Children's Behavior Coding System-PACU (CBCS-P), for children's distress and nondistress behaviors while in the postanesthesia recovery unit. Methods A multidisciplinary team examined videotapes of children in the PACU and developed a coding scheme that subsequently underwent a refinement process (CBCS-P). To examine the reliability and validity of the coding system, 121 children and their parents were videotaped during their stay in the PACU. Participants were healthy children undergoing elective, outpatient surgery and general anesthesia. The CBCS-P was utilized and objective data from medical charts (analgesic consumption and pain scores) were extracted to establish validity. Results Kappa values indicated good-to-excellent (κ's > .65) interrater reliability of the individual codes. The CBCS-P had good criterion validity when compared to children's analgesic consumption and pain scores. Conclusions The CBCS-P is a reliable, observational coding method that captures children's distress and nondistress postoperative behaviors. These findings highlight the importance of considering context in both the development and application of observational coding schemes. PMID:22167123
Development and Validation of Personality Disorder Spectra Scales for the MMPI-2-RF.

PubMed

Sellbom, Martin; Waugh, Mark H; Hopwood, Christopher J

2018-01-01

The purpose of this study was to develop and validate a set of MMPI-2-RF (Ben-Porath & Tellegen, 2008/2011) personality disorder (PD) spectra scales. These scales could serve the purpose of assisting with DSM-5 PD diagnosis and help link categorical and dimensional conceptions of personality pathology within the MMPI-2-RF. We developed and provided initial validity results for scales corresponding to the 10 PD constructs listed in the DSM-5 using data from student, community, clinical, and correctional samples. Initial validation efforts indicated good support for criterion validity with an external PD measure as well as with dimensional personality traits included in the DSM-5 alternative model for PDs. Construct validity results using psychosocial history and therapists' ratings in a large clinical sample were generally supportive as well. Overall, these brief scales provide clinicians using MMPI-2-RF data with estimates of DSM-5 PD constructs that can support cross-model connections between categorical and dimensional assessment approaches.
The Counselor Evaluation Rating Scale: A Valid Criterion of Counselor Effectiveness?

ERIC Educational Resources Information Center

Jones, Lawrence K.

1974-01-01

The validity of recent recommendations regarding the use of certain factors of the 16 Personality Factor Questionnaire (16PF) to select persons for counselor training programs, where the CERS was the criterion measure, is challenged. (Author)
Development of an opioid-related Overdose Risk Behavior Scale (ORBS).

PubMed

Pouget, Enrique R; Bennett, Alex S; Elliott, Luther; Wolfson-Stofko, Brett; Almeñana, Ramona; Britton, Peter C; Rosenblum, Andrew

2017-01-01

Drug overdose has emerged as the leading cause of injury-related death in the United States, driven by prescription opioid (PO) misuse, polysubstance use, and use of heroin. To better understand opioid-related overdose risks that may change over time and across populations, there is a need for a more comprehensive assessment of related risk behaviors. Drawing on existing research, formative interviews, and discussions with community and scientific advisors an opioid-related Overdose Risk Behavior Scale (ORBS) was developed. Military veterans reporting any use of heroin or POs in the past month were enrolled using venue-based and chain referral recruitment. The final scale consisted of 25 items grouped into 5 subscales eliciting the number of days in the past 30 during which the participant engaged in each behavior. Internal reliability, test-retest reliability and criterion validity were assessed using Cronbach's alpha, intraclass correlations (ICC) and Pearson's correlations with indicators of having overdosed during the past 30 days, respectivelyInternal reliability, test-retest reliability and criterion validity were assessed using Cronbach's alpha, intraclass correlations (ICC) and Pearson's correlations with indicators of having overdosed during the past 30 days, respectively. Data for 220 veterans were analyzed. The 5 subscales-(A) Adherence to Opioid Dosage and Therapeutic Purposes; (B) Alternative Methods of Opioid Administration; (C) Solitary Opioid Use; (D) Use of Nonprescribed Overdose-associated Drugs; and (E) Concurrent Use of POs, Other Psychoactive Drugs and Alcohol-generally showed good internal reliability (alpha range = 0.61 to 0.88), test-retest reliability (ICC range = 0.81 to 0.90), and criterion validity (r range = 0.22 to 0.66). The subscales were internally consistent with each other (alpha = 0.84). The scale mean had an ICC value of 0.99, and correlations with validators ranged from 0.44 to 0.56. These results constitute preliminary evidence for the reliability and validity of the new scale. If further validated, it could help improve overdose prevention and response research and could help improve the precision of overdose education and prevention efforts.
Spanish translation, cross-cultural adaptation, and validation of the Questionnaire for Diabetes-Related Foot Disease (Q-DFD)

PubMed Central

Castillo-Tandazo, Wilson; Flores-Fortty, Adolfo; Feraud, Lourdes; Tettamanti, Daniel

2013-01-01

Purpose To translate, cross-culturally adapt, and validate the Questionnaire for Diabetes-Related Foot Disease (Q-DFD), originally created and validated in Australia, for its use in Spanish-speaking patients with diabetes mellitus. Patients and methods The translation and cross-cultural adaptation were based on international guidelines. The Spanish version of the survey was applied to a community-based (sample A) and a hospital clinic-based sample (samples B and C). Samples A and B were used to determine criterion and construct validity comparing the survey findings with clinical evaluation and medical records, respectively; while sample C was used to determine intra- and inter-rater reliability. Results After completing the rigorous translation process, only four items were considered problematic and required a new translation. In total, 127 patients were included in the validation study: 76 to determine criterion and construct validity and 41 to establish intra- and inter-rater reliability. For an overall diagnosis of diabetes-related foot disease, a substantial level of agreement was obtained when we compared the Q-DFD with the clinical assessment (kappa 0.77, sensitivity 80.4%, specificity 91.5%, positive likelihood ratio [LR+] 9.46, negative likelihood ratio [LR−] 0.21); while an almost perfect level of agreement was obtained when it was compared with medical records (kappa 0.88, sensitivity 87%, specificity 97%, LR+ 29.0, LR− 0.13). Survey reliability showed substantial levels of agreement, with kappa scores of 0.63 and 0.73 for intra- and inter-rater reliability, respectively. Conclusion The translated and cross-culturally adapted Q-DFD showed good psychometric properties (validity, reproducibility, and reliability) that allow its use in Spanish-speaking diabetic populations. PMID:24039434
Development of a Valid and Reliable Knee Articular Cartilage Condition-Specific Study Methodological Quality Score.

PubMed

Harris, Joshua D; Erickson, Brandon J; Cvetanovich, Gregory L; Abrams, Geoffrey D; McCormick, Frank M; Gupta, Anil K; Verma, Nikhil N; Bach, Bernard R; Cole, Brian J

2014-02-01

Condition-specific questionnaires are important components in evaluation of outcomes of surgical interventions. No condition-specific study methodological quality questionnaire exists for evaluation of outcomes of articular cartilage surgery in the knee. To develop a reliable and valid knee articular cartilage-specific study methodological quality questionnaire. Cross-sectional study. A stepwise, a priori-designed framework was created for development of a novel questionnaire. Relevant items to the topic were identified and extracted from a recent systematic review of 194 investigations of knee articular cartilage surgery. In addition, relevant items from existing generic study methodological quality questionnaires were identified. Items for a preliminary questionnaire were generated. Redundant and irrelevant items were eliminated, and acceptable items modified. The instrument was pretested and items weighed. The instrument, the MARK score (Methodological quality of ARticular cartilage studies of the Knee), was tested for validity (criterion validity) and reliability (inter- and intraobserver). A 19-item, 3-domain MARK score was developed. The 100-point scale score demonstrated face validity (focus group of 8 orthopaedic surgeons) and criterion validity (strong correlation to Cochrane Quality Assessment score and Modified Coleman Methodology Score). Interobserver reliability for the overall score was good (intraclass correlation coefficient [ICC], 0.842), and for all individual items of the MARK score, acceptable to perfect (ICC, 0.70-1.000). Intraobserver reliability ICC assessed over a 3-week interval was strong for 2 reviewers (≥0.90). The MARK score is a valid and reliable knee articular cartilage condition-specific study methodological quality instrument. This condition-specific questionnaire may be used to evaluate the quality of studies reporting outcomes of articular cartilage surgery in the knee.
Development and initial validation of the Pharmacist Frequency of Interprofessional Collaboration Instrument (FICI-P) in primary care.

PubMed

Van, Connie; Costa, Daniel; Mitchell, Bernadette; Abbott, Penny; Krass, Ines

2012-01-01

Existing validated measures of pharmacist-physician collaboration focus on measuring attitudes toward collaboration and do not measure frequency of collaborative interactions. To develop and validate an instrument to measure the frequency of collaboration between pharmacists and general practitioners (GPs) from the pharmacist's perspective. An 11-item Pharmacist Frequency of Interprofessional Collaboration Instrument (FICI-P) was developed and administered to 586 pharmacists in 8 divisions of general practice in New South Wales, Australia. The initial items were informed by a review of the literature in addition to interviews of pharmacists and GPs. Items were subjected to principal component and Rasch analyses to determine each item's and the overall measure's psychometric properties and for any needed refinements. Two hundred and twenty four (38%) of pharmacist surveys were completed and returned. Principal component analysis suggested removal of 1 item for a final 1-factor solution. The refined 10-item FICI-P demonstrated internal consistency reliability at Cronbach's alpha=0.90. After collapsing the original 5-point response scale to a 4-point response scale, the refined FICI-P demonstrated fit to the Rasch model. Criterion validity of the FICI-P was supported by the correlation of FICI-P scores with scores on a previously validated Physician-Pharmacist Collaboration Instrument. Validity was also supported by predicted differences in FICI-P scores between subgroups of respondents stratified on age, colocation with GPs, and interactions during the intern-training period. The refined 10-item FICI-P was shown to have good internal consistency, criterion validity, and fit to the Rasch model. The creation of such a tool may allow for the measure of impact in the evaluation of interventions designed to improve interprofessional collaboration between GPs and pharmacists. Copyright © 2012 Elsevier Inc. All rights reserved.
Design and validation of a three-instrument toolkit for the assessment of competence in electrocardiogram rhythm recognition.

PubMed

Hernández-Padilla, José M; Granero-Molina, José; Márquez-Hernández, Verónica V; Suthers, Fiona; López-Entrambasaguas, Olga M; Fernández-Sola, Cayetano

2017-06-01

Rapid and accurate interpretation of cardiac arrhythmias by nurses has been linked with safe practice and positive patient outcomes. Although training in electrocardiogram rhythm recognition is part of most undergraduate nursing programmes, research continues to suggest that nurses and nursing students lack competence in recognising cardiac rhythms. In order to promote patient safety, nursing educators must develop valid and reliable assessment tools that allow the rigorous assessment of this competence before nursing students are allowed to practise without supervision. The aim of this study was to develop and psychometrically evaluate a toolkit to holistically assess competence in electrocardiogram rhythm recognition. Following a convenience sampling technique, 293 nursing students from a nursing faculty in a Spanish university were recruited for the study. The following three instruments were developed and psychometrically tested: an electrocardiogram knowledge assessment tool (ECG-KAT), an electrocardiogram skills assessment tool (ECG-SAT) and an electrocardiogram self-efficacy assessment tool (ECG-SES). Reliability and validity (content, criterion and construct) of these tools were meticulously examined. A high Cronbach's alpha coefficient demonstrated the excellent reliability of the instruments (ECG-KAT=0.89; ECG-SAT=0.93; ECG-SES=0.98). An excellent context validity index (scales' average content validity index>0.94) and very good criterion validity were evidenced for all the tools. Regarding construct validity, principal component analysis revealed that all items comprising the instruments contributed to measure knowledge, skills or self-efficacy in electrocardiogram rhythm recognition. Moreover, known-groups analysis showed the tools' ability to detect expected differences in competence between groups with different training experiences. The three-instrument toolkit developed showed excellent psychometric properties for measuring competence in electrocardiogram rhythm recognition.
Validation of a single summary score for the Prolapse/Incontinence Sexual Questionnaire-IUGA revised (PISQ-IR).

PubMed

Constantine, Melissa L; Pauls, Rachel N; Rogers, Rebecca R; Rockwood, Todd H

2017-12-01

The Prolapse/Incontinence Sexual Questionnaire-International Urogynecology Association (IUGA) Revised (PISQ-IR) measures sexual function in women with pelvic floor disorders (PFDs) yet is unwieldy, with six individual subscale scores for sexually active women and four for women who are not. We hypothesized that a valid and responsive summary score could be created for the PISQ-IR. Item response data from participating women who completed a revised version of the PISQ-IR at three clinical sites were used to generate item weights using a magnitude estimation (ME) and Q-sort (Q) approaches. Item weights were applied to data from the original PISQ-IR validation to generate summary scores. Correlation and factor analysis methods were used to evaluate validity and responsiveness of summary scores. Weighted and nonweighted summary scores for the sexually active PISQ-IR demonstrated good criterion validity with condition-specific measures: Incontinence Severity Index = 0.12, 0.11, 0.11; Pelvic Floor Distress Inventory-20 = 0.39, 0.39, 0.12; Epidemiology of Prolapse and Incontinence Questionnaire-Q35 = 0.26 0,.25, 0.40); Female Sexual Functioning Index subscale total score = 0.72, 0.75, 0.72 for nonweighted, ME, and Q summary scores, respectively. Responsiveness evaluation showed weighted and nonweighted summary scores detected moderate effect sizes (Cohen's d > 0.5). Weighted items for those NSA demonstrated significant floor effects and did not meet criterion validity. A PISQ-IR summary score for use with sexually active women, nonweighted or calculated with ME or Q item weights, is a valid and reliable measure for clinical use. The summary scores provide value for assesing clinical treatment of pelvic floor disorders.
Comparing the Construct and Criterion-Related Validity of Ability-Based and Mixed-Model Measures of Emotional Intelligence

ERIC Educational Resources Information Center

Livingstone, Holly A.; Day, Arla L.

2005-01-01

Despite the popularity of the concept of emotional intelligence(EI), there is much controversy around its definition, measurement, and validity. Therefore, the authors examined the construct and criterion-related validity of an ability-based EI measure (Mayer Salovey Caruso Emotional Intelligence Test [MSCEIT]) and a mixed-model EI measure…
Reliability and criterion validity of an observation protocol for working technique assessments in cash register work.

PubMed

Palm, Peter; Josephson, Malin; Mathiassen, Svend Erik; Kjellberg, Katarina

2016-06-01

We evaluated the intra- and inter-observer reliability and criterion validity of an observation protocol, developed in an iterative process involving practicing ergonomists, for assessment of working technique during cash register work for the purpose of preventing upper extremity symptoms. Two ergonomists independently assessed 17 15-min videos of cash register work on two occasions each, as a basis for examining reliability. Criterion validity was assessed by comparing these assessments with meticulous video-based analyses by researchers. Intra-observer reliability was acceptable (i.e. proportional agreement >0.7 and kappa >0.4) for 10/10 questions. Inter-observer reliability was acceptable for only 3/10 questions. An acceptable inter-observer reliability combined with an acceptable criterion validity was obtained only for one working technique aspect, 'Quality of movements'. Thus, major elements of the cashiers' working technique could not be assessed with an acceptable accuracy from short periods of observations by one observer, such as often desired by practitioners. Practitioner Summary: We examined an observation protocol for assessing working technique in cash register work. It was feasible in use, but inter-observer reliability and criterion validity were generally not acceptable when working technique aspects were assessed from short periods of work. We recommend the protocol to be used for educational purposes only.
Development of a questionnaire to measure heart disease risk knowledge in people with diabetes: the Heart Disease Fact Questionnaire.

PubMed

Wagner, Julie; Lacey, Kimberly; Chyun, Deborah; Abbott, Gina

2005-07-01

This paper describes a paper and pencil questionnaire that measures heart disease risk knowledge in people with diabetes. The Heart Disease Fact Questionnaire (HDFQ) is a 25-item questionnaire that was developed to tap into respondents' knowledge of major risk factors for the development of CHD. Approximately half of these items specifically address diabetes-related CHD risk factors. Based on extensive pilot data, the current study analyzed responses from 524 people with diabetes to assess the psychometric properties. The HDFQ is readable to an average 13-year old and imposes little burden. It shows good content and face validity. It demonstrates adequate internal consistency, with Kuder-Richardson-20 formula = 0.77 and good item-total correlations. Item analysis showed a desirable range in P-values. In discriminant function analyses, HDFQ scores differentiated respondents by knowledge of their own cardiovascular health, use of lipid lowering medications, health insurance status, and educational attainment, thus indicating good criterion related validity. This measure of heart disease risk knowledge is brief, understandable to respondents, and easy to administer and score. Its potential for use in research and practice is discussed. Future research should establish norms as well as investigate its test-retest reliability and predictive validity.
Validity of proposed DSM-5 diagnostic criteria for nicotine use disorder: results from 734 Israeli lifetime smokers

PubMed Central

Shmulewitz, D.; Wall, M.M.; Aharonovich, E.; Spivak, B.; Weizman, A.; Frisch, A.; Grant, B. F.; Hasin, D.

2013-01-01

Background The fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5) proposes aligning nicotine use disorder (NUD) criteria with those for other substances, by including the current DSM fourth edition (DSM-IV) nicotine dependence (ND) criteria, three abuse criteria (neglect roles, hazardous use, interpersonal problems) and craving. Although NUD criteria indicate one latent trait, evidence is lacking on: (1) validity of each criterion; (2) validity of the criteria as a set; (3) comparative validity between DSM-5 NUD and DSM-IV ND criterion sets; and (4) NUD prevalence. Method Nicotine criteria (DSM-IV ND, abuse and craving) and external validators (e.g. smoking soon after awakening, number of cigarettes per day) were assessed with a structured interview in 734 lifetime smokers from an Israeli household sample. Regression analysis evaluated the association between validators and each criterion. Receiver operating characteristic analysis assessed the association of the validators with the DSM-5 NUD set (number of criteria endorsed) and tested whether DSM-5 or DSM-IV provided the most discriminating criterion set. Changes in prevalence were examined. Results Each DSM-5 NUD criterion was significantly associated with the validators, with strength of associations similar across the criteria. As a set, DSM-5 criteria were significantly associated with the validators, were significantly more discriminating than DSM-IV ND criteria, and led to increased prevalence of binary NUD (two or more criteria) over ND. Conclusions All findings address previous concerns about the DSM-IV nicotine diagnosis and its criteria and support the proposed changes for DSM-5 NUD, which should result in improved diagnosis of nicotine disorders. PMID:23312475
Validity of the modified back-saver sit-and-reach test: a comparison with other protocols.

PubMed

Hui, S S; Yuen, P Y

2000-09-01

Studies have shown that the classical sit-and-reach (CSR) test, the modified sit-and-reach (MSR), and the newly developed back-saver sit-and-reach (BS) test have poor criterion-related validity in estimating low-back flexibility but yielded moderate criterion-related validity in hamstring flexibility. The V sit-and-reach (VSR) test was found to be practical but the validity has not been established. The purpose of this study was to propose a modified back-saver sit-and-reach (MBS) test, which incorporated all advantages of the various protocols, and to compare the criterion-related validity and reliability of all these tests. 158 college students (F = 96, and M = 62; age = 20.77 +/- 2.51) performed CSR, VSR, BS (left and right leg), and MBS (left and right leg) tests in a randomized order. Scores from each test were then correlated with the criterion measures. For all sit-reach tests, intraclass reliability (single trial) was very high (r = 0.89-0.98). MBS yielded significant and highest r with low-back and hamstring criterion for men (r = 0.47-0.67) and women (r = 0.23-0.54). The low-back and right hamstring validity of MBS for men were significantly (P < 0.01) higher than those from BS and CSR, whereas no differences in criterion-related validity were found between the MBS and other protocols in women. The ratings of perceived comfort among the sit-and-reach protocols were significantly different (P < 0.001) from each other. The rating for MBS was observed the most comfortable test as compared with other protocols. The MBS test is not only a reliable test for hamstring and low-back flexibility, it is also a more practical with improved validity for hamstring and low-back flexibility in men than previous protocols.
Criterion-Referenced Testing in Foreign Language Teaching.

ERIC Educational Resources Information Center

Takala, Sauli

A review of literature serves as the basis for a discussion of various aspects of criterion-referenced tests. The aspects discussed are: teaching and evaluation objectives, criterion- and norm-referenced measurement, stages in construction of criterion-referenced tests, construction and selection of items, test validity, and test reliability.…
Is comorbidity in the eating disorders related to perceptions of parenting? Criterion validity of the revised Young Parenting Inventory.

PubMed

Sheffield, Alexandra; Waller, Glenn; Emanuelli, Francesca; Murray, James

2006-01-01

Recent studies support the reliability and validity of the Young Parenting Inventory-Revised (YPI-R) and its use in investigating the role of parenting in the aetiology and maintenance of eating pathology. However, criterion validity has yet to be fully established. To investigate one aspect of criterion validity, this study examines the association between parenting and comorbid problems in the eating disorders (including general psychopathology and impulsivity). The participants were 124 women with eating disorders. They completed the YPI-R and the Brief Symptom Inventory (BSI; a measure of general psychopathology). They were also interviewed about their use of a number of impulsive behaviours. YPI-R scales were significant predictors of one of the nine BSI scales, and distinguished those patients who did or did not use specific impulsive behaviours. The criterion validity of the YPI-R is partially supported with regards to general psychopathology and impulsivity. The findings highlight the specificity of the parenting styles measured by the YPI-R, and the need for further research using this tool.

Investigation of detonation velocity in heterogeneous explosive system using the reactive Burgers' analog

NASA Astrophysics Data System (ADS)

Di Labbio, G.; Kiyanda, C. B.; Mi, X.; Higgins, A. J.; Nikiforakis, N.; Ng, H. D.

2016-06-01

In this study, the applicability of the Chapman-Jouguet (CJ) criterion is tested numerically for heterogeneous explosive media using a simple detonation analog. The analog system consists of a reactive Burgers' equation coupled with an Arrhenius type reaction wave, and the heterogeneity of the explosive media is mimicked using a discrete energy source approach. The governing equation is solved using a second order, finite-volume approach and the average propagation velocity of the discrete detonation is determined by tracking the leading shock front. Consistent with previous studies, the averaged velocity of the leading shock front from the unsteady numerical simulations is also found to be in good agreement with the velocity of a CJ detonation in a uniform medium wherein the energy source is spatially homogenized. These simulations have thus implications for whether the CJ criterion is valid to predict the detonation velocity in heterogeneous explosive media.
The Inclusion of In-Plane Stresses in Delamination Criteria

NASA Technical Reports Server (NTRS)

Fenske, Matthew T.

1999-01-01

A study of delamination failure was conducted with emphasis on delamination criteria. Evidence is presented which supports the inclusion of the in-plane stresses in addition to the interlaminar stress terms in delamination criteria. The delamination is characterized as the failure of a resin rich region in between ply sets. The entire six component stress state in this resin layer is calculated through a finite element analysis, averaged over a dimension of 1.75 ply thicknesses, and used in a Modified von Mises Delamination Criterion. This criterion builds onto previous criteria by including all six stress components in the interply resin layer. The MVMDC shows good correlation to experimental data. The results show that the treatment of delamination as the failure of a finite interply resin layer is a valid method and that the MVMDC, considering the full stress state, accurately indicates delamination for different laminate families.
Criterion-Related Validity of the Distance- and Time-Based Walk/Run Field Tests for Estimating Cardiorespiratory Fitness: A Systematic Review and Meta-Analysis.

PubMed

Mayorga-Vega, Daniel; Bocanegra-Parrilla, Raúl; Ornelas, Martha; Viciana, Jesús

2016-01-01

The main purpose of the present meta-analysis was to examine the criterion-related validity of the distance- and time-based walk/run tests for estimating cardiorespiratory fitness among apparently healthy children and adults. Relevant studies were searched from seven electronic bibliographic databases up to August 2015 and through other sources. The Hunter-Schmidt's psychometric meta-analysis approach was conducted to estimate the population criterion-related validity of the following walk/run tests: 5,000 m, 3 miles, 2 miles, 3,000 m, 1.5 miles, 1 mile, 1,000 m, ½ mile, 600 m, 600 yd, ¼ mile, 15 min, 12 min, 9 min, and 6 min. From the 123 included studies, a total of 200 correlation values were analyzed. The overall results showed that the criterion-related validity of the walk/run tests for estimating maximum oxygen uptake ranged from low to moderate (rp = 0.42-0.79), with the 1.5 mile (rp = 0.79, 0.73-0.85) and 12 min walk/run tests (rp = 0.78, 0.72-0.83) having the higher criterion-related validity for distance- and time-based field tests, respectively. The present meta-analysis also showed that sex, age and maximum oxygen uptake level do not seem to affect the criterion-related validity of the walk/run tests. When the evaluation of an individual's maximum oxygen uptake attained during a laboratory test is not feasible, the 1.5 mile and 12 min walk/run tests represent useful alternatives for estimating cardiorespiratory fitness. As in the assessment with any physical fitness field test, evaluators must be aware that the performance score of the walk/run field tests is simply an estimation and not a direct measure of cardiorespiratory fitness.
Development and Validation of Criterion-Referenced Clinically Relevant Fitness Standards for Maintaining Physical Independence in Later Years

ERIC Educational Resources Information Center

Rikli, Roberta E.; Jones, C. Jessie

2013-01-01

Purpose: To develop and validate criterion-referenced fitness standards for older adults that predict the level of capacity needed for maintaining physical independence into later life. The proposed standards were developed for use with a previously validated test battery for older adults--the Senior Fitness Test (Rikli, R. E., & Jones, C. J.…
Criterion Validity of the Mood and Feelings Questionnaire for Depressive Episodes in Clinic and Non-Clinic Subjects

ERIC Educational Resources Information Center

Daviss, W. Burleson; Birmaher, Boris; Melhem, Nadine A.; Axelson, David A.; Michaels, Shana M.; Brent, David A.

2006-01-01

Background: Previous measures of pediatric depression have shown inconsistent validity in groups with differing demographics, comorbid diagnoses, and clinic or non-clinic origins. The current study re-examines the criterion validity of child- and parent-versions of the Mood and Feelings Questionnaire (MFQ-C, MFQ-P) in a heterogeneous sample of…
Identifying and measuring stakeholder preferences for disease prioritisation: A case study of the pig industry in Australia.

PubMed

Brookes, V J; Hernández-Jover, M; Neslo, R; Cowled, B; Holyoake, P; Ward, M P

2014-01-01

We describe stakeholder preference modelling using a combination of new and recently developed techniques to elicit criterion weights to incorporate into a multi-criteria decision analysis framework to prioritise exotic diseases for the pig industry in Australia. Australian pig producers were requested to rank disease scenarios comprising nine criteria in an online questionnaire. Parallel coordinate plots were used to visualise stakeholder preferences, which aided identification of two diverse groups of stakeholders - one group prioritised diseases with impacts on livestock, and the other group placed more importance on diseases with zoonotic impacts. Probabilistic inversion was used to derive weights for the criteria to reflect the values of each of these groups, modelling their choice using a weighted sum value function. Validation of weights against stakeholders' rankings for scenarios based on real diseases showed that the elicited criterion weights for the group who prioritised diseases with livestock impacts were a good reflection of their values, indicating that the producers were able to consistently infer impacts from the disease information in the scenarios presented to them. The highest weighted criteria for this group were attack rate and length of clinical disease in pigs, and market loss to the pig industry. The values of the stakeholders who prioritised zoonotic diseases were less well reflected by validation, indicating either that the criteria were inadequate to consistently describe zoonotic impacts, the weighted sum model did not describe stakeholder choice, or that preference modelling for zoonotic diseases should be undertaken separately from livestock diseases. Limitations of this study included sampling bias, as the group participating were not necessarily representative of all pig producers in Australia, and response bias within this group. The method used to elicit criterion weights in this study ensured value trade-offs between a range of potential impacts, and that the weights were implicitly related to the scale of measurement of disease criteria. Validation of the results of the criterion weights against real diseases - a step rarely used in MCDA - added scientific rigour to the process. The study demonstrated that these are useful techniques for elicitation of criterion weights for disease prioritisation by stakeholders who are not disease experts. Preference modelling for zoonotic diseases needs further characterisation in this context. Copyright © 2013 Elsevier B.V. All rights reserved.
Care dependency of hospitalized children: testing the Care Dependency Scale for Paediatrics in a cross-cultural comparison.

PubMed

Tork, Hanan; Dassen, Theo; Lohrmann, Christa

2009-02-01

This paper is a report of a study to examine the psychometric properties of the Care Dependency Scale for Paediatrics in Germany and Egypt and to compare the care dependency of school-age children in both countries. Cross-cultural differences in care dependency of older adults have been documented in the literature, but little is known about the differences and similarities with regard to children's care dependency in different cultures. A convenience sample of 258 school-aged children from Germany and Egypt participated in the study in 2005. The reliability of the Care Dependency Scale for Paediatrics was assessed in terms of internal consistency and interrater reliability. Factor analysis (principal component analysis) was employed to verify the construct validity. A Visual Analogue Scale was used to investigate the criterion-related validity. Good internal consistency was detected both for the Arabic and German versions. Factor analysis revealed one factor for both versions. A Pearson's correlation between the Care Dependency Scale for Paediatrics and Visual Analogue Scale was statistically significant for both versions indicating criterion-related validity. Statistically significant differences between the participants were detected regarding the mean sum score on the Care Dependency Scale for Paediatrics. The Care Dependency Scale for Paediatrics is a reliable and valid tool for assessing the care dependency of children and is recommended for assessing the care dependency of children from different ethnic origins. Differences in care dependency between German and Egyptian children were detected, which might be due to cultural differences.
Chinese cross-cultural adaptation and validation of the Foot Function Index as tool to measure patients with foot and ankle functional limitations.

PubMed

González-Sánchez, Manuel; Ruiz-Muñoz, Maria; Li, Guang Zhi; Cuesta-Vargas, Antonio I

2018-08-01

To perform a cross-cultural adaptation and validation of the Foot Function Index (FFI) questionnaire to develop the Chinese version. Three hundred and six patients with foot and ankle neuromusculoskeletal diseases participated in this observational study. Construct validity, internal consistency and criterion validity were calculated for the FFI Chinese version after the translation and transcultural adaptation process. Internal consistency ranged from 0.996 to 0.998. Test-retest analysis ranged from 0.985 to 0.994; minimal detectable change 90: 2.270; standard error of measurement: 0.973. Load distribution of the three factors had an eigenvalue greater than 1. Chi-square value was 9738.14 (p < 0.001). Correlations with the three factors were significant between Factor 1 and the other two: r = -0.634 (Factor 2) and r = -0.191 (Factor 1). Foot Function Index (Taiwan Version), Short-Form 12 (Version 2) and EuroQol-5D were used for criterion validity. Factors 1 and 2 showed significant correlation with 15/16 and 14/16 scales and subscales, respectively. Foot Function Index Chinese version psychometric characteristics were good to excellent. Chinese researchers and clinicians may use this tool for foot and ankle assessment and monitoring. Implications for rehabilitation A cross-cultural adaptation of the FFI has been done from original version to Chinese. Consistent results and satisfactory psychometric properties of the Foot Function Index Chinese version have been reported. For Chinese speaking researcher and clinician FFI-Ch could be used as a tool to assess patients with foot disease.
Reliability and validity of tongue color analysis in the prediction of symptom patterns in terms of East Asian Medicine.

PubMed

Park, Young-Jae; Lee, Jin-Moo; Yoo, Seung-Yeon; Park, Young-Bae

2016-04-01

To examine whether color parameters of tongue inspection (TI) using a digital camera was reliable and valid, and to examine which color parameters serve as predictors of symptom patterns in terms of East Asian medicine (EAM). Two hundred female subjects' tongue substances were photographed by a mega-pixel digital camera. Together with the photographs, the subjects were asked to complete Yin deficiency, Phlegm pattern, and Cold-Heat pattern questionnaires. Using three sets of digital imaging software, each digital image was exposure- and white balance-corrected, and finally L* (luminance), a* (red-green balance), and b* (yellow-blue balance) values of the tongues were calculated. To examine intra- and inter-rater reliabilities and criterion validity of the color analysis method, three raters were asked to calculate color parameters for 20 digital image samples. Finally, four hierarchical regression models were formed. Color parameters showed good or excellent reliability (0.627-0.887 for intra-class correlation coefficients) and significant criterion validity (0.523-0.718 for Spearman's correlation). In the hierarchical regression models, age was a significant predictor of Yin deficiency (β = 0.192), and b* value of the tip of the tongue was a determinant predictor of Yin deficiency, Phlegm, and Heat patterns (β = - 0.212, - 0.172, and - 0.163). Luminance (L*) was predictive of Yin deficiency (β = -0.172) and Cold (β = 0.173) pattern. Our results suggest that color analysis of the tongue using the L*a*b* system is reliable and valid, and that color parameters partially serve as symptom pattern predictors in EAM practice.
Invited review: Animal-based indicators for on-farm welfare assessment for dairy goats.

PubMed

Battini, M; Vieira, A; Barbieri, S; Ajuda, I; Stilwell, G; Mattiello, S

2014-11-01

This paper reviews animal-based welfare indicators to develop a valid, reliable, and feasible on-farm welfare assessment protocol for dairy goats. The indicators were considered in the light of the 4 accepted principles (good feeding, good housing, good health, appropriate behavior) subdivided into 12 criteria developed by the European Welfare Quality program. We will only examine the practical indicators to be used on-farm, excluding those requiring the use of specific instruments or laboratory analysis and those that are recorded at the slaughterhouse. Body condition score, hair coat condition, and queuing at the feed barrier or at the drinker seem the most promising indicators for the assessment of the "good feeding" principle. As to "good housing," some indicators were considered promising for assessing "comfort around resting" (e.g., resting in contact with a wall) or "thermal comfort" (e.g., panting score for the detection of heat stress and shivering score for the detection of cold stress). Several indicators related to "good health," such as lameness, claw overgrowth, presence of external abscesses, and hair coat condition, were identified. As to the "appropriate behavior" principle, different criteria have been identified: agonistic behavior is largely used as the "expression of social behavior" criterion, but it is often not feasible for on-farm assessment. Latency to first contact and the avoidance distance test can be used as criteria for assessing the quality of the human-animal relationship. Qualitative behavior assessment seems to be a promising indicator for addressing the "positive emotional state" criterion. Promising indicators were identified for most of the considered criteria; however, no valid indicator has been identified for "expression of other behaviors." Interobserver reliability has rarely been assessed and warrants further attention; in contrast, short-term intraobserver reliability is frequently assessed and some studies consider mid- and long-term reliability. The feasibility of most of the reviewed indicators in commercial farms still needs to be carefully evaluated, as several studies were performed under experimental conditions. Our review highlights some aspects of goat welfare that have been widely studied, but some indicators need to be investigated further and drafted before being included in a valid, reliable, and feasible welfare assessment protocol. The indicators selected and examined may be an invaluable starting point for the development of an on-farm welfare assessment protocol for dairy goats. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Brief report: The Brief Alcohol Social Density Assessment (BASDA): convergent, criterion-related, and incremental validity.

PubMed

MacKillop, James; Acker, John D; Bollinger, Jared; Clifton, Allan; Miller, Joshua D; Campbell, W Keith; Goodie, Adam S

2013-09-01

Alcohol misuse is substantially influenced by social factors, but systematic assessments of social network drinking are typically lengthy. The goal of the present study was to provide further validation of a brief measure of social network alcohol use, the Brief Alcohol Social Density Assessment (BASDA), in a sample of emerging adults. Specifically, the study sought to examine the BASDA's convergent, criterion, and incremental validity in relation to well-established measures of drinking motives and problematic drinking. Participants were 354 undergraduates who were assessed using the BASDA, the Alcohol Use Disorders Identification Test (AUDIT), and the Drinking Motives Questionnaire. Significant associations were observed between the BASDA index of alcohol-related social density and alcohol misuse, social motives, and conformity motives, supporting convergent validity. Criterion-related validity was supported by evidence that significantly greater alcohol involvement was present in the social networks of individuals scoring at or above an AUDIT score of 8, a validated criterion for hazardous drinking. Finally, the BASDA index was significantly associated with alcohol misuse above and beyond drinking motives in relation to AUDIT scores, supporting incremental validity. Taken together, these findings provide further support for the BASDA as an efficient measure of drinking in an individual's social network. Methodological considerations as well as recommendations for future investigations in this area are discussed.
Validation of a French adaptation of the Harvard Trauma Questionnaire among torture survivors from sub-Saharan African countries

PubMed Central

de Fouchier, Capucine; Blanchet, Alain; Hopkins, William; Bui, Eric; Ait-Aoudia, Malik; Jehel, Louis

2012-01-01

Background To date no validated instrument in the French language exists to screen for posttraumatic stress disorder (PTSD) in survivors of torture and organized violence. Objective The aim of this study is to adapt and validate the Harvard Trauma Questionnaire (HTQ) to this population. Method The adapted version was administered to 52 French-speaking torture survivors, originally from sub-Saharan African countries, receiving psychological treatment in specialized treatment centers. A structured clinical interview for DSM was also conducted in order to assess if they met criteria for PTSD. Results Cronbach's alpha coefficient for the HTQ Part 4 was adequate (0.95). Criterion validity was evaluated using receiver operating characteristic curve analysis that generated good classification accuracy for PTSD (0.83). At the original cut-off score of 2.5, the HTQ demonstrated high sensitivity and specificity (0.87 and 0.73, respectively). Conclusion Results support the reliability and validity of the French version of the HTQ. PMID:23233870
Identifying Insomnia in Early Pregnancy: Validation of the Insomnia Symptoms Questionnaire (ISQ) in Pregnant Women.

PubMed

Okun, Michele L; Buysse, Daniel J; Hall, Martica H

2015-06-15

Although a substantial number of pregnant women report symptoms of insomnia, few studies have used a validated instrument to determine the prevalence in early gestation. Identification of insomnia in pregnancy is vital given the strong connection between insomnia and the incidence of depression, cardiovascular disease, or immune dysregulation. The goal of this paper is to provide additional psychometric evaluation and validation of the Insomnia Symptom Questionnaire (ISQ) and to establish prevalence rates of insomnia among a cohort of pregnant women during early gestation. The ISQ was evaluated in 143 pregnant women at 12 weeks gestation. The internal consistency and criterion validity of the dichotomized ISQ were compared to traditional measures of sleep from sleep diaries, actigraphy, and the Pittsburgh Sleep Quality Index using indices of sensitivity, specificity, positive and negative predictive value (PPV, NPV), and likelihood ratio (LR) tests. The ISQ identified 12.6% of the sample as meeting a case definition of insomnia, consistent with established diagnostic criteria. Good reliability was established with Cronbach α = 0.86. The ISQ had high specificity (most > 85%), but sensitivity, PPV, NPV, and LRs varied according to which sleep measure was used as the validating criterion. Insomnia is a health problem for many pregnant women at all stages in pregnancy. These data support the validity and reliability of the ISQ to identify insomnia in pregnant women. The ISQ is a short and cost-effective tool that can be quickly employed in large observational studies or in clinical practice where perinatal women are seen. A commentary on this article appears in this issue on page 593. © 2015 American Academy of Sleep Medicine.
The Multimedia Activity Recall for Children and Adolescents (MARCA): development and evaluation.

PubMed

Ridley, Kate; Olds, Tim S; Hill, Alison

2006-05-26

Self-report recall questionnaires are commonly used to measure physical activity, energy expenditure and time use in children and adolescents. However, self-report questionnaires show low to moderate validity, mainly due to inaccuracies in recalling activity in terms of duration and intensity. Aside from recall errors, inaccuracies in estimating energy expenditure from self-report questionnaires are compounded by a lack of data on the energy cost of everyday activities in children and adolescents. This article describes the development of the Multimedia Activity Recall for Children and Adolescents (MARCA), a computer-delivered use-of-time instrument designed to address both the limitations of self-report recall questionnaires in children, and the lack of energy cost data in children. The test-retest reliability of the MARCA was assessed using a sample of 32 children (aged 11.8 +/- 0.7 y) who undertook the MARCA twice within 24-h. Criterion validity was assessed by comparing self-reports with accelerometer counts collected on a sample of 66 children (aged 11.6 +/- 0.8 y). Content and construct validity were assessed by establishing whether data collected using the MARCA on 1429 children (aged 11.9 +/- 0.8 y) exhibited relationships and trends in children's physical activity consistent with established findings from a number of previous research studies. Test-retest reliability was high with intra-class coefficients ranging from 0.88 to 0.94. The MARCA demonstrated criterion validity comparable to other self-report instruments with Spearman coefficients ranging from rho = 0.36 to 0.45, and provided evidence of good content and construct validity. The MARCA is a valid and reliable self-report questionnaire, capable of a wide variety of flexible use-of-time analyses related to both physical activity and sedentary behaviour, and offers advantages over existing pen-and-paper questionnaires.
[Reliability and Validity of the Behavioral Check List for Preschool Children to Measure Attention Deficit Hyperactivity Behaviors].

PubMed

Tsuno, Kanami; Yoshimasu, Kouichi; Hayashi, Takashi; Tatsuta, Nozomi; Ito, Yuki; Kamijima, Michihiro; Nakai, Kunihiko

2018-01-01

Nowadays, attention deficit hyperactivity (ADH) problems are observed commonly among school-age children. However, questionnaires specific to ADH behaviors among preschool children are very few. The aim of this study was to investigate the reliability and validity of the 25-item Behavioral Check List (BCL), which was developed from interviews of parents with children who were diagnosed as having Attention-deficit/hyperactivity disorder (ADHD) and measures ADH behaviors in preschool age. We recruited 22 teachers from 10 nurseries/kindergartens in Miyagi Prefecture, Japan. A total of 138 preschool children were assessed using the BCL. To investigate inter-rater reliability, two teachers from each facility assess seven to twenty children in their class, and intraclass correlation coefficients (ICCs) were calculated. The teachers additionally answered questions in the 1/5-5 Caregiver-Teacher Report Form (C-TRF) to investigate the criterion validity of the BCL. To investigate structural validity, exploratory factor analysis with promax rotation and confirmatory factor analysis were performed. The internal consistency reliability of the BCL was good (α = 0.92) and correlation analyses also confirmed its excellent criterion validity. Although exploratory factor analysis for the BCL yielded a five-factor model that consisted of a factor structure different from that of the original one, the results were similar to the original six factors. The ICCs of the BCL were 0.38-0.99 and it was not high enough for inter-rater reliability in some facilities. However, there is a possibility to improve it by giving raters adequate explanations when using BCL. The present study showed acceptable levels of reliability and validity of the BCL among Japanese preschool children.
Toppling Trains.

ERIC Educational Resources Information Center

Parry, Malcolm

1998-01-01

Explains a novel way of approaching centripetal force: theory is used to predict an orbital period at which a toy train will topple from a circular track. The demonstration has elements of prediction (a criterion for a good model) and suspense (a criterion for a good demonstration). The demonstration proved useful in undergraduate physics and…
The Missing Middle in Validation Research

ERIC Educational Resources Information Center

Taylor, Erwin K.; Griess, Thomas

1976-01-01

In most selection validation research, only the upper and lower tails of the criterion distribution are used, often yielding misleading or incorrect results. Provides formulas and tables which enable the researcher to account more accurately for the distribution of criterion within the middle range of population. (Author/RW)
Criterion-Related Validity of Sit-and-Reach Tests for Estimating Hamstring and Lumbar Extensibility: a Meta-Analysis

PubMed Central

Mayorga-Vega, Daniel; Merino-Marban, Rafael; Viciana, Jesús

2014-01-01

The main purpose of the present meta-analysis was to examine the scientific literature on the criterion-related validity of sit-and-reach tests for estimating hamstring and lumbar extensibility. For this purpose relevant studies were searched from seven electronic databases dated up through December 2012. Primary outcomes of criterion-related validity were Pearson´s zero-order correlation coefficients (r) between sit-and-reach tests and hamstrings and/or lumbar extensibility criterion measures. Then, from the included studies, the Hunter- Schmidt´s psychometric meta-analysis approach was conducted to estimate population criterion- related validity of sit-and-reach tests. Firstly, the corrected correlation mean (rp), unaffected by statistical artefacts (i.e., sampling error and measurement error), was calculated separately for each sit-and-reach test. Subsequently, the three potential moderator variables (sex of participants, age of participants, and level of hamstring extensibility) were examined by a partially hierarchical analysis. Of the 34 studies included in the present meta-analysis, 99 correlations values across eight sit-and-reach tests and 51 across seven sit-and-reach tests were retrieved for hamstring and lumbar extensibility, respectively. The overall results showed that all sit-and-reach tests had a moderate mean criterion-related validity for estimating hamstring extensibility (rp = 0.46-0.67), but they had a low mean for estimating lumbar extensibility (rp = 0. 16-0.35). Generally, females, adults and participants with high levels of hamstring extensibility tended to have greater mean values of criterion-related validity for estimating hamstring extensibility. When the use of angular tests is limited such as in a school setting or in large scale studies, scientists and practitioners could use the sit-and-reach tests as a useful alternative for hamstring extensibility estimation, but not for estimating lumbar extensibility. Key Points Overall sit-and-reach tests have a moderate mean criterion-related validity for estimating hamstring extensibility, but they have a low mean validity for estimating lumbar extensibility. Among all the sit-and-reach test protocols, the Classic sit-and-reach test seems to be the best option to estimate hamstring extensibility. End scores (e.g., the Classic sit-and-reach test) are a better indicator of hamstring extensibility than the modifications that incorporate fingers-to-box distance (e.g., the Modified sit-and-reach test). When angular tests such as straight leg raise or knee extension tests cannot be used, sit-and-reach tests seem to be a useful field test alternative to estimate hamstring extensibility, but not to estimate lumbar extensibility. PMID:24570599
Validation by simulation of a clinical trial model using the standardized mean and variance criteria.

PubMed

Abbas, Ismail; Rovira, Joan; Casanovas, Josep

2006-12-01

To develop and validate a model of a clinical trial that evaluates the changes in cholesterol level as a surrogate marker for lipodystrophy in HIV subjects under alternative antiretroviral regimes, i.e., treatment with Protease Inhibitors vs. a combination of nevirapine and other antiretroviral drugs. Five simulation models were developed based on different assumptions, on treatment variability and pattern of cholesterol reduction over time. The last recorded cholesterol level, the difference from the baseline, the average difference from the baseline and level evolution, are the considered endpoints. Specific validation criteria based on a 10% minus or plus standardized distance in means and variances were used to compare the real and the simulated data. The validity criterion was met by all models for considered endpoints. However, only two models met the validity criterion when all endpoints were considered. The model based on the assumption that within-subjects variability of cholesterol levels changes over time is the one that minimizes the validity criterion, standardized distance equal to or less than 1% minus or plus. Simulation is a useful technique for calibration, estimation, and evaluation of models, which allows us to relax the often overly restrictive assumptions regarding parameters required by analytical approaches. The validity criterion can also be used to select the preferred model for design optimization, until additional data are obtained allowing an external validation of the model.
Numerical and Experimental Validation of a New Damage Initiation Criterion

NASA Astrophysics Data System (ADS)

Sadhinoch, M.; Atzema, E. H.; Perdahcioglu, E. S.; van den Boogaard, A. H.

2017-09-01

Most commercial finite element software packages, like Abaqus, have a built-in coupled damage model where a damage evolution needs to be defined in terms of a single fracture energy value for all stress states. The Johnson-Cook criterion has been modified to be Lode parameter dependent and this Modified Johnson-Cook (MJC) criterion is used as a Damage Initiation Surface (DIS) in combination with the built-in Abaqus ductile damage model. An exponential damage evolution law has been used with a single fracture energy value. Ultimately, the simulated force-displacement curves are compared with experiments to validate the MJC criterion. 7 out of 9 fracture experiments were predicted accurately. The limitations and accuracy of the failure predictions of the newly developed damage initiation criterion will be discussed shortly.

Using Item Response Theory to Develop a 60-Item Representation of the NEO PI-R Using the International Personality Item Pool: Development of the IPIP-NEO-60.

PubMed

Maples-Keller, Jessica L; Williamson, Rachel L; Sleep, Chelsea E; Carter, Nathan T; Campbell, W Keith; Miller, Joshua D

2017-10-31

Given advantages of freely available and modifiable measures, an increase in the use of measures developed from the International Personality Item Pool (IPIP), including the 300-item representation of the Revised NEO Personality Inventory (NEO PI-R; Costa & McCrae, 1992a ) has occurred. The focus of this study was to use item response theory to develop a 60-item, IPIP-based measure of the Five-Factor Model (FFM) that provides equal representation of the FFM facets and to test the reliability and convergent and criterion validity of this measure compared to the NEO Five Factor Inventory (NEO-FFI). In an undergraduate sample (n = 359), scores from the NEO-FFI and IPIP-NEO-60 demonstrated good reliability and convergent validity with the NEO PI-R and IPIP-NEO-300. Additionally, across criterion variables in the undergraduate sample as well as a community-based sample (n = 757), the NEO-FFI and IPIP-NEO-60 demonstrated similar nomological networks across a wide range of external variables (r ICC = .96). Finally, as expected, in an MTurk sample the IPIP-NEO-60 demonstrated advantages over the Big Five Inventory-2 (Soto & John, 2017 ; n = 342) with regard to the Agreeableness domain content. The results suggest strong reliability and validity of the IPIP-NEO-60 scores.
Cross-cultural validity of a dietary questionnaire for studies of dental caries risk in Japanese.

PubMed

Shinga-Ishihara, Chikako; Nakai, Yukie; Milgrom, Peter; Murakami, Kaori; Matsumoto-Nakano, Michiyo

2014-01-02

Diet is a major modifiable contributing factor in the etiology of dental caries. The purpose of this paper is to examine the reliability and cross-cultural validity of the Japanese version of the Food Frequency Questionnaire to assess dietary intake in relation to dental caries risk in Japanese. The 38-item Food Frequency Questionnaire, in which Japanese food items were added to increase content validity, was translated into Japanese, and administered to two samples. The first sample comprised 355 pregnant women with mean age of 29.2 ± 4.2 years for the internal consistency and criterion validity analyses. Factor analysis (principal components with Varimax rotation) was used to determine dimensionality. The dietary cariogenicity score was calculated from the Food Frequency Questionnaire and used for the analyses. Salivary mutans streptococci level was used as a semi-quantitative assessment of dental caries risk and measured by Dentocult SM. Dentocult SM scores were compared with the dietary cariogenicity score computed from the Food Frequency Questionnaire to examine criterion validity, and assessed by Spearman's correlation coefficient (rs) and Kruskal-Wallis test. Test-retest reliability of the Food Frequency Questionnaire was assessed with a second sample of 25 adults with mean age of 34.0 ± 3.0 years by using the intraclass correlation coefficient analysis. The Japanese language version of the Food Frequency Questionnaire showed high test-retest reliability (ICC = 0.70) and good criterion validity assessed by relationship with salivary mutans streptococci levels (rs = 0.22; p < 0.001). Factor analysis revealed four subscales that construct the questionnaire (solid sugars, solid and starchy sugars, liquid and semisolid sugars, sticky and slowly dissolving sugars). Internal consistency were low to acceptable (Cronbach's alpha = 0.67 for the total scale, 0.46-0.61 for each subscale). Mean dietary cariogenicity scores were 50.8 ± 19.5 in the first sample, 47.4 ± 14.1, and 40.6 ± 11.3 for the first and second administrations in the second sample. The distribution of Dentocult SM score was 6.8% (score = 0), 34.4% (score = 1), 39.4% (score = 2), and 19.4% (score = 3). Participants with higher scores were more likely to have higher dietary cariogenicity scores (p < 0.001; Kruskal-Wallis test). These results provide the preliminary evidence for the reliability and validity of the Japanese language Food Frequency Questionnaire.
Relapse Risk Assessment for Schizophrenia Patients (RASP): A New Self-Report Screening Tool.

PubMed

Velligan, Dawn; Carpenter, William; Waters, Heidi C; Gerlanc, Nicole M; Legacy, Susan N; Ruetsch, Charles

2018-01-01

The Relapse Assessment for Schizophrenia Patients (RASP) was developed as a six-question self-report screener that measures indicators of Increased Anxiety and Social Isolation to assess patient stability and predict imminent relapse. This paper describes the development and psychometric characteristics of the RASP. The RASP and Positive and Negative Syndrome Scale (PANSS) were administered to patients with schizophrenia (n=166) three separate times. Chart data were collected on a subsample of patients (n=81). Psychometric analyses of RASP included tests of reliability, construct validity, and concurrent validity of items. Factors from RASP were correlated with subscales from PANSS (sensitivity to change and criterion validity [agreement between RASP and evidence of relapse]). Test-retest reliability returned modest to strong agreement at the item level and strong agreement at the questionnaire level. RASP showed good item response curves and internal consistency for the total instrument and within each of the two subscales (Increased Anxiety and Social Isolation). RASP Total Score and subscales showed good concurrent validity when correlated with PANSS Total Score, Positive, Excitement, and Anxiety subscales. RASP correctly predicted relapse in 67% of cases, with good specificity and negative predictive power and acceptable positive predictive power and sensitivity. The reliability and validity data presented support the use of RASP in settings where addition of a brief self-report assessment of relapse risk among patients with schizophrenia may be of benefit. Ease of use and scoring, and the ability to administer without clinical supervision allows for routine administration and assessment of relapse risk.
Adaptive model training system and method

DOEpatents

Bickford, Randall L; Palnitkar, Rahul M; Lee, Vo

2014-04-15

An adaptive model training system and method for filtering asset operating data values acquired from a monitored asset for selectively choosing asset operating data values that meet at least one predefined criterion of good data quality while rejecting asset operating data values that fail to meet at least the one predefined criterion of good data quality; and recalibrating a previously trained or calibrated model having a learned scope of normal operation of the asset by utilizing the asset operating data values that meet at least the one predefined criterion of good data quality for adjusting the learned scope of normal operation of the asset for defining a recalibrated model having the adjusted learned scope of normal operation of the asset.
Adaptive model training system and method

DOEpatents

Bickford, Randall L; Palnitkar, Rahul M

2014-11-18

An adaptive model training system and method for filtering asset operating data values acquired from a monitored asset for selectively choosing asset operating data values that meet at least one predefined criterion of good data quality while rejecting asset operating data values that fail to meet at least the one predefined criterion of good data quality; and recalibrating a previously trained or calibrated model having a learned scope of normal operation of the asset by utilizing the asset operating data values that meet at least the one predefined criterion of good data quality for adjusting the learned scope of normal operation of the asset for defining a recalibrated model having the adjusted learned scope of normal operation of the asset.
Ethical leadership: meta-analytic evidence of criterion-related and incremental validity.

PubMed

Ng, Thomas W H; Feldman, Daniel C

2015-05-01

This study examines the criterion-related and incremental validity of ethical leadership (EL) with meta-analytic data. Across 101 samples published over the last 15 years (N = 29,620), we observed that EL demonstrated acceptable criterion-related validity with variables that tap followers' job attitudes, job performance, and evaluations of their leaders. Further, followers' trust in the leader mediated the relationships of EL with job attitudes and performance. In terms of incremental validity, we found that EL significantly, albeit weakly in some cases, predicted task performance, citizenship behavior, and counterproductive work behavior-even after controlling for the effects of such variables as transformational leadership, use of contingent rewards, management by exception, interactional fairness, and destructive leadership. The article concludes with a discussion of ways to strengthen the incremental validity of EL. (PsycINFO Database Record (c) 2015 APA, all rights reserved).
Scaffold percolative efficiency: in vitro evaluation of the structural criterion for electrospun mats.

PubMed

Heidarkhan Tehrani, Ashkan; Zadhoush, Ali; Karbasi, Saeed; Sadeghi-Aliabadi, Hojjat

2010-11-01

Fibrous scaffolds of engineered structures can be chosen as promising porous environments when an approved criterion validates their applicability for a specific medical purpose. For such biomaterials, this paper sought to investigate various structural characteristics in order to determine whether they are appropriate descriptors. A number of poly(3-hydroxybutyrate) scaffolds were electrospun; each of which possessed a distinguished architecture when their material and processing conditions were altered. Subsequent culture of mouse fibroblast cells (L929) was carried out to evaluate the cells viability on each scaffold after their attachment for 24 h and proliferation for 48 and 72 h. The scaffolds' porosity, pores number, pores size and distribution were quantified and none could establish a relationship with the viability results. Virtual reconstruction of the mats introduced an authentic criterion, "Scaffold Percolative Efficiency" (SPE), with which the above descriptors were addressed collectively. It was hypothesized to be able to quantify the efficacy of fibrous scaffolds by considering the integration of porosity and interconnectivity of the pores. There was a correlation of 80% as a good agreement between the SPE values and the spectrophotometer absorbance of viable cells; a viability of more than 350% in comparison to that of the controls.
The reliability and validity of a child and adolescent participation in decision-making questionnaire.

PubMed

O'Hare, L; Santin, O; Winter, K; McGuinness, C

2016-09-01

There is a growing impetus across the research, policy and practice communities for children and young people to participate in decisions that affect their lives. Furthermore, there is a dearth of general instruments that measure children and young people's views on their participation in decision-making. This paper presents the reliability and validity of the Child and Adolescent Participation in Decision-Making Questionnaire (CAP-DMQ) and specifically looks at a population of looked-after children, where a lack of participation in decision-making is an acute issue. The participants were 151 looked after children and adolescents between 10-23 years of age who completed the 10 item CAP-DMQ. Of the participants 113 were in receipt of an advocacy service that had an aim of increasing participation in decision-making with the remaining participants not having received this service. The results showed that the CAP-DMQ had good reliability (Cronbach's alpha = 0.94) and showed promising uni-dimensional construct validity through an exploratory factor analysis. The items in the CAP-DMQ also demonstrated good content validity by overlapping with prominent models of child and adolescent participation (Lundy 2007) and decision-making (Halpern 2014). A regression analysis showed that age and gender were not significant predictors of CAP-DMQ scores but receipt of advocacy was a significant predictor of scores (effect size d = 0.88), thus showing appropriate discriminant criterion validity. Overall, the CAP-DMQ showed good reliability and validity. Therefore, the measure has excellent promise for theoretical investigation in the area of child and adolescent participation in decision-making and equally shows empirical promise for use as a measure in evaluating services, which have increasing the participation of children and adolescents in decision-making as an intended outcome. © 2016 John Wiley & Sons Ltd.
Reliability and validity of the Chinese versions of self-efficacy and outcome expectations for osteoporosis medication adherence scales in Chinese immigrants.

PubMed

Qi, Bing-Bing; Resnick, Barbara

2014-01-01

To assess the psychometric properties of Chinese versions self-efficacy and outcome expectations on osteoporosis medication adherence (SEOMA-C and OEOMA-C) scales. Back-translated tools were assessed by internal consistency and R2 by structured equation modeling, confirmatory factor analyses, hypothesis testing, and criterion-related validity among 110 (81 females, 29 males) Mandarin-speaking immigrants (mean age = 63.44, SD = 9.63). The Cronbach's alpha for SEOMA-C and OEOMA-C is .904 and .937, respectively. There was fair and good fit of the measurement model to the data. Previous bone mineral density (BMD) testing, calcaneus BMD, self-efficacy for exercise, and osteoporosis medication adherence were positively related to SEOMA-C scores. These scales constitute some preliminary validity and reliability. Further refined and cultural sensitive items could be explored and added.
Validation of a Type 2 Diabetes Screening Tool in Rural Honduras

PubMed Central

Milton, Evan C.; Herman, William H.; Aiello, Allison E.; Danielson, Kris R.; Mendoza-Avelarez, Milton O.; Piette, John D.

2010-01-01

OBJECTIVE To validate a low-cost tool for identifying diabetic patients in rural areas of Latin America. RESEARCH DESIGN AND METHODS A regression equation incorporating postprandial time and a random plasma glucose was used to screen 800 adults in Honduras. Patients with a probability of diabetes of ≥20% were asked to return for a fasting plasma glucose (FPG). A random fifth of those with a screener-based probability of diabetes <20% were also asked to return for follow-up. The gold standard was an FPG ≥126 mg/dl. RESULTS The screener had very good test characteristics (area under the receiver operating characteristic curve = 0.89). Using the screening criterion of ≥0.42, the equation had a sensitivity of 74.1% and specificity of 97.2%. CONCLUSIONS This screener is a valid measure of diabetes risk in Honduras and could be used to identify diabetic patients in poor clinics in Latin America. PMID:19918008
Adaptation to Portuguese of the Depression, Anxiety and Stress Scales (DASS).

PubMed

Apóstolo, João Luís Alves; Mendes, Aida Cruz; Azeredo, Zaida Aguiar

2006-01-01

To adapt to Portuguese, of Portugal, the Depression, Anxiety and Stress Scales, a 21-item short scale (DASS 21), designed to measure depression, anxiety and stress. After translation and back-translation with the help of experts, the DASS 21 was administered to patients in external psychiatry consults (N=101), and its internal consistency, construct validity and concurrent validity were measured. The DASS 21 properties certify its quality to measure emotional states. The instrument reveals good internal consistency. Factorial analysis shows that the two-factor structure is more adequate. The first factor groups most of the items that theoretically assess anxiety and stress, and the second groups most of the items that assess depression, explaining, on the whole, 58.54% of total variance. The strong positive correlation between the DASS 21 and the Hospital Anxiety and Depression scale (HAD) confirms the hypothesis regarding the criterion validity, however, revealing fragilities as to the divergence between theoretically different constructs.
Psychopathy in Bulgaria: The cross-cultural generalizability of the Hare Psychopathy Checklist

PubMed Central

Wilson, Michael J.; Abramowitz, Carolyn; Vasilev, Georgi; Bozgunov, Kiril; Vassileva, Jasmin

2014-01-01

The generalizability of the psychopathy construct to Eastern European cultures has not been well-studied, and no prior studies have evaluated psychopathy in non-offender samples from this population. The current validation study examines the factor structure, internal consistency, and external validity of the Bulgarian translation of the Hare Psychopathy Checklist: Screening Version. Two hundred sixty-two Bulgarian adults from the general community were assessed, of which 185 had a history of substance dependence. Confirmatory factor analysis indicated good fit for the two-, three-, and four-factor models of psychopathy. Zero-order and partial correlation analyses were conducted between the two factors of psychopathy and criterion measures of antisocial behavior, internalizing and externalizing psychopathology, personality traits, addictive disorders and demographic characteristics. Relationships to external variables provided evidence for the convergent and discriminant validity of the psychopathy construct in a Bulgarian community sample. PMID:25313268
Developing and validating the Youth Conduct Problems Scale-Rwanda: a mixed methods approach.

PubMed

Ng, Lauren C; Kanyanganzi, Frederick; Munyanah, Morris; Mushashi, Christine; Betancourt, Theresa S

2014-01-01

This study developed and validated the Youth Conduct Problems Scale-Rwanda (YCPS-R). Qualitative free listing (n = 74) and key informant interviews (n = 47) identified local conduct problems, which were compared to existing standardized conduct problem scales and used to develop the YCPS-R. The YCPS-R was cognitive tested by 12 youth and caregiver participants, and assessed for test-retest and inter-rater reliability in a sample of 64 youth. Finally, a purposive sample of 389 youth and their caregivers were enrolled in a validity study. Validity was assessed by comparing YCPS-R scores to conduct disorder, which was diagnosed with the Mini International Neuropsychiatric Interview for Children, and functional impairment scores on the World Health Organization Disability Assessment Schedule Child Version. ROC analyses assessed the YCPS-R's ability to discriminate between youth with and without conduct disorder. Qualitative data identified a local presentation of youth conduct problems that did not match previously standardized measures. Therefore, the YCPS-R was developed solely from local conduct problems. Cognitive testing indicated that the YCPS-R was understandable and required little modification. The YCPS-R demonstrated good reliability, construct, criterion, and discriminant validity, and fair classification accuracy. The YCPS-R is a locally-derived measure of Rwandan youth conduct problems that demonstrated good psychometric properties and could be used for further research.
Measuring the needs of mental health patients in Greece: reliability and validity of the Greek version of the Camberwell assessment of need.

PubMed

Stefanatou, Pentagiotissa; Giannouli, Eleni; Konstantakopoulos, George; Vitoratou, Silia; Mavreas, Venetsanos

2014-11-01

Evaluation of mental health services based on patients' needs assessments has never taken place in Greece, although it is a crucial factor for the efficient use of their limited resources. To examine the inter-rater and test-retest reliability and the concurrent/convergent validity of the Greek research version of the Camberwell Assessment of Need-Research (CAN-R). A total of 53 schizophrenic patient-staff pairs were interviewed twice to test the inter-rater and test-retest reliability of the Greek version of the CAN-R. The World Health Organization Quality of Life-Brief Form (WHOQOL-BREF) and World Health Organization Disability Assessment Schedule-2.0 (WHODAS-2.0) were administered to the patients to examine concurrent validity. The inter-rater and test-retest reliability of patient and staff interviews for the 22 individual items and the eight summary scores of the instrument's four sections were good to excellent. Significant correlations emerged between CAN scores and the WHOQOL-BREF and WHODAS-2.0 domains for both patient and staff ratings, indicating good concurrent validity. Our results suggest that the Greek version of the CAN-R is a reliable instrument for assessing mental health patients' needs. Moreover, it is the first CAN-R validity study with satisfactory results using WHOQOL-BREF and WHODAS-2.0 as criterion variables. © The Author(s) 2013.
Earing Prediction in Cup Drawing using the BBC2008 Yield Criterion

NASA Astrophysics Data System (ADS)

Vrh, Marko; Halilovič, Miroslav; Starman, Bojan; Štok, Boris; Comsa, Dan-Sorin; Banabic, Dorel

2011-08-01

The paper deals with constitutive modelling of highly anisotropic sheet metals. It presents FEM based earing predictions in cup drawing simulation of highly anisotropic aluminium alloys where more than four ears occur. For that purpose the BBC2008 yield criterion, which is a plane-stress yield criterion formulated in the form of a finite series, is used. Thus defined criterion can be expanded to retain more or less terms, depending on the amount of given experimental data. In order to use the model in sheet metal forming simulations we have implemented it in a general purpose finite element code ABAQUS/Explicit via VUMAT subroutine, considering alternatively eight or sixteen parameters (8p and 16p version). For the integration of the constitutive model the explicit NICE (Next Increment Corrects Error) integration scheme has been used. Due to the scheme effectiveness the CPU time consumption for a simulation is comparable to the time consumption of built-in constitutive models. Two aluminium alloys, namely AA5042-H2 and AA2090-T3, have been used for a validation of the model. For both alloys the parameters of the BBC2008 model have been identified with a developed numerical procedure, based on a minimization of the developed cost function. For both materials, the predictions of the BBC2008 model prove to be in very good agreement with the experimental results. The flexibility and the accuracy of the model together with the identification and integration procedure guarantee the applicability of the BBC2008 yield criterion in industrial applications.
Validity and Reliability of Criterion-Referenced Measures: Issues and Procedures for Special Educators.

ERIC Educational Resources Information Center

Harris, Larry P.; Wolf, Steven R.

1979-01-01

The article focuses on the controversy over norm-referenced v criterion-referenced measures (CRM) in assessment of learning disorders. The authors contend that while the reliability of CRMs is generally indisputable, the validity of measures designed from local curricula is still dependent on the intuitive judgments of teachers. (Author/SBH)
Validation of the Military Entrance Physical Strength Capacity Test. Technical Report 610.

ERIC Educational Resources Information Center

Myers, David C.; And Others

A battery of physical ability tests was validated using a predictive, criterion-related strategy. The battery was given to 1,003 female soldiers and 980 male soldiers before they had begun Army Basic Training. Criterion measures which represented physical competency in Basic Training (physical proficiency tests, sick call, profiles, and separation…
The Benchmarking Capacity of a General Outcome Measure of Academic Language in Science and Social Studies

ERIC Educational Resources Information Center

Mooney, Paul; Lastrapes, Renée E.

2016-01-01

The amount of research evaluating the technical merits of general outcome measures of science and social studies achievement is growing. This study targeted criterion validity for critical content monitoring. Questions addressed the concurrent criterion validity of alternate presentation formats of critical content monitoring and the measure's…
Validation of a Criterion Referenced Test for Young Handicapped Children: PIPER.

ERIC Educational Resources Information Center

Strum, Irene; Shapiro, Madelaine

The purpose of this study was to validate the Prescriptive Instructional Program for Educational Readiness (PIPER) for utilization as a criterion referenced test (CRT) among learning disabled children. The program consisted of behavioral objectives and diagnostic and/or mastery tasks and activities for each objective in the area of gross motor…
Evaluation of Weighted Scale Reliability and Criterion Validity: A Latent Variable Modeling Approach

ERIC Educational Resources Information Center

Raykov, Tenko

2007-01-01

A method is outlined for evaluating the reliability and criterion validity of weighted scales based on sets of unidimensional measures. The approach is developed within the framework of latent variable modeling methodology and is useful for point and interval estimation of these measurement quality coefficients in counseling and education…

Meta-Analysis of Criterion Validity for Curriculum-Based Measurement in Written Language

ERIC Educational Resources Information Center

Romig, John Elwood; Therrien, William J.; Lloyd, John W.

2017-01-01

We used meta-analysis to examine the criterion validity of four scoring procedures used in curriculum-based measurement of written language. A total of 22 articles representing 21 studies (N = 21) met the inclusion criteria. Results indicated that two scoring procedures, correct word sequences and correct minus incorrect sequences, have acceptable…
Reliability and criterion validity of measurements using a smart phone-based measurement tool for the transverse rotation angle of the pelvis during single-leg lifting.

PubMed

Jung, Sung-Hoon; Kwon, Oh-Yun; Jeon, In-Cheol; Hwang, Ui-Jae; Weon, Jong-Hyuck

2018-01-01

The purposes of this study were to determine the intra-rater test-retest reliability of a smart phone-based measurement tool (SBMT) and a three-dimensional (3D) motion analysis system for measuring the transverse rotation angle of the pelvis during single-leg lifting (SLL) and the criterion validity of the transverse rotation angle of the pelvis measurement using SBMT compared with a 3D motion analysis system (3DMAS). Seventeen healthy volunteers performed SLL with their dominant leg without bending the knee until they reached a target placed 20 cm above the table. This study used a 3DMAS, considered the gold standard, to measure the transverse rotation angle of the pelvis to assess the criterion validity of the SBMT measurement. Intra-rater test-retest reliability was determined using the SBMT and 3DMAS using intra-class correlation coefficient (ICC) [3,1] values. The criterion validity of the SBMT was assessed with ICC [3,1] values. Both the 3DMAS (ICC = 0.77) and SBMT (ICC = 0.83) showed excellent intra-rater test-retest reliability in the measurement of the transverse rotation angle of the pelvis during SLL in a supine position. Moreover, the SBMT showed an excellent correlation with the 3DMAS (ICC = 0.99). Measurement of the transverse rotation angle of the pelvis using the SBMT showed excellent reliability and criterion validity compared with the 3DMAS.
Criterion-Related Validity of the Distance- and Time-Based Walk/Run Field Tests for Estimating Cardiorespiratory Fitness: A Systematic Review and Meta-Analysis

PubMed Central

Mayorga-Vega, Daniel; Bocanegra-Parrilla, Raúl; Ornelas, Martha; Viciana, Jesús

2016-01-01

Objectives The main purpose of the present meta-analysis was to examine the criterion-related validity of the distance- and time-based walk/run tests for estimating cardiorespiratory fitness among apparently healthy children and adults. Materials and Methods Relevant studies were searched from seven electronic bibliographic databases up to August 2015 and through other sources. The Hunter-Schmidt’s psychometric meta-analysis approach was conducted to estimate the population criterion-related validity of the following walk/run tests: 5,000 m, 3 miles, 2 miles, 3,000 m, 1.5 miles, 1 mile, 1,000 m, ½ mile, 600 m, 600 yd, ¼ mile, 15 min, 12 min, 9 min, and 6 min. Results From the 123 included studies, a total of 200 correlation values were analyzed. The overall results showed that the criterion-related validity of the walk/run tests for estimating maximum oxygen uptake ranged from low to moderate (rp = 0.42–0.79), with the 1.5 mile (rp = 0.79, 0.73–0.85) and 12 min walk/run tests (rp = 0.78, 0.72–0.83) having the higher criterion-related validity for distance- and time-based field tests, respectively. The present meta-analysis also showed that sex, age and maximum oxygen uptake level do not seem to affect the criterion-related validity of the walk/run tests. Conclusions When the evaluation of an individual’s maximum oxygen uptake attained during a laboratory test is not feasible, the 1.5 mile and 12 min walk/run tests represent useful alternatives for estimating cardiorespiratory fitness. As in the assessment with any physical fitness field test, evaluators must be aware that the performance score of the walk/run field tests is simply an estimation and not a direct measure of cardiorespiratory fitness. PMID:26987118
Increased Activity or Energy as a Primary Criterion for the Diagnosis of Bipolar Mania in DSM-5: Findings From the STEP-BD Study.

PubMed

Machado-Vieira, Rodrigo; Luckenbaugh, David A; Ballard, Elizabeth D; Henter, Ioline D; Tohen, Mauricio; Suppes, Trisha; Zarate, Carlos A

2017-01-01

DSM-5 describes "a distinct period of abnormally and persistently elevated, expansive, or irritable mood and abnormally and persistently increased activity or energy" as a primary criterion for mania. Thus, increased energy or activity is now considered a core symptom of manic and hypomanic episodes. Using data from the Systematic Treatment Enhancement Program for Bipolar Disorder study, the authors analyzed point prevalence data obtained at the initial visit to assess the diagnostic validity of this new DSM-5 criterion. The study hypothesis was that the DSM-5 criterion would alter the prevalence of mania and/or hypomania. The authors compared prevalence, clinical characteristics, validators, and outcome in patients meeting the DSM-5 criteria (i.e., DSM-IV criteria plus the DSM-5 criterion of increased activity or energy) and those who did not meet the new DSM-5 criterion (i.e., who only met DSM-IV criteria). All 4,360 participants met DSM-IV criteria for bipolar disorder, and 310 met DSM-IV criteria for a manic or hypomanic episode. When the new DSM-5 criterion of increased activity or energy was added as a coprimary symptom, the prevalence of mania and hypomania was reduced. Although minor differences were noted in clinical and concurrent validators, no changes were observed in longitudinal outcomes. The findings confirm that including increased activity or energy as part of DSM-5 criterion A decreases the prevalence of manic and hypomanic episodes but does not affect longitudinal clinical outcomes.
Could situational judgement tests be used for selection into dental foundation training?

PubMed

Patterson, F; Ashworth, V; Mehra, S; Falcon, H

2012-07-13

To pilot and evaluate a machine-markable situational judgement test (SJT) designed to select candidates into UK dental foundation training. Single centre pilot study. UK postgraduate deanery in 2010. Seventy-four candidates attending interview for dental foundation training in Oxford and Wessex Deaneries volunteered to complete the situational judgement test. The situational judgement test was developed to assess relevant professional attributes for dentistry (for example, empathy and integrity) in a machine-markable format. Test content was developed by subject matter experts working with experienced psychometricians. Evaluation of psychometric properties of the pilot situational judgement test (for example, reliability, validity and fairness). Scores in the dental foundation training selection process (short-listing and interviews) were used to examine criterion-related validity. Candidates completed an evaluation questionnaire to examine candidate reactions and face validity of the new test. Forty-six candidates were female and 28 male; mean age was 23.5-years-old (range 22-32). Situational judgement test scores were normally distributed and the test showed good internal reliability when corrected for test length (α = 0.74). Situational judgement test scores positively correlated with the management, leadership and professionalism interview (N = 50; r = 0.43, p <0.01) but not with the clinical skills interview, providing initial evidence of criterion-related validity as the situational judgement test is designed to test non-cognitive professional attributes beyond clinical knowledge. Most candidates perceived the situational judgement test as relevant to dentistry, appropriate for their training level, and fair. This initial pilot study suggests that a situational judgement test is an appropriate and innovative method to measure professional attributes (eg empathy and integrity) for selection into foundation training. Further research will explore the long-term predictive validity of the situational judgement test once candidates have entered training.
2016 Revisions to the 2010/2011 fibromyalgia diagnostic criteria.

PubMed

Wolfe, Frederick; Clauw, Daniel J; Fitzcharles, Mary-Ann; Goldenberg, Don L; Häuser, Winfried; Katz, Robert L; Mease, Philip J; Russell, Anthony S; Russell, Irwin Jon; Walitt, Brian

2016-12-01

The provisional criteria of the American College of Rheumatology (ACR) 2010 and the 2011 self-report modification for survey and clinical research are widely used for fibromyalgia diagnosis. To determine the validity, usefulness, potential problems, and modifications required for the criteria, we assessed multiple research reports published in 2010-2016 in order to provide a 2016 update to the criteria. We reviewed 14 validation studies that compared 2010/2011 criteria with ACR 1990 classification and clinical criteria, as well as epidemiology, clinical, and databank studies that addressed important criteria-level variables. Based on definitional differences between 1990 and 2010/2011 criteria, we interpreted 85% sensitivity and 90% specificity as excellent agreement. Against 1990 and clinical criteria, the median sensitivity and specificity of the 2010/2011 criteria were 86% and 90%, respectively. The 2010/2011 criteria led to misclassification when applied to regional pain syndromes, but when a modified widespread pain criterion (the "generalized pain criterion") was added misclassification was eliminated. Based on the above data and clinic usage data, we developed a (2016) revision to the 2010/2011 fibromyalgia criteria. Fibromyalgia may now be diagnosed in adults when all of the following criteria are met: CONCLUSIONS: The fibromyalgia criteria have good sensitivity and specificity. This revision combines physician and questionnaire criteria, minimizes misclassification of regional pain disorders, and eliminates the previously confusing recommendation regarding diagnostic exclusions. The physician-based criteria are valid for individual patient diagnosis. The self-report version of the criteria is not valid for clinical diagnosis in individual patients but is valid for research studies. These changes allow the criteria to function as diagnostic criteria, while still being useful for classification. Copyright © 2016 Elsevier Inc. All rights reserved.
Assessment of the psychometric properties of the Spanish language version of questionnaire ICIQ-Male Lower Urinary Tract Symptoms (ICIQ-MLUTS).

PubMed

Castro-Díaz, D M; Esteban-Fuertes, M; Salinas-Casado, J; Bustamante-Alarma, S; Gago-Ramos, J L; Galacho-Bech, A; García-Matres, M J; Rodríguez-Toves, L A; Zubiaur-Líbano, C; Collado-Serra, A; Batista-Miranda, J E; Ortiz-Gámiz, A

2014-03-01

To evaluate the psychometric properties of the Spanish version of the ICIQ-Male Lower Urinary Tract Symptoms Questionnaire (ICIQ-MLUTS): Feasibility (% of completion and ceiling/ground effects), reliability (Test-retest), convergent validity (vs Bladder Control Self-Assessment Questionnaire [BSAQ] and vs International Prostate Symptom Score [I-PSS]) and criterion validity (according to presence or absence of symptoms). This was an observational, non-interventionist and multicenter study. 223 male patients with lower urinary tract symptoms (LUTS), predominantly storage symptoms and aged 18-65, took part in the study. Patients completed the ICIQ-MLUTS (test), I-PSS and BSAQ questionnaires and referred their urinary symptoms in a single visit, with the exception of a subgroup composed by 49 patients that completed the questionnaire again 15 days after initial visit to evaluate test-retest reliability. The questionnaire includes 13 items divided in 2 sub-scales: Voiding symptoms (V) from 0-20 and Incontinence symptoms (I) from 0-24. Percentage of patients that completed all items: 98.84%. Ground effect is 0 and ceiling effect was under 6% in both sub-scales. Test-retest reliability: Intraclass correlation coefficient (ICC) ranged from 0.68 to 0.88, except on Delay. Kappa shows a good agreement, between 0.60 and 0.81, except for Nocturia. Convergent validity: Correlation (Spearman) between the questionnaire sub-scales scores and the rest of measures is statistically significant (P < .01 and P < .05). Criterion validity: Statistically significant differences (P < .05) between scores on ICIQ-MLUTS, from patients that refer experiencing symptoms and those who do not. The Spanish version of the ICIQ-MLUTS questionnaire shows adequate feasibility, reliability and validity. Copyright © 2013 AEU. Published by Elsevier Espana. All rights reserved.
Validity and reliability of the Japanese version of the Newest Vital Sign: a preliminary study.

PubMed

Kogure, Takamichi; Sumitani, Masahiko; Suka, Machi; Ishikawa, Hirono; Odajima, Takeshi; Igarashi, Ataru; Kusama, Makiko; Okamoto, Masako; Sugimori, Hiroki; Kawahara, Kazuo

2014-01-01

Health literacy (HL) refers to the ability to obtain, process, and understand basic health information and services, and is thus needed to make appropriate health decisions. The Newest Vital Sign (NVS) is comprised of 6 questions about an ice cream nutrition label and assesses HL numeracy skills. We developed a Japanese version of the NVS (NVS-J) and evaluated the validity and reliability of the NVS-J in patients with chronic pain. The translation of the original NVS into Japanese was achieved as per the published guidelines. An observational study was subsequently performed to evaluate the validity and reliability of the NVS-J in 43 Japanese patients suffering from chronic pain. Factor analysis with promax rotation, using the Kaiser criterion (eigenvalues ≥1.0), and a scree plot revealed that the main component of the NVS-J consists of three determinative factors, and each factor consists of two NVS-J items. The criterion-related validity of the total NVS-J score was significantly correlated with the total score of Ishikawa et al.'s self-rated HL Questionnaire, the clinical global assessment of comprehensive HL level, cognitive function, and the Brinkman index. In addition, Cronbach's coefficient for the total score of the NVS-J was adequate (alpha = 0.72). This study demonstrated that the NVS-J has good validity and reliability. Further, the NVS-J consists of three determinative factors: "basic numeracy ability," "complex numeracy ability," and "serious-minded ability." These three HL abilities comprise a 3-step hierarchical structure. Adequate HL should be promoted in chronic pain patients to enable coping, improve functioning, and increase activities of daily living (ADLs) and quality of life (QOL).
State of the art in the validation of screening methods for the control of antibiotic residues: is there a need for further development?

PubMed

Gaudin, Valérie

2017-09-01

Screening methods are used as a first-line approach to detect the presence of antibiotic residues in food of animal origin. The validation process guarantees that the method is fit-for-purpose, suited to regulatory requirements, and provides evidence of its performance. This article is focused on intra-laboratory validation. The first step in validation is characterisation of performance, and the second step is the validation itself with regard to pre-established criteria. The validation approaches can be absolute (a single method) or relative (comparison of methods), overall (combination of several characteristics in one) or criterion-by-criterion. Various approaches to validation, in the form of regulations, guidelines or standards, are presented and discussed to draw conclusions on their potential application for different residue screening methods, and to determine whether or not they reach the same conclusions. The approach by comparison of methods is not suitable for screening methods for antibiotic residues. The overall approaches, such as probability of detection (POD) and accuracy profile, are increasingly used in other fields of application. They may be of interest for screening methods for antibiotic residues. Finally, the criterion-by-criterion approach (Decision 2002/657/EC and of European guideline for the validation of screening methods), usually applied to the screening methods for antibiotic residues, introduced a major characteristic and an improvement in the validation, i.e. the detection capability (CCβ). In conclusion, screening methods are constantly evolving, thanks to the development of new biosensors or liquid chromatography coupled to tandem-mass spectrometry (LC-MS/MS) methods. There have been clear changes in validation approaches these last 20 years. Continued progress is required and perspectives for future development of guidelines, regulations and standards for validation are presented here.
Development and psychometric testing the Health of Body, Mind and Spirit Scale for assessing individuals who have drug abuse histories.

PubMed

Sun, Fan-Ko; Chiang, Chun-Ying; Lu, Chu-Yun; Yu, Pei-Jane; Liao, Tzu-Chiao; Lan, Chu-Mei

2018-03-01

To develop the Health of Body, Mind and Spirit Scale (HBMSS), which was designed to assess drug abusers' health condition. Helping drug abusers to become healthy is important to healthcare professionals. However, no instrument exists to assess drug abusers' state of health. A cross-sectional questionnaire survey was implemented to examine the validity of the HBMSS. Data were collected from 2015-2016 at one drug abuse prevention centre in Taiwan. Participants (N = 320) who had abused drugs were invited to complete a preliminary 64-item version of the HBMSS. An item analysis, criterion-related validity analysis (using the Relapse Prediction Scale [RPS] score), split-half reliability testing and confirmatory factor analysis (CFA) were conducted to examine the psychometric properties of the HBMSS. The final version of the HBMSS contained 15 items that were divided into three subscales: the health of the body, mind and spirit. Cronbach's α and split-half reliability coefficients were all above .85. The factor loading of each item was between .74-.95. The HBMSS had satisfactory criterion-related validity with the RPS score (r = -.50, p < .001). A second-order CFA was conducted on the HBMSS. The fit indexes were good, χ 2 = 184.060, df = 94, χ 2 /df = 1.958 (p = .000). The entire HBMSS and the subscales had satisfactory reliability and validity. Healthcare professionals could use the HBMSS to evaluate the condition of the health of individuals with a drug abuse history. © 2017 John Wiley & Sons Ltd.
A criterion for maximum resin flow in composite materials curing process

NASA Astrophysics Data System (ADS)

Lee, Woo I.; Um, Moon-Kwang

1993-06-01

On the basis of Springer's resin flow model, a criterion for maximum resin flow in autoclave curing is proposed. Validity of the criterion was proved for two resin systems (Fiberite 976 and Hercules 3501-6 epoxy resin). The parameter required for the criterion can be easily estimated from the measured resin viscosity data. The proposed criterion can be used in establishing the proper cure cycle to ensure maximum resin flow and, thus, the maximum compaction.
[Toward a deeper understanding of motivation towards exercise: measurement of integrated regulation in the Spanish context].

PubMed

González-Cutre, David; Sicilia, Álvaro; Fernández, Alberto

2010-11-01

The purpose of this study was to validate the Behavioural Regulation in Exercise Questionnaire in the Spanish context, including items to measure integrated regulation. Participants were 524 exercisers, mean age 29.59 years. The results revealed acceptable fit indices in the confirmatory factor analysis and good internal consistency (with a Cronbach alpha of .87 for integrated regulation). The diverse subscales also conformed to a simplex pattern and the factor structure was invariant across gender and age. Integrated regulation reflected high temporal stability over a 4-week period (ICC=.90). The criterion validity analysis of integrated regulation indicated that this variable was positively predicted by satisfaction of the needs for competence and autonomy. The results regarding the importance of measuring integrated regulation in exercise are discussed.
Danish VISA-A questionnaire with validation and reliability testing for Danish-speaking Achilles tendinopathy patients.

PubMed

Iversen, J V; Bartels, E M; Jørgensen, J E; Nielsen, T G; Ginnerup, C; Lind, M C; Langberg, H

2016-12-01

The VISA-A questionnaire has proven to be a valid and reliable tool for assessing severity of Achilles tendinopathy (AT). The aim was to translate and cross-culturally adapt the VISA-A questionnaire for a Danish-speaking AT population, and subsequently perform validity and reliability tests. Translation and following cross-cultural adaptation was performed as translation, synthesis, reverse translation, expert review, and pretesting. The final Danish version (VISA-A-DK) was tested for reliability on healthy controls (n = 75) and patients (n = 36). Tests for internal consistency, validity, and structure were performed on 71 patients. VISA-A-DK showed good reliability for patients (r = 0.80 ICC = 0.79) and healthy individuals (r = 0.98 ICC = 0.97). Internal consistency was 0.73 (Cronbach's alpha). The mean VISA-A-DK score in AT patients was 51 [47-55]. This was significantly lower than healthy controls with a score of 93 (90-95). Criterion validity was considered good when comparing the scores of the Danish version with the original version in both healthy individuals and patients. VISA-A-DK is a valid and reliable instrument and has shown compatible to the original version in assessment of AT patients. VISA-A-DK is a useful tool in the assessment of AT, both in research and in a clinical setting. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
38 CFR 18.442 - Admissions and recruitment.

Code of Federal Regulations, 2011 CFR

2011-07-01

... conduct periodic validity studies against the criterion of overall success in the education program or... use any test or criterion for admission that has a disproportionate, adverse effect on handicapped persons or any class of handicapped persons unless: (i) The test or criterion, as used by the recipient...
When is hub gene selection better than standard meta-analysis?

PubMed

Langfelder, Peter; Mischel, Paul S; Horvath, Steve

2013-01-01

Since hub nodes have been found to play important roles in many networks, highly connected hub genes are expected to play an important role in biology as well. However, the empirical evidence remains ambiguous. An open question is whether (or when) hub gene selection leads to more meaningful gene lists than a standard statistical analysis based on significance testing when analyzing genomic data sets (e.g., gene expression or DNA methylation data). Here we address this question for the special case when multiple genomic data sets are available. This is of great practical importance since for many research questions multiple data sets are publicly available. In this case, the data analyst can decide between a standard statistical approach (e.g., based on meta-analysis) and a co-expression network analysis approach that selects intramodular hubs in consensus modules. We assess the performance of these two types of approaches according to two criteria. The first criterion evaluates the biological insights gained and is relevant in basic research. The second criterion evaluates the validation success (reproducibility) in independent data sets and often applies in clinical diagnostic or prognostic applications. We compare meta-analysis with consensus network analysis based on weighted correlation network analysis (WGCNA) in three comprehensive and unbiased empirical studies: (1) Finding genes predictive of lung cancer survival, (2) finding methylation markers related to age, and (3) finding mouse genes related to total cholesterol. The results demonstrate that intramodular hub gene status with respect to consensus modules is more useful than a meta-analysis p-value when identifying biologically meaningful gene lists (reflecting criterion 1). However, standard meta-analysis methods perform as good as (if not better than) a consensus network approach in terms of validation success (criterion 2). The article also reports a comparison of meta-analysis techniques applied to gene expression data and presents novel R functions for carrying out consensus network analysis, network based screening, and meta analysis.
Concurrent Criterion Validity of the Ausburg Multidimensional Personality Instrument (AMPI) Clinical Scales among College Students

ERIC Educational Resources Information Center

Kelly, William E.; Lutz, Daniel

2014-01-01

The concurrent criterion validity of the Ausburg Multidimensional Personality Instrument (AMPI) clinical scales was examined. The AMPI and several scales purportedly measuring the same or similar constructs as those of the AMPI clinical scales were administered to two samples of college students (N = 134 and N = 118). The correlations between the…
The Validity of the Modified Sit-and-Reach Test in College-Age Students.

ERIC Educational Resources Information Center

Minkler, Sharin; Patterson, Patricia

1994-01-01

Reports a study that examined the criterion-related validity of the modified sit-and-reach test against criterion measures of hamstring and low back flexibility in college students. Results indicated the modified sit-and-reach test moderately related to hamstring flexibility, but its relation to low back flexibility was low. (SM)
Updating the Trainability Tests Literature on Black-White Subgroup Differences and Reconsidering Criterion-Related Validity

ERIC Educational Resources Information Center

Roth, Philip L.; Buster, Maury A.; Bobko, Philip

2011-01-01

A number of applied psychologists have suggested that trainability test Black-White ethnic group differences are low or relatively low (e.g., Siegel & Bergman, 1975), though data are scarce. Likewise, there are relatively few estimates of criterion-related validity for trainability tests predicting job performance (cf. Robertson & Downs,…
easyCBM® Reading Criterion Related Validity Evidence: Grades K-1. Technical Report #1309

ERIC Educational Resources Information Center

Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald

2013-01-01

In this technical report, we present the results of a study to gather criterion-related evidence for Grade K-1 easyCBM® reading measures. We used correlations to examine the relation between the easyCBM® measures and other published measures with known reliability and validity evidence, including the Dynamic Indicators of Basic Early Literacy…
Development and Criterion Validity of Differentiated and Elevated Vocational Interests in Adolescence

ERIC Educational Resources Information Center

Hirschi, Andreas

2009-01-01

Interest differentiation and elevation are supposed to provide important information about a person's state of interest development, yet little is known about their development and criterion validity. The present study explored these constructs among a group of Swiss adolescents. Study 1 applied a cross-sectional design with 210 students in 11th…

What Is True Halving in the Payoff Matrix of Game Theory?

PubMed Central

Hasegawa, Eisuke; Yoshimura, Jin

2016-01-01

In game theory, there are two social interpretations of rewards (payoffs) for decision-making strategies: (1) the interpretation based on the utility criterion derived from expected utility theory and (2) the interpretation based on the quantitative criterion (amount of gain) derived from validity in the empirical context. A dynamic decision theory has recently been developed in which dynamic utility is a conditional (state) variable that is a function of the current wealth of a decision maker. We applied dynamic utility to the equal division in dove-dove contests in the hawk-dove game. Our results indicate that under the utility criterion, the half-share of utility becomes proportional to a player’s current wealth. Our results are consistent with studies of the sense of fairness in animals, which indicate that the quantitative criterion has greater validity than the utility criterion. We also find that traditional analyses of repeated games must be reevaluated. PMID:27487194
What Is True Halving in the Payoff Matrix of Game Theory?

PubMed

Ito, Hiromu; Katsumata, Yuki; Hasegawa, Eisuke; Yoshimura, Jin

2016-01-01

In game theory, there are two social interpretations of rewards (payoffs) for decision-making strategies: (1) the interpretation based on the utility criterion derived from expected utility theory and (2) the interpretation based on the quantitative criterion (amount of gain) derived from validity in the empirical context. A dynamic decision theory has recently been developed in which dynamic utility is a conditional (state) variable that is a function of the current wealth of a decision maker. We applied dynamic utility to the equal division in dove-dove contests in the hawk-dove game. Our results indicate that under the utility criterion, the half-share of utility becomes proportional to a player's current wealth. Our results are consistent with studies of the sense of fairness in animals, which indicate that the quantitative criterion has greater validity than the utility criterion. We also find that traditional analyses of repeated games must be reevaluated.
Empirical agreement in model validation.

PubMed

Jebeile, Julie; Barberousse, Anouk

2016-04-01

Empirical agreement is often used as an important criterion when assessing the validity of scientific models. However, it is by no means a sufficient criterion as a model can be so adjusted as to fit available data even though it is based on hypotheses whose plausibility is known to be questionable. Our aim in this paper is to investigate into the uses of empirical agreement within the process of model validation. Copyright © 2015 Elsevier Ltd. All rights reserved.
Convergent, discriminant, and criterion validity of DSM-5 traits.

PubMed

Yalch, Matthew M; Hopwood, Christopher J

2016-10-01

Section III of the Diagnostic and Statistical Manual of Mental Disorders (5th edi.; DSM-5; American Psychiatric Association, 2013) contains a system for diagnosing personality disorder based in part on assessing 25 maladaptive traits. Initial research suggests that this aspect of the system improves the validity and clinical utility of the Section II Model. The Computer Adaptive Test of Personality Disorder (CAT-PD; Simms et al., 2011) contains many similar traits as the DSM-5, as well as several additional traits seemingly not covered in the DSM-5. In this study we evaluate the convergent and discriminant validity between the DSM-5 traits, as assessed by the Personality Inventory for DSM-5 (PID-5; Krueger et al., 2012), and CAT-PD in an undergraduate sample, and test whether traits included in the CAT-PD but not the DSM-5 provide incremental validity in association with clinically relevant criterion variables. Results supported the convergent and discriminant validity of the PID-5 and CAT-PD scales in their assessment of 23 out of 25 DSM-5 traits. DSM-5 traits were consistently associated with 11 criterion variables, despite our having intentionally selected clinically relevant criterion constructs not directly assessed by DSM-5 traits. However, the additional CAT-PD traits provided incremental information above and beyond the DSM-5 traits for all criterion variables examined. These findings support the validity of pathological trait models in general and the DSM-5 and CAT-PD models in particular, while also suggesting that the CAT-PD may include additional traits for consideration in future iterations of the DSM-5 system. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
7 CFR 15b.30 - Admissions and recruitment.

Code of Federal Regulations, 2011 CFR

2011-01-01

... first year grades, but shall conduct periodic validity studies against the criterion of overall success... admitted; (2) May not make use of any test or criterion for admission that has a disproportionate, adverse effect on handicapped persons or any class of handicapped persons unless (i) the test or criterion, as...
Validity of the posttraumatic stress disorders (PTSD) checklist in pregnant women.

PubMed

Gelaye, Bizu; Zheng, Yinnan; Medina-Mora, Maria Elena; Rondon, Marta B; Sánchez, Sixto E; Williams, Michelle A

2017-05-12

The PTSD Checklist-civilian (PCL-C) is one of the most commonly used self-report measures of PTSD symptoms, however, little is known about its validity when used in pregnancy. This study aims to evaluate the reliability and validity of the PCL-C as a screen for detecting PTSD symptoms among pregnant women. A total of 3372 pregnant women who attended their first prenatal care visit in Lima, Peru participated in the study. We assessed the reliability of the PCL-C items using Cronbach's alpha. Criterion validity and performance characteristics of PCL-C were assessed against an independent, blinded Clinician-Administered PTSD Scale (CAPS) interview using measures of sensitivity, specificity and receiver operating characteristics (ROC) curves. We tested construct validity using exploratory and confirmatory factor analytic approaches. The reliability of the PCL-C was excellent (Cronbach's alpha =0.90). ROC analysis showed that a cut-off score of 26 offered optimal discriminatory power, with a sensitivity of 0.86 (95% CI: 0.78-0.92) and a specificity of 0.63 (95% CI: 0.62-0.65). The area under the ROC curve was 0.75 (95% CI: 0.71-0.78). A three-factor solution was extracted using exploratory factor analysis and was further complemented with three other models using confirmatory factor analysis (CFA). In a CFA, a three-factor model based on DSM-IV symptom structure had reasonable fit statistics with comparative fit index of 0.86 and root mean square error of approximation of 0.09. The Spanish-language version of the PCL-C may be used as a screening tool for pregnant women. The PCL-C has good reliability, criterion validity and factorial validity. The optimal cut-off score obtained by maximizing the sensitivity and specificity should be considered cautiously; women who screened positive may require further investigation to confirm PTSD diagnosis.
Examination of the MMPI-2 restructured form (MMPI-2-RF) validity scales in civil forensic settings: findings from simulation and known group samples.

PubMed

Wygant, Dustin B; Ben-Porath, Yossef S; Arbisi, Paul A; Berry, David T R; Freeman, David B; Heilbronner, Robert L

2009-11-01

The current study examined the effectiveness of the MMPI-2 Restructured Form (MMPI-2-RF; Ben-Porath and Tellegen, 2008) over-reporting indicators in civil forensic settings. The MMPI-2-RF includes three revised MMPI-2 over-reporting validity scales and a new scale to detect over-reported somatic complaints. Participants dissimulated medical and neuropsychological complaints in two simulation samples, and a known-groups sample used symptom validity tests as a response bias criterion. Results indicated large effect sizes for the MMPI-2-RF validity scales, including a Cohen's d of .90 for Fs in a head injury simulation sample, 2.31 for FBS-r, 2.01 for F-r, and 1.97 for Fs in a medical simulation sample, and 1.45 for FBS-r and 1.30 for F-r in identifying poor effort on SVTs. Classification results indicated good sensitivity and specificity for the scales across the samples. This study indicates that the MMPI-2-RF over-reporting validity scales are effective at detecting symptom over-reporting in civil forensic settings.
[Spanish validation of Game Addiction Scale for Adolescents (GASA)].

PubMed

Lloret Irles, Daniel; Morell Gomis, Ramon; Marzo Campos, Juan Carlos; Tirado González, Sonia

The aim of this study is to adapt and validate the Game Addiction Scale for Adolescents (GASA) to the Spanish youth population. Cultural adaptation and validation study. Secondary Education centres. Two independent studies were conducted on a group of 466 young people with a mean age of 15.27 years (13-18, SD: 1.83) and 48.7% ♀ and on another group of 566, with a mean age of 21.24 years (19-26; SD: 1.86) 44.1% ♀. Addiction to video games (GASA); Game behavior (Game habits usage questionnaire), Impulsiveness (Plutchik Impulsiveness Scale) and Group Pressure (Ad hoc questionnaire). The Spanish version of GASA has shown good reliability and true to the original scale factor structure. As regards criterion validity, GASA scores are significantly different according to four criteria related to problem gambling: Game intensity and frequency, impulsiveness, and peer pressure. The results show that the adapted version GASA is adequate and a valid tool for assessing problematic gaming behaviour. Copyright © 2017 Elsevier España, S.L.U. All rights reserved.
Measurement versus prediction in the construction of patient-reported outcome questionnaires: can we have our cake and eat it?

PubMed

Smits, Niels; van der Ark, L Andries; Conijn, Judith M

2017-11-02

Two important goals when using questionnaires are (a) measurement: the questionnaire is constructed to assign numerical values that accurately represent the test taker's attribute, and (b) prediction: the questionnaire is constructed to give an accurate forecast of an external criterion. Construction methods aimed at measurement prescribe that items should be reliable. In practice, this leads to questionnaires with high inter-item correlations. By contrast, construction methods aimed at prediction typically prescribe that items have a high correlation with the criterion and low inter-item correlations. The latter approach has often been said to produce a paradox concerning the relation between reliability and validity [1-3], because it is often assumed that good measurement is a prerequisite of good prediction. To answer four questions: (1) Why are measurement-based methods suboptimal for questionnaires that are used for prediction? (2) How should one construct a questionnaire that is used for prediction? (3) Do questionnaire-construction methods that optimize measurement and prediction lead to the selection of different items in the questionnaire? (4) Is it possible to construct a questionnaire that can be used for both measurement and prediction? An empirical data set consisting of scores of 242 respondents on questionnaire items measuring mental health is used to select items by means of two methods: a method that optimizes the predictive value of the scale (i.e., forecast a clinical diagnosis), and a method that optimizes the reliability of the scale. We show that for the two scales different sets of items are selected and that a scale constructed to meet the one goal does not show optimal performance with reference to the other goal. The answers are as follows: (1) Because measurement-based methods tend to maximize inter-item correlations by which predictive validity reduces. (2) Through selecting items that correlate highly with the criterion and lowly with the remaining items. (3) Yes, these methods may lead to different item selections. (4) For a single questionnaire: Yes, but it is problematic because reliability cannot be estimated accurately. For a test battery: Yes, but it is very costly. Implications for the construction of patient-reported outcome questionnaires are discussed.
Automated Ecological Assessment of Physical Activity: Advancing Direct Observation.

PubMed

Carlson, Jordan A; Liu, Bo; Sallis, James F; Kerr, Jacqueline; Hipp, J Aaron; Staggs, Vincent S; Papa, Amy; Dean, Kelsey; Vasconcelos, Nuno M

2017-12-01

Technological advances provide opportunities for automating direct observations of physical activity, which allow for continuous monitoring and feedback. This pilot study evaluated the initial validity of computer vision algorithms for ecological assessment of physical activity. The sample comprised 6630 seconds per camera (three cameras in total) of video capturing up to nine participants engaged in sitting, standing, walking, and jogging in an open outdoor space while wearing accelerometers. Computer vision algorithms were developed to assess the number and proportion of people in sedentary, light, moderate, and vigorous activity, and group-based metabolic equivalents of tasks (MET)-minutes. Means and standard deviations (SD) of bias/difference values, and intraclass correlation coefficients (ICC) assessed the criterion validity compared to accelerometry separately for each camera. The number and proportion of participants sedentary and in moderate-to-vigorous physical activity (MVPA) had small biases (within 20% of the criterion mean) and the ICCs were excellent (0.82-0.98). Total MET-minutes were slightly underestimated by 9.3-17.1% and the ICCs were good (0.68-0.79). The standard deviations of the bias estimates were moderate-to-large relative to the means. The computer vision algorithms appeared to have acceptable sample-level validity (i.e., across a sample of time intervals) and are promising for automated ecological assessment of activity in open outdoor settings, but further development and testing is needed before such tools can be used in a diverse range of settings.
Automated Ecological Assessment of Physical Activity: Advancing Direct Observation

PubMed Central

Carlson, Jordan A.; Liu, Bo; Sallis, James F.; Kerr, Jacqueline; Papa, Amy; Dean, Kelsey; Vasconcelos, Nuno M.

2017-01-01

Technological advances provide opportunities for automating direct observations of physical activity, which allow for continuous monitoring and feedback. This pilot study evaluated the initial validity of computer vision algorithms for ecological assessment of physical activity. The sample comprised 6630 seconds per camera (three cameras in total) of video capturing up to nine participants engaged in sitting, standing, walking, and jogging in an open outdoor space while wearing accelerometers. Computer vision algorithms were developed to assess the number and proportion of people in sedentary, light, moderate, and vigorous activity, and group-based metabolic equivalents of tasks (MET)-minutes. Means and standard deviations (SD) of bias/difference values, and intraclass correlation coefficients (ICC) assessed the criterion validity compared to accelerometry separately for each camera. The number and proportion of participants sedentary and in moderate-to-vigorous physical activity (MVPA) had small biases (within 20% of the criterion mean) and the ICCs were excellent (0.82–0.98). Total MET-minutes were slightly underestimated by 9.3–17.1% and the ICCs were good (0.68–0.79). The standard deviations of the bias estimates were moderate-to-large relative to the means. The computer vision algorithms appeared to have acceptable sample-level validity (i.e., across a sample of time intervals) and are promising for automated ecological assessment of activity in open outdoor settings, but further development and testing is needed before such tools can be used in a diverse range of settings. PMID:29194358
Proposed modification of the criterion for the region of validity of the inverse-power expansion in diatomic long-range potentials

NASA Astrophysics Data System (ADS)

Ji, Bing; Tsai, Chin-Chun; Stwalley, William C.

1995-04-01

A modified internuclear distance criterion, RLR- m, as the lower bound for the region of validity of the inverse-power expansion of the diatomic long-range potential is proposed. This new criterion takes into account the spatial orientation of the atomic orbitals while retaining the simplicity of the traditional Le Roy radius, RLR for the interaction of S state atoms. Recent experimental and theoretical results for various excited states in Na 2 suggest that this proposed RLR- m is an appropriate generalization of RLR.
[Measurement properties of self-report questionnaires published in Korean nursing journals].

PubMed

Lee, Eun-Hyun; Kim, Chun-Ja; Kim, Eun Jung; Chae, Hyun-Ju; Cho, Soo-Yeon

2013-02-01

The purpose of this study was to evaluate measurement properties of self-report questionnaires for studies published in Korean nursing journals. Of 424 Korean nursing articles initially identified, 168 articles met the inclusion criteria. The methodological quality of the measurements used in the studies and interpretability were assessed using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. It consists of items on internal consistency, reliability, measurement error, content validity, construct validity including structural validity, hypothesis testing, cross-cultural validity, and criterion validity, and responsiveness. For each item of the COSMIN checklist, measurement properties are rated on a four-point scale: excellent, good, fair, and poor. Each measurement property is scored with worst score counts. All articles used the classical test theory for measurement properties. Internal consistency (72.6%), construct validity (56.5%), and content validity (38.2%) were most frequently reported properties being rated as 'excellent' by COSMIN checklist, whereas other measurement properties were rarely reported. A systematic review of measurement properties including interpretability of most instruments warrants further research and nursing-focused checklists assessing measurement properties should be developed to facilitate intervention outcomes across Korean studies.
Self-Reported Physical Activity within and outside the Neighborhood: Criterion-Related Validity of the Neighborhood Physical Activity Questionnaire in German Older Adults

ERIC Educational Resources Information Center

Bödeker, Malte; Bucksch, Jens; Wallmann-Sperlich, Birgit

2018-01-01

The Neighborhood Physical Activity Questionnaire allows to assess physical activity within and outside the neighborhood. Study objectives were to examine the criterion-related validity and health/functioning associations of Neighborhood Physical Activity Questionnaire-derived physical activity in German older adults. A total of 107 adults aged…
Effect of Items Direction (Positive or Negative) on the Factorial Construction and Criterion Related Validity in Likert Scale

ERIC Educational Resources Information Center

Naji Qasem, Mamun Ali; Ahmad Gul, Showkeen Bilal

2014-01-01

The study was conducted to know the effect of items direction (positive or negative) on the factorial construction and criterion related validity in Likert scale. The descriptive survey research method was used for the study and the sample consisted of 510 undergraduate students selected by used random sampling technique. A scale developed by…
Testing a Multi-Stage Screening System: Predicting Performance on Australia's National Achievement Test Using Teachers' Ratings of Academic and Social Behaviors

ERIC Educational Resources Information Center

Kettler, Ryan J.; Elliott, Stephen N.; Davies, Michael; Griffin, Patrick

2012-01-01

This study addresses the predictive validity of results from a screening system of academic enablers, with a sample of Australian elementary school students, when the criterion variable is end-of-year achievement. The investigation included (a) comparing the predictive validity of a brief criterion-referenced nomination system with more…
easyCBM® Reading Criterion Related Validity Evidence: Grades 2-5. Technical Report #1310

ERIC Educational Resources Information Center

Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald

2013-01-01

In this technical report, we present the results of a study to gather criterion-related evidence for Grade 2-5 easyCBM® reading measures. We used correlations to examine the relation between the easyCBM® measures and other published measures with known reliability and validity evidence, including the Gates-MacGinitie Reading Tests and the Dynamic…
A Case for Transforming the Criterion of a Predictive Validity Study

ERIC Educational Resources Information Center

Patterson, Brian F.; Kobrin, Jennifer L.

2011-01-01

This study presents a case for applying a transformation (Box and Cox, 1964) of the criterion used in predictive validity studies. The goals of the transformation were to better meet the assumptions of the linear regression model and to reduce the residual variance of fitted (i.e., predicted) values. Using data for the 2008 cohort of first-time,…
A Controlled Evaluation of the Distress Criterion for Binge Eating Disorder

ERIC Educational Resources Information Center

Grilo, Carlos M.; White, Marney A.

2011-01-01

Objective: Research has examined various aspects of the validity of the research criteria for binge eating disorder (BED) but has yet to evaluate the utility of Criterion C, "marked distress about binge eating." This study examined the significance of the marked distress criterion for BED using 2 complementary comparison groups. Method:…
Determination of esophageal eosinophil counts and other histologic features of eosinophilic esophagitis by pathology trainees is highly accurate.

PubMed

Rusin, Spencer; Covey, Shannon; Perjar, Irina; Hollyfield, Johnny; Speck, Olga; Woodward, Kimberly; Woosley, John T; Dellon, Evan S

2017-04-01

Many studies of eosinophilic esophagitis (EoE) use expert pathology review, but it is unknown whether less experienced pathologists can reliably assess EoE histology. We aimed to determine whether trainee pathologists can accurately quantify esophageal eosinophil counts and identify associated histologic features of EoE, as compared with expert pathologists. We used a set of 40 digitized slides from patients with varying degrees of esophageal eosinophilia. Each of 6 trainee pathologists underwent a teaching session and used our validated protocol to determine eosinophil counts and associated EoE findings. The same slides had previously been evaluated by expert pathologists, and these results comprised the criterion standard. Eosinophil counts were correlated, and agreement was calculated for the diagnostic threshold of 15 eosinophils per high-power field as well as for associated EoE findings. Peak eosinophil counts were highly correlated between the trainees and the criterion standard (ρ ranged from 0.87 to 0.92; P<.001 for all). Peak counts were also highly correlated between trainees (0.75-0.91; P<.001), and results were similar for mean counts. Agreement was excellent for determining if a count exceeded the diagnostic threshold (κ ranged from 0.83 to 0.89; P<.001). Agreement was very good for eosinophil degranulation (κ = 0.54-0.83; P<.01) and spongiosis (κ = 0.44-0.87; P<.01) but was lower for eosinophil microabscesses (κ = 0.37-0.64; P<.01). In conclusion, using a teaching session, digitized slide set, and validated protocol, the agreement between pathology trainees and expert pathologists for determining eosinophil counts was excellent. Agreement was very good for eosinophil degranulation and spongiosis but less so for microabscesses. Copyright © 2016 Elsevier Inc. All rights reserved.

The reliability and criterion validity of 2D video assessment of single leg squat and hop landing.

PubMed

Herrington, Lee; Alenezi, Faisal; Alzhrani, Msaad; Alrayani, Hasan; Jones, Richard

2017-06-01

The objective was to assess the intra-tester, within and between day reliability of measurement of hip adduction (HADD) and frontal plane projection angles (FPPA) during single leg squat (SLS) and single leg landing (SLL) using 2D video and the validity of these measurements against those found during 3D motion capture. 15 healthy subjects had their SLS and SLL assessed using 3D motion capture and video analysis. Inter-tester reliability for both SLS and SLL when measuring FPPA and HADD show excellent correlations (ICC 2,1 0.97-0.99). Within and between day assessment of SLS and SLL showed good to excellent correlations for both variables (ICC 3,1 0.72-91). 2D FPPA measures were found to have good correlation with knee abduction angle in 3-D (r=0.79, p=0.008) during SLS, and also to knee abduction moment (r=0.65, p=0.009). 2D HADD showed very good correlation with 3D HADD during SLS (r=0.81, p=0.001), and a good correlation during SLL (r=0.62, p=0.013). All other associations were weak (r<0.4). This study suggests that 2D video kinematics have a reasonable association to what is being measured with 3D motion capture. Copyright © 2017 Elsevier Ltd. All rights reserved.
Cross-cultural validity of a dietary questionnaire for studies of dental caries risk in Japanese

PubMed Central

2014-01-01

Background Diet is a major modifiable contributing factor in the etiology of dental caries. The purpose of this paper is to examine the reliability and cross-cultural validity of the Japanese version of the Food Frequency Questionnaire to assess dietary intake in relation to dental caries risk in Japanese. Methods The 38-item Food Frequency Questionnaire, in which Japanese food items were added to increase content validity, was translated into Japanese, and administered to two samples. The first sample comprised 355 pregnant women with mean age of 29.2 ± 4.2 years for the internal consistency and criterion validity analyses. Factor analysis (principal components with Varimax rotation) was used to determine dimensionality. The dietary cariogenicity score was calculated from the Food Frequency Questionnaire and used for the analyses. Salivary mutans streptococci level was used as a semi-quantitative assessment of dental caries risk and measured by Dentocult SM. Dentocult SM scores were compared with the dietary cariogenicity score computed from the Food Frequency Questionnaire to examine criterion validity, and assessed by Spearman’s correlation coefficient (rs) and Kruskal-Wallis test. Test-retest reliability of the Food Frequency Questionnaire was assessed with a second sample of 25 adults with mean age of 34.0 ± 3.0 years by using the intraclass correlation coefficient analysis. Results The Japanese language version of the Food Frequency Questionnaire showed high test-retest reliability (ICC = 0.70) and good criterion validity assessed by relationship with salivary mutans streptococci levels (rs = 0.22; p < 0.001). Factor analysis revealed four subscales that construct the questionnaire (solid sugars, solid and starchy sugars, liquid and semisolid sugars, sticky and slowly dissolving sugars). Internal consistency were low to acceptable (Cronbach’s alpha = 0.67 for the total scale, 0.46-0.61 for each subscale). Mean dietary cariogenicity scores were 50.8 ± 19.5 in the first sample, 47.4 ± 14.1, and 40.6 ± 11.3 for the first and second administrations in the second sample. The distribution of Dentocult SM score was 6.8% (score = 0), 34.4% (score = 1), 39.4% (score = 2), and 19.4% (score = 3). Participants with higher scores were more likely to have higher dietary cariogenicity scores (p < 0.001; Kruskal-Wallis test). Conclusions These results provide the preliminary evidence for the reliability and validity of the Japanese language Food Frequency Questionnaire. PMID:24383547
[The Family Questionnaire (FB-K) - A Short Version of the General Family Questionnaire and its Reliability and Validity].

PubMed

Sidor, Anna; Cierpka, Manfred

2016-01-01

A standardized assessment of a family system plays a crucial role in family therapy research and diagnostic, as well as in a family therapy itself. A 14-item short version of the General Family Questionnaire (FB-K) was designed to get a tool for assessing family functionality that is low time-consuming. The short version was developed by factor analysis from the long version FA-A. The quality criteria of the family questionnaire were verified in a control sample of 208 high-risk families four months after the birth of their child. The new family questionnaire demonstrates a very good reliability and a satisfactory 8-months-stability. The concurrent validity with the FACES scale "cohesion" is assured. Regarding the construct validity a positive correlation to the feeling of coherence was found. The family questionnaire shows a negative correlation to the maternal postnatal depressive symptoms, the degree of maternal stress burden, the dysfunctionality of the mother-child-relationship and impaired bonding. The values taken from a norm sample with infants are higher by trend and in the sample with children under 18 do not deviate from the values of the risk sample. FB-K covers two aspects of family functioning, the bond between family members and their willingness to communicate. The internal consistency of FB-K is excellent, the criterion and the construct validity are good.
Five-level emergency triage systems: variation in assessment of validity.

PubMed

Kuriyama, Akira; Urushidani, Seigo; Nakayama, Takeo

2017-11-01

Triage systems are scales developed to rate the degree of urgency among patients who arrive at EDs. A number of different scales are in use; however, the way in which they have been validated is inconsistent. Also, it is difficult to define a surrogate that accurately predicts urgency. This systematic review described reference standards and measures used in previous validation studies of five-level triage systems. We searched PubMed, EMBASE and CINAHL to identify studies that had assessed the validity of five-level triage systems and described the reference standards and measures applied in these studies. Studies were divided into those using criterion validity (reference standards developed by expert panels or triage systems already in use) and those using construct validity (prognosis, costs and resource use). A total of 57 studies examined criterion and construct validity of 14 five-level triage systems. Criterion validity was examined by evaluating (1) agreement between the assigned degree of urgency with objective standard criteria (12 studies), (2) overtriage and undertriage (9 studies) and (3) sensitivity and specificity of triage systems (7 studies). Construct validity was examined by looking at (4) the associations between the assigned degree of urgency and measures gauged in EDs (48 studies) and (5) the associations between the assigned degree of urgency and measures gauged after hospitalisation (13 studies). Particularly, among 46 validation studies of the most commonly used triages (Canadian Triage and Acuity Scale, Emergency Severity Index and Manchester Triage System), 13 and 39 studies examined criterion and construct validity, respectively. Previous studies applied various reference standards and measures to validate five-level triage systems. They either created their own reference standard or used a combination of severity/resource measures. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Are validated outcome measures used in distal radial fractures truly valid?

PubMed Central

Nienhuis, R. W.; Bhandari, M.; Goslings, J. C.; Poolman, R. W.; Scholtes, V. A. B.

2016-01-01

Objectives Patient-reported outcome measures (PROMs) are often used to evaluate the outcome of treatment in patients with distal radial fractures. Which PROM to select is often based on assessment of measurement properties, such as validity and reliability. Measurement properties are assessed in clinimetric studies, and results are often reviewed without considering the methodological quality of these studies. Our aim was to systematically review the methodological quality of clinimetric studies that evaluated measurement properties of PROMs used in patients with distal radial fractures, and to make recommendations for the selection of PROMs based on the level of evidence of each individual measurement property. Methods A systematic literature search was performed in PubMed, EMbase, CINAHL and PsycINFO databases to identify relevant clinimetric studies. Two reviewers independently assessed the methodological quality of the studies on measurement properties, using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. Level of evidence (strong / moderate / limited / lacking) for each measurement property per PROM was determined by combining the methodological quality and the results of the different clinimetric studies. Results In all, 19 out of 1508 identified unique studies were included, in which 12 PROMs were rated. The Patient-rated wrist evaluation (PRWE) and the Disabilities of Arm, Shoulder and Hand questionnaire (DASH) were evaluated on most measurement properties. The evidence for the PRWE is moderate that its reliability, validity (content and hypothesis testing), and responsiveness are good. The evidence is limited that its internal consistency and cross-cultural validity are good, and its measurement error is acceptable. There is no evidence for its structural and criterion validity. The evidence for the DASH is moderate that its responsiveness is good. The evidence is limited that its reliability and the validity on hypothesis testing are good. There is no evidence for the other measurement properties. Conclusion According to this systematic review, there is, at best, moderate evidence that the responsiveness of the PRWE and DASH are good, as are the reliability and validity of the PRWE. We recommend these PROMs in clinical studies in patients with distal radial fractures; however, more clinimetric studies of higher methodological quality are needed to adequately determine the other measurement properties. Cite this article: Dr Y. V. Kleinlugtenbelt. Are validated outcome measures used in distal radial fractures truly valid?: A critical assessment using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. Bone Joint Res 2016;5:153–161. DOI: 10.1302/2046-3758.54.2000462. PMID:27132246
Extending Structural Analyses of the Rosenberg Self-Esteem Scale to Consider Criterion-Related Validity: Can Composite Self-Esteem Scores Be Good Enough?

PubMed

Donnellan, M Brent; Ackerman, Robert A; Brecheen, Courtney

2016-01-01

Although the Rosenberg Self-Esteem Scale (RSES) is the most widely used measure of global self-esteem in the literature, there are ongoing disagreements about its factor structure. This methodological debate informs how the measure should be used in substantive research. Using a sample of 1,127 college students, we test the overall fit of previously specified models for the RSES, including a newly proposed bifactor solution (McKay, Boduszek, & Harvey, 2014 ). We extend previous work by evaluating how various latent factors from these structural models are related to a set of criterion variables frequently studied in the self-esteem literature. A strict unidimensional model poorly fit the data, whereas models that accounted for correlations between negatively and positively keyed items tended to fit better. However, global factors from viable structural models had similar levels of association with criterion variables and with the pattern of results obtained with a composite global self-esteem variable calculated from observed scores. Thus, we did not find compelling evidence that different structural models had substantive implications, thereby reducing (but not eliminating) concerns about the integrity of the self-esteem literature based on overall composite scores for the RSES.
Achievement Emotions and Achievement Goals in Support of the Convergent, Divergent and Criterion Validity of the Spanish-Cognitive Test Anxiety Scale

ERIC Educational Resources Information Center

Sánchez-Rosas, Javier; Furlan, Luis Alberto

2017-01-01

Based on the control-value theory of achievement emotions and theory of achievement goals, this research provides evidence of convergent, divergent, and criterion validity of the Spanish Cognitive Test Anxiety Scale (S-CTAS). A sample of Argentinean undergraduates responded to several scales administered at three points. At time 1 and 3, the…
The Measurement of Executive Function at Age 3 Years: Psychometric Properties and Criterion Validity of a New Battery of Tasks

ERIC Educational Resources Information Center

Willoughby, Michael T.; Blair, Clancy B.; Wirth, R. J.; Greenberg, Mark

2010-01-01

In this study, the authors examined the psychometric properties and criterion validity of a newly developed battery of tasks that were designed to assess executive function (EF) abilities in early childhood. The battery was included in the 36-month assessment of the Family Life Project (FLP), a prospective longitudinal study of 1,292 children…
The Investigation of ADHD Prevalence in Kindergarten Children in Northeast Iran and a Determination of the Criterion Validity of Conners' Questionnaire via Clinical Interview

ERIC Educational Resources Information Center

Abdekhodaie, Zahra; Tabatabaei, Seyed Mahmood; Gholizadeh, Mortaza

2012-01-01

In this study, the prevalence of attention-deficit hyperactivity disorder (ADHD) in kindergarten children in northeast Iran was investigated, and the criterion validity of Conners' parent-teacher questionnaire was evaluated through the use of clinical interviews. This study was a cross-sectional descriptive research project with children in…
Evaluation of the Criterion and Convergent Validity of the Diagnostic Interview for Social and Communication Disorders in Young and Low-Functioning Children

ERIC Educational Resources Information Center

Maljaars, Jarymke; Noens, Ilse; Scholte, Evert; van Berckelaer-Onnes, Ina

2012-01-01

The Diagnostic Interview for Social and Communication Disorders (DISCO; Wing, 2006) is a standardized, semi-structured and interviewer-based schedule for diagnosis of autism spectrum disorder (ASD). The objective of this study was to evaluate the criterion and convergent validity of the DISCO-11 ICD-10 algorithm in young and low-functioning…
Psychometric validation of the PROQOL-HIV questionnaire, a new health-related quality of life instrument-specific to HIV disease.

PubMed

Duracinsky, Martin; Lalanne, Christophe; Le Coeur, Sophie; Herrmann, Susan; Berzins, Baiba; Armstrong, Andrew Richard; Lau, Joseph Tak Fai; Fournier, Isabelle; Chassany, Olivier

2012-04-15

This study reports the psychometric validation of a new HIV/AIDS-specific health-related quality of life (HRQL) questionnaire, the Patient Reported Outcomes Quality of Life-HIV. The instrument was developed simultaneously across Europe, North and South America, Africa, Asia, and Australia to assess multidimensional quality of life impairments in the era of highly active antiretroviral therapy. A cross-sectional study was performed in 8 countries. The pilot 70-item questionnaire was co-administered with the HIV symptoms index, the EQ-5D and Medical Outcomes Study-HIV questionnaires. Demographic and biomedical data were collected. After item analysis and reduction, convergent discriminant concurrent validity and known-group validity were examined. Internal consistency and reliability scores were assessed using Cronbach alpha and intraclass correlation. The final sample of 791 patients was composed of 64% males (median age: 41 years, HIV diagnosis = 5 years), 13.8% were treatment naive. Item reduction yielded a 43-item form surveying 8 dimensions and 1 global health item that showed good convergent and discriminant validity and reliability (98% scaling success; Cronbach alphas 0.77-0.89). Correlations with EQ-5D and Medical Outcomes Study-HIV complied with concurrent validity expectations; likewise, correlations against the number of self-reported symptoms and depression showed good support for criterion validity. A test-retest study on French patients (n = 34) showed temporal stability (intraclass correlation coefficient = 0.86). Significant and meaningful differences of HRQL scores between countries were found. The Patient Reported Outcomes Quality of Life-HIV questionnaire is a valid and reliable instrument for assessing HRQL specific to HIV disease in different cultures and healthcare systems.
Validation of the Tuebingen CD-25 Inventory as a Measure of Postoperative Health-Related Quality of Life in Patients Treated for Cushing's Disease.

PubMed

Milian, Monika; Kreitschmann-Andermahr, Ilonka; Siegel, Sonja; Kleist, Bernadette; Führer-Sakel, Dagmar; Honegger, Juergen; Buchfelder, Michael; Psaras, Tsambika

2015-01-01

To evaluate the construct and criterion validity of the Tuebingen Cushing's disease quality of life inventory (Tuebingen CD-25) for application in patients treated for Cushing's disease (CD). A total of 176 patients with adrenocorticotropin hormone-dependent CD (144 of them female, overall mean age 46.1 ± 13.7 years) treated at 3 large tertiary referral centers in Germany were studied. Construct validity was assessed by hypothesis testing (self-perceived symptom reduction assessment) and contrasted groups (patients with vs. without hypercorticolism). For this purpose, already existing data from 55 CD patients was used, representing the hypercortisolemic group. Criterion validity (concurrent validity) was assessed in relation to the Cushing's quality of life questionnaire (CushingQoL), the Short Form 36 health survey (SF-36), and the body mass index (BMI). Patients with self-perceived remarkable symptom reduction had significant lower Tuebingen CD-25 scores (i.e. better health-related quality of life) than patients with self-perceived insufficient symptom reduction (p < 0.05). Similarly, the mean scores of the Tuebingen CD-25 scales were lower in patients without hypercortisolism (total score 27.0 ± 17.2) compared to those with hypercortisolism (total score 45.3 ± 22.1; each p < 0.05), providing evidence for construct validity. Criterion validity was confirmed by the correlations between the Tuebingen CD-25 total score and the CushingQoL (Spearman's coefficient -0.733), as well as all scales of the SF-36 (Spearman's coefficient between -0.447 and -0.700). The analyses presented in this large-sample study provide robust evidence for the construct and criterion validity of the Tuebingen CD-25. © 2015 S. Karger AG, Basel.
The Portuguese formal social support for autonomy and dependence in pain inventory (FSSADI_PAIN): a preliminary validation study.

PubMed

Matos, Marta; Bernardes, Sónia F

2013-09-01

Development and preliminary validation of a Portuguese measure of perceived Formal Social Support for Autonomy and Dependence in Pain (FSSADI_PAIN). One hundred and fifty-one older adults (88.1% women), between 56 and 94 years of age (M = 75.41; SD = 9.11), who attended one of the following institutions--day care centre (33.1%), nursing home (36.4%) and senior university (30.5%)--were recruited for this study. Along with the FSSADI_PAIN, participants filled out the Portuguese versions of the Brief Pain Inventory (Azevedo et al., 2007, Dor, 15, 6) and the Social Support Scale of Medical Outcomes Survey (Pais-Ribeiro & Ponte, 2009, Psicologia, Saúde & Doença, 10, 163). The factorial structure reflected the functions of perceived promotion of (1) dependence and (2) autonomy, showing good internal consistency (α > .70) and sensitivity indices. The FSSADI_PAIN showed good content, discriminant and criterion validity; it differentiated the perceptions of promotion of dependence/autonomy according to individual's pain severity and disability, as well as the type of institution. These preliminary findings suggest that the FSSADI_PAIN is an innovative and promising measure of perceived formal social support adapted to pain-related contexts. © 2012 The British Psychological Society.
Semi-structured Interview Measure of Stigma (SIMS) in psychosis: Assessment of psychometric properties.

PubMed

Wood, Lisa; Burke, Eilish; Byrne, Rory; Enache, Gabriela; Morrison, Anthony P

2016-10-01

Stigma is a significant difficulty for people who experience psychosis. To date, there have been no outcome measures developed to examine stigma exclusively in people with psychosis. The aim of this study was develop and validate a semi-structured interview measure of stigma (SIMS) in psychosis. The SIMS is an eleven item measure of stigma developed in consultation with service users who have experienced psychosis. 79 participants with experience of psychosis were recruited for the purposes of this study. They were administered the SIMS alongside a battery of other relevant outcome measures to examine reliability and validity. A one-factor solution was identified for the SIMS which encompassed all ten rateable items. The measure met all reliability and validity criteria and illustrated good internal consistency, inter-rater reliability, test retest reliability, criterion validity, construct validity, sensitivity to change and had no floor or ceiling effects. The SIMS is a reliable and valid measure of stigma in psychosis. It may be more engaging and acceptable than other stigma measures due to its semi-structured interview format. Crown Copyright © 2016. Published by Elsevier B.V. All rights reserved.
Does the decision in a validation process of a surrogate endpoint change with level of significance of treatment effect? A proposal on validation of surrogate endpoints.

PubMed

Sertdemir, Y; Burgut, R

2009-01-01

In recent years the use of surrogate end points (S) has become an interesting issue. In clinical trials, it is important to get treatment outcomes as early as possible. For this reason there is a need for surrogate endpoints (S) which are measured earlier than the true endpoint (T). However, before a surrogate endpoint can be used it must be validated. For a candidate surrogate endpoint, for example time to recurrence, the validation result may change dramatically between clinical trials. The aim of this study is to show how the validation criterion (R(2)(trial)) proposed by Buyse et al. are influenced by the magnitude of treatment effect with an application using real data. The criterion R(2)(trial) proposed by Buyse et al. (2000) is applied to the four data sets from colon cancer clinical trials (C-01, C-02, C-03 and C-04). Each clinical trial is analyzed separately for treatment effect on survival (true endpoint) and recurrence free survival (surrogate endpoint) and this analysis is done also for each center in each trial. Results are used for standard validation analysis. The centers were grouped by the Wald statistic in 3 equal groups. Validation criteria R(2)(trial) were 0.641 95% CI (0.432-0.782), 0.223 95% CI (0.008-0.503), 0.761 95% CI (0.550-0.872) and 0.560 95% CI (0.404-0.687) for C-01, C-02, C-03 and C-04 respectively. The R(2)(trial) criteria changed by the Wald statistics observed for the centers used in the validation process. Higher the Wald statistic groups are higher the R(2)(trial) values observed. The recurrence free survival is not a good surrogate for overall survival in clinical trials with non significant treatment effects and moderate for significant treatment effects. This shows that the level of significance of treatment effect should be taken into account in validation process of surrogate endpoints.
Analysis of augmented aircraft flying qualities through application of the Neal-Smith criterion

NASA Technical Reports Server (NTRS)

Bailey, R. E.; Smith, R. E.

1981-01-01

The Neal-Smith criterion is examined for possible applications in the evaluation of augmented fighter aircraft flying qualities. Longitudinal and lateral flying qualities are addressed. Based on the application of several longitudinal flying qualities data bases, revisions are proposed to the original criterion. Examples are given which show the revised criterion to be a good discriminator of pitch flying qualities. Initial results of lateral flying qualities evaluation through application of the Neal-Smith criterion are poor. Lateral aircraft configurations whose flying qualities are degraded by roll ratcheting effects map into the Level 1 region of the criterion. A third dimension of the criterion for flying qualities specification is evident. Additional criteria are proposed to incorporate this dimension into the criterion structure for flying qualities analysis.
Validation of the Headache Impact Test (HIT-6) in patients with chronic migraine.

PubMed

Rendas-Baum, Regina; Yang, Min; Varon, Sepideh F; Bloudek, Lisa M; DeGryse, Ronald E; Kosinski, Mark

2014-08-01

The Headache Impact Test (HIT)-6 was developed and has been validated in patients with various types of headache. The objective of this study was to report the psychometric properties of the HIT-6 among patients with chronic migraine. Data came from two international, multicenter, randomized, double-blind, placebo-controlled clinical trials of chronic migraine patients (N = 1,384) undergoing prophylaxis therapy. Confirmatory factor analysis and differential item functioning (DIF) analysis were used to test the latent structure and cross-cultural comparability of the HIT-6. Reliability, construct validity, and responsiveness were assessed. Two sets of criterion groups were used: (1) 28-day headache frequency: <10, 10-14, and ≥15 days; (2) sample quartiles of the total cumulative hours of headache: <140, 140 to <280, 280 to <420, and ≥420 hours. Two sets of responsiveness categories were defined as reduction of <30%, 30% to <50%, or ≥50% in (1) number of headache days and (2) cumulative hours of headache. Measurement invariance tests supported the stability of the HIT-6 latent structure across studies. DIF analysis supported cross-cultural comparability. Good reliability was observed across studies (Cronbach's α: 0.75-0.92; intraclass correlation coefficient: 0.76-0.80). HIT-6 scores correlated strongly (-0.86 to -0.59) with scores of the Migraine-Specific Quality-of-Life Questionnaire. Analysis of variance indicated that HIT-6 scores discriminated across both types of criterion groups (P<0.001), across studies and time points. HIT-6 change scores were significantly higher in magnitude in groups experiencing greater improvement (P<0.001). All measurement properties were consistently verified across the two studies, supporting the validity of the HIT-6 among chronic migraine patients. NCT00156910 and NCT00168428 on www.ClinicalTrials.gov.
Frequency of Binge Eating Episodes in Bulimia Nervosa and Binge Eating Disorder: Diagnostic Considerations

PubMed Central

Wilson, G. Terence; Sysko, Robyn

2013-01-01

Objective In DSM-IV, to be diagnosed with Bulimia Nervosa (BN) or the provisional diagnosis of Binge Eating Disorder (BED), an individual must experience episodes of binge eating is “at least twice a week” on average, for three or six months respectively. The purpose of this review was to examine the validity and utility of the frequency criterion for BN and BED. Method Published studies evaluating the frequency criterion were reviewed. Results Our review found little evidence to support the validity or utility of the DSM-IV frequency criterion of twice a week binge eating; however, the number of studies available for our review was limited. Conclusion A number of options are available for the frequency criterion in DSM-V, and the optimal diagnostic threshold for binge eating remains to be determined. PMID:19610014
The Physical Activity Scale for Individuals with Physical Disabilities: test-retest reliability and comparison with an accelerometer.

PubMed

van der Ploeg, Hidde P; Streppel, Kitty R M; van der Beek, Allard J; van der Woude, Luc H V; Vollenbroek-Hutten, Miriam; van Mechelen, Willem

2007-01-01

The objective was to determine the test-retest reliability and criterion validity of the Physical Activity Scale for Individuals with Physical Disabilities (PASIPD). Forty-five non-wheelchair dependent subjects were recruited from three Dutch rehabilitation centers. Subjects' diagnoses were: stroke, spinal cord injury, whiplash, and neurological-, orthopedic- or back disorders. The PASIPD is a 7-d recall physical activity questionnaire that was completed twice, 1 wk apart. During this week, physical activity was also measured with an Actigraph accelerometer. The test-retest reliability Spearman correlation of the PASIPD was 0.77. The criterion validity Spearman correlation was 0.30 when compared to the accelerometer. The PASIPD had test-retest reliability and criterion validity that is comparable to well established self-report physical activity questionnaires from the general population.
Indirect Measurement of Sexual Orientation: Comparison of the Implicit Relational Assessment Procedure, Viewing Time, and Choice Reaction Time Tasks.

PubMed

Rönspies, Jelena; Schmidt, Alexander F; Melnikova, Anna; Krumova, Rosina; Zolfagari, Asadeh; Banse, Rainer

2015-07-01

The present study was conducted to validate an adaptation of the Implicit Relational Assessment Procedure (IRAP) as an indirect latency-based measure of sexual orientation. Furthermore, reliability and criterion validity of the IRAP were compared to two established indirect measures of sexual orientation: a Choice Reaction Time task (CRT) and a Viewing Time (VT) task. A sample of 87 heterosexual and 35 gay men completed all three indirect measures in an online study. The IRAP and the VT predicted sexual orientation nearly perfectly. Both measures also showed a considerable amount of convergent validity. Reliabilities (internal consistencies) reached satisfactory levels. In contrast, the CRT did not tap into sexual orientation in the present study. In sum, the VT measure performed best, with the IRAP showing only slightly lower reliability and criterion validity, whereas the CRT did not yield any evidence of reliability or criterion validity in the present research. The results were discussed in the light of specific task properties of the indirect latency-based measures (task-relevance vs. task-irrelevance).

The cross-validated AUC for MCP-logistic regression with high-dimensional data.

PubMed

Jiang, Dingfeng; Huang, Jian; Zhang, Ying

2013-10-01

We propose a cross-validated area under the receiving operator characteristic (ROC) curve (CV-AUC) criterion for tuning parameter selection for penalized methods in sparse, high-dimensional logistic regression models. We use this criterion in combination with the minimax concave penalty (MCP) method for variable selection. The CV-AUC criterion is specifically designed for optimizing the classification performance for binary outcome data. To implement the proposed approach, we derive an efficient coordinate descent algorithm to compute the MCP-logistic regression solution surface. Simulation studies are conducted to evaluate the finite sample performance of the proposed method and its comparison with the existing methods including the Akaike information criterion (AIC), Bayesian information criterion (BIC) or Extended BIC (EBIC). The model selected based on the CV-AUC criterion tends to have a larger predictive AUC and smaller classification error than those with tuning parameters selected using the AIC, BIC or EBIC. We illustrate the application of the MCP-logistic regression with the CV-AUC criterion on three microarray datasets from the studies that attempt to identify genes related to cancers. Our simulation studies and data examples demonstrate that the CV-AUC is an attractive method for tuning parameter selection for penalized methods in high-dimensional logistic regression models.
Financial Decision-making Abilities and Financial Exploitation in Older African Americans: Preliminary Validity Evidence for the Lichtenberg Financial Decision Rating Scale (LFDRS)

PubMed Central

Ficker, Lisa J.; Rahman-Filipiak, Annalise

2015-01-01

This study examines preliminary evidence for the Lichtenberg Financial Decision Rating Scale (LFDRS), a new person-centered approach to assessing capacity to make financial decisions, and its relationship to self-reported cases of financial exploitation in 69 older African Americans. More than one third of individuals reporting financial exploitation also had questionable decisional abilities. Overall, decisional ability score and current decision total were significantly associated with cognitive screening test and financial ability scores, demonstrating good criterion validity. Financially exploited individuals, and non-exploited individuals, showed mean group differences on the Mini Mental State Exam, Financial Situational Awareness, Psychological Vulnerability, Current Decisional Ability, and Susceptibility to undue influence subscales, and Total Lichtenberg Financial Decision Rating Scale Score. Study findings suggest that impaired decisional abilities may render older adults more vulnerable to financial exploitation, and that the LFDRS is a valid tool for measuring both decisional abilities and financial exploitation. PMID:26285038
[Development of a cell phone addiction scale for korean adolescents].

PubMed

Koo, Hyun Young

2009-12-01

This study was done to develop a cell phone addiction scale for Korean adolescents. The process included construction of a conceptual framework, generation of initial items, verification of content validity, selection of secondary items, preliminary study, and extraction of final items. The participants were 577 adolescents in two middle schools and three high schools. Item analysis, factor analysis, criterion related validity, and internal consistency were used to analyze the data. Twenty items were selected for the final scale, and categorized into 3 factors explaining 55.45% of total variance. The factors were labeled as withdrawal/tolerance (7 items), life dysfunction (6 items), and compulsion/persistence (7 items). The scores for the scale were significantly correlated with self-control, impulsiveness, and cell phone use. Cronbach's alpha coefficient for the 20 items was .92. Scale scores identified students as cell phone addicted, heavy users, or average users. The above findings indicate that the cell phone addiction scale has good validity and reliability when used with Korean adolescents.
Comparison of two methods of measuring physical activity in South African older adults.

PubMed

Kolbe-Alexander, Tracy L; Lambert, Estelle V; Harkins, Judith Biletnikoff; Ekelund, Ulf

2006-01-01

The aim of this study was to assess the validity and reliability of the Yale Physical Activity Survey (YPAS) and the short version of the International Physical Activity Questionnaire (IPAQ) in older South African adults. The YPAS includes measures of weekly energy expenditure (EE) for housework, yard work, caregiving, exercise, and recreation. The IPAQ measures total time and EE during vigorous and moderate activity, walking, and sitting. The instruments were administered twice for test-retest reliability (men, n = 52, 68 +/- 5.4 years, and women, n = 70, 66 +/- 5.8 years). Data for criterion validity were obtained from accelerometers. YPAS reliability ranged from r = .44 to.80 for men and r = .59 to .99 for women (p < .0001). IPAQ reliability was lower for men (r = .29 to .76) than for women (r = .46 to .77). Criterion validity of the YPAS was .31 to .54 for men and .26 to .29 for women. The YPAS and short IPAQ had comparable results for reliability and criterion validity.
Reliability and Validity of the Chinese Version of FACIT-AI, a New Tool for Assessing Quality of Life in Patients with Malignant Ascites.

PubMed

Lou, Yanni; Lu, Linghui; Li, Yuan; Liu, Meng; Bredle, Jason M; Jia, Liqun

2015-10-01

The study objective was to determine the reliability and validity of the Chinese version of the Functional Assessment of Chronic Illness Therapy - Ascites Index (FACIT-AI). A forward-backward translation procedure was adopted to develop the Chinese version of the FACIT-AI, which was tested in 69 patients with malignant ascites. Cronbach's α, split-half reliability, and test-retest reliability were used to assess the reliability of the scale. The content validity index was used to assess the content validity, while factor analysis was used for construct validity and correlation analysis was used for criterion validity. The Cronbach's α was 0.772 for the total scale, and the split-half reliability was 0.693. The test-retest correlation was 0.972. The content validity index for the scale was 0.8-1.0. Four factors were extracted by factor analysis, and these contributed 63.51% of the total variance. Item-total correlations ranged from 0.591 to 0.897, and these were correlated with visual analog scale scores (correlation coefficient, 0.889; P<0.01). The Chinese version of the FACIT-AI has good reliability and validity and can be used as a tool to measure quality of life in Chinese patients with malignant ascites.
Long-Term Impact of Valid Case Criterion on Capturing Population-Level Growth under Item Response Theory Equating. Research Report. ETS RR-17-17

ERIC Educational Resources Information Center

Deng, Weiling; Monfils, Lora

2017-01-01

Using simulated data, this study examined the impact of different levels of stringency of the valid case inclusion criterion on item response theory (IRT)-based true score equating over 5 years in the context of K-12 assessment when growth in student achievement is expected. Findings indicate that the use of the most stringent inclusion criterion…
easyCBM Beginning Reading Measures: Grades K-1 Alternate Form Reliability and Criterion Validity with the SAT-10. Technical Report #1403

ERIC Educational Resources Information Center

Wray, Kraig; Lai, Cheng-Fei; Sáez, Leilani; Alonzo, Julie; Tindal, Gerald

2013-01-01

We report the results of an alternate form reliability and criterion validity study of kindergarten and grade 1 (N = 84-199) reading measures from the easyCBM© assessment system and Stanford Early School Achievement Test/Stanford Achievement Test, 10th edition (SESAT/SAT-10) across 5 time points. The alternate form reliabilities ranged from…
[Validity and reliability of Korean version of the Family Management Measure (Korean FaMM) for families with children having chronic illness].

PubMed

Kim, Dong Hee; Im, Yeo Jin

2013-02-01

To develop and test the validity and reliability of the Korean version of the Family Management Measure (Korean FaMM) to assess applicability for families with children having chronic illnesses. The Korean FaMM was articulated through forward-backward translation methods. Internal consistency reliability, construct and criterion validity were calculated using PASW WIN (19.0) and AMOS (20.0). Survey data were collected from 341 mothers of children suffering from chronic disease enrolled in a university hospital in Seoul, South Korea. The Korean version of FaMM showed reliable internal consistency with Cronbach's alpha for the total scale of .69-.91. Factor loadings of the 53 items on the six sub-scales ranged from 0.28-0.84. The model of six subscales for the Korean FaMM was validated by expiratory and confirmatory factor analysis (χ²<.001, RMR<.05, GFI, AGFI, NFI, NNFI>.08). Criterion validity compared to the Parental Stress Index (PSI) showed significant correlation. The findings of this study demonstrate that the Korean FaMM showed satisfactory construct and criterion validity and reliability. It is useful to measure Korean family's management style with their children who have a chronic illness.
Assessment of Lower Limb Muscle Strength and Power Using Hand-Held and Fixed Dynamometry: A Reliability and Validity Study

PubMed Central

Perraton, Luke G.; Bower, Kelly J.; Adair, Brooke; Pua, Yong-Hao; Williams, Gavin P.; McGaw, Rebekah

2015-01-01

Introduction Hand-held dynamometry (HHD) has never previously been used to examine isometric muscle power. Rate of force development (RFD) is often used for muscle power assessment, however no consensus currently exists on the most appropriate method of calculation. The aim of this study was to examine the reliability of different algorithms for RFD calculation and to examine the intra-rater, inter-rater, and inter-device reliability of HHD as well as the concurrent validity of HHD for the assessment of isometric lower limb muscle strength and power. Methods 30 healthy young adults (age: 23±5yrs, male: 15) were assessed on two sessions. Isometric muscle strength and power were measured using peak force and RFD respectively using two HHDs (Lafayette Model-01165 and Hoggan microFET2) and a criterion-reference KinCom dynamometer. Statistical analysis of reliability and validity comprised intraclass correlation coefficients (ICC), Pearson correlations, concordance correlations, standard error of measurement, and minimal detectable change. Results Comparison of RFD methods revealed that a peak 200ms moving window algorithm provided optimal reliability results. Intra-rater, inter-rater, and inter-device reliability analysis of peak force and RFD revealed mostly good to excellent reliability (coefficients ≥ 0.70) for all muscle groups. Concurrent validity analysis showed moderate to excellent relationships between HHD and fixed dynamometry for the hip and knee (ICCs ≥ 0.70) for both peak force and RFD, with mostly poor to good results shown for the ankle muscles (ICCs = 0.31–0.79). Conclusions Hand-held dynamometry has good to excellent reliability and validity for most measures of isometric lower limb strength and power in a healthy population, particularly for proximal muscle groups. To aid implementation we have created freely available software to extract these variables from data stored on the Lafayette device. Future research should examine the reliability and validity of these variables in clinical populations. PMID:26509265
A New Criterion for Prediction of Hot Tearing Susceptibility of Cast Alloys

NASA Astrophysics Data System (ADS)

Nasresfahani, Mohamad Reza; Niroumand, Behzad

2014-08-01

A new criterion for prediction of hot tearing susceptibility of cast alloys is suggested which takes into account the effects of both important mechanical and metallurgical factors and is believed to be less sensitive to the presence of volume defects such as bifilms and inclusions. The criterion was validated by studying the hot tearing tendency of Al-Cu alloy. In conformity with the experimental results, the new criterion predicted reduction of hot tearing tendency with increasing the copper content.
The Myotonometer: Not a Valid Measurement Tool for Active Hamstring Musculotendinous Stiffness.

PubMed

Pamukoff, Derek N; Bell, Sarah E; Ryan, Eric D; Blackburn, J Troy

2016-05-01

Hamstring musculotendinous stiffness (MTS) is associated with lower-extremity injury risk (ie, hamstring strain, anterior cruciate ligament injury) and is commonly assessed using the damped oscillatory technique. However, despite a preponderance of studies that measure MTS reliably in laboratory settings, there are no valid clinical measurement tools. A valid clinical measurement technique is needed to assess MTS and permit identification of individuals at heightened risk of injury and track rehabilitation progress. To determine the validity and reliability of the Myotonometer for measuring active hamstring MTS. Descriptive laboratory study. Laboratory. 33 healthy participants (15 men, age 21.33 ± 2.94 y, height 172.03 ± 16.36 cm, mass 74.21 ± 16.36 kg). Hamstring MTS was assessed using the damped oscillatory technique and the Myotonometer. Intraclass correlations were used to determine the intrasession, intersession, and interrater reliability of the Myotonometer. Criterion validity was assessed via Pearson product-moment correlation between MTS measures obtained from the Myotonometer and from the damped oscillatory technique. The Myotonometer demonstrated good intrasession (ICC3,1 = .807) and interrater reliability (ICC2,k = .830) and moderate intersession reliability (ICC2,k = .693). However, it did not provide a valid measurement of MTS compared with the damped oscillatory technique (r = .346, P = .061). The Myotonometer does not provide a valid measure of active hamstring MTS. Although the Myotonometer does not measure active MTS, it possesses good reliability and portability and could be used clinically to measure tissue compliance, muscle tone, or spasticity associated with multiple musculoskeletal disorders. Future research should focus on portable and clinically applicable tools to measure active hamstring MTS in efforts to prevent and monitor injuries.
Reliability and validity of the Japanese version of the Community Integration Measure for community-dwelling people with schizophrenia.

PubMed

Shioda, Ai; Tadaka, Etsuko; Okochi, Ayako

2017-01-01

Community integration is an essential right for people with schizophrenia that affects their well-being and quality of life, but no valid instrument exists to measure it in Japan. The aim of the present study is to develop and evaluate the reliability and validity of the Japanese version of the Community Integration Measure (CIM) for people with schizophrenia. The Japanese version of the CIM was developed as a self-administered questionnaire based on the original version of the CIM, which was developed by McColl et al. This study of the Japanese CIM had a cross-sectional design. Construct validity was determined using a confirmatory factor analysis (CFA) and data from 291 community-dwelling people with schizophrenia in Japan. Internal consistency was calculated using Cronbach's alpha. The Lubben Social Network Scale (LSNS-6), the Rosenberg Self-Esteem Scale (RSE) and the UCLA Loneliness Scale, version 3 (UCLALS) were administered to assess the criterion-related validity of the Japanese version of the CIM. The participants were 263 people with schizophrenia who provided valid responses. The Cronbach's alpha was 0.87, and CFA identified one domain with ten items that demonstrated the following values: goodness of fit index = 0.924, adjusted goodness of fit index = 0.881, comparative fit index = 0.925, and root mean square error of approximation = 0.085. The correlation coefficients were 0.43 (p < 0.001) with the LSNS-6, 0.42 (p < 0.001) with the RSE, and -0.57 (p < 0.001) with the UCLALS. The Japanese version of the CIM demonstrated adequate reliability and validity for assessing community integration for people with schizophrenia in Japan.
Validity and reliability of Nike + Fuelband for estimating physical activity energy expenditure.

PubMed

Tucker, Wesley J; Bhammar, Dharini M; Sawyer, Brandon J; Buman, Matthew P; Gaesser, Glenn A

2015-01-01

The Nike + Fuelband is a commercially available, wrist-worn accelerometer used to track physical activity energy expenditure (PAEE) during exercise. However, validation studies assessing the accuracy of this device for estimating PAEE are lacking. Therefore, this study examined the validity and reliability of the Nike + Fuelband for estimating PAEE during physical activity in young adults. Secondarily, we compared PAEE estimation of the Nike + Fuelband with the previously validated SenseWear Armband (SWA). Twenty-four participants (n = 24) completed two, 60-min semi-structured routines consisting of sedentary/light-intensity, moderate-intensity, and vigorous-intensity physical activity. Participants wore a Nike + Fuelband and SWA, while oxygen uptake was measured continuously with an Oxycon Mobile (OM) metabolic measurement system (criterion). The Nike + Fuelband (ICC = 0.77) and SWA (ICC = 0.61) both demonstrated moderate to good validity. PAEE estimates provided by the Nike + Fuelband (246 ± 67 kcal) and SWA (238 ± 57 kcal) were not statistically different than OM (243 ± 67 kcal). Both devices also displayed similar mean absolute percent errors for PAEE estimates (Nike + Fuelband = 16 ± 13 %; SWA = 18 ± 18 %). Test-retest reliability for PAEE indicated good stability for Nike + Fuelband (ICC = 0.96) and SWA (ICC = 0.90). The Nike + Fuelband provided valid and reliable estimates of PAEE, that are similar to the previously validated SWA, during a routine that included approximately equal amounts of sedentary/light-, moderate- and vigorous-intensity physical activity.
Development of a Microsoft Excel tool for applying a factor retention criterion of a dimension coefficient to a survey on patient safety culture.

PubMed

Chien, Tsair-Wei; Shao, Yang; Jen, Dong-Hui

2017-10-27

Many quality-of-life studies have been conducted in healthcare settings, but few have used Microsoft Excel to incorporate Cronbach's α with dimension coefficient (DC) for describing a scale's characteristics. To present a computer module that can report a scale's validity, we manipulated datasets to verify a DC that can be used as a factor retention criterion for demonstrating its usefulness in a patient safety culture survey (PSC). Microsoft Excel Visual Basic for Applications was used to design a computer module for simulating 2000 datasets fitting the Rasch rating scale model. The datasets consisted of (i) five dual correlation coefficients (correl. = 0.3, 0.5, 0.7, 0.9, and 1.0) on two latent traits (i.e., true scores) following a normal distribution and responses to their respective 1/3 and 2/3 items in length; (ii) 20 scenarios of item lengths from 5 to 100; and (iii) 20 sample sizes from 50 to 1000. Each item containing 5-point polytomous responses was uniformly distributed in difficulty across a ± 2 logit range. Three methods (i.e., dimension interrelation ≥0.7, Horn's parallel analysis (PA) 95% confidence interval, and individual random eigenvalues) were used for determining one factor to retain. DC refers to the binary classification (1 as one factor and 0 as many factors) used for examining accuracy with the indicators sensitivity, specificity, and area under receiver operating characteristic curve (AUC). The scale's reliability and DC were simultaneously calculated for each simulative dataset. PSC real data were demonstrated with DC to interpret reports of the unit-based construct validity using the author-made MS Excel module. The DC method presented accurate sensitivity (=0.96), specificity (=0.92) with a DC criterion (≥0.70), and AUC (=0.98) that were higher than those of the two PA methods. PA combined with DC yielded good sensitivity (=0.96), specificity (=1.0) with a DC criterion (≥0.70), and AUC (=0.99). Advances in computer technology may enable healthcare users familiar with MS Excel to apply DC as a factor retention criterion for determining a scale's unidimensionality and evaluating a scale's quality.
A new scale for the assessment of performance and capacity of hand function in children with hemiplegic cerebral palsy: reliability and validity studies.

PubMed

Rosa-Rizzotto, M; Visonà Dalla Pozza, L; Corlatti, A; Luparia, A; Marchi, A; Molteni, F; Facchin, P; Pagliano, E; Fedrizzi, E

2014-10-01

In hemiplegic children, the recognition of the activity limitation pattern and the possibility of grading its severity are relevant for clinicians while planning interventions, monitoring results, predicting outcomes. Aim of the study is to examine the reliability and validity of Besta Scale, an instrument used to measure in hemiplegic children from 18 months to 12 years of age both grasp on request (capacity) and spontaneous use of upper limb (performance) in bimanual play activities and in ADL. Psychometric analysis of reliability and of validity of the Besta scale was performed. Outpatient study sample Reliability study: A sample of 39 patients was enrolled. The administration of Besta scale was video-recorded in a standardized manner. All videos were scored by 20 independent raters on subsequent viewing. 3 raters randomly selected from the 20-raters group rescored the same video two years later for intra-rater reliability. Intra and inter-rater reliability were calculated using Intraclass Correlation Coefficient (ICC) and Kendall's coefficient (K), respectively. Internal consistency reliability was assessed using Alpha's Chronbach coefficient. Validity study: a sample of 105 children was assessed 5 times (at t0 and 2, 3, 6 and 12 months later) by 20 independent raters. Each patient underwent at the same time to QUEST and Besta scale administration and assessment. Criterion validity was calculated using rho-Pearson coefficient. Reliability study: The inter-rater reliability calculated with Kendall's coefficient resulted moderate K=0.47. The intra-rater (or test-retest) reliability for 3 raters was excellent (ICC=0.927). The Cronbach's alpha for internal consistency was 0.972. Validity study: Besta scale showed a good criterion validity compared to QUEST increasing by age and severity of impairment. Rho Pearson's correlation coefficient r was 0.81 (P<0.0001). Limitations. Besta scales in infants finds hard to distinguish between mild to moderately impaired hand function. Besta scale scoring system is a valid and reliable tool, utilizable in a clinical setting to monitor evolution of unimanual and bimanual manipulation and to distinguish hand's capacity from performance.
Determination of Air Enthalpy Based on Meteorological Data as an Indicator for Heat Stress Assessment in Occupational Outdoor Environments, a Field Study in IRAN.

PubMed

Heidari, Hamidreza; Golbabaei, Farideh; Shamsipour, Aliakbar; Rahimi Forushani, Abbas; Gaeini, Abbasali

2016-01-01

Heat stress evaluation and timely notification, especially using meteorological data is an important issue attracted attention in recent years. Therefore, this study aimed at answering the following research questions: 1) can enthalpy as a common environmental parameter reported by meteorological agencies be applied accurately for evaluation of thermal condition of outdoor settings, and 2) if so, what is it's the best criterion to detect areas in stress or stress-free situations, separately. Nine climatic regions were selected throughout Iran covering a wide variety of climatic conditions like those, which exist around the world. Three types of parameters including measured (ta, RH, Pa and WBGT), estimated (metabolic rate and cloth thermal insulation), and calculated parameters (enthalpy and effective WBGT) were recorded for 1452 different situations. Enthalpy as a new indicator in this research was compared to WBGT in selected regions. Altogether, a good consistency was obtained between enthalpy and WBGT in selected regions (Kappa value: 0.815). Based on the good ROC curve obtained using MedCal software, the criterion of the values more than 74.24 for the new index was determined to explain heat stress situation for outdoor environments. Because of simplicity in measurement, applicability of the indicator for weather agencies, the consistency observed between enthalpy and a valid as well as accurate index (WBGT), sensor requirements which take only a few seconds to reach equilibrium and so on, enthalpy indicator can be introduced and applied as a good substitute for WBGT for outdoor settings.
Validity, responsiveness, and minimal clinically important difference of EQ-5D-5L in stroke patients undergoing rehabilitation.

PubMed

Chen, Poyu; Lin, Keh-Chung; Liing, Rong-Jiuan; Wu, Ching-Yi; Chen, Chia-Ling; Chang, Ku-Chou

2016-06-01

To examine the criterion validity, responsiveness, and minimal clinically important difference (MCID) of the EuroQoL 5-Dimensions Questionnaire (EQ-5D-5L) and visual analog scale (EQ-VAS) in people receiving rehabilitation after stroke. The EQ-5D-5L, along with four criterion measures-the Medical Research Council scales for muscle strength, the Fugl-Meyer assessment, the functional independence measure, and the Stroke Impact Scale-was administered to 65 patients with stroke before and after 3- to 4-week therapy. Criterion validity was estimated using the Spearman correlation coefficient. Responsiveness was analyzed by the effect size, standardized response mean (SRM), and criterion responsiveness. The MCID was determined by anchor-based and distribution-based approaches. The percentage of patients exceeding the MCID was also reported. Concurrent validity of the EQ-Index was better compared with the EQ-VAS. The EQ-Index has better power for predicting the rehabilitation outcome in the activities of daily living than other motor-related outcome measures. The EQ-Index was moderately responsive to change (SRM = 0.63), whereas the EQ-VAS was only mildly responsive to change. The MCID estimation of the EQ-Index (the percentage of patients exceeding the MCID) was 0.10 (33.8 %) and 0.10 (33.8 %) based on the anchor-based and distribution-based approaches, respectively, and the estimation of EQ-VAS was 8.61 (41.5 %) and 10.82 (32.3 %). The EQ-Index has shown reasonable concurrent validity, limited predictive validity, and acceptable responsiveness for detecting the health-related quality of life in stroke patients undergoing rehabilitation, but not for EQ-VAS. Future research considering different recovery stages after stroke is warranted to validate these estimations.
Measuring personality functioning in older adults: construct validity of the Severity Indices of Personality Functioning - Short Form (SIPP-SF).

PubMed

Rossi, Gina; Debast, Inge; van Alphen, S P J

2017-07-01

The dimensional personality disorders model in the Diagnostic and Statistical Manual (DSM)-5 section III conceptually differentiates impaired personality functioning (criterion A) from the presence of pathological traits (criterion B). This study is the first to specifically address the measurement of criterion A in older adults. Moreover, the convergent/divergent validity of criterion A and criterion B will be compared in younger and older age groups. The Severity Indices of Personality Functioning - Short Form (SIPP-SF) was administered in older (N = 171) and younger adults (N = 210). The factorial structure was analyzed with exploratory structural equation modeling. Differences in convergent/divergent validity between personality functioning (SIPP-SF) and pathological traits (Personality Inventory for DSM-5; Dimensional Assessment of Personality Pathology-Basic Questionnaire) were examined across age groups. Identity Integration, Relational Capacities, Responsibility, Self-Control, and Social Concordance were corroborated as higher order domains. Although the SIPP-SF domains measured unique variation, some high correlations with pathological traits referred to overlapping constructs. Moreover, in older adults, personality functioning was more strongly related to Psychoticism, Disinhibition, Antagonism and Dissocial Behavior compared to younger adults. The SIPP-SF construct validity was demonstrated in terms of a structure of five higher order domains of personality functioning. The instrument is promising as a possible measure of impaired personality functioning in older adults. As such, it is a useful clinical tool to follow up effects of therapy on levels of personality functioning. Moreover, traits were associated with different degrees of personality functioning across age groups.
Criterion-Referenced Testing for College-Level General Education: Some Problems and Recommendations.

ERIC Educational Resources Information Center

Benoist, Howard

1979-01-01

The adoption of a criterion-referenced assessment system and the resulting disadvantages of this form of evaluation for the college general education program are discussed, including problems in identifying assessment validation procedures. (RAO)
Technical Note: Approximate solution of transient drawdown for constant-flux pumping at a partially penetrating well in a radial two-zone confined aquifer

NASA Astrophysics Data System (ADS)

Huang, C.-S.; Yang, S.-Y.; Yeh, H.-D.

2015-03-01

An aquifer consisting of a skin zone and a formation zone is considered as a two-zone aquifer. Existing solutions for the problem of constant-flux pumping (CFP) in a two-zone confined aquifer involve laborious calculation. This study develops a new approximate solution for the problem based on a mathematical model including two steady-state flow equations with different hydraulic parameters for the skin and formation zones. A partially penetrating well may be treated as the Neumann condition with a known flux along the screened part and zero flux along the unscreened part. The aquifer domain is finite with an outer circle boundary treated as the Dirichlet condition. The steady-state drawdown solution of the model is derived by the finite Fourier cosine transform. Then, an approximate transient solution is developed by replacing the radius of the boundary in the steady-state solution with an analytical expression for a dimensionless time-dependent radius of influence. The approximate solution is capable of predicting good temporal drawdown distributions over the whole pumping period except at the early stage. A quantitative criterion for the validity of neglecting the vertical flow component due to a partially penetrating well is also provided. Conventional models considering radial flow without the vertical component for the CFP have good accuracy if satisfying the criterion.

Criterion Validity of the Child's Challenging Behavior Scale, Version 2 (CCBS-2).

PubMed

Bourke-Taylor, Helen M; Cordier, Reinie; Pallant, Julie F

The Child's Challenging Behavior Scale, Version 2 (CCBS-2), measures maternal rating of a child's challenging behaviors that compromise maternal mental health. The CCBS-2, the Child Behavior Checklist (CBCL), and the Strengths and Difficulties Questionnaire (SDQ) were compared in a sample of typically developing young Australian children. Criterion validity was investigated by correlating the CCBS-2 with "gold standard" measures (CBCL and SDQ subscales). Data were collected in a cross-sectional survey of mothers (N = 336) of children ages 3-9 yr. Correlations with the CBCL externalizing subscales demonstrated moderate (ρ = .46) to strong (ρ = .66) correlations. Correlations with the SDQ externalizing behaviors subscales were moderate (ρ = .35) to strong (ρ = .60). The criterion validity established in this study strengthens the psychometric properties that support ongoing development of the CCBS-2 as an efficient tool that may identify children in need of further evaluation. Copyright © 2018 by the American Occupational Therapy Association, Inc.
The Personality Inventory for DSM-5 Short Form (PID-5-SF): psychometric properties and association with big five traits and pathological beliefs in a Norwegian population.

PubMed

Thimm, Jens C; Jordan, Stian; Bach, Bo

2016-12-07

With the publication of the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5), an alternative model for personality disorders based on personality dysfunction and pathological personality traits was introduced. The Personality Inventory for DSM-5 (PID-5) is a 220-item self-report inventory designed to assess the personality traits of this model. Recently, a short 100-item version of the PID-5 (PID-5-SF) has been developed. The aim of this study was to investigate the score reliability and structure of the Norwegian PID-5-SF. Further, criterion validity with the five factor model of personality (FFM) and pathological personality beliefs was examined. A derivation sample of university students (N = 503) completed the PID-5, the Big Five Inventory (BFI), and the Personality Beliefs Questionnaire - Short Form (PBQ-SF), whereas a replication sample of 127 students completed the PID-5-SF along with the aforementioned measures. The short PID-5 showed overall good score reliability and structural validity. The associations with FFM traits and pathological personality beliefs were conceptually coherent and similar for the two forms of the PID-5. The results suggest that the Norwegian PID-5 short form is a reliable and efficient measure of the trait criterion of the alternative model for personality disorders in DSM-5.
The Revised Child Anxiety and Depression Scale 25-Parent Version: Scale Development and Validation in a School-Based and Clinical Sample.

PubMed

Ebesutani, Chad; Korathu-Larson, Priya; Nakamura, Brad J; Higa-McMillan, Charmaine; Chorpita, Bruce

2017-09-01

To help facilitate the dissemination and implementation of evidence-based assessment practices, we examined the psychometric properties of the shortened 25-item version of the Revised Child Anxiety and Depression Scale-parent report (RCADS-25-P), which was based on the same items as the previously published shortened 25-item child version. We used two independent samples of youth-a school sample ( N = 967, Grades 3-12) and clinical sample ( N = 433; 6-18 years)-to examine the factor structure, reliability, and validity of the RCADS-25-P scale scores. Results revealed that the two-factor structure (i.e., depression and broad anxiety factor) fit the data well in both the school and clinical sample. All reliability estimates, including test-retest indices, exceeded benchmark for good reliability. In the school sample, the RCADS-25-P scale scores converged significantly with related criterion measures and diverged with nonrelated criterion measures. In the clinical sample, the RCADS-25-P scale scores successfully discriminated between those with and without target problem diagnoses. In both samples, child-parent agreement indices were in the expected ranges. Normative data were also reported. The RCADS-25-P thus demonstrated robust psychometric properties across both a school and clinical sample as an effective brief screening instrument to assess for depression and anxiety in children and adolescents.
Development and content validity of a screening instrument for gaming addiction in adolescents: the Gaming Addiction Identification Test (GAIT).

PubMed

Vadlin, Sofia; Åslund, Cecilia; Nilsson, Kent W

2015-08-01

This study describes the development of a screening tool for gaming addiction in adolescents - the Gaming Addiction Identification Test (GAIT). Its development was based on the research literature on gaming and addiction. An expert panel comprising professional raters (n = 7), experiential adolescent raters (n = 10), and parent raters (n = 10) estimated the content validity of each item (I-CVI) as well as of the whole scale (S-CVI/Ave), and participated in a cognitive interview about the GAIT scale. The mean scores for both I-CVI and S-CVI/Ave ranged between 0.97 and 0.99 compared with the lowest recommended I-CVI value of 0.78 and the S-CVI/Ave value of 0.90. There were no sex differences and no differences between expert groups regarding ratings in content validity. No differences in the overall evaluation of the scale emerged in the cognitive interviews. Our conclusions were that GAIT showed good content validity in capturing gaming addiction. The GAIT needs further investigation into its psychometric properties of construct validity (convergent and divergent validity) and criterion-related validity, as well as its reliability in both clinical settings and in community settings with adolescents. © 2015 Scandinavian Psychological Associations and John Wiley & Sons Ltd.
Psychological Flexibility of Nurses in a Cancer Hospital: Preliminary Validation of a Chinese Version of the Work-related Acceptance and Action Questionnaire.

PubMed

Xu, Xianghua; Liu, Xiangyu; Ou, Meijun; Xie, Chanjuan; Chen, Yongyi

2018-01-01

To translate the English work-related acceptance and action questionnaire (WAAQ), make cross-cultural adaptations, and examine its psychometric properties when used by Chinese oncology nurses. After translation, the psychometric properties of the Chinese WAAQ were analyzed among 417 nurses, and content validity was determined by six experts. Item-level content validity index (CVI) values were between 0.83 and 1.00; scale-level CVI/universal agreement (S-CVI/UA) and S-CVI/average were 0.86 and 0.98, respectively, which implicated a good content validity. The correlation of the Chinese WAAQ with AAQ-II ( r s = -0.247, P < 0.001) suggested criterion validity, and those with General Health Questionnaire-12 (-0.250, <0.001) and general self-efficacy scale (0.491, <0.001) and Utrecht work engagement scale (UWES) (0.439, <0.001) suggested convergent validity. Exploratory factor analysis identified a seven-item, one-factor structure of WAAQ. The Chinese version of WAAQ had high internal consistency (Cronbach's α = 0.920), with an item-total correlation coefficient of 0.702-0.828 ( P < 0.05), split-half reliability of 0.933, and test-retest reliability of 0.772. The Chinese WAAQ is a reliable and valid tool for assessing psychological flexibility in Chinese oncology nurses.
Development and psychometric testing of the Cancer Knowledge Scale for Elders.

PubMed

Su, Ching-Ching; Chen, Yuh-Min; Kuo, Bo-Jein

2009-03-01

To develop the Cancer Knowledge Scale for Elders and test its validity and reliability. The number of elders suffering from cancer is increasing. To facilitate cancer prevention behaviours among elders, they shall be educated about cancer-related knowledge. Prior to designing a programme that would respond to the special needs of elders, understanding the cancer-related knowledge within this population was necessary. However, extensive review of the literature revealed a lack of appropriate instruments for measuring cancer-related knowledge. A valid and reliable cancer knowledge scale for elders is necessary. A non-experimental methodological design was used to test the psychometric properties of the Cancer Knowledge Scale for Elders. Item analysis was first performed to screen out items that had low corrected item-total correlation coefficients. Construct validity was examined with a principle component method of exploratory factor analysis. Cancer-related health behaviour was used as the criterion variable to evaluate criterion-related validity. Internal consistency reliability was assessed by the KR-20. Stability was determined by two-week test-retest reliability. The factor analysis yielded a four-factor solution accounting for 49.5% of the variance. For criterion-related validity, cancer knowledge was positively correlated with cancer-related health behaviour (r = 0.78, p < 0.001). The KR-20 coefficients of each factor were 0.85, 0.76, 0.79 and 0.67 and 0.87 for the total scale. Test-retest reliability over a two-week period was 0.83 (p < 0.001). This study provides evidence for content validity, construct validity, criterion-related validity, internal consistency and stability of the Cancer Knowledge Scale for Elders. The results show that this scale is an easy-to-use instrument for elders and has adequate validity and reliability. The scale can be used as an assessment instrument when implementing cancer education programmes for elders. It can also be used to evaluate the effects of education programmes.
Reliability and validity of a tool to measure the severity of tongue thrust in children: the Tongue Thrust Rating Scale.

PubMed

Serel Arslan, S; Demir, N; Karaduman, A A

2017-02-01

This study aimed to develop a scale called Tongue Thrust Rating Scale (TTRS), which categorised tongue thrust in children in terms of its severity during swallowing, and to investigate its validity and reliability. The study describes the developmental phase of the TTRS and presented its content and criterion-based validity and interobserver and intra-observer reliability. For content validation, seven experts assessed the steps in the scale over two Delphi rounds. Two physical therapists evaluated videos of 50 children with cerebral palsy (mean age, 57·9 ± 16·8 months), using the TTRS to test criterion-based validity, interobserver and intra-observer reliability. The Karaduman Chewing Performance Scale (KCPS) and Drooling Severity and Frequency Scale (DSFS) were used for criterion-based validity. All the TTRS steps were deemed necessary. The content validity index was 0·857. A very strong positive correlation was found between two examinations by one physical therapist, which indicated intra-observer reliability (r = 0·938, P < 0·001). A very strong positive correlation was also found between the TTRS scores of two physical therapists, indicating interobserver reliability (r = 0·892, P < 0·001). There was also a strong positive correlation between the TTRS and KCPS (r = 0·724, P < 0·001) and a very strong positive correlation between the TTRS scores and DSFS (r = 0·822 and r = 0·755; P < 0·001). These results demonstrated the criterion-based validity of the TTRS. The TTRS is a valid, reliable and clinically easy-to-use functional instrument to document the severity of tongue thrust in children. © 2016 John Wiley & Sons Ltd.
Turkish Version of Kolcaba's Immobilization Comfort Questionnaire: A Validity and Reliability Study.

PubMed

Tosun, Betül; Aslan, Özlem; Tunay, Servet; Akyüz, Aygül; Özkan, Hüseyin; Bek, Doğan; Açıksöz, Semra

2015-12-01

The purpose of this study was to determine the validity and reliability of the Turkish version of the Immobilization Comfort Questionnaire (ICQ). The sample used in this methodological study consisted of 121 patients undergoing lower extremity arthroscopy in a training and research hospital. The validity study of the questionnaire assessed language validity, structural validity and criterion validity. Structural validity was evaluated via exploratory factor analysis. Criterion validity was evaluated by assessing the correlation between the visual analog scale (VAS) scores (i.e., the comfort and pain VAS scores) and the ICQ scores using Spearman's correlation test. The Kaiser-Meyer-Olkin coefficient and Bartlett's test of sphericity were used to determine the suitability of the data for factor analysis. Internal consistency was evaluated to determine reliability. The data were analyzed with SPSS version 15.00 for Windows. Descriptive statistics were presented as frequencies, percentages, means and standard deviations. A p value ≤ .05 was considered statistically significant. A moderate positive correlation was found between the ICQ scores and the VAS comfort scores; a moderate negative correlation was found between the ICQ and the VAS pain measures in the criterion validity analysis. Cronbach α values of .75 and .82 were found for the first and second measurements, respectively. The findings of this study reveal that the ICQ is a valid and reliable tool for assessing the comfort of patients in Turkey who are immobilized because of lower extremity orthopedic problems. Copyright © 2015. Published by Elsevier B.V.
Reliability and Validity of the Korean Version of the Internet Addiction Test among College Students

PubMed Central

Lee, Kounseok; Lee, Hye-Kyung; Gyeong, Hyunsu; Yu, Byeongkwan; Song, Yul-Mai

2013-01-01

We developed a Korean translation of the Internet Addiction Test (KIAT), widely used self-report for internet addiction and tested its reliability and validity in a sample of college students. Two hundred seventy-nine college students at a national university completed the KIAT. Internal consistency and two week test-retest reliability were calculated from the data, and principal component factor analysis was conducted. Participants also completed the Internet Addiction Diagnostic Questionnaire (IADQ), the Korea Internet addiction scale (K-scale), and the Patient Health Questionnaire-9 for the criterion validity. Cronbach's alpha of the whole scale was 0.91, and test-retest reliability was also good (r = 0.73). The IADQ, the K-scale, and depressive symptoms were significantly correlated with the KIAT scores, demonstrating concurrent and convergent validity. The factor analysis extracted four factors (Excessive use, Dependence, Withdrawal, and Avoidance of reality) that accounted for 59% of total variance. The KIAT has outstanding internal consistency and high test-retest reliability. Also, the factor structure and validity data show that the KIAT is comparable to the original version. Thus, the KIAT is a psychometrically sound tool for assessing internet addiction in the Korean-speaking population. PMID:23678270
British isles lupus assessment group 2004 index is valid for assessment of disease activity in systemic lupus erythematosus

PubMed Central

Yee, Chee-Seng; Farewell, Vernon; Isenberg, David A; Rahman, Anisur; Teh, Lee-Suan; Griffiths, Bridget; Bruce, Ian N; Ahmad, Yasmeen; Prabu, Athiveeraramapandian; Akil, Mohammed; McHugh, Neil; D'Cruz, David; Khamashta, Munther A; Maddison, Peter; Gordon, Caroline

2007-01-01

Objective To determine the construct and criterion validity of the British Isles Lupus Assessment Group 2004 (BILAG-2004) index for assessing disease activity in systemic lupus erythematosus (SLE). Methods Patients with SLE were recruited into a multicenter cross-sectional study. Data on SLE disease activity (scores on the BILAG-2004 index, Classic BILAG index, and Systemic Lupus Erythematosus Disease Activity Index 2000 [SLEDAI-2K]), investigations, and therapy were collected. Overall BILAG-2004 and overall Classic BILAG scores were determined by the highest score achieved in any of the individual systems in the respective index. Erythrocyte sedimentation rates (ESRs), C3 levels, C4 levels, anti–double-stranded DNA (anti-dsDNA) levels, and SLEDAI-2K scores were used in the analysis of construct validity, and increase in therapy was used as the criterion for active disease in the analysis of criterion validity. Statistical analyses were performed using ordinal logistic regression for construct validity and logistic regression for criterion validity. Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated. Results Of the 369 patients with SLE, 92.7% were women, 59.9% were white, 18.4% were Afro-Caribbean and 18.4% were South Asian. Their mean ± SD age was 41.6 ± 13.2 years and mean disease duration was 8.8 ± 7.7 years. More than 1 assessment was obtained on 88.6% of the patients, and a total of 1,510 assessments were obtained. Increasing overall scores on the BILAG-2004 index were associated with increasing ESRs, decreasing C3 levels, decreasing C4 levels, elevated anti-dsDNA levels, and increasing SLEDAI-2K scores (all P < 0.01). Increase in therapy was observed more frequently in patients with overall BILAG-2004 scores reflecting higher disease activity. Scores indicating active disease (overall BILAG-2004 scores of A and B) were significantly associated with increase in therapy (odds ratio [OR] 19.3, P < 0.01). The BILAG-2004 and Classic BILAG indices had comparable sensitivity, specificity, PPV, and NPV. Conclusion These findings show that the BILAG-2004 index has construct and criterion validity. PMID:18050213
[Validity and Reliability of Korean Version of the Spiritual Care Competence Scale].

PubMed

Chung, Mi Ja; Park, Youngrye; Eun, Young

2016-12-01

The aim of this study was to examine the validity and reliability of the Korean Version of the Spiritual Care Competence Scale (K-SCCS). A cross-sectional study design was used. The K-SCCS consisted of 26 questions to measure spiritual care competence of nurses. Participants, 228 nurses who had more than 3 years'experience as a nurse, completed the survey. Confirmatory factor analysis was used to examine the construct validity and correlations of K-SCCS and spiritual well-being (SWB) were used to examine the criterion validity of K-SCCS. Cronbach's alpha was used to test internal consistency. The construct and the criterion-related validity of K-SCCS were supported as measures of spiritual care competence. Cronbach's alpha was .95. Factor loadings of the 26 questions ranged from .60 to .96. Construct validity of K-SCCS was verified by confirmatory factor analysis (RMSEA=.08, CFI=.90, NFI=.85). Criterion validity compared to the SWB showed significant correlation (r=.44, p<.001). The findings suggest that K-SCCS serves as an appropriate measure of spiritual care competence with validity and reliability. However, further study is needed to retest the verification of the factor analysis related to factor 2 (professionalisation and improving the quality of spiritual care) and factor 3 (personal support and patient counseling). Therefore, we recommend using the total score without distinguishing subscales.
Effects of positive impression management on the NEO Personality Inventory--Revised in a clinical population.

PubMed

Ballenger, J F; Caldwell-Andrews, A; Baer, R A

2001-06-01

Sixty adults in outpatient psychotherapy completed the NEO Personality Inventory--Revised (NEO PI-R, P. T. Costa & R. R. McCrae, 1992a). Half were instructed to fake good and half were given standard instructions. All completed the Interpersonal Adjective Scale--Revised, Big Five (J. S. Wiggins & P. D. Trapnell, 1997) under standard instructions, and their therapists completed the observer rating form of the NEO Five-Factor Inventory. A comparison group of 30 students completed the NEO PI-R under standard instructions. Standard and fake-good participants obtained significantly different NEO PI-R domain scores. Correlations between the NEO PI-R and criterion measures were significantly lower for faking than for standard patients. Validity scales for the NEO PI-R (J. A. Schinka, B. N. Kinder, & T. Kremer, 1997) were moderately accurate in discriminating faking from standard patients, but were only marginally accurate in discriminating faking patients from students.
Validity, reliability and responsiveness of a short version of the Stroke-Specific Quality of Life Scale in patients receiving rehabilitation.

PubMed

Chen, Hui-fang; Wu, Ching-yi; Lin, Keh-chung; Li, Ming-wei; Yu, Hung-wen

2012-07-01

To examine the measurement properties of a short version of the Stroke-Specific Quality of Life Scale (SS-QoL-12). Self-report survey of patients with mild to moderate upper extremity dysfunction. A total of 126 patients provided 252 observations before and after treatment. The construct validity and reliability was examined using the Rasch model; the concurrent and predictive validity was estimated using Spearman's rank correlation coefficients. Paired t-test and the standardized response mean (SRM) were performed to estimate the responsiveness of the SS-QoL-12. The 2-factor model (psychosocial and physical domains) fit the data better with smaller deviances. All but 1 item showed acceptable fit, and no item biases were detected. The reliability of the subscales and the whole scale ranged from 0.67 to 0.99. The total score showed fair correlations with the criterion measures at pretreatment (ρ = 0.28-0.40) and fair to good correlations at post-treatment (ρ = 0.39-0.54). The subscales had low to fair correlations at pretreatment (ρ = 0.19-0.49) and fair to good correlations at post-treatment (ρ = 0.31-0.56). The total and the subscales had low to good predictions at baseline (ρ = 0.22-0.52). The whole scale and the psychosocial subscale were mildly responsive to change (SRM = 0.22), but the physical subscale was not responsive to change (SRM = 0.08). The SS-QoL-12 has acceptable to good measurement properties, with an advantage of requiring less time to administer than other scales. The use of the subscale and total scores depends on the purpose of research. Future studies should recruit stroke patients with a broad range of dysfunction and use a large sample size to validate the findings.
Development and Validation of Triarchic Psychopathy Scales from the Multidimensional Personality Questionnaire

PubMed Central

Brislin, Sarah J.; Drislane, Laura E.; Smith, Shannon Toney; Edens, John F.; Patrick, Christopher J.

2015-01-01

Psychopathy is conceptualized by the triarchic model as encompassing three distinct phenotypic constructs: boldness, meanness, and disinhibition. In the current study, the Multidimensional Personality Questionnaire (MPQ), a normal-range personality measure, was evaluated for representation of these three constructs. Consensus ratings were used to identify MPQ items most related to each triarchic (Tri) construct. Scale measures were developed from items indicative of each construct, and scores for these scales were evaluated for convergent and discriminant validity in community (N = 176) and incarcerated samples (N = 240). A cross the two samples, MPQ-Tri scale scores demonstrated good internal consistencies and relationships with criterion measures of various types consistent with predictions based on the triarchic model. Findings are discussed in terms of their implications for further investigation of the triarchic model constructs in preexisting datasets that include the MPQ, in particular longitudinal and genetically informative datasets. PMID:25642934
Quality of Life after Brain Injury (QOLIBRI) Overall Scale for patients after aneurysmal subarachnoid hemorrhage.

PubMed

Wong, George Kwok Chu; Lam, Sandy Wai; Ngai, Karine; Wong, Adrian; Mok, Vincent; Poon, Wai Sang

2014-06-01

The Quality of Life after Brain Injury Overall Scale (QOLIBRI-OS) is a recently developed instrument that provides a brief summary measure of health-related quality of life (HRQoL) in domains typically affected by brain injury. This study examined the application of the six item QOLIBRI-OS in patients after aneurysmal subarachnoid hemorrhage (aSAH). Hong Kong Chinese aSAH patients were evaluated prospectively within the chronic phase of 1 year after aSAH in this multi-center observational study. Cronbach's α was 0.88, and correlations were satisfactory for all six items. QOLIBRI-OS demonstrated good criterion validity with other 1 year outcome assessments. In conclusion, QOLIBRI-OS can be used as a brief index for disease-specific HRQoL assessment after aSAH. Further validation in another population of aSAH patients is recommended. Copyright © 2013 Elsevier Ltd. All rights reserved.
Evaluating Maintenance Performance: The Development of Graphic Symbolic Substitutes for Criterion Referenced Job Task Performance Tests for Electronic Maintenance. Final Report.

ERIC Educational Resources Information Center

Shriver, Edgar L.; Foley, John P., Jr.

A battery of criterion referenced Job Task Performance Tests (JTPT) was developed because paper and pencil tests of job knowledge and electronic theory had very poor criterion-related or empirical validity with respect to the ability of electronic maintenance men to perform their job. Although the original JTPT required the use of actual…
Ten Issues in Criterion-Referenced Testing: A Response to Commonly Heard Criticisms.

ERIC Educational Resources Information Center

Curlette, William L.; Stallings, William M.

1979-01-01

The 10 criticisms of criterion-referenced tests addressed in this paper are: the domains tested; pedagogical influence; difficulty of items; cumbersome reports; reliability; arbitrary criteria; local objectives; labeling; predictive validity; and repeated testing. (SJL)
Procedures for Constructing and Using Criterion-Referenced Performance Tests.

ERIC Educational Resources Information Center

Campbell, Clifton P.; Allender, Bill R.

1988-01-01

Criterion-referenced performance tests (CRPT) provide a realistic method for objectively measuring task proficiency against predetermined attainment standards. This article explains the procedures of constructing, validating, and scoring CRPTs and includes a checklist for a welding test. (JOW)
The brief multidimensional students' life satisfaction scale-college version.

PubMed

Zullig, Keith J; Huebner, E Scott; Patton, Jon M; Murray, Karen A

2009-01-01

To investigate the psychometric properties of the BMSLSS-College among 723 college students. Internal consistency estimates explored scale reliability, factor analysis explored construct validity, and known-groups validity was assessed using the National College Youth Risk Behavior Survey and Harvard School of Public Health College Alcohol Study. Criterion-related validity was explored through analyses with the CDC's health-related quality of life scale and a social isolation scale. Acceptable internal consistency reliability, construct, known-groups, and criterion-related validity were established. Findings offer preliminary support for the BMSLSS-C; it could be useful in large-scale research studies, applied screening contexts, and for program evaluation purposes toward achieving Healthy People 2010 objectives.
Validation of cross-cultural child mental health and psychosocial research instruments: adapting the Depression Self-Rating Scale and Child PTSD Symptom Scale in Nepal

PubMed Central

2011-01-01

Background The lack of culturally adapted and validated instruments for child mental health and psychosocial support in low and middle-income countries is a barrier to assessing prevalence of mental health problems, evaluating interventions, and determining program cost-effectiveness. Alternative procedures are needed to validate instruments in these settings. Methods Six criteria are proposed to evaluate cross-cultural validity of child mental health instruments: (i) purpose of instrument, (ii) construct measured, (iii) contents of construct, (iv) local idioms employed, (v) structure of response sets, and (vi) comparison with other measurable phenomena. These criteria are applied to transcultural translation and alternative validation for the Depression Self-Rating Scale (DSRS) and Child PTSD Symptom Scale (CPSS) in Nepal, which recently suffered a decade of war including conscription of child soldiers and widespread displacement of youth. Transcultural translation was conducted with Nepali mental health professionals and six focus groups with children (n = 64) aged 11-15 years old. Because of the lack of child mental health professionals in Nepal, a psychosocial counselor performed an alternative validation procedure using psychosocial functioning as a criterion for intervention. The validation sample was 162 children (11-14 years old). The Kiddie-Schedule for Affective Disorders and Schizophrenia (K-SADS) and Global Assessment of Psychosocial Disability (GAPD) were used to derive indication for treatment as the external criterion. Results The instruments displayed moderate to good psychometric properties: DSRS (area under the curve (AUC) = 0.82, sensitivity = 0.71, specificity = 0.81, cutoff score ≥ 14); CPSS (AUC = 0.77, sensitivity = 0.68, specificity = 0.73, cutoff score ≥ 20). The DSRS items with significant discriminant validity were "having energy to complete daily activities" (DSRS.7), "feeling that life is not worth living" (DSRS.10), and "feeling lonely" (DSRS.15). The CPSS items with significant discriminant validity were nightmares (CPSS.2), flashbacks (CPSS.3), traumatic amnesia (CPSS.8), feelings of a foreshortened future (CPSS.12), and easily irritated at small matters (CPSS.14). Conclusions Transcultural translation and alternative validation feasibly can be performed in low clinical resource settings through task-shifting the validation process to trained mental health paraprofessionals using structured interviews. This process is helpful to evaluate cost-effectiveness of psychosocial interventions. PMID:21816045

Psychometric properties of the World Health Organization quality of life assessment – brief in methadone patients: a validation study in northern Taiwan

PubMed Central

2013-01-01

Background Quality of life (QOL) is an important outcome measure in the treatment of heroin addiction. The Taiwan version of the World Health Organization Quality of Life assessment (WHOQOL-BREF [TW]) has been developed and studied in various groups, but not specifically in a population of injection drug users. The aim of this study was to analyze the psychometric properties of the WHOQOL-BREF (TW) in a sample of injection drug users undergoing methadone maintenance treatment. Methods A total of 553 participants were interviewed and completed the instrument. Item-response distributions, internal consistency, corrected item-domain correlation, criterion-related validity, and construct validity through confirmatory factor analysis were evaluated. Results The frequency distribution of the 4 domains of the WHOQOL-BREF (TW) showed no floor or ceiling effects. The instrument demonstrated adequate internal consistency (Cronbach’s alpha coefficients were higher than 0.7 across the 4 domains) and all items had acceptable correlation with the corresponding domain scores (r = 0.32-0.73). Correlations (p < 0.01) of the 4 domains with the 2 benchmark items assessing overall QOL and general health were supportive of criterion-related validity. Confirmatory factor analysis yielded marginal goodness-of-fit between the 4-domain model and the sample data. Conclusions The hypothesized WHOQOL-BREF measurement model was appropriate for the injection drug users after some adjustments. Despite different patterns found in the confirmatory factor analysis, the findings overall suggest that the WHOQOL-BREF (TW) is a reliable and valid measure of QOL among injection drug users and can be utilized in future treatment outcome studies. The factor structure provided by the study also helps to understand the QOL characteristics of the injection drug users in Taiwan. However, more research is needed to examine its test-retest reliability and sensitivity to changes due to treatment. PMID:24325611
The psychometric properties of an Iranian translation of the Work Ability Index (WAI) questionnaire.

PubMed

Abdolalizadeh, M; Arastoo, A A; Ghsemzadeh, R; Montazeri, A; Ahmadi, K; Azizi, A

2012-09-01

This study was carried out to evaluate the psychometric properties of an Iranian translation of the Work Ability Index (WAI) questionnaire. In this methodological study, nurses and healthcare workers aged 40 years and older who worked in educational hospitals in Ahvaz (236 workers) in 2010, completed the questionnaire and 60 of the workers filled out the WAI questionnaire for the second time to ensure test-retest reliability. Forward-backward method was applied to translate the questionnaire from English into Persian. The psychometric properties of the Iranian translation of the WAI were assessed using the fallowing tests: Internal consistency (to test reliability), test-retest analysis, exploratory factor analysis (construct validity), discriminate validity by comparing the mean WAI score in two groups of the employees that had different levels of sick leave, criterion validity by determining the correlation between the Persian version of short form health survey (SF-36) and WAI score. Cronbach's alpha coefficient was estimated to be 0.79 and it was concluded that the internal consistency was high enough. The intraclass correlation coefficient was recognized to be 0.92. Factor analysis indicated three factors in the structure of the work ability including self-perceived work ability (24.5% of the variance), mental resources (22.23% of the variance), and presence of disease and health related limitation (18.55% of the variance). Statistical tests showed that this questionnaire was capable of discriminating two groups of employees who had different levels of sick leave. Criterion validity analysis showed that this instrument and all dimensions of the Iranian version of SF-36 were correlated significantly. Item correlation corrective for overlap showed the items tests had a good correlation except for one. The finding of the study showed that the Iranian version of the WAI is a reliable and valid measure of work ability and can be used both in research and practical activities.
The test-retest reliability and criterion validity of a high-intensity, netball-specific circuit test: The Net-Test.

PubMed

Mungovan, Sean F; Peralta, Paula J; Gass, Gregory C; Scanlan, Aaron T

2018-04-12

To examine the test-retest reliability and criterion validity of a high-intensity, netball-specific fitness test. Repeated measures, within-subject design. Eighteen female netball players competing in an international competition completed a trial of the Net-Test, which consists of 14 timed netball-specific movements. Players also completed a series of netball-relevant criterion fitness tests. Ten players completed an additional Net-Test trial one week later to assess test-retest reliability using intraclass correlation coefficient (ICC), typical error of measurement (TEM), and coefficient of variation (CV). The typical error of estimate expressed as CV and Pearson correlations were calculated between each criterion test and Net-Test performance to assess criterion validity. Five movements during the Net-Test displayed moderate ICC (0.84-0.90) and two movements displayed high ICC (0.91-0.93). Seven movements and heart rate taken during the Net-Test held low CV (<5%) with values ranging from 1.7 to 9.5% across measures. Total time (41.63±2.05s) during the Net-Test possessed low CV and significant (p<0.05) correlations with 10m sprint time (1.98±0.12s; CV=4.4%, r=0.72), 20m sprint time (3.38±0.19s; CV=3.9%, r=0.79), 505 Change-of-Direction time (2.47±0.08s; CV=2.0%, r=0.80); and maximum oxygen uptake (46.59±2.58 mLkg -1 min -1 ; CV=4.5%, r=-0.66). The Net-Test possesses acceptable reliability for the assessment of netball fitness. Further, the high criterion validity for the Net-Test suggests a range of important netball-specific fitness elements are assessed in combination. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
A New Multiaxial High-Cycle Fatigue Criterion Based on the Critical Plane for Ductile and Brittle Materials

NASA Astrophysics Data System (ADS)

Wang, Cong; Shang, De-Guang; Wang, Xiao-Wei

2015-02-01

An improved high-cycle multiaxial fatigue criterion based on the critical plane was proposed in this paper. The critical plane was defined as the plane of maximum shear stress (MSS) in the proposed multiaxial fatigue criterion, which is different from the traditional critical plane based on the MSS amplitude. The proposed criterion was extended as a fatigue life prediction model that can be applicable for ductile and brittle materials. The fatigue life prediction model based on the proposed high-cycle multiaxial fatigue criterion was validated with experimental results obtained from the test of 7075-T651 aluminum alloy and some references.
A new responder criterion (relative effect per patient (REPP) > 0.2) externally validated in a large total hip replacement multicenter cohort (EUROHIP).

PubMed

Huber, J; Hüsler, J; Dieppe, P; Günther, K P; Dreinhöfer, K; Judge, A

2016-03-01

To validate a new method to identify responders (relative effect per patient (REPP) >0.2) using the OMERACT-OARSI criteria as gold standard in a large multicentre sample. The REPP ([score before - after treatment]/score before treatment) was calculated for 845 patients of a large multicenter European cohort study for THR. The patients with a REPP >0.2 were defined as responders. The responder rate was compared to the gold standard (OMERACT-OARSI criteria) using receiver operator characteristic (ROC) curve analysis for sensitivity, specificity and percentage of appropriately classified patients. With the criterion REPP>0.2 85.4% of the patients were classified as responders, applying the OARSI-OMERACT criteria 85.7%. The new method had 98.8% sensitivity, 94.2% specificity and 98.1% of the patients were correctly classified compared to the gold standard. The external validation showed a high sensitivity and also specificity of a new criterion to identify a responder compared to the gold standard method. It is simple and has no uncertainties due to a single classification criterion. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.
Bayesian cross-validation for model evaluation and selection, with application to the North American Breeding Bird Survey

USGS Publications Warehouse

Link, William; Sauer, John R.

2016-01-01

The analysis of ecological data has changed in two important ways over the last 15 years. The development and easy availability of Bayesian computational methods has allowed and encouraged the fitting of complex hierarchical models. At the same time, there has been increasing emphasis on acknowledging and accounting for model uncertainty. Unfortunately, the ability to fit complex models has outstripped the development of tools for model selection and model evaluation: familiar model selection tools such as Akaike's information criterion and the deviance information criterion are widely known to be inadequate for hierarchical models. In addition, little attention has been paid to the evaluation of model adequacy in context of hierarchical modeling, i.e., to the evaluation of fit for a single model. In this paper, we describe Bayesian cross-validation, which provides tools for model selection and evaluation. We describe the Bayesian predictive information criterion and a Bayesian approximation to the BPIC known as the Watanabe-Akaike information criterion. We illustrate the use of these tools for model selection, and the use of Bayesian cross-validation as a tool for model evaluation, using three large data sets from the North American Breeding Bird Survey.
Reliability and Criterion Validity of a Novel Clinical Test of Simple and Complex Reaction Time in Athletes1

PubMed Central

Eckner, James T.; Richardson, James K.; Kim, Hogene; Joshi, Monica S.; Oh, Youkeun K.; Ashton-Miller, James A.

2015-01-01

Summary Slowed reaction time (RT) represents both a risk factor for and a consequence of sport concussion. The purpose of this study was to determine the reliability and criterion validity of a novel clinical test of simple and complex RT, called RTclin, in contact sport athletes. Both tasks were adapted from the well-known ruler drop test of RT and involve manually grasping a falling vertical shaft upon its release, with the complex task employing a go/no-go paradigm based on a slight cue. In 46 healthy contact sport athletes (24 males; M = 16.3 yr., SD = 5.0; 22 women: M age= 15.0 yr., SD = 4.0) whose sports included soccer, ice hockey, American football, martial arts, wrestling, and lacrosse, the latency and accuracy of simple and complex RTclin had acceptable test-retest and inter-rater reliabilities and correlated with a computerized criterion standard, the Axon Computerized Cognitive Assessment Tool. Medium to large effect sizes were found. The novel RTclin tests have acceptable reliability and criterion validity for clinical use and hold promise as concussion assessment tools. PMID:26106803
Output-Based Structural Damage Detection by Using Correlation Analysis Together with Transmissibility

PubMed Central

Cao, Hongyou; Liu, Quanmin; Wahab, Magd Abdel

2017-01-01

Output-based structural damage detection is becoming increasingly appealing due to its potential in real engineering applications without any restriction regarding excitation measurements. A new transmissibility-based damage detection approach is presented in this study by combining transmissibility with correlation analysis in order to strengthen its performance in discriminating damaged from undamaged scenarios. From this perspective, damage detection strategies are hereafter established by constructing damage-sensitive indicators from a derived transmissibility. A cantilever beam is numerically analyzed to verify the feasibility of the proposed damage detection procedure, and an ASCE (American Society of Civil Engineers) benchmark is henceforth used in the validation for its application in engineering structures. The results of both studies reveal a good performance of the proposed methodology in identifying damaged states from intact states. The comparison between the proposed indicator and the existing indicator also affirms its applicability in damage detection, which might be adopted in further structural health monitoring systems as a discrimination criterion. This study contributed an alternative criterion for transmissibility-based damage detection in addition to the conventional ones. PMID:28773218
29 CFR 1607.5 - General standards for validity studies.

Code of Federal Regulations, 2010 CFR

2010-07-01

... 29 Labor 4 2010-07-01 2010-07-01 false General standards for validity studies. 1607.5 Section 1607... studies. A. Acceptable types of validity studies. For the purposes of satisfying these guidelines, users may rely upon criterion-related validity studies, content validity studies or construct validity...
Validation of the Intrinsic Spirituality Scale (ISS) with Muslims.

PubMed

Hodge, David R; Zidan, Tarek; Husain, Altaf

2015-12-01

This study validates an existing spirituality measure--the intrinsic spirituality scale (ISS)--for use with Muslims in the United States. A confirmatory factor analysis was conducted with a diverse sample of self-identified Muslims (N = 281). Validity and reliability were assessed along with criterion and concurrent validity. The measurement model fit the data well, normed χ2 = 2.50, CFI = 0.99, RMSEA = 0.07, and SRMR = 0.02. All 6 items that comprise the ISS demonstrated satisfactory levels of validity (λ > .70) and reliability (R2 > .50). The Cronbach's alpha obtained with the present sample was .93. Appropriate correlations with theoretically linked constructs demonstrated criterion and concurrent validity. The results suggest the ISS is a valid measure of spirituality in clinical settings with the rapidly growing Muslim population. The ISS may, for instance, provide an efficient screening tool to identify Muslims that are particularly likely to benefit from spiritually accommodative treatments. (c) 2015 APA, all rights reserved).
The Reliability and Validity of the Self-Reported Drinking Measures in the Army’s Health Risk Appraisal Survey

PubMed Central

Bell, Nicole S.; Williams, Jeffrey O.; Senier, Laura; Strowman, Shelley R.; Amoroso, Paul J.

2007-01-01

Background The reliability and validity of self-reported drinking behaviors from the Army Health Risk Appraisal (HRA) survey are unknown. Methods We compared demographics and health experiences of those who completed the HRA with those who did not (1991–1998). We also evaluated the reliability and validity of eight HRA alcohol-related items, including the CAGE, weekly drinking quantity, and drinking and driving measures. We used Cohen’s κ and Pearson’s r to assess reliability and convergent validity. To assess criterion (predictive) validity, we used proportional hazards and logistical regression models predicting alcohol-related hospitalizations and alcohol-related separations from the Army, respectively. Results A total of 404,966 soldiers completed an HRA. No particular demographic group seems to be over- or underrepresented. Although few respondents skipped alcohol items, those who did tended to be older and of minority race. The alcohol items demonstrate a reasonable degree of reliability, with Cronbach’s α = 0.69 and test-retest reliability associations in the 0.75–0.80 range for most items over 2- to 30-day interims between surveys. The alcohol measures showed good criterion-related validity: those consuming more than 21 drinks per week were at 6 times the risk for subsequent alcohol-related hospitalization versus those who abstained from drinking (hazard ratio, 6.36; 95% confidence interval=5.79, 6.99). Those who said their friends worried about their drinking were almost 5 times more likely to be discharged due to alcoholism (risk ratio, 4.9; 95% confidence interval=4.00, 6.04) and 6 times more likely to experience an alcohol-related hospitalization (hazard ratio, 6.24; 95% confidence interval=5.74, 6.77). Conclusions The Army’s HRA alcohol items seem to elicit reliable and valid responses. Because HRAs contain identifiers, alcohol use can be linked with subsequent health and occupational outcomes, making the HRA a useful epidemiological research tool. Associations between perceived peer opinions of drinking and subsequent problems deserve further exploration. PMID:12766628
Identifying dyspepsia in the Greek population: translation and validation of a questionnaire.

PubMed

Anastasiou, Foteini; Antonakis, Nikos; Chaireti, Georgia; Theodorakis, Pavlos N; Lionis, Christos

2006-03-04

Studies on clinical issues, including diagnostic strategies, are considered to be the core content of general practice research. The use of standardised instruments is regarded as an important component for the development of Primary Health Care research capacity. Demand for epidemiological cross-cultural comparisons in the international setting and the use of common instruments and definitions valid to each culture is bigger than ever. Dyspepsia is a common complaint in primary practice but little is known with respect to its incidence in Greece. There are some references about the Helicobacter Pylori infection in patients with functional dyspepsia or gastric ulcer in Greece but there is no specific instrument for the identification of dyspepsia. This paper reports on the validation and translation into Greek, of an English questionnaire for the identification of dyspepsia in the general population and discusses several possibilities of its use in the Greek primary care. The selected English postal questionnaire for the identification of people with dyspepsia in the general population consists of 30 items and was developed in 1995. The translation and cultural adaptation of the questionnaire has been performed according to international standards. For the validation of the instrument the internal consistency of the items was established using the alpha coefficient of Chronbach, the reproducibility (test - retest reliability) was measured by kappa correlation coefficient and the criterion validity was calculated against the diagnosis of the patients' records using also kappa correlation coefficient. The final Greek version of the postal questionnaire for the identification of dyspepsia in the general population was reliably translated. The internal consistency of the questionnaire was good, Chronbach's alpha was found to be 0.88 (95% CI: 0.81-0.93), suggesting that all items were appropriate to measure. Kappa coefficient for reproducibility (test - retest reliability) was found 0.66 (95% CI: 0.62-0.71), whereas the kappa analysis for criterion validity was 0.63 (95% CI: 0.36-0.89). This study indicates that the Greek translation is comparable with the English-language version in terms of validity and reliability, and is suitable for epidemiological research within the Greek primary health care setting.
Validation of Cost-Effectiveness Criterion for Evaluating Noise Abatement Measures

DOT National Transportation Integrated Search

1999-04-01

This project will provide the Texas Department of Transportation (TxDOT)with information about the effects of the current cost-effectiveness criterion. The project has reviewed (1) the cost-effectiveness criteria used by other states, (2) the noise b...
Assessing educational outcomes in middle childhood: validation of the Teacher Academic Attainment Scale.

PubMed

Johnson, Samantha; Marlow, Neil; Wolke, Dieter

2012-06-01

Assessing educational outcomes in high-risk populations is crucial for defining long-term outcomes. As standardized tests are costly and time-consuming, we assessed the use of the Teacher Academic Attainment Scale (TAAS) as an outcome measure. Three hundred and forty three children in mainstream schools aged 10 to 11 years (144 males, 199 females; 190 extremely preterm and 153 term; mean age 10 y 9 mo, SD 5.5 mo, range 9 y 8 mo-12 y 3 mo) were assessed using the reading and mathematics scales of the criterion standard Wechsler Individual Achievement Test, 2nd (UK) edition (WIAT-II). Class teachers completed the TAAS, a seven-item questionnaire for assessing academic attainment. The TAAS was also completed at 6 years of age for 266 children. Cronbach's alpha 0.95 indicated excellent internal consistency, and the correlation between TAAS scores at 6 and 11 years indicated good test-retest reliability (r=0.77, p<0.001). Significantly higher TAAS scores for term vs preterm children demonstrated discriminative validity. TAAS scores at 6 and 11 years were significantly correlated with WIAT-II reading (r=0.69 and 0.75, p<0.001) and mathematics (r=0.75 and 0.82, p<0.001) scores, demonstrating good predictive and concurrent validity respectively. TAAS scores of <2.5 were good predictors of learning difficulties. The TAAS is a brief, psychometrically sound teacher-report of academic attainment that yields continuous and categorical outcomes. It provides a cost- and time-efficient outcome measure for large-scale studies. © The Authors. Developmental Medicine & Child Neurology © 2012 Mac Keith Press.
Validation study of the SCREENIVF: an instrument to screen women or men on risk for emotional maladjustment before the start of a fertility treatment.

PubMed

Ockhuijsen, Henrietta D L; van Smeden, Maarten; van den Hoogen, Agnes; Boivin, Jacky

2017-06-01

To examine construct and criterion validity of the Dutch SCREENIVF among women and men undergoing a fertility treatment. A prospective longitudinal study nested in a randomized controlled trial. University hospital. Couples, 468 women and 383 men, undergoing an IVF/intracytoplasmic sperm injection (ICSI) treatment in a fertility clinic, completed the SCREENIVF. Construct and criteria validity of the SCREENIVF. The comparative fit index and root mean square error of approximation for women and men show a good fit of the factor model. Across time, the sensitivity for Hospital Anxiety and Depression Scale subscale in women ranged from 61%-98%, specificity 53%-65%, predictive value of a positive test (PVP) 13%-56%, predictive value of a negative test (PVN) 70%-99%. The sensitivity scores for men ranged from 38%-100%, specificity 71%-75%, PVP 9%-27%, PVN 92%-100%. A prediction model revealed that for women 68.7% of the variance in the Hospital Anxiety and Depression Scale on time 1 and 42.5% at time 2 and 38.9% at time 3 was explained by the predictors, the sum score scales of the SCREENIVF. For men, 58.1% of the variance in the Hospital Anxiety and Depression Scale on time 1 and 46.5% at time 2 and 37.3% at time 3 was explained by the predictors, the sum score scales of the SCREENIVF. The SCREENIVF has good construct validity but the concurrent validity is better than the predictive validity. SCREENIVF will be most effectively used in fertility clinics at the start of treatment and should not be used as a predictive tool. Copyright © 2017 American Society for Reproductive Medicine. All rights reserved.
Validity of the inexpensive Stepping Meter in counting steps in free living conditions: a pilot study

PubMed Central

De Cocker, K; Cardon, G; De Bourdeaudhuij, I

2006-01-01

Objectives To evaluate if inexpensive Stepping Meters are valid in counting steps in adults in free living conditions. Methods For six days, 35 healthy volunteers wore a criterion Yamax Digiwalker and five Stepping Meters every day until all 973 pedometers had been tested. Steps were recorded daily, and the differences between counts from the Digiwalker and the Stepping Meter were expressed as a percentage of the valid value of the Digiwalker step counts. The criterion used to determine if a Stepping Meter was valid was a maximum deviation of 10% from the Digiwalker step counts. Results A total of 252 (25.9%) Stepping Meters met the criterion, whereas 74.1% made an overestimation or underestimation of more than 10%. In more than one third (36.6%) of the invalid Stepping Meters, the deviation was greater than 50%. Most (64.8%) of the invalid pedometers overestimated the actual steps taken. Conclusions Inexpensive Stepping Meters cannot be used in community interventions as they will give participants the wrong message. PMID:16790485
Reliability and validity of the Spanish Language Wechsler Adult Intelligence Scale (3rd Edition) in a sample of American, urban, Spanish-speaking Hispanics.

PubMed

Renteria, Laura; Li, Susan Tinsley; Pliskin, Neil H

2008-05-01

The utility of the Spanish WAIS-III was investigated by examining its reliability and validity among 100 Spanish-speaking participants. Results indicated that the internal consistency of the subtests was satisfactory, but inadequate for Letter Number Sequencing. Criterion validity was adequate. Convergent and discriminant validity results were generally similar to the North American normative sample. Paired sample t-tests suggested that the WAIS-III may underestimate ability when compared to the criterion measures that were utilized to assess validity. This study provides support for the use of the Spanish WAIS-III in urban Hispanic populations, but also suggests that caution be used when administering specific subtests, due to the nature of the Latin America alphabet and potential test bias.
Assessing the environmental characteristics of cycling routes to school: a study on the reliability and validity of a Google Street View-based audit.

PubMed

Vanwolleghem, Griet; Van Dyck, Delfien; Ducheyne, Fabian; De Bourdeaudhuij, Ilse; Cardon, Greet

2014-06-10

Google Street View provides a valuable and efficient alternative to observe the physical environment compared to on-site fieldwork. However, studies on the use, reliability and validity of Google Street View in a cycling-to-school context are lacking. We aimed to study the intra-, inter-rater reliability and criterion validity of EGA-Cycling (Environmental Google Street View Based Audit - Cycling to school), a newly developed audit using Google Street View to assess the physical environment along cycling routes to school. Parents (n = 52) of 11-to-12-year old Flemish children, who mostly cycled to school, completed a questionnaire and identified their child's cycling route to school on a street map. Fifty cycling routes of 11-to-12-year olds were identified and physical environmental characteristics along the identified routes were rated with EGA-Cycling (5 subscales; 37 items), based on Google Street View. To assess reliability, two researchers performed the audit. Criterion validity of the audit was examined by comparing the ratings based on Google Street View with ratings through on-site assessments. Intra-rater reliability was high (kappa range 0.47-1.00). Large variations in the inter-rater reliability (kappa range -0.03-1.00) and criterion validity scores (kappa range -0.06-1.00) were reported, with acceptable inter-rater reliability values for 43% of all items and acceptable criterion validity for 54% of all items. EGA-Cycling can be used to assess physical environmental characteristics along cycling routes to school. However, to assess the micro-environment specifically related to cycling, on-site assessments have to be added.
An evidence-based decision assistance model for predicting training outcome in juvenile guide dogs.

PubMed

Harvey, Naomi D; Craigon, Peter J; Blythe, Simon A; England, Gary C W; Asher, Lucy

2017-01-01

Working dog organisations, such as Guide Dogs, need to regularly assess the behaviour of the dogs they train. In this study we developed a questionnaire-style behaviour assessment completed by training supervisors of juvenile guide dogs aged 5, 8 and 12 months old (n = 1,401), and evaluated aspects of its reliability and validity. Specifically, internal reliability, temporal consistency, construct validity, predictive criterion validity (comparing against later training outcome) and concurrent criterion validity (comparing against a standardised behaviour test) were evaluated. Thirty-nine questions were sourced either from previously published literature or created to meet requirements identified via Guide Dogs staff surveys and staff feedback. Internal reliability analyses revealed seven reliable and interpretable trait scales named according to the questions within them as: Adaptability; Body Sensitivity; Distractibility; Excitability; General Anxiety; Trainability and Stair Anxiety. Intra-individual temporal consistency of the scale scores between 5-8, 8-12 and 5-12 months was high. All scales excepting Body Sensitivity showed some degree of concurrent criterion validity. Predictive criterion validity was supported for all seven scales, since associations were found with training outcome, at at-least one age. Thresholds of z-scores on the scales were identified that were able to distinguish later training outcome by identifying 8.4% of all dogs withdrawn for behaviour and 8.5% of all qualified dogs, with 84% and 85% specificity. The questionnaire assessment was reliable and could detect traits that are consistent within individuals over time, despite juvenile dogs undergoing development during the study period. By applying thresholds to scores produced from the questionnaire this assessment could prove to be a highly valuable decision-making tool for Guide Dogs. This is the first questionnaire-style assessment of juvenile dogs that has shown value in predicting the training outcome of individual working dogs.
Accuracy of clinical observations of push-off during gait after stroke.

PubMed

McGinley, Jennifer L; Morris, Meg E; Greenwood, Ken M; Goldie, Patricia A; Olney, Sandra J

2006-06-01

To determine the accuracy (criterion-related validity) of real-time clinical observations of push-off in gait after stroke. Criterion-related validity study of gait observations. Rehabilitation hospital in Australia. Eleven participants with stroke and 8 treating physical therapists. Not applicable. Pearson product-moment correlation between physical therapists' observations of push-off during gait and criterion measures of peak ankle power generation from a 3-dimensional motion analysis system. A high correlation was obtained between the observational ratings and the measurements of peak ankle power generation (Pearson r =.98). The standard error of estimation of ankle power generation was .32W/kg. Physical therapists can make accurate real-time clinical observations of push-off during gait following stroke.

Bikeability and methodological issues using the active commuting route environment scale (ACRES) in a metropolitan setting.

PubMed

Wahlgren, Lina; Schantz, Peter

2011-01-17

Route environments can positively influence people's active commuting and thereby contribute to public health. The Active Commuting Route Environment Scale (ACRES) was developed to study active commuters' perceptions of their route environments. However, bicycle commuters represent a small portion of the population in many cities and thus are difficult to study using population-based material. Therefore, the aim of this study is to expand the state of knowledge concerning the criterion-related validity of the ACRES and the representativity using an advertisement-recruited sample. Furthermore, by comparing commuting route environment profiles of inner urban and suburban areas, we provide a novel basis for understanding the relationship between environment and bikeability. Bicycle commuters from Greater Stockholm, Sweden, advertisement- (n = 1379) and street-recruited (n = 93), responded to the ACRES. Traffic planning and environmental experts from the Municipality of Stockholm (n = 24) responded to a modified version of the ACRES. The criterion-related validity assessments were based on whether or not differences between the inner urban and the suburban route environments, as indicated by the experts and by four existing objective measurements, were reflected by differences in perceptions of these environments. Comparisons of ratings between advertisement- and street-recruited participants were used for the assessments of representativity. Finally, ratings of inner urban and suburban route environments were used to evaluate commuting route environment profiles. Differences in ratings of the inner urban and suburban route environments by the advertisement-recruited participants were in accord with the existing objective measurements and corresponded reasonably well with those of the experts. Overall, there was a reasonably good correspondence between the advertisement- and street-recruited participants' ratings. Distinct differences in commuting route environment profiles were noted between the inner urban and suburban areas. Suburban route environments were rated as safer and more stimulating for bicycle-commuting than the inner urban ones. In general, the findings applied to both men and women. The overall results show: considerable criterion-related validity of the ACRES; ratings of advertisement-recruited participants mirroring those of street-recruited participants; and a higher degree of bikeability in the suburban commuting route environments than in the inner urban ones.
Validation of the Brief Confusion Assessment Method for Screening Delirium in Elderly Medical Patients in a German Emergency Department.

PubMed

Baten, Verena; Busch, Hans-Jörg; Busche, Caroline; Schmid, Bonaventura; Heupel-Reuter, Miriam; Perlov, Evgeniy; Brich, Jochen; Klöppel, Stefan

2018-05-08

Delirium is frequent in elderly patients presenting in the emergency department (ED). Despite the severe prognosis, the majority of delirium cases remain undetected by emergency physicians (EPs). At the time of our study there was no valid delirium screening tool available for EDs in German-speaking regions. We aimed to evaluate the brief Confusion Assessment Method (bCAM) for a German ED during the daily work routine. We implemented the bCAM into practice in a German interdisciplinary high-volume ED and evaluated the bCAM's validity in a convenience sample of medical patients aged ≥ 70 years. The bCAM, which assesses four core features of delirium, was performed by EPs during their daily work routine and compared to a criterion standard based on the criteria for delirium as described in the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition. Compared to the criterion standard, delirium was found to be present in 46 (16.0%) of the 288 nonsurgical patients enrolled. The bCAM showed 93.8% specificity (95% confidence interval [CI] = 90.0%-96.5%) and 65.2% sensitivity (95% CI = 49.8%-78.7%). Positive and negative likelihood ratios were 10.5 and 0.37, respectively, while the odds ratio was 28.4. Delirium was missed in 10 of 16 cases, since the bCAM did not indicate altered levels of consciousness and disorganized thinking. The level of agreement with the criterion standard increased for patients with low cognitive performance. This was the first study evaluating the bCAM for a German ED and when performed by EPs during routine work. The bCAM showed good specificity, but only moderate sensitivity. Nevertheless, application of the bCAM most likely improves the delirium detection rate in German EDs. However, it should only be applied by trained physicians to maximize diagnostic accuracy and hence improve the bCAM's sensitivity. Future studies should refine the bCAM. © 2018 by the Society for Academic Emergency Medicine.
Visual judgements of steadiness in one-legged stance: reliability and validity.

PubMed

Haupstein, T; Goldie, P

2000-01-01

There is a paucity of information about the validity and reliability of clinicians' visual judgements of steadiness in one-legged stance. Such judgements are used frequently in clinical practice to support decisions about treatment in the fields of neurology, sports medicine, paediatrics and orthopaedics. The aim of the present study was to address the validity and reliability of visual judgements of steadiness in one-legged stance in a group of physiotherapists. A videotape of 20 five-second performances was shown to 14 physiotherapists with median clinical experience of 6.75 years. Validity of visual judgement was established by correlating scores obtained from an 11-point rating scale with criterion scores obtained from a force platform. In addition, partial correlations were used to control for the potential influence of body weight on the relationship between the visual judgements and criterion scores. Inter-observer reliability was quantified between the physiotherapists; intra-observer reliability was quantified between two tests four weeks apart. Mean criterion-related validity was high, regardless of whether body weight was controlled for statistically (Pearson's r = 0.84, 0.83, respectively). The standard error of estimating the criterion score was 3.3 newtons. Inter-observer reliability was high (ICC (2,1) = 0.81 at Test 1 and 0.82 at Test 2). Intra-observer reliability was high (on average ICC (2,1) = 0.88; Pearson's r = 0.90). The standard error of measurement for the 11-point scale was one unit. The finding of higher accuracy of making visual judgements than previously reported may be due to several aspects of design: use of a criterion score derived from the variability of the force signal which is more discriminating than variability of centre of pressure; use of a discriminating visual rating scale; specificity and clear definition of the phenomenon to be rated.
Development of an aerobic capacity prediction model from one-mile run/walk performance in adolescents aged 13-16 years.

PubMed

Burns, Ryan D; Hannon, James C; Brusseau, Timothy A; Eisenman, Patricia A; Shultz, Barry B; Saint-Maurice, Pedro F; Welk, Gregory J; Mahar, Matthew T

2016-01-01

A popular algorithm to predict VO2Peak from the one-mile run/walk test (1MRW) includes body mass index (BMI), which manifests practical issues in school settings. The purpose of this study was to develop an aerobic capacity model from 1MRW in adolescents independent of BMI. Cardiorespiratory endurance data were collected on 90 adolescents aged 13-16 years. The 1MRW was administered on an outside track and a laboratory VO2Peak test was conducted using a maximal treadmill protocol. Multiple linear regression was employed to develop the prediction model. Results yielded the following algorithm: VO2Peak = 7.34 × (1MRW speed in m s(-1)) + 0.23 × (age × sex) + 17.75. The New Model displayed a multiple correlation and prediction error of R = 0.81, standard error of the estimate = 4.78 ml kg(-1) · min(-1), with measured VO2Peak and good criterion-referenced (CR) agreement into FITNESSGRAM's Healthy Fitness Zone (Kappa = 0.62; percentage agreement = 84.4%; Φ = 0.62). The New Model was validated using k-fold cross-validation and showed homoscedastic residuals across the range of predicted scores. The omission of BMI did not compromise accuracy of the model. In conclusion, the New Model displayed good predictive accuracy and good CR agreement with measured VO2Peak in adolescents aged 13-16 years.
Assessing local instrument reliability and validity: a field-based example from northern Uganda.

PubMed

Betancourt, Theresa S; Bass, Judith; Borisova, Ivelina; Neugebauer, Richard; Speelman, Liesbeth; Onyango, Grace; Bolton, Paul

2009-08-01

This paper presents an approach for evaluating the reliability and validity of mental health measures in non-Western field settings. We describe this approach using the example of our development of the Acholi psychosocial assessment instrument (APAI), which is designed to assess depression-like (two tam, par and kumu), anxiety-like (ma lwor) and conduct problems (kwo maraco) among war-affected adolescents in northern Uganda. To examine the criterion validity of this measure in the absence of a traditional gold standard, we derived local syndrome terms from qualitative data and used self reports of these syndromes by indigenous people as a reference point for determining caseness. Reliability was examined using standard test-retest and inter-rater methods. Each of the subscale scores for the depression-like syndromes exhibited strong internal reliability ranging from alpha = 0.84-0.87. Internal reliability was good for anxiety (0.70), conduct problems (0.83), and the pro-social attitudes and behaviors (0.70) subscales. Combined inter-rater reliability and test-retest reliability were good for most subscales except for the conduct problem scale and prosocial scales. The pattern of significant mean differences in the corresponding APAI problem scale score between self-reported cases vs. noncases on local syndrome terms was confirmed in the data for all of the three depression-like syndromes, but not for the anxiety-like syndrome ma lwor or the conduct problem kwo maraco.
The Pelvic Organ Prolapse/Urinary Incontinence Sexual Questionnaire (PISQ-12): validation of the Dutch version.

PubMed

't Hoen, Lisette A; Utomo, Elaine; Steensma, Anneke B; Blok, Bertil F M; Korfage, Ida J

2015-09-01

To establish the reliability and validity of the Dutch version of the Pelvic Organ Prolapse/Urinary Incontinence Sexual Questionnaire (PISQ-12) in women with pelvic floor dysfunction. The PISQ-12 was translated into Dutch following a standardized translation process. A group of 124 women involved in a heterosexual relationship who had had symptoms of urinary incontinence, fecal incontinence and/or pelvic organ prolapse for at least 3 months were eligible for inclusion. A reference group was used for assessment of discriminative ability. Data were analyzed for internal consistency, reproducibility, construct validity, responsiveness, and interpretability. An alteration was made to item 12 and was corrected for during the analysis. The patient group comprised 70 of the 124 eligible women, and the reference group comprised 208 women from a panel representative of the Dutch female population. The Dutch PISQ-12 showed an adequate internal consistency with a Cronbach's alpha of 0.57 - 0.69, increasing with correction for item 12 to 0.69 - 0.75, for the reference and patient group, respectively. Scores in the patient group were lower (32.6 ± 6.9) than in the reference group (36.3 ± 4.8; p = 0.0001), indicating a lower sexual function in the patient group and good discriminative ability. Reproducibility was excellent with an intraclass correlation coefficient for agreement of 0.93 (0.88 - 0.96). A positive correlation was found with the Short Form-12 Health Survey (SF-12) measure representing good criterion validity. Due to the small number of patients who had received treatment at the 6-month follow-up, no significant responsiveness could be established. This study showed that the Dutch version of the PISQ-12 has good validity and reliability. The PISQ-12 will enable Dutch physicians to evaluate sexual dysfunction in women with pelvic floor disorders.
Study of image reconstruction for terahertz indirect holography with quasi-optics receiver.

PubMed

Gao, Xiang; Li, Chao; Fang, Guangyou

2013-06-01

In this paper, an indirect holographic image reconstruction algorithm was studied for terahertz imaging with a quasi-optics receiver. Based on the combination of the reciprocity principle and modified quasi-optics theory, analytical expressions of the received spatial power distribution and its spectrum are obtained for the interference pattern of target wave and reference wave. These results clearly give the quantitative relationship between imaging quality and the parameters of a Gaussian beam, which provides a good criterion for terahertz quasi-optics transceivers design in terahertz off-axis holographic imagers. To validate the effectiveness of the proposed analysis method, some imaging results with a 0.3 THz prototype system are shown based on electromagnetic simulation.
Validation of the SCOFF questionnaire for screening of eating disorders among Mexican university students.

PubMed

Sanchez-Armass, Omar; Raffaelli, Marcela; Andrade, Flavia Cristina Drumond; Wiley, Angela R; Noyola, Aida Nacielli Morales; Arguelles, Alejandra Cepeda; Aradillas-Garcia, Celia

2017-03-01

To evaluate the criterion validity and diagnostic utility of the SCOFF, a brief eating disorder (ED) screening instrument, in a Mexican sample. The study was conducted in two phases in 2012. Phase I involved the administration of self-report measures [the SCOFF and the Eating Disorder Inventory-2, (EDI-2)] to 1057 students aged 17-56 years (M age = 21.0, SD = 3.4; 67 % female) from three colleges at the Universidad Autónoma de San Luis Potosí, Mexico. In Phase II, a random subsample of these students (n = 104) participated in the eating disorder examination, a structured interview that yields ED diagnoses. Analyses were conducted to evaluate the SCOFF's criterion validity by examining (a) correlations between scores on the SCOFF and the EDI-2 and (b) the SCOFF's ability to differentiate diagnosed ED cases and non-cases. EDI-2 subscales showed high correlations with the SCOFF scores proving initial evidence of criterion validity. A score of two points on the SCOFF optimized the sensitivity (78 %) and specificity (84 %). With this cutoff, the SCOFF correctly classified over half the cases (PPV = 58 %) and screened out the majority of non-cases (NPV = 93 %) providing further evidence of criterion validity. Analyses were repeated separately for men and women, yielding gender-specific information on the SCOFF's performance. Taken as a whole, results indicated that the SCOFF can be a useful tool for identifying Mexican university students who are at risk of eating disorders.
Criterion validity of the International Physical Activity Questionnaire Short Form (IPAQ-SF) for use in patients with rheumatoid arthritis: comparison with the SenseWear Armband.

PubMed

Tierney, M; Fraser, A; Kennedy, N

2015-06-01

The International Physical Activity Questionnaire Short Form (IPAQ-SF) is a self-report questionnaire commonly used in patients with rheumatoid arthritis (RA) to measure physical activity. However, despite its frequent use in patients with RA, its validity has not been ascertained in this population. The aim of this study was to examine the criterion validity of energy expenditure from physical activity recorded with the IPAQ-SF in patients with RA compared with the objective criterion measure, the SenseWear Armband (SWA) which has been validated previously in this population. Cross-sectional criterion validation study. Regional hospital outpatient setting. Twenty-two patients with RA attending outpatient rheumatology clinics. Subjects wore an SWA for 7 full consecutive days and completed the IPAQ-SF. Energy expenditure from physical activity recorded by the SWA and the IPAQ-SF. Energy expenditure from physical activity recorded by the IPAQ-SF and the SWA showed a small, non-significant correlation (r=0.407, P=0.60). The IPAQ-SF underestimated energy expenditure from physical activity by 41% compared with the SWA. This was corroborated using Bland and Altman plots, as the IPAQ-SF was found to overestimate energy expenditure from physical activity in nine of the 22 individuals, and underestimate energy expenditure from physical activity in the remaining 13 individuals. The IPAQ-SF has limited use as an accurate and absolute measure for estimating energy expenditure from physical activity in patients with RA. Copyright © 2014 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.
Psychometric properties of the Chinese version of the Menopause-Specific Quality-of-Life questionnaire.

PubMed

Nie, Guangning; Yang, Hongyan; Liu, Jian; Zhao, ChunMei; Wang, Xiaoyun

2017-05-01

The Menopause-Specific Quality-of-Life (MENQOL) questionnaire was developed as a specific tool to measure the health-related quality-of-life of postmenopausal women. Thus far, the Chinese version questionnaire has not been subjected to psychometric assessment with a large sample. This study aims to evaluate the validity and reliability of the Chinese version of the MENQOL specific to postmenopausal women in China. A total of 1,137 menopausal symptomatic and 491 menopausal asymptomatic women from eight cities in China were recruited using a convenience sampling method. Psychometric properties were evaluated by descriptive statistics, validity, and reliability. Reliability was assessed for each subscale of the MENQOL through internal consistency reliability with Cronbach's α and intersubscale correlations. Item-domain correlations, principal components analysis (PCA), and confirmatory factor analysis were performed to determine construct validity. t tests were used to compare the differences between the menopausal symptomatic and asymptomatic women and to evaluate the discriminate validity. Pearson correlation coefficients were calculated between MENQOL scores and the Kupperman index to assess criterion-related validity. The most common symptoms in Chinese menopausal symptomatic women were "experiencing poor memory" (94.4%), "feeling tired or worn out" (93.8%), "aching in muscle and joints" (89.4%), "low backache" (86.9%), "decrease in physical strength" (86.6%), "aches in back of neck or head" (86.2%), "difficulty sleeping" (83.6%), "accomplishing less than I used to" (83.4%), "feeling a lack of energy" (83.3%), "change in your sexual desire" (81%), and "hot flash" (80.7%) among others. The symptoms of "increased facial hair" were rarely seen (9.9%). The vasomotor domain, as well as psychosocial, physical, and sexual domains showed high reliability (Cronbach's α 0.84, 0.87, 0.89, and 0.86, respectively). Item-domain correlation analysis showed that all items correlated more strongly with their own domains than with other domains. In the PCA, after deleting the "increased facial hair" item, items in the vasomotor, sexual, and psychosocial subscales loaded on their respective domains by and large, and items in the physical subscale divided into two factors. The PCA revealed a latent structure of the Chinese version of MENQOL nearly identical to the original MENQOL domains. The confirmatory factor analysis demonstrated that the questionnaire fits well with a four-domain model. The MENQOL can discriminate between menopausal symptomatic women with asymptomatic women as it showed good discriminate validity. Criterion-related validity was confirmed by a significant correlation between MENQOL scores and the Kupperman index. This study showed that Chinese version of MENQOL has good psychometric properties and would be suitable to measure the health-related quality-of-life of Chinese menopausal women except for item 21 (increased facial hair).
A Controlled Evaluation of the Distress Criterion for Binge Eating Disorder

PubMed Central

Grilo, Carlos M.; White, Marney A.

2012-01-01

Objective Research has examined various aspects of the validity of the research criteria for binge eating disorder (BED) but has yet to evaluate the utility of criterion C “marked distress about binge eating.” This study examined the significance of the marked distress criterion for BED using two complementary comparisons groups. Method A total of 1075 community volunteers completed a battery of self-report instruments as part of an internet study. Analyses compared body mass index (BMI), eating-disorder psychopathology, and depressive levels in four groups: 97 participants with BED except for the distress criterion (BED-ND), 221 participants with BED including the distress criterion (BED), 79 participants with bulimia nervosa (BN), and 489 obese participants without binge-eating or purging (NBPO). Parallel analyses compared these study groups using the broadened frequency criterion (i.e., once-weekly for binge/purge behaviors) proposed for DSM-5 and the DSM-IV twice-weekly frequency criterion. Results The BED group had significantly greater eating-disorder psychopathology and depressive levels than the BED-ND group. The BED group, but not the BED-ND group, had significantly greater eating-disorder psychopathology than the NBPO comparison group. The BN group had significantly greater eating-disorder psychopathology and depressive levels than all three other groups. The group differences existed even after controlling for depression levels, BMI, and demographic variables, although some differences between the BN and BED groups were attenuated when controlling for depression levels. Conclusions These findings provide support for the validity of the “marked distress” criterion for the diagnosis of BED. PMID:21707133
Validation of the English version of the UNESP-Botucatu multidimensional composite pain scale for assessing postoperative pain in cats

PubMed Central

2013-01-01

Background A scale validated in one language is not automatically valid in another language or culture. The purpose of this study was to validate the English version of the UNESP-Botucatu multidimensional composite pain scale (MCPS) to assess postoperative pain in cats. The English version was developed using translation, back-translation, and review by individuals with expertise in feline pain management. In sequence, validity and reliability tests were performed. Results Of the three domains identified by factor analysis, the internal consistency was excellent for ‘pain expression’ and ‘psychomotor change’ (0.86 and 0.87) but not for ‘physiological variables’ (0.28). Relevant changes in pain scores at clinically distinct time points (e.g., post-surgery, post-analgesic therapy), confirmed the construct validity and responsiveness (Wilcoxon test, p < 0.001). Favorable correlation with the IVAS scores (p < 0.001) and moderate to very good agreement between blinded observers and ‘gold standard’ evaluations, supported criterion validity. The cut-off point for rescue analgesia was > 7 (range 0–30 points) with 96.5% sensitivity and 99.5% specificity. Conclusions The English version of the UNESP-Botucatu-MCPS is a valid, reliable and responsive instrument for assessing acute pain in cats undergoing ovariohysterectomy, when used by anesthesiologists or anesthesia technicians. The cut-off point for rescue analgesia provides an additional tool for guiding analgesic therapy. PMID:23867090
Validation of the English version of the UNESP-Botucatu multidimensional composite pain scale for assessing postoperative pain in cats.

PubMed

Brondani, Juliana T; Mama, Khursheed R; Luna, Stelio P L; Wright, Bonnie D; Niyom, Sirirat; Ambrosio, Jennifer; Vogel, Pamela R; Padovani, Carlos R

2013-07-17

A scale validated in one language is not automatically valid in another language or culture. The purpose of this study was to validate the English version of the UNESP-Botucatu multidimensional composite pain scale (MCPS) to assess postoperative pain in cats. The English version was developed using translation, back-translation, and review by individuals with expertise in feline pain management. In sequence, validity and reliability tests were performed. Of the three domains identified by factor analysis, the internal consistency was excellent for 'pain expression' and 'psychomotor change' (0.86 and 0.87) but not for 'physiological variables' (0.28). Relevant changes in pain scores at clinically distinct time points (e.g., post-surgery, post-analgesic therapy), confirmed the construct validity and responsiveness (Wilcoxon test, p < 0.001). Favorable correlation with the IVAS scores (p < 0.001) and moderate to very good agreement between blinded observers and 'gold standard' evaluations, supported criterion validity. The cut-off point for rescue analgesia was > 7 (range 0-30 points) with 96.5% sensitivity and 99.5% specificity. The English version of the UNESP-Botucatu-MCPS is a valid, reliable and responsive instrument for assessing acute pain in cats undergoing ovariohysterectomy, when used by anesthesiologists or anesthesia technicians. The cut-off point for rescue analgesia provides an additional tool for guiding analgesic therapy.
On optimal soft-decision demodulation

NASA Technical Reports Server (NTRS)

Lee, L. N.

1975-01-01

Wozencraft and Kennedy have suggested that the appropriate demodulator criterion of goodness is the cut-off rate of the discrete memoryless channel created by the modulation system; the criterion of goodness adopted in this note is the symmetric cut-off rate which differs from the former criterion only in that the signals are assumed equally likely. Massey's necessary condition for optimal demodulation of binary signals is generalized to M-ary signals. It is shown that the optimal demodulator decision regions in likelihood space are bounded by hyperplanes. An iterative method is formulated for finding these optimal decision regions from an initial good quess. For additive white Gaussian noise, the corresponding optimal decision regions in signal space are bounded by hypersurfaces with hyperplane asymptotes; these asymptotes themselves bound the decision regions of a demodulator which, in several examples, is shown to be virtually optimal. In many cases, the necessary condition for demodulator optimality is also sufficient, but a counter example to its general sufficiency is given.
A Note on Economic Content and Test Validity.

ERIC Educational Resources Information Center

Soper, John C.; Brenneke, Judith Staley

1987-01-01

Offers practical tips on how teachers can determine whether classroom tests are actually measuring what they are designed to measure. Discusses criterion-related validity, construct validity, and content validity. Demonstrates how to determine the degree of content validity a particular test may have for a particular course or unit. (Author/DH)
The Dula dangerous driving index in China: an investigation of reliability and validity.

PubMed

Qu, Weina; Ge, Yan; Jiang, Caihong; Du, Feng; Zhang, Kan

2014-03-01

The aim of this study was to translate the Dula Dangerous Driving Index (DDDI) into Chinese and to verify its reliability and validity. A total of 246 drivers completed the Chinese version of the DDDI and the Driver Behavior Questionnaire (DBQ). Specific sociodemographic variables and traffic violations were also measured. A confirmatory factor analysis confirmed the internal structure of the DDDI, and the four-factor model was supported in China. Measures of convergent and criterion validity demonstrated that the Chinese DDDI was valid. Its convergent validity was supported by its positive relationship with the DBQ, and its criterion validity was tested using its relationship with self-reported accident involvement and traffic violations. Finally, score comparisons between different demographic groups revealed significant differences, thereby linking age and driving years to dangerous driving. Copyright © 2013 Elsevier Ltd. All rights reserved.
Standards for Evaluating Criterion-Referenced Tests.

ERIC Educational Resources Information Center

Walker, Clinton B.

Standards for evaluating criterion-referenced tests are presented. Twenty-one standards, grouped in three categories, are discussed. Category one is defined as measurement properties and is comprised of conceptual validity, including description of the domain, test item agreement with objectives, and item representativeness of the objectives; and…
Empirical extensions of the lasso penalty to reduce the false discovery rate in high-dimensional Cox regression models.

PubMed

Ternès, Nils; Rotolo, Federico; Michiels, Stefan

2016-07-10

Correct selection of prognostic biomarkers among multiple candidates is becoming increasingly challenging as the dimensionality of biological data becomes higher. Therefore, minimizing the false discovery rate (FDR) is of primary importance, while a low false negative rate (FNR) is a complementary measure. The lasso is a popular selection method in Cox regression, but its results depend heavily on the penalty parameter λ. Usually, λ is chosen using maximum cross-validated log-likelihood (max-cvl). However, this method has often a very high FDR. We review methods for a more conservative choice of λ. We propose an empirical extension of the cvl by adding a penalization term, which trades off between the goodness-of-fit and the parsimony of the model, leading to the selection of fewer biomarkers and, as we show, to the reduction of the FDR without large increase in FNR. We conducted a simulation study considering null and moderately sparse alternative scenarios and compared our approach with the standard lasso and 10 other competitors: Akaike information criterion (AIC), corrected AIC, Bayesian information criterion (BIC), extended BIC, Hannan and Quinn information criterion (HQIC), risk information criterion (RIC), one-standard-error rule, adaptive lasso, stability selection, and percentile lasso. Our extension achieved the best compromise across all the scenarios between a reduction of the FDR and a limited raise of the FNR, followed by the AIC, the RIC, and the adaptive lasso, which performed well in some settings. We illustrate the methods using gene expression data of 523 breast cancer patients. In conclusion, we propose to apply our extension to the lasso whenever a stringent FDR with a limited FNR is targeted. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Factor structure and diagnostic efficiency of the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, criteria for avoidant personality disorder in Hispanic men and women with substance use disorders.

PubMed

Becker, Daniel F; Añez, Luis Miguel; Paris, Manuel; Bedregal, Luis; Grilo, Carlos M

2009-01-01

This study examined the internal consistency, factor structure, and diagnostic efficiency of the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV), criteria for avoidant personality disorder (AVPD) and the extent to which these metrics may be affected by sex. Subjects were 130 monolingual Hispanic adults (90 men, 40 women) who had been admitted to a specialty clinic that provides psychiatric and substance abuse services to Spanish-speaking patients. All were reliably assessed with the Spanish-Language Version of the Diagnostic Interview for DSM-IV Personality Disorders. The AVPD diagnosis was determined by the best-estimate method. After evaluating internal consistency of the AVPD criterion set, an exploratory factor analysis was performed using principal components extraction. Afterward, diagnostic efficiency indices were calculated for all AVPD criteria. Subsequent analyses examined men and women separately. For the overall group, internal consistency of AVPD criteria was good. Exploratory factor analysis revealed a 1-factor solution (accounting for 70% of the variance), supporting the unidimensionality of the AVPD criterion set. The best inclusion criterion was "reluctance to take risks," whereas "interpersonally inhibited" was the best exclusion criterion and the best predictor overall. When men and women were examined separately, similar results were obtained for both internal consistency and factor structure, with slight variations noted between sexes in the patterning of diagnostic efficiency indices. These psychometric findings, which were similar for men and women, support the construct validity of the DSM-IV criteria for AVPD and may also have implications for the treatment of this particular clinical population.
The HIV Medication Taking Self-Efficacy Scale: Psychometric Evaluation

PubMed Central

Erlen, Judith A.; Cha, EunSeok; Kim, Kevin H.; Caruthers, Donna; Sereika, Susan M.

2010-01-01

Aim This paper is a report of an examination of the psychometric properties of the HIV Medication Taking Self-efficacy Scale. Background Self-efficacy is a critically important component of strategies to improve HIV medication-taking; however, valid and reliable tools for assessing HIV medication-taking self-efficacy are limited. Method We used a cross-sectional, correlational design. Between 2003 and 2007, 326 participants were recruited from sites in Pennsylvania and Ohio in the United States of America. Six self-report questionnaires administered at baseline and 12 weeks later during “Improving Adherence to Antiretroviral Therapy” were used to examine the variables of interest. Means and variances, reliability, criterion, and construct validity of the HIV Medication Taking Self-efficacy Scale were assessed. Findings Participants reported high self-confidence in their ability to carry out specific medication-related tasks (mean=8.31) and in the medication’s ability to effect good outcomes (mean=8.56). The HIV Medication Taking Self-efficacy Scale and subscales showed excellent reliability (α = .93 ~ .94). Criterion validity was well-established by examining the relationships between the HIV Medication Taking Self-efficacy Scale and selected physiological and psychological factors, and self-reported medication adherence (r = −.20 ~ .58). A two-factor model with a correlation between self-efficacy belief and outcome expectancy fitted the data well (model χ2 = 3871.95, df = 325, p<001; CFA =.96; RMSEA =.046). Conclusion The HIV Medication Taking Self-efficacy Scale is a psychometrically sound measure of medication-taking self-efficacy for use by researchers and clinicians with people with HIV. The findings offer insight into the development of interventions to promote self-efficacy and medication adherence in persons with HIV. PMID:20722799

Validation of a skinfold based index for tracking proportional changes in lean mass

PubMed Central

Slater, G J; Duthie, G M; Pyne, D B; Hopkins, W G

2006-01-01

Background The lean mass index (LMI) is a new empirical measure that tracks within‐subject proportional changes in body mass adjusted for changes in skinfold thickness. Objective To compare the ability of the LMI and other skinfold derived measures of lean mass to monitor changes in lean mass. Methods 20 elite rugby union players undertook full anthropometric profiles on two occasions 10 weeks apart to calculate the LMI and five skinfold based measures of lean mass. Hydrodensitometry, deuterium dilution, and dual energy x ray absorptiometry provided a criterion choice, four compartment (4C) measure of lean mass for validation purposes. Regression based measures of validity, derived for within‐subject proportional changes through log transformation, included correlation coefficients and standard errors of the estimate. Results The correlation between change scores for the LMI and 4C lean mass was moderate (0.37, 90% confidence interval −0.01 to 0.66) and similar to the correlations for the other practical measures of lean mass (range 0.26 to 0.42). Standard errors of the estimate for the practical measures were in the range of 2.8–2.9%. The LMI correctly identified the direction of change in 4C lean mass for 14 of the 20 athletes, compared with 11 to 13 for the other practical measures of lean mass. Conclusions The LMI is probably as good as other skinfold based measures for tracking lean mass and is theoretically more appropriate. Given the impracticality of the 4C criterion measure for routine field use, the LMI may offer a convenient alternative for monitoring physique changes, provided its utility is established under various conditions. PMID:16505075
Reliability, Validity, and Sensitivity of a Novel Smartphone-Based Eccentric Hamstring Strength Test in Professional Football Players.

PubMed

Lee, Justin W Y; Cai, Ming-Jing; Yung, Patrick S H; Chan, Kai-Ming

2018-05-01

To evaluate the test-retest reliability, sensitivity, and concurrent validity of a smartphone-based method for assessing eccentric hamstring strength among male professional football players. A total of 25 healthy male professional football players performed the Chinese University of Hong Kong (CUHK) Nordic break-point test, hamstring fatigue protocol, and isokinetic hamstring strength test. The CUHK Nordic break-point test is based on a Nordic hamstring exercise. The Nordic break-point angle was defined as the maximum point where the participant could no longer support the weight of his body against gravity. The criterion for the sensitivity test was the presprinting and postsprinting difference of the Nordic break-point angle with a hamstring fatigue protocol. The hamstring fatigue protocol consists of 12 repetitions of the 30-m sprint with 30-s recoveries between sprints. Hamstring peak torque of the isokinetic hamstring strength test was used as the criterion for validity. A high test-retest reliability (intraclass correlation coefficient = .94; 95% confidence interval, .82-.98) was found in the Nordic break-point angle measurements. The Nordic break-point angle significantly correlated with isokinetic hamstring peak torques at eccentric action of 30°/s (r = .88, r 2 = .77, P < .001). The minimal detectable difference was 8.03°. The sensitivity of the measure was good enough that a significance difference (effect size = 0.70, P < .001) was found between presprinting and postsprinting values. The CUHK Nordic break-point test is a simple, portable, quick smartphone-based method to provide reliable and accurate eccentric hamstring strength measures among male professional football players.
Utility of ultrasound for body fat assessment: validity and reliability compared to a multicompartment criterion.

PubMed

Smith-Ryan, Abbie E; Blue, Malia N M; Trexler, Eric T; Hirsch, Katie R

2018-03-01

Measurement of body composition to assess health risk and prevention is expanding. Accurate portable techniques are needed to facilitate use in clinical settings. This study evaluated the accuracy and repeatability of a portable ultrasound (US) in comparison with a four-compartment criterion for per cent body fat (%Fat) in overweight/obese adults. Fifty-one participants (mean ± SD; age: 37·2 ± 11·3 years; BMI: 31·6 ± 5·2 kg m -2 ) were measured for %Fat using US (GE Logiq-e) and skinfolds. A subset of 36 participants completed a second day of the same measurements, to determine reliability. US and skinfold %Fat were calculated using the seven-site Jackson-Pollock equation. The Wang 4C model was used as the criterion method for %Fat. Compared to a gold standard criterion, US %Fat (36·4 ± 11·8%; P = 0·001; standard error of estimate [SEE] = 3·5%) was significantly higher than the criterion (33·0 ± 8·0%), but not different than skinfolds (35·3 ± 5·9%; P = 0·836; SEE = 4·5%). US resulted in good reliability, with no significant differences from Day 1 (39·95 ± 15·37%) to Day 2 (40·01 ± 15·42%). Relative consistency was 0·96, and standard error of measure was 0·94%. Although US overpredicted %Fat compared to the criterion, a moderate SEE for US is suggestive of a practical assessment tool in overweight individuals. %Fat differences reported from these field-based techniques are less than reported by other single-measurement laboratory methods and therefore may have utility in a clinical setting. This technique may also accurately track changes. © 2016 Scandinavian Society of Clinical Physiology and Nuclear Medicine. Published by John Wiley & Sons Ltd.
[The psychometric properties of the Turkish version of Myocardial Infarction Dimensional Assessment Scale (MIDAS)].

PubMed

Yılmaz, Emel; Eser, Erhan; Şekuri, Cevad; Kültürsay, Hakan

2011-08-01

The purpose of this study was to describe the psychometric properties of the Myocardial Infarction Dimensional Assessment Scale (MIDAS). This is a methodological cultural adaptation study. The MIDAS consists of 35-items covering seven domains: physical activity, insecurity, emotional reaction, dependency, diet, concerns over medication, and side effects which are rated on a five-point Likert scale from 1: never to 5:always. The highest score of MIDAS is 100.Quality of life (QOL) decreases as the score of scale increases. Overall 185 myocardial infarction (MI) patients were enrolled in this study. Cronbach alpha was used for the reliability analysis. The criterion validity, structural validity, and sensitivity analysis approach was used for validity analysis. New York Heart Association (NYHA) and the Canadian Cardiovascular Society Functional Classifications (CCSFC) for testing the criterion validity; SF-36 for construct validity testing of the Turkish version of the MIDAS were used. The range of Cronbach alpha values is 0.79-0.90 for seven domains of the scale. No problematic items were observed for the entire scale. Medication related domains of the MIDAS showed considerable floor effects (35.7%-22.7%). Confirmatory Factor analysis indicators [Comparative Fit Index (CFI) =0.95 and Root Mean Square Error of Approximation (RMSEA) =0.075] supported the construct validity of MIDAS. Convergent validity of the MIDAS was confirmed with correlation of SF-36 scale where appropriate. Criterion validity results was also satisfactory by comparing different stages of the NYHA and the CCSFC (p<0.05). Overall results revealed that Turkish version of the MIDAS is a reliable and valid instrument.
Cross-cultural adaptation and validation of the Ankle Osteoarthritis Scale for use in French-speaking populations.

PubMed

Angers, Magalie; Svotelis, Amy; Balg, Frederic; Allard, Jean-Pascal

2016-04-01

The Ankle Osteoarthritis Scale (AOS) is a self-administered score specific for ankle osteoarthritis (OA) with excellent reliability and strong construct and criterion validity. Many recent randomized multicentre trials have used the AOS, and the involvement of the French-speaking population is limited by the absence of a French version. Our goal was to develop a French version and validate the psychometric properties to assure equivalence to the original English version. Translation was performed according to American Association of Orthopaedic Surgeons (AAOS) 2000 guidelines for cross-cultural adaptation. Similar to the validation process of the English AOS, we evaluated the psychometric properties of the French version (AOS-Fr): criterion validity (AOS-Fr v. Western Ontario and McMaster Universities Arthritis Index [WOMAC] and SF-36 scores), construct validity (AOS-Fr correlation to single heel-lift test), and reliability (AOS-Fr test-retest). Sixty healthy individuals tested a prefinal version of the AOS-Fr for comprehension, leading to modifications and a final version that was approved by C. Saltzman, author of the AOS. We then recruited patients with ankle OA for evaluation of the AOS-Fr psychometric properties. Twenty-eight patients with ankle OA participated in the evaluation. The AOS-Fr showed strong criterion validity (AOS:WOMAC r = 0.709 and AOS:SF-36 r = -0.654) and construct validity (r = 0.664) and proved to be reliable (test-retest intraclass correlation coefficient = 0.922). The AOS-Fr is a reliable and valid score equivalent to the English version in terms of psychometric properties, thus is available for use in multicentre trials.
An Elasto-Plastic Damage Model for Rocks Based on a New Nonlinear Strength Criterion

NASA Astrophysics Data System (ADS)

Huang, Jingqi; Zhao, Mi; Du, Xiuli; Dai, Feng; Ma, Chao; Liu, Jingbo

2018-05-01

The strength and deformation characteristics of rocks are the most important mechanical properties for rock engineering constructions. A new nonlinear strength criterion is developed for rocks by combining the Hoek-Brown (HB) criterion and the nonlinear unified strength criterion (NUSC). The proposed criterion takes account of the intermediate principal stress effect against HB criterion, as well as being nonlinear in the meridian plane against NUSC. Only three parameters are required to be determined by experiments, including the two HB parameters σ c and m i . The failure surface of the proposed criterion is continuous, smooth and convex. The proposed criterion fits the true triaxial test data well and performs better than the other three existing criteria. Then, by introducing the Geological Strength Index, the proposed criterion is extended to rock masses and predicts the test data well. Finally, based on the proposed criterion, a triaxial elasto-plastic damage model for intact rock is developed. The plastic part is based on the effective stress, whose yield function is developed by the proposed criterion. For the damage part, the evolution function is assumed to have an exponential form. The performance of the constitutive model shows good agreement with the results of experimental tests.
[Construction of the Time Management Scale and examination of the influence of time management on psychological stress response].

PubMed

Imura, Tomoya; Takamura, Masahiro; Okazaki, Yoshihiro; Tokunaga, Satoko

2016-10-01

We developed a scale to measure time management and assessed its reliability and validity. We then used this scale to examine the impact of time management on psychological stress response. In Study 1-1, we developed the scale and assessed its internal consistency and criterion-related validity. Findings from a factor analysis revealed three elements of time management, “time estimation,” “time utilization,” and “taking each moment as it comes.” In Study 1-2, we assessed the scale’s test-retest reliability. In Study 1-3, we assessed the validity of the constructed scale. The results indicate that the time management scale has good reliability and validity. In Study 2, we performed a covariance structural analysis to verify our model that hypothesized that time management influences perceived control of time and psychological stress response, and perceived control of time influences psychological stress response. The results showed that time estimation increases the perceived control of time, which in turn decreases stress response. However, we also found that taking each moment as it comes reduces perceived control of time, which in turn increases stress response.
Psychometric Properties of the Work Well Index: A Short Questionnaire for Work-Related Stress.

PubMed

Mauss, Daniel; Li, Jian; Angerer, Peter

2017-02-01

The aim of this study was to test the psychometric properties of a short questionnaire for work-related stress entitled Work Well index (WWi) and its interaction with different variables of self-reported health. An online survey was conducted in a sample of 1,218 employees (51% female) in four countries of an international insurance company. Internal consistency reliability, factorial validity, convergent validity and criterion validity of the 10-item WWi were analyzed. Good internal consistency reliability of the WWi was obtained (Cronbach's α coefficient = 0.85). Confirmatory factor analysis showed a satisfactory model fit of the data (AGFI = 0.92). The WWi was highly correlated to conceptually close constructs such as demand-control, effort-reward imbalance and workplace social capital (p < 0.001). Moreover, the 10-item WWi was significantly (p < 0.001) associated with elevated risk of self-rated health, absenteeism, presenteeism and depression (odds ratio 1.63, 1.36, 2.08, 2.95, respectively). We conclude that this short questionnaire is a reliable and valid instrument measuring psychosocial stress at work. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Psychometric properties of the WHOQOL-BREF in an Iranian adult sample.

PubMed

Yousefy, A R; Usefy, A R; Ghassemi, Gh R; Sarrafzadegan, N; Mallik, S; Baghaei, A M; Rabiei, K

2010-04-01

To evaluate discriminant validity, reliability, internal consistency, and dimensional structure of the World Health Organization Quality of Life-BREF (WHOQOL-BREF) in a heterogeneous Iranian population. A clustered randomized sample of 2,956 healthy with 2,936 unhealthy rural and urban inhabitants aged 30 and above from two dissimilar Iranian provinces during 2006 completed the Persian version of the WHOQOL-BREF. We performed descriptive and analytical analysis including t-student, correlation matrix, Cronbach's Alpha, and factor analysis with principal components method and Varimax rotation with SPSS.15. The mean age of the participants was 42.2 +/- 12.1 years and the mean years of education was 9.3 +/- 3.8. The Iranian version of the WHOQOL-BREF domain scores demonstrated good internal consistency, criterion validity, and discriminant validity. The physical health domain contributed most in overall quality of life, while the environment domain made the least contribution. Factor analysis provided evidence for construct validity for four-factor model of the instrument. The scores of all domains discriminated between healthy persons and the patients. The WHOQOL-BREF has adequate psychometric properties and is, therefore, an adequate measure for assessing quality of life at the domain level in an adult Iranian population.
Lifesource XL-18 pedometer for measuring steps under controlled and free-living conditions.

PubMed

Liu, Sam; Brooks, Dina; Thomas, Scott; Eysenbach, Gunther; Nolan, Robert Peter

2015-01-01

The primary aim was to examine the criterion and construct validity and test-retest reliability of the Lifesource XL-18 pedometer (A&D Medical, Toronto, ON, Canada) for measuring steps under controlled and free-living activities. The influence of body mass index, waist size and walking speed on the criterion validity of XL-18 was also explored. Forty adults (35-74 years) performed a 6-min walk test in the controlled condition, and the criterion validity of XL-18 was assessed by comparing it to steps counted manually. Thirty-five adults participated in the free-living condition and the construct validity of XL-18 was assessed by comparing it to Yamax SW-200 (YAMAX Health & Sports, Inc., San Antonio, TX, USA). During the controlled condition, XL-18 did not significantly differ from criterion (P > 0.05) and no systematic error was found using Bland-Altman analysis. The accuracy of XL-18 decreased with slower walking speed (P = 0.001). During the free-living condition, Bland-Altman analysis revealed that XL-18 overestimated daily steps by 327 ± 118 than Yamax (P = 0.004). However, the absolute percent error (APE) (6.5 ± 0.58%) was still within an acceptable range. XL-18 did not differ statistically between pant pockets. XL-18 is suitable for measuring steps in controlled and free-living conditions. However, caution may be required when interpreting the steps recorded under slower speeds and free-living conditions.
Development of a gambling addictive behavior scale for adolescents in Korea.

PubMed

Park, Hyun Sook; Jung, Sun Young

2012-12-01

This study was conducted to develop a gambling addictive behavior scale for adolescents. The process involved construction of a conceptual framework, initial item search, verification of content validity, selection of secondary items, and extraction of final items. The participants were 299 adolescents from two middle schools and four high schools. Item analysis, factor analysis, criterion validity, internal consistency, and ROC curve were used to analyze the data. For the final scale, 25 items were selected, and categorized into 4 factors which accounted for 54.9% of the total variance. The factors were labeled as loss of control, life dysfunction from gambling addiction, gambling experience, and social dysfunction from problem gambling. The scores for the scale were significantly correlated with addictive personality, irrational gambling belief, and adolescent's gambling addictive behavior. Cronbach's alpha coefficient for the 25 items was .94. Scale scores identified adolescents as being in a problem gambling group, a non-problem gambling group, and a non-gambling group by the ROC curve. The above findings indicate that the gambling addictive behavior scale has good validity and reliability and can be used with adolescents in Korea.
Psychometric evaluation of the Dutch version of the Subjective Opiate Withdrawal Scale (SOWS).

PubMed

Dijkstra, Boukje A G; Krabbe, Paul F M; Riezebos, Truus G M; van der Staak, Cees P F; De Jong, Cor A J

2007-01-01

To evaluate the psychometric properties of the Dutch version of the 16-item Subjective Opiate Withdrawal Scale (SOWS). The SOWS measures withdrawal symptoms at the time of assessment. The Dutch SOWS was repeatedly administered to a sample of 272 opioid-dependent inpatients of four addiction treatment centers during rapid detoxification with or without general anesthesia. Examination of the psychometric properties of the SOWS included exploratory factor analysis, internal consistency, test-retest reliability, and criterion validity. Exploratory factor analysis of the SOWS revealed a general pattern of four factors with three items not always clustered in the same factors at different points of measurement. After excluding these items from factor analysis four factors were identified during detoxification (temperature dysregulation, tractus locomotorius, tractus gastro-intestinalis and facial disinhibition). The 13-item SOWS shows high internal consistency and test-retest reliability and good validity at different stages of withdrawal. The 13-item SOWS is a reliable and valid instrument to assess opioid withdrawal during rapid detoxification. Three items were deleted because their content does not correspond directly with opioid withdrawal symptoms. Copyright (c) 2007 S. Karger AG, Basel.
[Assessing work-related stress: an Italian adaptation of the HSE Management Standards Work-Related Stress Indicator Tool].

PubMed

Marcatto, Francesco; D'Errico, Giuseppe; Di Blas, Lisa; Ferrante, Donatella

2011-01-01

The aim of this paper is to present a preliminary validation of an Italian adaptation of the HSE Management Standards Work-Related Stress Indicator Tool (IT), an instrument for assessing work-related stress at the organizational level, originally developed in Britain by the Health and Safety Executive. A scale that assesses the physical work environment has been added to the original version of the IT. 190 employees of the University of Trieste have been enrolled in the study. A confirmatory analysis showed a satisfactory fit of the eight-factors structure of the instrument. Further psychometric analysis showed adequate internal consistency of the IT scales and good criterion validity, as evidenced by the correlations with self-perception of stress, work satisfaction and motivation. In conclusion, the Indicator Tool proved to be a valid and reliable instrument for the assessment of work-related stress at the organizational level, and it is also compatible with the instructions provided by the Ministry of Labour and Social Policy (Circular letter 18/11/2010).
Reliability and validity of 12-item Short-Form health survey (SF-12) for the health status of Chinese community elderly population in Xujiahui district of Shanghai.

PubMed

Shou, Juan; Ren, Limin; Wang, Haitang; Yan, Fei; Cao, Xiaoyun; Wang, Hui; Wang, Zhiliang; Zhu, Shanzhu; Liu, Yao

2016-04-01

The 12-item Short-Form Health Survey (SF-12) is the abridged practical version of SF-36. This cross-sectional study was aimed to assess the reliability and validity of SF-12 for the health status of Chinese community elderly population. The Chinese community elderly people in Xujiahui district of Shanghai were investigated. The internal consistency reliability was assessed using Cronbach's alpha and split-half reliability coefficients. Construct validity was analyzed using exploratory factor analysis (EFA) and confirmatory factor analysis (CFA). Spearman's correlation coefficient (ρ) was used for the evaluation of criterion, convergent, and discriminant validity with Spearman's ρ ≥ 0.4 as satisfactory. Comparisons of the SF-12 summary scores among populations that differed in demographics were performed for discriminant validity. Total 1343 individuals aged ≥60 and <85 years old (response rate: 91.3 %) were analyzed. The Cronbach's α value (0.910) and the split-half reliability coefficient (0.812) reflected satisfactory internal consistency reliability of SF-12. EFA extracted a two-factor model (physical and mental health). About 60.7 % of the total variance was explained by the two factors. CFA showed that the two-factor solution provided a good fit to the data. Good convergent validity and discriminant validity of SF-12 were proved by the correction analyses (Spearman's ρ > 0.4) and the comparisons of the SF-12 summary scores among populations (P < 0.05). SF-12 summary scores were significantly correlated with the SF-36 summary scores (Spearman's ρ > 0.4, P < 0.05). In conclusion, SF-12 had satisfactory reliability and validity in measuring health status of Chinese community elderly population in Xujiahui district of Shanghai.
Technical Note: Approximate solution of transient drawdown for constant-flux pumping at a partially penetrating well in a radial two-zone confined aquifer

NASA Astrophysics Data System (ADS)

Huang, C.-S.; Yang, S.-Y.; Yeh, H.-D.

2015-06-01

An aquifer consisting of a skin zone and a formation zone is considered as a two-zone aquifer. Existing solutions for the problem of constant-flux pumping in a two-zone confined aquifer involve laborious calculation. This study develops a new approximate solution for the problem based on a mathematical model describing steady-state radial and vertical flows in a two-zone aquifer. Hydraulic parameters in these two zones can be different but are assumed homogeneous in each zone. A partially penetrating well may be treated as the Neumann condition with a known flux along the screened part and zero flux along the unscreened part. The aquifer domain is finite with an outer circle boundary treated as the Dirichlet condition. The steady-state drawdown solution of the model is derived by the finite Fourier cosine transform. Then, an approximate transient solution is developed by replacing the radius of the aquifer domain in the steady-state solution with an analytical expression for a dimensionless time-dependent radius of influence. The approximate solution is capable of predicting good temporal drawdown distributions over the whole pumping period except at the early stage. A quantitative criterion for the validity of neglecting the vertical flow due to a partially penetrating well is also provided. Conventional models considering radial flow without the vertical component for the constant-flux pumping have good accuracy if satisfying the criterion.
Psychological Flexibility of Nurses in a Cancer Hospital: Preliminary Validation of a Chinese Version of the Work-related Acceptance and Action Questionnaire

PubMed Central

Xu, Xianghua; Liu, Xiangyu; Ou, Meijun; Xie, Chanjuan; Chen, Yongyi

2018-01-01

Objective: To translate the English work-related acceptance and action questionnaire (WAAQ), make cross-cultural adaptations, and examine its psychometric properties when used by Chinese oncology nurses. Methods: After translation, the psychometric properties of the Chinese WAAQ were analyzed among 417 nurses, and content validity was determined by six experts. Results: Item-level content validity index (CVI) values were between 0.83 and 1.00; scale-level CVI/universal agreement (S-CVI/UA) and S-CVI/average were 0.86 and 0.98, respectively, which implicated a good content validity. The correlation of the Chinese WAAQ with AAQ-II (rs = −0.247, P < 0.001) suggested criterion validity, and those with General Health Questionnaire-12 (−0.250, <0.001) and general self-efficacy scale (0.491, <0.001) and Utrecht work engagement scale (UWES) (0.439, <0.001) suggested convergent validity. Exploratory factor analysis identified a seven-item, one-factor structure of WAAQ. The Chinese version of WAAQ had high internal consistency (Cronbach's α = 0.920), with an item-total correlation coefficient of 0.702–0.828 (P < 0.05), split-half reliability of 0.933, and test-retest reliability of 0.772. Conclusions: The Chinese WAAQ is a reliable and valid tool for assessing psychological flexibility in Chinese oncology nurses. PMID:29379839
Development and evaluation of the Andhra Pradesh Children and Parent Study Physical Activity Questionnaire (APCAPS-PAQ): a cross-sectional study.

PubMed

Matsuzaki, Mika; Sullivan, Ruth; Ekelund, Ulf; Krishna, K V Radha; Kulkarni, Bharati; Collier, Tim; Ben-Shlomo, Yoav; Kinra, Sanjay; Kuper, Hannah

2016-01-19

There is limited availability of context-specific physical activity questionnaires in low and middle income countries. The aim of this study was to develop and examine the validity of a new Indian physical activity questionnaire, the Andhra Pradesh Children and Parent Study Physical Activity Questionnaire (APCAPS-PAQ). The current study was conducted with the cohort from the Hyderabad DXA Study (n = 2321), recruited in 2009-2010. Criterion validity (n = 245) was examined by comparing the APCAPS-PAQ to a combined heart rate and motion sensor worn for 8 days. Construct validity (n = 2321) was assessed with linear regression, comparing APCAPS-PAQ against BMI, percent body fat, and pulse rate. The APCAPS-PAQ criterion validity was variable depending on the PA intensity groups (ρ = 0.26, 0.07, 0.39; к = 0.14, 0.04, 0.16 for sedentary, light, moderate/vigorous physical activity (MVPA) respectively). Sedentary and light intensity activities from the questionnaire were underestimated when compared to the criterion data while MVPA in APCAPS-PAQ was overestimated. Higher time spent in sedentary activity in APCAPS-PAQ was associated with higher BMI and percent body fat, suggesting construct validity. The APCAPS-PAQ validity is comparable to other physical activity questionnaires. This tool is able to assess sedentary behavior, moderate/vigorous activity and physical activity energy expenditure on a group level with reasonable validity. This new questionnaire may be used for ranking individuals according to their sedentary time and physical activity in southern India.
Validation of the German version of the Burn Specific Health Scale-Brief (BSHS-B).

PubMed

Müller, Astrid; Smits, Dirk; Jasper, Stefanie; Berg, Lea; Claes, Laurence; Ipaktchi, Ramin; Vogt, Peter M; de Zwaan, Martina

2015-09-01

The Burn Specific Health Scale-Brief (BSHS-B) is recognized as a valid self-rating scale to evaluate quality of life after burn. To validate the translated German version of the BSHS-B. One hundred and forty one burn survivors (65.2% men) with a mean age of 49.62 years (SD=15.16) and a mean duration after burn of 45.01 months (SD=26.18) answered the BSHS-B. Factor structure was tested by using confirmatory factor analysis, reliability (internal consistency) of the scales was determined by means of Cronbach's α. Construct validity was explored through correlations between the BSHS-B and the Short-Form 8 Health Survey (SF-8). In addition, the know-groups technique was used to determine to which degree the BSHS-B discriminates between patients with low and high burn severity based on the abbreviated burn severity index (ABSI). The Hospital Anxiety and Depression Scale (HADS) was used to examine criterion validity. The nine BSHS-B subscales showed good internal consistency. A second-order confirmatory factor analysis revealed the following main components: (1) Affect and Relationship, (2) Function and (3) Skin Involvement. The second-order factors were positively correlated with the SF-8 and negatively correlated with symptoms of anxiety and depression. Patients with low ABSI scored higher on all three BSHS-B domains than those with high ABSI. The results indicate good psychometric properties of the German BSHS-B. Further studies are needed to investigate the utility of the questionnaire in clinical routine practice, evaluation of burn management programs, and burn-specific research. Copyright © 2015 Elsevier Ltd and ISBI. All rights reserved.
Validation of a Chinese version of the stress overload scale-short and its use as a screening tool for mental health status.

PubMed

Duan, Wenjie; Mu, Wenlong

2018-02-01

Although stress emerges when environmental demands exceed personal resources, existing measurement methods for stress focus only on one aspect. The newly-developed Short Stress Overload Scale (SOS-S) assesses the extent of stress by assessing both event load (i.e., environmental demands) and personal vulnerability (i.e., personal resources). The present study was designed to evaluate the psychometric properties of the Chinese version of Stress Overload Scale-Short (SOS-SC), and further examine its roles in screening mental health status. A total of 1364 participants were recruited from communities and colleges for scale validation. Reliabilities were good throughout the subsamples (ω > 0.80). Confirmatory factor analysis indicated the acceptable goodness-of-fit for the two-factor correlated model (Sample 1: 560 community residents). Multi-group confirmatory factor analysis confirmed measurement invariance across community residents (Sample 1) and college students (Sample 2 and Sample 3). Criterion validity and convergent validity were established (Sample 2: 554 college students). Latent moderated structural equations demonstrated that the relationship between SOS-SC and depression is moderated by social support (Sample 2), further validating the SOS-SC. In addition, the SOS-SC effectively screened individuals in a population at different levels of mental health status (i.e., "at risk" vs. "at low risk" for depression symptoms and/or wellbeing). The SOS-SC exhibits acceptable psychometric properties in the Chinese context. That said, the two aspects of stress can be differentiated by the Chinese context, therefore, the SOS-SC can be used to measure stress and screen mental health status among the Chinese population, and monitor and evaluate health-promoting interventions.
Reliability, Validity, and Classification Accuracy of the DSM-5 Diagnostic Criteria for Gambling Disorder and Comparison to DSM-IV.

PubMed

Stinchfield, Randy; McCready, John; Turner, Nigel E; Jimenez-Murcia, Susana; Petry, Nancy M; Grant, Jon; Welte, John; Chapman, Heather; Winters, Ken C

2016-09-01

The DSM-5 was published in 2013 and it included two substantive revisions for gambling disorder (GD). These changes are the reduction in the threshold from five to four criteria and elimination of the illegal activities criterion. The purpose of this study was to twofold. First, to assess the reliability, validity and classification accuracy of the DSM-5 diagnostic criteria for GD. Second, to compare the DSM-5-DSM-IV on reliability, validity, and classification accuracy, including an examination of the effect of the elimination of the illegal acts criterion on diagnostic accuracy. To compare DSM-5 and DSM-IV, eight datasets from three different countries (Canada, USA, and Spain; total N = 3247) were used. All datasets were based on similar research methods. Participants were recruited from outpatient gambling treatment services to represent the group with a GD and from the community to represent the group without a GD. All participants were administered a standardized measure of diagnostic criteria. The DSM-5 yielded satisfactory reliability, validity and classification accuracy. In comparing the DSM-5 to the DSM-IV, most comparisons of reliability, validity and classification accuracy showed more similarities than differences. There was evidence of modest improvements in classification accuracy for DSM-5 over DSM-IV, particularly in reduction of false negative errors. This reduction in false negative errors was largely a function of lowering the cut score from five to four and this revision is an improvement over DSM-IV. From a statistical standpoint, eliminating the illegal acts criterion did not make a significant impact on diagnostic accuracy. From a clinical standpoint, illegal acts can still be addressed in the context of the DSM-5 criterion of lying to others.

Criterion validity and accuracy of global positioning satellite and data logging devices for wheelchair tennis court movement

PubMed Central

Sindall, Paul; Lenton, John P.; Whytock, Katie; Tolfrey, Keith; Oyster, Michelle L.; Cooper, Rory A.; Goosey-Tolfrey, Victoria L.

2013-01-01

Purpose To compare the criterion validity and accuracy of a 1 Hz non-differential global positioning system (GPS) and data logger device (DL) for the measurement of wheelchair tennis court movement variables. Methods Initial validation of the DL device was performed. GPS and DL were fitted to the wheelchair and used to record distance (m) and speed (m/second) during (a) tennis field (b) linear track, and (c) match-play test scenarios. Fifteen participants were monitored at the Wheelchair British Tennis Open. Results Data logging validation showed underestimations for distance in right (DLR) and left (DLL) logging devices at speeds >2.5 m/second. In tennis-field tests, GPS underestimated distance in five drills. DLL was lower than both (a) criterion and (b) DLR in drills moving forward. Reversing drill direction showed that DLR was lower than (a) criterion and (b) DLL. GPS values for distance and average speed for match play were significantly lower than equivalent values obtained by DL (distance: 2816 (844) vs. 3952 (1109) m, P = 0.0001; average speed: 0.7 (0.2) vs. 1.0 (0.2) m/second, P = 0.0001). Higher peak speeds were observed in DL (3.4 (0.4) vs. 3.1 (0.5) m/second, P = 0.004) during tennis match play. Conclusions Sampling frequencies of 1 Hz are too low to accurately measure distance and speed during wheelchair tennis. GPS units with a higher sampling rate should be advocated in further studies. Modifications to existing DL devices may be required to increase measurement precision. Further research into the validity of movement devices during match play will further inform the demands and movement patterns associated with wheelchair tennis. PMID:23820154
Montreal-Toulouse Language Assessment Battery: evidence of criterion validity from patients with aphasia.

PubMed

Pagliarin, Karina Carlesso; Ortiz, Karin Zazo; Barreto, Simone dos Santos; Pimenta Parente, Maria Alice de Mattos; Nespoulous, Jean-Luc; Joanette, Yves; Fonseca, Rochele Paz

2015-10-15

The Montreal-Toulouse Language Assessment Battery - Brazilian version (MTL-BR) provides a general description of language processing and related components in adults with brain injury. The present study aimed at verifying the criterion-related validity of the Montreal-Toulouse Language Assessment Battery - Brazilian version (MTL-BR) by assessing its ability to discriminate between individuals with unilateral brain damage with and without aphasia. The investigation was carried out in a Brazilian community-based sample of 104 adults, divided into four groups: 26 participants with left hemisphere damage (LHD) with aphasia, 25 participants with right hemisphere damage (RHD), 28 with LHD non-aphasic, and 25 healthy adults. There were significant differences between patients with aphasia and the other groups on most total and subtotal scores on MTL-BR tasks. The results showed strong criterion-related validity evidence for the MTL-BR Battery, and provided important information regarding hemispheric specialization and interhemispheric cooperation. Future research is required to search for additional evidence of sensitivity, specificity and validity of the MTL-BR in samples with different types of aphasia and degrees of language impairment. Copyright © 2015 Elsevier B.V. All rights reserved.
15 CFR 8b.20 - Admission and recruitment.

Code of Federal Regulations, 2014 CFR

2014-01-01

... AGAINST THE HANDICAPPED IN FEDERALLY ASSISTED PROGRAMS OPERATED BY THE DEPARTMENT OF COMMERCE Post... proportion of handicapped individuals who may be admitted; and (2) May not make use of any test or criterion... handicapped individuals unless: (i) The test or criterion, as used by the recipient, has been validated as a...
Procedures for Empirical Determination of En-Route Criterion Levels.

ERIC Educational Resources Information Center

Moncrief, Michael H.

En-route Criterion Levels (ECLs) are defined as decision rules for predicting pupil readiness to advance through an instructional sequence. This study investigated the validity of present ELCs in an individualized mathematics program and tested procedures for empirically determining optimal ECLs. Retest scores and subsequent progress were…
15 CFR 8b.20 - Admission and recruitment.

Code of Federal Regulations, 2011 CFR

2011-01-01

... AGAINST THE HANDICAPPED IN FEDERALLY ASSISTED PROGRAMS OPERATED BY THE DEPARTMENT OF COMMERCE Post... proportion of handicapped individuals who may be admitted; and (2) May not make use of any test or criterion... handicapped individuals unless: (i) The test or criterion, as used by the recipient, has been validated as a...
15 CFR 8b.20 - Admission and recruitment.

Code of Federal Regulations, 2012 CFR

2012-01-01

... AGAINST THE HANDICAPPED IN FEDERALLY ASSISTED PROGRAMS OPERATED BY THE DEPARTMENT OF COMMERCE Post... proportion of handicapped individuals who may be admitted; and (2) May not make use of any test or criterion... handicapped individuals unless: (i) The test or criterion, as used by the recipient, has been validated as a...
15 CFR 8b.20 - Admission and recruitment.

Code of Federal Regulations, 2010 CFR

2010-01-01

... AGAINST THE HANDICAPPED IN FEDERALLY ASSISTED PROGRAMS OPERATED BY THE DEPARTMENT OF COMMERCE Post... proportion of handicapped individuals who may be admitted; and (2) May not make use of any test or criterion... handicapped individuals unless: (i) The test or criterion, as used by the recipient, has been validated as a...
15 CFR 8b.20 - Admission and recruitment.

Code of Federal Regulations, 2013 CFR

2013-01-01

... AGAINST THE HANDICAPPED IN FEDERALLY ASSISTED PROGRAMS OPERATED BY THE DEPARTMENT OF COMMERCE Post... proportion of handicapped individuals who may be admitted; and (2) May not make use of any test or criterion... handicapped individuals unless: (i) The test or criterion, as used by the recipient, has been validated as a...
Development and psychometric properties of an informant assessment scale of theory of mind for adults with traumatic brain injury.

PubMed

Zhang, Dengke; Pang, Yanxia; Cai, Weixiong; Fazio, Rachel L; Ge, Jianrong; Su, Qiaorong; Xu, Shuiqin; Pan, Yinan; Chen, Sanmei; Zhang, Hongwei

2016-08-01

Impairment of theory of mind (ToM) is a common phenomenon following traumatic brain injury (TBI) that has clear effects on patients' social functioning. A growing body of research has focused on this area, and several methods have been developed to assess ToM deficiency. Although an informant assessment scale would be useful for examining individuals with TBI, very few studies have adopted this approach. The purpose of the present study was to develop an informant assessment scale of ToM for adults with traumatic brain injury (IASToM-aTBI) and to test its reliability and validity with 196 adults with TBI and 80 normal adults. A 44-item scale was developed following a literature review, interviews with patient informants, consultations with experts, item analysis, and exploratory factor analysis (EFA). The following three common factors were extracted: social interaction, understanding of beliefs, and understanding of emotions. The psychometric analyses indicate that the scale has good internal consistency reliability, split-half reliability, test-retest reliability, inter-rater reliability, structural validity, discriminate validity and criterion validity. These results provide preliminary evidence that supports the reliability and validity of the IASToM-aTBI as a ToM assessment tool for adults with TBI.
Assessment of mutual understanding of physician patient encounters: development and validation of a Mutual Understanding Scale (MUS) in a multicultural general practice setting.

PubMed

Harmsen, J A M; Bernsen, R M D; Meeuwesen, L; Pinto, D; Bruijnzeels, M A

2005-11-01

Mutual understanding between physician and patient is essential for good quality of care; however, both parties have different views on health complaints and treatment. This study aimed to develop and validate a measure of mutual understanding (MU) in a multicultural setting. The study included 986 patients from 38 general practices. GPs completed a questionnaire and patients were interviewed after the consultation. To assess mutual understanding the answers from GP and patient to questions about different consultation aspects were compared. An expert panel, using nominal group technique, developed criteria for mutual understanding on consultation aspects and secondly, established a ranking to combine all aspects into an overall consultation judgement. Regarding construct validity, patients' ethnicity, age and language proficiency were the most important predictors for MU. Regarding criterion validity, all GP-related criteria (the GPs perception of his ability to explain to the patient, the patient's ability to explain to the GP, and the patient's understanding of consultation aspects), were well-related to MU. The same can be said of patient's consultation satisfaction and feeling that the GP was considerate. We conclude that the Mutual Understanding Scale is regarded a reliable and valid measure to be used in large-scale quantitative studies.
Adolescent Domain Screening Inventory-Short Form: Development and Initial Validation

ERIC Educational Resources Information Center

Corrigan, Matthew J.

2017-01-01

This study sought to develop a short version of the ADSI, and investigate its psychometric properties. Methods: This is a secondary analysis. Analysis to determine the Cronbach's Alpha, correlations to determine concurrent criterion validity and known instrument validity and a logistic regression to determine predictive validity were conducted.…
Psychometric properties of the Adverse Childhood Experiences Abuse Short Form (ACE-ASF) among Romanian high school students.

PubMed

Meinck, Franziska; Cosma, Alina Paula; Mikton, Christopher; Baban, Adriana

2017-10-01

Child abuse is a major public health problem. In order to establish the prevalence of abuse exposure among children, measures need to be age-appropriate, sensitive, reliable and valid. This study aimed to investigate the psychometric properties of the Adverse Childhood Experiences Questionnaire Abuse Short Form (ACE-ASF). The ACE-ASF is an 8-item, retrospective self-report questionnaire measuring lifetime physical, emotional and sexual abuse. Data from a nationally representative sample of 15-year-old, school-going adolescents (n=1733, 55.5% female) from the Romanian Health Behavior in School-Based Children Study 2014 (HBSC) were analyzed. The factorial structure of the ACE-ASF was tested with Exploratory Factor Analysis (EFA) and confirmed using Confirmatory Factor Analysis (CFA). Measurement invariance was examined across sex, and internal reliability and concurrent criterion validity were established. Violence exposure was high: 39.7% physical, 32.2% emotional and 13.1% sexual abuse. EFA established a two-factor structure: physical/emotional abuse and sexual abuse. CFA confirmed this model fitted the data well [χ2(df)=60.526(19); RMSEA=0.036; CFI/TLI=0.990/0.986]. Metric invariance was supported across sexes. Internal consistency was good (0.83) for the sexual abuse scale and poor (0.57) for the physical/emotional abuse scale. Concurrent criterion validity confirmed hypothesized relationships between childhood abuse and health-related quality of life, life satisfaction, self-perceived health, bullying victimization and perpetration, externalizing and internalizing behaviors, and multiple health complaints. Results support the ACE-ASF as a valid measure of physical, emotional and sexual abuse in school-aged adolescents. However, the ACE-ASF combines spanking with other types of physical abuse when this should be assessed separately instead. Future research is needed to replicate findings in different youth populations and across age groups. Copyright © 2017 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Continuum Mechanics at the Atomic Scale.

DTIC Science & Technology

1977-01-01

an infinite hoop stress at the tip of the crack (Figure 9 ). Because of this singularity a perfectly good criterion of brittle fracture, the maximum...for brittle fracture, we will arrive at the Griffith criterion with the extra benefit that the Griffith constant is now fully determined. As a result...crack tip. From (5.9) it now follows that 2 2 2toZ - [a/2 C (v)] t = C (5.10) 0c Alas, this is the Griffith fracture criterion for brittle fracture with
The Arthroscopic Surgical Skill Evaluation Tool (ASSET).

PubMed

Koehler, Ryan J; Amsdell, Simon; Arendt, Elizabeth A; Bisson, Leslie J; Braman, Jonathan P; Bramen, Jonathan P; Butler, Aaron; Cosgarea, Andrew J; Harner, Christopher D; Garrett, William E; Olson, Tyson; Warme, Winston J; Nicandri, Gregg T

2013-06-01

Surgeries employing arthroscopic techniques are among the most commonly performed in orthopaedic clinical practice; however, valid and reliable methods of assessing the arthroscopic skill of orthopaedic surgeons are lacking. The Arthroscopic Surgery Skill Evaluation Tool (ASSET) will demonstrate content validity, concurrent criterion-oriented validity, and reliability when used to assess the technical ability of surgeons performing diagnostic knee arthroscopic surgery on cadaveric specimens. Cross-sectional study; Level of evidence, 3. Content validity was determined by a group of 7 experts using the Delphi method. Intra-articular performance of a right and left diagnostic knee arthroscopic procedure was recorded for 28 residents and 2 sports medicine fellowship-trained attending surgeons. Surgeon performance was assessed by 2 blinded raters using the ASSET. Concurrent criterion-oriented validity, interrater reliability, and test-retest reliability were evaluated. Content validity: The content development group identified 8 arthroscopic skill domains to evaluate using the ASSET. Concurrent criterion-oriented validity: Significant differences in the total ASSET score (P < .05) between novice, intermediate, and advanced experience groups were identified. Interrater reliability: The ASSET scores assigned by each rater were strongly correlated (r = 0.91, P < .01), and the intraclass correlation coefficient between raters for the total ASSET score was 0.90. Test-retest reliability: There was a significant correlation between ASSET scores for both procedures attempted by each surgeon (r = 0.79, P < .01). The ASSET appears to be a useful, valid, and reliable method for assessing surgeon performance of diagnostic knee arthroscopic surgery in cadaveric specimens. Studies are ongoing to determine its generalizability to other procedures as well as to the live operating room and other simulated environments.
Construction and Validation of the Perceived Opportunity to Craft Scale.

PubMed

van Wingerden, Jessica; Niks, Irene M W

2017-01-01

We developed and validated a scale to measure employees' perceived opportunity to craft (POC) in two separate studies conducted in the Netherlands (total N = 2329). POC is defined as employees' perception of their opportunity to craft their job. In Study 1, the perceived opportunity to craft scale (POCS) was developed and tested for its factor structure and reliability in an explorative way. Study 2 consisted of confirmatory analyses of the factor structure and reliability of the scale as well as examination of the discriminant and criterion-related validity of the POCS. The results indicated that the scale consists of one dimension and could be reliably measured with five items. Evidence was found for the discriminant validity of the POCS. The scale also showed criterion-related validity when correlated with job crafting (+), job resources (autonomy +; opportunities for professional development +), work engagement (+), and the inactive construct cynicism (-). We discuss the implications of these findings for theory and practice.
An evidence-based decision assistance model for predicting training outcome in juvenile guide dogs

PubMed Central

Craigon, Peter J.; Blythe, Simon A.; England, Gary C. W.; Asher, Lucy

2017-01-01

Working dog organisations, such as Guide Dogs, need to regularly assess the behaviour of the dogs they train. In this study we developed a questionnaire-style behaviour assessment completed by training supervisors of juvenile guide dogs aged 5, 8 and 12 months old (n = 1,401), and evaluated aspects of its reliability and validity. Specifically, internal reliability, temporal consistency, construct validity, predictive criterion validity (comparing against later training outcome) and concurrent criterion validity (comparing against a standardised behaviour test) were evaluated. Thirty-nine questions were sourced either from previously published literature or created to meet requirements identified via Guide Dogs staff surveys and staff feedback. Internal reliability analyses revealed seven reliable and interpretable trait scales named according to the questions within them as: Adaptability; Body Sensitivity; Distractibility; Excitability; General Anxiety; Trainability and Stair Anxiety. Intra-individual temporal consistency of the scale scores between 5–8, 8–12 and 5–12 months was high. All scales excepting Body Sensitivity showed some degree of concurrent criterion validity. Predictive criterion validity was supported for all seven scales, since associations were found with training outcome, at at-least one age. Thresholds of z-scores on the scales were identified that were able to distinguish later training outcome by identifying 8.4% of all dogs withdrawn for behaviour and 8.5% of all qualified dogs, with 84% and 85% specificity. The questionnaire assessment was reliable and could detect traits that are consistent within individuals over time, despite juvenile dogs undergoing development during the study period. By applying thresholds to scores produced from the questionnaire this assessment could prove to be a highly valuable decision-making tool for Guide Dogs. This is the first questionnaire-style assessment of juvenile dogs that has shown value in predicting the training outcome of individual working dogs. PMID:28614347
Measuring physical activity in young people with cerebral palsy: validity and reliability of the ActivPAL™ monitor.

PubMed

Bania, Theofani

2014-09-01

We determined the criterion validity and the retest reliability of the ΑctivPAL™ monitor in young people with diplegic cerebral palsy (CP). Activity monitor data were compared with the criterion of video recording for 10 participants. For the retest reliability, activity monitor data were collected from 24 participants on two occasions. Participants had to have diplegic CP and be between 14 and 22 years of age. They also had to be of Gross Motor Function Classification System level II or III. Outcomes were time spent in standing, number of steps (physical activity) and time spent in sitting (sedentary behaviour). For criterion validity, coefficients of determination were all high (r(2) ≥ 0.96), and limits of group agreement were relatively narrow, but limits of agreement for individuals were narrow only for number of steps (≥5.5%). Relative reliability was high for number of steps (intraclass correlation coefficient = 0.87) and moderate for time spent in sitting and lying, and time spent in standing (intraclass correlation coefficients = 0.60-0.66). For groups, changes of up to 7% could be due to measurement error with 95% confidence, but for individuals, changes as high as 68% could be due to measurement error. The results support the criterion validity and the retest reliability of the ActivPAL™ to measure physical activity and sedentary behaviour in groups of young people with diplegic CP but not in individuals. Copyright © 2014 John Wiley & Sons, Ltd.
Measuring the learning capacity of organisations: development and factor analysis of the Questionnaire for Learning Organizations.

PubMed

Oudejans, S C C; Schippers, G M; Schramade, M H; Koeter, M W J; van den Brink, W

2011-04-01

To investigate internal consistency and factor structure of a questionnaire measuring learning capacity based on Senge's theory of the five disciplines of a learning organisation: Personal Mastery, Mental Models, Shared Vision, Team Learning, and Systems Thinking. Cross-sectional study. Substance-abuse treatment centres (SATCs) in The Netherlands. A total of 293 SATC employees from outpatient and inpatient treatment departments, financial and human resources departments. Psychometric properties of the Questionnaire for Learning Organizations (QLO), including factor structure, internal consistency, and interscale correlations. A five-factor model representing the five disciplines of Senge showed good fit. The scales for Personal Mastery, Shared Vision and Team Learning had good internal consistency, but the scales for Systems Thinking and Mental Models had low internal consistency. The proposed five-factor structure was confirmed in the QLO, which makes it a promising instrument to assess learning capacity in teams. The Systems Thinking and the Mental Models scales have to be revised. Future research should be aimed at testing criterion and discriminatory validity.
A reliable and valid questionnaire was developed to measure computer vision syndrome at the workplace.

PubMed

Seguí, María del Mar; Cabrero-García, Julio; Crespo, Ana; Verdú, José; Ronda, Elena

2015-06-01

To design and validate a questionnaire to measure visual symptoms related to exposure to computers in the workplace. Our computer vision syndrome questionnaire (CVS-Q) was based on a literature review and validated through discussion with experts and performance of a pretest, pilot test, and retest. Content validity was evaluated by occupational health, optometry, and ophthalmology experts. Rasch analysis was used in the psychometric evaluation of the questionnaire. Criterion validity was determined by calculating the sensitivity and specificity, receiver operator characteristic curve, and cutoff point. Test-retest repeatability was tested using the intraclass correlation coefficient (ICC) and concordance by Cohen's kappa (κ). The CVS-Q was developed with wide consensus among experts and was well accepted by the target group. It assesses the frequency and intensity of 16 symptoms using a single rating scale (symptom severity) that fits the Rasch rating scale model well. The questionnaire has sensitivity and specificity over 70% and achieved good test-retest repeatability both for the scores obtained [ICC = 0.802; 95% confidence interval (CI): 0.673, 0.884] and CVS classification (κ = 0.612; 95% CI: 0.384, 0.839). The CVS-Q has acceptable psychometric properties, making it a valid and reliable tool to control the visual health of computer workers, and can potentially be used in clinical trials and outcome research. Copyright © 2015 Elsevier Inc. All rights reserved.
Reliability and Validity of the Work and Well-Being Inventory (WBI) for Employees.

PubMed

Vendrig, A A; Schaafsma, F G

2018-06-01

Purpose The purpose of this study is to measure the psychometric properties of the Work and Wellbeing Inventory (WBI) (in Dutch: VAR-2), a screening tool that is used within occupational health care and rehabilitation. Our research question focused on the reliability and validity of this inventory. Methods Over the years seven different samples of workers, patients and sick listed workers varying in size between 89 and 912 participants (total: 2514), were used to measure the test-retest reliability, the internal consistency, the construct and concurrent validity, and the criterion and predictive validity. Results The 13 scales displayed good internal consistency and test-retest reliability. The constructive validity of the WBI could clearly be demonstrated in both patients and healthy workers. Confirmative factor analyses revealed a CFI >.90 for all scales. The depression scale predicted future work absenteeism (>6 weeks) because of a common mental disorder in healthy workers. The job strain scale and the illness behavior scale predicted long term absenteeism (>3 months) in workers with short-term absenteeism. The illness behavior scale moderately predicted return to work in rehab patients attending an intensive multidisciplinary program. Conclusions The WBI is a valid and reliable tool for occupational health practitioners to screen for risk factors for prolonged or future sickness absence. With this tool they will have reliable indications for further advice and interventions to restore the work ability.

Reliability and Validity of the Behavioral Addiction Measure for Video Gaming.

PubMed

Sanders, James L; Williams, Robert J

2016-01-01

Most tests of video game addiction have weak construct validity and limited ability to correctly identify people in denial. The purpose of the present research was to investigate the reliability and validity of a new test of video game addiction (Behavioral Addiction Measure-Video Gaming [BAM-VG]) that was developed in part to address these deficiencies. Regular adult video gamers (n = 506) were recruited from a Canadian online panel and completed a survey containing three measures of excessive video gaming (BAM-VG; DSM-5 criteria for Internet Gaming Disorder [IGD]; and the IGD-20), as well as questions concerning extensiveness of video game involvement and self-report of problems associated with video gaming. One month later, they were reassessed for the purposes of establishing test-retest reliability. The BAM-VG demonstrated good internal consistency as well as 1 month test-retest reliability. Criterion-related validity was demonstrated by significant correlations with the following: time spent playing, self-identification of video game problems, and scores on other instruments designed to assess video game addiction (DSM-5 IGD, IGD-20). Consistent with the theory, principal component analysis identified two components underlying the BAM-VG that roughly correspond with impaired control and significant negative consequences deriving from this impaired control. Together with its excellent construct validity and other technical features, the BAM-VG represents a reliable and valid test of video game addiction.
Age-Neutrality of a Brief Assessment of the Section III Alternative Model for Personality Disorders in Older Adults.

PubMed

Debast, Inge; Rossi, Gina; van Alphen, S P J

2018-04-01

The alternative model for personality disorders in the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders ( DSM-5) is considered an important step toward a possibly better conceptualization of personality pathology in older adulthood, by the introduction of levels of personality functioning (Criterion A) and trait dimensions (Criterion B). Our main aim was to examine age-neutrality of the Short Form of the Severity Indices of Personality Problems (SIPP-SF; Criterion A) and Personality Inventory for DSM-5-Brief Form (PID-5-BF; Criterion B). Differential item functioning (DIF) analyses and more specifically the impact on scale level through differential test functioning (DTF) analyses made clear that the SIPP-SF was more age-neutral (6% DIF, only one of four domains showed DTF) than the PID-5-BF (25% DIF, all four tested domains had DTF) in a community sample of older and younger adults. Age differences in convergent validity also point in the direction of differences in underlying constructs. Concurrent and criterion validity in geriatric psychiatry inpatients suggest that both the SIPP-SF scales measuring levels of personality functioning (especially self-functioning) and the PID-5-BF might be useful screening measures in older adults despite age-neutrality not being confirmed.
Food and Nutrition (Intermediate). Performance Objectives and Criterion-Referenced Test Items.

ERIC Educational Resources Information Center

Missouri Univ., Columbia. Instructional Materials Lab.

This document contains competencies and criterion-referenced test items for the Intermediate Food and Nutrition semester course in Missouri that were derived from the duties and tasks of the Missouri homemaker and identified and validated by home economics teachers and subject matter specialists. The guide is designed to assist home economics…
Multi-Informant Assessment of Temperament in Children with Externalizing Behavior Problems

ERIC Educational Resources Information Center

Copeland, William; Landry, Kerry; Stanger, Catherine; Hudziak, James J.

2004-01-01

We examined the criterion validity of parent and self-report versions of the Junior Temperament and Character Inventory (JTCI) in children with high levels of externalizing problems. The sample included 412 children (206 participants and 206 siblings) participating in a family study of attention and aggressive behavior problems. Criterion validity…
The Validity of the Instructional Reading Level.

ERIC Educational Resources Information Center

Powell, William R.

Presented is a critical inquiry about the product of the informal reading inventory (IRI) and about some of the elements used in the process of determining that product. Recent developments on this topic are briefly reviewed. Questions are raised concerning what is a suitable criterion level for word recognition. The original criterion of 95…
Estimation of median growth curves for children up two years old based on biresponse local linear estimator

NASA Astrophysics Data System (ADS)

Chamidah, Nur; Rifada, Marisa

2016-03-01

There is significant of the coeficient correlation between weight and height of the children. Therefore, the simultaneous model estimation is better than partial single response approach. In this study we investigate the pattern of sex difference in growth curve of children from birth up to two years of age in Surabaya, Indonesia based on biresponse model. The data was collected in a longitudinal representative sample of the Surabaya population of healthy children that consists of two response variables i.e. weight (kg) and height (cm). While a predictor variable is age (month). Based on generalized cross validation criterion, the modeling result based on biresponse model by using local linear estimator for boy and girl growth curve gives optimal bandwidth i.e 1.41 and 1.56 and the determination coefficient (R2) i.e. 99.99% and 99.98%,.respectively. Both boy and girl curves satisfy the goodness of fit criterion i.e..the determination coefficient tends to one. Also, there is difference pattern of growth curve between boy and girl. The boy median growth curves is higher than those of girl curve.
Experimental investigation of shaping disturbance observer design for motion control of precision mechatronic stages with resonances

NASA Astrophysics Data System (ADS)

Yang, Jin; Hu, Chuxiong; Zhu, Yu; Wang, Ze; Zhang, Ming

2017-08-01

In this paper, shaping disturbance observer (SDOB) is investigated for precision mechatronic stages with middle-frequency zero/pole type resonance to achieve good motion control performance in practical manufacturing situations. Compared with traditional standard disturbance observer (DOB), in SDOB a pole-zero cancellation based shaping filter is cascaded to the mechatronic stage plant to meet the challenge of motion control performance deterioration caused by actual resonance. Noting that pole-zero cancellation is inevitably imperfect and the controller may even consequently become unstable in practice, frequency domain stability analysis is conducted to find out how each parameter of the shaping filter affects the control stability. Moreover, the robust design criterion of the shaping filter, and the design procedure of SDOB, are both proposed to guide the actual design and facilitate practical implementation. The SDOB with the proposed design criterion is applied to a linear motor driven stage and a voice motor driven stage, respectively. Experimental results consistently validate the effectiveness nature of the proposed SDOB scheme in practical mechatronics motion applications. The proposed SDOB design actually could be an effective unit in the controller design for motion stages of mechanical manufacture equipments.
Tribological Behavior and the Mild–Severe Wear Transition of Mg97Zn1Y2 Alloy with a LPSO Structure Phase

PubMed Central

Sun, Wei; Xuan, Xihua; Li, Liang; An, Jian

2018-01-01

Dry friction and wear tests were performed on as-cast Mg97Zn1Y2 alloy using a pin-on-disc configuration. Coefficients of friction and wear rates were measured as a function of applied load at sliding speeds of 0.2, 0.8 and 3.0 m/s. The wear mechanisms were identified in the mild and severe wear regimes by means of morphological observation and composition analysis of worn surfaces using scanning electron microscope (SEM) and energy dispersive X-ray spectrometer (EDS). Analyses of microstructure and hardness changes in subsurfaces verified the microstructure transformation from the deformed to the dynamically recrystallized, and properties changed from the strain hardening to dynamic crystallization (DRX) softening before and after the mild–severe wear transition. The mild–severe wear transition can be determined by a proposed contact surface DRX temperature criterion, from which the critical DRX temperatures at different sliding speeds are calculated using DRX dynamics; hence transition loads can also be calculated using a transition load model. The calculated transition loads are in good agreement with the measured ones, demonstrating the validity and applicability of the contact surface DRX temperature criterion. PMID:29584692
Considerations Underlying the Use of Mixed Group Validation

ERIC Educational Resources Information Center

Jewsbury, Paul A.; Bowden, Stephen C.

2013-01-01

Mixed Group Validation (MGV) is an approach for estimating the diagnostic accuracy of tests. MGV is a promising alternative to the more commonly used Known Groups Validation (KGV) approach for estimating diagnostic accuracy. The advantage of MGV lies in the fact that the approach does not require a perfect external validity criterion or gold…
Psychometric properties of the Social Interaction Anxiety Scale and separation criterion between Spanish youths with and without subtypes of social anxiety.

PubMed

Zubeidat, Ihab; Salinas, José María; Sierra, Juan Carlos; Fernández-Parra, Antonio

2007-01-01

In this study, we analyzed the reliability and validity of the Social Interaction Anxiety Scale (SIAS) and propose a separation criterion between youths with specific and generalized social anxiety and youths without social anxiety. A sample of 1012 Spanish youths attending school completed the SIAS, the Liebowitz Social Anxiety Scale, the Social Avoidance and Distress Scale, the Fear of Negative Evaluation Scale, the Youth Self-Report for Ages 11-18 and the Minnesota Multiphasic Personality Inventory-Adolescent. The factor analysis suggests the existence of three factors in the SIAS, the first two of which explain most of the variance of the construct assessed. Internal consistency is adequate in the first two factors. The SIAS features an adequate theoretical validity with the scores of different variables related to social interaction. Analysis of the criterion scores yields three groups pertaining to three clearly differentiated clusters. In the third cluster, two of social anxiety groups - specific and generalized - have been identified by means of a quantitative separation criterion.
Wechsler Adult Intelligence Scale-Fourth Edition (WAIS-IV) processing speed scores as measures of noncredible responding: The third generation of embedded performance validity indicators.

PubMed

Erdodi, Laszlo A; Abeare, Christopher A; Lichtenstein, Jonathan D; Tyson, Bradley T; Kucharski, Brittany; Zuccato, Brandon G; Roth, Robert M

2017-02-01

Research suggests that select processing speed measures can also serve as embedded validity indicators (EVIs). The present study examined the diagnostic utility of Wechsler Adult Intelligence Scale-Fourth Edition (WAIS-IV) subtests as EVIs in a mixed clinical sample of 205 patients medically referred for neuropsychological assessment (53.3% female, mean age = 45.1). Classification accuracy was calculated against 3 composite measures of performance validity as criterion variables. A PSI ≤79 produced a good combination of sensitivity (.23-.56) and specificity (.92-.98). A Coding scaled score ≤5 resulted in good specificity (.94-1.00), but low and variable sensitivity (.04-.28). A Symbol Search scaled score ≤6 achieved a good balance between sensitivity (.38-.64) and specificity (.88-.93). A Coding-Symbol Search scaled score difference ≥5 produced adequate specificity (.89-.91) but consistently low sensitivity (.08-.12). A 2-tailed cutoff on the Coding/Symbol Search raw score ratio (≤1.41 or ≥3.57) produced acceptable specificity (.87-.93), but low sensitivity (.15-.24). Failing ≥2 of these EVIs produced variable specificity (.81-.93) and sensitivity (.31-.59). Failing ≥3 of these EVIs stabilized specificity (.89-.94) at a small cost to sensitivity (.23-.53). Results suggest that processing speed based EVIs have the potential to provide a cost-effective and expedient method for evaluating the validity of cognitive data. Given their generally low and variable sensitivity, however, they should not be used in isolation to determine the credibility of a given response set. They also produced unacceptably high rates of false positive errors in patients with moderate-to-severe head injury. Combining evidence from multiple EVIs has the potential to improve overall classification accuracy. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
I-CAN: the classification and prediction of support needs.

PubMed

Arnold, Samuel R C; Riches, Vivienne C; Stancliffe, Roger J

2014-03-01

Since 1992, the diagnosis and classification of intellectual disability has been dependent upon three constructs: intelligence, adaptive behaviour and support needs (Luckasson et al. 1992. Mental Retardation: Definition, Classification and Systems of Support. American Association on Intellectual and Developmental Disability, Washington, DC). While the methods and instruments to measure intelligence and adaptive behaviour are well established and generally accepted, the measurement and classification of support needs is still in its infancy. This article explores the measurement and classification of support needs. A study is presented comparing scores on the ICF (WHO, 2001) based I-CAN v4.2 support needs assessment and planning tool with expert clinical judgment using a proposed classification of support needs. A logical classification algorithm was developed and validated on a separate sample. Good internal consistency (range 0.73-0.91, N = 186) and criterion validity (κ = 0.94, n = 49) were found. Further advances in our understanding and measurement of support needs could change the way we assess, describe and classify disability. © 2013 John Wiley & Sons Ltd.
Development and Validation of the Adolescent Psychological Need Support in Exercise Questionnaire.

PubMed

Emm-Collison, Lydia G; Standage, Martyn; Gillison, Fiona B

2016-10-01

Grounded within self-determination theory (SDT; Deci & Ryan, 2000; Ryan & Deci, in press), three studies were conducted to develop and psychometrically test a measure of adolescents' perceptions of psychological need support for exercise (viz., for autonomy, competence, and relatedness): the Adolescent Psychological Need Support in Exercise Questionnaire (APNSEQ). In Study 1, 34 items were developed in collaboration with an expert panel. Through categorical confirmatory factor analysis and item response theory, responses from 433 adolescents were used to identify the best fitting and performing items in Study 2. Here, a three-factor nine-item measure showed good fit to the data. In Study 3, responses from an independent sample of 373 adolescents provided further evidence for the nine-item solution as well as for internal consistency, criterion validity, and invariance across gender and social agent (friends, family, and physical education teacher). The APNSEQ was supported as a measure of adolescents' perceptions of psychological need support within the context of exercise.
Psychometric properties of the Florence CyberBullying-CyberVictimization Scales.

PubMed

Palladino, Benedetta Emanuela; Nocentini, Annalaura; Menesini, Ersilia

2015-02-01

The present study tried to answer the research need for empirically validated and theoretically based instruments to assess cyberbullying and cybervictimization. The psychometric properties of the Florence CyberBullying-CyberVictimization Scales (FCBVSs) were analyzed in a sample of 1,142 adolescents (Mage=15.18 years; SD=1.12 years; 54.5% male). For both cybervictimization and cyberbullying, results support a gender invariant model involving 14 items and four factors covering four types of behaviors (written-verbal, visual, impersonation, and exclusion). The second-order confirmatory factor analysis confirmed that a "global," second-order measure of cyberbullying and cybervictimization fits the data well. Overall, the scales showed good validity (construct, concurrent, and convergent) and reliability (internal consistency and test-retest). In addition, using the global key question measure as a criterion, ROC analyses, determining the ability of a test to discriminate between groups, allowed us to identify cutoff points to classify respondents as involved/not involved starting from the continuum measure derived from the scales.
Evaluations of the psychometric properties of the Recovery-Stress Questionnaire for Athletes among a sample of young French table tennis players.

PubMed

Martinent, Guillaume; Decret, Jean-Claude; Isoard-Gautheur, Sandrine; Filaire, Edith; Ferrand, Claude

2014-04-01

This study used confirmatory factor analyses (CFAs) among a sample of young French table tennis players to test: (a) original 19-factor structure, (b) 14-factor structure recently suggested in literature, and (c) hierarchical factor structure of the Recovery-Stress Questionnaire for Athletes (RESTQ-Sport). 148 table tennis players completed the RESTQ-Sport and other self-report questionnaires between one to five occasions with a delay of 1 mo. between each completion. Results of CFAs showed: (a) evidence for relative superiority of the original model in comparison to an alternative model recently proposed in literature, (b) a good fit of the data for the 67-item 17-factor model of the RESTQ-Sport, and (c) an acceptable fit of the data for the hierarchical model of the RESTQ-Sport. Correlations between RESTQ-Sport subscales and burnout and motivation subscales also provided evidence for criterion-related validity of the RESTQ-Sport. This study provided support for reliability and validity of the RESTQ-Sport.
The List of Threatening Experiences: the reliability and validity of a brief life events questionnaire.

PubMed

Brugha, T S; Cragg, D

1990-07-01

During the 23 years since the original work of Holmes & Rahe, research into stressful life events on human subjects has tended towards the development of longer and more complex inventories. The List of Threatening Experiences (LTE) of Brugha et al., by virtue of its brevity, overcomes difficulties of clinical application. In a study of 50 psychiatric patients and informants, the questionnaire version of the list (LTE-Q) was shown to have high test-retest reliability, and good agreement with informant information. Concurrent validity, based on the criterion of independently rated adversity derived from a semistructured life events interview, making use of the Life Events and Difficulties Scales (LEDS) method developed by Brown & Harris, showed both high specificity and sensitivity. The LTE-Q is particularly recommended for use in psychiatric, psychological and social studies in which other intervening variables such as social support, coping, and cognitive variables are of interest, and resources do not allow for the use of extensive interview measures of stress.
Assessment of performance validity in the Stroop Color and Word Test in mild traumatic brain injury patients: a criterion-groups validation design.

PubMed

Guise, Brian J; Thompson, Matthew D; Greve, Kevin W; Bianchini, Kevin J; West, Laura

2014-03-01

The current study assessed performance validity on the Stroop Color and Word Test (Stroop) in mild traumatic brain injury (TBI) using criterion-groups validation. The sample consisted of 77 patients with a reported history of mild TBI. Data from 42 moderate-severe TBI and 75 non-head-injured patients with other clinical diagnoses were also examined. TBI patients were categorized on the basis of Slick, Sherman, and Iverson (1999) criteria for malingered neurocognitive dysfunction (MND). Classification accuracy is reported for three indicators (Word, Color, and Color-Word residual raw scores) from the Stroop across a range of injury severities. With false-positive rates set at approximately 5%, sensitivity was as high as 29%. The clinical implications of these findings are discussed. © 2012 The British Psychological Society.
Validity and reliability of sleep time questionnaires in children and adolescents: A systematic review and meta-analysis.

PubMed

Nascimento-Ferreira, Marcus V; Collese, Tatiana S; de Moraes, Augusto César F; Rendo-Urteaga, Tara; Moreno, Luis A; Carvalho, Heráclito B

2016-12-01

Sleep duration has been associated with several health outcomes in children and adolescents. As an extensive number of questionnaires are currently used to investigate sleep schedule or sleep time, we performed a systematic review of criterion validation of sleep time questionnaires for children and adolescents, considering accelerometers as the reference method. We found a strong correlation between questionnaires and accelerometers for weeknights and a moderate correlation for weekend nights. When considering only studies performing a reliability assessment of the used questionnaires, a significant increase in the correlations for both weeknights and weekend nights was observed. In conclusion, moderate to strong criterion validity of sleep time questionnaires was observed; however, the reliability assessment of the questionnaires showed strong validation performance. Copyright © 2015 Elsevier Ltd. All rights reserved.
Validity, responsiveness, minimal detectable change, and minimal clinically important change of the Pediatric Motor Activity Log in children with cerebral palsy.

PubMed

Lin, Keh-chung; Chen, Hui-fang; Chen, Chia-ling; Wang, Tien-ni; Wu, Ching-yi; Hsieh, Yu-wei; Wu, Li-ling

2012-01-01

This study examined criterion-related validity and clinimetric properties of the Pediatric Motor Activity Log (PMAL) in children with cerebral palsy. Study participants were 41 children (age range: 28-113 months) and their parents. Criterion-related validity was evaluated by the associations between the PMAL and criterion measures at baseline and posttreatment, including the self-care, mobility, and cognition subscale, the total performance of the Functional Independence Measure in children (WeeFIM), and the grasping and visual-motor integration of the Peabody Developmental Motor Scales. Pearson correlation coefficients were calculated. Responsiveness was examined using the paired t test and the standardized response mean, the minimal detectable change was captured at the 90% confidence level, and the minimal clinically important change was estimated using anchor-based and distribution-based approaches. The PMAL-QOM showed fair concurrent validity at pretreatment and posttreatment and predictive validity, whereas the PMAL-AOU had fair concurrent validity at posttreatment only. The PMAL-AOU and PMAL-QOM were both markedly responsive to change after treatment. Improvement of at least 0.67 points on the PMAL-AOU and 0.66 points on the PMAL-QOM can be considered as a true change, not measurement error. A mean change has to exceed the range of 0.39-0.94 on the PMAL-AOU and the range of 0.38-0.74 on the PMAL-QOM to be regarded as clinically important change. Copyright © 2011 Elsevier Ltd. All rights reserved.
Construct and Criterion Validity of the PedsQL™ 4.0 Instrument (Pediatric Quality of Life Inventory) in Colombia.

PubMed

Amaya-Arias, Ana Carolina; Alzate, Juan Pablo; Eslava-Schmalbach, Javier H

2017-01-01

This study aimed at determining the validity of the Pediatric Quality of Life Inventory 4.0 (PedsQL™ 4.0) for the measurement of health-related quality of life (HRQOL) in Colombian children. Validation study of measurement instruments. The PedsQL™ 4.0 was applied by convenience sampling to 375 pairs of children and adolescents between the ages of 5 and 17 and to their parents-caregivers, as well as to 125 parents-caregivers of children between the ages of 2 and 4 in five cities of Colombia (Bogota, Medellin, Cali, Barranquilla and Bucaramanga). Construct validity was assessed through the use of exploratory and confirmatory factor analysis, and criterion validity was assessed by correlations between the PedsQL™ 4.0 and the KIDSCREEN-27. The instrument was applied to 375 children (ages 5-18) and 125 parents of children between the ages of 2 and 4. Factor analysis revealed four factors considered suitable for the sample in both the child and parent reports, whereas Bartlett's test of sphericity showed inter-correlation between variables. Scale and subscales showed proper indicators of internal consistency. It is recommended not to include or review some of the items in the Colombian version of the scale. The Spanish version for Colombia of the PedsQL™ 4.0 displays suitable indicators of criterion and construct validity, therefore becoming a valuable tool for measuring HRQOL in children in our country. Some modifications are recommended for the Colombian version of the scale.

A new self-report inventory of dyslexia for students: criterion and construct validity.

PubMed

Tamboer, Peter; Vorst, Harrie C M

2015-02-01

The validity of a Dutch self-report inventory of dyslexia was ascertained in two samples of students. Six biographical questions, 20 general language statements and 56 specific language statements were based on dyslexia as a multi-dimensional deficit. Dyslexia and non-dyslexia were assessed with two criteria: identification with test results (Sample 1) and classification using biographical information (both samples). Using discriminant analyses, these criteria were predicted with various groups of statements. All together, 11 discriminant functions were used to estimate classification accuracy of the inventory. In Sample 1, 15 statements predicted the test criterion with classification accuracy of 98%, and 18 statements predicted the biographical criterion with classification accuracy of 97%. In Sample 2, 16 statements predicted the biographical criterion with classification accuracy of 94%. Estimations of positive and negative predictive value were 89% and 99%. Items of various discriminant functions were factor analysed to find characteristic difficulties of students with dyslexia, resulting in a five-factor structure in Sample 1 and a four-factor structure in Sample 2. Answer bias was investigated with measures of internal consistency reliability. Less than 20 self-report items are sufficient to accurately classify students with and without dyslexia. This supports the usefulness of self-assessment of dyslexia as a valid alternative to diagnostic test batteries. Copyright © 2015 John Wiley & Sons, Ltd.
Criterion Validity of Measures of Perceived Relative Harm of E-Cigarettes and Smokeless Tobacco Compared to Cigarettes

PubMed Central

Persoskie, Alexander; Nguyen, Anh B.; Kaufman, Annette R.; Tworek, Cindy

2017-01-01

Beliefs about the relative harmfulness of one product compared to another (perceived relative harm) are central to research and regulation concerning tobacco and nicotine-containing products, but techniques for measuring such beliefs vary widely. We compared the validity of direct and indirect measures of perceived harm of e-cigarettes and smokeless tobacco (SLT) compared to cigarettes. On direct measures, participants explicitly compare the harmfulness of each product. On indirect measures, participants rate the harmfulness of each product separately, and ratings are compared. The U.S. Health Information National Trends Survey (HINTS-FDA-2015; N=3738) included direct measures of perceived harm of e-cigarettes and SLT compared to cigarettes. Indirect measures were created by comparing ratings of harm from e-cigarettes, SLT, and cigarettes on 3-point scales. Logistic regressions tested validity by assessing whether direct and indirect measures were associated with criterion variables including: ever-trying e-cigarettes, ever-trying snus, and SLT use status. Compared to the indirect measures, the direct measures of harm were more consistently associated with criterion variables. On direct measures, 26% of adults rated e-cigarettes as less harmful than cigarettes, and 11% rated SLT as less harmful than cigarettes. Direct measures appear to provide valid information about individuals’ harm beliefs, which may be used to inform research and tobacco control policy. Further validation research is encouraged. PMID:28073035
Determination of the criterion-related validity of hip joint angle test for estimating hamstring flexibility using a contemporary statistical approach.

PubMed

Sainz de Baranda, Pilar; Rodríguez-Iniesta, María; Ayala, Francisco; Santonja, Fernando; Cejudo, Antonio

2014-07-01

To examine the criterion-related validity of the horizontal hip joint angle (H-HJA) test and vertical hip joint angle (V-HJA) test for estimating hamstring flexibility measured through the passive straight-leg raise (PSLR) test using contemporary statistical measures. Validity study. Controlled laboratory environment. One hundred thirty-eight professional trampoline gymnasts (61 women and 77 men). Hamstring flexibility. Each participant performed 2 trials of H-HJA, V-HJA, and PSLR tests in a randomized order. The criterion-related validity of H-HJA and V-HJA tests was measured through the estimation equation, typical error of the estimate (TEEST), validity correlation (β), and their respective confidence limits. The findings from this study suggest that although H-HJA and V-HJA tests showed moderate to high validity scores for estimating hamstring flexibility (standardized TEEST = 0.63; β = 0.80), the TEEST statistic reported for both tests was not narrow enough for clinical purposes (H-HJA = 10.3 degrees; V-HJA = 9.5 degrees). Subsequently, the predicted likely thresholds for the true values that were generated were too wide (H-HJA = predicted value ± 13.2 degrees; V-HJA = predicted value ± 12.2 degrees). The results suggest that although the HJA test showed moderate to high validity scores for estimating hamstring flexibility, the prediction intervals between the HJA and PSLR tests are not strong enough to suggest that clinicians and sport medicine practitioners should use the HJA and PSLR tests interchangeably as gold standard measurement tools to evaluate and detect short hamstring muscle flexibility.
Estimating activity energy expenditure: how valid are physical activity questionnaires?

PubMed

Neilson, Heather K; Robson, Paula J; Friedenreich, Christine M; Csizmadi, Ilona

2008-02-01

Activity energy expenditure (AEE) is the modifiable component of total energy expenditure (TEE) derived from all activities, both volitional and nonvolitional. Because AEE may affect health, there is interest in its estimation in free-living people. Physical activity questionnaires (PAQs) could be a feasible approach to AEE estimation in large populations, but it is unclear whether or not any PAQ is valid for this purpose. Our aim was to explore the validity of existing PAQs for estimating usual AEE in adults, using doubly labeled water (DLW) as a criterion measure. We reviewed 20 publications that described PAQ-to-DLW comparisons, summarized study design factors, and appraised criterion validity using mean differences (AEE(PAQ) - AEE(DLW), or TEE(PAQ) - TEE(DLW)), 95% limits of agreement, and correlation coefficients (AEE(PAQ) versus AEE(DLW) or TEE(PAQ) versus TEE(DLW)). Only 2 of 23 PAQs assessed most types of activity over the past year and indicated acceptable criterion validity, with mean differences (TEE(PAQ) - TEE(DLW)) of 10% and 2% and correlation coefficients of 0.62 and 0.63, respectively. At the group level, neither overreporting nor underreporting was more prevalent across studies. We speculate that, aside from reporting error, discrepancies between PAQ and DLW estimates may be partly attributable to 1) PAQs not including key activities related to AEE, 2) PAQs and DLW ascertaining different time periods, or 3) inaccurate assignment of metabolic equivalents to self-reported activities. Small sample sizes, use of correlation coefficients, and limited information on individual validity were problematic. Future research should address these issues to clarify the true validity of PAQs for estimating AEE.
Validation of the peak bilirubin criterion for outcome after partial hepatectomy.

PubMed

van Mierlo, Kim M C; Lodewick, Toine M; Dhar, Dipok K; van Woerden, Victor; Kurstjens, Ralph; Schaap, Frank G; van Dam, Ronald M; Vyas, Soumil; Malagó, Massimo; Dejong, Cornelis H C; Olde Damink, Steven W M

2016-10-01

Postoperative liver failure (PLF) is a dreaded complication after partial hepatectomy. The peak bilirubin criterion (>7.0 mg/dL or ≥120 μmol/L) is used to define PLF. This study aimed to validate the peak bilirubin criterion as postoperative risk indicator for 90-day liver-related mortality. Characteristics of 956 consecutive patients who underwent partial hepatectomy at the Maastricht University Medical Centre or Royal Free London between 2005 and 2012 were analyzed by uni- and multivariable analyses with odds ratios (OR) and 95% confidence intervals (95%CI). Thirty-five patients (3.7%) met the postoperative peak bilirubin criterion at median day 19 with a median bilirubin level of 183 [121-588] μmol/L. Sensitivity and specificity for liver-related mortality after major hepatectomy were 41.2% and 94.6%, respectively. The positive predictive value was 22.6%. Predictors of liver-related mortality were the peak bilirubin criterion (p < 0.001, OR = 15.9 [95%CI 5.2-48.7]), moderate-severe steatosis and fibrosis (p = 0.013, OR = 8.5 [95%CI 1.6-46.6]), ASA 3-4 (p = 0.047, OR = 3.0 [95%CI 1.0-8.8]) and age (p = 0.044, OR = 1.1 [95%CI 1.0-1.1]). The peak bilirubin criterion has a low sensitivity and positive predictive value for 90-day liver-related mortality after major hepatectomy. Copyright © 2016 International Hepato-Pancreato-Biliary Association Inc. Published by Elsevier Ltd. All rights reserved.
A comparison of two patient classification instruments in an acute care hospital.

PubMed

Seago, Jean Ann

2002-05-01

Patient classification systems are alternately praised and vilified by staff nurses, nurse managers, and nurse executives. Most nurses agree that substantial resources are used to create or find, implement, manage, and maintain the systems, and that the predictive ability of the instruments is intermittent. The purpose of this study is to compare the predictive validity of two types of patient classification instruments commonly used in acute care hospitals in California. Acute care hospitals in California are required by both the Joint Commission on Accreditation of Healthcare Organizations and California Title 22 to have a reliable and valid patient classification system (PCS). The two general types of systems commonly used are the summative task type PCS and the critical incident or criterion type PCS. There is little to assist nurse executives in deciding which type of PCS to choose. There is modest research demonstrating the validity and reliability of different PCSs but no published data comparing the predictive validity of the different types of systems. The unit of analysis is one patient shift called the study shift. The study shift is defined as the first day shift after the patient has been in the hospital for a full 24 hours. Data were collected using medical record review only. Both types, criterion and summative, of PCS data collection instruments were completed for all patients at both collection points. Each patient had a before and after score for each type of instrument. Three hundred forty-nine medical records for inpatients meeting the inclusion criteria were examined. The average patient age was 76 years, the average length of stay was 6.6 days with an average of 6.7 secondary diagnoses recorded. Fifty-five percent of the sample was female and the most common primary diagnosis was CHF, followed by COPD, CVA, and pneumonia. There was a difference in mean summative predictor score and the mean summative actual score of 1.57 points with the predictor score higher (P =.001; CI =.62--2.5). For the criterion instrument, 68.4% of the predictor criterion scores were in category 2 compared to 65.5% of the actual criterion scores. The criterion predictor agreed with the criterion actual score 45% of the time for category 1 patients, 87.3% of the time for category 2 patients, 77.1% of the time for category 3 patients and 72.7% of the time for category 4 patients, with an overall agreement between predictor and actual criterion scores of 79.9% (Kappa P <.001, indicating agreement is not by chance). The most significant finding of this study is that there are virtually no differences in the predictive ability of summative versus criterion patient classification instruments. Using the same patients, both types of instruments predicted the actual score over 78% of the time.
Identifying dyspepsia in the Greek population: translation and validation of a questionnaire

PubMed Central

Anastasiou, Foteini; Antonakis, Nikos; Chaireti, Georgia; Theodorakis, Pavlos N; Lionis, Christos

2006-01-01

Background Studies on clinical issues, including diagnostic strategies, are considered to be the core content of general practice research. The use of standardised instruments is regarded as an important component for the development of Primary Health Care research capacity. Demand for epidemiological cross-cultural comparisons in the international setting and the use of common instruments and definitions valid to each culture is bigger than ever. Dyspepsia is a common complaint in primary practice but little is known with respect to its incidence in Greece. There are some references about the Helicobacter Pylori infection in patients with functional dyspepsia or gastric ulcer in Greece but there is no specific instrument for the identification of dyspepsia. This paper reports on the validation and translation into Greek, of an English questionnaire for the identification of dyspepsia in the general population and discusses several possibilities of its use in the Greek primary care. Methods The selected English postal questionnaire for the identification of people with dyspepsia in the general population consists of 30 items and was developed in 1995. The translation and cultural adaptation of the questionnaire has been performed according to international standards. For the validation of the instrument the internal consistency of the items was established using the alpha coefficient of Chronbach, the reproducibility (test – retest reliability) was measured by kappa correlation coefficient and the criterion validity was calculated against the diagnosis of the patients' records using also kappa correlation coefficient. Results The final Greek version of the postal questionnaire for the identification of dyspepsia in the general population was reliably translated. The internal consistency of the questionnaire was good, Chronbach's alpha was found to be 0.88 (95% CI: 0.81–0.93), suggesting that all items were appropriate to measure. Kappa coefficient for reproducibility (test – retest reliability) was found 0.66 (95% CI: 0.62–0.71), whereas the kappa analysis for criterion validity was 0.63 (95% CI: 0.36–0.89). Conclusion This study indicates that the Greek translation is comparable with the English-language version in terms of validity and reliability, and is suitable for epidemiological research within the Greek primary health care setting. PMID:16515708
Adults' past-day recall of sedentary time: reliability, validity, and responsiveness.

PubMed

Clark, Bronwyn K; Winkler, Elisabeth; Healy, Genevieve N; Gardiner, Paul G; Dunstan, David W; Owen, Neville; Reeves, Marina M

2013-06-01

Past-day recall rather than recall of past week or a usual/typical day may improve the validity of self-reported sedentary time measures. This study examined the test-retest reliability, criterion validity, and responsiveness of the seven-item questionnaire, Past-day Adults' Sedentary Time (PAST). Participants (breast cancer survivors, n = 90, age = 33-75 yr, body mass index = 25-40 kg·m) in a 6-month randomized controlled trial of a lifestyle-based weight loss intervention completed the interviewer-administered PAST questionnaire about time spent sitting/lying on the previous day for work, transport, television viewing, nonwork computer use, reading, hobbies, and other purposes (summed for total sedentary time). The instrument was administered at baseline, 7 d later for test-retest reliability (n = 86), and at follow-up. ActivPAL3-assessed sit/lie time in bouts of ≥5 min during waking hours on the recall day was used as the validity criterion measure at both baseline (n = 72) and follow-up (n = 68). Analyses included intraclass correlation coefficients, Pearson's correlations (r), and Bland-Altman plots and responsiveness index. The PAST had fair to good test-retest reliability (intraclass correlation coefficient = 0.50, 95% confidence interval [CI] = 0.32-0.64). At baseline, the correlation between PAST and activPAL sit/lie time was r = 0.57 (95% CI = 0.39-0.71). The mean difference between PAST at baseline and retest was -25 min (5.2%), 95% limits of agreement = -5.9 to 5.0 h, and the activPAL sit/lie time was -9 min (1.8%), 95% limits of agreement = -4.9 to 4.6 h. The PAST showed small but significant responsiveness (-0.44, 95% CI = -0.92 to -0.04); responsiveness of activPAL sit/lie time was not significant. The PAST questionnaire provided an easy-to-administer measure of sedentary time in this sample. Validity and reliability findings compare favorably with other sedentary time questionnaires. Past-day recall of sedentary time shows promise for use in future health behavior, epidemiological, and population surveillance studies.
Landing flying qualities evaluation criteria for augmented aircraft

NASA Technical Reports Server (NTRS)

Radford, R. C.; Smith, R.; Bailey, R.

1980-01-01

The criteria evaluated were: Calspan Neal-Smith; Onstott (Northrop Time Domain); McDonnell-Douglas Equivalent System Approach; R. H. Smith Criterion. Each criterion was applied to the same set of longitudinal approach and landing flying qualities data. A revised version of the Neal-Smith criterion which is applicable to the landing task was developed and tested against other landing flying qualities data. Results indicated that both the revised Neal-Smith criterion and the Equivalent System Approach are good discriminators of pitch landing flying qualities; Neal-Smith has particular merit as a design guide, while the Equivalent System Approach is well suited for development of appropriate military specification requirements applicable to highly augmented aircraft.
Empirical Validation of Reading Proficiency Guidelines

ERIC Educational Resources Information Center

Clifford, Ray; Cox, Troy L.

2013-01-01

The validation of ability scales describing multidimensional skills is always challenging, but not impossible. This study applies a multistage, criterion-referenced approach that uses a framework of aligned texts and reading tasks to explore the validity of the ACTFL and related reading proficiency guidelines. Rasch measurement and statistical…
Psychometric properties of the Chinese version of the Menopause-Specific Quality-of-Life questionnaire

PubMed Central

Nie, Guangning; Yang, Hongyan; Liu, Jian; Zhao, ChunMei; Wang, Xiaoyun

2017-01-01

Abstract Objective: The Menopause-Specific Quality-of-Life (MENQOL) questionnaire was developed as a specific tool to measure the health-related quality-of-life of postmenopausal women. Thus far, the Chinese version questionnaire has not been subjected to psychometric assessment with a large sample. This study aims to evaluate the validity and reliability of the Chinese version of the MENQOL specific to postmenopausal women in China. Methods: A total of 1,137 menopausal symptomatic and 491 menopausal asymptomatic women from eight cities in China were recruited using a convenience sampling method. Psychometric properties were evaluated by descriptive statistics, validity, and reliability. Reliability was assessed for each subscale of the MENQOL through internal consistency reliability with Cronbach's α and intersubscale correlations. Item-domain correlations, principal components analysis (PCA), and confirmatory factor analysis were performed to determine construct validity. t tests were used to compare the differences between the menopausal symptomatic and asymptomatic women and to evaluate the discriminate validity. Pearson correlation coefficients were calculated between MENQOL scores and the Kupperman index to assess criterion-related validity. Results: The most common symptoms in Chinese menopausal symptomatic women were “experiencing poor memory” (94.4%), “feeling tired or worn out” (93.8%), “aching in muscle and joints” (89.4%), “low backache” (86.9%), “decrease in physical strength” (86.6%), “aches in back of neck or head” (86.2%), “difficulty sleeping” (83.6%), “accomplishing less than I used to” (83.4%), “feeling a lack of energy” (83.3%), “change in your sexual desire” (81%), and “hot flash” (80.7%) among others. The symptoms of “increased facial hair” were rarely seen (9.9%). The vasomotor domain, as well as psychosocial, physical, and sexual domains showed high reliability (Cronbach's α 0.84, 0.87, 0.89, and 0.86, respectively). Item-domain correlation analysis showed that all items correlated more strongly with their own domains than with other domains. In the PCA, after deleting the “increased facial hair” item, items in the vasomotor, sexual, and psychosocial subscales loaded on their respective domains by and large, and items in the physical subscale divided into two factors. The PCA revealed a latent structure of the Chinese version of MENQOL nearly identical to the original MENQOL domains. The confirmatory factor analysis demonstrated that the questionnaire fits well with a four-domain model. The MENQOL can discriminate between menopausal symptomatic women with asymptomatic women as it showed good discriminate validity. Criterion-related validity was confirmed by a significant correlation between MENQOL scores and the Kupperman index. Conclusions: This study showed that Chinese version of MENQOL has good psychometric properties and would be suitable to measure the health-related quality-of-life of Chinese menopausal women except for item 21 (increased facial hair). PMID:27922934
The Arthroscopic Surgical Skill Evaluation Tool (ASSET)

PubMed Central

Koehler, Ryan J.; Amsdell, Simon; Arendt, Elizabeth A; Bisson, Leslie J; Braman, Jonathan P; Butler, Aaron; Cosgarea, Andrew J; Harner, Christopher D; Garrett, William E; Olson, Tyson; Warme, Winston J.; Nicandri, Gregg T.

2014-01-01

Background Surgeries employing arthroscopic techniques are among the most commonly performed in orthopaedic clinical practice however, valid and reliable methods of assessing the arthroscopic skill of orthopaedic surgeons are lacking. Hypothesis The Arthroscopic Surgery Skill Evaluation Tool (ASSET) will demonstrate content validity, concurrent criterion-oriented validity, and reliability, when used to assess the technical ability of surgeons performing diagnostic knee arthroscopy on cadaveric specimens. Study Design Cross-sectional study; Level of evidence, 3 Methods Content validity was determined by a group of seven experts using a Delphi process. Intra-articular performance of a right and left diagnostic knee arthroscopy was recorded for twenty-eight residents and two sports medicine fellowship trained attending surgeons. Subject performance was assessed by two blinded raters using the ASSET. Concurrent criterion-oriented validity, inter-rater reliability, and test-retest reliability were evaluated. Results Content validity: The content development group identified 8 arthroscopic skill domains to evaluate using the ASSET. Concurrent criterion-oriented validity: Significant differences in total ASSET score (p<0.05) between novice, intermediate, and advanced experience groups were identified. Inter-rater reliability: The ASSET scores assigned by each rater were strongly correlated (r=0.91, p <0.01) and the intra-class correlation coefficient between raters for the total ASSET score was 0.90. Test-retest reliability: there was a significant correlation between ASSET scores for both procedures attempted by each individual (r = 0.79, p<0.01). Conclusion The ASSET appears to be a useful, valid, and reliable method for assessing surgeon performance of diagnostic knee arthroscopy in cadaveric specimens. Studies are ongoing to determine its generalizability to other procedures as well as to the live OR and other simulated environments. PMID:23548808
An evaluation of the Psychache Scale on an offender population.

PubMed

Mills, Jeremy F; Green, Kate; Reddon, John R

2005-10-01

This study examined the generalizability of a self-report measure of psychache to an offender population. The factor structure, construct validity, and criterion validity of the Psychache Scale was assessed on 136 male prison inmates. The results showed the Psychache Scale has a single underlying factor structure and to be strongly associated with measures of depression and hopelessness and moderately associated with psychiatric symptoms and the criterion variable of a history of prior suicide attempts. The variables of depression, hopelessness, and psychiatric symptoms all contributed unique variance to psychache. Discussion centers on psychache's theoretical application to the prediction of suicide.
Reliability and Validity of the Musculoskeletal Tumor Society Scoring System for the Upper Extremity in Japanese Patients.

PubMed

Uehara, Kosuke; Ogura, Koichi; Akiyama, Toru; Shinoda, Yusuke; Iwata, Shintaro; Kobayashi, Eisuke; Tanzawa, Yoshikazu; Yonemoto, Tsukasa; Kawano, Hirotaka; Kawai, Akira

2017-09-01

The Musculoskeletal Tumor Society (MSTS) scoring system developed in 1993 is a widely used disease-specific evaluation tool for assessment of physical function in patients with musculoskeletal tumors; however, only a few studies have confirmed its reliability and validity. The aim of this study was to validate the MSTS scoring system for the upper extremity (MSTS-UE) in Japanese patients with musculoskeletal tumors for use by others in research. Does the MSTS-UE have: (1) sufficient reliability and internal consistency; (2) adequate construct validity; and (3) reasonable criterion validity in comparison to the Toronto Extremity Salvage Score (TESS) or SF-36? Reliability was performed using test-retest analysis, and internal consistency was evaluated with Cronbach's alpha coefficient. Construct validity was evaluated using a scree plot to confirm the construct number and the Akaike information criterion network. Criterion validity was evaluated by comparing the MSTS-UE with the TESS and SF-36. The test-retest reliability with intraclass correlation coefficient (0.95; 95% CI, 0.91-0.97) was excellent, and internal consistency with Cronbach's α (0.7; 95% CI, 0.53-0.81) was acceptable. There were no ceiling and floor effects. The Akaike Information Criterion network showed that lifting ability, pain, and dexterity played central roles among the components. The MSTS-UE showed substantial correlation with the TESS scoring scale (r = 0.75; p < 0.001) and fair correlation with the SF-36 physical component summary (r = 0.37; p = 0.007). Although the MSTS-UE showed slight correlation with the SF-36 mental component summary, the emotional acceptance component of the MSTS-UE showed fair correlation (r = 0.29; p = 0.039). We can conclude that the MSTS is not an adequate measure of general health-related quality of life; however, this system was designed mainly to be a simple measure of function in a single extremity. To evaluate the mental state of patients with musculoskeletal tumors in the upper extremity, further study is needed.
The Queensland high risk foot form (QHRFF) – is it a reliable and valid clinical research tool for foot disease?

PubMed Central

2014-01-01

Background Foot disease complications, such as foot ulcers and infection, contribute to considerable morbidity and mortality. These complications are typically precipitated by “high-risk factors”, such as peripheral neuropathy and peripheral arterial disease. High-risk factors are more prevalent in specific “at risk” populations such as diabetes, kidney disease and cardiovascular disease. To the best of the authors’ knowledge a tool capturing multiple high-risk factors and foot disease complications in multiple at risk populations has yet to be tested. This study aimed to develop and test the validity and reliability of a Queensland High Risk Foot Form (QHRFF) tool. Methods The study was conducted in two phases. Phase one developed a QHRFF using an existing diabetes foot disease tool, literature searches, stakeholder groups and expert panel. Phase two tested the QHRFF for validity and reliability. Four clinicians, representing different levels of expertise, were recruited to test validity and reliability. Three cohorts of patients were recruited; one tested criterion measure reliability (n = 32), another tested criterion validity and inter-rater reliability (n = 43), and another tested intra-rater reliability (n = 19). Validity was determined using sensitivity, specificity and positive predictive values (PPV). Reliability was determined using Kappa, weighted Kappa and intra-class correlation (ICC) statistics. Results A QHRFF tool containing 46 items across seven domains was developed. Criterion measure reliability of at least moderate categories of agreement (Kappa > 0.4; ICC > 0.75) was seen in 91% (29 of 32) tested items. Criterion validity of at least moderate categories (PPV > 0.7) was seen in 83% (60 of 72) tested items. Inter- and intra-rater reliability of at least moderate categories (Kappa > 0.4; ICC > 0.75) was seen in 88% (84 of 96) and 87% (20 of 23) tested items respectively. Conclusions The QHRFF had acceptable validity and reliability across the majority of items; particularly items identifying relevant co-morbidities, high-risk factors and foot disease complications. Recommendations have been made to improve or remove identified weaker items for future QHRFF versions. Overall, the QHRFF possesses suitable practicality, validity and reliability to assess and capture relevant foot disease items across multiple at risk populations. PMID:24468080
Physical employment standards for U.K. fire and rescue service personnel.

PubMed

Blacker, S D; Rayson, M P; Wilkinson, D M; Carter, J M; Nevill, A M; Richmond, V L

2016-01-01

Evidence-based physical employment standards are vital for recruiting, training and maintaining the operational effectiveness of personnel in physically demanding occupations. (i) Develop criterion tests for in-service physical assessment, which simulate the role-related physical demands of UK fire and rescue service (UK FRS) personnel. (ii) Develop practical physical selection tests for FRS applicants. (iii) Evaluate the validity of the selection tests to predict criterion test performance. Stage 1: we conducted a physical demands analysis involving seven workshops and an expert panel to document the key physical tasks required of UK FRS personnel and to develop 'criterion' and 'selection' tests. Stage 2: we measured the performance of 137 trainee and 50 trained UK FRS personnel on selection, criterion and 'field' measures of aerobic power, strength and body size. Statistical models were developed to predict criterion test performance. Stage 3: matter experts derived minimum performance standards. We developed single person simulations of the key physical tasks required of UK FRS personnel as criterion and selection tests (rural fire, domestic fire, ladder lift, ladder extension, ladder climb, pump assembly, enclosed space search). Selection tests were marginally stronger predictors of criterion test performance (r = 0.88-0.94, 95% Limits of Agreement [LoA] 7.6-14.0%) than field test scores (r = 0.84-0.94, 95% LoA 8.0-19.8%) and offered greater face and content validity and more practical implementation. This study outlines the development of role-related, gender-free physical employment tests for the UK FRS, which conform to equal opportunities law. © The Author 2015. Published by Oxford University Press on behalf of the Society of Occupational Medicine. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Detecting Symptom Exaggeration in Combat Veterans Using the MMPI-2 Symptom Validity Scales: A Mixed Group Validation

ERIC Educational Resources Information Center

Tolin, David F.; Steenkamp, Maria M.; Marx, Brian P.; Litz, Brett T.

2010-01-01

Although validity scales of the Minnesota Multiphasic Personality Inventory-2 (MMPI-2; J. N. Butcher, W. G. Dahlstrom, J. R. Graham, A. Tellegen, & B. Kaemmer, 1989) have proven useful in the detection of symptom exaggeration in criterion-group validation (CGV) studies, usually comparing instructed feigners with known patient groups, the…
Further Validation of the IDAS: Evidence of Convergent, Discriminant, Criterion, and Incremental Validity

ERIC Educational Resources Information Center

Watson, David; O'Hara, Michael W.; Chmielewski, Michael; McDade-Montez, Elizabeth A.; Koffel, Erin; Naragon, Kristin; Stuart, Scott

2008-01-01

The authors explicated the validity of the Inventory of Depression and Anxiety Symptoms (IDAS; D. Watson et al., 2007) in 2 samples (306 college students and 605 psychiatric patients). The IDAS scales showed strong convergent validity in relation to parallel interview-based scores on the Clinician Rating version of the IDAS; the mean convergent…
Using Item Data for Evaluating Criterion Reference Measures with an Empirical Investigation of Index Consistency.

ERIC Educational Resources Information Center

Meredith, Keith E.; Sabers, Darrell L.

Data required for evaluating a Criterion Referenced Measurement (CRM) is described with a matrix. The information within the matrix consists of the "pass-fail" decisions of two CRMs. By differentially defining these two CRMs, different concepts of reliability and validity can be examined. Indices suggested for analyzing the matrix are listed with…
The Development of a Criterion Instrument for Counselor Selection.

ERIC Educational Resources Information Center

Remer, Rory; Sease, William

A measure of potential performance as a counselor is needed as an adjunct to the information presently employed in selection decisions. This article deals with one possible method of development of such a potential performance criterion and the steps taken, to date, in the attempt to validate it. It includes: the overall effectiveness of the…

Development of a Criterion-Referenced, Performance-Based Assessment of Reading Comprehension in a Whole Literacy Program.

ERIC Educational Resources Information Center

Tibbetts, Katherine A.; And Others

This paper describes the development of a criterion-referenced, performance-based measure of third grade reading comprehension. The primary purpose of the assessment is to contribute unique and valid information for use in the formative evaluation of a whole literacy program. A secondary purpose is to supplement other program efforts to…
Emotion Regulation among School-Age Children: The Development and Validation of a New Criterion Q-Sort Scale.

ERIC Educational Resources Information Center

Shields, Ann; Cicchetti, Dante

1997-01-01

Two studies examined psychometric properties of a new criterion Q-sort for children's emotion regulation and autonomy. Multitrait-multimethod matrix and factor analyses indicated impressive convergence among the emotion regulation Q-scale and established affect regulation measures. The new scale was not discriminable from measures of related…
The Predictive Validity of the Minnesota Reading Assessment for Students in Postsecondary Vocational Education Programs.

ERIC Educational Resources Information Center

Brown, James M.; Chang, Gerald

1982-01-01

The predictive validity of the Minnesota Reading Assessment (MRA) when used to project potential performance of postsecondary vocational-technical education students was examined. Findings confirmed the MRA to be a valid predictor, although the error in prediction varied between the criterion variables. (Author/GK)
Standards Performance Continuum: Development and Validation of a Measure of Effective Pedagogy.

ERIC Educational Resources Information Center

Doherty, R. William; Hilberg, R. Soleste; Epaloose, Georgia; Tharp, Roland G.

2002-01-01

Describes the development and validation of the Standards Performance Continuum (SPC) for assessing teacher performance of the Standards for Effective Pedagogy. Three studies involving Florida, California, and New Mexico public school teachers provided evidence of inter-rater reliability, concurrent validity, and criterion-related validity…
The Reliability and Validity of the Coopersmith Self-Esteem Inventory-Form B.

ERIC Educational Resources Information Center

Chiu, Lian-Hwang

1985-01-01

The purpose of this study was to determine the test-retest reliability and concurrent validity of the short form (Form B) of the Coopersmith Self-Esteem Inventory. Criterion measures for validity included: (1) sociometric measures; (2) teacher's popularity ranking; and, (3) self-esteem rating. (Author/LMO)
Current Concerns in Validity Theory.

ERIC Educational Resources Information Center

Kane, Michael

Validity is concerned with the clarification and justification of the intended interpretations and uses of observed scores. It has not been easy to formulate a general methodology set of principles for validation, but progress has been made, especially as the field has moved from relatively limited criterion-related models to sophisticated…
The Development and Validation of a Life Experience Inventory for the Identification of Creative Electrical Engineers.

ERIC Educational Resources Information Center

Michael, William B.; Colson, Kenneth R.

1979-01-01

The construction and validation of the Life Experience Inventory (LEI) for the identification of creative electrical engineers are described. Using the number of patents held or pending as a criterion measure, the LEI was found to have high concurrent validity. (JKS)
Validation of the Lollipop Test: A Diagnostic Screening Test of School Readiness.

ERIC Educational Resources Information Center

Chew, Alex L.; Morris, John D.

1984-01-01

The validity of the Lollipop Test: A Diagnostic Screening Test of School Readiness was examined using the Metropolitan Readiness Test (MRT), Level I, Form Q, as the criterion. Appreciable concurrent validity was found across test batteries. Implications for school readiness screening are discussed. (Author/BS)
Concurrent Validity of the TONI-3

ERIC Educational Resources Information Center

Banks, Sandra H.; Franzen, Michael D.

2010-01-01

The literature pertaining to intelligence assessment reveals an ongoing discussion about the areas of intelligence captured by nonverbal tests. To date, few studies have investigated the criterion validity of the Test of Nonverbal Intelligence, Third Edition (TONI-3). The present study investigates the concurrent validity of the TONI-3 in a sample…
Bikeability and methodological issues using the active commuting route environment scale (ACRES) in a metropolitan setting

PubMed Central

2011-01-01

Background Route environments can positively influence people's active commuting and thereby contribute to public health. The Active Commuting Route Environment Scale (ACRES) was developed to study active commuters' perceptions of their route environments. However, bicycle commuters represent a small portion of the population in many cities and thus are difficult to study using population-based material. Therefore, the aim of this study is to expand the state of knowledge concerning the criterion-related validity of the ACRES and the representativity using an advertisement-recruited sample. Furthermore, by comparing commuting route environment profiles of inner urban and suburban areas, we provide a novel basis for understanding the relationship between environment and bikeability. Methods Bicycle commuters from Greater Stockholm, Sweden, advertisement- (n = 1379) and street-recruited (n = 93), responded to the ACRES. Traffic planning and environmental experts from the Municipality of Stockholm (n = 24) responded to a modified version of the ACRES. The criterion-related validity assessments were based on whether or not differences between the inner urban and the suburban route environments, as indicated by the experts and by four existing objective measurements, were reflected by differences in perceptions of these environments. Comparisons of ratings between advertisement- and street-recruited participants were used for the assessments of representativity. Finally, ratings of inner urban and suburban route environments were used to evaluate commuting route environment profiles. Results Differences in ratings of the inner urban and suburban route environments by the advertisement-recruited participants were in accord with the existing objective measurements and corresponded reasonably well with those of the experts. Overall, there was a reasonably good correspondence between the advertisement- and street-recruited participants' ratings. Distinct differences in commuting route environment profiles were noted between the inner urban and suburban areas. Suburban route environments were rated as safer and more stimulating for bicycle-commuting than the inner urban ones. In general, the findings applied to both men and women. Conclusions The overall results show: considerable criterion-related validity of the ACRES; ratings of advertisement-recruited participants mirroring those of street-recruited participants; and a higher degree of bikeability in the suburban commuting route environments than in the inner urban ones. PMID:21241470
Instruments to assess self-care among healthy children: A systematic review of measurement properties.

PubMed

Urpí-Fernández, Ana-María; Zabaleta-Del-Olmo, Edurne; Montes-Hidalgo, Javier; Tomás-Sábado, Joaquín; Roldán-Merino, Juan-Francisco; Lluch-Canut, María-Teresa

2017-12-01

To identify, critically appraise and summarize the measurement properties of instruments to assess self-care in healthy children. Assessing self-care is a proper consideration for nursing practice and nursing research. No systematic review summarizes instruments of measurement validated in healthy children. Psychometric review in accordance with the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) panel. MEDLINE, CINAHL, PsycINFO, Web of Science and Open Grey were searched from their inception to December 2016. Validation studies with a healthy child population were included. Search was not restricted by language. Two reviewers independently assessed the methodological quality of included studies using the COSMIN checklist. Eleven studies were included in the review assessing the measurement properties of ten instruments. There was a maximum of two studies per instrument. None of the studies evaluated the properties of test-retest reliability, measurement error, criterion validity and responsiveness. Internal consistency and structural validity were rated as "excellent" or "good" in four studies. Four studies were rated as "excellent" in content validity. Cross-cultural validity was rated as "poor" in the two studies (three instruments) which cultural adaptation was carried out. The evidence available does not allow firm conclusions about the instruments identified in terms of reliability and validity. Future research should focus on generate evidence about a wider range of measurement properties of these instruments using a rigorous methodology, as well as instrument testing on different countries and child population. © 2017 John Wiley & Sons Ltd.
Validation of scores of use of inhalation devices: valoration of errors *

PubMed Central

Zambelli-Simões, Letícia; Martins, Maria Cleusa; Possari, Juliana Carneiro da Cunha; Carvalho, Greice Borges; Coelho, Ana Carla Carvalho; Cipriano, Sonia Lucena; de Carvalho-Pinto, Regina Maria; Cukier, Alberto; Stelmach, Rafael

2015-01-01

Abstract Objective: To validate two scores quantifying the ability of patients to use metered dose inhalers (MDIs) or dry powder inhalers (DPIs); to identify the most common errors made during their use; and to identify the patients in need of an educational program for the use of these devices. Methods: This study was conducted in three phases: validation of the reliability of the inhaler technique scores; validation of the contents of the two scores using a convenience sample; and testing for criterion validation and discriminant validation of these instruments in patients who met the inclusion criteria. Results: The convenience sample comprised 16 patients. Interobserver disagreement was found in 19% and 25% of the DPI and MDI scores, respectively. After expert analysis on the subject, the scores were modified and were applied in 72 patients. The most relevant difficulty encountered during the use of both types of devices was the maintenance of total lung capacity after a deep inhalation. The degree of correlation of the scores by observer was 0.97 (p < 0.0001). There was good interobserver agreement in the classification of patients as able/not able to use a DPI (50%/50% and 52%/58%; p < 0.01) and an MDI (49%/51% and 54%/46%; p < 0.05). Conclusions: The validated scores allow the identification and correction of inhaler technique errors during consultations and, as a result, improvement in the management of inhalation devices. PMID:26398751
Validity and Reliability of a Wearable Inertial Sensor to Measure Velocity and Power in the Back Squat and Bench Press.

PubMed

Orange, Samuel T; Metcalfe, James W; Liefeith, Andreas; Marshall, Phil; Madden, Leigh A; Fewster, Connor R; Vince, Rebecca V

2018-05-08

Orange, ST, Metcalfe, JW, Liefeith, A, Marshall, P, Madden, LA, Fewster, CR, and Vince, RV. Validity and reliability of a wearable inertial sensor to measure velocity and power in the back squat and bench press. J Strength Cond Res XX(X): 000-000, 2018-This study examined the validity and reliability of a wearable inertial sensor to measure velocity and power in the free-weight back squat and bench press. Twenty-nine youth rugby league players (18 ± 1 years) completed 2 test-retest sessions for the back squat followed by 2 test-retest sessions for the bench press. Repetitions were performed at 20, 40, 60, 80, and 90% of 1 repetition maximum (1RM) with mean velocity, peak velocity, mean power (MP), and peak power (PP) simultaneously measured using an inertial sensor (PUSH) and a linear position transducer (GymAware PowerTool). The PUSH demonstrated good validity (Pearson's product-moment correlation coefficient [r]) and reliability (intraclass correlation coefficient [ICC]) only for measurements of MP (r = 0.91; ICC = 0.83) and PP (r = 0.90; ICC = 0.80) at 20% of 1RM in the back squat. However, it may be more appropriate for athletes to jump off the ground with this load to optimize power output. Further research should therefore evaluate the usability of inertial sensors in the jump squat exercise. In the bench press, good validity and reliability were evident only for the measurement of MP at 40% of 1RM (r = 0.89; ICC = 0.83). The PUSH was unable to provide a valid and reliable estimate of any other criterion variable in either exercise. Practitioners must be cognizant of the measurement error when using inertial sensor technology to quantify velocity and power during resistance training, particularly with loads other than 20% of 1RM in the back squat and 40% of 1RM in the bench press.
Analysis of Emotion Regulation in Spanish Adolescents: Validation of the Emotion Regulation Questionnaire

PubMed Central

Gómez-Ortiz, Olga; Romera, Eva M.; Ortega-Ruiz, Rosario; Cabello, Rosario; Fernández-Berrocal, Pablo

2016-01-01

Emotion regulation (ER) is a basic psychological process that has been broadly linked to psychosocial adjustment. Due to its relationship with psychosocial adjustment, a significant number of instruments have been developed to assess emotion regulation in a reliable and valid manner. Among these, the Emotion Regulation Questionnaire (ERQ; Gross and John, 2003) is one of the most widely used, having shown good psychometric properties with adult samples from different cultures. Studies of validation in children and adolescents are, however, scarce and have only been developed for the Australian and Portuguese populations. The aim of this study was to validate the Spanish version of the ERQ for use in adolescents and determine possible differences according to the gender and age of young people. The sample consisted of 2060 adolescents (52.1% boys). Exploratory and Confirmatory factor analysis (EFA and CFA), multi-group analysis and Two-way multivariate analysis of variance (MANOVA) were performed and the percentiles calculated. The results of the AFE and CFA corroborated the existence of two factors related to the emotion regulation strategies of cognitive reappraisal and expressive suppression, showing acceptable internal consistency and test-retest reliability. Both factors also showed good criterion validity with personality traits, self-esteem, and social anxiety. Differences in cognitive reappraisal were found with regard to age, with younger students exhibiting the greatest mastery of this strategy. Gender differences were observed regarding the expressive suppression strategy, with boys being more likely to use this strategy than girls. A gender-age interaction effect was also observed, revealing that the use of the expressive suppression strategy did not vary by age in girls, and was more widely used by boys aged 12–14 years than those aged 15–16 years. However, we found evidence of measurement invariance across sex and age groups. The results suggest that the ERQ is a valid and reliable instrument that can be used to evaluate emotion regulation strategies in adolescents. PMID:26779076
A meta-analysis of an implicit measure of personality functioning: the Mutuality of Autonomy Scale.

PubMed

Graceffo, Robert A; Mihura, Joni L; Meyer, Gregory J

2014-01-01

The Mutuality of Autonomy scale (MA) is a Rorschach variable designed to capture the degree to which individuals mentally represent self and other as mutually autonomous versus pathologically destructive (Urist, 1977). Discussions of the MA's validity found in articles and chapters usually claim good support, which we evaluated by a systematic review and meta-analysis of its construct validity. Overall, in a random effects analysis across 24 samples (N = 1,801) and 91 effect sizes, the MA scale was found to maintain a relationship of r =.20, 95% CI [.16,.25], with relevant validity criteria. We hypothesized that MA summary scores that aggregate more MA response-level data would maintain the strongest relationship with relevant validity criteria. Results supported this hypothesis (aggregated scoring method: r =.24, k = 57, S = 24; nonaggregated scoring methods: r =.15, k = 34, S = 10; p =.039, 2-tailed). Across 7 exploratory moderator analyses, only 1 (criterion method) produced significant results. Criteria derived from the Thematic Apperception Test produced smaller effects than clinician ratings, diagnostic differentiation, and self-attributed characteristics; criteria derived from observer reports produced smaller effects than clinician ratings and self-attributed characteristics. Implications of the study's findings are discussed in terms of both research and clinical work.
Vietnamese Version of Diabetes Self-Management Instrument: Development and Psychometric Testing.

PubMed

Dao-Tran, Tiet-Hanh; Anderson, Debra J; Chang, Anne M; Seib, Charrlotte; Hurst, Cameron

2017-04-01

Self-management plays a vital role in diabetes management for adults with type 2 diabetes (T2DM). While there are many people with T2DM in Vietnam, clinical understanding of diabetes self-management (DSM) in this context is limited due to the lack of a valid measurement instrument. Translation and back-translation processes were used to translate the Diabetes Self-Management Instrument (DSMI) into Vietnamese. Then, translation equivalence, face validity, construct validity, and internal consistency were assessed in a sample of 198 Vietnamese adults with T2DM. The Cronbach's alpha of the V-DSMI was .92, with a number of significant inter-item correlations. The Vietnamese version of the Diabetes Self-Management Instrument (V-DSMI) retained the meaning of the original English version, and the language of the V-DSMI was clearly understandable to adults with T2DM in Vietnam. Confirmatory factor analysis supported the goodness of fit between the data and the previously identified factor structure. These results indicated that the V-DSMI is acceptable for use with Vietnamese adults with T2DM in further practice and research. However, future studies would be beneficial to determine the test-retest reliability and criterion validity of the V-DSMI. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Can generic paediatric mortality scores calculated 4 hours after admission be used as inclusion criteria for clinical trials?

PubMed Central

Leteurtre, Stéphane; Leclerc, Francis; Wirth, Jessica; Noizet, Odile; Magnenant, Eric; Sadik, Ahmed; Fourier, Catherine; Cremer, Robin

2004-01-01

Introduction Two generic paediatric mortality scoring systems have been validated in the paediatric intensive care unit (PICU). Paediatric RISk of Mortality (PRISM) requires an observation period of 24 hours, and PRISM III measures severity at two time points (at 12 hours and 24 hours) after admission, which represents a limitation for clinical trials that require earlier inclusion. The Paediatric Index of Mortality (PIM) is calculated 1 hour after admission but does not take into account the stabilization period following admission. To avoid these limitations, we chose to conduct assessments 4 hours after PICU admission. The aim of the present study was to validate PRISM, PRISM III and PIM at the time points for which they were developed, and to compare their accuracy in predicting mortality at those times with their accuracy at 4 hours. Methods All children admitted from June 1998 to May 2000 in one tertiary PICU were prospectively included. Data were collected to generate scores and predictions using PRISM, PRISM III and PIM. Results There were 802 consecutive admissions with 80 deaths. For the time points for which the scores were developed, observed and predicted mortality rates were significantly different for the three scores (P < 0.01) whereas all exhibited good discrimination (area under the receiver operating characteristic curve ≥0.83). At 4 hours after admission only the PIM had good calibration (P = 0.44), but all three scores exhibited good discrimination (area under the receiver operating characteristic curve ≥0.82). Conclusions Among the three scores calculated at 4 hours after admission, all had good discriminatory capacity but only the PIM score was well calibrated. Further studies are required before the PIM score at 4 hours can be used as an inclusion criterion in clinical trials. PMID:15312217
A New Z Score Curve of the Coronary Arterial Internal Diameter Using the Lambda-Mu-Sigma Method in a Pediatric Population.

PubMed

Kobayashi, Tohru; Fuse, Shigeto; Sakamoto, Naoko; Mikami, Masashi; Ogawa, Shunichi; Hamaoka, Kenji; Arakaki, Yoshio; Nakamura, Tsuneyuki; Nagasawa, Hiroyuki; Kato, Taichi; Jibiki, Toshiaki; Iwashima, Satoru; Yamakawa, Masaru; Ohkubo, Takashi; Shimoyama, Shinya; Aso, Kentaro; Sato, Seiichi; Saji, Tsutomu

2016-08-01

Several coronary artery Z score models have been developed. However, a Z score model derived by the lambda-mu-sigma (LMS) method has not been established. Echocardiographic measurements of the proximal right coronary artery, left main coronary artery, proximal left anterior descending coronary artery, and proximal left circumflex artery were prospectively collected in 3,851 healthy children ≤18 years of age and divided into developmental and validation data sets. In the developmental data set, smooth curves were fitted for each coronary artery using linear, logarithmic, square-root, and LMS methods for both sexes. The relative goodness of fit of these models was compared using the Bayesian information criterion. The best-fitting model was tested for reproducibility using the validation data set. The goodness of fit of the selected model was visually compared with that of the previously reported regression models using a Q-Q plot. Because the internal diameter of each coronary artery was not similar between sexes, sex-specific Z score models were developed. The LMS model with body surface area as the independent variable showed the best goodness of fit; therefore, the internal diameter of each coronary artery was transformed into a sex-specific Z score on the basis of body surface area using the LMS method. In the validation data set, a Q-Q plot of each model indicated that the distribution of Z scores in the LMS models was closer to the normal distribution compared with previously reported regression models. Finally, the final models for each coronary artery in both sexes were developed using the developmental and validation data sets. A Microsoft Excel-based Z score calculator was also created, which is freely available online (http://raise.umin.jp/zsp/calculator/). Novel LMS models with which to estimate the sex-specific Z score of each internal coronary artery diameter were generated and validated using a large pediatric population. Copyright © 2016 American Society of Echocardiography. Published by Elsevier Inc. All rights reserved.
An examination of the Psychopathic Personality Inventory's nomological network: a meta-analytic review.

PubMed

Miller, Joshua D; Lynam, Donald R

2012-07-01

Since its publication, the Psychopathic Personality Inventory and its revision (Lilienfeld & Andrews, 1996; Lilienfeld & Widows, 2005) have become increasingly popular such that it is now among the most frequently used self-report inventories for the assessment of psychopathy. The current meta-analysis examined the relations between the two PPI factors (factor 1: Fearless Dominance; factor 2: Self-Centered Impulsivity), as well as their relations with other validated measures of psychopathy, internalizing and externalizing forms of psychopathology, general personality traits, and antisocial personality disorder symptoms. Across 61 samples reported in 49 publications, we found support for the convergent and criterion validity of both PPI factor 2 and the PPI total score. Much weaker validation was found for PPI factor 1, which manifested limited convergent validity and a pattern of correlations with central criterion variables that was inconsistent with many conceptualizations of psychopathy. PsycINFO Database Record (c) 2012 APA, all rights reserved.
Measuring violence risk and outcomes among Mexican American adolescent females.

PubMed

Cervantes, Richard C; Duenas, Norma; Valdez, Avelardo; Kaplan, Charles

2006-01-01

Central to the development of culturally competent violence prevention programs for Hispanic youth is the development of psychometrically sound violence risk and outcome measures for this population. A study was conducted to determine the psychometric properties of two commonly used violence measures, in this case for Mexican American adolescent females. The Conflict Tactics Scales (CTS2) and the Past Feelings and Acts of Violence Scale (PFAV) were analyzed to examine their interitem reliability, criterion validity, and discriminant validity. A sample of 150 low-risk and 150 high-risk adolescent females was studied. Discriminant validity was indicated by the perpetrator negotiation scale and by the victim psychological aggression and sexual coercion scales of the CTS2 and the PFAV. Analysis indicates that the CTS2 scales and the PFAV demonstrate adequate reliability, whereas strong criterion validity was evidenced by eight of the CTS2 scales and the PFAV.

Validating the Center for Epidemiological Studies Depression Scale for Children in Rwanda

PubMed Central

Betancourt, Theresa; Scorza, Pamela; Meyers-Ohki, Sarah; Mushashi, Christina; Kayiteshonga, Yvonne; Binagwaho, Agnes; Stulac, Sara; Beardslee, William R.

2017-01-01

Objective We assessed the validity of the Center for Epidemiological Studies Depression Scale for Children (CES-DC) as a screen for depression in Rwandan children and adolescents. Although the CES-DC is widely used for depression screening in high-income countries, its validity in low-income and culturally diverse settings, including sub-Saharan Africa, is unknown. Method The CES-DC was selected based on alignment with local expressions of depression-like problems in Rwandan children and adolescents. To examine criterion validity, we compared CES-DC scores to depression diagnoses on a structured diagnostic interview, the Mini International Neuropsychiatric Interview for Children (MINI KID), in a sample of 367 Rwandan children and adolescents aged 10 through 17 years. Caregiver and child or adolescent self-reports endorsing the presence of local depression-like problems agahinda kenshi (persistent sorrow) and kwiheba (severe hopelessness) were also examined for agreement with MINI KID diagnosis. Results The CES-DC exhibited good internal reliability (α = .86) and test-retest reliability (r = .85). The area under the receiver operating characteristic curve for the CES-DC was 0.825 when compared to MINI KID diagnoses, indicating a strong ability to distinguish between depressed and nondepressed children and adolescents in Rwanda. A cut point of ≥ 30 corresponded with a sensitivity of 81.9% and a specificity of 71.9% in this referred sample. MINI KID diagnosis was well aligned with local expressions of depression-like problems. Conclusion The CES-DC demonstrates good psychometric properties for clinical screening and evaluation in Rwanda, and should be considered for use in this and other low-resource settings. Population samples are needed to determine a generalizable cut point in nonreferred samples. PMID:23200285
The Work Disability Functional Assessment Battery (WD-FAB): Feasibility and Psychometric Properties

PubMed Central

Meterko, Mark; Marfeo, Elizabeth E.; McDonough, Christine M.; Jette, Alan M.; Ni, Pengsheng; Bogusz, Kara; Rasch, Elizabeth K; Brandt, Diane E.; Chan, Leighton

2015-01-01

Objectives To assess the feasibility and psychometric properties of eight scales covering two domains of the newly developed Work Disability Functional Assessment Battery (WD-FAB): physical function (PF) and behavioral health (BH) function. Design Cross-sectional. Setting Community. Participants Adults unable to work due to a physical (n=497) or mental (n=476) disability. Interventions None. Main Outcome Measures Each disability group responded to a survey consisting of the relevant WD-FAB scales and existing measures of established validity. The WD-FAB scales were evaluated with regard to data quality (score distribution; percent “I don’t know” responses), efficiency of administration (number of items required to achieve reliability criterion; time required to complete the scale) by computerized adaptive testing (CAT), and measurement accuracy as tested by person fit. Construct validity was assessed by examining both convergent and discriminant correlations between the WD-FAB scales and scores on same-domain and cross-domain established measures. Results Data quality was good and CAT efficiency was high across both WD-FAB domains. Measurement accuracy was very good for the PF scales; BH scales demonstrated more variability. Construct validity correlations, both convergent and divergent, between all WD-FAB scales and established measures were in the expected direction and range of magnitude. Conclusions The data quality, CAT efficacy, person fit and construct validity of the WD-FAB scales were well supported and suggest that the WD-FAB could be used to assess physical and behavioral health function related to work disability. Variation in scale performance suggests the need for future work on item replenishment and refinement, particularly regarding the Self-Efficacy scale. PMID:25528263
Quality of life in oncological patients with oropharyngeal dysphagia: validity and reliability of the Dutch version of the MD Anderson Dysphagia Inventory and the Deglutition Handicap Index.

PubMed

Speyer, Renée; Heijnen, Bas J; Baijens, Laura W; Vrijenhoef, Femke H; Otters, Elsemieke F; Roodenburg, Nel; Bogaardt, Hans C

2011-12-01

Quality of life is an important outcome measurement in objectifying the current health status or therapy effects in patients with oropharyngeal dysphagia. In this study, the validity and reliability of the Dutch version of the Deglutition Handicap Index (DHI) and the MD Anderson Dysphagia Inventory (MDADI) have been determined for oncological patients with oropharyngeal dysphagia. At Maastricht University Medical Center, 76 consecutive patients were selected and asked to fill in three questionnaires on quality of life related to oropharyngeal dysphagia (the SWAL-QOL, the MDADI, and the DHI) as well as a simple one-item visual analog Dysphagia Severity Scale. None of the quality-of-life questionnaires showed any floor or ceiling effect. The test-retest reliability of the MDADI and the Dysphagia Severity Scale proved to be good. The test-retest reliability of the DHI could not be determined because of insufficient data, but the intraclass correlation coefficients were rather high. The internal consistency proved to be good. However, confirmatory factor analysis could not distinguish the underlying constructs as defined by the subscales per questionnaire. When assessing criterion validity, both the MDADI and the DHI showed satisfactory associations with the SWAL-QOL (reference or gold standard) after having removed the less relevant subscales of the SWAL-QOL. In conclusion, when assessing the validity and reliability of the Dutch version of the DHI or the MDADI, not all psychometric properties have been adequately met. In general, because of difficulties in the interpretation of study results when using questionnaires lacking sufficient psychometric quality, it is recommended that researchers strive to use questionnaires with the most optimal psychometric properties.
Criterion-related validity of the Test of Children's Speech sentence intelligibility measure for children with cerebral palsy and dysarthria.

PubMed

Hodge, Megan; Gotzke, Carrie Lynne

2014-08-01

To evaluate the criterion-related validity of the TOCS+ sentence measure (TOCS+, Hodge, Daniels & Gotzke, 2009 ) for children with dysarthria and CP by comparing intelligibility and rate scores obtained concurrently from the TOCS+ and from a conversational sample. Twenty children (3 to 10 years old) diagnosed with spastic cerebral palsy (CP) participated. Nineteen children also had a confirmed diagnosis of dysarthria. Children's intelligibility and speaking rate scores obtained from the TOCS+, which uses imitation of sets of randomly selected items ranging from 2-7 words (80 words in total) and from a contiguous 100-word conversational speech were compared. Mean intelligibility scores were 46.5% (SD = 26.4%) and 50.9% (SD = 19.1%) and mean rates in words per minute (WPM) were 90.2 (SD = 22.3) and 94.1 (SD = 25.6), respectively, for the TOCS+ and conversational samples. No significant differences were found between the two conditions for intelligibility or rate scores. Strong correlations were found between the TOCS+ and conversational samples for intelligibility (r = 0.86; p < 0.001) and WPM (r = 0.77; p < 0.001), supporting the criterion validity of the TOCS+ sentence task as a time efficient procedure for measuring intelligibility and rate in children with CP, with and without confirmed dysarthria. The results support the criterion validity of the TOCS+ sentence task as a time efficient procedure for measuring intelligibility and rate in children with CP, with and without confirmed dysarthria. Children varied in their relative performance on the two speaking tasks, reflecting the complexity of factors that influence intelligibility and rate scores.
Military Families' Perceptions of Neighborhood Characteristics Affecting Reintegration: Development of an Aggregate Measure.

PubMed

Beehler, Sarah; Ahern, Jennifer; Balmer, Brandi; Kuhlman, Jennifer

2017-01-01

This pilot study evaluated the validity and reliability of an Experience of Neighborhood (EON) measure developed to assess neighborhood characteristics that shape reintegration opportunities for returning service members and their families. A total of 91 post-9/11 veterans and spouses completed a survey administered at the Minnesota State Fair. Participants self-reported on their reintegration status (veterans), social functioning (spouses), social support, and mental health. EON factor structure, internal consistency reliability, and validity (discriminant, content, criterion) were analyzed. The EON measure showed adequate reliability, discriminant validity, and content validity. More work is needed to assess criterion validity because EON scores were not correlated with scores on a Census-based index used to measure quality of military neighborhoods. The EON may be useful in assessing broad local factors influencing health among returning veterans and spouses. More research is needed to understand geographic variation in neighborhood conditions and how those affect reintegration and mental health for military families.
Validation of a Portuguese version of the Information Needs in Cardiac Rehabilitation (INCR) scale in Brazil.

PubMed

Ghisi, Gabriela Lima de Melo; Dos Santos, Rafaella Zulianello; Bonin, Christiani Batista Decker; Roussenq, Suellen; Grace, Sherry L; Oh, Paul; Benetti, Magnus

2014-01-01

To translate, culturally adapt and psychometrically validate the Information Needs in Cardiac Rehabilitation (INCR) tool to Portuguese. The identification of information needs is considered the first step to improve knowledge that ultimately could improve health outcomes. The Portuguese version generated was tested in 300 cardiac rehabilitation patients (CR) (34% women; mean age = 61.3 ± 2.1 years old). Test-retest reliability was assessed using intraclass correlation coefficient (ICC), the internal consistency using Cronbach's alpha, and the criterion validity was assessed with regard to patients' education and duration in CR. All 9 subscales were considered internally consistent (á > 0.7). Significant differences between mean total needs and educational level (p < 0.05) and duration in CR (p = 0.03) supported criterion validity. The overall mean (4.6 ± 0.4), as well as the means of the 9 subscales were high (emergency/safety was the greatest need). The Portuguese INCR was demonstrated to have sufficient reliability, consistency and validity. Copyright © 2014 Elsevier Inc. All rights reserved.
Military Families’ Perceptions of Neighborhood Characteristics Affecting Reintegration: Development of an Aggregate Measure

PubMed Central

Beehler, Sarah; Ahern, Jennifer; Balmer, Brandi; Kuhlman, Jennifer

2017-01-01

This pilot study evaluated the validity and reliability of an Experience of Neighborhood (EON) measure developed to assess neighborhood characteristics that shape reintegration opportunities for returning service members and their families. A total of 91 post-9/11 veterans and spouses completed a survey administered at the Minnesota State Fair. Participants self-reported on their reintegration status (veterans), social functioning (spouses), social support, and mental health. EON factor structure, internal consistency reliability, and validity (discriminant, content, criterion) were analyzed. The EON measure showed adequate reliability, discriminant validity, and content validity. More work is needed to assess criterion validity because EON scores were not correlated with scores on a Census-based index used to measure quality of military neighborhoods. The EON may be useful in assessing broad local factors influencing health among returning veterans and spouses. More research is needed to understand geographic variation in neighborhood conditions and how those affect reintegration and mental health for military families. PMID:28936370
Monitoring sedentary patterns in office employees: validity of an m-health tool (Walk@Work-App) for occupational health.

PubMed

Bort-Roig, Judit; Puig-Ribera, Anna; Contreras, Ruth S; Chirveches-Pérez, Emilia; Martori, Joan C; Gilson, Nicholas D; McKenna, Jim

2017-09-15

This study validated the Walk@Work-Application (W@W-App) for measuring occupational sitting and stepping. The W@W-App was installed on the smartphones of office-based employees (n=17; 10 women; 26±3 years). A prescribed 1-hour laboratory protocol plus two continuous hours of occupational free-living activities were performed. Intra-class correlation coefficients (ICC) compared mean differences of sitting time and step count measurements between the W@W-App and criterion measures (ActivPAL3TM and SW200Yamax Digi-Walker). During the protocol, agreement between self-paced walking (ICC=0.85) and active working tasks step counts (ICC=0.80) was good. The smallest median difference was for sitting time (1.5seconds). During free-living conditions, sitting time (ICC=0.99) and stepping (ICC=0.92) showed excellent agreement, with a difference of 0.5minutes and 18 steps respectively. The W@W-App provided valid measures for monitoring occupational sedentary patterns in real life conditions; a key issue for increasing awareness and changing occupational sedentariness. Copyright © 2017 SESPAS. Publicado por Elsevier España, S.L.U. All rights reserved.
Selecting the optimum number of partial least squares components for the calibration of attenuated total reflectance-mid-infrared spectra of undesigned kerosene samples.

PubMed

Gómez-Carracedo, M P; Andrade, J M; Rutledge, D N; Faber, N M

2007-03-07

Selecting the correct dimensionality is critical for obtaining partial least squares (PLS) regression models with good predictive ability. Although calibration and validation sets are best established using experimental designs, industrial laboratories cannot afford such an approach. Typically, samples are collected in an (formally) undesigned way, spread over time and their measurements are included in routine measurement processes. This makes it hard to evaluate PLS model dimensionality. In this paper, classical criteria (leave-one-out cross-validation and adjusted Wold's criterion) are compared to recently proposed alternatives (smoothed PLS-PoLiSh and a randomization test) to seek out the optimum dimensionality of PLS models. Kerosene (jet fuel) samples were measured by attenuated total reflectance-mid-IR spectrometry and their spectra where used to predict eight important properties determined using reference methods that are time-consuming and prone to analytical errors. The alternative methods were shown to give reliable dimensionality predictions when compared to external validation. By contrast, the simpler methods seemed to be largely affected by the largest changes in the modeling capabilities of the first components.
The Multiple Sclerosis Self-Management Scale

PubMed Central

Ghahari, Setareh; Khoshbin, Lana S.

2014-01-01

Background: The Multiple Sclerosis Self-Management Scale (MSSM) is currently the only measure that was developed specifically to address self-management among individuals with multiple sclerosis (MS). While good internal consistency (α = 0.85) and construct validity have been demonstrated, other psychometric properties have not been established. This study was undertaken to evaluate the criterion validity, test-retest reliability, and face validity of the MSSM. Methods: Thirty-one individuals with MS who met the inclusion criteria were recruited to complete a series of questionnaires at two time points. At Time 1, participants completed the MSSM and two generic self-management tools—the Partners in Health (PIH-12) and the Health Education Impact Questionnaire (heiQ)—as well as a short questionnaire to capture participants' opinions about the MSSM. At Time 2, approximately 2 weeks after Time 1, participants completed the MSSM again. Results: The available MSSM factors showed moderate to high correlations with both PIH-12 and heiQ and were deemed to have satisfactory test-retest reliability. Face validity pointed to areas of the MSSM that need to be revised in future work. As indicated by the participants, some dimensions of MS self-management are missing in the MSSM and some items such as medication are redundant. Conclusions: This study provides evidence for the reliability and validity of the MSSM; however, further changes are required for both researchers and clinicians to use the tool meaningfully in practice. PMID:25061429
Development and validation of an Overreporting Scale for the Personality Inventory for DSM-5 (PID-5).

PubMed

Sellbom, Martin; Dhillon, Sonya; Bagby, R Michael

2018-05-01

Our aim in the current study was to develop a validity scale for the Personality Inventory for DSM-5 (PID-5) to detect noncredible overreported responding. To this end, we used a rare symptoms approach and identified extreme response options on PID-5 items that were infrequently endorsed by students in 3 different university samples (N = 1,370) and in a psychiatric patient sample (N = 194). The resulting 10-item scale (the PID-5-ORS) produced adequate-to-good estimates of internal reliability and was significantly correlated with the Minnesota Multiphasic Personality Inventory-2 Restructued Form (MMPI-2-RF) overreporting validity scales, providing evidence of concurrent validity. The criterion validity of the PID-5-ORS was demonstrated in an analog simulation design study. More specifically, university students instructed to overreport (n = 80) scored substantially higher on the PID-5-ORS relative to both a group of genuine psychiatric patients and students instructed to complete the PID-5 under standard (honest) instructions (n = 161); the effect size magnitudes associated with these differences were large. Classification accuracy analyses further revealed that high scores on the PID-5-ORS were associated with high specificity (and thus, low rates of false positive classifications) in differentiating overreporters from genuine patients, with sensitivity being somewhat weaker. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Psychometric support of the school climate measure in a large, diverse sample of adolescents: a replication and extension.

PubMed

Zullig, Keith J; Collins, Rani; Ghani, Nadia; Patton, Jon M; Scott Huebner, E; Ajamie, Jean

2014-02-01

The School Climate Measure (SCM) was developed and validated in 2010 in response to a dearth of psychometrically sound school climate instruments. This study sought to further validate the SCM on a large, diverse sample of Arizona public school adolescents (N = 20,953). Four SCM domains (positive student-teacher relationships, academic support, order and discipline, and physical environment) were available for the analysis. Confirmatory factor analysis and structural equation modeling were established to construct validity, and criterion-related validity was assessed via selected Youth Risk Behavior Survey (YRBS) school safety items and self-reported grade (GPA) point average. Analyses confirmed the 4 SCM school climate domains explained approximately 63% of the variance (factor loading range .45-.92). Structural equation models fit the data well χ(2) = 14,325 (df = 293, p < .001), comparative fit index (CFI) = .951, Tuker-Lewis index (TLI) = .952, root mean square error of approximation (RMSEA) = .05). The goodness-of-fit index was .940. Coefficient alphas ranged from .82 to .93. Analyses of variance with post hoc comparisons suggested the SCM domains related in hypothesized directions with the school safety items and GPA. Additional evidence supports the validity and reliability of the SCM. Measures, such as the SCM, can facilitate data-driven decisions and may be incorporated into evidenced-based processes designed to improve student outcomes. © 2014, American School Health Association.
Reliability and Validity Study of the Chamorro Assisted Gait Scale for People with Sprained Ankles, Walking with Forearm Crutches

PubMed Central

Ridao-Fernández, Carmen; Ojeda, Joaquín; Benítez-Lugo, Marisa; Sevillano, José Luis

2016-01-01

Objective The aim of this study was to design and validate a functional assessment scale for assisted gait with forearm crutches (Chamorro Assisted Gait Scale—CHAGS) and to assess its reliability in people with sprained ankles. Design Thirty subjects who suffered from sprained ankle (anterior talofibular ligament first and second degree) were included in the study. A modified Delphi technique was used to obtain the content validity. The selected items were: pelvic and scapular girdle dissociation(1), deviation of Center of Gravity(2), crutch inclination(3), steps rhythm(4), symmetry of step length(5), cross support(6), simultaneous support of foot and crutch(7), forearm off(8), facing forward(9) and fluency(10). Two raters twice visualized the gait of the sample subjects which were recorded. The criterion-related validity was determined by correlation between CHAGS and Coding of eight criteria of qualitative gait analysis (Viel Coding). Internal consistency and inter and intra-rater reliability were also tested. Results CHAGS obtained a high and negative correlation with Viel Coding. We obtained a good internal consistency and the intra-class correlation coefficients oscillated between 0.97 and 0.99, while the minimal detectable changes were acceptable. Conclusion CHAGS scale is a valid and reliable tool for assessing assisted gait with crutches in people with sprained ankles to perform partial relief of lower limbs. PMID:27168236
Construct and Criterion Validity of the PedsQL™ 4.0 Instrument (Pediatric Quality of Life Inventory) in Colombia

PubMed Central

Amaya-Arias, Ana Carolina; Alzate, Juan Pablo; Eslava-Schmalbach, Javier H

2017-01-01

Background: This study aimed at determining the validity of the Pediatric Quality of Life Inventory 4.0 (PedsQL™ 4.0) for the measurement of health-related quality of life (HRQOL) in Colombian children. Methods: Validation study of measurement instruments. The PedsQL™ 4.0 was applied by convenience sampling to 375 pairs of children and adolescents between the ages of 5 and 17 and to their parents-caregivers, as well as to 125 parents-caregivers of children between the ages of 2 and 4 in five cities of Colombia (Bogota, Medellin, Cali, Barranquilla and Bucaramanga). Construct validity was assessed through the use of exploratory and confirmatory factor analysis, and criterion validity was assessed by correlations between the PedsQL™ 4.0 and the KIDSCREEN-27. Results: The instrument was applied to 375 children (ages 5–18) and 125 parents of children between the ages of 2 and 4. Factor analysis revealed four factors considered suitable for the sample in both the child and parent reports, whereas Bartlett's test of sphericity showed inter-correlation between variables. Scale and subscales showed proper indicators of internal consistency. It is recommended not to include or review some of the items in the Colombian version of the scale. Conclusions: The Spanish version for Colombia of the PedsQL™ 4.0 displays suitable indicators of criterion and construct validity, therefore becoming a valuable tool for measuring HRQOL in children in our country. Some modifications are recommended for the Colombian version of the scale. PMID:28900536
Development and validation of criterion-referenced clinically relevant fitness standards for maintaining physical independence in later years.

PubMed

Rikli, Roberta E; Jones, C Jessie

2013-04-01

To develop and validate criterion-referenced fitness standards for older adults that predict the level of capacity needed for maintaining physical independence into later life. The proposed standards were developed for use with a previously validated test battery for older adults-the Senior Fitness Test (Rikli, R. E., & Jones, C. J. (2001). Development and validation of a functional fitness test for community--residing older adults. Journal of Aging and Physical Activity, 6, 127-159; Rikli, R. E., & Jones, C. J. (1999a). Senior fitness test manual. Champaign, IL: Human Kinetics.). A criterion measure to assess physical independence was identified. Next, scores from a subset of 2,140 "moderate-functioning" older adults from a larger cross-sectional database, together with findings from longitudinal research on physical capacity and aging, were used as the basis for proposing fitness standards (performance cut points) associated with having the ability to function independently. Validity and reliability analyses were conducted to test the standards for their accuracy and consistency as predictors of physical independence. Performance standards are presented for men and women ages 60-94 indicating the level of fitness associated with remaining physically independent until late in life. Reliability and validity indicators for the standards ranged between .79 and .97. The proposed standards provide easy-to-use, previously unavailable methods for evaluating physical capacity in older adults relative to that associated with physical independence. Most importantly, the standards can be used in planning interventions that target specific areas of weakness, thus reducing risk for premature loss of mobility and independence.
Community validation of the IDEA study cognitive screen in rural Tanzania.

PubMed

Gray, William K; Paddick, Stella Maria; Collingwood, Cecilia; Kisoli, Aloyce; Mbowe, Godfrey; Mkenda, Sarah; Lissu, Carolyn; Rogathi, Jane; Kissima, John; Walker, Richard W; Mushi, Declare; Chaote, Paul; Ogunniyi, Adesola; Dotchin, Catherine L

2016-11-01

The dementia diagnosis gap in sub-Saharan Africa (SSA) is large, partly because of difficulties in screening for cognitive impairment in the community. As part of the Identification and Intervention for Dementia in Elderly Africans (IDEA) study, we aimed to validate the IDEA cognitive screen in a community-based sample in rural Tanzania METHODS: Study participants were recruited from people who attended screening days held in villages within the rural Hai district of Tanzania. Criterion validity was assessed against the gold standard clinical dementia diagnosis using DSM-IV criteria. Construct validity was assessed against, age, education, sex and grip strength and instrumental activities of daily living (IADLs). Internal consistency and floor and ceiling effects were also examined. During community screening, the IDEA cognitive screen had high criterion validity, with an area under the receiver operating characteristic curve of 0.855 (95% CI 0.794 to 0.915). Higher scores on the screen were significantly correlated with lower age, male sex, having attended school, better grip strength and improved performance in activities of daily living. Factor analysis revealed a single factor with an eigenvalue greater than one, although internal consistency was only moderate (Cronbach's alpha = 0.534). The IDEA cognitive screen had high criterion and construct validity and is suitable for use as a cognitive screening instrument in a community setting in SSA. Only moderate internal consistency may partly reflect the multi-domain nature of dementia as diagnosed clinically. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Calculation of strained BaTiO3 with different exchange correlation functionals examined with criterion by Ginzburg-Landau theory, uncovering expressions by crystallographic parameters

NASA Astrophysics Data System (ADS)

Watanabe, Yukio

2018-05-01

In the calculations of tetragonal BaTiO3, some exchange-correlation (XC) energy functionals such as local density approximation (LDA) have shown good agreement with experiments at room temperature (RT), e.g., spontaneous polarization (PS), and superiority compared with other XC functionals. This is due to the error compensation of the RT effect and, hence, will be ineffective in the heavily strained case such as domain boundaries. Here, ferroelectrics under large strain at RT are approximated as those at 0 K because the strain effect surpasses the RT effects. To find effective XC energy functionals for strained BaTiO3, we propose a new comparison, i.e., a criterion. This criterion is the properties at 0 K given by the Ginzburg-Landau (GL) theory because GL theory is a thermodynamic description of experiments working under the same symmetry-constraints as ab initio calculations. With this criterion, we examine LDA, generalized gradient approximations (GGA), meta-GGA, meta-GGA + local correlation potential (U), and hybrid functionals, which reveals the high accuracy of some XC functionals superior to XC functionals that have been regarded as accurate. This result is examined directly by the calculations of homogenously strained tetragonal BaTiO3, confirming the validity of the new criterion. In addition, the data points of theoretical PS vs. certain crystallographic parameters calculated with different XC functionals are found to lie on a single curve, despite their wide variations. Regarding these theoretical data points as corresponding to the experimental results, analytical expressions of the local PS using crystallographic parameters are uncovered. These expressions show the primary origin of BaTiO3 ferroelectricity as oxygen displacements. Elastic compliance and electrostrictive coefficients are estimated. For the comparison of strained results, we show that the effective critical temperature TC under strain <-0.01 is >1000 K from an approximate method combining ab initio results with GL theory. In addition, in a definite manner, the present results show much more enhanced ferroelectricity at large strain than the previous reports.
Validity of Various Methods for Determining Velocity, Force, and Power in the Back Squat.

PubMed

Banyard, Harry G; Nosaka, Ken; Sato, Kimitake; Haff, G Gregory

2017-10-01

To examine the validity of 2 kinematic systems for assessing mean velocity (MV), peak velocity (PV), mean force (MF), peak force (PF), mean power (MP), and peak power (PP) during the full-depth free-weight back squat performed with maximal concentric effort. Ten strength-trained men (26.1 ± 3.0 y, 1.81 ± 0.07 m, 82.0 ± 10.6 kg) performed three 1-repetition-maximum (1RM) trials on 3 separate days, encompassing lifts performed at 6 relative intensities including 20%, 40%, 60%, 80%, 90%, and 100% of 1RM. Each repetition was simultaneously recorded by a PUSH band and commercial linear position transducer (LPT) (GymAware [GYM]) and compared with measurements collected by a laboratory-based testing device consisting of 4 LPTs and a force plate. Trials 2 and 3 were used for validity analyses. Combining all 120 repetitions indicated that the GYM was highly valid for assessing all criterion variables while the PUSH was only highly valid for estimations of PF (r = .94, CV = 5.4%, ES = 0.28, SEE = 135.5 N). At each relative intensity, the GYM was highly valid for assessing all criterion variables except for PP at 20% (ES = 0.81) and 40% (ES = 0.67) of 1RM. Moreover, the PUSH was only able to accurately estimate PF across all relative intensities (r = .92-.98, CV = 4.0-8.3%, ES = 0.04-0.26, SEE = 79.8-213.1 N). PUSH accuracy for determining MV, PV, MF, MP, and PP across all 6 relative intensities was questionable for the back squat, yet the GYM was highly valid at assessing all criterion variables, with some caution given to estimations of MP and PP performed at lighter loads.
Development of a new instrument for determining the level of chewing function in children.

PubMed

Serel Arslan, S; Demir, N; Barak Dolgun, A; Karaduman, A A

2016-07-01

This study aimed to develop a chewing performance scale that classifies chewing from normal to severely impaired and to investigate its validity and reliability. The study included the developmental phase and reported the content, structural, criterion validity, interobserver and intra-observer reliability of the chewing performance scale, which was called the Karaduman Chewing Performance Scale (KCPS). A dysphagia literature review, other questionnaires and clinical experiences were used in the developmental phase. Seven experts assessed the steps for content validity over two Delphi rounds. To test structural, criterion validity, interobserver and intra-observer reliability, two swallowing therapists evaluated chewing videos of 144 children (Group I: 61 healthy children without chewing disorders, mean age of 42·38 ± 9·36 months; Group II: 83 children with cerebral palsy who have chewing disorders, mean age of 39·09 ± 22·95 months) using KCPS. The Behavioral Pediatrics Feeding Assessment Scale (BPFAS) was used for criterion validity. The KCPS steps arranged between 0-4 were found to be necessary. The content validity index was 0·885. The KCPS levels were found to be different between groups I and II (χ(2) = 123·286, P < 0·001). A moderately strong positive correlation was found between the KCPS and the subscales of the BPFAS (r = 0·444-0·773, P < 0·001). An excellent positive correlation was detected between two swallowing therapists and between two examinations of one swallowing therapist (r = 0·962, P < 0·001; r = 0·990, P < 0·001, respectively). The KCPS is a valid, reliable, quick and clinically easy-to-use functional instrument for determining the level of chewing function in children. © 2016 John Wiley & Sons Ltd.
Reliability and validity of the Bowel Function Index for evaluating opioid-induced constipation: translation, cultural adaptation and validation of the Portuguese version (BFI-P).

PubMed

Dueñas, María; Mendonça, Liliane; Sampaio, Rute; Gouvinhas, Cláudia; Oliveira, Daniela; Castro-Lopes, José Manuel; Azevedo, Luís Filipe

2017-03-01

The Bowel Function Index (BFI) is a simple and sound bowel function and opioid-induced constipation (OIC) screening tool. We aimed to develop the translation and cultural adaptation of this measure (BFI-P) and to assess its reliability and validity for the Portuguese language and a chronic pain population. The BFI-P was created after a process including translation, back translation and cultural adaptation. Participants (n = 226) were recruited in a chronic pain clinic and were assessed at baseline and after one week. Internal consistency, test-retest reliability, responsiveness, construct (convergent and known groups) and factorial validity were assessed. Test-retest reliability had an intra-class correlation of 0.605 for BFI mean score. Internal consistency of BFI had Cronbach's alpha of 0.865. The construct validity of BFI-P was shown to be excellent and the exploratory factor analysis confirmed its unidimensional structure. The responsiveness of BFI-P was excellent, with a suggested 17-19 point and 8-12 point change in score constituting a clinically relevant change in constipation for patients with and without previous constipation, respectively. This study had some limitations, namely, the criterion validity of BFI-P was not directly assessed; and the absence of a direct criterion for OIC precluded the assessment of the criterion based responsiveness of BFI-P. Nevertheless, BFI may importantly contribute to better OIC screening and its Portuguese version (BFI-P) has been shown to have excellent reliability, internal consistency, validity and responsiveness. Further suggestions regarding statistically and clinically important change cut-offs for this instrument are presented.

Is Echinococcus intermedius a valid species?

USDA-ARS?s Scientific Manuscript database

Medical and veterinary sciences require scientific names to discriminate pathogenic organisms in our living environment. Various species concepts have been proposed for metazoan animals. There are, however, constant controversies over their validity because of lack of a common criterion to define ...
Minimizing false positive error with multiple performance validity tests: response to Bilder, Sugar, and Hellemann (2014 this issue).

PubMed

Larrabee, Glenn J

2014-01-01

Bilder, Sugar, and Hellemann (2014 this issue) contend that empirical support is lacking for use of multiple performance validity tests (PVTs) in evaluation of the individual case, differing from the conclusions of Davis and Millis (2014), and Larrabee (2014), who found no substantial increase in false positive rates using a criterion of failure of ≥ 2 PVTs and/or Symptom Validity Tests (SVTs) out of multiple tests administered. Reconsideration of data presented in Larrabee (2014) supports a criterion of ≥ 2 out of up to 7 PVTs/SVTs, as keeping false positive rates close to and in most cases below 10% in cases with bona fide neurologic, psychiatric, and developmental disorders. Strategies to minimize risk of false positive error are discussed, including (1) adjusting individual PVT cutoffs or criterion for number of PVTs failed, for examinees who have clinical histories placing them at risk for false positive identification (e.g., severe TBI, schizophrenia), (2) using the history of the individual case to rule out conditions known to result in false positive errors, (3) using normal performance in domains mimicked by PVTs to show that sufficient native ability exists for valid performance on the PVT(s) that have been failed, and (4) recognizing that as the number of PVTs/SVTs failed increases, the likelihood of valid clinical presentation decreases, with a corresponding increase in the likelihood of invalid test performance and symptom report.
Measurement of academic entitlement.

PubMed

Miller, Brian K

2013-10-01

Members of Generation Y, or Millennials, have been accused of being lazy, whiny, pampered, and entitled, particularly in the college classroom. Using an equity theory framework, eight items from a measure of work entitlement were adapted to measure academic entitlement in a university setting in three independent samples. In Study 1 (n = 229), confirmatory factor analyses indicated good model fit to a unidimensional structure for the data. In Study 2 (n = 200), the questionnaire predicted unique variance in university satisfaction beyond two more general measures of dispositional entitlement. In Study 3 (n = 161), the measure predicted unique variance in perceptions of grade fairness beyond that which was predicted by another measure of academic entitlement. This analysis provides evidence of discriminant, convergent, incremental, concurrent criterion-related, and construct validity for the Academic Equity Preference Questionnaire.
The development of the Adolescent Nervios Scale: preliminary findings.

PubMed

Livanis, Andrew; Tryon, Georgiana Shick

2010-01-01

This paper details the construction of a scale to measure the culture-bound syndrome of nervios in Latino early adolescents, ages 11 to 14. Informed by nervios literature and experts, we developed the 31-item Adolescent Nervios Scale (ANS) with items comprised of symptoms representing various psychiatric conditions common to Western culture. In contrast to 277 non-Latino early adolescents who responded to the items as representing disparate constructs, 307 Latino early adolescents responded to ANS items in a unitary fashion. For Latino early adolescents, the ANS demonstrated good internal consistency and stability as well as concurrent, discriminative, and criterion-based validity. The results support the measurement of nervios and its relationship to the school performance and adjustment of Latino youth. (PsycINFO Database Record (c) 2009 APA, all rights reserved).
Indonesian teacher engagement index: a rasch model analysis

NASA Astrophysics Data System (ADS)

Sasmoko; Abbas, B. S.; Indrianti, Y.; Widhoyoko, S. A.

2018-01-01

The research aimed to calibrate Indonesian Teacher Engagement Index (ITEI) using instrument with RASCH MODEL. The respondents were 672 teachers of elementary, junior high, high school and vocational school. The number of items planned was 165 items with the initial reliability of 0.98. The ITEI scale uses Likert Scale (1 to 4) which was converted from ordinal scale to Equal Interval Scale. RASCH MODEL analysis was done by selecting based on Outfit Mean Square (MNSQ) between 0.5-1.5 as a good item, and measuring Point Measure Correlation (Pt Mean Corr) with the criterion of 0.4-0.85. Moderate Outfit Z-Standard (ZSTD) was ignored because the sample was >500. Conclusions: ITEI is valid with 30 items and reliability of 0.97, and less engage teachers significantly at α <0.05.
Further evidence for a broader concept of somatization disorder using the somatic symptom index.

PubMed

Hiller, W; Rief, W; Fichter, M M

1995-01-01

Somatization syndromes were defined in a sample of 102 psychosomatic inpatients according to the restrictive criteria of DSM-III-R somatization disorder and the broader diagnostic concept of the Somatic Symptom Index (SSI). Both groups showed a qualitatively similar pattern of psychopathological comorbidity and had elevated scores on measures of depression, hypochondriasis, and anxiety. A good discrimination between mild and severe forms of somatization was found by using the SSI criterion. SSI use accounted for a substantial amount of comorbidity variance, with rates of 15%-20% for depression, 16% for hypochondriasis, and 13% for anxiety. The results provide further evidence for the validity of the SSI concept, which reflects the clinical relevance of somatization in addition to the narrow definition of somatization disorder.
Criterion-Related Validity of Two Curriculum-Based Measures of Mathematical Skill in Relation to Reading Comprehension in Secondary Students

ERIC Educational Resources Information Center

Anselmo, Giancarlo A.; Yarbrough, Jamie L.; Kovaleski, Joseph F.; Tran, Vi N.

2017-01-01

This study analyzed the relationship between benchmark scores from two curriculum-based measurement probes in mathematics (M-CBM) and student performance on a state-mandated high-stakes test. Participants were 298 students enrolled in grades 7 and 8 in a rural southeastern school. Specifically, we calculated the criterion-related and predictive…
The Information a Test Provides on an Ability Parameter. Research Report. ETS RR-07-18

ERIC Educational Resources Information Center

Haberman, Shelby J.

2007-01-01

In item-response theory, if a latent-structure model has an ability variable, then elementary information theory may be employed to provide a criterion for evaluation of the information the test provides concerning ability. This criterion may be considered even in cases in which the latent-structure model is not valid, although interpretation of…
Teachers' Grade Assignment and the Predictive Validity of Criterion-Referenced Grades

ERIC Educational Resources Information Center

Thorsen, Cecilia; Cliffordson, Christina

2012-01-01

Research has found that grades are the most valid instruments for predicting educational success. Why grades have better predictive validity than, for example, standardized tests is not yet fully understood. One possible explanation is that grades reflect not only subject-specific knowledge and skills but also individual differences in other…
Toward a Process-Focused Model of Test Score Validity: Improving Psychological Assessment in Science and Practice

ERIC Educational Resources Information Center

Bornstein, Robert F.

2011-01-01

Although definitions of validity have evolved considerably since L. J. Cronbach and P. E. Meehl's classic (1955) review, contemporary validity research continues to emphasize correlational analyses assessing predictor-criterion relationships, with most outcome criteria being self-reports. The present article describes an alternative way of…
Mobile Phone Use in a Developing Country: A Malaysian Empirical Study

ERIC Educational Resources Information Center

Yeow, Paul H. P.; Yen Yuen, Yee; Connolly, Regina

2008-01-01

This study examined the factors that influence consumer satisfaction with mobile telephone use in Malaysia. The validity of the study's constructs, criterion, and content was confirmed. Construct validity was verified through the factor analysis with a total variance of 73.72 percent explained by all six independent factors. Content validity was…
The Trait Emotional Intelligence Questionnaire: Internal Structure, Convergent, Criterion, and Incremental Validity in an Italian Sample

ERIC Educational Resources Information Center

Andrei, Federica; Smith, Martin M.; Surcinelli, Paola; Baldaro, Bruno; Saklofske, Donald H.

2016-01-01

This study investigated the structure and validity of the Italian translation of the Trait Emotional Intelligence Questionnaire. Data were self-reported from 227 participants. Confirmatory factor analysis supported the four-factor structure of the scale. Hierarchical regressions also demonstrated its incremental validity beyond demographics, the…
The Physical Education and School Sport Environment Inventory: Preliminary Validation and Reliability

ERIC Educational Resources Information Center

Fairclough, Stuart J.; Hilland, Toni A.; Vinson, Don; Stratton, Gareth

2012-01-01

The study purpose was to assess preliminary validity and reliability of the Physical Education and School Sport Environment Inventory (PESSEI), which was designed to audit physical education (PE) and school sport spaces and resources. PE teachers from eight English secondary schools completed the PESSEI. Criterion validity was assessed by…
Eating Disorder Diagnostic Scale: Additional Evidence of Reliability and Validity

ERIC Educational Resources Information Center

Stice, Eric; Fisher, Melissa; Martinez, Erin

2004-01-01

The authors conducted 4 studies investigating the reliability and validity of the Eating Disorder Diagnostic Scale (HDDS; E. Stice, C. F. Telch, & S. L. Rizvi, 2000), a brief self-report measure for diagnosing anorexia nervosa, bulimia nervosa, and binge eating disorder. Study 1 found that the HDDS showed criterion validity with interview-based…
Evaluation of the Gratitude Questionnaire in a Chinese Sample of Adults: Factorial Validity, Criterion-Related Validity, and Measurement Invariance Across Sex

PubMed Central

Kong, Feng; You, Xuqun; Zhao, Jingjing

2017-01-01

The Gratitude Questionnaire (GQ; McCullough et al., 2002) is one of the most widely used instruments to assess dispositional gratitude. The purpose of this study was to validate a Chinese version of the GQ by examining internal consistency, factor structure, convergent validity, and measurement invariance across sex. A total of 1151 Chinese adults were recruited to complete the GQ, Positive Affect and Negative Affect Scales, and Satisfaction with Life Scale. Confirmatory factor analysis indicated that the original unidimensional model fitted well, which is in accordance with the findings in Western populations. Furthermore, the GQ had satisfactory composite reliability and criterion-related validity with measures of life satisfaction and affective well-being. Evidence of configural, metric and scalar invariance across sex was obtained. Tests of the latent mean differences found females had higher latent mean scores than males. These findings suggest that the Chinese version of GQ is a reliable and valid tool for measuring dispositional gratitude and can generally be utilized across sex in the Chinese context. PMID:28919873
Evaluation of the Gratitude Questionnaire in a Chinese Sample of Adults: Factorial Validity, Criterion-Related Validity, and Measurement Invariance Across Sex.

PubMed

Kong, Feng; You, Xuqun; Zhao, Jingjing

2017-01-01

The Gratitude Questionnaire (GQ; McCullough et al., 2002) is one of the most widely used instruments to assess dispositional gratitude. The purpose of this study was to validate a Chinese version of the GQ by examining internal consistency, factor structure, convergent validity, and measurement invariance across sex. A total of 1151 Chinese adults were recruited to complete the GQ, Positive Affect and Negative Affect Scales, and Satisfaction with Life Scale. Confirmatory factor analysis indicated that the original unidimensional model fitted well, which is in accordance with the findings in Western populations. Furthermore, the GQ had satisfactory composite reliability and criterion-related validity with measures of life satisfaction and affective well-being. Evidence of configural, metric and scalar invariance across sex was obtained. Tests of the latent mean differences found females had higher latent mean scores than males. These findings suggest that the Chinese version of GQ is a reliable and valid tool for measuring dispositional gratitude and can generally be utilized across sex in the Chinese context.
Revision of the criterion to avoid electron heating during laser aided plasma diagnostics (LAPD)

NASA Astrophysics Data System (ADS)

Carbone, E. A. D.; Palomares, J. M.; Hübner, S.; Iordanova, E.; van der Mullen, J. J. A. M.

2012-01-01

A criterion is given for the laser fluency (in J/m2) such that, when satisfied, disturbance of the plasma by the laser is avoided. This criterion accounts for laser heating of the electron gas intermediated by electron-ion (ei) and electron-atom (ea) interactions. The first heating mechanism is well known and was extensively dealt with in the past. The second is often overlooked but of importance for plasmas of low degree of ionization. It is especially important for cold atmospheric plasmas, plasmas that nowadays stand in the focus of attention. The new criterion, based on the concerted action of both ei and ea interactions is validated by Thomson scattering experiments performed on four different plasmas.
Users’ Support as a Social Resource in Educational Services: Construct Validity and Measurement Invariance of the User-Initiated Support Scale (UISS)

PubMed Central

Loera, Barbara; Martini, Mara; Viotti, Sara; Converso, Daniela

2016-01-01

Social support is an important resource for reducing the risks of stress and burnout at work. It seems to be particularly helpful for educational and social professionals. The constant and intense relationships with users that characterize this kind of service can be very demanding, increasing stress and leading to burnout. While significant attention has been paid to supervisors and colleagues in the literature, users have rarely been considered as possible sources of social support. The only exception is the Zimmermann et al.’s (2011) research, focused on customer support as a resource for workers’ well-being. This paper proposes the validation of the customer-initiated support scale developed by Zimmermann et al. (2011), translated into Italian and focused on educational services users (children’s parents), to measure the user support perceived by workers: the User-Initiated Support Scale (UISS). In Study 1 (105 teachers), which specifically involved educators and kindergarten teachers, the items and scale properties were preliminarily examined using descriptive analyses and exploratory factor analysis (EFA). In Study 2 (304 teachers), the construct and criterion validity and scale dimensionality were analyzed using confirmatory factor analysis (CFA). In Study 3 (304 teachers from Study 2 and 296 educators), measurement invariance (MI) was tested. The EFA results from Study 1 showed a one-factor solution (explained variance, 67.2%). The scale showed good internal coherence (alpha = 0.88). The CFA in Study 2 validated the one-factor solution (comparative fit index = 0.987; standardized root mean square residual = 0.054). Bivariate correlations confirmed construct validity; the UISS was positively associated (convergent) with user gratitude, and not associated (divergent) with disproportionate customer expectations. Regarding the criterion validity test, the UISS was strongly correlated with burnout and job satisfaction. The analysis of MI performed on the Study 3 data confirmed the equality of the parameters of the covariance structure model between the two samples of kindergarten teachers and educators. This research study offers a useful version of a tool for measuring a crucial, but often ignored, protective resource for all professionals working directly with people (patients, students, and service users) that can represent important sources of well-being, directly or indirectly lessening the negative impacts of job demands. PMID:27602008
Users' Support as a Social Resource in Educational Services: Construct Validity and Measurement Invariance of the User-Initiated Support Scale (UISS).

PubMed

Loera, Barbara; Martini, Mara; Viotti, Sara; Converso, Daniela

2016-01-01

Social support is an important resource for reducing the risks of stress and burnout at work. It seems to be particularly helpful for educational and social professionals. The constant and intense relationships with users that characterize this kind of service can be very demanding, increasing stress and leading to burnout. While significant attention has been paid to supervisors and colleagues in the literature, users have rarely been considered as possible sources of social support. The only exception is the Zimmermann et al.'s (2011) research, focused on customer support as a resource for workers' well-being. This paper proposes the validation of the customer-initiated support scale developed by Zimmermann et al. (2011), translated into Italian and focused on educational services users (children's parents), to measure the user support perceived by workers: the User-Initiated Support Scale (UISS). In Study 1 (105 teachers), which specifically involved educators and kindergarten teachers, the items and scale properties were preliminarily examined using descriptive analyses and exploratory factor analysis (EFA). In Study 2 (304 teachers), the construct and criterion validity and scale dimensionality were analyzed using confirmatory factor analysis (CFA). In Study 3 (304 teachers from Study 2 and 296 educators), measurement invariance (MI) was tested. The EFA results from Study 1 showed a one-factor solution (explained variance, 67.2%). The scale showed good internal coherence (alpha = 0.88). The CFA in Study 2 validated the one-factor solution (comparative fit index = 0.987; standardized root mean square residual = 0.054). Bivariate correlations confirmed construct validity; the UISS was positively associated (convergent) with user gratitude, and not associated (divergent) with disproportionate customer expectations. Regarding the criterion validity test, the UISS was strongly correlated with burnout and job satisfaction. The analysis of MI performed on the Study 3 data confirmed the equality of the parameters of the covariance structure model between the two samples of kindergarten teachers and educators. This research study offers a useful version of a tool for measuring a crucial, but often ignored, protective resource for all professionals working directly with people (patients, students, and service users) that can represent important sources of well-being, directly or indirectly lessening the negative impacts of job demands.
Development and validation of anthropometric equations to estimate appendicular muscle mass in elderly women.

PubMed

Pereira, Piettra Moura Galvão; da Silva, Giselma Alcântara; Santos, Gilberto Moreira; Petroski, Edio Luiz; Geraldes, Amandio Aristides Rihan

2013-07-02

This study aimed to examine the cross validity of two anthropometric equations commonly used and propose simple anthropometric equations to estimate appendicular muscle mass (AMM) in elderly women. Among 234 physically active and functionally independent elderly women, 101 (60 to 89 years) were selected through simple drawing to compose the study sample. The paired t test and the Pearson correlation coefficient were used to perform cross-validation and concordance was verified by intraclass correction coefficient (ICC) and by the Bland and Altman technique. To propose predictive models, multiple linear regression analysis, anthropometric measures of body mass (BM), height, girth, skinfolds, body mass index (BMI) were used, and muscle perimeters were included in the analysis as independent variables. Dual-Energy X-ray Absorptiometry (AMMDXA) was used as criterion measurement. The sample power calculations were carried out by Post Hoc Compute Achieved Power. Sample power values from 0.88 to 0.91 were observed. When compared, the two equations tested differed significantly from the AMMDXA (p <0.001 and p = 0.001). Ten population / specific anthropometric equations were developed to estimate AMM, among them, three equations achieved all validation criteria used: AMM (E2) = 4.150 +0.251 [bodymass (BM)] - 0.411 [bodymass index (BMI)] + 0.011 [Right forearm perimeter (PANTd) 2]; AMM (E3) = 4.087 + 0.255 (BM) - 0.371 (BMI) + 0.011 (PANTd) 2 - 0.035 [thigh skinfold (DCCO)]; MMA (E6) = 2.855 + 0.298 (BM) + 0.019 (Age) - 0,082 [hip circumference (PQUAD)] + 0.400 (PANTd) - 0.332 (BMI). The equations estimated the criterion method (p = 0.056 p = 0.158), and explained from 0.69% to 0.74% of variations observed in AMMDXA with low standard errors of the estimate (1.36 to 1.55 kg) and high concordance (ICC between 0,90 and 0.91 and concordance limits from -2,93 to 2,33 kg). The equations tested were not valid for use in physically active and functionally independent elderly women. The simple anthropometric equations developed in this study showed good practical applicability and high validity to estimate AMM in elderly women.

Development and validation of anthropometric equations to estimate appendicular muscle mass in elderly women

PubMed Central

2013-01-01

Objective This study aimed to examine the cross validity of two anthropometric equations commonly used and propose simple anthropometric equations to estimate appendicular muscle mass (AMM) in elderly women. Methods Among 234 physically active and functionally independent elderly women, 101 (60 to 89 years) were selected through simple drawing to compose the study sample. The paired t test and the Pearson correlation coefficient were used to perform cross-validation and concordance was verified by intraclass correction coefficient (ICC) and by the Bland and Altman technique. To propose predictive models, multiple linear regression analysis, anthropometric measures of body mass (BM), height, girth, skinfolds, body mass index (BMI) were used, and muscle perimeters were included in the analysis as independent variables. Dual-Energy X-ray Absorptiometry (AMMDXA) was used as criterion measurement. The sample power calculations were carried out by Post Hoc Compute Achieved Power. Sample power values from 0.88 to 0.91 were observed. Results When compared, the two equations tested differed significantly from the AMMDXA (p <0.001 and p = 0.001). Ten population / specific anthropometric equations were developed to estimate AMM, among them, three equations achieved all validation criteria used: AMM (E2) = 4.150 +0.251 [bodymass (BM)] - 0.411 [bodymass index (BMI)] + 0.011 [Right forearm perimeter (PANTd) 2]; AMM (E3) = 4.087 + 0.255 (BM) - 0.371 (BMI) + 0.011 (PANTd) 2 - 0.035 [thigh skinfold (DCCO)]; MMA (E6) = 2.855 + 0.298 (BM) + 0.019 (Age) - 0,082 [hip circumference (PQUAD)] + 0.400 (PANTd) - 0.332 (BMI). The equations estimated the criterion method (p = 0.056 p = 0.158), and explained from 0.69% to 0.74% of variations observed in AMMDXA with low standard errors of the estimate (1.36 to 1.55 kg) and high concordance (ICC between 0,90 and 0.91 and concordance limits from -2,93 to 2,33 kg). Conclusion The equations tested were not valid for use in physically active and functionally independent elderly women. The simple anthropometric equations developed in this study showed good practical applicability and high validity to estimate AMM in elderly women. PMID:23815948
Deconstructing the Brain Disconnection–Brain Death Analogy and Clarifying the Rationale for the Neurological Criterion of Death

PubMed Central

Moschella, Melissa

2016-01-01

This article explains the problems with Alan Shewmon’s critique of brain death as a valid sign of human death, beginning with a critical examination of his analogy between brain death and severe spinal cord injury. The article then goes on to assess his broader argument against the necessity of the brain for adult human organismal integration, arguing that he fails to translate correctly from biological to metaphysical claims. Finally, on the basis of a deeper metaphysical analysis, I offer a revised rationale for the validity of the neurological criterion of human death. PMID:27095749
[Criterion and Construct Validity in Nursing Diagnosis "Sedentary Lifestyle" in People over 50 Years Old].

PubMed

Guirao-Goris, Silamani J; Ferrer Ferrandis, Esperanza; Montejano Lozoya, Raimunda

2016-02-18

The aim of the study is to identify the construct and criterion validity of the nursing diagnosis label Sedentary Lifestyle. A cross-sectional study in a nursing consultation in primary health care was conducted. Participants were all people that was attended for one year over 50 who voluntarily wish to participate (n=85) in the study. Objective weekly physical activity was measured in METs with an Accelerometer, objective measure of performance was measured by gait speed EPESE Battery (both measures that were used as the gold standard), and physical activity questionnaires (RAPA), the COOP-WONCA physical fitness chart. Spearman correlation coefficients, mean comparison tests and analysis of sensitivity and specificity were used as statistical analysis. The diagnosis "Sedentary Lifestyle" showed a positive correlation between its manifestations and physical activity measured in METs (r=0.39) and EPESE gait speed (r=0.35). The diagnosis showed a sensitivity of 85.1% and a specificity of 65.2% and showed ability to discriminate active people from those that are not using METs as a measure of physical activity (t=-4.4). The diagnosis "Sedentary Lifestyle" shows criterion and construct validity.
[Criterion Validity of the German Version of the CES-D in the General Population].

PubMed

Jahn, Rebecca; Baumgartner, Josef S; van den Nest, Miriam; Friedrich, Fabian; Alexandrowicz, Rainer W; Wancata, Johannes

2018-04-17

The "Center of Epidemiologic Studies - Depression scale" (CES-D) is a well-known screening tool for depression. Until now the criterion validity of the German version of the CES-D was not investigated in a sample of the adult general population. 508 study participants of the Austrian general population completed the CES-D. ICD-10 diagnoses were established by using the Schedules for Clinical Assessment in Neuropsychiatry (SCAN). Receiver Operating Characteristics (ROC) analysis was conducted. Possible gender differences were explored. Overall discriminating performance of the CES-D was sufficient (ROC-AUC 0,836). Using the traditional cut-off values of 15/16 and 21/22 respectively the sensitivity was 43.2 % and 32.4 %, respectively. The cut-off value developed on the basis of our sample was 9/10 with a sensitivity of 81.1 % und a specificity of 74.3 %. There were no significant gender differences. This is the first study investigating the criterion validity of the German version of the CES-D in the general population. The optimal cut-off values yielded sufficient sensitivity and specificity, comparable to the values of other screening tools. © Georg Thieme Verlag KG Stuttgart · New York.
[Development and validity of workplace bullying in nursing-type inventory (WPBN-TI)].

PubMed

Lee, Younju; Lee, Mihyoung

2014-04-01

The purpose of this study was to develop an instrument to assess bullying of nurses, and test the validity and reliability of the instrument. The initial thirty items of WPBN-TI were identified through a review of the literature on types bullying related to nursing and in-depth interviews with 14 nurses who experienced bullying at work. Sixteen items were developed through 2 content validity tests by 9 experts and 10 nurses. The final WPBN-TI instrument was evaluated by 458 nurses from five general hospitals in the Incheon metropolitan area. SPSS 18.0 program was used to assess the instrument based on internal consistency reliability, construct validity, and criterion validity. WPBN-TI consisted of 16 items with three distinct factors (verbal and nonverbal bullying, work-related bullying, and external threats), which explained 60.3% of the total variance. The convergent validity and determinant validity for WPBN-TI were 100.0%, 89.7%, respectively. Known-groups validity of WPBN-TI was proven through the mean difference between subjective perception of bullying. The satisfied criterion validity for WPBN-TI was more than .70. The reliability of WPBN-TI was Cronbach's α of .91. WPBN-TI with high validity and reliability is suitable to determine types of bullying in nursing workplace.
Reliability and validity of a visual analogue scale used by owners to measure chronic pain attributable to osteoarthritis in their dogs.

PubMed

Hielm-Björkman, Anna K; Kapatkin, Amy S; Rita, Hannu J

2011-05-01

To assess validity and reliability for a visual analogue scale (VAS) used by owners to measure chronic pain in their osteoarthritic dogs. 68, 61, and 34 owners who completed a questionnaire. Owners answered questionnaires at 5 time points. Criterion validity of the VAS was evaluated for all dogs in the intended-to-treat population by correlating scores for the VAS with scores for the validated Helsinki Chronic Pain Index (HCPI) and a relative quality-of-life scale. Intraclass correlation was used to assess repeatability of the pain VAS at 2 baseline evaluations. To determine sensitivity to change and face validity of the VAS, 2 blinded, randomized control groups (17 dogs receiving carprofen and 17 receiving a placebo) were analyzed over time. Significant correlations existed between the VAS score and the quality-of-life scale and HCPI scores. Intraclass coefficient (r = 0.72; 95% confidence interval, 0.57 to 0.82) for the VAS indicated good repeatability. In the carprofen and placebo groups, there was poor correlation between the 2 pain evaluation methods (VAS and HCPI items) at the baseline evaluation, but the correlation improved in the carprofen group over time. No correlation was detected for the placebo group over time. Although valid and reliable, the pain VAS was a poor tool for untrained owners because of poor face validity (ie, owners could not recognize their dogs' behavior as signs of pain). Only after owners had seen pain diminish and then return (after starting and discontinuing NSAID use) did the VAS have face validity.
Measurement properties of tools measuring mental health knowledge: a systematic review.

PubMed

Wei, Yifeng; McGrath, Patrick J; Hayden, Jill; Kutcher, Stan

2016-08-23

Mental health literacy has received great attention recently to improve mental health knowledge, decrease stigma and enhance help-seeking behaviors. We conducted a systematic review to critically appraise the qualities of studies evaluating the measurement properties of mental health knowledge tools and the quality of included measurement properties. We searched PubMed, PsycINFO, EMBASE, CINAHL, the Cochrane Library, and ERIC for studies addressing psychometrics of mental health knowledge tools and published in English. We applied the COSMIN checklist to assess the methodological quality of each study as "excellent", "good", "fair", or "indeterminate". We ranked the level of evidence of the overall quality of each measurement property across studies as "strong", "moderate", "limited", "conflicting", or "unknown". We identified 16 mental health knowledge tools in 17 studies, addressing reliability, validity, responsiveness or measurement errors. The methodological quality of included studies ranged from "poor" to "excellent" including 6 studies addressing the content validity, internal consistency or structural validity demonstrating "excellent" quality. We found strong evidence of the content validity or internal consistency of 6 tools; moderate evidence of the internal consistency, the content validity or the reliability of 8 tools; and limited evidence of the reliability, the structural validity, the criterion validity, or the construct validity of 12 tools. Both the methodological qualities of included studies and the overall evidence of measurement properties are mixed. Based on the current evidence, we recommend that researchers consider using tools with measurement properties of strong or moderate evidence that also reached the threshold for positive ratings according to COSMIN checklist.
[Reliability and validity of depression scales of Chinese version: a systematic review].

PubMed

Sun, X Y; Li, Y X; Yu, C Q; Li, L M

2017-01-10

Objective: Through systematically reviewing the reliability and validity of depression scales of Chinese version in adults in China to evaluate the psychometric properties of depression scales for different groups. Methods: Eligible studies published before 6 May 2016 were retrieved from the following database: CNKI, Wanfang, PubMed and Embase. The HSROC model of the diagnostic test accuracy (DTA) for Meta-analysis was used to calculate the pooled sensitivity and specificity of the PHQ-9. Results: A total of 44 papers evaluating the performance of depression scales were included. Results showed that the reliability and validity of the common depression scales were eligible, including the Beck depression inventory (BDI), the Hamilton depression scale (HAMD), the center epidemiological studies depression scale (CES-D), the patient health questionnaire (PHQ) and the Geriatric depression scale (GDS). The Cronbach' s coefficient of most tools were larger than 0.8, while the test-retest reliability and split-half reliability were larger than 0.7, indicating good internal consistency and stability. The criterion validity, convergent validity, discrimination validity and screening validity were acceptable though different cut-off points were recommended by different studies. The pooled sensitivity of the 11 studies evaluating PHQ-9 was 0.88 (95 %CI : 0.85-0.91) while the pooled specificity was 0.89 (95 %CI : 0.82-0.94), which demonstrated the applicability of PHQ-9 in screening depression. Conclusion: The reliability and validity of different depression scales of Chinese version are acceptable. The characteristics of different tools and study population should be taken into consideration when choosing a specific scale.
Translation and linguistic validation of the Persian version of the Bristol Female Lower Urinary Tract Symptoms instrument.

PubMed

Pourmomeny, Abbas Ali; Rezaeian, Zahra Sadat; Soltanmohamadi, Mahsa

2017-09-01

The aim of this study was to evaluate the psychometric properties of the Persian version of the International Consultation on Incontinence Modular Questionnaire for Female Lower Urinary Tract Symptoms (ICIQ-FLUTS) in patients with urinary tract dysfunction. After gaining permission from the International Consultation on Incontinence Modular Questionnaire (ICIQ) advisory board, the English Female Lower Urinary Tract Symptoms (FLUTS) questionnaire was translated into Persian and then translated back into English. One hundred fourteen women with pelvic floor dysfunction were asked to complete the Persian FLUTS and International Consultation on Incontinence Modular Questionnaire Overactive Bladder Questionnaire (ICIQ-OAB). The Persian FLUTS questionnaire was also readministered to 20 patients 2 weeks after their initial visit. Study data were analyzed using SPSS V16.0. To validate the translated questionnaire, we assayed content/face validity, internal consistency/reliability, and construct validity. Internal consistency and test-retest reliability were assessed using Cronbach's alpha and the intraclass correlation coefficient (ICC) respectively. The mean age of the patients was 48.8 years old, 84% were married, and 59% had at least one Caesarean. Except for very few missing data, there is no any ambiguity in the Persian version of the FLUTS questionnaire. The Cronbach's alpha was 0.83, indicating a high internal consistency. Concerning criterion validity, correlation between the Persian FLUTS and the OAB was 0.77 (p < 0.001). The initial testing of the Persian version of the FLUTS questionnaire demonstrates good internal consistency, content validity, and reliability.
Responsiveness and predictive validity of the tablet-based symbol digit modalities test in patients with stroke.

PubMed

Hsiao, Pei-Chi; Yu, Wan-Hui; Lee, Shih-Chieh; Chen, Mei-Hsiang; Hsieh, Ching-Lin

2018-06-14

The responsiveness and predictive validity of the Tablet-based Symbol Digit Modalities Test (T-SDMT) are unknown, which limits the utility of the T-SDMT in both clinical and research settings. The purpose of this study was to examine the responsiveness and predictive validity of the T-SDMT in inpatients with stroke. A follow-up, repeated-assessments design. One rehabilitation unit at a local medical center. A total of 50 inpatients receiving rehabilitation completed T-SDMT assessments at admission to and discharge from a rehabilitation ward. The median follow-up period was 14 days. The Barthel index (BI) was assessed at discharge and was used as the criterion of the predictive validity. The mean changes in the T-SDMT scores between admission and discharge were statistically significant (paired t-test = 3.46, p = 0.001). The T-SDMT scores showed a nearly moderate standardized response mean (0.49). A moderate association (Pearson's r = 0.47) was found between the scores of the T-SDMT at admission and those of the BI at discharge, indicating good predictive validity of the T-SDMT. Our results support the responsiveness and predictive validity of the T-SDMT in patients with stroke receiving rehabilitation in hospitals. This study provides empirical evidence supporting the use of the T-SDMT as an outcome measure for assessing processingspeed in inpatients with stroke. The scores of the T-SDMT could be used to predict basic activities of daily living function in inpatients with stroke.
[Design and validation of the scales for the assessment of the psychological impact of past life events: the role of ruminative thought and personal growth].

PubMed

Fernández-Fernández, Virginia; Márquez-González, María; Losada-Baltar, Andrés; García, Pablo E; Romero-Moreno, Rosa

2013-01-01

Older people's emotional distress is often related to rumination processes focused on past vital events occurred during their lives. The specific coping strategies displayed to face those events may contribute to explain older adults' current well-being: they can perceive that they have obtained personal growth after those events and/or they can show a tendency to have intrusive thoughts about those events. This paper describes the development and analysis of the psychometric properties of the Scales for the Assessment of the Psychological Impact of Past Life Events (SAPIPLE): the past life events-occurrence scale (LE-O), ruminative thought scale (LE-R) and personal growth scale (LE-PG). Participants were 393 community dwelling elderly (mean age=71.5 years old; SD=6.9). In addition to the SAPIPLE scales, depressive symptomatology, anxiety, psychological well-being, life satisfaction, physical function and vitality have been assessed. The inter-rater agreement's analysis suggests the presence of two factors in the LE-O: positive and negative vital events. Confirmatory Factor Analysis (CFA) supported this two-dimensional structure for both the LE-R and the LE-PG. Good internal consistency indexes have been obtained for each scale and subscale, as well as good criterion and concurrent validity indexes. Both ruminative thoughts about past life events and personal growth following those events are related to older adults' current well-being. The SAPIPLE presents good psychometric properties that justify its use for elderly people. Copyright © 2012 SEGG. Published by Elsevier Espana. All rights reserved.
A treatment schedule of conventional physical therapy provided to enhance upper limb sensorimotor recovery after stroke: expert criterion validity and intra-rater reliability.

PubMed

Donaldson, Catherine; Tallis, Raymond C; Pomeroy, Valerie M

2009-06-01

Inadequate description of treatment hampers progress in stroke rehabilitation. To develop a valid, reliable, standardised treatment schedule of conventional physical therapy provided for the paretic upper limb after stroke. Eleven neurophysiotherapists participated in the established methodology: semi-structured interviews, focus groups and piloting a draft treatment schedule in clinical practice. Different physiotherapists (n=13) used the treatment schedule to record treatment given to stroke patients with mild, moderate and severe upper limb paresis. Rating of adequacy of the treatment schedule was made using a visual analogue scale (0 to 100mm). Mean (95% confidence interval) visual analogue scores were calculated (expert criterion validity). For intra-rater reliability, each physiotherapist observed a video tape of their treatment and immediately completed a treatment schedule recording form on two separate occasions, 4 to 6 weeks apart. The Kappa statistic was calculated for intra-rater reliability. The treatment schedule consists of a one-page A4 recording form and a user booklet, detailing 50 treatment activities. Expert criterion validity was 79 (95% confidence interval 74 to 84). Intra-rater Kappa was 0.81 (P<0.001). This treatment schedule can be used to document conventional physical therapy in subsequent clinical trials in the geographical area of its development. Further work is needed to investigate generalisability beyond this geographical area.
[Reliability and validity of Meaningful Life Measure-Chinese Revised in Chinese college students].

PubMed

Xiao, Rong; Lai, Qiao-Zhen; Yang, Jia-Ping

2016-04-20

To test the reliability and validity of Meaningful Life Measure-Chinese Revised (MLM-CR) in Chinese college students. A total of 1035 college students were evaluated with MLM-CR, Satisfaction with Life Scale (SWLS), Purpose in Life (PIL) and Patient Health Questionnaire-2 (PHQ-2), and 120 of the students were examined with PIL-SF twice. All the items in MLM-CR had good discrimination indexes (r=0.753-0.838, P<0.001). Confirmatory factor analysis confirmed the hypothesized five-factor model of MLM-CR (Χ 2 /df=3.4, GFI=0.946, AGFI=0.924, RMR=0.069, NFI=0.953, CFI=0.966, RMSEA=0.048). The total internal consistency reliability of MLM-CR was 0.942, and the alpha coefficients of the 5 dimensions ranged from 0.782 to 0.877; the total split-half reliability was 0.920, and the split-half reliability of the 5 dimensions ranged from 0.752 to 0.830; the total test-retest reliability was 0.871, and the test-retest reliability of the 5 dimensions ranged from 0.783 to 0.805. The criterion validity of MLM-CR in correlation with SWLS, PIL and PHQ-2 was 0.66, 0.755 and -0.388, respectively (P<0.01). The Average score of MLM-CR of the college students was 5.20∓0.90, and the scores were significantly higher in female students than in the male students (P<0.001). MLM-CR has good psychometric properties for application in comprehensive evaluation of personal meaning in life.
Psychometric goodness of the Mini Sleep Questionnaire.

PubMed

Natale, Vincenzo; Fabbri, Marco; Tonetti, Lorenzo; Martoni, Monica

2014-07-01

The current study was conducted to evaluate the psychometric properties and analyze the convergent validity of the Italian version of the Mini Sleep Questionnaire (MSQ). In addition, it was aimed to put forward cut-off values to be used in screening protocols. The MSQ was administered to 1830 participants (age range 18-87 years), of whom 1208 also completed the Sleep Disorder Questionnaire (age range 18-87 years). A subgroup of 187 (age range 18-71 years) participants was randomly chosen to test the test-retest reliability. A complete psychometric evaluation was performed on the MSQ. To study the validity of the tool, the Sleep Disorder Questionnaire was used as an external criterion to validate the MSQ. Using the Youden index, we calculated the cut-off values that performed best. Finally, we created receiver-operator curves to test the accuracy of each cut-off value identified. For the MSQ, Cronbach's alpha score was 0.77 while homogeneity was 0.26. Factorial analyses confirmed the presence of two dimensions: sleep (Cronbach's alpha 0.75; homogeneity 0.37) and wake (Cronbach's alpha 0.75; homogeneity 0.44). For each dimension, a cut-off value was identified (>16 and >14, respectively). Both cut-off values obtained an area under the curve higher than 0.80. Psychometric evaluation of the MSQ was satisfactory. The cut-off values analyzed in the present study showed good performance. On the whole, the results of this study suggest that the MSQ can be a useful screening tool. © 2014 The Authors. Psychiatry and Clinical Neurosciences © 2014 Japanese Society of Psychiatry and Neurology.
The Association Between Self-Rated Fitness and Cardiorespiratory Fitness in Adults.

PubMed

Jensen, Karina Gregersen; Rosthøj, Susanne; Linneberg, Allan; Aadahl, Mette

2018-06-01

To assess criterion validity of a single item question on self-rated physical fitness against objectively measured cardiorespiratory fitness. From the Health2008 study 749 men and women between 30 and 60 years of age rated their fitness as excellent, very good, good, fair or poor. Cardiorespiratory fitness was estimated with the watt-max test. Agreement between self-rated and objectively measured physical fitness was assessed by Cohen's weighted kappa coefficient. Correlation was determined by Goodman & Kruskal's gamma correlation coefficient. All analyses were stratified according to gender. Data from 323 men and 426 women were analysed. There was a slight agreement between self-rated and objectively measured fitness in men (weighted kappa: 0.18, [95%CI: 0.13;0.23]) and a fair agreement in women (weighted kappa: 0.27, [95%CI: 0.22;0.32]). In both genders, self-rated fitness was positively correlated with objectively measured fitness (moderate correlation; gamma correlation coefficient for men: 0.63 [95%CI: 0.54;0.72] and women: 0.67 [95%CI: 0.59;0.75]). There was a slight to fair agreement and moderate, positive correlations between self-rated physical fitness and watt-max estimated cardiorespiratory fitness. Hence, a single-item question on physical fitness may be a cost-effective method of assessing fitness in large population studies, but is not valid for individual assessments. © Georg Thieme Verlag KG Stuttgart · New York.
The Healthy Lifestyle and Personal Control Questionnaire (HLPCQ): a novel tool for assessing self-empowerment through a constellation of daily activities.

PubMed

Darviri, Christina; Alexopoulos, Evangelos C; Artemiadis, Artemios K; Tigani, Xanthi; Kraniotou, Christina; Darvyri, Panagiota; Chrousos, George P

2014-09-24

The main goal of stress management and health promotion programs is to improve health by empowering people to take control over their lives. Daily health-related lifestyle choices are integral targets of these interventions and critical to evaluating their efficacy. To date, concepts such as self-efficacy, self-control and empowerment are assessed by tools that only partially address daily lifestyle choices. The aim of this study is to validate a novel measurement tool, the Healthy Lifestyle and Personal Control Questionnaire (HLPCQ), which aims to assess the concept of empowerment through a constellation of daily activities. Therefore, we performed principal component analysis (PCA) of 26 items that were derived from the qualitative data of several stress management programs conducted by our research team. The PCA resulted in the following five-factor solution: 1) Dietary Healthy Choices, 2) Dietary Harm Avoidance, 3) Daily Routine, 4) Organized Physical Exercise and 5) Social and Mental Balance. All subscales showed satisfactory internal consistency and variance, relative to theoretical score ranges. Subscale scores and the total score were significantly correlated with perceived stress and health locus of control, implying good criterion validity. Associations with sociodemographic data and other variables, such as sleep quality and health assessments, were also found. The HLPCQ is a good tool for assessing the efficacy of future health-promoting interventions to improve individuals' lifestyle and wellbeing.
Patient Assessment of Constipation Quality of Life Questionnaire: Translation, Cultural Adaptation, Reliability, and Validity of the Persian Version.

PubMed

Nikjooy, Afsaneh; Jafari, Hassan; Saba, Maryam A; Ebrahimi, Naghmeh; Mirzaei, Rezvan

2018-05-01

The Patient Assessment of Constipation Quality of Life (PAC-QOL) questionnaire is the most validated and the most specific tool for measuring the quality of life of patients with constipation. Over 120 million people live in countries whose official language is Persian. There is no reported Persian version of the PAC-QOL questionnaire yet. The aim of this study was to translate and culturally adapt the PAC-QOL questionnaire and to assess its reliability and validity among Persian patients with chronic constipation. Following the translation and cultural adaptation of the PAC-QOL questionnaire to Persian, 100 patients (mean±SD age=40.51±13.67) with constipation were recruited for validity measurement and 20 patients were re-examined for reliability. Content validity was assessed based on the opinions of an expert committee and the floor/ceiling effect. Construct validity was evaluated according to the hypothesis test. The SF-36 questionnaire was used for concurrent criterion validity, intra-class correlation coefficient for reliability, and Cronbach's alpha for internal consistency. The content validity of the PAC-QOL questionnaire was proven, and there was no floor/ceiling effect. Construct validity also was confirmed based on the hypothesis test. The overall Cronbach's alpha of the PAC-QOL questionnaire was 0.92 (range=0.72-0.92), and the overall intra-class correlation coefficient of the questionnaire was 0.88 (range=0.69-0.87). The correlation between the SF-36 and PAC-QOL questionnaires was moderate. The Persian version of the PAC-QOL questionnaire demonstrated good validity and reliability properties in chronic constipation. Accordingly, Persian researchers and clinicians can benefit from this questionnaire in further research and assessment of treatment outcomes.
Concordance and Discordance of Self-Rated and Researcher-Measured Successful Aging: Subtypes and Associated Factors.

PubMed

Gu, Danan; Feng, Qiushi; Sautter, Jessica M; Yang, Fang; Ma, Lei; Zhen, Zhihong

2017-03-01

To investigate subtypes of successful aging (SA) based on concordance and discordance between self-rated and researcher-defined measures and their associations with demographic, psychosocial, and life satisfaction factors. We used multinomial logistic regression models to analyze 2013 cross-sectional survey data from 1,962 persons aged 65 and older in Shanghai that measured self-rated successful aging (SSA) with a single global assessment and researcher-defined successful aging (RSA) with a cumulative deficit index reflecting physical, physiological, cognitive, psychological, and social engagement domains. We generated four subtypes based on these two dichotomous variables: nonsuccessful aging (non-SA; meeting neither the criterion of RSA nor the criterion of SSA), RSA-only (meeting the criterion of RSA-only but not the criterion of SSA), SSA-only (meeting the criterion of SSA-only but not the criterion of RSA), and both-successful aging (both-SA; meeting both criteria of RSA and SSA). In the sample, 32% were nonsuccessful agers, 7% RSA-only, 34% SSA-only, and 27% successful agers. Female gender and older age were associated with lower likelihood of RSA-only and both-SA relative to non-SA, but with greater likelihood of SSA-only. Good socioeconomic conditions and social networks were associated with greater likelihood of SSA-only and both-SA relative to non-SA or RSA-only. Satisfaction with life domains was robustly and positively associated with good successful aging outcomes. Researcher-defined successful aging and self-rated successful aging are different measures with distinct social correlates. Subtypes of concordance and discordance provide a more holistic biopsychosocial conceptualization of successful aging. © The Author 2016. Published by Oxford University Press on behalf of The Gerontological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Evaluation of objectivity, reliability and criterion validity of the key indicator method for manual handling operations (KIM-MHO), draft 2007.

PubMed

Klußmann, André; Gebhardt, Hansjürgen; Rieger, Monika; Liebers, Falk; Steinberg, Ulf

2012-01-01

Upper extremity musculoskeletal symptoms and disorders are common in the working population. The economic and social impact of such disorders is considerable. Long-time, dynamic repetitive exposure of the hand-arm system during manual handling operations (MHO) alone or in combination with static and postural effort are recognised as causes of musculoskeletal symptoms and disorders. The assessment of these manual work tasks is crucial to estimate health risks of exposed employees. For these work tasks, a new method for the assessment of the working conditions was developed and a validation study was performed. The results suggest satisfying criterion validity and moderate objectivity of the KIM-MHO draft 2007. The method was modified and evaluated again. It is planned to release a new version of KIM-MHO in spring 2012.
Reliability and Validity of the Professional Counseling Performance Evaluation

ERIC Educational Resources Information Center

Shepherd, J. Brad; Britton, Paula J.; Kress, Victoria E.

2008-01-01

The definition and measurement of counsellor trainee competency is an issue that has received increased attention yet lacks quantitative study. This research evaluates item responses, scale reliability and intercorrelations, interrater agreement, and criterion-related validity of the Professional Performance Fitness Evaluation/Professional…

Some links on this page may take you to non-federal websites. Their policies may differ from this site.