Validating MEDIQUAL Constructs
NASA Astrophysics Data System (ADS)
Lee, Sang-Gun; Min, Jae H.
In this paper, we validate MEDIQUAL constructs through the different media users in help desk service. In previous research, only two end-users' constructs were used: assurance and responsiveness. In this paper, we extend MEDIQUAL constructs to include reliability, empathy, assurance, tangibles, and responsiveness, which are based on the SERVQUAL theory. The results suggest that: 1) five MEDIQUAL constructs are validated through the factor analysis. That is, importance of the constructs have relatively high correlations between measures of the same construct using different methods and low correlations between measures of the constructs that are expected to differ; and 2) five MEDIQUAL constructs are statistically significant on media users' satisfaction in help desk service by regression analysis.
The Modified Cognitive Constructions Coding System: Reliability and Validity Assessments
ERIC Educational Resources Information Center
Moran, Galia S.; Diamond, Gary M.
2006-01-01
The cognitive constructions coding system (CCCS) was designed for coding client's expressed problem constructions on four dimensions: intrapersonal-interpersonal, internal-external, responsible-not responsible, and linear-circular. This study introduces, and examines the reliability and validity of, a modified version of the CCCS--a version that…
Sirota, Miroslav; Juanchich, Marie
2018-03-27
The Cognitive Reflection Test, measuring intuition inhibition and cognitive reflection, has become extremely popular because it reliably predicts reasoning performance, decision-making, and beliefs. Across studies, the response format of CRT items sometimes differs, based on the assumed construct equivalence of tests with open-ended versus multiple-choice items (the equivalence hypothesis). Evidence and theoretical reasons, however, suggest that the cognitive processes measured by these response formats and their associated performances might differ (the nonequivalence hypothesis). We tested the two hypotheses experimentally by assessing the performance in tests with different response formats and by comparing their predictive and construct validity. In a between-subjects experiment (n = 452), participants answered stem-equivalent CRT items in an open-ended, a two-option, or a four-option response format and then completed tasks on belief bias, denominator neglect, and paranormal beliefs (benchmark indicators of predictive validity), as well as on actively open-minded thinking and numeracy (benchmark indicators of construct validity). We found no significant differences between the three response formats in the numbers of correct responses, the numbers of intuitive responses (with the exception of the two-option version, which had a higher number than the other tests), and the correlational patterns of the indicators of predictive and construct validity. All three test versions were similarly reliable, but the multiple-choice formats were completed more quickly. We speculate that the specific nature of the CRT items helps build construct equivalence among the different response formats. We recommend using the validated multiple-choice version of the CRT presented here, particularly the four-option CRT, for practical and methodological reasons. Supplementary materials and data are available at https://osf.io/mzhyc/ .
Construction and Validation of a Professional Suitability Scale for Social Work Practice
ERIC Educational Resources Information Center
Tam, Dora M. Y.; Coleman, Heather
2009-01-01
This article reports on the construction and validation of a professional suitability scale, designed for assessing students' suitability for social work practice. Data were collected from 188 field supervisors who provided usable questionnaires, representing a response rate of 74%. Construct validation by exploratory factor analysis identified a…
ERIC Educational Resources Information Center
Jackson, Allen W.; Morrow, James R., Jr.; Bowles, Heather R.; FitzGerald, Shannon J.; Blair, Steven N.
2007-01-01
Valid measurement of physical activity is important for studying the risks for morbidity and mortality. The purpose of this study was to examine evidence of construct validity of two similar single-response items assessing physical activity via self-report. Both items are based on the stages of change model. The sample was 687 participants (men =…
Johnston, Marie; Dixon, Diane; Hart, Jo; Glidewell, Liz; Schröder, Carin; Pollard, Beth
2014-05-01
In studies involving theoretical constructs, it is important that measures have good content validity and that there is not contamination of measures by content from other constructs. While reliability and construct validity are routinely reported, to date, there has not been a satisfactory, transparent, and systematic method of assessing and reporting content validity. In this paper, we describe a methodology of discriminant content validity (DCV) and illustrate its application in three studies. Discriminant content validity involves six steps: construct definition, item selection, judge identification, judgement format, single-sample test of content validity, and assessment of discriminant items. In three studies, these steps were applied to a measure of illness perceptions (IPQ-R) and control cognitions. The IPQ-R performed well with most items being purely related to their target construct, although timeline and consequences had small problems. By contrast, the study of control cognitions identified problems in measuring constructs independently. In the final study, direct estimation response formats for theory of planned behaviour constructs were found to have as good DCV as Likert format. The DCV method allowed quantitative assessment of each item and can therefore inform the content validity of the measures assessed. The methods can be applied to assess content validity before or after collecting data to select the appropriate items to measure theoretical constructs. Further, the data reported for each item in Appendix S1 can be used in item or measure selection. Statement of contribution What is already known on this subject? There are agreed methods of assessing and reporting construct validity of measures of theoretical constructs, but not their content validity. Content validity is rarely reported in a systematic and transparent manner. What does this study add? The paper proposes discriminant content validity (DCV), a systematic and transparent method of assessing and reporting whether items assess the intended theoretical construct and only that construct. In three studies, DCV was applied to measures of illness perceptions, control cognitions, and theory of planned behaviour response formats. Appendix S1 gives content validity indices for each item of each questionnaire investigated. Discriminant content validity is ideally applied while the measure is being developed, before using to measure the construct(s), but can also be applied after using a measure. © 2014 The British Psychological Society.
Construct Validity and Reliability of College Students' Responses to the Reasons for Smoking Scale
ERIC Educational Resources Information Center
Fiala, Kelly Ann; D'Abundo, Michelle Lee; Marinaro, Laura Marie
2010-01-01
When utilizing self-assessments to determine motives for health behaviors, it is essential that the resulting data demonstrate sound psychometric properties. The purpose of this research was to assess the reliability and construct validity of college students' responses to the Reasons for Smoking Scale (RFS). Confirmatory factor analyses and…
Rater Cognition: Implications for Validity
ERIC Educational Resources Information Center
Bejar, Issac I.
2012-01-01
The scoring process is critical in the validation of tests that rely on constructed responses. Documenting that readers carry out the scoring in ways consistent with the construct and measurement goals is an important aspect of score validity. In this article, rater cognition is approached as a source of support for a validity argument for scores…
ERIC Educational Resources Information Center
Hendrickson, Amy; Patterson, Brian; Ewing, Maureen
2010-01-01
The psychometric considerations and challenges associated with including constructed response items on tests are discussed along with how these issues affect the form assembly specifications for mixed-format exams. Reliability and validity, security and fairness, pretesting, content and skills coverage, test length and timing, weights, statistical…
Validation of the breast evaluation questionnaire for breast hypertrophy and breast reduction.
Lewin, Richard; Elander, Anna; Lundberg, Jonas; Hansson, Emma; Thorarinsson, Andri; Claudelin, Malin; Bladh, Helena; Lidén, Mattias
2018-06-13
There is a lack of published, validated questionnaires for evaluating psychosocial morbidity in patients with breast hypertrophy undergoing breast reduction surgery. To validate the breast evaluation questionnaire (BEQ), originally developed for the assessment of breast augmentation patients, for the assessment of psychosocial morbidity in patients with breast hypertrophy undergoing breast reduction surgery. Validation study Subjects: Women with macromastia Methods: The validation of the BEQ, adapted to breast reduction, was performed in several steps. Content validity, reliability, construct validity and responsiveness were assessed. The original version was adjusted according to the results for content validity and resulted in item reduction and a modified BEQ (mBEQ) that was then assessed for reliability, construct validity and responsiveness. Internal and external validation was performed for the modified BEQ. Convergent validity was tested against Breast-Q (reduction) and discriminate validity was tested against the SF-36. Known-groups validation revealed significant differences between the normal population and patients undergoing breast reduction surgery. The BEQ showed good reliability by test-re-test analysis and high responsiveness. The modified BEQ may be reliable, valid and responsive instrument for assessing women who undergo breast reduction.
Substance versus style: a new look at social desirability in motivating contexts.
Smith, D Brent; Ellingson, Jill E
2002-04-01
Although there is an emerging consensus that social desirability does not meaningfully affect criterion-related validity, several researchers have reaffirmed the argument that social desirability degrades the construct validity of personality measures. Yet, most research demonstrating the adverse consequences of faking for construct validity uses a fake-good instruction set. The consequence of such a manipulation is to exacerbate the effects of response distortion beyond what would be expected under realistic circumstances (e.g., an applicant setting). The research reported in this article was designed to assess these issues by using real-world contexts not influenced by artificial instructions. Results suggest that response distortion has little impact on the construct validity of personality measures used in selection contexts.
Percent Grammatical Responses as a General Outcome Measure: Initial Validity
ERIC Educational Resources Information Center
Eisenberg, Sarita L.; Guo, Ling-Yu
2018-01-01
Purpose: This report investigated the validity of using percent grammatical responses (PGR) as a measure for assessing grammaticality. To establish construct validity, we computed the correlation of PGR with another measure of grammar skills and with an unrelated skill area. To establish concurrent validity for PGR, we computed the correlation of…
ERIC Educational Resources Information Center
Eleje, Lydia I.; Esomonu, Nkechi P. M.
2018-01-01
A Test to measure achievement in quantitative economics among secondary school students was developed and validated in this study. The test is made up 20 multiple choice test items constructed based on quantitative economics sub-skills. Six research questions guided the study. Preliminary validation was done by two experienced teachers in…
Abma, Femke I; van der Klink, Jac J L; Bültmann, Ute
2013-03-01
The promotion of a sustainable, healthy and productive working life attracts more and more attention. Recently the Work Role Functioning Questionnaire (WRFQ) has been cross-culturally translated and adapted to Dutch. This questionnaire aims to measure the health-related work functioning of workers with health problems. The aim of this study is to evaluate the reliability, validity (including five new items) and responsiveness of the WRFQ 2.0 in the working population. A longitudinal study was conducted among workers. The reliability (internal consistency, test-retest reliability, measurement error), validity (structural validity-factor analysis, construct validity by means of hypotheses testing) and responsiveness of the WRFQ 2.0 were evaluated. A total of N = 553 workers completed the survey. The final WRFQ 2.0 has four subscales and showed very good internal consistency, moderate test-retest reliability, good construct validity and moderate responsiveness in the working population. The WRFQ was able to distinguish between groups with different levels of mental health, physical health, fatigue and need for recovery. A moderate correlation was found between WRFQ and related constructs respectively work ability and work productivity. A weak relationship was found with general self-rated health, work engagement and work involvement. The WRFQ 2.0 is a reliable and valid instrument to measure health-related work functioning in the working population. Further validation in larger samples is recommended, especially for test-retest reliability, responsiveness and the questionnaire's ability to predict the future course of health-related work functioning.
A Study on the Impact of Fatigue on Human Raters When Scoring Speaking Responses
ERIC Educational Resources Information Center
Ling, Guangming; Mollaun, Pamela; Xi, Xiaoming
2014-01-01
The scoring of constructed responses may introduce construct-irrelevant factors to a test score and affect its validity and fairness. Fatigue is one of the factors that could negatively affect human performance in general, yet little is known about its effects on a human rater's scoring quality on constructed responses. In this study, we compared…
ERIC Educational Resources Information Center
Thomas, Michael L.; Lanyon, Richard I.; Millsap, Roger E.
2009-01-01
The use of criterion group validation is hindered by the difficulty of classifying individuals on latent constructs. Latent class analysis (LCA) is a method that can be used for determining the validity of scales meant to assess latent constructs without such a priori classifications. The authors used this method to examine the ability of the L…
ERIC Educational Resources Information Center
Detrick, Paul; Chibnall, John T.
2008-01-01
Interpretation of positive response distortion (socially desirable responding) in employment evaluations is an important validity issue. This study of police officer applicants examined the construct validity of the Paulhus Deception Scales (PDS)--Moralistic Bias (MB; exaggerated adjustment/agreeableness) and Egoistic Bias (EB; exaggerated…
Assessing Construct Validity Using Multidimensional Item Response Theory.
ERIC Educational Resources Information Center
Ackerman, Terry A.
The concept of a user-specified validity sector is discussed. The idea of the validity sector combines the work of M. D. Reckase (1986) and R. Shealy and W. Stout (1991). Reckase developed a methodology to represent an item in a multidimensional latent space as a vector. Item vectors are computed using multidimensional item response theory item…
Evidence of Construct Validity in Published Achievement Tests.
ERIC Educational Resources Information Center
Nolet, Victor; Tindal, Gerald
Valid interpretation of test scores is the shared responsibility of the test designer and the test user. Test publishers must provide evidence of the validity of the decisions their tests are intended to support, while test users are responsible for analyzing this evidence and subsequently using the test in the manner indicated by the publisher.…
The Trunk Impairment Scale - modified to ordinal scales in the Norwegian version.
Gjelsvik, Bente; Breivik, Kyrre; Verheyden, Geert; Smedal, Tori; Hofstad, Håkon; Strand, Liv Inger
2012-01-01
To translate the Trunk Impairment Scale (TIS), a measure of trunk control in patients after stroke, into Norwegian (TIS-NV), and to explore its construct validity, internal consistency, intertester and test-retest reliability. TIS was translated according to international guidelines. The validity study was performed on data from 201 patients with acute stroke. Fifty patients with stroke and acquired brain injury were recruited to examine intertester and test-retest reliability. Construct validity was analyzed with exploratory and confirmatory factor analysis and item response theory, internal consistency with Cronbach's alpha test, and intertester and test-retest reliability with kappa and intraclass correlation coefficient tests. The back-translated version of TIS-NV was validated by the original developer. The subscale Static sitting balance was removed. By combining items from the subscales Dynamic sitting balance and Coordination, six ordinal superitems (testlets) were constructed. The TIS-NV was renamed the modified TIS-NV (TIS-modNV). After modifications the TIS-modNV fitted well to a locally dependent unidimensional item response theory model. It demonstrated good construct validity, excellent internal consistency, and high intertester and test-retest reliability for the total score. This study supports that the TIS-modNV is a valid and reliable scale for use in clinical practice and research.
Illustrating a Mixed-Method Approach for Validating Culturally Specific Constructs
ERIC Educational Resources Information Center
Hitchcock, J.H.; Nastasi, B.K.; Dai, D.Y.; Newman, J.; Jayasena, A.; Bernstein-Moore, R.; Sarkar, S.; Varjas, K.
2005-01-01
The purpose of this article is to illustrate a mixed-method approach (i.e., combining qualitative and quantitative methods) for advancing the study of construct validation in cross-cultural research. The article offers a detailed illustration of the approach using the responses 612 Sri Lankan adolescents provided to an ethnographic survey. Such…
Heald, Alison E; Fudman, Edward J; Anklesaria, Pervin; Mease, Philip J
2010-05-01
To assess the validity, responsiveness, and reliability of single-joint outcome measures for determining target joint (TJ) response in patients with inflammatory arthritis. Patient-reported outcomes (PRO), consisting of responses to single questions about TJ global status on a 100-mm visual analog scale (VAS; TJ global score), function on a 100-mm VAS (TJ function score), and pain on a 5-point Likert scale (TJ pain score) were piloted in 66 inflammatory arthritis subjects in a phase 1/2 clinical study of an intraarticular gene transfer agent and compared to physical examination measures (TJ swelling, TJ tenderness) and validated function questionnaires (Disabilities of the Arm, Shoulder and Hand scale, Rheumatoid Arthritis Outcome Score, and the Health Assessment Questionnaire). Construct validity was assessed by evaluating the correlation between the single-joint outcome measures and validated function questionnaires using Spearman's rank correlation. Responsiveness or sensitivity to change was assessed through calculating effect size and standardized response means (SRM). Reliability of physical examination measures was assessed by determining interobserver agreement. The single-joint PRO were highly correlated with each other and correlated well with validated functional measures. The TJ global score exhibited modest effect size and modest SRM that correlated well with the patient's assessment of response on a 100-mm VAS. Physical examination measures exhibited high interrater reliability, but correlated less well with validated functional measures and the patient's assessment of response. Single-joint PRO, particularly the TJ global score, are simple to administer and demonstrate construct validity and responsiveness in patients with inflammatory arthritis. (ClinicalTrials.gov identifier NCT00126724).
Personality Constructs and Measures
ERIC Educational Resources Information Center
Teglasi, Hedwig; Simcox, April G.; Kim, Na-Young
2007-01-01
A psychological construct, such as personality, is an abstraction that is not directly seen but inferred through observed regularities in cognitive, affective, and behavioral responses in various settings. Two assumptions give meaning to the idea of construct validity. First, constructs represent real phenomena that exist apart from the potential…
Individualism: a valid and important dimension of cultural differences between nations.
Schimmack, Ulrich; Oishi, Shigehiro; Diener, Ed
2005-01-01
Oyserman, Coon, and Kemmelmeier's (2002) meta-analysis suggested problems in the measurement of individualism and collectivism. Studies using Hofstede's individualism scores show little convergent validity with more recent measures of individualism and collectivism. We propose that the lack of convergent validity is due to national differences in response styles. Whereas Hofstede statistically controlled for response styles, Oyserman et al.'s meta-analysis relied on uncorrected ratings. Data from an international student survey demonstrated convergent validity between Hofstede's individualism dimension and horizontal individualism when response styles were statistically controlled, whereas uncorrected scores correlated highly with the individualism scores in Oyserman et al.'s meta-analysis. Uncorrected horizontal individualism scores and meta-analytic individualism scores did not correlate significantly with nations' development, whereas corrected horizontal individualism scores and Hofstede's individualism dimension were significantly correlated with development. This pattern of results suggests that individualism is a valid construct for cross-cultural comparisons, but that the measurement of this construct needs improvement.
Construct Validity in TOEFL iBT Speaking Tasks: Insights from Natural Language Processing
ERIC Educational Resources Information Center
Kyle, Kristopher; Crossley, Scott A.; McNamara, Danielle S.
2016-01-01
This study explores the construct validity of speaking tasks included in the TOEFL iBT (e.g., integrated and independent speaking tasks). Specifically, advanced natural language processing (NLP) tools, MANOVA difference statistics, and discriminant function analyses (DFA) are used to assess the degree to which and in what ways responses to these…
Li, Hong-Yan; Bi, Rui-Xue; Zhong, Qing-Ling
2017-12-01
Disaster nurse education has received increasing importance in China. Knowing the abilities of disaster response in undergraduate nursing students is beneficial to promote teaching and learning. However, there are few valid and reliable tools that measure the abilities of disaster response in undergraduate nursing students. To develop a self-report scale of self-efficacy in disaster response for Chinese undergraduate nursing students and test its psychometric properties. Nursing students (N=318) from two medical colleges were chosen by purposive sampling. The Disaster Response Self-Efficacy Scale (DRSES) was developed and psychometrically tested. Reliability and content validity were studied. Construct validity was tested by exploratory and confirmatory factor analysis. Reliability was tested by internal consistency and test-retest reliability. The DRSES consisted of 3 factors and 19 items with a 5-point rating. The content validity was 0.91, Cronbach's alpha coefficient was 0.912, and the intraclass correlation coefficient for test-retest reliability was 0.953. The construct validity was good (χ 2 /df=2.440, RMSEA=0.068, NFI=0.907, CFI=0.942, IFI=0.430, p<0.001). The newly developed DRSES has proven good reliability and validity. It could therefore be used as an assessment tool to evaluate self-efficacy in disaster response for Chinese undergraduate nursing students. Copyright © 2017. Published by Elsevier Ltd.
Validation of Automated Scoring of Science Assessments
ERIC Educational Resources Information Center
Liu, Ou Lydia; Rios, Joseph A.; Heilman, Michael; Gerard, Libby; Linn, Marcia C.
2016-01-01
Constructed response items can both measure the coherence of student ideas and serve as reflective experiences to strengthen instruction. We report on new automated scoring technologies that can reduce the cost and complexity of scoring constructed-response items. This study explored the accuracy of c-rater-ML, an automated scoring engine…
Kohonen Self-Organizing Maps in Validity Maintenance for Automated Scoring of Constructed Response.
ERIC Educational Resources Information Center
Williamson, David M.; Bejar, Isaac I.
As the automated scoring of constructed responses reaches operational status, monitoring the scoring process becomes a primary concern, particularly if automated scoring is intended to operate completely unassisted by humans. Using actual candidate selections from the Architectural Registration Examination (n=326), this study uses Kohonen…
Validation of the Mayo Hip Score: construct validity, reliability and responsiveness to change.
Singh, Jasvinder A; Schleck, Cathy; Harmsen, W Scott; Lewallen, David G
2016-01-19
Previous studies have provided the initial evidence for construct validity and test-retest reliability of the Mayo Hip Score. Instruments used for Total Hip Arthroplasty (THA) outcomes assessment should be valid, reliable and responsive to change. Our main objective was to examine the responsiveness to change, association with subsequent revision and the construct validity of the Mayo hip score. Discriminant ability was assessed by calculating effect size (ES), standardized response mean (SRM) and Guyatt's responsiveness index (GRI). Minimal clinically important difference (MCII) and moderate improvement thresholds were calculated. We assessed construct validity by examining association of scores with preoperative patient characteristics and correlation with Harris hip score, and assessed association of scores with the risk of subsequent revision. Five thousand three hundred seven provided baseline data; of those with baseline data, 2,278 and 2,089 (39%) provided 2- and 5-year data, respectively. Large ES, SRM and GRI ranging 2.66-2.78, 2.42-2.61 and 1.67-1.88 were noted for Mayo hip scores with THA, respectively. The MCII and moderate improvement thresholds were 22.4-22.7 and 39.4-40.5 respectively. Hazard ratios of revision surgery were higher with lower final score or less improvement in Mayo hip score at 2-years and borderline significant/non-significant at 5-years, respectively: (1) score ≤55 with hazard ratios of 2.24 (95% CI, 1.45, 3.46; p = 0.0003) and 1.70 (95% CI, 1.00, 2.92; p = 0.05) of implant revision subsequently, compared to 72-80 points; (2) no improvement or worsening score with hazard ratios 3.94 (95% CI, 1.50, 10.30; p = 0.005) and 2.72 (95% CI, 0.85,8.70; p = 0.09), compared to improvement >50-points. Mayo hip score had significant positive correlation with younger age, male gender, lower BMI, lower ASA class and lower Deyo-Charlson index (p ≤ 0.003 for each) and with Harris hip scores (p < 0.001). Mayo Hip Score is valid, sensitive to change and associated with future risk of revision surgery in patients with primary THA.
The reliability and validity of the SF-8 with a conflict-affected population in northern Uganda.
Roberts, Bayard; Browne, John; Ocaka, Kaducu Felix; Oyok, Thomas; Sondorp, Egbert
2008-12-02
The SF-8 is a health-related quality of life instrument that could provide a useful means of assessing general physical and mental health amongst populations affected by conflict. The purpose of this study was to test the validity and reliability of the SF-8 with a conflict-affected population in northern Uganda. A cross-sectional multi-staged, random cluster survey was conducted with 1206 adults in camps for internally displaced persons in Gulu and Amuru districts of northern Uganda. Data quality was assessed by analysing the number of incomplete responses to SF-8 items. Response distribution was analysed using aggregate endorsement frequency. Test-retest reliability was assessed in a separate smaller survey using the intraclass correlation test. Construct validity was measured using principal component analysis, and the Pearson Correlation test for item-summary score correlation and inter-instrument correlations. Known groups validity was assessed using a two sample t-test to evaluates the ability of the SF-8 to discriminate between groups known to have, and not have, physical and mental health problems. The SF-8 showed excellent data quality. It showed acceptable item response distribution based upon analysis of aggregate endorsement frequencies. Test-retest showed a good intraclass correlation of 0.61 for PCS and 0.68 for MCS. The principal component analysis indicated strong construct validity and concurred with the results of the validity tests by the SF-8 developers. The SF-8 also showed strong construct validity between the 8 items and PCS and MCS summary score, moderate inter-instrument validity, and strong known groups validity. This study provides evidence on the reliability and validity of the SF-8 amongst IDPs in northern Uganda.
The reliability and validity of the SF-8 with a conflict-affected population in northern Uganda
Roberts, Bayard; Browne, John; Ocaka, Kaducu Felix; Oyok, Thomas; Sondorp, Egbert
2008-01-01
Background The SF-8 is a health-related quality of life instrument that could provide a useful means of assessing general physical and mental health amongst populations affected by conflict. The purpose of this study was to test the validity and reliability of the SF-8 with a conflict-affected population in northern Uganda. Methods A cross-sectional multi-staged, random cluster survey was conducted with 1206 adults in camps for internally displaced persons in Gulu and Amuru districts of northern Uganda. Data quality was assessed by analysing the number of incomplete responses to SF-8 items. Response distribution was analysed using aggregate endorsement frequency. Test-retest reliability was assessed in a separate smaller survey using the intraclass correlation test. Construct validity was measured using principal component analysis, and the Pearson Correlation test for item-summary score correlation and inter-instrument correlations. Known groups validity was assessed using a two sample t-test to evaluates the ability of the SF-8 to discriminate between groups known to have, and not have, physical and mental health problems. Results The SF-8 showed excellent data quality. It showed acceptable item response distribution based upon analysis of aggregate endorsement frequencies. Test-retest showed a good intraclass correlation of 0.61 for PCS and 0.68 for MCS. The principal component analysis indicated strong construct validity and concurred with the results of the validity tests by the SF-8 developers. The SF-8 also showed strong construct validity between the 8 items and PCS and MCS summary score, moderate inter-instrument validity, and strong known groups validity. Conclusion This study provides evidence on the reliability and validity of the SF-8 amongst IDPs in northern Uganda. PMID:19055716
On Validity Theory and Test Validation
ERIC Educational Resources Information Center
Sireci, Stephen G.
2007-01-01
Lissitz and Samuelsen (2007) propose a new framework for conceptualizing test validity that separates analysis of test properties from analysis of the construct measured. In response, the author of this article reviews fundamental characteristics of test validity, drawing largely from seminal writings as well as from the accepted standards. He…
Imura, Tomoya; Takamura, Masahiro; Okazaki, Yoshihiro; Tokunaga, Satoko
2016-10-01
We developed a scale to measure time management and assessed its reliability and validity. We then used this scale to examine the impact of time management on psychological stress response. In Study 1-1, we developed the scale and assessed its internal consistency and criterion-related validity. Findings from a factor analysis revealed three elements of time management, “time estimation,” “time utilization,” and “taking each moment as it comes.” In Study 1-2, we assessed the scale’s test-retest reliability. In Study 1-3, we assessed the validity of the constructed scale. The results indicate that the time management scale has good reliability and validity. In Study 2, we performed a covariance structural analysis to verify our model that hypothesized that time management influences perceived control of time and psychological stress response, and perceived control of time influences psychological stress response. The results showed that time estimation increases the perceived control of time, which in turn decreases stress response. However, we also found that taking each moment as it comes reduces perceived control of time, which in turn increases stress response.
Moen, Vegard Pihl; Drageset, Jorunn; Eide, Geir Egil; Klokkerud, Mari; Gjesdal, Sturla
2017-02-01
The World Health Organization Disability Assessment Schedule (WHODAS) 2.0 is a generic instrument to assess disability covering six domains. The purpose of this study was to investigate the potential of the instrument for monitoring disability in specialized somatic rehabilitation by testing reliability, construct validity and responsiveness of WHODAS 2.0, Norwegian version, among patients with various health conditions. For taxonomy, terminology and definitions, the Consensus-based Standards for the Selection of Health Measurement Instruments were followed. Reproducibility was investigated by the intra-class correlation coefficient (ICC) in a randomly selected sample. Internal consistency was assessed by Cronbach's alpha. Construct validity was evaluated by correlations between WHODAS 2.0 and the Medical Outcomes Study 36-item Short Form, and fit of the hypothesized structure using confirmatory factor analysis (CFA). Responsiveness was evaluated in another randomly selected sample by testing a priori formulated hypotheses. Nine hundred seventy patients were included in the study. Reproducibility and responsiveness were evaluated in 53 and 104 patients, respectively. The ICC for the WHODAS 2.0 domains ranged from 0.63 to 0.84 and was 0.87 for total score. Cronbach's alpha for domains ranged from 0.75 to 0.94 and was 0.93 for total score. For construct validity, 6 of 12 expected correlations were confirmed and CFA did not achieve satisfactory fit indices. For responsiveness, 3 of 8 hypotheses were confirmed. The Norwegian version of WHODAS 2.0 showed moderate to satisfactory reliability and moderate validity in rehabilitation patients. However, the present study indicated possible limitations in terms of responsiveness.
ERIC Educational Resources Information Center
Wu, Pei-Chen; Huang, Tsai-Wei
2010-01-01
This study was to apply the mixed Rasch model to investigate person heterogeneity of Beck Depression Inventory-II-Chinese version (BDI-II-C) and its effects on dimensionality and construct validity. Person heterogeneity was reflected by two latent classes that differ qualitatively. Additionally, person heterogeneity adversely affected the…
ERIC Educational Resources Information Center
Egelhoff, Kelsey; Lane, Alison E.
2013-01-01
The Auditory Behavior Questionnaire (ABQ) evaluates abnormal behavioral responses to auditory stimulation in children with Autism Spectrum Disorder (ASD). This study reports preliminary reliability, construct validity and standardization of the ABQ. Parents of children with ASD aged 7-21 years (n = 165) completed the ABQ on-line. Cronbach's alpha…
Shaw, Amanda M; Rogge, Ronald D
2016-02-01
This study took a critical look at the construct of sexual quality. The 65 items of four well-validated self-report measures of sexual satisfaction (the Index of Sexual Satisfaction [ISS], Hudson, Harrison, & Crosscup, 1981; the Global Measure of Sexual Satisfaction [GMSEX], Lawrance & Byers, 1995; the Pinney Sexual Satisfaction Inventory [PSSI], Pinney, Gerrard, & Denney, 1987; the Young Sexual Satisfaction Scale [YSSS], Young, Denny, Luquis, & Young, 1998) and an additional 74 potential sexual quality items were given to 3060 online participants. Using Item Response Theory (IRT), we demonstrated that the ISS, YSSS, and PSSI scales provided suboptimal levels of precision in assessing sexual quality, particularly given the length of those scales. Exploratory factor analyses, IRT, differential item functioning analyses, and longitudinal responsiveness analyses were used to develop and evaluate the Quality of Sex Inventory. Results suggested that, in comparison to existing scales, the QSI (1) offers investigators and clinicians more theoretically focused scales, (2) distinguishes sexual satisfaction from sexual dissatisfaction, and (3) offers greater precision and power for detecting differences with (4) comparably high levels of responsiveness for detecting change over time despite being notably shorter than most of the existing scales. The QSI-satisfaction subscales demonstrated strong convergent validity with other measures of sexual satisfaction and excellent construct validity with anchor scales from the nomological net surrounding that construct, suggesting that they continue to assess the same theoretical construct as prior scales. Implications for research are discussed.
Measurement Properties of Two Innovative Item Formats in a Computer-Based Test
ERIC Educational Resources Information Center
Wan, Lei; Henly, George A.
2012-01-01
Many innovative item formats have been proposed over the past decade, but little empirical research has been conducted on their measurement properties. This study examines the reliability, efficiency, and construct validity of two innovative item formats--the figural response (FR) and constructed response (CR) formats used in a K-12 computerized…
The Academic Resilience Scale (ARS-30): A New Multidimensional Construct Measure.
Cassidy, Simon
2016-01-01
Resilience is a psychological construct observed in some individuals that accounts for success despite adversity. Resilience reflects the ability to bounce back, to beat the odds and is considered an asset in human characteristic terms. Academic resilience contextualizes the resilience construct and reflects an increased likelihood of educational success despite adversity. The paper provides an account of the development of a new multidimensional construct measure of academic resilience. The 30 item Academic Resilience Scale (ARS-30) explores process-as opposed to outcome-aspects of resilience, providing a measure of academic resilience based on students' specific adaptive cognitive-affective and behavioral responses to academic adversity. Findings from the study involving a sample of undergraduate students ( N = 532) demonstrate that the ARS-30 has good internal reliability and construct validity. It is suggested that a measure such as the ARS-30, which is based on adaptive responses, aligns more closely with the conceptualisation of resilience and provides a valid construct measure of academic resilience relevant for research and practice in university student populations.
Reinders, Marcel E; Blankenstein, Annette H; Knol, Dirk L; de Vet, Henrica C W; van Marwijk, Harm W J
2009-08-01
A focus on the communicator competency is considered to be an important requirement to help physicians to acquire consultation skills. A feedback questionnaire, in which patients assess consultation skills might be a useful learning tool. An existing questionnaire on patient perception of patient-centeredness (PPPC) was adapted to cover the 'communicator' items in the competency profile. We assessed the face and content validity, the construct validity and the internal consistency of this new patient feedback on consultation skills (PFC) questionnaire. We assessed the face validity of the PFC by interviewing patients and general practice trainees (GPTs) during the developmental process. The content validity was determined by experts (n=10). First-year GPTs (23) collected 222 PFCs, from which the data were used to assess the construct validity (factor analysis), internal consistency, response rates and ceiling effects. The PFC adequately covers the corresponding 'communicator' competency (face and content validity). Factor analysis showed a one-dimensional construct. The internal consistency was high (Cronbach's alpha 0.89). For the single items, the response rate varied from 89.2% to 100%; the maximum score (ceiling effect) varied from 45.5% to 89.2%. The PFC appears to be a valid, internally consistent instrument. The PFC may be a valuable learning tool with which GPTs, other physicians and medical students can acquire feedback from patients regarding their consultation skills.
[Design and validation of a questionnaire for psychosocial nursing diagnosis in Primary Care].
Brito-Brito, Pedro Ruymán; Rodríguez-Álvarez, Cristobalina; Sierra-López, Antonio; Rodríguez-Gómez, José Ángel; Aguirre-Jaime, Armando
2012-01-01
To develop a valid, reliable and easy-to-use questionnaire for a psychosocial nursing diagnosis. The study was performed in two phases: first phase, questionnaire design and construction; second phase, validity and reliability tests. A bank of items was constructed using the NANDA classification as a theoretical framework. Each item was assigned a Likert scale or dichotomous response. The combination of responses to the items constituted the diagnostic rules to assign up to 28 labels. A group of experts carried out the validity test for content. Other validated scales were used as reference standards for the criterion validity tests. Forty-five nurses provided the questionnaire to the patients on three separate occasions over a period of three weeks, and the other validated scales only once to 188 randomly selected patients in Primary Care centres in Tenerife (Spain). Validity tests for construct confirmed the six dimensions of the questionnaire with 91% of total variance explained. Validity tests for criterion showed a specificity of 66%-100%, and showed high correlations with the reference scales when the questionnaire was assigning nursing diagnoses. Reliability tests showed agreement of 56%-91% (P<.001), and a 93% internal consistency. The Questionnaire for Psychosocial Nursing Diagnosis was called CdePS, and included 61 items. The CdePS is a valid, reliable and easy-to-use tool in Primary Care centres to improve the assigning of a psychosocial nursing diagnosis. Copyright © 2011 Elsevier España, S.L. All rights reserved.
Negahban, Hossein; Mohtasebi, Elham; Goharpey, Shahin
2015-01-01
The aim of this methodological study was to cross-culturally translate the Shoulder Activity Scale (SAS) into the Persian and determine its clinimetric properties including reliability, validity, and responsiveness in patients with shoulder disorders. Persian version of the SAS was obtained after standard forward-backward translation. Three questionnaires were completed by the respondents: SAS, shoulder pain and disability index (SPADI), and Short-Form 36 Health Survey (SF-36). The patients completed the SAS, 1 week after the first visit to evaluate the test-retest reliability. Construct validity was evaluated by examining the associations between the scores on the SAS and the scores obtained from the SPADI, SF-36, and age of the patients. To assess responsiveness, data were collected in the first visit and then again after 4 weeks physiotherapy intervention. Test-retest reliability and internal consistency were assessed using Intra-class Correlation Coefficient (ICC) and Cronbach's alpha, respectively. To evaluate construct validity, Spearman's rank correlation was used. The ability of the SAS to detect changes was evaluated by the receiver-operating characteristics method. No problem or language difficulties were reported during translation process. Test-retest reliability of the SAS was excellent with an ICC of 0.98. Also, the marginal Cronbach's alpha level of 0.64 was obtained. The correlation between the SAS and the SPADI was low, proving divergent validity, whereas the correlations between the SAS and the SF-36/age were moderate proving convergent validity. A marginally acceptable responsiveness was achieved for the Persian SAS. The study provides some evidences to support the test-retest reliability, internal consistency, construct validity, and responsiveness of the Persian version of the SAS in patients with shoulder disorders. Therefore, it seems that this instrument is a useful measure of shoulder activity level in research setting and clinical practice. The shoulder activity scale (SAS) is a reliable, valid, and responsive measure of shoulder activity level in Persian-speaking patients with different shoulder disorders. The results on clinimetric properties of the Persian SAS are comparable with its original, English version. Persian version of the SAS can be used in "clinical" and "research" settings of patients with shoulder disorders.
Measuring Standards in Primary English: The Validity of PIRLS--A Response to Mary Hilton
ERIC Educational Resources Information Center
Whetton, Chris; Twist, Liz; Sainsbury, Marian
2007-01-01
Hilton (2006) criticises the PIRLS (Progress in International Reading Literacy Study) tests and the survey conduct, raising questions about the validity of international surveys of reading. Her criticisms fall into four broad areas: cultural validity, methodological issues, construct validity and the survey in England. However, her criticisms are…
Construction and validation of forms: systematization of the care of people under hemodialysis.
Arreguy-Sena, Cristina; Marques, Tais de Oliveira; Souza, Luciene Carnevale de; Alvarenga-Martins, Nathália; Krempser, Paula; Braga, Luciene Muniz; Parreira, Pedro Miguel Dos Santos Dinis
2018-01-01
create and validate forms to subsidize the systematization of nursing care with people on hemodialysis. institutional case study to support the systematization of assistance from the construction of forms for data collection, diagnoses, interventions and nursing results, using cross-mapping, Risner's reasoning, Neuman's theory, taxonomies of diagnoses, interventions and nursing results with application in clinical practice and validation by focal group with specialist nurses. 18 people on hemodialysis and 7 nurses participated. Consensus content of form matter with specialist nurses in the area (Crombach 0.86). The papers captured 43 diagnoses, 26 interventions and 78 nursing results depicting human responses in their singularities. the validated forms fill a gap by enabling the capture of human responses from people on hemodialysis and by subsidizing the planning of nursing care on a scientific basis.
Heald, S L; Riddle, D L; Lamb, R L
1997-10-01
The purposes of this study were (1) to assess the construct validity of the Shoulder Pain and Disability Index (SPADI) and (2) to determine whether the SPADI is more responsive than the Sickness Impact Profile (SIP), a generic health status measure. The sample consisted of 94 patients who were diagnosed with a shoulder problem and referred to six outpatient physical therapy clinics. Clinically meaningful change was determined by use of an ordinal rating scale designed to determine whether the patient's shoulder function was improved, the same, or worse following treatment. Spearman rho correlations were calculated for the initial visit SPADI and SIP scores. The standardized response mean (SRM) was used to measure responsiveness for the patients who were judged to be improved. One-tailed paired t tests (alpha = .01) were used to determine whether differences existed among SRM values. Correlations between the SPADI and SIP scores ranged from r = .01 to r = .57. The SRM value was higher for the SPADI total score (SRM = 1.38) than for the SIP total score (SRM = 0.79). Most correlations between SPADI and SIP scores provided support for the construct validity of the SPADI. The SPADI does not appear to strongly reflect occupational and recreational disability and is more responsive than the SIP.
ERIC Educational Resources Information Center
Polat, Ozgul; Dagal, Asude B.
2013-01-01
This study is aimed at developing a scale (Parents' Evaluation of Responsible Behaviors of 5-6 Year Old Children) for measuring parents' evaluation of their 5-6 year-old children's responsible behaviors. The construct validity of the scale was tested by Factor Analysis. Factor analysis determined that the scale can be clustered under 10 factors.…
ERIC Educational Resources Information Center
Fuchs, Douglas; Compton, Donald L.; Fuchs, Lynn S.; Bouton, Bobette; Caffrey, Erin
2011-01-01
The purpose of this study was to examine the construct and predictive validity of a dynamic assessment (DA) of decoding learning. Students (N = 318) were assessed in the fall of first grade on an array of instruments that were given in hopes of forecasting responsiveness to reading instruction. These instruments included DA as well as…
A Validity-Based Approach to Quality Control and Assurance of Automated Scoring
ERIC Educational Resources Information Center
Bejar, Isaac I.
2011-01-01
Automated scoring of constructed responses is already operational in several testing programmes. However, as the methodology matures and the demand for the utilisation of constructed responses increases, the volume of automated scoring is likely to increase at a fast pace. Quality assurance and control of the scoring process will likely be more…
Measuring Constructs in Family Science: How Can Item Response Theory Improve Precision and Validity?
ERIC Educational Resources Information Center
Gordon, Rachel A.
2015-01-01
This article provides family scientists with an understanding of contemporary measurement perspectives and the ways in which item response theory (IRT) can be used to develop measures with desired evidence of precision and validity for research uses. The article offers a nontechnical introduction to some key features of IRT, including its…
ERIC Educational Resources Information Center
Hopfenbeck, Therese N.; Maul, Andrew
2011-01-01
The aim of this study was to investigate response-process based evidence for the validity of the Programme for International Student Assessment's (PISA) self-report questionnaire scales as measures of specific psychological constructs, with a focus on scales meant to measure inclination toward specific learning strategies. Cognitive interviews (N…
Singh, Amika S; Chinapaw, Mai J M; Uijtdewilligen, Léonie; Vik, Froydis N; van Lippevelde, Wendy; Fernández-Alvira, Juan M; Stomfai, Sarolta; Manios, Yannis; van der Sluijs, Maria; Terwee, Caroline; Brug, Johannes
2012-08-13
Insight in parental energy balance-related behaviours, their determinants and parenting practices are important to inform childhood obesity prevention. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. The objective of the current study was to examine the test-retest reliability and construct validity of the parent questionnaire used in the ENERGY-project, assessing parental energy balance-related behaviours, their determinants, and parenting practices among parents of 10-12 year old children. We collected data among parents (n = 316 in the test-retest reliability study; n = 109 in the construct validity study) of 10-12 year-old children in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent interview was assessed using ICC and percentage agreement.All but one item showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Construct validity appeared to be good to excellent for 92 out of 121 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 29 items, construct validity was moderate for 24 and poor for 5 items. The reliability and construct validity of the items of the ENERGY-parent questionnaire on multiple energy balance-related behaviours, their potential determinants, and parenting practices appears to be good. Based on the results of the validity study, we strongly recommend adapting parts of the ENERGY-parent questionnaire if used in future research.
Harrison, Peter M C; Collins, Tom; Müllensiefen, Daniel
2017-06-15
Modern psychometric theory provides many useful tools for ability testing, such as item response theory, computerised adaptive testing, and automatic item generation. However, these techniques have yet to be integrated into mainstream psychological practice. This is unfortunate, because modern psychometric techniques can bring many benefits, including sophisticated reliability measures, improved construct validity, avoidance of exposure effects, and improved efficiency. In the present research we therefore use these techniques to develop a new test of a well-studied psychological capacity: melodic discrimination, the ability to detect differences between melodies. We calibrate and validate this test in a series of studies. Studies 1 and 2 respectively calibrate and validate an initial test version, while Studies 3 and 4 calibrate and validate an updated test version incorporating additional easy items. The results support the new test's viability, with evidence for strong reliability and construct validity. We discuss how these modern psychometric techniques may also be profitably applied to other areas of music psychology and psychological science in general.
Mercier, Catherine; Roche, Sylvain; Gaillard, Ségolène; Kassai, Behrouz; Arzimanoglou, Alexis; Herbillon, Vania; Roy, Pascal; Rheims, Sylvain
2016-05-01
Attention deficit hyperactivity disorder (ADHD) is a well-known comorbidity in children with epilepsy. In English-speaking countries, the scores of the original ADHD-rating scale IV are currently used as main outcomes in various clinical trials in children with epilepsy. In French-speaking countries, several French versions are in use though none has been fully validated yet. We sought here for a partial validation of a French version of the ADHD-RS IV regarding construct validity, internal consistency (i.e., scale reliability), item reliability, and responsiveness in a group of French children with ADHD and epilepsy. The study involved 167 children aged 6-15years in 10 French neuropediatric units. The factorial structure and item reliability were assessed with a confirmatory factorial analysis for ordered categorical variables. The dimensions' internal consistency was assessed with Guttman's lambda 6 coefficient. The responsiveness was assessed by the change in score under methylphenidate and in comparison with a control group. The results confirmed the original two-dimensional factorial structure (inattention, hyperactivity/impulsivity) and showed a satisfactory reliability of most items, a good dimension internal consistency, and a good responsiveness of the total score and the two subscores. The studied French version of the ADHD-RS IV is thus validated regarding construct validity, reliability, and responsiveness. It can now be used in French-speaking countries in clinical trials of treatments involving children with ADHD and epilepsy. The full validation requires further investigations. Copyright © 2016 Elsevier Inc. All rights reserved.
Toward a Measure of Accountability in Nursing: A Three-Stage Validation Study.
Drach-Zahavy, Anat; Leonenko, Marina; Srulovici, Einav
2018-06-04
To develop and psychometrically evaluate a three-dimensional questionnaire suitable for evaluating personal and organizational accountability in nurses. Accountability is defined as a three-dimensional value, directing professionals to take responsibility for their decisions and actions, to be willing to explain them (transparency) and to be judged according to society's accepted values (answerability). Despite the relatively clear definition, measurement of accountability lags well behind. Existing self-report questionnaires do not fully capture the complexity of the concept; nor do they capture the different sources of accountability (e.g., personal accountability, organizational accountability). A three-stage measure development. Data were collected during 2015-2016. In Phase 1, an initial database of items (N = 74) was developed, based on literature review and qualitative study, establishing face and content validity. In Phase 2, the face, content, construct and criterion-related validity of the initial questionnaires (19 items for personal and organizational accountability questionnaire) was established with a sample of 229 nurses. In Phase 3, the final questionnaires (19 items each) were validated with a new sample of 329 nurses and established construct validity. The final version of the instruments comprised 19 items, suitable for assessing personal and organizational accountability. The questionnaire referred to the dimensions of responsibility, transparency and answerability. The findings established the instrument's content, construct and criterion-related validity, as well as good internal reliability. The questionnaire portrays accountability in nursing, by capturing nurses' subjective perceptions of accountability dimensions (responsibility, transparency, answerability), as demonstrated by personal and organizational values. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Crins, Martine H. P.; Roorda, Leo D.; Smits, Niels; de Vet, Henrica C. W.; Westhovens, Rene; Cella, David; Cook, Karon F.; Revicki, Dennis; van Leeuwen, Jaap; Boers, Maarten; Dekker, Joost; Terwee, Caroline B.
2015-01-01
The Dutch-Flemish PROMIS Group translated the adult PROMIS Pain Interference item bank into Dutch-Flemish. The aims of the current study were to calibrate the parameters of these items using an item response theory (IRT) model, to evaluate the cross-cultural validity of the Dutch-Flemish translations compared to the original English items, and to evaluate their reliability and construct validity. The 40 items in the bank were completed by 1085 Dutch chronic pain patients. Before calibrating the items, IRT model assumptions were evaluated using confirmatory factor analysis (CFA). Items were calibrated using the graded response model (GRM), an IRT model appropriate for items with more than two response options. To evaluate cross-cultural validity, differential item functioning (DIF) for language (Dutch vs. English) was examined. Reliability was evaluated based on standard errors and Cronbach’s alpha. To evaluate construct validity correlations with scores on legacy instruments (e.g., the Disabilities of the Arm, Shoulder and Hand Questionnaire) were calculated. Unidimensionality of the Dutch-Flemish PROMIS Pain Interference item bank was supported by CFA tests of model fit (CFI = 0.986, TLI = 0.986). Furthermore, the data fit the GRM and showed good coverage across the pain interference continuum (threshold-parameters range: -3.04 to 3.44). The Dutch-Flemish PROMIS Pain Interference item bank has good cross-cultural validity (only two out of 40 items showing DIF), good reliability (Cronbach’s alpha = 0.98), and good construct validity (Pearson correlations between 0.62 and 0.75). A computer adaptive test (CAT) and Dutch-Flemish PROMIS short forms of the Dutch-Flemish PROMIS Pain Interference item bank can now be developed. PMID:26214178
Crins, Martine H P; Roorda, Leo D; Smits, Niels; de Vet, Henrica C W; Westhovens, Rene; Cella, David; Cook, Karon F; Revicki, Dennis; van Leeuwen, Jaap; Boers, Maarten; Dekker, Joost; Terwee, Caroline B
2015-01-01
The Dutch-Flemish PROMIS Group translated the adult PROMIS Pain Interference item bank into Dutch-Flemish. The aims of the current study were to calibrate the parameters of these items using an item response theory (IRT) model, to evaluate the cross-cultural validity of the Dutch-Flemish translations compared to the original English items, and to evaluate their reliability and construct validity. The 40 items in the bank were completed by 1085 Dutch chronic pain patients. Before calibrating the items, IRT model assumptions were evaluated using confirmatory factor analysis (CFA). Items were calibrated using the graded response model (GRM), an IRT model appropriate for items with more than two response options. To evaluate cross-cultural validity, differential item functioning (DIF) for language (Dutch vs. English) was examined. Reliability was evaluated based on standard errors and Cronbach's alpha. To evaluate construct validity correlations with scores on legacy instruments (e.g., the Disabilities of the Arm, Shoulder and Hand Questionnaire) were calculated. Unidimensionality of the Dutch-Flemish PROMIS Pain Interference item bank was supported by CFA tests of model fit (CFI = 0.986, TLI = 0.986). Furthermore, the data fit the GRM and showed good coverage across the pain interference continuum (threshold-parameters range: -3.04 to 3.44). The Dutch-Flemish PROMIS Pain Interference item bank has good cross-cultural validity (only two out of 40 items showing DIF), good reliability (Cronbach's alpha = 0.98), and good construct validity (Pearson correlations between 0.62 and 0.75). A computer adaptive test (CAT) and Dutch-Flemish PROMIS short forms of the Dutch-Flemish PROMIS Pain Interference item bank can now be developed.
Construct validation of an interactive digital algorithm for ostomy care.
Beitz, Janice M; Gerlach, Mary A; Schafer, Vickie
2014-01-01
The purpose of this study was to evaluate construct validity for a previously face and content validated Ostomy Algorithm using digital real-life clinical scenarios. A cross-sectional, mixed-methods Web-based survey design study was conducted. Two hundred ninety-seven English-speaking RNs completed the study; participants practiced in both acute care and postacute settings, with 1 expert ostomy nurse (WOC nurse) and 2 nonexpert nurses. Following written consent, respondents answered demographic questions and completed a brief algorithm tutorial. Participants were then presented with 7 ostomy-related digital scenarios consisting of real-life photos and pertinent clinical information. Respondents used the 11 assessment components of the digital algorithm to choose management options. Participant written comments about the scenarios and the research process were collected. The mean overall percentage of correct responses was 84.23%. Mean percentage of correct responses for respondents with a self-reported basic ostomy knowledge was 87.7%; for those with a self-reported intermediate ostomy knowledge was 85.88% and those who were self-reported experts in ostomy care achieved 82.77% correct response rate. Five respondents reported having no prior ostomy care knowledge at screening and achieved an overall 45.71% correct response rate. No negative comments regarding the algorithm were recorded by participants. The new standardized Ostomy Algorithm remains the only face, content, and construct validated digital clinical decision instrument currently available. Further research on application at the bedside while tracking patient outcomes is warranted.
Translation and validation of the German version of the Bournemouth Questionnaire for Neck Pain.
Soklic, Marina; Peterson, Cynthia; Humphreys, B Kim
2012-01-25
Clinical outcome measures are important tools to monitor patient improvement during treatment as well as to document changes for research purposes. The short-form Bournemouth questionnaire for neck pain patients (BQN) was developed from the biopsychosocial model and measures pain, disability, cognitive and affective domains. It has been shown to be a valid and reliable outcome measure in English, French and Dutch and more sensitive to change compared to other questionnaires. The purpose of this study was to translate and validate a German version of the Bournemouth questionnaire for neck pain patients. German translation and back translation into English of the BQN was done independently by four persons and overseen by an expert committee. Face validity of the German BQN was tested on 30 neck pain patients in a single chiropractic practice. Test-retest reliability was evaluated on 31 medical students and chiropractors before and after a lecture. The German BQN was then assessed on 102 first time neck pain patients at two chiropractic practices for internal consistency, external construct validity, external longitudinal construct validity and sensitivity to change compared to the German versions of the Neck Disability Index (NDI) and the Neck Pain and Disability Scale (NPAD). Face validity testing lead to minor changes to the German BQN. The Intraclass Correlation Coefficient for the test-retest reliability was 0.99. The internal consistency was strong for all 7 items of the BQN with Cronbach α's of .79 and .80 for the pre and post-treatment total scores. External construct validity and external longitudinal construct validity using Pearson's correlation coefficient showed statistically significant correlations for all 7 scales of the BQN with the other questionnaires. The German BQN showed greater responsiveness compared to the other questionnaires for all scales. The German BQN is a valid and reliable outcome measure that has been successfully translated and culturally adapted. It is shorter, easier to use, and more responsive to change than the NDI and NPAD.
Cross-cultural construct validity study of professionalism of Vietnamese medical students.
Nhan, Vo Thanh; Violato, Claudio; Le An, Pham; Beran, Tanya N
2014-01-01
Although many studies have made efforts to define and assess medical professionalism, few have addressed issues of construct validity. The purpose of this article is to explore further construct validity of medical professionalism employing exploratory and confirmatory factor analysis. The 32-item instrument by the American Board of Internal Medicine (ABIM) was adapted to assess the perceptions on medical professionalism of Vietnamese medical students. A sample of 1,196 (487 first-year, 341 third-year, 368 sixth-year) medical students participated voluntarily in the completion of the instrument. The data were randomly divided into three samples to assess the construct validity of medical professionalism by empirically deriving and confirming a model of professionalism. Exploratory and confirmatory factor analytic techniques resulted in a six-factor well-fitting model with a comparative fit index of .963 and root mean square error approximation of .029, 90% confidence interval [016, .039]: integrity, social responsibility, professional practice habits, ensuring quality care, altruism, and self-awareness. Social responsibility was perceived least important, and self-awareness was perceived most important by Vietnamese medical students. These constructs of medical professionalism were relatively similar with those found in Taiwanese medical students and the ABIM definitions but with some Vietnamese cultural differences. Although the results confirm that medical professionalism is a somewhat culturally sensitive construct, it nonetheless has many elements of medical professionalism that are universal. Future research should be conducted to test the generalizability of our six-factor model of professionalism with various samples (e.g., residents, physicians), cultures, and language groups.
The Academic Resilience Scale (ARS-30): A New Multidimensional Construct Measure
Cassidy, Simon
2016-01-01
Resilience is a psychological construct observed in some individuals that accounts for success despite adversity. Resilience reflects the ability to bounce back, to beat the odds and is considered an asset in human characteristic terms. Academic resilience contextualizes the resilience construct and reflects an increased likelihood of educational success despite adversity. The paper provides an account of the development of a new multidimensional construct measure of academic resilience. The 30 item Academic Resilience Scale (ARS-30) explores process—as opposed to outcome—aspects of resilience, providing a measure of academic resilience based on students’ specific adaptive cognitive-affective and behavioral responses to academic adversity. Findings from the study involving a sample of undergraduate students (N = 532) demonstrate that the ARS-30 has good internal reliability and construct validity. It is suggested that a measure such as the ARS-30, which is based on adaptive responses, aligns more closely with the conceptualisation of resilience and provides a valid construct measure of academic resilience relevant for research and practice in university student populations. PMID:27917137
Singh, Amika S; Vik, Froydis N; Chinapaw, Mai J M; Uijtdewilligen, Léonie; Verloigne, Maïté; Fernández-Alvira, Juan M; Stomfai, Sarolta; Manios, Yannis; Martens, Marloes; Brug, Johannes
2011-12-09
Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items.
2011-01-01
Background Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. Objective To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. Methods We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Results Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Conclusions Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items. PMID:22152048
Sörman, Karolina; Nilsonne, Gustav; Howner, Katarina; Tamm, Sandra; Caman, Shilan; Wang, Hui-Xin; Ingvar, Martin; Edens, John F; Gustavsson, Petter; Lilienfeld, Scott O; Petrovic, Predrag; Fischer, Håkan; Kristiansson, Marianne
2016-01-01
Cross-cultural investigation of psychopathy measures is important for clarifying the nomological network surrounding the psychopathy construct. The Psychopathic Personality Inventory-Revised (PPI-R) is one of the most extensively researched self-report measures of psychopathic traits in adults. To date however, it has been examined primarily in North American criminal or student samples. To address this gap in the literature, we examined PPI-R's reliability, construct validity and factor structure in non-criminal individuals (N = 227) in Sweden, using a multimethod approach including psychophysiological correlates of empathy for pain. PPI-R construct validity was investigated in subgroups of participants by exploring its degree of overlap with (i) the Psychopathy Checklist: Screening Version (PCL:SV), (ii) self-rated empathy and behavioral and physiological responses in an experiment on empathy for pain, and (iii) additional self-report measures of alexithymia and trait anxiety. The PPI-R total score was significantly associated with PCL:SV total and factor scores. The PPI-R Coldheartedness scale demonstrated significant negative associations with all empathy subscales and with rated unpleasantness and skin conductance responses in the empathy experiment. The PPI-R higher order Self-Centered Impulsivity and Fearless Dominance dimensions were associated with trait anxiety in opposite directions (positively and negatively, respectively). Overall, the results demonstrated solid reliability (test-retest and internal consistency) and promising but somewhat mixed construct validity for the Swedish translation of the PPI-R.
Sörman, Karolina; Nilsonne, Gustav; Howner, Katarina; Tamm, Sandra; Caman, Shilan; Wang, Hui-Xin; Ingvar, Martin; Edens, John F.; Gustavsson, Petter; Lilienfeld, Scott O; Petrovic, Predrag; Fischer, Håkan; Kristiansson, Marianne
2016-01-01
Cross-cultural investigation of psychopathy measures is important for clarifying the nomological network surrounding the psychopathy construct. The Psychopathic Personality Inventory-Revised (PPI-R) is one of the most extensively researched self-report measures of psychopathic traits in adults. To date however, it has been examined primarily in North American criminal or student samples. To address this gap in the literature, we examined PPI-R’s reliability, construct validity and factor structure in non-criminal individuals (N = 227) in Sweden, using a multimethod approach including psychophysiological correlates of empathy for pain. PPI-R construct validity was investigated in subgroups of participants by exploring its degree of overlap with (i) the Psychopathy Checklist: Screening Version (PCL:SV), (ii) self-rated empathy and behavioral and physiological responses in an experiment on empathy for pain, and (iii) additional self-report measures of alexithymia and trait anxiety. The PPI-R total score was significantly associated with PCL:SV total and factor scores. The PPI-R Coldheartedness scale demonstrated significant negative associations with all empathy subscales and with rated unpleasantness and skin conductance responses in the empathy experiment. The PPI-R higher order Self-Centered Impulsivity and Fearless Dominance dimensions were associated with trait anxiety in opposite directions (positively and negatively, respectively). Overall, the results demonstrated solid reliability (test-retest and internal consistency) and promising but somewhat mixed construct validity for the Swedish translation of the PPI-R. PMID:27300292
Koller, Ingrid; Levenson, Michael R.; Glück, Judith
2017-01-01
The valid measurement of latent constructs is crucial for psychological research. Here, we present a mixed-methods procedure for improving the precision of construct definitions, determining the content validity of items, evaluating the representativeness of items for the target construct, generating test items, and analyzing items on a theoretical basis. To illustrate the mixed-methods content-scaling-structure (CSS) procedure, we analyze the Adult Self-Transcendence Inventory, a self-report measure of wisdom (ASTI, Levenson et al., 2005). A content-validity analysis of the ASTI items was used as the basis of psychometric analyses using multidimensional item response models (N = 1215). We found that the new procedure produced important suggestions concerning five subdimensions of the ASTI that were not identifiable using exploratory methods. The study shows that the application of the suggested procedure leads to a deeper understanding of latent constructs. It also demonstrates the advantages of theory-based item analysis. PMID:28270777
Signatures of Quality Teaching for Indigenous Students
ERIC Educational Resources Information Center
Boon, Helen J.; Lewthwaite, Brian E.
2016-01-01
This paper presents findings from the validation of a survey instrument constructed in response to what Indigenous parents/carers and students believe constitutes culturally responsive pedagogies that positively influence Indigenous student learning. Characteristics of culturally responsive pedagogies established through interviews with Australian…
Psychometric Evaluation of the Ford Insomnia Response to Stress Test (FIRST) in Early Pregnancy.
Gelaye, Bizu; Zhong, Qiu-Yue; Barrios, Yasmin V; Redline, Susan; Drake, Christopher L; Williams, Michelle A
2016-04-15
To evaluate the construct validity and factor structure of the Spanish-language version of the Ford Insomnia Response to Stress Test questionnaire (FIRST-S) when used in early pregnancy. A cohort of 647 women were interviewed at ≤ 16 weeks of gestation to collect information regarding lifestyle, demographic, and sleep characteristics. The factorial structure of the FIRST-S was tested through exploratory and confirmatory factor analyses (EFA and CFA). Internal consistency and construct validity were also assessed by evaluating the association between the FIRST-S with symptoms of depression, anxiety, and sleep quality. Item response theory (IRT) analyses were conducted to complement classical test theory (CTT) analytic approaches. The mean score of the FIRST-S was 13.8 (range: 9-33). The results of the EFA showed that the FIRST-S contained a one-factor solution that accounted for 69.8% of the variance. The FIRST-S items showed good internal consistency (Cronbach α = 0.81). CFA results corroborated the one-factor structure finding from the EFA; and yielded measures indicating goodness of fit (comparative fit index of 0.902) and accuracy (root mean square error of approximation of 0.057). The FIRST-S had good construct validity as demonstrated by statistically significant associations of FIRST-S scores with sleep quality, antepartum depression and anxiety symptoms. Finally, results from IRT analyses suggested excellent item infit and outfit measures. The FIRST-S was found to have good construct validity and internal consistency for assessing vulnerability to insomnia during early pregnancy. © 2016 American Academy of Sleep Medicine.
Singh, Varun Pratap; Singh, Rajkumar
2014-03-01
The aim of this study was to develop a reliable and valid Nepali version of the Psychosocial Impact of Dental Aesthetic Questionnaire (PIDAQ). Cross-sectional descriptive validation study. B.P. Koirala Institute of Health Sciences, Dharan, Nepal. A rigorous translation process including conceptual and semantic evaluation, translation, back translation and pre-testing was carried out. Two hundred and fifty-two undergraduates, including equal numbers of males and females with an age ranging from 18 to 29 years (mean age: 22·33±2·114 years), participated in this study. Reliability was assessed by Cronbach's alpha coefficient and the coefficient of correlation was used to assess correlation between items and test-retest reliability. The construct validity was tested by factorial analysis. Convergent construct validity was tested by comparison of PIDAQ scores with the aesthetic component of the index of orthodontic treatment needs (IOTN-AC) and perception of occlusion scale (POS), respectively. Discriminant construct validity was assessed by differences in score for those who demand treatment and those who did not. The response rate was 100%. One hundred and twenty-three individuals had a demand for orthodontic treatment. The Nepali PIDAQ had excellent reliability with Cronbach's alpha of 0·945, corrected item correlation between 0·525 and 0·790 and overall test-retest reliability of 0·978. The construct validity was good with formation of a new sub-domain 'Dental self-consciousness'. The scale had good correlation with IOTN-AC and POS fulfilling convergent construct validity. The discriminant construct validity was proved by significant differences in scores for subjects with demand and without demand for treatment. To conclude, Nepali version of PIDAQ has good psychometric properties and can be used effectively in this population group for further research.
[Validation of the German version of the Oxford Elbow Score : A cross-sectional study].
Marquardt, J; Schöttker-Königer, T; Schäfer, A
2016-08-01
Elbow complaints are complex problems leading to severe consequences for affected people and the healthcare system. The German version of the Oxford Elbow Score (OES) is the first German-speaking instrument that specifically measures elbow complaints from the patient's perspective and changes of their health status. The aim of this study is the validation of the German version of the OES. In this context the internal consistency and the construct validity were investigated. 59 patients with elbow complaints completed the German version of the OES, the DASH and the SF-36 in a cross-sectional study. The internal consistency was calculated with Cronbach's alpha coefficients. Spearman's correlation coefficients were used to confirm construct validity. Cronbach's alpha for pain, function and psychological subscales was 0.88, 0.81 and 0.90, respectively. The whole questionnaire presents a Cronbach's alpha value of 0.93. Convergent construct validity was confirmed with correlation coefficients containing values of -0.84, -0.77 and -0.82 compared to DASH and values ranging from 0.41 to 0.80 compared with the physical domains of the SF-36. The divergent construct validity presented values ranging from 0.07 to 0.20 with the SF-36 domains of "general health perception" and "mental health". The German OES is an internal consistent instrument with good convergent and divergent construct validity. Other aspects of the validity, the reliability and the responsiveness should be confirmed through further studies.
Questionnaire to assess patient satisfaction with pharmaceutical care in Spanish language.
Traverso, María Luz; Salamano, Mercedes; Botta, Carina; Colautti, Marisel; Palchik, Valeria; Pérez, Beatriz
2007-08-01
To develop and validate a questionnaire, in Spanish, for assessing patient satisfaction with pharmaceutical care received in community pharmacies. Selection and translation of questionnaire's items; definition of response scale and demographic questions. Evaluation of face and content validity, feasibility, factor structure, reliability and construct validity. Forty-one community pharmacies of the province of Santa Fe. Argentina. Questionnaire administered to patients receiving pharmaceutical care or traditional pharmacy services. Pilot test to assess feasibility. Factor analysis used principal components and varimax rotation. Reliability established using internal consistency with Cronbach's alpha. Construct validity determined with extreme group method. A self-administered questionnaire with 27 items, 5-point Likert response scale and demographic questions was designed considering multidimensional structure of patient satisfaction. Questionnaire evaluates cumulative experience of patients with comprehensive pharmaceutical care practice in community pharmacies. Two hundred and seventy-four complete questionnaires were obtained. Factor analysis resulted in three factors: Managing therapy, Interpersonal relationship and General satisfaction, with a cumulative variance of 62.51%. Cronbach's alpha for the whole questionnaire was 0.96, and 0.95, 0.88 and 0.76 for the three factors, respectively. Mann-Whitney test for construct validity did not showed significant differences between pharmacies that provide pharmaceutical care and those that do not, however, 23 items showed significant differences between the two groups of pharmacies. The questionnaire developed can be a reliable and valid instrument to assess patient satisfaction with pharmaceutical care in community pharmacies in Spanish. Further research is needed to deepen the validation process.
Stinson, Jennifer N; Jibb, Lindsay A; Nguyen, Cynthia; Nathan, Paul C; Maloney, Anne Marie; Dupuis, L Lee; Gerstle, J Ted; Hopyan, Sevan; Alman, Benjamin A; Strahlendorf, Caron; Portwine, Carol; Johnston, Donna L
2015-12-01
We evaluated the construct validity (including responsiveness), reliability, and feasibility of the Pain Squad multidimensional smartphone-based pain assessment application (app) in children and adolescents with cancer, using 2 descriptive studies with repeated measures. Participants (8-18 years) undergoing cancer treatment were drawn from 4 pediatric cancer centers. In study 1, 92 participants self-reported their level of pain twice daily for 2 weeks using the Pain Squad app to assess app construct validity and reliability. In study 2, 14 participants recorded their level of pain twice a day for 1 week before and 2 weeks after cancer-related surgery to determine app responsiveness. Participants in both studies completed multiple measures to determine the construct validity and feasibility of the Pain Squad app. Correlations between average weekly pain ratings on the Pain Squad app and recalled least, average, and worst weekly pain were moderate to high (0.43-0.68). Correlations with health-related quality of life and pain coping (measured with PedsQL Inventory 4.0, PedsQL Cancer Module, and Pain Coping Questionnaire) were -0.46 to 0.29. The app showed excellent internal consistency (α = 0.96). Pain ratings changed because of surgery with large effect sizes between baseline and the first week postsurgery (>0.85) and small effect sizes between baseline and the second week postsurgery (0.13-0.32). These findings provide evidence of the construct validity, reliability, and feasibility of the Pain Squad app in children and adolescents with cancer. Use of real-time data capture approaches should be considered in future studies of childhood cancer pain. A video accompanying this abstract is available online as Supplemental Digital Content at http://links.lww.com/PAIN/A169.
Validation Study of a Gatekeeping Attitude Index for Social Work Education
ERIC Educational Resources Information Center
Tam, Dora M. Y.; Coleman, Heather
2011-01-01
This article reports on a study designed to validate the Gatekeeping Attitude Index, a 14-item Likert scaling index. The authors collected data from a convenience sample of social work field instructors (N = 188) with a response rate of 74.0%. Construct validation by exploratory factor analysis identified a 2-factor solution on the index after…
ERIC Educational Resources Information Center
Bingham, Shereen G.; Burleson, Brant R.
1996-01-01
Develops and assesses the validity of a self-report measure of sexual harassment proclivities in men. Demonstrates the validity of the scale by its moderate correlations with attitude measures relevant to sexual harassment, its nonsignificant correlation with the need to provide socially desirable responses, and by showing that potential victims…
Li, Xiaochuan; Bai, Xuedong; Wu, Yaohong; Ruan, Dike
2016-03-15
To construct and validate a model to predict responsible nerve roots in lumbar degenerative disease with diagnostic doubt (DD). From January 2009-January 2013, 163 patients with DD were assigned to the construction (n = 106) or validation sample (n = 57) according to different admission times to hospital. Outcome was assessed according to the Japanese Orthopedic Association (JOA) recovery rate as excellent, good, fair, and poor. The first two results were considered as effective clinical outcome (ECO). Baseline patient and clinical characteristics were considered as secondary variables. A multivariate logistic regression model was used to construct a model with the ECO as a dependent variable and other factors as explanatory variables. The odds ratios (ORs) of each risk factor were adjusted and transformed into a scoring system. Area under the curve (AUC) was calculated and validated in both internal and external samples. Moreover, calibration plot and predictive ability of this scoring system were also tested for further validation. Patients with DD with ECOs in both construction and validation models were around 76 % (76.4 and 75.5 % respectively). more preoperative visual analog pain scale (VAS) score (OR = 1.56, p < 0.01), stenosis levels of L4/5 or L5/S1 (OR = 1.44, p = 0.04), stenosis locations with neuroforamen (OR = 1.95, p = 0.01), neurological deficit (OR = 1.62, p = 0.01), and more VAS improvement of selective nerve route block (SNRB) (OR = 3.42, p = 0.02). the internal area under the curve (AUC) was 0.85, and the external AUC was 0.72, with a good calibration plot of prediction accuracy. Besides, the predictive ability of ECOs was not different from the actual results (p = 0.532). We have constructed and validated a predictive model for confirming responsible nerve roots in patients with DD. The associated risk factors were preoperative VAS score, stenosis levels of L4/5 or L5/S1, stenosis locations with neuroforamen, neurological deficit, and VAS improvement of SNRB. A tool such as this is beneficial in the preoperative counseling of patients, shared surgical decision making, and ultimately improving safety in spine surgery.
Construct Validity of the Societal Outreach Scale (SOS).
Fike, David S; Denton, Jason; Walk, Matt; Kish, Jennifer; Gorman, Ira
2018-04-01
The American Physical Therapy Association (APTA) has been working toward a vision of increasing professional focus on societal-level health. However, performance of social responsibility and related behaviors by physical therapists remain relatively poorly integrated into practice. Promoting a focus on societal outreach is necessary for all health care professionals to impact the health of their communities. The objective was to document the validity of the 14-item Societal Outreach Scale (SOS) for use with practicing physical therapists. This study used a cross-sectional survey. The SOS was transmitted via email to all therapists who were licensed and practicing in 10 states in the United States that were purposefully selected to assure a broad representation. A sample of 2612 usable responses was received. Factor analysis was applied to assess construct validity of the instrument. Of alternate models, a 3-factor model best demonstrated goodness of fit with the sample data according to conventional indices (standardized root mean squared residual = .03, comparative fit index .96, root mean square error of approximation = .06). The 3 factors measured by the SOS were labeled Societal-Level Health Advocacy, Community Engagement/Social Integration, and Political Engagement. Internal consistency reliability was 0.7 for all factors. The 3-factor SOS demonstrated acceptable validity and reliability. Though the sample included a broad representation of physical therapists, this was a single cross-sectional study. Additional confirmatory factor analysis, reliability testing, and word refinement of the tool are warranted. Given the construct validity and reliability of the 3-factor SOS, it is recommended for use as a validated instrument to measure physical therapists' performance of social responsibility and related behaviors.
Confirmatory factor analysis of the Infant Feeding Styles Questionnaire in Latino families.
Wood, Charles T; Perreira, Krista M; Perrin, Eliana M; Yin, H Shonna; Rothman, Russell L; Sanders, Lee M; Delamater, Alan M; Bentley, Margaret E; Bronaugh, Andrea B; Thompson, Amanda L
2016-05-01
Parent feeding practices affect risk of obesity in children. Latino children are at higher risk of obesity than the general population, yet valid measure of feeding practices, one of which is the Infant Feeding Styles Questionnaire (IFSQ), have not been formally validated in Spanish. To validate the IFSQ among Latino families, we conducted confirmatory factor analysis of pressuring, restrictive, and responsive feeding constructs from the IFSQ. The IFSQ was administered at the 12-month visit in the Greenlight study, a multi-center cluster randomized trial to prevent obesity. Parents were included if they were of Latino origin (n = 303) and completed an English or Spanish language modified IFSQ (without the indulgence construct). Scores from nine sub-constructs of the IFSQ were compared between English and Spanish language versions. We tested reliability with Cronbach's alpha coefficients and performed confirmatory factor analysis to examine factor loadings and goodness of fit characteristics, modifying constructs to achieve best fit. Of 303 parents completing the IFSQ, 84% were born outside the US, and 74% completed the IFSQ in Spanish. Reliability coefficients ranged from 0.28 to 0.61 for the laissez-faire sub-constructs and from 0.58 to 0.83 for the pressuring, restrictive, and responsive sub-constructs. Results for all coefficients were similar between participants responding to an English and Spanish version of the IFSQ. Goodness of fit indices ranged from CFI 0.82-1 and RMSEA 0.00-0.31, and the model performed best in pressuring-soothing (CFI 1.0, RMSEA 0.00) and restrictive-amount (CFI 0.98, RMSEA 0.1) sub-constructs. In a sample of Latino families, pressuring, restrictive, and responsive constructs performed well. The modified IFSQ in both English and Spanish-speaking Latino families may be used to assess parenting behaviors related to early obesity risk in this at-risk population. Copyright © 2016 Elsevier Ltd. All rights reserved.
Confirmatory Factor Analysis of the Infant Feeding Styles Questionnaire in Latino Families
Wood, Charles T.; Perreira, Krista M.; Perrin, Eliana M.; Yin, H. Shonna; Rothman, Russell L.; Sanders, Lee M.; Delamater, Alan M.; Bentley, Margaret E.; Bronaugh, Andrea B.; Thompson, Amanda L.
2016-01-01
Background Parent feeding practices affect risk of obesity in children. Latino children are at higher risk of obesity than the general population, yet valid measure of feeding practices, one of which is the Infant Feeding Styles Questionnaire (IFSQ), have not been formally validated in Spanish. Objective To validate the IFSQ among Latino families, we conducted confirmatory factor analysis of pressuring, restrictive, and responsive feeding constructs from the IFSQ. Design/Methods The IFSQ was administered at the 12-month visit in the Greenlight study, a multi-center cluster randomized trial to prevent obesity. Parents were included if they were of Latino origin (n=303) and completed an English or Spanish language modified IFSQ (without the indulgence construct). Scores from nine sub-constructs of the IFSQ were compared between English and Spanish language versions. We tested reliability with Cronbach’s alpha coefficients and performed confirmatory factor analysis to examine factor loadings and goodness of fit characteristics, modifying constructs to achieve best fit. Results Of 303 parents completing the IFSQ, 84% were born outside the US, and 74% completed the IFSQ in Spanish. Reliability coefficients ranged from 0.28–0.61 for the laissez-faire sub-constructs and from 0.58–0.83 for the pressuring, restrictive, and responsive sub-constructs. Results for all coefficients were similar between participants responding to an English and Spanish version of the IFSQ. Goodness of fit indices ranged from CFI 0.82–1 and RMSEA 0.00–0.31, and the model performed best in pressuring-soothing (CFI 1.0, RMSEA 0.00) and restrictive-amount (CFI 0.98, RMSEA 0.1) sub-constructs. Conclusions In a sample of Latino families, pressuring, restrictive, and responsive constructs performed well. The modified IFSQ in both English and Spanish-speaking Latino families may be used to assess parenting behaviors related to early obesity risk in this at-risk population. PMID:26876910
Measuring Constructs in Family Science: How Can Item Response Theory Improve Precision and Validity?
Gordon, Rachel A.
2014-01-01
This article provides family scientists with an understanding of contemporary measurement perspectives and the ways in which item response theory (IRT) can be used to develop measures with desired evidence of precision and validity for research uses. The article offers a nontechnical introduction to some key features of IRT, including its orientation toward locating items along an underlying dimension and toward estimating precision of measurement for persons with different levels of that same construct. It also offers a didactic example of how the approach can be used to refine conceptualization and operationalization of constructs in the family sciences, using data from the National Longitudinal Survey of Youth 1979 (n = 2,732). Three basic models are considered: (a) the Rasch and (b) two-parameter logistic models for dichotomous items and (c) the Rating Scale Model for multicategory items. Throughout, the author highlights the potential for researchers to elevate measurement to a level on par with theorizing and testing about relationships among constructs. PMID:25663714
The Work Instability Scale for Rheumatoid Arthritis (RA-WIS): Does it work in osteoarthritis?
Tang, Kenneth; Beaton, Dorcas E; Lacaille, Diane; Gignac, Monique A M; Zhang, Wei; Anis, Aslam H; Bombardier, Claire
2010-09-01
To validate the 23-item Work Instability Scale for Rheumatoid Arthritis (RA-WIS) for use in osteoarthritis (OA) using both classical test theory and item response theory approaches. Baseline and 12-month follow-up data were collected from workers with OA recruited from community and clinical settings (n = 130). Fit of RA-WIS data to the Rasch model was evaluated by item- and person-fit statistics (size of residual, chi-sq), assessments of differential item functioning, and tests of unidimensionality and local independence. Internal consistency was assessed by KR-20. Convergent construct validity (Spearman r, known-groups) was evaluated against theoretical constructs that assess impact of health on work. Responsiveness to global indicators of change was assessed by standardized response means (SRM) and area under the receiver operating characteristic curves. Data structure of the RA-WIS showed adequate fit to the Rasch model (chi-sq = 83.2, P = 0.03) after addressing local dependency in three item pairs by creating testlets. High internal consistency (KR-20 = 0.93) and convergent validity with work-oriented constructs (|r| = 0.55-0.77) were evident. The RA-WIS correlated most strongly with the concept of illness intrusiveness (r = 0.77) and was highly responsive to changes (SRM = 1.05 [deterioration]; -0.78 [improvement]). Although developed for RA, the RA-WIS is psychometrically sound for OA and demonstrates interval-level property.
Kutlay, Sehim; Kuçukdeveci, Ayse A; Elhan, Atilla H; Yavuzer, Gunes; Tennant, Alan
2007-02-28
Assessment of cognitive impairment with a valid cognitive screening tool is essential in neurorehabilitation. The aim of this study was to test the reliability and validity of the Turkish-adapted version of the Middlesex Elderly Assessment of Mental State (MEAMS) among acquired brain injury patients in Turkey. Some 155 patients with acquired brain injury admitted for rehabilitation were assessed by the adapted version of MEAMS at admission and discharge. Reliability was tested by internal consistency, intra-class correlation coefficient (ICC) and person separation index; internal construct validity by Rasch analysis; external construct validity by associations with physical and cognitive disability (FIM); and responsiveness by Effect Size. Reliability was found to be good with Cronbach's alpha of 0.82 at both admission and discharge; and likewise an ICC of 0.80. Person separation index was 0.813. Internal construct validity was good by fit of the data to the Rasch model (mean item fit -0.178; SD 1.019). Items were substantially free of differential item functioning. External construct validity was confirmed by expected associations with physical and cognitive disability. Effect size was 0.42 compared with 0.22 for cognitive FIM. The reliability and validity of the Turkish version of MEAMS as a cognitive impairment screening tool in acquired brain injury has been demonstrated.
The Social Physique Anxiety Scale: construct validity in adolescent females.
McAuley, E; Burman, G
1993-09-01
Hart, Leary, and Rejeski have developed the Social Physique Anxiety Scale (SPA), a measure of the anxiety experienced in response to having one's physique evaluated by other people. The present study cross-validated the psychometric properties of this measure in a sample (N = 236) of adolescent competitive female gymnasts. Employing structural equation modeling, the proposed unidimensional factor structure of the SPA was supported, although some questions regarding the robustness of the fit are raised. Construct validity was demonstrated by significant inverse relationships between aspects of physical efficacy (perceived physical ability and physical self-presentation confidence) and degree of social physique anxiety. These findings are discussed in terms of possible alternative factor structures and integration of social anxiety and other psychosocial constructs to better understand physical activity behavior.
Valente, Ana Rita S; Hall, Andreia; Alvelos, Helena; Leahy, Margaret; Jesus, Luis M T
2018-04-12
The appropriate use of language in context depends on the speaker's pragmatic language competencies. A coding system was used to develop a specific and adult-focused self-administered questionnaire to adults who stutter and adults who do not stutter, The Assessment of Language Use in Social Contexts for Adults, with three categories: precursors, basic exchanges, and extended literal/non-literal discourse. This paper presents the content validity, item analysis, reliability coefficients and evidences of construct validity of the instrument. Content validity analysis was based on a two-stage process: first, 11 pragmatic questionnaires were assessed to identify items that probe each pragmatic competency and to create the first version of the instrument; second, items were assessed qualitatively by an expert panel composed by adults who stutter and controls, and quantitatively and qualitatively by an expert panel composed by clinicians. A pilot study was conducted with five adults who stutter and five controls to analyse items and calculate reliability. Construct validity evidences were obtained using the hypothesized relationships method and factor analysis with 28 adults who stutter and 28 controls. Concerning content validity, the questionnaires assessed up to 13 pragmatic competencies. Qualitative and quantitative analysis revealed ambiguities in items construction. Disagreement between experts was solved through item modification. The pilot study showed that the instrument presented internal consistency and temporal stability. Significant differences between adults who stutter and controls and different response profiles revealed the instrument's underlying construct. The instrument is reliable and presented evidences of construct validity.
The Utrecht questionnaire (U-CEP) measuring knowledge on clinical epidemiology proved to be valid.
Kortekaas, Marlous F; Bartelink, Marie-Louise E L; de Groot, Esther; Korving, Helen; de Wit, Niek J; Grobbee, Diederick E; Hoes, Arno W
2017-02-01
Knowledge on clinical epidemiology is crucial to practice evidence-based medicine. We describe the development and validation of the Utrecht questionnaire on knowledge on Clinical epidemiology for Evidence-based Practice (U-CEP); an assessment tool to be used in the training of clinicians. The U-CEP was developed in two formats: two sets of 25 questions and a combined set of 50. The validation was performed among postgraduate general practice (GP) trainees, hospital trainees, GP supervisors, and experts. Internal consistency, internal reliability (item-total correlation), item discrimination index, item difficulty, content validity, construct validity, responsiveness, test-retest reliability, and feasibility were assessed. The questionnaire was externally validated. Internal consistency was good with a Cronbach alpha of 0.8. The median item-total correlation and mean item discrimination index were satisfactory. Both sets were perceived as relevant to clinical practice. Construct validity was good. Both sets were responsive but failed on test-retest reliability. One set took 24 minutes and the other 33 minutes to complete, on average. External GP trainees had comparable results. The U-CEP is a valid questionnaire to assess knowledge on clinical epidemiology, which is a prerequisite for practicing evidence-based medicine in daily clinical practice. Copyright © 2016 Elsevier Inc. All rights reserved.
Systematic Development and Validation of a Theory-Based Questionnaire to Assess Toddler Feeding12
Hurley, Kristen M.; Pepper, M. Reese; Candelaria, Margo; Wang, Yan; Caulfield, Laura E.; Latta, Laura; Hager, Erin R.; Black, Maureen M.
2013-01-01
This paper describes the development and validation of a 27-item caregiver-reported questionnaire on toddler feeding. The development of the Toddler Feeding Behavior Questionnaire was based on a theory of interactive feeding that incorporates caregivers’ responses to concerns about their children’s dietary intake, appetite, size, and behaviors rather than relying exclusively on caregiver actions. Content validity included review by an expert panel (n = 7) and testing in a pilot sample (n = 105) of low-income mothers of toddlers. Construct validity and reliability were assessed among a second sample of low-income mothers of predominately African-American (70%) toddlers aged 12–32 mo (n = 297) participating in the baseline evaluation of a toddler overweight prevention study. Internal consistency (Cronbach’s α: 0.64–0.87) and test-retest (0.57–0.88) reliability were acceptable for most constructs. Exploratory and confirmatory factor analyses revealed 5 theoretically derived constructs of feeding: responsive, forceful/pressuring, restrictive, indulgent, and uninvolved (root mean square error of approximation = 0.047, comparative fit index = 0.90, standardized root mean square residual = 0.06). Statistically significant (P < 0.05) convergent validity results further validated the scale, confirming established relations between feeding behaviors, toddler overweight status, perceived toddler fussiness, and maternal mental health. The Toddler Feeding Behavior Questionnaire adds to the field by providing a brief instrument that can be administered in 5 min to examine how caregiver-reported feeding behaviors relate to toddler health and behavior. PMID:24068792
Systematic development and validation of a theory-based questionnaire to assess toddler feeding.
Hurley, Kristen M; Pepper, M Reese; Candelaria, Margo; Wang, Yan; Caulfield, Laura E; Latta, Laura; Hager, Erin R; Black, Maureen M
2013-12-01
This paper describes the development and validation of a 27-item caregiver-reported questionnaire on toddler feeding. The development of the Toddler Feeding Behavior Questionnaire was based on a theory of interactive feeding that incorporates caregivers' responses to concerns about their children's dietary intake, appetite, size, and behaviors rather than relying exclusively on caregiver actions. Content validity included review by an expert panel (n = 7) and testing in a pilot sample (n = 105) of low-income mothers of toddlers. Construct validity and reliability were assessed among a second sample of low-income mothers of predominately African-American (70%) toddlers aged 12-32 mo (n = 297) participating in the baseline evaluation of a toddler overweight prevention study. Internal consistency (Cronbach's α: 0.64-0.87) and test-retest (0.57-0.88) reliability were acceptable for most constructs. Exploratory and confirmatory factor analyses revealed 5 theoretically derived constructs of feeding: responsive, forceful/pressuring, restrictive, indulgent, and uninvolved (root mean square error of approximation = 0.047, comparative fit index = 0.90, standardized root mean square residual = 0.06). Statistically significant (P < 0.05) convergent validity results further validated the scale, confirming established relations between feeding behaviors, toddler overweight status, perceived toddler fussiness, and maternal mental health. The Toddler Feeding Behavior Questionnaire adds to the field by providing a brief instrument that can be administered in 5 min to examine how caregiver-reported feeding behaviors relate to toddler health and behavior.
Calibration of the Dutch-Flemish PROMIS Pain Behavior item bank in patients with chronic pain.
Crins, M H P; Roorda, L D; Smits, N; de Vet, H C W; Westhovens, R; Cella, D; Cook, K F; Revicki, D; van Leeuwen, J; Boers, M; Dekker, J; Terwee, C B
2016-02-01
The aims of the current study were to calibrate the item parameters of the Dutch-Flemish PROMIS Pain Behavior item bank using a sample of Dutch patients with chronic pain and to evaluate cross-cultural validity between the Dutch-Flemish and the US PROMIS Pain Behavior item banks. Furthermore, reliability and construct validity of the Dutch-Flemish PROMIS Pain Behavior item bank were evaluated. The 39 items in the bank were completed by 1042 Dutch patients with chronic pain. To evaluate unidimensionality, a one-factor confirmatory factor analysis (CFA) was performed. A graded response model (GRM) was used to calibrate the items. To evaluate cross-cultural validity, Differential item functioning (DIF) for language (Dutch vs. English) was evaluated. Reliability of the item bank was also examined and construct validity was studied using several legacy instruments, e.g. the Roland Morris Disability Questionnaire. CFA supported the unidimensionality of the Dutch-Flemish PROMIS Pain Behavior item bank (CFI = 0.960, TLI = 0.958), the data also fit the GRM, and demonstrated good coverage across the pain behavior construct (threshold parameters range: -3.42 to 3.54). Analysis showed good cross-cultural validity (only six DIF items), reliability (Cronbach's α = 0.95) and construct validity (all correlations ≥0.53). The Dutch-Flemish PROMIS Pain Behavior item bank was found to have good cross-cultural validity, reliability and construct validity. The development of the Dutch-Flemish PROMIS Pain Behavior item bank will serve as the basis for Dutch-Flemish PROMIS short forms and computer adaptive testing (CAT). © 2015 European Pain Federation - EFIC®
[Validity and reliability of the Culture of Quality Health Services questionnaire in Mexico].
Herrera-Kiengelher, L; Zepeda-Zaragoza, J; Austria-Corrales, F; Vázquez-Zarate, V M
2013-01-01
Patient Safety is a major public health problem worldwide and is responsibility of all those involved in health care. Establishing a Safety Culture has proved to be a factor that favors the integration of work teams, communication and construction of clear procedures in various organizations. Promote a culture of safety depends on several factors, such as organization, work unit and staff. Objective assessment of these factors will help to identify areas for improvement and establish strategic lines of action. [corrected] To adapt, validate and calibrate the questionnaire Culture of Quality in Health Services (CQHS) in Mexican population. A cross with a stratified representative sample of 522 health workers. The questionnaire was translated and adapted from Singer's. Content was validated by experts, internal consistency, confirmatory factorial validity and item calibration with Samejima's Graded Response Model. Convergent and divergent construct validity was confirmed from the CQHS, item calibration showed that the questionnaire is able to discriminate between patients and represent different levels of the hypothesized dimensions with greater accuracy and lower standard error. The CQHS is a valid and reliable instrument to assess patient safety culture in hospitals in Mexico. Copyright © 2013 SECA. Published by Elsevier Espana. All rights reserved.
Development and validation of the Measure of Indigenous Racism Experiences (MIRE)
Paradies, Yin C; Cunningham, Joan
2008-01-01
Background In recent decades there has been increasing evidence of a relationship between self-reported racism and health. Although a plethora of instruments to measure racism have been developed, very few have been described conceptually or psychometrically Furthermore, this research field has been limited by a dearth of instruments that examine reactions/responses to racism and by a restricted focus on African American populations. Methods In response to these limitations, the 31-item Measure of Indigenous Racism Experiences (MIRE) was developed to assess self-reported racism for Indigenous Australians. This paper describes the development of the MIRE together with an opportunistic examination of its content, construct and convergent validity in a population health study involving 312 Indigenous Australians. Results Focus group research supported the content validity of the MIRE, and inter-item/scale correlations suggested good construct validity. A good fit with a priori conceptual dimensions was demonstrated in factor analysis, and convergence with a separate item on discrimination was satisfactory. Conclusion The MIRE has considerable utility as an instrument that can assess multiple facets of racism together with responses/reactions to racism among indigenous populations and, potentially, among other ethnic/racial groups. PMID:18426602
Dima, Alexandra Lelia; Schulz, Peter Johannes
2017-01-01
Background The eHealth Literacy Scale (eHEALS) is a tool to assess consumers’ comfort and skills in using information technologies for health. Although evidence exists of reliability and construct validity of the scale, less agreement exists on structural validity. Objective The aim of this study was to validate the Italian version of the eHealth Literacy Scale (I-eHEALS) in a community sample with a focus on its structural validity, by applying psychometric techniques that account for item difficulty. Methods Two Web-based surveys were conducted among a total of 296 people living in the Italian-speaking region of Switzerland (Ticino). After examining the latent variables underlying the observed variables of the Italian scale via principal component analysis (PCA), fit indices for two alternative models were calculated using confirmatory factor analysis (CFA). The scale structure was examined via parametric and nonparametric item response theory (IRT) analyses accounting for differences between items regarding the proportion of answers indicating high ability. Convergent validity was assessed by correlations with theoretically related constructs. Results CFA showed a suboptimal model fit for both models. IRT analyses confirmed all items measure a single dimension as intended. Reliability and construct validity of the final scale were also confirmed. The contrasting results of factor analysis (FA) and IRT analyses highlight the importance of considering differences in item difficulty when examining health literacy scales. Conclusions The findings support the reliability and validity of the translated scale and its use for assessing Italian-speaking consumers’ eHealth literacy. PMID:28400356
Chin, Weng Yee; Choi, Edmond P H; Chan, Kit T Y; Wong, Carlos K H
2015-01-01
The Center for Epidemiologic Studies Depression Scale (CES-D) is a commonly used instrument to measure depressive symptomatology. Despite this, the evidence for its psychometric properties remains poorly established in Chinese populations. The aim of this study was to validate the use of the CES-D in Chinese primary care patients by examining factor structure, construct validity, reliability, sensitivity and responsiveness. The psychometric properties were assessed amongst a sample of 3686 Chinese adult primary care patients in Hong Kong. Three competing factor structure models were examined using confirmatory factor analysis. The original CES-D four-structure model had adequate fit, however the data was better fit into a bi-factor model. For the internal construct validity, corrected item-total correlations were 0.4 for most items. The convergent validity was assessed by examining the correlations between the CES-D, the Patient Health Questionnaire 9 (PHQ-9) and the Short Form-12 Health Survey (version 2) Mental Component Summary (SF-12 v2 MCS). The CES-D had a strong correlation with the PHQ-9 (coefficient: 0.78) and SF-12 v2 MCS (coefficient: -0.75). Internal consistency was assessed by McDonald's omega hierarchical (ωH). The ωH value for the general depression factor was 0.855. The ωH values for "somatic", "depressed affect", "positive affect" and "interpersonal problems" were 0.434, 0.038, 0.738 and 0.730, respectively. For the two-week test-retest reliability, the intraclass correlation coefficient was 0.91. The CES-D was sensitive in detecting differences between known groups, with the AUC >0.7. Internal responsiveness of the CES-D to detect positive and negative changes was satisfactory (with p value <0.01 and all effect size statistics >0.2). The CES-D was externally responsive, with the AUC>0.7. The CES-D appears to be a valid, reliable, sensitive and responsive instrument for screening and monitoring depressive symptoms in adult Chinese primary care patients. In its original four-factor and bi-factor structure, the CES-D is supported for cross-cultural comparisons of depression in multi-center studies.
The development of an instrument to assess chemistry perceptions
NASA Astrophysics Data System (ADS)
Wells, Raymond R.
The instrument, developed in this study, attempted to correct the deficiencies of previous instruments. Statements of belief and opinion can be validly included under the construct of chemistry perceptions. Further, statements that might be better characterized as science attitudes, math attitudes, or attitudes toward a specific course or program were not included. Eliminating statements of math anxiety and test anxiety insured that responses to statements of anxiety were perceptions of anxiety solely related to chemistry. The results of the expert judges' responses to the Validation of Proposed Perception Statements forms were detailed to establish construct and content validity. The nature of Likert scale construction and calculation of internal consistency also supported the validity of the instrument. A pilot Chemistry Perception Questionnaire (CPQ) was then constructed based on agreement of the appropriate subscale and mean importance of the perception statements. The pilot CPQ results were subjected to an item analysis based on three sets of statistics: the frequency of each response and the percentage of respondents making each response for each perception statement, the mean and standard deviations for each item, and the item discrimination index which correlated the item scores with the subscale scores. With no zero or negative correlations to the subscale scores, it was not necessary to replace any of the perception statements contained in the pilot instrument. Therefore, the piloted Chemistry Perception Questionnaire became the final instrument. Factor analysis confirmed the multidimensionality of the instrument. The instrument was administered twice with a separation interval of approximately one month in order to perform a test-retest reliability analysis. One hundred and forty-one pairs were matched and results detailed. The correlation between forms, for the total instrument, was 0.9342. The mean coefficient alpha, for the total instrument, was 0.9495. With test-retest correlations and alphas exceeding 0.70 for all seven subscales and the total instrument, it was determined that the Chemistry Perception Questionnaire instrument achieved reasonably high reliability estimations.
Ruch, Willibald; Heintz, Sonja
2017-01-01
How strongly does humor (i.e., the construct-relevant content) in the Humor Styles Questionnaire (HSQ; Martin et al., 2003) determine the responses to this measure (i.e., construct validity)? Also, how much does humor influence the relationships of the four HSQ scales, namely affiliative, self-enhancing, aggressive, and self-defeating, with personality traits and subjective well-being (i.e., criterion validity)? The present paper answers these two questions by experimentally manipulating the 32 items of the HSQ to only (or mostly) contain humor (i.e., construct-relevant content) or to substitute the humor content with non-humorous alternatives (i.e., only assessing construct-irrelevant context). Study 1 (N = 187) showed that the HSQ affiliative scale was mainly determined by humor, self-enhancing and aggressive were determined by both humor and non-humorous context, and self-defeating was primarily determined by the context. This suggests that humor is not the primary source of the variance in three of the HQS scales, thereby limiting their construct validity. Study 2 (N = 261) showed that the relationships of the HSQ scales to the Big Five personality traits and subjective well-being (positive affect, negative affect, and life satisfaction) were consistently reduced (personality) or vanished (subjective well-being) when the non-humorous contexts in the HSQ items were controlled for. For the HSQ self-defeating scale, the pattern of relationships to personality was also altered, supporting an positive rather than a negative view of the humor in this humor style. The present findings thus call for a reevaluation of the role that humor plays in the HSQ (construct validity) and in the relationships to personality and well-being (criterion validity). PMID:28473794
ERIC Educational Resources Information Center
Farley, Frank H.; And Others
Two studies were reported which attempted to estimate the stability and construct validity of human salivary response as a measure of individual differences (IDs) in physiological arousal. Twenty-second base line estimates and 20-second response levels to four drops of lemon juice were measured, with the former value being removed from the latter…
Bajada, Stefan; Mohanty, Khitish
2016-06-01
The Majeed scoring system is a disease-specific outcome measure that was originally designed to assess pelvic injuries. The aim of this study was to determine the psychometric properties of the Majeed scoring system for chronic sacroiliac joint pain. Internal consistency, content validity, criterion validity, construct validity and responsiveness to change was assessed prospectively for the Majeed scoring system in a cohort of 60 patients diagnosed with sacroiliac joint pain. This diagnosis was confirmed with CT-guided sacroiliac joint anaesthetic block. The overall Majeed score showed acceptable internal consistency (Cronbach alpha = 0.63). Similarly, it showed acceptable floor (0 %) and ceiling (0 %) effects. On the other hand, the domains of pain, work, sitting and sexual intercourse had high (>30 %) floor effects. Significant correlation with the physical component of the Short Form-36 (p = 0.005) and Oswestry disability index (p ≤ 0.001) was found indicating acceptable criterion validity. The overall Majeed score showed acceptable construct validity with all five developed hypotheses showing significance (p ≤ 0.05). The overall Majeed score showed acceptable responsiveness to change with a large (≥0.80) effect size and standardized response mean. Overall the Majeed scoring system demonstrated acceptable psychometric properties for outcome assessment in chronic sacroiliac joint pain. Thus, its use in this condition is adequate. However, some domains demonstrated suboptimal performance indicating that improvement might be achieved with the development of an outcome measure specific for sacroiliac joint dysfunction and degeneration.
Richtering, Sarah S; Morris, Rebecca; Soh, Sze-Ee; Barker, Anna; Bampi, Fiona; Neubeck, Lis; Coorey, Genevieve; Mulley, John; Chalmers, John; Usherwood, Tim; Peiris, David; Chow, Clara K; Redfern, Julie
2017-01-01
Electronic health (eHealth) strategies are evolving making it important to have valid scales to assess eHealth and health literacy. Item response theory methods, such as the Rasch measurement model, are increasingly used for the psychometric evaluation of scales. This paper aims to examine the internal construct validity of an eHealth and health literacy scale using Rasch analysis in a population with moderate to high cardiovascular disease risk. The first 397 participants of the CONNECT study completed the electronic health Literacy Scale (eHEALS) and the Health Literacy Questionnaire (HLQ). Overall Rasch model fit as well as five key psychometric properties were analysed: unidimensionality, response thresholds, targeting, differential item functioning and internal consistency. The eHEALS had good overall model fit (χ2 = 54.8, p = 0.06), ordered response thresholds, reasonable targeting and good internal consistency (person separation index (PSI) 0.90). It did, however, appear to measure two constructs of eHealth literacy. The HLQ subscales (except subscale 5) did not fit the Rasch model (χ2: 18.18-60.60, p: 0.00-0.58) and had suboptimal targeting for most subscales. Subscales 6 to 9 displayed disordered thresholds indicating participants had difficulty distinguishing between response options. All subscales did, nonetheless, demonstrate moderate to good internal consistency (PSI: 0.62-0.82). Rasch analyses demonstrated that the eHEALS has good measures of internal construct validity although it appears to capture different aspects of eHealth literacy (e.g. using eHealth and understanding eHealth). Whilst further studies are required to confirm this finding, it may be necessary for these constructs of the eHEALS to be scored separately. The nine HLQ subscales were shown to measure a single construct of health literacy. However, participants' scores may not represent their actual level of ability, as distinction between response categories was unclear for the last four subscales. Reducing the response categories of these subscales may improve the ability of the HLQ to distinguish between different levels of health literacy.
Dueñas, María; Mendonça, Liliane; Sampaio, Rute; Gouvinhas, Cláudia; Oliveira, Daniela; Castro-Lopes, José Manuel; Azevedo, Luís Filipe
2017-03-01
The Bowel Function Index (BFI) is a simple and sound bowel function and opioid-induced constipation (OIC) screening tool. We aimed to develop the translation and cultural adaptation of this measure (BFI-P) and to assess its reliability and validity for the Portuguese language and a chronic pain population. The BFI-P was created after a process including translation, back translation and cultural adaptation. Participants (n = 226) were recruited in a chronic pain clinic and were assessed at baseline and after one week. Internal consistency, test-retest reliability, responsiveness, construct (convergent and known groups) and factorial validity were assessed. Test-retest reliability had an intra-class correlation of 0.605 for BFI mean score. Internal consistency of BFI had Cronbach's alpha of 0.865. The construct validity of BFI-P was shown to be excellent and the exploratory factor analysis confirmed its unidimensional structure. The responsiveness of BFI-P was excellent, with a suggested 17-19 point and 8-12 point change in score constituting a clinically relevant change in constipation for patients with and without previous constipation, respectively. This study had some limitations, namely, the criterion validity of BFI-P was not directly assessed; and the absence of a direct criterion for OIC precluded the assessment of the criterion based responsiveness of BFI-P. Nevertheless, BFI may importantly contribute to better OIC screening and its Portuguese version (BFI-P) has been shown to have excellent reliability, internal consistency, validity and responsiveness. Further suggestions regarding statistically and clinically important change cut-offs for this instrument are presented.
Dhillon, Sonya; Bagby, R Michael; Kushner, Shauna C; Burchett, Danielle
2017-04-01
The Personality Inventory for DSM-5 (PID-5) is a 220-item self-report instrument that assesses the alternative model of personality psychopathology in Section III (Emerging Measures and Models) of DSM-5 . Despite its relatively recent introduction, the PID-5 has generated an impressive accumulation of studies examining its psychometric properties, and the instrument is also already widely and frequently used in research studies. Although the PID-5 is psychometrically sound overall, reviews of this instrument express concern that this scale does not possess validity scales to detect invalidating levels of response bias, such as underreporting and overreporting. McGee Ng et al. (2016), using a "known-groups" (partial) criterion design, demonstrated that both underreporting and overreporting grossly affect mean scores on PID-5 scales. In the current investigation, we replicate these findings using an analog simulation design. An important extension to this replication study was the finding that the construct validity of the PID-5 was also significantly compromised by response bias, with statistically significant attenuation noted in validity coefficients of the PID-5 domain scales with scales from other instruments measuring congruent constructs. This attenuation was found for underreporting and overreporting bias. We believe there is a need to develop validity scales to screen for data-distorting response bias in research contexts and in clinical assessments where response bias is likely or otherwise suspected. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Construct validity of the Korean Women's Abuse Intolerance Scale.
Choi, Myunghan; Phillips, Linda R; Figueredo, Aurelio José; Insel, Katheleen; Min, Sung-Kil
2008-01-01
Domestic violence against married women has persisted throughout Korean history. However, very little empirical research has been conducted in Korea about domestic violence, its causes, or women's responses. To develop and test psychometrically the Korean Women's Abuse Intolerance Scale (KWAIS) to measure women's propensity or desire to leave abusive husbands in Korea. The first phase of the investigation involved qualitative research to explore the themes of women's responses to domestic violence and the development of the instrument. The second phase was a preliminary study conducted to examine women's responses to domestic violence. In the third phase, construct validity of the scale was established, using a sample of 184 married women living in Korea. The KWAIS adequately supported the underlying theory of women's responses to domestic violence, demonstrating strong content validity, high internal consistency (Cronbach's alpha of .98), and criterion-related validity evidenced by significant correlations that supported hypotheses among abuse intolerance and abuse (r = .69), traditional family ideology (r = -.78), marital satisfaction (r = -.85), attitude toward power ascription (r = .63), and collectivism (r = -.88). Factor analysis yielded a four-factor structure, explaining 78.4% of the common variance. Factor loadings ranged from .65 to .93. The findings for the psychometric properties of the KWAIS established its potential as a research instrument in measuring Korean women's propensity or desire to leave abusive husbands. Future studies need to focus on determining the predictive validity of the KWAIS and evaluating cross-cultural differences in women's propensity or desire to leave abusive husbands.
The Nature of Science Instrument-Elementary (NOSI-E): the end of the road?
Peoples, Shelagh M; O'Dwyer, Laura M
2014-01-01
This research continues prior work published in this journal (Peoples, O'Dwyer, Shields and Wang, 2013). The first paper described the scale development, psychometric analyses and part-validation of a theoretically-grounded Rasch-based instrument, the Nature of Science Instrument-Elementary (NOSI-E). The NOSI-E was designed to measure elementary students' understanding of the Nature of Science (NOS). In the first paper, evidence was provided for three of the six validity aspects (content, substantive and generalizability) needed to support the construct validity of the NOSI-E. The research described in this paper examines two additional validity aspects (structural and external). The purpose of this study was to determine which of three competing internal models provides reliable, interpretable, and responsive measures of students' understanding of NOS. One postulate is that the NOS construct is unidimensional;. alternatively, the NOS construct is composed of five independent unidimensional constructs (the consecutive approach). Lastly, the NOS construct is multidimensional and composed of five inter-related but separate dimensions. The vast body of evidence supported the claim that the NOS construct is multidimensional. Measures from the multidimensional model were positively related to student science achievement and students' perceptions of their classroom environment; this provided supporting evidence for the external validity aspect of the NOS construct. As US science education moves toward students learning science through engaging in authentic scientific practices and building learning progressions (NRC, 2012), it will be important to assess whether this new approach to teaching science is effective, and the NOSI-E may be used as a measure of the impact of this reform.
Xiao, Han; Peng, Minjin; Yan, Hong; Gao, Mengting; Li, Jingjing; Yu, Bin; Wu, Hanbo; Li, Shiyue
2016-01-01
Further advancement in schistosomiasis prevention requires new tools to assess protective motivation, and promote innovative intervention program. This study aimed to develop and evaluate an instrument developed based on the Protection Motivation Theory (PMT) to predict protective behavior intention against schistosomiasis among adolescents in China. We developed the Schistosomiasis PMT Scale based on two appraisal pathways of protective motivation- threat appraisal pathway and coping appraisal pathway. Data from a large sample of middle school students ( n = 2238, 51 % male, mean age 13.13 ± 1.10) recruited in Hubei, China was used to evaluated the validity and reliability of the scale. The final scale contains 18 items with seven sub-constructs. Cronbach's Alpha coefficients for the entire instrument was 0.76, and for the seven sub-constructs of severity, vulnerability, intrinsic reward, extrinsic reward, response efficacy, self-efficacy and response cost was 0.56, 0.82, 0.75, 0.80, 0.90, 0.72 and 0.70, respectively. The construct validity analysis revealed that the one level 7 sub-constructs model fitted data well (GFI = 0.98, CFI = 0.98, RMSEA = 0.03, Chi-sq/df = 3.90, p < 0.001). Predictive validity showed that both the PMT instrument score and the 7 sub-construct scores were significantly correlated with the intention engaged in protective behavior against schistosomiasis ( p < 0.05). This study provides a reliable and valid tool to measure protective motivation in schistosomiasis prevention control. Further studies are needed to develop more effective intervention programs for schistosomiasis prevention.
Tomita, Machiko R; Saharan, Sumandeep; Rajendran, Sheela; Nochajski, Susan M; Schweitzer, Jo A
2014-01-01
OBJECTIVE. To identify psychometric properties of the Home Safety Self-Assessment Tool (HSSAT) to prevent falls in community-dwelling older adults. METHOD. We tested content validity, test-retest reliability, interrater reliability, construct validity, convergent and discriminant validity, and responsiveness to change. RESULTS. The content validity index was .98, the intraclass correlation coefficient for test-retest reliability was .97, and the interrater reliability was .89. The difference on identified risk factors between the use and nonuse of the HSSAT was significant (p = .005). Convergent validity with the Centers for Disease Control and Prevention Home Safety Checklist was high (r = .65), and discriminant validity with fear of falling was very low (r = .10). The responsiveness to change was moderate (standardized response mean = 0.57). CONCLUSION. The HSSAT is a reliable and valid instrument to identify fall risks in a home environment, and the HSSAT booklet is effective as educational material leading to improvement in home safety. Copyright © 2014 by the American Occupational Therapy Association, Inc.
Lin, Chung-Ying; Pakpour, Amir H
2017-02-01
The problems of mood disorders are critical in people with epilepsy. Therefore, there is a need to validate a useful tool for the population. The Hospital Anxiety and Depression Scale (HADS) has been used on the population, and showed that it is a satisfactory screening tool. However, more evidence on its construct validity is needed. A total of 1041 people with epilepsy were recruited in this study, and each completed the HADS. Confirmatory factor analysis (CFA) and Rasch analysis were used to understand the construct validity of the HADS. In addition, internal consistency was tested using Cronbachs' α, person separation reliability, and item separation reliability. Ordering of the response descriptors and the differential item functioning (DIF) were examined using the Rasch models. The HADS showed that 55.3% of our participants had anxiety; 56.0% had depression based on its cutoffs. CFA and Rasch analyses both showed the satisfactory construct validity of the HADS; the internal consistency was also acceptable (α=0.82 in anxiety and 0.79 in depression; person separation reliability=0.82 in anxiety and 0.73 in depression; item separation reliability=0.98 in anxiety and 0.91 in depression). The difficulties of the four-point Likert scale used in the HADS were monotonically increased, which indicates no disordering response categories. No DIF items across male and female patients and across types of epilepsy were displayed in the HADS. The HADS has promising psychometric properties on construct validity in people with epilepsy. Moreover, the additive item score is supported for calculating the cutoff. Copyright © 2016 British Epilepsy Association. Published by Elsevier Ltd. All rights reserved.
Development and validation of instrument for ergonomic evaluation of tablet arm chairs
Tirloni, Adriana Seára; dos Reis, Diogo Cunha; Bornia, Antonio Cezar; de Andrade, Dalton Francisco; Borgatto, Adriano Ferreti; Moro, Antônio Renato Pereira
2016-01-01
The purpose of this study was to develop and validate an evaluation instrument for tablet arm chairs based on ergonomic requirements, focused on user perceptions and using Item Response Theory (IRT). This exploratory study involved 1,633 participants (university students and professors) in four steps: a pilot study (n=26), semantic validation (n=430), content validation (n=11) and construct validation (n=1,166). Samejima's graded response model was applied to validate the instrument. The results showed that all the steps (theoretical and practical) of the instrument's development and validation processes were successful and that the group of remaining items (n=45) had a high consistency (0.95). This instrument can be used in the furniture industry by engineers and product designers and in the purchasing process of tablet arm chairs for schools, universities and auditoriums. PMID:28337099
Development and Initial Validation of the Adolescent Responses to Body Dissatisfaction Measure
ERIC Educational Resources Information Center
Maxwell, Melissa A.; Cole, David A.
2012-01-01
One community sample (N = 607) of youths generated self-reported responses to body dissatisfaction, from which the Adolescent Responses to Body Dissatisfaction (ARBD) inventory was constructed. A 2nd, similar sample (N = 830) completed this measure as well as measures of coping, body dissatisfaction, body mass index, depressive symptoms, and…
Diviani, Nicola; Dima, Alexandra Lelia; Schulz, Peter Johannes
2017-04-11
The eHealth Literacy Scale (eHEALS) is a tool to assess consumers' comfort and skills in using information technologies for health. Although evidence exists of reliability and construct validity of the scale, less agreement exists on structural validity. The aim of this study was to validate the Italian version of the eHealth Literacy Scale (I-eHEALS) in a community sample with a focus on its structural validity, by applying psychometric techniques that account for item difficulty. Two Web-based surveys were conducted among a total of 296 people living in the Italian-speaking region of Switzerland (Ticino). After examining the latent variables underlying the observed variables of the Italian scale via principal component analysis (PCA), fit indices for two alternative models were calculated using confirmatory factor analysis (CFA). The scale structure was examined via parametric and nonparametric item response theory (IRT) analyses accounting for differences between items regarding the proportion of answers indicating high ability. Convergent validity was assessed by correlations with theoretically related constructs. CFA showed a suboptimal model fit for both models. IRT analyses confirmed all items measure a single dimension as intended. Reliability and construct validity of the final scale were also confirmed. The contrasting results of factor analysis (FA) and IRT analyses highlight the importance of considering differences in item difficulty when examining health literacy scales. The findings support the reliability and validity of the translated scale and its use for assessing Italian-speaking consumers' eHealth literacy. ©Nicola Diviani, Alexandra Lelia Dima, Peter Johannes Schulz. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 11.04.2017.
Collins, N J; Prinsen, C A C; Christensen, R; Bartels, E M; Terwee, C B; Roos, E M
2016-08-01
To conduct a systematic review and meta-analysis to synthesize evidence regarding measurement properties of the Knee injury and Osteoarthritis Outcome Score (KOOS). A comprehensive literature search identified 37 eligible papers evaluating KOOS measurement properties in participants with knee injuries and/or osteoarthritis (OA). Methodological quality was evaluated using the COSMIN checklist. Where possible, meta-analysis of extracted data was conducted for all studies and stratified by age and knee condition; otherwise narrative synthesis was performed. KOOS has adequate internal consistency, test-retest reliability and construct validity in young and old adults with knee injuries and/or OA. The ADL subscale has better content validity for older patients and Sport/Rec for younger patients with knee injuries, while the Pain subscale is more relevant for painful knee conditions. The five-factor structure of the original KOOS is unclear. There is some evidence that the KOOS subscales demonstrate sufficient unidimensionality, but this requires confirmation. Although measurement error requires further evaluation, the minimal detectable change for KOOS subscales ranges from 14.3 to 19.6 for younger individuals, and ≥20 for older individuals. Evidence of responsiveness comes from larger effect sizes following surgical (especially total knee replacement) than non-surgical interventions. KOOS demonstrates adequate content validity, internal consistency, test-retest reliability, construct validity and responsiveness for age- and condition-relevant subscales. Structural validity, cross-cultural validity and measurement error require further evaluation, as well as construct validity of KOOS Physical function Short form. Suggested order of subscales for different knee conditions can be applied in hierarchical testing of endpoints in clinical trials. PROSPERO (CRD42011001603). Copyright © 2016 Osteoarthritis Research Society International. Published by Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
Coolahan, Kathleen; McWayne, Christine; Fantuzzo, John; Grim, Suzanne
2002-01-01
Examined the construct and concurrent validity of the Parenting Behavior Questionnaire-Head Start (PBQ-HS) with low-income African-American families with preschoolers, and whether parenting styles differed by caregiver characteristics. Derived Active-Responsive, Active-Restrictive, and Passive-Permissive parenting dimensions; the last differed…
A Second Dystopia in Education: Validity Issues in Authentic Assessment Practices
ERIC Educational Resources Information Center
Hathcoat, John D.; Penn, Jeremy D.; Barnes, Laura L.; Comer, Johnathan C.
2016-01-01
Authentic assessments used in response to accountability demands in higher education face at least two threats to validity. First, a lack of interchangeability between assessment tasks introduces bias when using aggregate-based scores at an institutional level. Second, reliance on written products to capture constructs such as critical thinking…
Ghio, Daniela; Thomson, Wendy; Calam, Rachel; Ulph, Fiona; Baildam, Eileen M; Hyrich, Kimme; Cordingley, Lis
2018-02-01
To investigate the suitability of the revised Illness Perception Questionnaire (IPQ-R) for use with adolescents with a long-term pain condition and to validate a new questionnaire for use with this age group. A three-phase mixed-methods study. Phase 1 comprised in-depth qualitative analyses of audio-recorded cognitive interviews with 20 adolescents with juvenile idiopathic arthritis who were answering IPQ-R items. Transcripts were coded using framework analysis. A content analysis of their intended responses to individual items was also conducted. In Phase 2, a new questionnaire was developed and its linguistic and face validity were assessed with 18 adolescents without long-term conditions. In Phase 3, the construct validity of the new questionnaire was assessed with 240 adolescents with juvenile idiopathic arthritis. A subset of 43 adolescents completed the questionnaire a second time to assess test-retest reliability. All participants were aged 11-16 years. Participants described both conceptual and response format difficulties when answering IPQ-R items. In response, the Pain Perception Questionnaire for Young People (PPQ-YP) was designed which incorporated significant modifications to both wording and response formats when compared with the IPQ-R. A principal component analysis of the PPQ-YP identified ten constructs in the new questionnaire. Emotional representations were separated into two constructs, responsive and anticipatory emotions. The PPQ-YP showed high test-retest reliability. Symptom beliefs appear to be more salient to adolescents with a long-term pain condition than beliefs about the illness as a whole. A new questionnaire to assess pain beliefs of adolescents was designed. Further validation work may be needed to assess its suitability for use with other pain conditions. Statement of contribution What is already known on this subject? Versions of the adult Revised Illness Perception Questionnaire (IPQ-R) have been adapted for adolescents and children by changing item wording; however, research to assess the degree to which the underlying IPQ-R constructs are relevant to adolescents with a long-term condition had not been performed. What the present study adds? In adolescents, beliefs about symptoms of their condition are more salient than beliefs about the illness as a whole. Question response formats for children and young people need to take account of age-specific abilities. A new questionnaire has been designed for adolescents with pain. It is theoretically congruent with the CS-SRM. © 2017 The Authors. British Journal of Health Psychology published by John Wiley & Sons Ltd on behalf of British Psychological Society.
Development of the Parent Responses to School Functioning Questionnaire.
Barber Garcia, Brittany N; Gray, Laura S; Simons, Laura E; Logan, Deirdre E
2017-10-01
Parents play an important role in supporting school functioning in youth with chronic pain, but no validated tools exists to assess parental responses to child and adolescent pain behaviors in the school context. Such a tool would be useful in identifying targets of change to reduce pain-related school impairment. The goal of this study was to develop and preliminarily validate the Parent Responses to School Functioning Questionnaire (PRSF), a parent self-report measure of this construct. After initial expert review and pilot testing, the measure was administered to 418 parents of children (ages 6-17 years) seen for initial multidisciplinary chronic pain clinic evaluation. The final 16-item PRSF showed evidence of good internal consistency (α = .82) and 2-week test-retest reliability (intraclass correlation coefficient = .87). Criterion validity was demonstrated by significant correlations with school absence rates and overall school functioning, and construct validity was demonstrated by correlations with general parental responses to pain. Three subscales emerged capturing parents' personal distress, parents' level of distrust of the school, and parents' expectations and behaviors related to their child's management of challenging school situations. These results provide preliminary support for the PRSF as a psychometrically sound tool to assess parents' responses to child pain in the school setting. The 16-item PRSF measures parental responses to their child's chronic pain in the school context. The clinically useful measure can inform interventions aimed reducing functional disability in children with chronic pain by enhancing parents' ability to respond adaptively to child pain behaviors. Copyright © 2017 American Pain Society. Published by Elsevier Inc. All rights reserved.
Hassett, Afton L; Li, Tracy; Buyske, Steven; Savage, Shantal V; Gignac, Monique A M
2008-05-01
To consider the feasibility of assessing multiple facets of independence in rheumatoid arthritis (RA) using a measure developed from existing items and examining its face validity, construct validity and responsiveness to change. The ATTAIN (Abatacept Trial in Treatment of Anti-tumor necrosis factor [TNF] Inadequate responders) database was used. Patients with RA were randomized 2:1, abatacept (n = 258) and placebo (n = 133). A multi-faceted scale to measure physical and psychosocial independence was constructed using items from the Health Assessment Questionnaire (HAQ) and Short Form 36 Health Survey (SF-36). Questions assessing activity limitations and need for outside caregiver help were also examined. Interviews with 20 RA patients assessed face validity. Item Response Theory analysis yielded two traits - 'Psychosocial Independence', derived from the number of days with activity limitations plus the Role Emotional, Social Functioning and Role Physical subscale items from the SF-36; and 'Physical Independence', derived from 15 HAQ items assessing need for help from another. The two traits showed no significant differential item functioning for age or gender and demonstrated good face validity. Changes over 169 days on Psychosocial Independence were greater (mean 0.46 units, 95% confidence interval [CI]: 0.17-0.75) for the abatacept group than for placebo (p = 0.002). Changes in Physical Independence were greater (mean 0.59 units, 95% CI: 0.35-0.82) for the abatacept group than for placebo (p < 0.001). The multi-faceted assessment of independence in RA based on items from commonly used instruments is feasible suggesting promise for evaluating independence in future clinical trials. This approach demonstrated good face and construct validity and responsiveness in RA patients who had previously failed anti-TNF therapy. However, we caution against an interpretation that these data suggest that abatacept improves independence because the component parts of this assessment came from instruments used in the ATTAIN trial where data had been previously analyzed.
Desai, Arti D; Zhou, Chuan; Stanford, Susan; Haaland, Wren; Varni, James W; Mangione-Smith, Rita M
2014-12-01
Validated patient-reported outcomes responsive to clinical change are needed to evaluate the effectiveness of quality improvement interventions. To evaluate responsiveness, construct validity, and predictive validity of the Pediatric Quality of Life Inventory (PedsQL) 4.0 Generic Core Scales in the pediatric inpatient setting. Prospective, cohort study of parents and caregivers of patients 1 month to 18 years old (n = 4637) and patients 13 to 18 years old (n = 359) admitted to Seattle Children's Hospital between October 1, 2011, and December 31, 2013. Of 7184 eligible participants invited to complete the survey, 4637 (64.5%) completed the PedsQL on admission, and of these 2694 (58.1%) completed the follow-up survey 2 to 8 weeks after discharge. Responsiveness was assessed by calculating improvement scores (difference between follow-up and admission scores). Construct validity was examined by comparing the mean improvement scores for known groups differing by medical complexity. Predictive validity was assessed using Poisson regression to examine associations among admission scores, prolonged length of stay (≥3 days), and 30-day readmissions or emergency department (ED) return visits. Similar models examined the association between improvement scores and risk for 30-day readmissions or ED return visits. The mean (SD) PedsQL improvement scores (scale, 0-100) were 22.1 (22.7) for total, 29.4 (32.4) for physical, and 17.1 (21.0) for psychosocial. The mean PedsQL total improvement scores were lower for patients with medically complex conditions compared with patients without chronic conditions (13.7 [95% CI, 11.6-15.8] vs. 24.1 [95% CI, 22.4-25.7], P < .001). A 10-point decrement in the PedsQL total admission score below the established community-based mean was associated with an increase in risk for prolonged length of stay (15% [95% CI, 13%-17%]), 30-day readmissions (8% [95% CI, 3%-14%]), and ED return visits (13% [95% CI, 6%-20%]). A 5-point decrement in the PedsQL total improvement score below the study sample mean improvement score was associated with an increase in risk for 30-day readmissions or ED return visits (9% [95% CI, -1% to 19%]). The PedsQL demonstrated responsiveness, construct validity, and predictive validity in hospitalized pediatric patients. The PedsQL may be a useful patient-reported outcome for hospital-based clinical effectiveness research.
Moving to Capture Children's Attention: Developing a Methodology for Measuring Visuomotor Attention.
Hill, Liam J B; Coats, Rachel O; Mushtaq, Faisal; Williams, Justin H G; Aucott, Lorna S; Mon-Williams, Mark
2016-01-01
Attention underpins many activities integral to a child's development. However, methodological limitations currently make large-scale assessment of children's attentional skill impractical, costly and lacking in ecological validity. Consequently we developed a measure of 'Visual Motor Attention' (VMA)-a construct defined as the ability to sustain and adapt visuomotor behaviour in response to task-relevant visual information. In a series of experiments, we evaluated the capability of our method to measure attentional processes and their contributions in guiding visuomotor behaviour. Experiment 1 established the method's core features (ability to track stimuli moving on a tablet-computer screen with a hand-held stylus) and demonstrated its sensitivity to principled manipulations in adults' attentional load. Experiment 2 standardised a format suitable for use with children and showed construct validity by capturing developmental changes in executive attention processes. Experiment 3 tested the hypothesis that children with and without coordination difficulties would show qualitatively different response patterns, finding an interaction between the cognitive and motor factors underpinning responses. Experiment 4 identified associations between VMA performance and existing standardised attention assessments and thereby confirmed convergent validity. These results establish a novel approach to measuring childhood attention that can produce meaningful functional assessments that capture how attention operates in an ecologically valid context (i.e. attention's specific contribution to visuomanual action).
Development and validation of the coping with terror scale.
Stein, Nathan R; Schorr, Yonit; Litz, Brett T; King, Lynda A; King, Daniel W; Solomon, Zahava; Horesh, Danny
2013-10-01
Terrorism creates lingering anxiety about future attacks. In prior terror research, the conceptualization and measurement of coping behaviors were constrained by the use of existing coping scales that index reactions to daily hassles and demands. The authors created and validated the Coping with Terror Scale to fill the measurement gap. The authors emphasized content validity, leveraging the knowledge of terror experts and groups of Israelis. A multistep approach involved construct definition and item generation, trimming and refining the measure, exploring the factor structure underlying item responses, and garnering evidence for reliability and validity. The final scale comprised six factors that were generally consistent with the authors' original construct specifications. Scores on items linked to these factors demonstrate good reliability and validity. Future studies using the Coping with Terror Scale with other populations facing terrorist threats are needed to test its ability to predict resilience, functional impairment, and psychological distress.
Shrestha, Bidhan; Niraula, Surya Raj; Parajuli, Prakash K; Suwal, Pramita; Singh, Raj Kumar
2018-06-01
To assess the reliability and to validate the translated Nepalese version of the Oral Health Impact Profile (OHIP-EDENT-N) in Nepalese edentulous subjects. The international guidelines for translation and cross-cultural adaption of OHIP-EDENT were followed, and a Nepalese version of the questionnaire was adapted for this study. Eighty-eight completely edentulous subjects were then selected for the study and completed their responses for the questionnaire. The reliability of the OHIP-EDENT-N was evaluated using internal consistency. Validity was assessed as construct and convergent validity. Construct validity was determined using exploratory factor analysis (EFA). The correlation between OHIP-EDENT-N subscale scores and the global question was investigated to test the convergent validity. Cronbach's alpha for the total score of OHIP-EDENT-N was 0.78. Construct validity was assessed by factor analysis: 70.196% of the variance was accountable to five factors extracted from the factor analysis. Factor loadings above 0.40 were noted for all items. In terms of convergent validity, significant correlations could be established between OHIP-EDENT-N and global questions. This study has been able to establish the reliability and validity of the OHIP-EDENT-N, and OHIP-EDENT-N can be a considered a reliable tool to assess the oral health related quality of life in the Nepalese edentulous population. © 2016 by the American College of Prosthodontists.
Merolla, Giovanni; Corona, Katia; Zanoli, Gustavo; Cerciello, Simone; Giannotti, Stefano; Porcellini, Giuseppe
2017-12-01
The Kerlan-Jobe Orthopaedic Clinic (KJOC) Shoulder and Elbow score is a reliable and sensitive tool to measure the performance of overhead athletes. The purpose of this study was to carry out a cross-cultural adaptation and validation of the KJOC questionnaire in Italian and to assess its reliability, validity, and responsiveness. Ninety professional athletes with a painful shoulder were included in this study and were assigned to the "injury group" (n = 32) or the "overuse group" (n = 58); 65 were managed conservatively and 25 were treated by arthroscopic surgery. To assess the reliability of the KJOC score, patients were asked to fill in the questionnaire at baseline and after 2 weeks. To test the construct validity, KJOC scores were compared to those obtained with the Italian version of the Disabilities of the Arm, Shoulder, and Hand (DASH) scale, and with the DASH sports/performing arts module. To test KJOC score responsiveness, the follow-up KJOC scores of the participants treated conservatively were compared to those of the patients treated by arthroscopic surgery. Statistical analysis demonstrated that the KJOC questionnaire is reliable in terms of the single items and the overall score (ICC 0.95-0.99); that it has high construct validity (r s = -0.697; p < 0.01); and that it is responsive to clinical differences in shoulder function (p < 0.0001). The Italian version of the KJOC Shoulder and Elbow score performed in a similar way to the English version and demonstrated good validity, reliability, and responsiveness after conservative and surgical treatment. II.
Strober, Bruce; Zhao, Yang; Tran, Mary Helen; Gnanasakthy, Ari; Nyirady, Judit; Papavassilis, Charis; Nelson, Lauren M; McLeod, Lori D; Mordin, Margaret; Gottlieb, Alice B; Elewski, Boni E; Lebwohl, Mark
2016-03-01
This analysis aimed to confirm the reliability, validity, and responsiveness of the Psoriasis Symptom Diary (PSD) using data from two Phase III studies in patients with moderate to severe chronic plaque psoriasis. Data from two randomized, double-blind, double-dummy, placebo-controlled, multicenter Phase III studies (n = 820) assessing the efficacy and safety of secukinumab were used. The PSD (24-h recall; 0-10 numeric rating scale) was electronically administered each evening. Test-retest reliability was determined using intraclass correlations. Construct validity hypotheses were evaluated via correlations with the Psoriasis Area and Severity Index (PASI), Investigator's Global Assessment (IGA), Dermatology Life Quality Index (DLQI), EuroQoL 5-Dimension Health Status Questionnaire, and Patient Global Impression of Change (PGIC). Discriminating ability and responsiveness were evaluated by estimating mean differences and effect sizes between known groups (using the PASI and IGA). Phase II-derived, anchor-based PGIC thresholds and cumulative distribution function (CDF) plots described meaningful change. Items on the PSD yielded high intraclass coefficients (>0.90). Correlations were in the anticipated direction and by week 12 were moderate to strong (0.41-0.73) in magnitude, demonstrating construct validity. Average PSD item scores differed predictably and significantly between known groups. Responsiveness effect size estimates were moderate to large (0.6-1.5), and CDF plots showed the percentage of responders to be consistently higher in treatment than in placebo arms across the range of change in PSD scores. The PSD is reliable, valid, and responsive, and represents a valid tool to enhance treatment decisions in patients with moderate to severe plaque psoriasis. © 2015 The International Society of Dermatology.
Park, Dong-Jin; Kang, Ji-Hyoun; Lee, Jeong-Won; Lee, Kyung-Eun; Wen, Lihui; Kim, Tae-Jong; Park, Yong-Wook; Nam, Tai-Seung; Kim, Myung-Sun; Lee, Shin-Seok
2013-07-01
The aim of this study was to assess and validate the Korean version of the Boston Carpal Tunnel Questionnaire (K-BCTQ) in patients with carpal tunnel syndrome (CTS). After translation and cultural adaptation of the BCTQ to a Korean version, the K-BCTQ was administered to 54 patients with CTS; it was administered again after 2 weeks to assess reliability. Additionally, we administered K-DASH and EQ-5D to assess construct-validity. In a prospective study of responsiveness to clinical change, 29 of 54 patients were treated by ultrasonography-guided local corticosteroid injection therapy. The internal consistency of the K-BCTQ was high (Cronbach's alpha: 0.915) and the intra-class correlation coefficients were 0.931 for the symptom severity scale (P<0.001) and 0.844 for the functional severity scale (P<0.001). The construct-validity between the symptom severity scale and the K-DASH, and between the functional severity scale and the K-DASH were significantly correlated (both P<0.001). Clinical improvement was noted in 29 patients with injection therapy. The effect size of symptom severity was 0.67, and that of functional severity was 0.58. In conclusion, the K-BCTQ shows good reliability, construct-validity, and acceptable responsiveness after local corticosteroid injection therapy (Clinical trial number, KCT0000050).
Jenerette, Coretta; Dixon, Jane
2010-10-01
Ethnic and cultural norms influence an individual's assertiveness. In health care, assertiveness may play an important role in health outcomes, especially for predominantly minority populations, such as adults with sickle cell disease. Therefore, it is important to develop measures to accurately assess assertiveness. It is also important to reduce response burden of lengthy instruments while retaining instrument reliability and validity. The purpose of this article is to describe development of a shorter version of the Simple Rathus Assertiveness Schedule (SRAS). Data from a cross-sectional descriptive study of adults with sickle cell disease were used to construct a short form of the SRAS, guided by stepwise regression analysis. The 19-item Simple Rathus Assertiveness Scale-Short Form (SRAS-SF) had acceptable reliability (α = .81) and construct validity and was highly correlated with the SRAS (r = .98, p = .01). The SRAS-SF reduces response burden, while maintaining reliability and validity.
ERIC Educational Resources Information Center
Sullivan, Elizabeth A.; Abramowitz, Carolyn S.; Lopez, Mabel; Kosson, David S.
2006-01-01
The utility of the psychopathy construct in predicting laboratory deficits, criminal behavior, response to intervention, and recidivism has been well documented in European American populations. However, less is known about the manifestation and correlates of psychopathy in Latino and African American populations. The present study examined the…
NASA Astrophysics Data System (ADS)
Peoples, Shelagh
The purpose of this study was to determine which of three competing models will provide, reliable, interpretable, and responsive measures of elementary students' understanding of the nature of science (NOS). The Nature of Science Instrument-Elementary (NOSI-E), a 28-item Rasch-based instrument, was used to assess students' NOS understanding. The NOS construct was conceptualized using five construct dimensions (Empirical, Inventive, Theory-laden, Certainty and Socially & Culturally Embedded). The competing models represent three internal models for the NOS construct. One postulate is that the NOS construct is unidimensional where one latent construct explains the relationship between the 28 items of the NOSI-E. Alternatively, the NOS construct is composed of five independent unidimensional constructs (the consecutive approach). Lastly, the NOS construct is multidimensional and composed of five inter-related but separate dimensions. A validity argument was developed that hypothesized that the internal structure of the NOS construct is best represented by the multidimensional Rasch model. Four sets of analyses were performed in which the three representations were compared. These analyses addressed five validity aspects (content, substantive, generalizability, structural and external) of construct validity. The vast body of evidence supported the claim that the NOS construct is composed of five separate but inter-related dimensions that is best represented by the multidimensional Rasch model. The results of the multidimensional analyses indicated that the items of the five subscales were of excellent technical quality, exhibited no differential item functioning (based on gender), had an item hierarchy that conformed to theoretical expectations; and together formed subscales of reasonable reliability (> 0.7 on each subscale) that were responsive to change in the construct. Theory-laden scores from the multidimensional model predicted students' science achievement with scores from all five NOS dimensions significantly predicting students' perceptions of the constructivist nature of their classroom learning environment. The NOSI-E instrument is a theoretically grounded scale that can measure elementary students' NOS understanding and appears suitable for use in science education research.
2015-01-01
Background The Center for Epidemiologic Studies Depression Scale (CES-D) is a commonly used instrument to measure depressive symptomatology. Despite this, the evidence for its psychometric properties remains poorly established in Chinese populations. The aim of this study was to validate the use of the CES-D in Chinese primary care patients by examining factor structure, construct validity, reliability, sensitivity and responsiveness. Methods and Results The psychometric properties were assessed amongst a sample of 3686 Chinese adult primary care patients in Hong Kong. Three competing factor structure models were examined using confirmatory factor analysis. The original CES-D four-structure model had adequate fit, however the data was better fit into a bi-factor model. For the internal construct validity, corrected item-total correlations were 0.4 for most items. The convergent validity was assessed by examining the correlations between the CES-D, the Patient Health Questionnaire 9 (PHQ-9) and the Short Form-12 Health Survey (version 2) Mental Component Summary (SF-12 v2 MCS). The CES-D had a strong correlation with the PHQ-9 (coefficient: 0.78) and SF-12 v2 MCS (coefficient: -0.75). Internal consistency was assessed by McDonald’s omega hierarchical (ωH). The ωH value for the general depression factor was 0.855. The ωH values for “somatic”, “depressed affect”, “positive affect” and “interpersonal problems” were 0.434, 0.038, 0.738 and 0.730, respectively. For the two-week test-retest reliability, the intraclass correlation coefficient was 0.91. The CES-D was sensitive in detecting differences between known groups, with the AUC >0.7. Internal responsiveness of the CES-D to detect positive and negative changes was satisfactory (with p value <0.01 and all effect size statistics >0.2). The CES-D was externally responsive, with the AUC>0.7. Conclusions The CES-D appears to be a valid, reliable, sensitive and responsive instrument for screening and monitoring depressive symptoms in adult Chinese primary care patients. In its original four-factor and bi-factor structure, the CES-D is supported for cross-cultural comparisons of depression in multi-center studies. PMID:26252739
Longitudinal Construct Validity of Brief Symptom Inventory Subscales in Schizophrenia
ERIC Educational Resources Information Center
Long, Jeffrey D.; Harring, Jeffrey R.; Brekke, John S.; Test, Mary Ann; Greenberg, Jan
2007-01-01
Longitudinal validity of Brief Symptom Inventory subscales was examined in a sample (N = 318) with schizophrenia-related illness measured at baseline and every 6 months for 3 years. Nonlinear factor analysis of items was used to test graded response models (GRMs) for subscales in isolation. The models varied in their within-time and between-times…
Structural Validity of the Professional Development Profile of the LoTi Digital-Age Survey
ERIC Educational Resources Information Center
Mehta, Vandhana; Hull, Darrell M.
2013-01-01
The present study was used to examine the structural construct validity of the Professional Development Profile of the LoTi Digital-Age Survey, a measure of teacher instructional practices with technology in the classroom. Teacher responses ("N" = 2,840) from across the United States were used to assess factor structure of the instrument…
Hift, Richard J
2014-11-28
Written assessments fall into two classes: constructed-response or open-ended questions, such as the essay and a number of variants of the short-answer question, and selected-response or closed-ended questions; typically in the form of multiple-choice. It is widely believed that constructed response written questions test higher order cognitive processes in a manner that multiple-choice questions cannot, and consequently have higher validity. An extensive review of the literature suggests that in summative assessment neither premise is evidence-based. Well-structured open-ended and multiple-choice questions appear equivalent in their ability to assess higher cognitive functions, and performance in multiple-choice assessments may correlate more highly than the open-ended format with competence demonstrated in clinical practice following graduation. Studies of construct validity suggest that both formats measure essentially the same dimension, at least in mathematics, the physical sciences, biology and medicine. The persistence of the open-ended format in summative assessment may be due to the intuitive appeal of the belief that synthesising an answer to an open-ended question must be both more cognitively taxing and similar to actual experience than is selecting a correct response. I suggest that cognitive-constructivist learning theory would predict that a well-constructed context-rich multiple-choice item represents a complex problem-solving exercise which activates a sequence of cognitive processes which closely parallel those required in clinical practice, hence explaining the high validity of the multiple-choice format. The evidence does not support the proposition that the open-ended assessment format is superior to the multiple-choice format, at least in exit-level summative assessment, in terms of either its ability to test higher-order cognitive functioning or its validity. This is explicable using a theory of mental models, which might predict that the multiple-choice format will have higher validity, a statement for which some empiric support exists. Given the superior reliability and cost-effectiveness of the multiple-choice format consideration should be given to phasing out open-ended format questions in summative assessment. Whether the same applies to non-exit-level assessment and formative assessment is a question which remains to be answered; particularly in terms of the educational effect of testing, an area which deserves intensive study.
Chew, Keng Sheng; Kueh, Yee Cheng; Abdul Aziz, Adlihafizi
2017-03-21
Despite their importance on diagnostic accuracy, there is a paucity of literature on questionnaire tools to assess clinicians' awareness toward cognitive errors. A validation study was conducted to develop a questionnaire tool to evaluate the Clinician's Awareness Towards Cognitive Errors (CATChES) in clinical decision making. This questionnaire is divided into two parts. Part A is to evaluate the clinicians' awareness towards cognitive errors in clinical decision making while Part B is to evaluate their perception towards specific cognitive errors. Content validation for both parts was first determined followed by construct validation for Part A. Construct validation for Part B was not determined as the responses were set in a dichotomous format. For content validation, all items in both Part A and Part B were rated as "excellent" in terms of their relevance in clinical settings. For construct validation using exploratory factor analysis (EFA) for Part A, a two-factor model with total variance extraction of 60% was determined. Two items were deleted. Then, the EFA was repeated showing that all factor loadings are above the cut-off value of >0.5. The Cronbach's alpha for both factors are above 0.6. The CATChES questionnaire tool is a valid questionnaire tool aimed to evaluate the awareness among clinicians toward cognitive errors in clinical decision making.
Adaptability: Conceptual and Empirical Perspectives on Responses to Change, Novelty and Uncertainty
ERIC Educational Resources Information Center
Martin, Andrew J.; Nejad, Harry; Colmar, Susan; Liem, Gregory Arief D.
2012-01-01
Adaptability is proposed as individuals' capacity to constructively regulate psycho-behavioral functions in response to new, changing, and/or uncertain circumstances, conditions and situations. The present investigation explored the internal and external validity of an hypothesised adaptability scale. The sample comprised 2,731 high school…
Cappelleri, Joseph C.; Lundy, J. Jason; Hays, Ron D.
2014-01-01
Introduction The U.S. Food and Drug Administration’s patient-reported outcome (PRO) guidance document defines content validity as “the extent to which the instrument measures the concept of interest” (FDA, 2009, p. 12). “Construct validity is now generally viewed as a unifying form of validity for psychological measurements, subsuming both content and criterion validity” (Strauss & Smith, 2009, p. 7). Hence both qualitative and quantitative information are essential in evaluating the validity of measures. Methods We review classical test theory and item response theory approaches to evaluating PRO measures including frequency of responses to each category of the items in a multi-item scale, the distribution of scale scores, floor and ceiling effects, the relationship between item response options and the total score, and the extent to which hypothesized “difficulty” (severity) order of items is represented by observed responses. Conclusion Classical test theory and item response theory can be useful in providing a quantitative assessment of items and scales during the content validity phase of patient-reported outcome measures. Depending on the particular type of measure and the specific circumstances, either one or both approaches should be considered to help maximize the content validity of PRO measures. PMID:24811753
Vuorenmaa, M; Halme, N; Åstedt-Kurki, P; Kaunonen, M; Perälä, M-L
2014-07-01
The Family Empowerment Scale (FES) is a widely used instrument which measures the parents' own sense of their empowerment at the level of the family, service system and community. It was originally developed for parents of children with emotional disabilities. The aims of this study were to evaluate the validity and reliability of the Finnish FES and to examine its responsiveness in measuring the empowerment of parents with small children. The English FES was translated into Finnish using back translation and modified so as to be generic and convenient for all families. The construct, convergent, discriminant and concurrent validities, reliability and responsiveness of the Finnish FES were examined. Participants (n = 955) were the parents of children aged 0-9 years who had been selected using stratified random sampling. Confirmatory factor analysis proved that the Finnish FES had three subscales based on the original FES. Convergent and discriminant validities confirmed and supported the same construct. The relationship between parents' participation and empowerment was tested for concurrent validity. As in previous FES studies, the participating parents were more empowered, which supported the concurrent validity. The reliability of the Finnish FES proved acceptable for both parents. The Finnish FES could also discriminate the responses of the parents. Participation in the activities organized by the family service system influenced parents' perceptions of empowerment more than did their background characteristics. The Finnish FES is a valid and reliable instrument and it is suitable for measuring the empowerment of parents. However, it is necessary to consider how the FES would identify in the best way the parents who perhaps need some help. © 2013 John Wiley & Sons Ltd.
Construct Validity of the Emotional Eating Scale Adapted for Children and Adolescents
Vannucci, Anna; Tanofsky-Kraff, Marian; Shomaker, Lauren B.; Ranzenhofer, Lisa M.; Matheson, Brittany E.; Cassidy, Omni L.; Zocca, Jaclyn M.; Kozlosky, Merel; Yanovski, Susan Z.; Yanovski, Jack A.
2012-01-01
Background Emotional eating, defined as eating in response to a range of negative emotions, is common in youth. Yet, there are few easily administered and well-validated methods to assess emotional eating in pediatric populations. Objective The current study tested the construct validity of the Emotional Eating Scale Adapted for Children and Adolescents (EES-C) by examining its relationship to observed emotional eating at laboratory test meals. Method One hundred fifty-one youth (8-18 years) participated in two multi-item lunch buffet meals on separate days. They ate ad libitum after being instructed to “eat as much as you would at a normal meal” or to “let yourself go and eat as much as you want.” State negative affect was assessed immediately prior to each meal. The EES-C was completed three months, on average, prior to the first test meal. Results Among youth with high EES-C total scores, but not low EES-C scores, higher pre-meal state negative affect was related to greater total energy intake at both meals, with and without the inclusion of age, race, sex, and BMI-z as covariates (ps < 0.03). Discussion The EES-C demonstrates good construct validity for children and adolescents’ observed energy intake across laboratory test meals designed to capture both normal and disinhibited eating. Future research is required to evaluate the construct validity of the EES-C in the natural environment and the predictive validity of the EES-C longitudinally. PMID:22124451
Hill, Bridget; Pallant, Julie; Williams, Gavin; Olver, John; Ferris, Scott; Bialocerkowski, Andrea
2016-12-01
To evaluate the internal construct validity and dimensionality of a new patient-reported outcome measure for people with traumatic brachial plexus injury (BPI) based on the International Classification of Functioning, Disability and Health definition of activity. Cross-sectional study. Outpatient clinics. Adults (age range, 18-82y) with a traumatic BPI (N=106). There were 106 people with BPI who completed a 51-item 5-response questionnaire. Responses were analyzed in 4 phases (missing responses, item correlations, exploratory factor analysis, and Rasch analysis) to evaluate the properties of fit to the Rasch model, threshold response, local dependency, dimensionality, differential item functioning, and targeting. Not applicable, as this study addresses the development of an outcome measure. Six items were deleted for missing responses, and 10 were deleted for high interitem correlations >.81. The remaining 35 items, while demonstrating fit to the Rasch model, showed evidence of local dependency and multidimensionality. Items were divided into 3 subscales: dressing and grooming (8 items), arm and hand (17 items), and no hand (6 items). All 3 subscales demonstrated fit to the model with no local dependency, minimal disordered thresholds, no unidimensionality or differential item functioning for age, time postinjury, or self-selected dominance. Subscales were combined into 3 subtests and demonstrated fit to the model, no misfit, and unidimensionality, allowing calculation of a summary score. This preliminary analysis supports the internal construct validity of the Brachial Assessment Tool, a unidimensional targeted 4-response patient-reported outcome measure designed to solely assess activity after traumatic BPI regardless of level of injury, age at recruitment, premorbid limb dominance, and time postinjury. Further examination is required to determine test-retest reliability and responsiveness. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Using the Rasch Measurement Model in Psychometric Analysis of the Family Effectiveness Measure
McCreary, Linda L.; Conrad, Karen M.; Conrad, Kendon J.; Scott, Christy K; Funk, Rodney R.; Dennis, Michael L.
2013-01-01
Background Valid assessment of family functioning can play a vital role in optimizing client outcomes. Because family functioning is influenced by family structure, socioeconomic context, and culture, existing measures of family functioning--primarily developed with nuclear, middle class European American families--may not be valid assessments of families in diverse populations. The Family Effectiveness Measure was developed to address this limitation. Objectives To test the Family Effectiveness Measure with data from a primarily low-income African American convenience sample, using the Rasch measurement model. Method A sample of 607 adult women completed the measure. Rasch analysis was used to assess unidimensionality, response category functioning, item fit, person reliability, differential item functioning by race and parental status, and item hierarchy. Criterion-related validity was tested using correlations with five other variables related to family functioning. Results The Family Effectiveness Measure measures two separate constructs: The effective family functioning construct was a psychometrically sound measure of the target construct that was more efficient due to the deletion of 22 items. The ineffective family functioning construct consisted of 16 of those deleted items but was not as strong psychometrically. Items in both constructs evidenced no differential item functioning by race. Criterion-related validity was supported for both. Discussion In contrast to the prevailing conceptualization that family functioning is a single construct, assessed by positively and negatively worded items, use of the Rasch analysis suggested the existence of two constructs. While the effective family functioning is a strong and efficient measure of family functioning, the ineffective family functioning will require additional item development and psychometric testing. PMID:23636342
Arimura, Tatsuyuki; Hosoi, Masako; Tsukiyama, Yoshihiro; Yoshida, Toshiyuki; Fujiwara, Daiki; Tanaka, Masanori; Tamura, Ryuichi; Nakashima, Yasunori; Sudo, Nobuyuki; Kubo, Chiharu
2012-04-01
The present study aimed to develop a Japanese version of the Short-Form McGill Pain Questionnaire (SF-MPQ-J) that focuses on cross-culturally equivalence to the original English version and to test its reliability and validity. Cross-sectional design. In study 1, SF-MPQ was translated and adapted into Japanese. It included construction of response scales equivalent to the original using a variation of the Thurstone method of equal-appearing intervals. A total of 147 undergraduate students and 44 pain patients participated in the development of the Japanese response scales. To measure the equivalence of pain descriptors, 62 pain patients in four diagnostic groups were asked to choose pain descriptors that described their pain. In study 2, chronic pain patients (N=126) completed the SF-MPQ-J, the Long-Form McGill Pain Questionnaire Japanese version (LF-MPQ-J), and the 11-point numerical rating scale of pain intensity. Correlation analysis examined the construct validity of the SF-MPQ-J. The results from study 1 were used to develop SF-MPQ-J, which is linguistically equivalent to the original questionnaire. Response scales from SF-MPQ-J represented the original scale values. All pain descriptors, except one, were used by >33% in at least one of the four diagnostic groups. Study 2 exhibited adequate internal consistency and test-retest reliability, with the construct validity of SF-MPQ-J comparable to the original. These findings suggested that SF-MPQ-J is reliable, valid, and cross-culturally equivalent to the original questionnaire. Researchers might consider using this scale in multicenter, multi-ethnical trials or cross-cultural studies that include Japanese-speaking patients. Wiley Periodicals, Inc.
Llamas-Ramos, Inés; Llamas-Ramos, Rocío; Buz, José; Cortés-Rodríguez, María; Martín-Nogueras, Ana María
2018-06-01
The Memorial Symptom Assessment Scale (MSAS) is a self-rating instrument for the assessment of symptom distress in cancer patients. The Spanish version of the MSAS has recently been validated. However, we lack evidence of the internal construct validity of the shorter versions (short form [MSAS-SF] and condensed form [CMSAS]). In addition, rigorous testing of these scales with modern psychometric methods is needed. The aim of this study was to evaluate the internal construct validity and reliability of the Spanish versions of the MSAS-SF and CMSAS in oncology outpatients using Rasch analysis. Data from a convenience sample of oncology outpatients receiving chemotherapy (n = 306; mean age 60 years; 63% women) at a university hospital were analyzed. The Rasch unidimensional measurement model was used to examine response category functioning, item hierarchy, targeting, unidimensionality, reliability, and differential item functioning by age, gender, and marital status. The response category structure of the symptom distress items was improved by collapsing two categories. The scales were adequately targeted to the study patients, showed overall Rasch model fit (mean Infit MnSq ranged from 0.98 to 1.05), met criteria for unidimensionality, and the reliability of scores was good (person reliability > 0.80), except for the CMSAS prevalence scale. Only four items showed differential item functioning. The present study demonstrated that the Spanish versions of the MSAS-SF and CMSAS have adequate psychometric properties to evaluate symptom distress in oncology outpatients. Additional studies of the CMSAS are recommended. Copyright © 2018 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Scheerhagen, Marisja; van Stel, Henk F.; Birnie, Erwin; Franx, Arie; Bonsel, Gouke J.
2015-01-01
Background Maternity care is an integrated care process, which consists of different services, involves different professionals and covers different time windows. To measure performance of maternity care based on clients' experiences, we developed and validated a questionnaire. Methods and Findings We used the 8-domain WHO Responsiveness model, and previous materials to develop a self-report questionnaire. A dual study design was used for development and validation. Content validity of the ReproQ-version-0 was determined through structured interviews with 11 pregnant women (≥28 weeks), 10 women who recently had given birth (≤12 weeks), and 19 maternity care professionals. Structured interviews established the domain relevance to the women; all items were separately commented on. All Responsiveness domains were judged relevant, with Dignity and Communication ranking highest. Main missing topic was the assigned expertise of the health professional. After first adaptation, construct validity of the ReproQ-version-1 was determined through a web-based survey. Respondents were approached by maternity care organizations with different levels of integration of services of midwives and obstetricians. We sent questionnaires to 605 third trimester pregnant women (response 65%), and 810 women 6 weeks after delivery (response 55%). Construct validity was based on: response patterns; exploratory factor analysis; association of the overall score with a Visual Analogue Scale (VAS), known group comparisons. Median overall ReproQ score was 3.70 (range 1–4) showing good responsiveness. The exploratory factor analysis supported the assumed domain structure and suggested several adaptations. Correlation of the VAS rating and overall ReproQ score (antepartum, postpartum) supported validity (r = 0.56; 0.59, p<0.001 Spearman's correlation coefficient). Pre-stated group comparisons confirmed the expected difference following a good vs. adverse birth outcome. Fully integrated organizations performed slightly better (median = 3.78) than less integrated organizations (median = 3.63; p<0.001). Participation rate of women with a low educational level and/or a non-western origin was low. Conclusions The ReproQ appears suitable for assessing quality of maternity care from the clients' perspective. Recruitment of disadvantaged groups requires additional non-digital approaches. PMID:25671310
Comins, J D; Krogsgaard, M R; Kreiner, S; Brodersen, J
2013-10-01
The benefit of anterior cruciate ligament (ACL) reconstruction has been questioned based on patient-reported outcome measures (PROMs). Valid interpretation of such results requires confirmation of the psychometric properties of the PROM. Rasch analysis is the gold standard for validation of PROMs, yet PROMs used for ACL reconstruction have not been validated using Rasch analysis. We used Rasch analysis to investigate the psychometric properties of the Knee Numeric-Entity Evaluation Score (KNEES-ACL), a newly developed PROM for patients treated for ACL deficiency. Two-hundred forty-two patients pre- and post-ACL reconstruction completed the pilot PROM. Rasch models were used to assess the psychometric properties (e.g., unidimensionality, local response dependency, and differential item functioning). Forty-one items distributed across seven unidimensional constructs measuring impairment, functional limitations, and psychosocial consequences were confirmed to fit Rasch models. Fourteen items were removed because of statistical lack of fit and inadequate face validity. Local response dependency and differential item functioning were identified and adjusted. The KNEES-ACL is the first Rasch-validated condition-specific PROM constructed for patients with ACL deficiency and patients with ACL reconstruction. Thus, this instrument can be used for within- and between-group comparisons. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Translation and Validation of the Dutch Western Ontario Meniscal Evaluation Tool.
van der Wal, Robert J P; Heemskerk, Bastiaan T J; van Arkel, Ewoud R A; Mokkink, Lidwine B; Thomassen, Bregje J W
2017-05-01
The aim of this study was to translate the Western Ontario Meniscal Evaluation Tool (WOMET) into Dutch, to evaluate the content validity, construct validity, reliability, and responsiveness, and to determine the minimal important change (MIC) of the Dutch version. The WOMET was translated into Dutch according to a standardized forward-backward translation protocol. A total of 86 patients (51 males, 35 females, median age 52 years [interquartile range, 43-60 years]) with isolated meniscal pathology were included. The WOMET was completed three times; at baseline, around 2 weeks, and after 3 months from the baseline. Knee injury and osteoarthritis outcome score, International Knee Documentation Committee subjective knee form, short-form 36, and an anchor question were also answered. There were good results for content validity (floor and ceiling effects [< 15%]), construct validity (79% of the predefined hypotheses were confirmed), internal consistency (Cronbach α = 0.87, 0.79, and 0.86 for each subscale score), test-retest reliability (intraclass correlation coefficient = 0.78 for total WOMET score), and responsiveness (79% of the predefined hypotheses were confirmed). The smallest detectable change and MIC for the Dutch WOMET are 20.5 and 14.7, respectively. The Dutch version of the WOMET is valid and reliable for assessing health-related quality of life in patients with meniscal pathology. Thieme Medical Publishers 333 Seventh Avenue, New York, NY 10001, USA.
Mars, Godelief M J; van Eijk, Jacques Th M; Post, Marcel W M; Proot, Ireen M; Mesters, Ilse; Kempen, Gertrudis I J M
2014-08-01
To develop and test the Maastricht Personal Autonomy Questionnaire (MPAQ), an instrument measuring personal autonomy of older adults with a chronic physical illness in accordance with their experience of autonomy. Achievement of personal autonomy is conceptualized as correspondence between the way people's lives are actually arranged and the way people want to arrange their lives. A field test was conducted in three waves (n = 412, n = 125 and n = 244) among a random sample of people older than 59 years with either chronic obstructive pulmonary disease or diabetes mellitus. Construct validity, reproducibility and responsiveness were evaluated. The MPAQ entailing 16 items consists of three scales: degree of (personal) autonomy, working on autonomy and dilemmas. Construct validity was largely supported by confirmatory factor analysis and correlations between the MPAQ and other instruments. Intraclass correlation coefficients ranged from 0.61 to 0.80 and SRDsgroup from 0.10 to 0.13. Mean change was larger (0.54) than was SRDgroup (0.11) in patients who had deteriorated, but smaller in patients who had improved (0.07). The MPAQ has good content and construct validity and moderate reproducibility. Responsiveness is weak, although better for deterioration than for improvement.
van der Lee, J H; Beckerman, H; Knol, D L; de Vet, H C W; Bouter, L M
2004-06-01
The Motor Activity Log (MAL) is a semistructured interview for hemiparetic stroke patients to assess the use of their paretic arm and hand (amount of use [AOU]) and quality of movement [QOM]) during activities of daily living. Scores range from 0 to 5. The following clinimetric properties of the MAL were quantified: internal consistency (Cronbach alpha), test-retest agreement (Bland and Altman method), cross-sectional construct validity (correlation between AOU and QOM and with the Action Research Arm [ARA] test), longitudinal construct validity (correlation of change on the MAL during the intervention with a global change rating [GCR] and with change on the ARA), and responsiveness (effect size). Two baseline measurements 2 weeks apart and 1 follow-up measurement immediately after 2 weeks of intensive exercise therapy either with or without immobilization of the unimpaired arm (forced use) were performed in 56 chronic stroke patients. Internal consistency was high (AOU: alpha=0.88; QOM: alpha=0.91). The limits of agreement were -0.70 to 0.85 and -0.61 to 0.71 for AOU and QOM, respectively. The correlation with the ARA score (Spearman rho) was 0.63 (AOU and QOM). However, the improvement on the MAL during the intervention was only weakly related to the GCR and to the improvement on the ARA, Spearman rho was between 0.16 and 0.22. The responsiveness ratio was 1.9 (AOU) and 2.0 (QOM). The MAL is internally consistent and relatively stable in chronic stroke patients not undergoing an intervention. The cross-sectional construct validity of the MAL is reasonable, but the results raise doubt about its longitudinal construct validity.
Moving to Capture Children’s Attention: Developing a Methodology for Measuring Visuomotor Attention
Coats, Rachel O.; Mushtaq, Faisal; Williams, Justin H. G.; Aucott, Lorna S.; Mon-Williams, Mark
2016-01-01
Attention underpins many activities integral to a child’s development. However, methodological limitations currently make large-scale assessment of children’s attentional skill impractical, costly and lacking in ecological validity. Consequently we developed a measure of ‘Visual Motor Attention’ (VMA)—a construct defined as the ability to sustain and adapt visuomotor behaviour in response to task-relevant visual information. In a series of experiments, we evaluated the capability of our method to measure attentional processes and their contributions in guiding visuomotor behaviour. Experiment 1 established the method’s core features (ability to track stimuli moving on a tablet-computer screen with a hand-held stylus) and demonstrated its sensitivity to principled manipulations in adults’ attentional load. Experiment 2 standardised a format suitable for use with children and showed construct validity by capturing developmental changes in executive attention processes. Experiment 3 tested the hypothesis that children with and without coordination difficulties would show qualitatively different response patterns, finding an interaction between the cognitive and motor factors underpinning responses. Experiment 4 identified associations between VMA performance and existing standardised attention assessments and thereby confirmed convergent validity. These results establish a novel approach to measuring childhood attention that can produce meaningful functional assessments that capture how attention operates in an ecologically valid context (i.e. attention's specific contribution to visuomanual action). PMID:27434198
Berings, Marjolein G M C; Poell, Rob F; Simons, P Robert-Jan; van Veldhoven, Marc J P M
2007-06-01
This paper is a report of a study to develop and test the psychometric properties of the On-the-job Learning Style Questionnaire for the Nursing Profession. Although numerous questionnaires measuring learning styles have been developed, none are suitable for working environments. Existing instruments do not meet the requirements for use in workplace settings and tend to ignore the influence of different learning situations. The questionnaire was constructed using a situation-response design, measuring learning activities in different on-the-job learning situations. Content validity was ensured by basing the questionnaire on interview studies. The questionnaire was distributed to 912 Registered Nurses working in different departments of 13 general hospitals in the Netherlands at the end of 2005. The response rate was 41% (372 questionnaires). The internal factor structure of the questionnaire was partly based on the learning activities in which nurses participate and partly on the learning situation in which they are performed. The internal consistency was good. The situation-response design of the questionnaire demonstrated its added value. Construct validity was estimated using intercorrelations between the scales, and criterion validity was estimated based on the relationships of the scales with perceived professional competence. The On-the-job Learning Styles Questionnaire for the Nursing Profession is well suited to describing nurses' learning styles in on-the-job settings and has satisfactory psychometric properties.
[Evaluation of Suicide Risk Levels in Hospitals: Validity and Reliability Tests].
Macagnino, Sandro; Steinert, Tilman; Uhlmann, Carmen
2018-05-01
Examination of in-hospital suicide risk levels concerning their validity and their reliability. The internal suicide risk levels were evaluated in a cross sectional study of in 163 inpatients. A reliability check was performed via determining interrater-reliability of senior physician, therapist and the responsible nurse. Within the scope of the validity check, we conducted analyses of criterion validity and construct validity. For the total sample an "acceptable" to "good" interrater-reliability (Kendalls W = .77) of suicide risk levels were obtained. Schizophrenic disorders showed the lowest values, for personality disorders we found the highest level of interrater-reliability. When examining the criterion validity, Item-9 of the BDI-II is substantial correlated to our suicide risk levels (ρ m = .54, p < .01). Within the scope of construct validity check, affective disorders showed the highest correlation (ρ = .77), compatible also with "convergent validity". They differed with schizophrenic disorders which showed the least concordance (ρ = .43). In-hospital suicide risk levels may represent an important contribution to the assessment of suicidal behavior of inpatients experiencing psychiatric treatment due to their overall good validity and reliability. © Georg Thieme Verlag KG Stuttgart · New York.
Fermont, Jilles M; Blazeby, Jane M; Rogers, Chris A; Wordsworth, Sarah
2017-01-01
Bariatric surgery is considered an effective treatment for individuals with severe and complex obesity. Besides reducing weight and improving obesity related comorbidities such as diabetes, bariatric surgery could improve patients' health-related quality of life. However, the frequently used instrument to measure quality of life, the EQ-5D has not been validated for use in bariatric surgery, which is a major limitation to its use in this clinical context. Our study undertook a psychometric validation of the 5 level EQ-5D (EQ-5D-5L) using clinical trial data to measure health-related quality of life in patients with severe and complex obesity undergoing bariatric surgery. Health-related quality of life was assessed at baseline (before randomisation) and six months later in 189 patients in a randomised controlled trial of bariatric surgery. Patients completed two generic health-related quality of life instruments, the EQ-5D-5L and SF-12, which were used together for the validation using data from all patients in the trial as the trial is ongoing. Psychometric analyses included construct and criterion validity and responsiveness to change. Of the 189 validation patients, 141 (75%) were female, the median age was 49 years old (range 23-70 years) and body mass index ranged from 33-70 kg/m2. For construct validity, there were significant improvements in the distribution of responses in all EQ-5D dimensions between baseline and 6 months after randomisation. For criterion validity, the highest degree of correlation was between the EQ-5D pain/discomfort and SF-12 bodily pain domain. For responsiveness the EQ-5D and SF-12 showed statistically significant improvements in health-related quality of life between baseline and 6 months after randomisation. The EQ-5D-5L is a valid generic measure for measuring health-related quality of life in bariatric surgery patients.
Huber, Adam M; Feldman, Brian M; Rennebohm, Robert M; Hicks, Jeanne E; Lindsley, Carol B; Perez, Maria D; Zemel, Lawrence S; Wallace, Carol A; Ballinger, Susan H; Passo, Murray H; Reed, Ann M; Summers, Ronald M; White, Patience H; Katona, Ildy M; Miller, Frederick W; Lachenbruch, Peter A; Rider, Lisa G
2004-05-01
To examine the measurement characteristics of the Childhood Myositis Assessment Scale (CMAS) in children with juvenile idiopathic inflammatory myopathy (juvenile IIM), and to obtain preliminary data on the clinical significance of CMAS scores. One hundred eight children with juvenile IIM were evaluated on 2 occasions, 7-9 months apart, using various measures of physical function, strength, and disease activity. Interrater reliability, construct validity, and responsiveness of the CMAS were examined. The minimum clinically important difference (MID) and CMAS scores corresponding to various degrees of physical disability were estimated. The intraclass correlation coefficient for 26 patients assessed by 2 examiners was 0.89, indicating very good interrater reliability. The CMAS score correlated highly with the Childhood Health Assessment Questionnaire (C-HAQ) score and with findings on manual muscle testing (MMT) (r(s) = -0.73 and 0.73, respectively) and moderately with physician-assessed global disease activity and skin activity, parent-assessed global disease severity, and muscle magnetic resonance imaging (r(s) = -0.44 to -0.61), thereby demonstrating good construct validity. The standardized response mean was 0.81 (95% confidence interval 0.53, 1.09) in patients with at least 0.8 cm improvement on a 10-cm visual analog scale for physician-assessed global disease activity, indicating strong responsiveness. In bivariate regression models predicting physician-assessed global disease activity, MMT remained significant in models containing the CMAS (P = 0.03) while the C-HAQ did not (P = 0.4). Estimates of the MID ranged from 1.5 to 3.0 points on a 0-52-point scale. CMAS scores corresponding to no, mild, mild-to-moderate, and moderate physical disability, respectively, were 48, 45, 39, and 30. The CMAS exhibits good reliability, construct validity, and responsiveness, and is therefore a valid instrument for the assessment of physical function, muscle strength, and endurance in children with juvenile IIM. Preliminary data on MID and corresponding levels of disability should aid in the clinical interpretation of CMAS scores when assessing patients with juvenile IIM.
Hopwood, Tanya L; Schutte, Nicola S; Loi, Natasha M
2017-09-01
Two studies, with a total of 707 participants, developed and examined the reliability and validity of a measure for anticipatory traumatic reaction (ATR), a novel construct describing a form of distress that may occur in response to threat-related media reports and discussions. Exploratory and confirmatory factor analysis resulted in a scale comprising three subscales: feelings related to future threat; preparatory thoughts and actions; and disruption to daily activities. Internal consistency was .93 for the overall ATR scale. The ATR scale demonstrated convergent validity through associations with negative affect, depression, anxiety, stress, neuroticism, and repetitive negative thinking. The scale showed discriminant validity in relationships to Big Five characteristics. The ATR scale had some overlap with a measure of posttraumatic stress disorder, but also showed substantial separate variance. This research provides preliminary evidence for the novel construct of ATR as well as a measure of the construct. The ATR scale will allow researchers to further investigate anticipatory traumatic reaction in the fields of trauma, clinical practice, and social psychology.
A signal detection-item response theory model for evaluating neuropsychological measures.
Thomas, Michael L; Brown, Gregory G; Gur, Ruben C; Moore, Tyler M; Patt, Virginie M; Risbrough, Victoria B; Baker, Dewleen G
2018-02-05
Models from signal detection theory are commonly used to score neuropsychological test data, especially tests of recognition memory. Here we show that certain item response theory models can be formulated as signal detection theory models, thus linking two complementary but distinct methodologies. We then use the approach to evaluate the validity (construct representation) of commonly used research measures, demonstrate the impact of conditional error on neuropsychological outcomes, and evaluate measurement bias. Signal detection-item response theory (SD-IRT) models were fitted to recognition memory data for words, faces, and objects. The sample consisted of U.S. Infantry Marines and Navy Corpsmen participating in the Marine Resiliency Study. Data comprised item responses to the Penn Face Memory Test (PFMT; N = 1,338), Penn Word Memory Test (PWMT; N = 1,331), and Visual Object Learning Test (VOLT; N = 1,249), and self-report of past head injury with loss of consciousness. SD-IRT models adequately fitted recognition memory item data across all modalities. Error varied systematically with ability estimates, and distributions of residuals from the regression of memory discrimination onto self-report of past head injury were positively skewed towards regions of larger measurement error. Analyses of differential item functioning revealed little evidence of systematic bias by level of education. SD-IRT models benefit from the measurement rigor of item response theory-which permits the modeling of item difficulty and examinee ability-and from signal detection theory-which provides an interpretive framework encompassing the experimentally validated constructs of memory discrimination and response bias. We used this approach to validate the construct representation of commonly used research measures and to demonstrate how nonoptimized item parameters can lead to erroneous conclusions when interpreting neuropsychological test data. Future work might include the development of computerized adaptive tests and integration with mixture and random-effects models.
Korzeniowski, Przemyslaw; Brown, Daniel C; Sodergren, Mikael H; Barrow, Alastair; Bello, Fernando
2017-02-01
The goal of this study was to establish face, content, and construct validity of NOViSE-the first force-feedback enabled virtual reality (VR) simulator for natural orifice transluminal endoscopic surgery (NOTES). Fourteen surgeons and surgical trainees performed 3 simulated hybrid transgastric cholecystectomies using a flexible endoscope on NOViSE. Four of them were classified as "NOTES experts" who had independently performed 10 or more simulated or human NOTES procedures. Seven participants were classified as "Novices" and 3 as "Gastroenterologists" with no or minimal NOTES experience. A standardized 5-point Likert-type scale questionnaire was administered to assess the face and content validity. NOViSE showed good overall face and content validity. In 14 out of 15 statements pertaining to face validity (graphical appearance, endoscope and tissue behavior, overall realism), ≥50% of responses were "agree" or "strongly agree." In terms of content validity, 85.7% of participants agreed or strongly agreed that NOViSE is a useful training tool for NOTES and 71.4% that they would recommend it to others. Construct validity was established by comparing a number of performance metrics such as task completion times, path lengths, applied forces, and so on. NOViSE demonstrated early signs of construct validity. Experts were faster and used a shorter endoscopic path length than novices in all but one task. The results indicate that NOViSE authentically recreates a transgastric hybrid cholecystectomy and sets promising foundations for the further development of a VR training curriculum for NOTES without compromising patient safety or requiring expensive animal facilities.
Bourion-Bédès, Stéphanie; Schwan, Raymund; Epstein, Jonathan; Laprevote, Vincent; Bédès, Alex; Bonnet, Jean-Louis; Baumann, Cédric
2015-02-01
The study aimed to examine the construct validity and reliability of the Quality of Life Enjoyment and Satisfaction Questionnaire-Short Form (Q-LES-Q-SF) according to both classical test and item response theories. The psychometric properties of the French version of this instrument were investigated in a cross-sectional, multicenter study. A total of 124 outpatients with a substance dependence diagnosis participated in the study. Psychometric evaluation included descriptive analysis, internal consistency, test-retest reliability, and validity. The dimensionality of the instrument was explored using a combination of the classical test, confirmatory factor analysis (CFA), and an item response theory analysis, the Person Separation Index (PSI), in a complementary manner. The results of the Q-LES-Q-SF revealed that the questionnaire was easy to administer and the acceptability was good. The internal consistency and the test-retest reliability were 0.9 and 0.88, respectively. All items were significantly correlated with the total score and the SF-12 used in the study. The CFA with one factor model was good, and for the unidimensional construct, the PSI was found to be 0.902. The French version of the Q-LES-Q-SF yielded valid and reliable clinical assessments of the quality of life for future research and clinical practice involving French substance abusers. In response to recent questioning regarding the unidimensionality or bidimensionality of the instrument and according to the underlying theoretical unidimensional construct used for its development, this study suggests the Q-LES-Q-SF as a one-dimension questionnaire in French QoL studies.
Aspesberro, François; Fesinmeyer, Megan D; Zhou, Chuan; Zimmerman, Jerry J; Mangione-Smith, Rita
2016-06-01
To assess the construct validity and the responsiveness of the Pediatric Quality of Life Inventory 4.0 Generic Core Scales and Infant Scales in the medical-surgical (PICU) and cardiac PICU. Prospective cohort study of 367 inpatients admitted either to the PICU or the cardiac ICU at Seattle Children's Hospital from January 2012 to June 2013. Parent/caregiver and child (≥ 8 yr old, developmentally appropriate, and critical illness resolved) Pediatric Quality of Life Inventory scores were obtained within 24 hours of PICU/cardiac ICU discharge and subsequently at 4-12 weeks following hospital discharge. Of the 491 eligible participants invited to participate, 367 (74.7% response rate) completed the Pediatric Quality of Life Inventory survey at ICU discharge, and of these, 263 (71.7% follow-up response rate) completed the follow-up survey 4-12 weeks after hospital discharge. Responsiveness was assessed by calculating improvement scores (difference between follow-up and ICU discharge scores, Δ Pediatric Quality of Life Inventory). Construct validity was examined by comparing mean improvement scores for known groups differing by medical complexity. At follow-up, [INCREMENT] Pediatric Quality of Life Inventory scores were as follows (mean ± SD): physical domain, 34.8 ± 32.0; and psychosocial domain, 23.1 ± 23.5. Patients with complex chronic or noncomplex chronic disease had physical functioning improvement scores that were 17.4 points (95% CI, -28.3 to -6.5; p < 0.001) and 19.5 points (95% CI, -30.4 to -8.5; p < 0.002) lower than children with no chronic illness, respectively. Patients with complex chronic disease exhibited psychosocial improvement scores that were 9.6 points (95% CI, -18.4 to -0.8; p < 0.033) lower than patients without chronic disease. Patients with noncomplex chronic disease had similar psychosocial improvement scores when compared with patients without chronic disease. As a measure of health-related quality of live, Pediatric Quality of Life Inventory demonstrated responsiveness and construct validity in a broad population of critically ill children. This measure represents a patient-centered clinically meaningful patient-or-parent-reported outcome measure for pediatric research assessing the clinical effectiveness of PICU/cardiac ICU interventions. When using health-related quality of life recovery as an outcome measure to assess clinical effectiveness in the PICU/cardiac ICU setting, measuring and controlling for the level of medical complexity is important in order to understand the true impact of clinical interventions.
Lugo, Josefina; Nápoles, Misleidy; Pérez, Inés; Ordaz, Niurka; Luzardo, Mario; Fernández, Leticia
2014-01-01
INTRODUCTION Although modern technology has extended the survival of breast cancer patients, treatment's adverse effects impact their health-related quality of life. Currently, no instrument exists capable of identifying the range of problems affecting breast cancer patients receiving radiotherapy in Cuba's socioeconomic and cultural context. OBJECTIVES Construct and validate an instrument to measure the effects of breast cancer and radiotherapy on health-related quality of life in Cuban patients. METHODS The study was conducted at the Oncology and Radiobiology Institute, Havana, Cuba, from January 2010 through December 2011. Inclusion criteria were: adult female, histological diagnosis of breast cancer, treated with ambulatory radiotherapy, and written informed consent; patients unable to communicate orally or in writing, or who had neurologic or psychiatric conditions were excluded. Development phase: focus groups guided by a list of questions were carried out with 50 women. The patients reported 61 problems affecting their health-related quality-of-life. A nominal group (six oncologists and two nurses) identified the same problems. A syntactic analysis of the information was performed to create items for study and measurement scales. Content validity was determined by a nominal group of seven experts using professional judgment. Another 20 patients were selected to evaluate face validity. Validation phase: the instrument was applied to 230 patients at three different points: before radiotherapy, at the end of radiotherapy and four weeks after radiotherapy was concluded. Reliability, construct validity, discriminant validity, predictive validity, interpretability and response burden were evaluated. RESULTS The final instrument developed had 33 items distributed in 4 domains: physical functioning, psychological functioning, social and family relationships, and physical and emotional adverse effects of disease and treatment. There were two discrete items: perceived general health and perceived health-related quality of life. Content validity and face validity were assessed as acceptable, by experts and patients respectively. Homogeneity, construct validity, and discriminant validity were satisfactory. The best results were obtained with test-retest reliability, predictive validity, and interpretability; the low rate of unanswered questions indicated that the instrument did not produce excessive patient response burden. CONCLUSION The new instrument fulfilled the requirements for measuring impact of breast cancer and of radiotherapy on health-related quality of life in these Cuban patients, validating its usefulness for inclusion in clinical trial protocols.
van der Kooy, Jacoba; Valentine, Nicole B; Birnie, Erwin; Vujkovic, Marijana; de Graaf, Johanna P; Denktaş, Semiha; Steegers, Eric A P; Bonsel, Gouke J
2014-12-03
The concept of responsiveness, introduced by the World Health Organization (WHO), addresses non-clinical aspects of health service quality that are relevant regardless of provider, country, health system or health condition. Responsiveness refers to "aspects related to the way individuals are treated and the environment in which they are treated" during health system interactions. This paper assesses the psychometric properties of a newly developed responsiveness questionnaire dedicated to evaluating maternal experiences of perinatal care services, called the Responsiveness in Perinatal and Obstetric Health Care Questionnaire (ReproQ), using the eight-domain WHO concept. The ReproQ was developed between October 2009 and February 2010 by adapting the WHO Responsiveness Questionnaire items to the perinatal care context. The psychometric properties of feasibility, construct validity, and discriminative validity were empirically assessed in a sample of Dutch women two weeks post partum. A total of 171 women consented to participation. Feasibility: the interviews lasted between 20 and 40 minutes and the overall missing rate was 8%. Construct validity: mean Cronbach's alphas for the antenatal, birth and postpartum phase were: 0.73 (range 0.57-0.82), 0.84 (range 0.66-0.92), and 0.87 (range 0.62-0.95) respectively. The item-own scale correlations within all phases were considerably higher than most of the item-other scale correlations. Within the antenatal care, birth care and post partum phases, the eight factors explained 69%, 69%, and 76% of variance respectively. Discriminative validity: overall responsiveness mean sum scores were higher for women whose children were not admitted. This confirmed the hypothesis that dissatisfaction with health outcomes is transferred to their judgement on responsiveness of the perinatal services. The ReproQ interview-based questionnaire demonstrated satisfactory psychometric properties to describe the quality of perinatal care in the Netherlands, with the potential to discriminate between different levels of quality of care. In view of the relatively small sample, further testing and research is recommended.
McAuley, Tara; White, Desirée
2010-01-01
The present study addressed three related aims: (1) to replicate and extend previous work regarding the non-unitary nature of processing speed, response inhibition, and working memory during development, (2) to quantify the rate at which processing speed, response inhibition, and working memory develop and the extent to which the development of these latter abilities reflect general changes in processing speed, and (3) to evaluate whether commonly used tasks of processing speed, response inhibition, and working memory are valid and reliable when used with a developmentally diverse group. To address these aims, a latent variables approach was used to analyze data from 147 participants 6 to 24 years of age. Results showed that processing speed, response inhibition, and working memory were separable abilities and that the extent of this separability was stable cross the age range of participants. All three constructs improved as a function of age; however, only the effect of age on working memory remained significant after processing speed was controlled. The psychometric properties of tasks used to assess the constructs were age invariant, thus validating their use in studies of executive development. PMID:20888572
Richter, Tobias; Schroeder, Sascha; Wöhrmann, Britta
2009-03-01
In social cognition, knowledge-based validation of information is usually regarded as relying on strategic and resource-demanding processes. Research on language comprehension, in contrast, suggests that validation processes are involved in the construction of a referential representation of the communicated information. This view implies that individuals can use their knowledge to validate incoming information in a routine and efficient manner. Consistent with this idea, Experiments 1 and 2 demonstrated that individuals are able to reject false assertions efficiently when they have validity-relevant beliefs. Validation processes were carried out routinely even when individuals were put under additional cognitive load during comprehension. Experiment 3 demonstrated that the rejection of false information occurs automatically and interferes with affirmative responses in a nonsemantic task (epistemic Stroop effect). Experiment 4 also revealed complementary interference effects of true information with negative responses in a nonsemantic task. These results suggest the existence of fast and efficient validation processes that protect mental representations from being contaminated by false and inaccurate information.
Ó Ciardha, Caoilte; Attard-Johnson, Janice; Bindemann, Markus
2018-04-01
Latency-based measures of sexual interest require additional evidence of validity, as do newer pupil dilation approaches. A total of 102 community men completed six latency-based measures of sexual interest. Pupillary responses were recorded during three of these tasks and in an additional task where no participant response was required. For adult stimuli, there was a high degree of intercorrelation between measures, suggesting that tasks may be measuring the same underlying construct (convergent validity). In addition to being correlated with one another, measures also predicted participants' self-reported sexual interest, demonstrating concurrent validity (i.e., the ability of a task to predict a more validated, simultaneously recorded, measure). Latency-based and pupillometric approaches also showed preliminary evidence of concurrent validity in predicting both self-reported interest in child molestation and viewing pornographic material containing children. Taken together, the study findings build on the evidence base for the validity of latency-based and pupillometric measures of sexual interest.
ERIC Educational Resources Information Center
Teo, Timothy
2010-01-01
Purpose: The purpose of this paper is to assess the cross-cultural validity of the technology acceptance measure for pre-service teachers (TAMPST) on a Malaysian sample. Design/methodology/approach: A total of 193 pre-service teachers from a Malaysian university completed a survey questionnaire measuring their responses to five constructs in the…
Owolabi, Mayowa O
2014-01-01
Teacher's attitude domain, a pivotal aspect of clinical teaching, is missing in the Stanford Faculty Development Program Questionnaire (SFDPQ), the most widely used student-based assessment method of clinical teaching skills. This study was conducted to develop and validate the teacher's attitude domain and evaluate the validity and internal consistency reliability of the augmented SFDPQ. Items generated for the new domain included teacher's enthusiasm, sobriety, humility, thoroughness, empathy, and accessibility. The study involved 20 resident doctors assessed once by 64 medical students using the augmented SFDPQ. Construct validity was explored using correlation among the different domains and a global rating scale. Factor analysis was performed. The response rate was 94%. The new domain had a Cronbach's alpha of 0.89, with 1-factor solution explaining 57.1% of its variance. It showed the strongest correlation to the global rating scale (rho = 0.71). The augmented SFDPQ, which had a Cronbach's alpha of 0.93, correlated better (rho = 0.72, p < 0.00001) to the global rating scale than the original SFDPQ (rho = 0.67, p < 0.00001). The new teacher's attitude domain exhibited good internal consistency and construct and factorial validity. It enhanced the content and construct validity of the SFDPQ. The validated construct of the augmented SFDPQ is recommended for design and evaluation of basic and continuing clinical teaching programs. © 2014 The Alliance for Continuing Education in the Health Professions, the Society for Academic Continuing Medical Education, and the Council on Continuing Medical Education, Association for Hospital Medical Education.
Internal consistency and validity of a new physical workload questionnaire
Bot, S; Terwee, C; van der Windt, D A W M; Feleus, A; Bierma-Zeinstra, S; Knol, D; Bouter, L; Dekker, J
2004-01-01
Aims: To examine the dimensionality, internal consistency, and construct validity of a new physical workload questionnaire in employees with musculoskeletal complaints. Methods: Factor analysis was applied to the responses in three study populations with musculoskeletal disorders (n = 406, 300, and 557) on 26 items related to physical workload. The internal consistency of the resulting subscales was examined. It was hypothesised that physical workload would vary among different occupational groups. The occupations of all subjects were classified into four groups on the basis of expected workload (heavy physical load; long lasting postures and repetitive movements; both; no physical load). Construct validity of the subscales created was tested by comparing the subscale scores among these occupational groups. Results: The pattern of the factor loadings of items was almost identical for the three study populations. Two interpretable factors were found: items related to heavy physical workload loaded highly on the first factor, and items related to static postures or repetitive work loaded highly on the second factor. The first constructed subscale "heavy physical work" had a Cronbach's α of 0.92 to 0.93 and the second subscale "long lasting postures and repetitive movements", of 0.86 to 0.87. Six of eight hypotheses regarding the construct validity of the subscales were confirmed. Conclusions: The results support the internal structure, internal consistency, and validity of the new physical workload questionnaire. Testing this questionnaire in non-symptomatic employees and comparing its performance with objective assessments of physical workload are important next steps in the validation process. PMID:15550603
Kiltz, Uta; van der Heijde, Désirée; Boonen, Annelies; Akkoc, Nurullah; Bautista-Molano, Wilson; Burgos-Vargas, Ruben; Wei, James Cheng-Chung; Chiowchanwisawakit, Praveena; Dougados, Maxime; Duruoz, M Tuncay; Elzorkany, Bassel Kamal; Gaydukova, Inna; Gensler, Lianne S; Gilio, Michele; Grazio, Simeon; Gu, Jieruo; Inman, Robert D; Kim, Tae-Jong; Navarro-Compan, Victoria; Marzo-Ortega, Helena; Ozgocmen, Salih; Pimentel Dos Santos, Fernando; Schirmer, Michael; Stebbings, Simon; Van den Bosch, Filip E; van Tubergen, Astrid; Braun, Juergen
2018-06-01
To evaluate construct validity, interpretability, reliability and responsiveness as well as determination of cut-off points for good and poor health within the original English version and the 18 translations of the disease-specific Assessment of Spondyloarthritis international Society Health Index (ASAS HI) in 23 countries worldwide in patients with spondyloarthritis (SpA). A representative sample of patients with SpA fulfilling the ASAS classification criteria for axial (axSpA) or peripheral SpA was used. The construct validity of the ASAS HI was tested using Spearman correlation with several standard health outcomes for axSpA. Test-retest reliability was assessed by intraclass correlation coefficients (ICCs) in patients with stable disease (interval 4-7 days). In patients who required an escalation of therapy because of high disease activity, responsiveness was tested after 2-24weeks using standardised response mean (SRM). Among the 1548 patients, 64.9% were men, with a mean (SD) age 42.0 (13.4) years. Construct validity ranged from low (age: 0.10) to high (Bath AnkylosingSpondylitisFunctioning Index: 0.71). Internal consistency was high (Cronbach's α of 0.93). The reliability among 578 patients was good (ICC=0.87 (95% CI 0.84 to 0.89)). Responsiveness among 246 patients was moderate-large (SRM=-0.44 for non-steroidal anti-inflammatory drugs, -0.69 for conventional synthetic disease-modifying antirheumatic drug and -0.85 for tumour necrosis factor inhibitor). The smallest detectable change was 3.0. Values ≤5.0 have balanced specificity to distinguish good health as opposed to moderate health, and values ≥12.0 are specific to represent poor health as opposed to moderate health. The ASAS HI proved to be valid, reliable and responsive. It can be used to evaluate the impact of SpA and its treatment on functioning and health. Furthermore, comparison of disease impact between populations is possible. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
The Rorschach texture response: a construct validation study using attachment theory.
Cassella, Michael J; Viglione, Donald J
2009-11-01
Using attachment theory, in this research, we explored the construct validity of the Rorschach (Exner, 1974) Texture (T) response as a measure of interpersonal closeness and contact. A total of 40 men and 39 women completed the Rorschach and 2 attachment inventories. Their romantic partners also completed an informant version of the attachment measures. Attachment styles were measured by factor scores involving both self-report and partner report. Results indicate that attachment theory, as a broad conceptual framework, is associated with T. Specifically, T = 1 is most closely associated with a secure attachment style, T > 1 with aspects of the preoccupied style, and T = 0 with aspects of the avoidant style and an absence of secure attachment. Needs for closeness and contact associated with T can be couched within an adult attachment theory, but in this study, we did not test for problematic aspects of insecure attachment. Gender is a complicating factor and deserves more study.
[Construct validity of a Portuguese version of the Female Sexual Function Index].
Pacagnella, Rodolfo de Carvalho; Martinez, Edson Zangiacomi; Vieira, Elisabeth Meloni
2009-11-01
This study aimed to evaluate the construct validity of a version of the Female Sexual Function Index (FSFI). The version was inserted into a multidimensional questionnaire and applied face-to-face to 235 sterilized women. Non-response rate was 1.7%. The sample proved to be highly homogeneous, with low income and education. Cronbach's alpha of 0.948 indicated high internal consistency. Factor analysis showed that the instrument was measuring four factors: desire/arousal, lubrication, orgasm/satisfaction, and pain. This may reflect characteristics of the instrument itself, female sexual response, or even the sample's cultural peculiarities, which can affect understanding of the questions or concepts with which the instrument deals. This study emphasizes the need for further research in the general population to determine the psychometric properties of the FSFI, such as its factor composition and definition of appropriate cut-off points for the Brazilian population.
Sung, Vivian W.; Griffith, James W.; Rogers, Rebecca G.; Raker, Christina A.; Clark, Melissa A.
2016-01-01
Purpose Current patient-reported outcomes for female urinary incontinence (UI) are limited by their inability to be tailored. Our objective is to describe the development and field-testing of 7 item banks designed to measure domains identified as important UI in females (UIf). We also describe the calibration and validation properties of the UIf-item banks, which allow for more efficient computerized-adaptive testing (CAT) in the future. METHODS The UIf-measures included 168 items covering 7 domains: Stress UI (SUI), Overactive Bladder (OAB), Urinary Frequency, Physical, Social and Emotional Health Impact, and Adaptation. Items underwent rigorous qualitative development and psychometric testing across 2 sites. Items were calibrated using item response theory and evaluated for internal consistency, construct validity and responsiveness. RESULTS 750 women (249 SUI, 249 OAB, and 252 mixed UI) participated. Mean age was 55±14 years ,23% were Hispanic, 80% white. In addition to face and content validity, the measures demonstrated good internal consistency (coefficient alpha 0.92-0.98) and unidimensionality. There was evidence for construct validity with moderate to strong correlations with the UDI (r’s ≥ 0.6) and IIQ (r’s = ≥ 0.6) scales. The measures were responsive to change for SUI treatment (paired t-test p <.001, ES range=1.3 to 2.9; SRM range=1.3 to 2.5) and OAB treatment (paired t-test p <.05 for all domains except Social Health Impact and Adaptation, ES range=.3 to 1.5, SRM range=0.4 to 1.0). The measures were responsive based on concurrent changes with the UDI and IIQ (p < 0.05). CAT versions were developed and pilot tested. CONCLUSIONS The UIf-item banks demonstrate good psychometric characteristics and are a sufficiently valid set of customizable tools for measuring UI symptoms and life impact. PMID:26732514
ERIC Educational Resources Information Center
Edwards, Patrick W.; Donaldson, Mary Ann
1989-01-01
A study of the construction and factor validity of the Response to Child Incest Questionnaire, a self-report instrument for assessing commonly reported symptoms of adult survivors of incest, is reported. The instrument's usefulness as a pre- and post-treatment measure and further research needs are discussed. (MSE)
Eye-Tracking as a Measure of Responsiveness to Joint Attention in Infants at Risk for Autism
ERIC Educational Resources Information Center
Navab, Anahita; Gillespie-Lynch, Kristen; Johnson, Scott P.; Sigman, Marian; Hutman, Ted
2012-01-01
Reduced responsiveness to joint attention (RJA), as assessed by the Early Social Communication Scales (ESCS), is predictive of both subsequent language difficulties and autism diagnosis. Eye-tracking measurement of RJA is a promising prognostic tool because it is highly precise and standardized. However, the construct validity of eye-tracking…
Drake, David; Kennedy, Rodney; Wallace, Eric
2017-12-01
Researchers and practitioners working in sports medicine and science require valid tests to determine the effectiveness of interventions and enhance understanding of mechanisms underpinning adaptation. Such decision making is influenced by the supportive evidence describing the validity of tests within current research. The objective of this study is to review the validity of lower body isometric multi-joint tests ability to assess muscular strength and determine the current level of supporting evidence. Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines were followed in a systematic fashion to search, assess and synthesize existing literature on this topic. Electronic databases such as Web of Science, CINAHL and PubMed were searched up to 18 March 2015. Potential inclusions were screened against eligibility criteria relating to types of test, measurement instrument, properties of validity assessed and population group and were required to be published in English. The Consensus-based Standards for the Selection of health Measurement Instruments (COSMIN) checklist was used to assess methodological quality and measurement property rating of included studies. Studies rated as fair or better in methodological quality were included in the best evidence synthesis. Fifty-nine studies met the eligibility criteria for quality appraisal. The ten studies that rated fair or better in methodological quality were included in the best evidence synthesis. The most frequently investigated lower body isometric multi-joint tests for validity were the isometric mid-thigh pull and isometric squat. The validity of each of these tests was strong in terms of reliability and construct validity. The evidence for responsiveness of tests was found to be moderate for the isometric squat test and unknown for the isometric mid-thigh pull. No tests using the isometric leg press met the criteria for inclusion in the best evidence synthesis. Researchers and practitioners can use the isometric squat and isometric mid-thigh pull with confidence in terms of reliability and construct validity. Further work to investigate other validity components such as criterion validity, smallest detectable change and responsiveness to resistance exercise interventions may be beneficial to the current level of evidence.
Cross-cultural adaptation of the German version of the spinal stenosis measure.
Wertli, Maria M; Steurer, Johann; Wildi, Lukas M; Held, Ulrike
2014-06-01
To validate the German version of the spinal stenosis measure (SSM), a disease-specific questionnaire assessing symptom severity, physical function, and satisfaction with treatment in patients with lumbar spinal stenosis. After translation, cross-cultural adaptation, and pilot testing, we assessed internal consistency, test-retest reliability, construct validity, and responsiveness of the SSM subscales. Data from a large Swiss multi-center prospective cohort study were used. Reference scales for the assessment of construct validity and responsiveness were the numeric rating scale, pain thermometer, and the Roland Morris Disability Questionnaire. One hundred and eight consecutive patients were included in this validation study, recruited from five different centers. Cronbach's alpha was above 0.8 for all three subscales of the SSM. The objectivity of the SSM was assessed using a partial credit approach. The model showed a good global fit to the data. Of the 108 patients 78 participated in the test-retest procedure. The ICC values were above 0.8 for all three subscales of the SSM. Correlations with reference scales were above 0.7 for the symptom and function subscales. For satisfaction subscale, it was 0.66 or above. Clinically meaningful changes of the reference scales over time were associated with significantly more improvement in all three SSM subscales (p < 0.001). Conclusion: The proposed version of the SSM showed very good measurement properties and can be considered validated for use in the German language.
Cross-cultural adaptation and validation of the Turkish version of Oxford hip score.
Tuğay, Baki Umut; Tuğay, Nazan; Güney, Hande; Hazar, Zeynep; Yüksel, İnci; Atilla, Bülent
2015-06-01
The purpose of this study was to translate the Oxford hip score (OHS) into Turkish and to evaluate the psychometric properties by testing the internal consistency, reproducibility, construct validity, and responsiveness in patients with hip osteoarthritis (OA). Oxford hip score was translated and culturally adapted according to the guidelines in the literature. Seventy patients (mean age 61.45 ± 9.29 years) with hip osteoarthritis participated in the study. Patients completed the Turkish Oxford hip score (OHS-TR), the Short-Form 36 (SF-36), and Western Ontario and McMaster Universities Index (WOMAC). Internal consistency was tested using Cronbach's α coefficient. Patients completed OHS-TR questionnaire twice in 7 days for determining the reproducibility. Correlation between the total results of both tests was determined by the Pearson correlation coefficient and intraclass correlation coefficient (ICC). Validity was assessed by calculating the Pearson correlation coefficient between the OHS-TR and WOMAC and SF-36 scores. Floor and ceiling effects were analyzed. The internal consistency was high (Cronbach's α 0.93). The construct validity showed a significant correlation between the OHS-TR and WOMAC and related SF-36 domains (p < 0.001). The ICC's ranged between 0.80 and 0.99. There was no floor or ceiling effect in total OHS-TR score. The OHS-TR questionnaire is valid, reliable, and responsive for the Turkish-speaking patients with hip OA.
Hashimoto, Hideki; Sase, Takeshi; Arai, Yasuhisa; Maruyama, Toru; Isobe, Keijirou; Shouno, Yasuhiro
2007-02-15
A cross-sectional observational study to determine the response distribution, internal consistency, and construct, concurrent, and discriminative validities of The Scoliosis Research Society-22 (SRS-22) Patient Questionnaire translated into Japanese as compared with the other language versions. To validate the Japanese version of SRS22. The SRS-22 was translated into several languages but yet not into Japanese. The Japanese SRS-22 and Medical Outcomes Study Short Form 36 were simultaneously administered to 114 adolescent idiopathic scoliosis patients. Exploratory factor analysis revealed a 4-factor structure, though several items were not loaded as theoretically expected. The originally constructed Japanese SRS-22 subscales and the English version showed similar response distribution. Internal consistency was fair but lower than that of the English version. The concurrent validity of the translated version, except for the self-image subscale, was supported using Medical Outcomes Study Short Form 36 subscales as a reference. The function scale differed significantly by curve angle magnitude and treatment status. The self-image score was the highest in patients under observation when curve angle was < 40 degrees, while postsurgical patients marked the highest scores when the angle > or = 40 degrees, respectively. The Japanese SRS-22 is valid and may be useful for clinical evaluation of Japanese scoliosis patients, though the self-image subscale may need further assessment.
Validity and cultural equivalence of the standard Greene Climacteric Scale in Hong Kong.
Chen, Run Qiu; Davis, Susan R; Wong, Chit Ming; Lam, Tai Hing
2010-01-01
The aim of this study was to translate the standard Greene Climacteric Scale (GCS) and a urogenital symptom scale into colloquial Chinese (Hong Kong) and test their validity and reliability in Hong Kong Chinese women. The scales were translated with standard techniques, and cross-cultural construct validity, internal consistency, test-retest reliability, and responsiveness were tested on samples of women aged 40 to 60 years recruited from the community. A total of 611 women, with mean (SD) age of 48.9 (5.3) years, provided completed scales for the study. Confirmatory factor analysis demonstrated construct validity of the translated standard GCS. The items were found to have good homogeneity in measuring the scale concepts (Cronbach alpha > 0.7). But the three-item urogenital scale had poor internal consistency (Cronbach alpha = 0.43), and a combination of this scale with the standard GCS resulted in a reduced model fit to the data. Test-retest reliability for the GCS was good on women recruited for a retest (n = 52). The translated GCS was found to be responsive to change over time (effect size, 0.59; n = 19). The Chinese (Hong Kong) version of the standard GCS is a valid and cultural-equivalent instrument. Our data do not support inclusion of the urogenital scale to the standard GCS. Measurement of urogenital symptoms is subject to further study.
Cross-cultural adaptation and validation of Persian Achilles tendon Total Rupture Score.
Ansari, Noureddin Nakhostin; Naghdi, Soofia; Hasanvand, Sahar; Fakhari, Zahra; Kordi, Ramin; Nilsson-Helander, Katarina
2016-04-01
To cross-culturally adapt the Achilles tendon Total Rupture Score (ATRS) to Persian language and to preliminary evaluate the reliability and validity of a Persian ATRS. A cross-sectional and prospective cohort study was conducted to translate and cross-culturally adapt the ATRS to Persian language (ATRS-Persian) following steps described in guidelines. Thirty patients with total Achilles tendon rupture and 30 healthy subjects participated in this study. Psychometric properties of floor/ceiling effects (responsiveness), internal consistency reliability, test-retest reliability, standard error of measurement (SEM), smallest detectable change (SDC), construct validity, and discriminant validity were tested. Factor analysis was performed to determine the ATRS-Persian structure. There were no floor or ceiling effects that indicate the content and responsiveness of ATRS-Persian. Internal consistency was high (Cronbach's α 0.95). Item-total correlations exceeded acceptable standard of 0.3 for the all items (0.58-0.95). The test-retest reliability was excellent [(ICC)agreement 0.98]. SEM and SDC were 3.57 and 9.9, respectively. Construct validity was supported by a significant correlation between the ATRS-Persian total score and the Persian Foot and Ankle Outcome Score (PFAOS) total score and PFAOS subscales (r = 0.55-0.83). The ATRS-Persian significantly discriminated between patients and healthy subjects. Explanatory factor analysis revealed 1 component. The ATRS was cross-culturally adapted to Persian and demonstrated to be a reliable and valid instrument to measure functional outcomes in Persian patients with Achilles tendon rupture. II.
Singer, Sara; Meterko, Mark; Baker, Laurence; Gaba, David; Falwell, Alyson; Rosen, Amy
2007-10-01
To describe the development of an instrument for assessing workforce perceptions of hospital safety culture and to assess its reliability and validity. Primary data collected between March 2004 and May 2005. Personnel from 105 U.S. hospitals completed a 38-item paper and pencil survey. We received 21,496 completed questionnaires, representing a 51 percent response rate. Based on review of existing safety climate surveys, we developed a list of key topics pertinent to maintaining a culture of safety in high-reliability organizations. We developed a draft questionnaire to address these topics and pilot tested it in four preliminary studies of hospital personnel. We modified the questionnaire based on experience and respondent feedback, and distributed the revised version to 42,249 hospital workers. We randomly divided respondents into derivation and validation samples. We applied exploratory factor analysis to responses in the derivation sample. We used those results to create scales in the validation sample, which we subjected to multitrait analysis (MTA). We identified nine constructs, three organizational factors, two unit factors, three individual factors, and one additional factor. Constructs demonstrated substantial convergent and discriminant validity in the MTA. Cronbach's alpha coefficients ranged from 0.50 to 0.89. It is possible to measure key salient features of hospital safety climate using a valid and reliable 38-item survey and appropriate hospital sample sizes. This instrument may be used in further studies to better understand the impact of safety climate on patient safety outcomes.
Burgers, Paul T P W; Poolman, Rudolf W; Van Bakel, Theodorus M J; Tuinebreijer, Wim E; Zielinski, Stephanie M; Bhandari, Mohit; Patka, Peter; Van Lieshout, Esther M M
2015-05-06
The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) has been extensively evaluated in groups of patients with osteoarthritis, yet not in patients with a femoral neck fracture. This study aimed to determine the reliability, construct validity, and responsiveness of the WOMAC compared with the Short Form-12 (SF-12) and the EuroQol 5D (EQ-5D) questionnaires for the assessment of elderly patients with a femoral neck fracture. Reliability was tested by assessing the Cronbach alpha. Construct validity was determined with the Pearson correlation coefficient. Change scores were calculated from ten weeks to twelve months of follow-up. Standardized response means and floor and ceiling effects were determined. Analyses were performed to compare the results for patients less than eighty years old with those for patients eighty years of age or older. The mean WOMAC total score was 89 points before the fracture in the younger patients and increased from 70 points at ten weeks to 81 points at two years postoperatively. In the older age group, these scores were 86, 75, and 78 points. The mean WOMAC pain scores before the fracture and at ten weeks and two years postoperatively were 92, 76, and 87 points, respectively, in the younger age group and 92, 84, and 93 points in the older age group. Function scores were 89, 68, and 79 points for the younger age group and 84, 71, and 73 points for the older age group. The Cronbach alpha for pain, stiffness, function, and the total scale ranged from 0.83 to 0.98 for the younger age group and from 0.79 to 0.97 for the older age group. Construct validity was good, with 82% and 79% of predefined hypotheses confirmed in the younger and older age groups, respectively. Responsiveness was moderate. No floor effects were found. Moderate to large ceiling effects were found for pain and stiffness scales at ten weeks and twelve months in younger patients (18% to 36%) and in the older age group (38% to 53%). The WOMAC showed good reliability, construct validity, and responsiveness in both age groups of elderly patients with a femoral neck fracture who had been physically and mentally fit before the fracture. The instrument is suitable for use in future clinical studies in these populations. The results are based on two clinical trials. The questionnaires used concern pure, clinically relevant issues (ability to walk, climb stairs, etc.). Moreover, the results can be used for future research comparing clinical outcomes (or treatments) for populations with a femoral neck fracture. Copyright © 2015 by The Journal of Bone and Joint Surgery, Incorporated.
O'Sullivan, Elizabeth J; Rasmussen, Kathleen M
2017-12-01
The breastfeeding surveillance tool in the United States, the National Immunization Survey, considers the maternal-infant dyad to be breastfeeding for as long as the infant consumes human milk (HM). However, many infants consume at least some HM from a bottle, which can lead to health outcomes different from those for at-the-breast feeding. Our aim was to develop a construct-valid questionnaire that categorizes infants by nutrition source, that is, own mother's HM, another mother's HM, infant formula, or other and feeding mode, that is, at the breast or from a bottle, and test the reliability of this questionnaire. The Questionnaire on Infant Feeding was developed through a literature review and modified based on qualitative research. Construct validity was assessed through cognitive interviews and a test-retest reliability study was conducted among mothers who completed the questionnaire twice, 1 month apart. Cognitive interviews were conducted with ten mothers from upstate New York between September and December 2014. A test-retest reliability study was conducted among 44 mothers from across the United States between March and May 2015. Equivalence of questions with continuous responses about the timing of starting and stopping various behaviors and the agreement between responses to questions with categorical responses on the two questionnaires completed 1 month apart. Reliability was assessed using paired-equivalence tests for questions about the timing of starting and stopping behaviors and weighted Cohen's κ for questions about the frequency and intensity of behaviors. Reliability of the Questionnaire on Infant Feeding was moderately high among mothers of infants aged 19 to 35 months, with most questions about the timing of starting and stopping behaviors equivalent to within 1 month. Weighted Cohen's κ for categorical questions indicated substantial agreement. The Questionnaire on Infant Feeding is a construct-valid tool to measure duration, intensity, and mode of infant HM consumption and duration of maternal HM production that is reliable within 19 to 35 months postpartum. Criterion-validity testing of these questions will improve the utility of the Questionnaire on Infant Feeding as a surveillance tool. Copyright © 2017 Academy of Nutrition and Dietetics. Published by Elsevier Inc. All rights reserved.
Validation of source approval of HMA surface mix aggregate : research summary.
DOT National Transportation Integrated Search
2016-04-01
Pavement surfaces must maintain an adequate level of friction in order to provide a : safe surface for vehicles. The Maryland State Highway Administration (SHA) is : responsible for ensuring that flexible pavement construction, using hot mix asphalt ...
Construct Validity: Advances in Theory and Methodology
Strauss, Milton E.; Smith, Gregory T.
2008-01-01
Measures of psychological constructs are validated by testing whether they relate to measures of other constructs as specified by theory. Each test of relations between measures reflects on the validity of both the measures and the theory driving the test. Construct validation concerns the simultaneous process of measure and theory validation. In this chapter, we review the recent history of validation efforts in clinical psychological science that has led to this perspective, and we review five recent advances in validation theory and methodology of importance for clinical researchers. These are: the emergence of nonjustificationist philosophy of science; an increasing appreciation for theory and the need for informative tests of construct validity; valid construct representation in experimental psychopathology; the need to avoid representing multidimensional constructs with a single score; and the emergence of effective new statistical tools for the evaluation of convergent and discriminant validity. PMID:19086835
Van de Velde, Dominique; Coorevits, Pascal; Sabbe, Lode; De Baets, Stijn; Bracke, Piet; Van Hove, Geert; Josephsson, Staffan; Ilsbroukx, Stephan; Vanderstraeten, Guy
2017-03-01
To examine the internal consistency, test-retest reliability, construct validity, discriminant validity and responsiveness of the Ghent Participation Scale. Cross-sectional study with a test-retest sample. Six outpatient rehabilitation centres in Belgium. A total of 365 outpatients from eight diagnostic groups. The Ghent Participation Scale, the Impact on Participation and Autonomy, the Utrecht Scale for Evaluation of Rehabilitation-Participation and the Medical outcome study Short Form SF-36. The Ghent Participation Scale was found to have good internal consistency (Cronbach's α between 0.75 and 0.83). At item level, the test-retest reliability was good; weighted kappas ranged between 0.57 and 0.88. On the dimension level intraclass correlation coefficients ranged between 0.80 and 0.90. Evidence for construct validity came from high correlations between the subscales of the Ghent Participation Scale and four subscales of the Impact on Participation and Autonomy (range, r = -0.71 to -0.87) and two subscales of the Utrecht Scale for Evaluation of Rehabilitation-Participation (range, r = 0.54 to 0.72). Standardized response mean ranged between 0.23 and 0.68 and the area under the curve ranged between 68% and 88%. The Ghent Participation Scale appears to be a valid and reliable method of assessing participation irrespective of the respondent's health condition. The Ghent Participation Scale is responsive and is able to detect changes over time.
Tong, W W; Wang, W; Xu, W D
2016-08-15
The Western Ontario Meniscal Evaluation Tool (WOMET) is a questionnaire designed to evaluate the health-related quality of life (HRQOL) of patients with meniscal pathology. Our study aims to culturally adapt and validate the WOMET into a Chinese version. We translated the WOMET into Chinese. Then, a total of 121 patients with meniscal pathology were invited to participate in this study. To assess the test-retest reliability, the Chinese version WOMET was completed twice at 7-day intervals by the participants. The construct validity was assessed using Pearson's correlation coefficient or Spearman's correlation to test for correlations among the Chinese version WOMET and the eight domains of Short Form-36 (SF-36), the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC), and the International Knee Documentation Committee (IKDC) score. Responsiveness was tested by comparison of the preoperative and postoperative scores of the Chinese version WOMET. The test-retest reliability of the overall scale and different domains were all found to be excellent. The Cronbach's α was 0.90. The Chinese version WOMET correlated well with other questionnaires which suggested good construct validity. We observed no ceiling and floor effects of the Chinese version WOMET. We also found good responsiveness for the effect size, and the standardized response mean values were 0.86 and 1.11. The Chinese version of the WOMET appears to be reliable and valid in evaluating patients with meniscal pathology.
[Measurement properties of self-report questionnaires published in Korean nursing journals].
Lee, Eun-Hyun; Kim, Chun-Ja; Kim, Eun Jung; Chae, Hyun-Ju; Cho, Soo-Yeon
2013-02-01
The purpose of this study was to evaluate measurement properties of self-report questionnaires for studies published in Korean nursing journals. Of 424 Korean nursing articles initially identified, 168 articles met the inclusion criteria. The methodological quality of the measurements used in the studies and interpretability were assessed using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. It consists of items on internal consistency, reliability, measurement error, content validity, construct validity including structural validity, hypothesis testing, cross-cultural validity, and criterion validity, and responsiveness. For each item of the COSMIN checklist, measurement properties are rated on a four-point scale: excellent, good, fair, and poor. Each measurement property is scored with worst score counts. All articles used the classical test theory for measurement properties. Internal consistency (72.6%), construct validity (56.5%), and content validity (38.2%) were most frequently reported properties being rated as 'excellent' by COSMIN checklist, whereas other measurement properties were rarely reported. A systematic review of measurement properties including interpretability of most instruments warrants further research and nursing-focused checklists assessing measurement properties should be developed to facilitate intervention outcomes across Korean studies.
Hamamoto, Yosuke; Ito, Hiromu; Furu, Moritoshi; Ishikawa, Masahiro; Azukizawa, Masayuki; Kuriyama, Shinichi; Nakamura, Shinichiro; Matsuda, Shuichi
2015-09-01
The purposes of this study were to translate the new Knee Society Score (KSS) into Japanese and to evaluate the construct and content validity, test-retest reliability, and internal consistency of the Japanese version of the new KSS. The Japanese version of the KSS was developed according to cross-cultural guidelines by using the "translation-back translation" method to ensure content validity. KSS data were then obtained from patients who had undergone total knee arthroplasty (TKA). The psychometric properties evaluated were as follows: for feasibility, response rate, and floor and ceiling effects; for construct validity, internal consistency using Cronbach's alpha, and correlations with quality of life. Construct validity was evaluated by using Spearman's correlation coefficient to quantify the correlation between the KSS and the Japanese version of the Oxford 12-item Knee Score or Short Form 36 Health Survey (SF-36) questionnaires. The Japanese version of the KSS was sent to 93 consecutive osteoarthritic patients who underwent primary TKA in our institution. Fifty-five patients completed the questionnaires and were included in this study. Neither a floor nor ceiling effect was observed. The reliability proved excellent in the majority of domains, with intraclass correlation coefficients of 0.65-0.88. Internal consistency, assessed by Cronbach's alpha, was good to excellent for all domains (0.78-0.94). All of the four domains of the KSS correlated significantly with the Oxford 12-item Knee Score. The activity and satisfaction domains of the KSS correlated significantly with all and the majority of subscales of the SF-36, respectively, whereas symptoms and expectation domains showed significant correlations only with bodily pain and vitality subscales and with the physical function, bodily pain, and vitality subscales, respectively. The Japanese version of the new KSS is a valid, reliable, and responsive instrument to capture subjective aspects of the functional symptoms and abilities of patients who undergo TKA.
Haugum, Mona; Iversen, Hilde Hestad; Bjertnaes, Oyvind; Lindahl, Anne Karin
2017-02-20
Patient experiences are an important aspect of health care quality, but there is a lack of validated instruments for their measurement in the substance dependence literature. A new questionnaire to measure inpatients' experiences of interdisciplinary treatment for substance dependence has been developed in Norway. The aim of this study was to psychometrically test the new questionnaire, using data from a national survey in 2013. The questionnaire was developed based on a literature review, qualitative interviews with patients, expert group discussions and pretesting. Data were collected in a national survey covering all residential facilities with inpatients in treatment for substance dependence in 2013. Data quality and psychometric properties were assessed, including ceiling effects, item missing, exploratory factor analysis, and tests of internal consistency reliability, test-retest reliability and construct validity. The sample included 978 inpatients present at 98 residential institutions. After correcting for excluded patients (n = 175), the response rate was 91.4%. 28 out of 33 items had less than 20.5% of missing data or replies in the "not applicable" category. All but one item met the ceiling effect criterion of less than 50.0% of the responses in the most favorable category. Exploratory factor analysis resulted in three scales: "treatment and personnel", "milieu" and "outcome". All scales showed satisfactory internal consistency reliability (Cronbach's alpha ranged from 0.75-0.91) and test-retest reliability (ICC ranged from 0.82-0.85). 17 of 18 significant associations between single variables and the scales supported construct validity of the PEQ-ITSD. The content validity of the PEQ-ITSD was secured by a literature review, consultations with an expert group and qualitative interviews with patients. The PEQ-ITSD was used in a national survey in Norway in 2013 and psychometric testing showed that the instrument had satisfactory internal consistency reliability and construct validity.
Neijenhuijs, Koen I; Jansen, Femke; Aaronson, Neil K; Brédart, Anne; Groenvold, Mogens; Holzner, Bernhard; Terwee, Caroline B; Cuijpers, Pim; Verdonck-de Leeuw, Irma M
2018-05-07
The EORTC IN-PATSAT32 is a patient-reported outcome measure (PROM) to assess cancer patients' satisfaction with in-patient health care. The aim of this study was to investigate whether the initial good measurement properties of the IN-PATSAT32 are confirmed in new studies. Within the scope of a larger systematic review study (Prospero ID 42017057237), a systematic search was performed of Embase, Medline, PsycINFO, and Web of Science for studies that investigated measurement properties of the IN-PATSAT32 up to July 2017. Study quality was assessed, data were extracted, and synthesized according to the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) methodology. Nine studies were included in this review. The evidence on reliability and construct validity were rated as sufficient and of the quality of the evidence as moderate. The evidence on structural validity was rated as insufficient and of low quality. The evidence on internal consistency was indeterminate. Measurement error, responsiveness, criterion validity, and cross-cultural validity were not reported in the included studies. Measurement error could be calculated for two studies and was judged indeterminate. In summary, the IN-PATSAT32 performs as expected with respect to reliability and construct validity. No firm conclusions can be made yet whether the IN-PATSAT32 also performs as well with respect to structural validity and internal consistency. Further research on these measurement properties of the PROM is therefore needed as well as on measurement error, responsiveness, criterion validity, and cross-cultural validity. For future studies, it is recommended to take the COSMIN methodology into account.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Reidunsdatter, Randi J.; Lund, Jo-Asmund; Department of Oncology, St. Olavs University Hospital, Trondheim
Purpose: Several treatment options are available for patients with prostate cancer. Applicable and valid self-assessment instruments for assessing health-related quality of life (HRQOL) are of paramount importance. The aim of this study was to explore the validity and responsiveness of the intestinal part of the prostate cancer-specific questionnaire QUFW94. Methods and Materials: The content of the intestinal part of QUFW94 was examined by evaluation of experienced clinicians and reviewing the literature. The psychometric properties and responsiveness were assessed by analyzing HRQOL data from the randomized study Scandinavian Prostate Cancer Group 7 (SPCG)/Swedish Association for Urological Oncology 3 (SFUO). Subscales weremore » constructed by means of exploratory factor analyses. Internal consistency was assessed by Cronbach's alpha. Responsiveness was investigated by comparing baseline scores with the 4-year posttreatment follow-up. Results: The content validity was found acceptable, but some amendments were proposed. The factor analyses revealed two symptom scales. The first scale comprised five items regarding general stool problems, frequency, incontinence, need to plan toilet visits, and daily activity. Cronbach's alpha at 0.83 indicated acceptable homogeneity. The second scale was less consistent with a Cronbach's alpha at 0.55. The overall responsiveness was found to be very satisfactory. Conclusion: Two scales were identified in the bowel dimension of the QUFW94; the first one had good internal consistency. The responsiveness was excellent, and some modifications are suggested to strengthen the content validity.« less
Saunders, Ruth P.; McIver, Kerry L.; Dowda, Marsha; Pate, Russell R.
2013-01-01
Objective Scales used to measure selected social-cognitive beliefs and motives for physical activity were tested among boys and girls. Methods Covariance modeling was applied to responses obtained from large multi-ethnic samples of students in the fifth and sixth grades. Results Theoretically and statistically sound models were developed, supporting the factorial validity of the scales in all groups. Multi-group longitudinal invariance was confirmed between boys and girls, overweight and normal weight students, and non-Hispanic black and white children. The construct validity of the scales was supported by hypothesized convergent and discriminant relationships within a measurement model that included correlations with physical activity (MET • min/day) measured by an accelerometer. Conclusions Scores from the scales provide valid assessments of selected beliefs and motives that are putative mediators of change in physical activity among boys and girls, as they begin the understudied transition from the fifth grade into middle school, when physical activity naturally declines. PMID:23459310
Dishman, Rod K; Saunders, Ruth P; McIver, Kerry L; Dowda, Marsha; Pate, Russell R
2013-06-01
Scales used to measure selected social-cognitive beliefs and motives for physical activity were tested among boys and girls. Covariance modeling was applied to responses obtained from large multi-ethnic samples of students in the fifth and sixth grades. Theoretically and statistically sound models were developed, supporting the factorial validity of the scales in all groups. Multi-group longitudinal invariance was confirmed between boys and girls, overweight and normal weight students, and non-Hispanic black and white children. The construct validity of the scales was supported by hypothesized convergent and discriminant relationships within a measurement model that included correlations with physical activity (MET • min/day) measured by an accelerometer. Scores from the scales provide valid assessments of selected beliefs and motives that are putative mediators of change in physical activity among boys and girls, as they begin the understudied transition from the fifth grade into middle school, when physical activity naturally declines.
The Fear of Positive Evaluation Scale: assessing a proposed cognitive component of social anxiety.
Weeks, Justin W; Heimberg, Richard G; Rodebaugh, Thomas L
2008-01-01
Cognitive-behavioral models propose that fear of negative evaluation is the core feature of social anxiety disorder. However, it may be that fear of evaluation in general is important in social anxiety, including fears of positive as well as negative evaluation. To test this hypothesis, we developed the Fear of Positive Evaluation Scale (FPES) and conducted analyses to examine the psychometric properties of the FPES, as well as test hypotheses regarding the construct of fear of positive evaluation (FPE). Responses from a large (n = 1711) undergraduate sample were utilized. The reliability, construct validity, and factorial validity of the FPES were examined; the distinction of FPE from fear of negative evaluation was evaluated utilizing confirmatory factor analysis; and the ability of FPE to predict social interaction anxiety above and beyond fear of negative evaluation was assessed. Results provide preliminary support for the psychometric properties of the FPES and the validity of the construct of FPE. The implications of FPE with respect to the study and treatment of social anxiety disorder are discussed.
Factors affecting the implementation of green specifications in construction.
Lam, Patrick T I; Chan, Edwin H W; Poon, C S; Chau, C K; Chun, K P
2010-01-01
Green specifications constitute one of the important elements in green construction. New sustainability requirements and changing priorities in construction management have spurred the emerging green specifications to a faster pace of development. A cross-sectional survey has been conducted in Hong Kong in 2007 to identify principal factors leading to the success of preparing green specifications. Based on extensive construction management literature, 20 variables concerning sustainable construction were summarized. Using the Mann-Whitney U-test, the subtle differences between stakeholders in specifying construction work have been detected even with the high consistency of the responses among the groups. Moreover, five independent factors for successful specification of green construction have been categorized by factor analysis. They are related to (1) green technology and techniques, (2) reliability and quality of specification, (3) leadership and responsibility, (4) stakeholder involvement, and (5) guide and benchmarking systems. Whilst the first and fourth factors are generally more important, different stakeholder groups have different emphases. The results of the survey have been validated against established principles. 2009 Elsevier Ltd. All rights reserved.
Lohse, Barbara; Satter, Ellyn; Arnold, Kristen
2014-04-01
Accurate early assessment and targeted intervention with problematic parent/child feeding dynamics is critical for the prevention and treatment of child obesity. The division of responsibility in feeding (sDOR), articulated by the Satter Feeding Dynamics Model (fdSatter), has been demonstrated clinically as an effective approach to reduce child feeding problems, including those leading to obesity. Lack of a tested instrument to examine adherence to fdSatter stimulated initial construction of the Satter Feeding Dynamics Inventory (fdSI). The aim of this project was to refine the item pool to establish translational validity, making the fdSI suitable for advanced psychometric analysis. Cognitive interviews (n = 80) with caregivers of varied socioeconomic strata informed revisions that demonstrated face and content validity. fdSI responses were mapped to interviews using an iterative, multi-phase thematic approach to provide an instrument ready for construct validation. fdSI development required five interview phases over 32 months: Foundational; Refinement; Transitional; Assurance; and Launching. Each phase was associated with item reduction and revision. Thirteen items were removed from the 38-item Foundational phase and seven were revised in the Refinement phase. Revisions, deletions, and additions prompted by Transitional and Assurance phase interviews resulted in the 15-item Launching phase fdSI. Only one Foundational phase item was carried through all development phases, emphasizing the need to test for item comprehension and interpretation before psychometric analyses. Psychometric studies of item pools without encrypted meanings will facilitate progress toward a tool that accurately detects adherence to sDOR. Ability to measure sDOR will facilitate focus on feeding behaviors associated with reduced risk of childhood obesity.
Psychometric evaluation of the Swedish version of Rosenberg's self-esteem scale.
Eklund, Mona; Bäckström, Martin; Hansson, Lars
2018-04-01
The widely used Rosenberg's self-esteem scale (RSES) has not been evaluated for psychometric properties in Sweden. This study aimed at analyzing its factor structure, internal consistency, criterion, convergent and discriminant validity, sensitivity to change, and whether a four-graded Likert-type response scale increased its reliability and validity compared to a yes/no response scale. People with mental illness participating in intervention studies to (1) promote everyday life balance (N = 223) or (2) remedy self-stigma (N = 103) were included. Both samples completed the RSES and questionnaires addressing quality of life and sociodemographic data. Sample 1 also completed instruments chosen to assess convergent and discriminant validity: self-mastery (convergent validity), level of functioning and occupational engagement (discriminant validity). Confirmatory factor analysis (CFA), structural equation modeling, and conventional inferential statistics were used. Based on both samples, the Swedish RSES formed one factor and exhibited high internal consistency (>0.90). The two response scales were equivalent. Criterion validity in relation to quality of life was demonstrated. RSES could distinguish between women and men (women scoring lower) and between diagnostic groups (people with depression scoring lower). Correlations >0.5 with variables chosen to reflect convergent validity and around 0.2 with variables used to address discriminant validity further highlighted the construct validity of RSES. The instrument also showed sensitivity to change. The Swedish RSES exhibited a one-component factor structure and showed good psychometric properties in terms of good internal consistency, criterion, convergent and discriminant validity, and sensitivity to change. The yes/no and the four-graded Likert-type response scales worked equivalently.
Kimball, A B; Naegeli, A N; Edson-Heredia, E; Lin, C-Y; Gaich, C; Nikaï, E; Wyrwich, K; Yosipovitch, G
2016-07-01
Itching is a profoundly distressing symptom for many patients with psoriasis, but it has not been rigorously studied using validated tools for this condition. This study investigated the psychometric properties of the Itch Numeric Rating Scale (Itch NRS), a single-item patient-reported outcome (PRO) measuring the worst itching severity due to psoriasis in the past 24 h. Using disease-specific clinician-rated and PRO data from one phase II and three phase III randomized clinical studies of subjects with moderate-to-severe plaque psoriasis, the Itch NRS was evaluated for test-retest reliability, construct validity and responsiveness. A responder definition was explored using anchor- and distribution-based methods. Test-retest reliability analyses supported the reproducibility of the measure (intraclass correlation coefficient range 0·71-0·74). To support the construct validity of the Itch NRS, large cross-sectional correlations with the Dermatology Life Quality Index (DLQI) Symptoms and Feelings domain (r ≥ 0·60 at baseline and r ≥ 0·80 at week 12) supported a priori hypotheses, while large correlations (r ≥ 0·71) between changes in Itch NRS scores and changes in DLQI Symptoms and Feelings domain scores from baseline to week 12 established responsiveness. A 4-point change was optimal for demonstrating a level of clinically meaningful improvement in itch severity after 12 weeks of treatment, which corresponds with marked clinical improvements in plaque psoriasis. The Itch NRS demonstrated sufficient reliability, validity and responsiveness, and appropriate interpretation standards for evaluating change over time in itch severity among patients with moderate-to-severe plaque psoriasis when validated using clinical trial data for this condition. © 2016 The Authors. British Journal of Dermatology published by John Wiley & Sons Ltd on behalf of British Association of Dermatologists.
Flosadottir, Vala; Roos, Ewa M; Ageberg, Eva
2017-09-01
The Activity Rating Scale (ARS) for disorders of the knee evaluates the level of activity by the frequency of participation in 4 separate activities with high demands on knee function, with a score ranging from 0 (none) to 16 (pivoting activities 4 times/wk). To translate and cross-culturally adapt the ARS into Swedish and to assess measurement properties of the Swedish version of the ARS. Cohort study (diagnosis); Level of evidence, 2. The COSMIN guidelines were followed. Participants (N = 100 [55 women]; mean age, 27 years) who were undergoing rehabilitation for a knee injury completed the ARS twice for test-retest reliability. The Knee injury and Osteoarthritis Outcome Score (KOOS), Tegner Activity Scale (TAS), and modernized Saltin-Grimby Physical Activity Level Scale (SGPALS) were administered at baseline to validate the ARS. Construct validity and responsiveness of the ARS were evaluated by testing predefined hypotheses regarding correlations between the ARS, KOOS, TAS, and SGPALS. The Cronbach alpha, intraclass correlation coefficients, absolute reliability, standard error of measurement, smallest detectable change, and Spearman rank-order correlation coefficients were calculated. The ARS showed good internal consistency (α ≈ 0.96), good test-retest reliability (intraclass correlation coefficient >0.9), and no systematic bias between measurements. The standard error of measurement was less than 2 points, and the smallest detectable change was less than 1 point at the group level and less than 5 points at the individual level. More than 75% of the hypotheses were confirmed, indicating good construct validity and good responsiveness of the ARS. The Swedish version of the ARS is valid, reliable, and responsive for evaluating the level of activity based on the frequency of participation in high-demand knee sports activities in young adults with a knee injury.
Meikle, Mary B; Henry, James A; Griest, Susan E; Stewart, Barbara J; Abrams, Harvey B; McArdle, Rachel; Myers, Paula J; Newman, Craig W; Sandridge, Sharon; Turk, Dennis C; Folmer, Robert L; Frederick, Eric J; House, John W; Jacobson, Gary P; Kinney, Sam E; Martin, William H; Nagler, Stephen M; Reich, Gloria E; Searchfield, Grant; Sweetow, Robert; Vernon, Jack A
2012-01-01
Chronic subjective tinnitus is a prevalent condition that causes significant distress to millions of Americans. Effective tinnitus treatments are urgently needed, but evaluating them is hampered by the lack of standardized measures that are validated for both intake assessment and evaluation of treatment outcomes. This work was designed to develop a new self-report questionnaire, the Tinnitus Functional Index (TFI), that would have documented validity both for scaling the severity and negative impact of tinnitus for use in intake assessment and for measuring treatment-related changes in tinnitus (responsiveness) and that would provide comprehensive coverage of multiple tinnitus severity domains. To use preexisting knowledge concerning tinnitus-related problems, an Item Selection Panel (17 expert judges) surveyed the content (175 items) of nine widely used tinnitus questionnaires. From those items, the Panel identified 13 separate domains of tinnitus distress and selected 70 items most likely to be responsive to treatment effects. Eliminating redundant items while retaining good content validity and adding new items to achieve the recommended minimum of 3 to 4 items per domain yielded 43 items, which were then used for constructing TFI Prototype 1.Prototype 1 was tested at five clinics. The 326 participants included consecutive patients receiving tinnitus treatment who provided informed consent-constituting a convenience sample. Construct validity of Prototype 1 as an outcome measure was evaluated by measuring responsiveness of the overall scale and its individual items at 3 and 6 mo follow-up with 65 and 42 participants, respectively. Using a predetermined list of criteria, the 30 best-functioning items were selected for constructing TFI Prototype 2.Prototype 2 was tested at four clinics with 347 participants, including 155 and 86 who provided 3 and 6 mo follow-up data, respectively. Analyses were the same as for Prototype 1. Results were used to select the 25 best-functioning items for the final TFI. Both prototypes and the final TFI displayed strong measurement properties, with few missing data, high validity for scaling of tinnitus severity, and good reliability. All TFI versions exhibited the same eight factors characterizing tinnitus severity and negative impact. Responsiveness, evaluated by computing effect sizes for responses at follow-up, was satisfactory in all TFI versions.In the final TFI, Cronbach's alpha was 0.97 and test-retest reliability 0.78. Convergent validity (r = 0.86 with Tinnitus Handicap Inventory [THI]; r = 0.75 with Visual Analog Scale [VAS]) and discriminant validity (r = 0.56 with Beck Depression Inventory-Primary Care [BDI-PC]) were good. The final TFI was successful at detecting improvement from the initial clinic visit to 3 mo with moderate to large effect sizes and from initial to 6 mo with large effect sizes. Effect sizes for the TFI were generally larger than those obtained for the VAS and THI. After careful evaluation, a 13-point reduction was considered a preliminary criterion for meaningful reduction in TFI outcome scores. The TFI should be useful in both clinical and research settings because of its responsiveness to treatment-related change, validity for scaling the overall severity of tinnitus, and comprehensive coverage of multiple domains of tinnitus severity.
Construct Validity of the Nepalese School Leaving English Reading Test
ERIC Educational Resources Information Center
Dawadi, Saraswati; Shrestha, Prithvi N.
2018-01-01
There has been a steady interest in investigating the validity of language tests in the last decades. Despite numerous studies on construct validity in language testing, there are not many studies examining the construct validity of a reading test. This paper reports on a study that explored the construct validity of the English reading test in…
Almeida, Tatiana Magalhães de; Cola, Paula Cristina; Pernambuco, Leandro de Araújo; Magalhães, Hipólito Virgílio; Magnoni, Carlos Daniel; Silva, Roberta Gonçalves da
2017-08-17
The aim of the present study was to identify the evidence of validity based on the content and response process of the Rastreamento de Disfagia Orofaríngea no Acidente Vascular Encefálico (RADAVE; "Screening Tool for Oropharyngeal Dysphagia in Stroke"). The criteria used to elaborate the questions were based on a literature review. A group of judges consisting of 19 different health professionals evaluated the relevance and representativeness of the questions, and the results were analyzed using the Content Validity Index. In order to evidence validity based on the response processes, 23 health professionals administered the screening tool and analyzed the questions using a structured scale and cognitive interview. The RADAVE structured to be applied in two stages. The first version consisted of 18 questions in stage I and 11 questions in stage II. Eight questions in stage I and four in stage II did not reach the minimum Content Validity Index, requiring reformulation by the authors. The cognitive interview demonstrated some misconceptions. New adjustments were made and the final version was produced with 12 questions in stage I and six questions in stage II. It was possible to develop a screening tool for dysphagia in stroke with adequate evidence of validity based on content and response processes. Both validity evidences obtained so far allowed to adjust the screening tool in relation to its construct. The next studies will analyze the other evidences of validity and the measures of accuracy.
Klos, D S; Paddock, J R
1978-12-01
Three criteria for assessing relationship status were proposed: self-disclosure despite the risk of parental disapproval; openness to critical feedback from parents; constructive confrontation when angry with parents. These concepts were operationalized as narratives of nine interpersonal dilemmas, to which late adolescents responded by indicating "What would you do if you were in this situation?" Reliable example-anchored scales were constructed from the responses of one sample of college students and then cross-validated with two other samples. Social class had a significant but small effect on the relationship status scores; but age and sex of adolescent and sex of parent did not. The patterns of correlations of the Relationship Status Scales among themselves and with the Parent-Child Relations Questionnaire, the College Self-Expression Scale, the Fear of Negative Evaluation Scale, and Hogan's Empathy Scale were interpreted as evidence of construct validity.
Development of a scale to measure patients' trust in health insurers.
Zheng, Beiyao; Hall, Mark A; Dugan, Elizabeth; Kidd, Kristin E; Levine, Douglas
2002-02-01
To develop a scale to measure patients' trust in health insurers, including public and private insurers and both indemnity and managed care. A scale was developed based on our conceptual model of insurer trust. The scale was analyzed for its factor structure, internal consistency, construct validity, and other psychometric properties. The scale was developed and validated on a random national sample (n = 410) of subjects with any type of insurance and further validated and used in a regional random sample of members of an HMO in North Carolina (n = 1152). Factor analysis was used to uncover the underlying dimensions of the scale. Internal consistency was assessed by Cronbach's alpha. Construct validity was established by Pearson or Spearman correlations and t tests. Data were collected via telephone interviews. The 11-item scale has good internal consistency (alpha = 0.92/ 0.89) and response variability (range = 11-55, M = 36.5/37.0, SD = 7.8/7.0). Insurer trust is a unidimensional construct and is related to trust in physicians, satisfaction with care and with insurer, having enough choice in selecting health insurer, no prior disputes with health insurer, type of insurer, and desire to remain with insurer. Trust in health insurers can be validly and reliably measured. Additional studies are required to learn more about what factors affect insurer trust and whether differences and changes in insurer trust affect actual behaviors and other outcomes of interest.
Assessment of the psychometric properties of the Family Management Measure.
Knafl, Kathleen; Deatrick, Janet A; Gallo, Agatha; Dixon, Jane; Grey, Margaret; Knafl, George; O'Malley, Jean
2011-06-01
This paper reports development of the Family Management Measure (FaMM) of parental perceptions of family management of chronic conditions. By telephone interview, 579 parents of children age 3 to 19 with a chronic condition (349 partnered mothers, 165 partners, 65 single mothers) completed the FaMM and measures of child functional status and behavioral problems and family functioning. Analyses addressed reliability, factor structure, and construct validity. Exploratory factor analysis yielded six scales: Child's Daily Life, Condition Management Ability, Condition Management Effort, Family Life Difficulty, Parental Mutuality, and View of Condition Impact. Internal consistency reliability ranged from .72 to .91, and test-retest reliability from .71 to .94. Construct validity was supported by significant correlations in hypothesized directions between FaMM scales and established measures. Results support FaMM's; reliability and validity, indicating it performs in a theoretically meaningful way and taps distinct aspects of family response to childhood chronic conditions.
Kashikar-Zuck, Susmita; Carle, Adam; Barnett, Kimberly; Goldschneider, Kenneth R.; Sherry, David D.; Mara, Constance A.; Cunningham, Natoshia; Farrell, Jennifer; Tress, Jenna; DeWitt, Esi Morgan
2015-01-01
The Patient Reported Outcomes Measurement Information System (PROMIS) initiative is a comprehensive strategy by the National Institutes of Health to support the development and validation of precise instruments to assess self-reported health domains across healthy and disease-specific populations. Much progress has been made in instrument development but there remains a gap in the validation of PROMIS measures for pediatric chronic pain. The purpose of this study was to investigate the construct validity and responsiveness to change of seven PROMIS domains for the assessment of children (ages 8-18) with chronic pain – Pain Interference, Fatigue, Anxiety, Depression, Mobility, Upper Extremity Function and Peer Relationships. PROMIS measures were administered at the initial visit and two follow-up visits at an outpatient chronic pain clinic (CPC; N=82) and at an intensive amplified pain day-treatment program (AMP; N= 63). Aim 1 examined construct validity of PROMIS measures by comparing them with corresponding “legacy” measures administered as part of usual care in the CPC sample. Aim 2 examined sensitivity to change in both CPC and AMP samples. Longitudinal growth models showed that PROMIS Pain Interference, Anxiety, Depression, Mobility, Upper Extremity and Peer Relationship measures and legacy instruments generally performed similarly with slightly steeper slopes of improvement in legacy measures. All seven PROMIS domains showed responsiveness to change. Results offered initial support for the validity of PROMIS measures in pediatric chronic pain. Further validation with larger and more diverse pediatric pain samples and additional legacy measures would broaden the scope of use of PROMIS in clinical research. PMID:26447704
Alhajj, Mohammed Nasser; Amran, Abdullah Ghalib; Halboub, Esam; Al-Basmi, Abdulghani Ali; Al-Ghabri, Fawaz Abdullah
2017-07-01
This study aimed at developing the Arabic version of the Orofacial Esthetic Scale (OES-Ar) and to investigate its psychometric properties among Arabic-speaking population with and without esthetic impairments. Translation and cross-cultural adaptation was done according to the standard guidelines. Internal consistency was assessed on 230 participants. For test-retest reliability, 50 subjects with natural teeth were recalled within a period of 2 weeks. Validity of the OES-Ar was tested by construct, convergent, and discriminant validity tests. Responsiveness to esthetic changes was assessed in 60 patients. The results showed excellent internal consistency with Cronbach's alpha value of 0.92 and inter-item correlation average value of 0.60. The ICC values ranged from 0.87 to 0.96 which indicated excellent agreement. Construct validity of the OES-Ar was confirmed to be one-factor structure (one-dimensional). For convergent validity, a significant correlation was found between OES summary score and overall impression of the orofacial esthetic as well as between OES summary score and the summary score of the three questions of the OHIP-49Ar related to esthetic. The discriminant validity test revealed significant differences between different study groups (P<0.001). Responsiveness to treatment was confirmed by significant differences between pre- and post-treatment OES total summary score (P<0.001). The OES-Ar has excellent psychometric properties making it valuable instrument to assess orofacial esthetics in Arabic-speaking patients. Copyright © 2016 Japan Prosthodontic Society. Published by Elsevier Ltd. All rights reserved.
Singer, Sara; Meterko, Mark; Baker, Laurence; Gaba, David; Falwell, Alyson; Rosen, Amy
2007-01-01
Objective To describe the development of an instrument for assessing workforce perceptions of hospital safety culture and to assess its reliability and validity. Data Sources/Study Setting Primary data collected between March 2004 and May 2005. Personnel from 105 U.S. hospitals completed a 38-item paper and pencil survey. We received 21,496 completed questionnaires, representing a 51 percent response rate. Study Design Based on review of existing safety climate surveys, we developed a list of key topics pertinent to maintaining a culture of safety in high-reliability organizations. We developed a draft questionnaire to address these topics and pilot tested it in four preliminary studies of hospital personnel. We modified the questionnaire based on experience and respondent feedback, and distributed the revised version to 42,249 hospital workers. Data Collection We randomly divided respondents into derivation and validation samples. We applied exploratory factor analysis to responses in the derivation sample. We used those results to create scales in the validation sample, which we subjected to multitrait analysis (MTA). Principal Findings We identified nine constructs, three organizational factors, two unit factors, three individual factors, and one additional factor. Constructs demonstrated substantial convergent and discriminant validity in the MTA. Cronbach's α coefficients ranged from 0.50 to 0.89. Conclusions It is possible to measure key salient features of hospital safety climate using a valid and reliable 38-item survey and appropriate hospital sample sizes. This instrument may be used in further studies to better understand the impact of safety climate on patient safety outcomes. PMID:17850530
Kaiser, Ulrike; Steinmetz, Dorit; Scharnagel, Rüdiger; Jensen, Mark P; Balck, Friedrich; Sabatowski, Rainer
2014-10-14
Since the response of spouses has been proven to be an important reinforcement of pain behaviour and disability it has been addressed in research and therapy. Fordyce suggested pain behaviour and well behaviour be considered in explaining suffering in chronic pain patients. Among existing instruments concerning spouse's responses the aspect of well behaviour has not been examined so far. The SRI (Spouse Response Inventory) tries to consider pain behaviour and well behaviour and appears to be acceptable because of its brevity and close proximity to daily language. The aim of the study is the translation into German, followed by evaluation and validation, of the SRI on a German sample of patients with chronic pain. The study is comprehensively designed: initially, the focus will lie on the translation of the instrument following the guidelines for cross-cultural translation and adaptation and evaluation of the German version according to the source study. Subsequently, a validation referring to predictive, incremental and construct validation will be conducted using instruments based on similar or close but different constructs. Evaluation of the resulting SRI-G (SRI-German) will be conducted on a sample of at least 30 patients with chronic pain attending a comprehensive pain centre. For validation at least 120 patients with chronic headache, back pain, cancer related pain and somatoform pain disorder shall be included, for a total of 480 patients. Separate analyses according to specific pain diagnoses will be performed to ensure psychometric property, interpretability and control of diagnosis of specific limitations. Analyses will include comprehensive investigation of psychometric property of the scale by hierarchical regression analyses, correlation analyses, multivariate analysis of variance and exploratory factor analyses (SPSS). The study protocol was approved by the Ethics Committee of the University of Dresden (EK 335 122008) based on the Helsinki declaration. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Development and Construct Validation of the Interprofessional Attitudes Scale
Norris, Jeffrey; Carpenter, Joan G.; Eaton, Jacqueline; Guo, Jia-Wen; Lassche, Madeline; Pett, Marjorie A.; Blumenthal, Donald K.
2015-01-01
Purpose Training of health professionals requires development of interprofessional competencies and assessment of these competencies. No validated tools exist to assess all four competency domains described in the 2011 Core Competencies for Interprofessional Collaborative Practice (the IPEC Report). The purpose of this study was to develop and validate a scale based on the IPEC competency domains that assesses interprofessional attitudes of students in the health professions. Method In 2012, a survey tool was developed and administered to 1,549 students from the University of Utah Health Science Center, an academic health center composed of four schools and colleges (Health, Medicine, Nursing, and Pharmacy). Exploratory and confirmatory factor analyses (EFA and CFA) were performed to validate the assessment tool, eliminate redundant questions, and to identify subscales. Results The EFA and CFA focused on aligning subscales with IPEC core competencies, and demonstrating good construct validity and internal consistency reliability. A response rate of 45% (n = 701) was obtained. Responses with complete data (n=678) were randomly split into two datasets which were independently analyzed using EFA and CFA. The EFA produced a 27-item scale, with five subscales (Cronbach’s alpha coefficients: 0.62 to 0.92). CFA indicated the content of the five subscales was consistent with the EFA model. Conclusions The Interprofessional Attitudes Scale (IPAS) is a novel tool that, compared to previous tools, better reflects current trends in interprofessional competencies. The IPAS should be useful to health sciences educational institutions and others training people to work collaboratively in interprofessional teams. PMID:25993280
Rasch-built Overall Disability Scale for Multifocal motor neuropathy (MMN-RODS(©) ).
Vanhoutte, Els K; Faber, Catharina G; van Nes, Sonja I; Cats, Elisabeth A; Van der Pol, W-Ludo; Gorson, Kenneth C; van Doorn, Pieter A; Cornblath, David R; van den Berg, Leonard H; Merkies, Ingemar S J
2015-09-01
Clinical trials in multifocal motor neuropathy (MMN) have often used ordinal-based measures that may not accurately capture changes. We aimed to construct a disability interval outcome measure specifically for MMN using the Rasch model and to examine its clinimetric properties. A total of 146 preliminary activity and participation items were assessed twice (reliability studies) in 96 clinically stable MMN patients. These patients also assessed the ordinal-based overall disability sum score (construct, sample-dependent validity). The final Rasch-built overall disability scale for MMN (MMN-RODS(©) ) was serially applied in 26 patients with newly diagnosed or relapsing MMN, treated with intravenous immunoglobulin (IVIg) (1-year follow-up; responsiveness study). The magnitude of change for each patient was calculated using the minimum clinically important difference technique related to the individually obtained standard errors. A total of 121 items not fulfilling Rasch requirements were removed. The final 25-item MMN-RODS(©) fulfilled all Rasch model's expectations and showed acceptable reliability and validity including good discriminatory capacity. Most serially examined patients improved, but its magnitude was low, reflecting poor responsiveness. The constructed MMN-RODS(©) is a disease-specific, interval measure to detect activity limitations in patients with MMN and overcomes the shortcomings of ordinal scales. However, future clinimetric studies are needed to improve the MMN-RODS(©) 's responsiveness by longer observations and/or more rigorous treatment regimens. © 2015 Peripheral Nerve Society.
ERIC Educational Resources Information Center
Tassé, Marc J.; Schalock, Robert L.; Thissen, David; Balboni, Giulia; Bersani, Henry, Jr.; Borthwick-Duffy, Sharon A.; Spreat, Scott; Widaman, Keith F.; Zhang, Dalun; Navas, Patricia
2016-01-01
The Diagnostic Adaptive Behavior Scale (DABS) was developed using item response theory (IRT) methods and was constructed to provide the most precise and valid adaptive behavior information at or near the cutoff point of making a decision regarding a diagnosis of intellectual disability. The DABS initial item pool consisted of 260 items. Using IRT…
Tabbakh, Tamara; Freeland-Graves, Jeanne
2016-08-01
The home environment is an important setting for the development of weight status in adolescence. At present a limited number of valid and reliable tools are available to evaluate the weight-related comprehensive home environment of this population. The goal of this research was to develop the Multidimensional Home Environment Scale which measures multiple components of the home. It includes psychological, social, and environmental domains from the perspective of an adolescent and the mother. Items were generated based on a literature review and then assessed for content validity by an expert panel and focus group in the target population. Internal consistency reliability was determined using Cronbach's α. Principal components analysis with varimax rotation was employed for assessment of construct validity. Temporal stability was evaluated using paired sample t-tests and bivariate correlations between responses at two different times, 1-2weeks apart. Associations between adolescent and mother responses were utilized for convergent validity. The final versions contained 32-items for adolescents and 36-items for mothers; these were administered to 218 adolescents and mothers. The subscales on the questionnaires exhibited high construct validity, internal consistency reliability (adolescent: α=0.82, mother: α=0.83) and test-retest reliability (adolescent: r=0.90, p<0.01; mother: r=0.91, p<0.01). Total home environment scores were computed, with greater scores reflecting a better health environment. These results verify the utility of the MHES as a valid and reliable instrument. This promising tool can be utilized to capture the comprehensive home environment of young adolescents (11-14years old). Copyright © 2016 Elsevier Ltd. All rights reserved.
Measuring psychological flexibility in medical students and residents: a psychometric analysis
Palladino, Christie L.; Ange, Brittany; Richardson, Deborah S.; Casillas, Rhonda; Decker, Matt; Gillies, Ralph A.; House, Amy; Rollock, Michael; Salazar, William H.; Waller, Jennifer L.; Zeidan, Ronnie; Stepleman, Lara
2013-01-01
Purpose Psychological flexibility involves mindful awareness of our thoughts and feelings without allowing them to prohibit acting consistently with our values and may have important implications for patient-centered clinical care. Although psychological flexibility appears quite relevant to the training and development of health care providers, prior research has not evaluated measures of psychological flexibility in medical learners. Therefore, we investigated the validity of our learners’ responses to three measures related to psychological flexibility. Methods Fourth-year medical students and residents (n=275) completed three measures of overlapping aspects of psychological flexibility: (1) Acceptance and Action Questionnaire-II (AAQ-II); (2) Cognitive Fusion Questionnaire (CFQ); and (3) Mindful Attention and Awareness Questionnaire (MAAS). We evaluated five aspects of construct validity: content, response process, internal structure, relationship with other variables, and consequences. Results We found good internal consistency for responses on the AAQ (α=0.93), MAAS (α=0.92), and CFQ (α=0.95). Factor analyses demonstrated a reasonable fit to previously published factor structures. As expected, scores on all three measures were moderately correlated with one another and with a measure of life satisfaction (p<0.01). Conclusion Our findings provide preliminary evidence supporting validity of the psychological flexibility construct in a medical education sample. As psychological flexibility is a central concept underlying self-awareness, this work may have important implications for clinical training and practice. PMID:23948496
An empirical look at the Defense Mechanism Test (DMT): reliability and construct validity.
Ekehammar, Bo; Zuber, Irena; Konstenius, Marja-Liisa
2005-07-01
Although the Defense Mechanism Test (DMT) has been in use for almost half a century, there are still quite contradictory views about whether it is a reliable instrument, and if so, what it really measures. Thus, based on data from 39 female students, we first examined DMT inter-coder reliability by analyzing the agreement among trained judges in their coding of the same DMT protocols. Second, we constructed a "parallel" photographic picture that retained all structural characteristic of the original and analyzed DMT parallel-test reliability. Third, we examined the construct validity of the DMT by (a) employing three self-report defense-mechanism inventories and analyzing the intercorrelations between DMT defense scores and corresponding defenses in these instruments, (b) studying the relationships between DMT responses and scores on trait and state anxiety, and (c) relating DMT-defense scores to measures of self-esteem. The main results showed that the DMT can be coded with high reliability by trained coders, that the parallel-test reliability is unsatisfactory compared to traditional psychometric standards, that there is a certain generalizability in the number of perceptual distortions that people display from one picture to another, and that the construct validation provided meager empirical evidence for the conclusion that the DMT measures what it purports to measure, that is, psychological defense mechanisms.
Vieira, Maria Aparecida; Ohara, Conceição Vieira da Silva; de Domenico, Edvane Birelo Lopes
2016-01-01
Abstract Objective: to construct an instrument for the assessment of graduates of undergraduate nursing courses and to validate this instrument through the consensus of specialists. Method: methodological study. In order to elaborate the instrument, documental analysis and a literature review were undertaken. Validation took place through use of the Delphi Conference, between September 2012 and September 2013, in which 36 specialists from Brazilian Nursing participated. In order to analyze reliability, the Cronbach alpha coefficient, the item/total correlation, and the Pearson correlation coefficient were calculated. Results: the instrument was constructed with the participation of specialist nurses representing all regions of Brazil, with experience in lecturing and research. The first Delphi round led to changes in the first instrument, which was restructured and submitted to another round, with a response rate of 94.44%. In the second round, the instrument was validated with a Cronbach alpha of 0.75. Conclusion: the final instrument possessed three dimensions related to the characterization of the graduate, insertion in the job market, and evaluation of the professional training process. This instrument may be used across the territory of Brazil as it is based on the curricular guidelines and contributes to the process of regulation of the quality of the undergraduate courses in Nursing. PMID:27305184
A process-oriented measure of habit strength for moderate-to-vigorous physical activity
Grove, J. Robert; Zillich, Irja; Medic, Nikola
2014-01-01
Purpose: Habitual action is an important aspect of health behaviour, but the relevance of various habit strength indicators continues to be debated. This study focused specifically on moderate-to-vigorous physical activity (MVPA) and evaluated the construct validity of a framework emphasizing patterned action, stimulus-response bonding, automaticity, and negative consequences for nonperformance as indicators of habit strength for this form of exercise. Methods: Upper-level undergraduates (N = 124) provided demographic information and responded to questionnaire items assessing historical MVPA involvement, current MVPA involvement, and the four proposed habit strength dimensions. Factor analyses were used to examine the latent structure of the habit strength indicators, and the model's construct validity was evaluated via an examination of relationships with repetition history and current behaviour. Results: At a measurement level, findings indicated that the proposed four-component model possessed psychometric integrity as a coherent set of factors. Criterion-related validity was also demonstrated via significant changes in three of the four factors as a function of past involvement in MVPA and significant correlations with the frequency, duration, and intensity of current MVPA. Conclusions: These findings support the construct validity of this exercise habit strength model and suggest that it could provide a template for future research on how MVPA habits are developed and maintained. PMID:25750789
Choi, Bongsam
2018-01-01
[Purpose] This study aimed to cross-cultural adapt and validate the Korean version of an physical activity measure (K-PAM) for community-dwelling elderly. [Subjects and Methods] One hundred and thirty eight community-dwelling elderlies, 32 males and 106 female, participated in the study. All participants were asked to fill out a fifty-one item questionnaire measuring perceived difficulty in the activities of daily living (ADL) for the elderly. One-parameter model of item response theory (Rasch analysis) was applied to determine the construct validity and to inspect item-level psychometric properties of 51 ADL items of the K-PAM. [Results] Person separation reliability (analogous to Cronbach's alpha) for internal consistency was ranging 0.93 to 0.94. A total of 16 items was misfit to the Rasch model. After misfit item deletion, 35 ADL items of the K-PAM were placed in an empirically meaningful hierarchy from easy to hard. The item-person map analysis delineated that the item difficulty was well matched for the elderlies with moderate and low ability except for high ceilings. [Conclusion] Cross-cultural adapted K-PAM was shown to be sufficient for establishing construct validity and stable psychometric properties confirmed by person separation reliability and fit statistics.
A process-oriented measure of habit strength for moderate-to-vigorous physical activity.
Grove, J Robert; Zillich, Irja; Medic, Nikola
2014-01-01
Purpose : Habitual action is an important aspect of health behaviour, but the relevance of various habit strength indicators continues to be debated. This study focused specifically on moderate-to-vigorous physical activity (MVPA) and evaluated the construct validity of a framework emphasizing patterned action, stimulus-response bonding, automaticity, and negative consequences for nonperformance as indicators of habit strength for this form of exercise. Methods : Upper-level undergraduates ( N = 124) provided demographic information and responded to questionnaire items assessing historical MVPA involvement, current MVPA involvement, and the four proposed habit strength dimensions. Factor analyses were used to examine the latent structure of the habit strength indicators, and the model's construct validity was evaluated via an examination of relationships with repetition history and current behaviour. Results : At a measurement level, findings indicated that the proposed four-component model possessed psychometric integrity as a coherent set of factors. Criterion-related validity was also demonstrated via significant changes in three of the four factors as a function of past involvement in MVPA and significant correlations with the frequency, duration, and intensity of current MVPA. Conclusions : These findings support the construct validity of this exercise habit strength model and suggest that it could provide a template for future research on how MVPA habits are developed and maintained.
de Almeida Vieira Monteiro, Ana Paula Teixeira; Fernandes, Alexandre Bastos
2016-05-17
Cultural competence is an essential component in rendering effective and culturally responsive services to culturally and ethnically diverse clients. Still, great difficulty exists in assessing the cultural competence of mental health nurses. There are no Portuguese validated measurement instruments to assess cultural competence in mental health nurses. This paper reports a study testing the reliability and validity of the Portuguese version of the Multicultural Mental Health Awareness Scale-MMHAS in a sample of Portuguese nurses. Following a standard forward/backward translation into Portuguese, the adapted version of MMHAS, along with a sociodemographic questionnaire, were applied to a sample of 306 Portuguese nurses (299 males, 77 females; ages 21-68 years, M = 35.43, SD = 9.85 years). A psychometric research design was used with content and construct validity and reliability. Reliability was assessed using internal consistency and item-total correlations. Construct validity was determined using factor analysis. The factor analysis confirmed that the Portuguese version of MMHAS has a three-factor structure of multicultural competencies (Awareness, Knowledge, and Skills) explaining 59.51% of the total variance. Strong content validity and reliability correlations were demonstrated. The Portuguese version of MMHAS has a strong internal consistency, with a Cronbach's alpha of 0.958 for the total scale. The results supported the construct validity and reliability of the Portuguese version of MMHAS, proving that is a reliable and valid measure of multicultural counselling competencies in mental health nursing. The MMHAS Portuguese version can be used to evaluate the effectiveness of multicultural competency training programs in Portuguese-speaking mental health nurses. The scale can also be a useful in future studies of multicultural competencies in Portuguese-speaking nurses.
Sexual Harassment DEOCS 4.1 Construct Validity Summary
2017-08-01
These items were modified to provide additional clarity regarding chain of command actions and response in the final survey . ** These items were...modified to provide additional clarity regarding indivduals from the respondent’s workplace in the final survey . 4 Conclusion The revised sexual
Assessing Creative Problem-Solving with Automated Text Grading
ERIC Educational Resources Information Center
Wang, Hao-Chuan; Chang, Chun-Yen; Li, Tsai-Yen
2008-01-01
The work aims to improve the assessment of creative problem-solving in science education by employing language technologies and computational-statistical machine learning methods to grade students' natural language responses automatically. To evaluate constructs like creative problem-solving with validity, open-ended questions that elicit…
Cheng, Alice; Humayun, Aiza; Schwartz, Zvi
2016-01-01
Abstract The addition of porosity to the traditionally used solid titanium metal implants has been suggested to more closely mimic the natural mechanical properties of bone and increase osseointegration in dental and orthopedic implants. The objective of this study was to evaluate cellular response to three-dimensional (3D) porous Ti-6Al-4V constructs fabricated by additive manufacturing using laser sintering with low porosity (LP), medium porosity (MP), and high porosity (HP) with low resolution (LR) and high resolution (HR) based on a computed tomography scan of human trabecular bone. After surface processing, construct porosity ranged from 41.0% to 76.1%, but all possessed micro-/nanoscale surface roughness and similar surface chemistry containing mostly Ti, O, and C. Biological responses (osteoblast differentiation, maturation, and local factor production) by MG63 osteoblast-like cells and normal human osteoblasts favored 3D than two-dimensional (2D) solid constructs. First, MG63 cells were used to assess differences in cell response to 2D compared to LR and HR porous 3D constructs. MG63 cells were sensitive to porosity resolution and exhibited increased osteocalcin (OCN), vascular endothelial growth factor (VEGF), osteoprotegerin (OPG), and bone morphogenetic protein 2 (BMP2) on HR 3D constructs than on 2D and LR 3D constructs. MG63 cells also exhibited porosity-dependent responses on HR constructs, with up to a 6.9-fold increase in factor production on LP-HR and MP-HR constructs than on HP-HR constructs. NHOsts were then used to validate biological response on HR constructs. NHOsts exhibited decreased DNA content and alkaline phosphatase activity and up to a 2.9-fold increase in OCN, OPG, VEGF, BMP2, and BMP4 on 3D HR constructs than on 2D controls. These results indicate that osteoblasts prefer a 3D architecture than a 2D surface and that osteoblasts are sensitive to the resolution of trabecular detail and porosity parameters of laser-sintered 3D Ti-6Al-4V constructs. PMID:28804735
Cheng, Alice; Humayun, Aiza; Boyan, Barbara D; Schwartz, Zvi
2016-03-01
The addition of porosity to the traditionally used solid titanium metal implants has been suggested to more closely mimic the natural mechanical properties of bone and increase osseointegration in dental and orthopedic implants. The objective of this study was to evaluate cellular response to three-dimensional (3D) porous Ti-6Al-4V constructs fabricated by additive manufacturing using laser sintering with low porosity (LP), medium porosity (MP), and high porosity (HP) with low resolution (LR) and high resolution (HR) based on a computed tomography scan of human trabecular bone. After surface processing, construct porosity ranged from 41.0% to 76.1%, but all possessed micro-/nanoscale surface roughness and similar surface chemistry containing mostly Ti, O, and C. Biological responses (osteoblast differentiation, maturation, and local factor production) by MG63 osteoblast-like cells and normal human osteoblasts favored 3D than two-dimensional (2D) solid constructs. First, MG63 cells were used to assess differences in cell response to 2D compared to LR and HR porous 3D constructs. MG63 cells were sensitive to porosity resolution and exhibited increased osteocalcin (OCN), vascular endothelial growth factor (VEGF), osteoprotegerin (OPG), and bone morphogenetic protein 2 (BMP2) on HR 3D constructs than on 2D and LR 3D constructs. MG63 cells also exhibited porosity-dependent responses on HR constructs, with up to a 6.9-fold increase in factor production on LP-HR and MP-HR constructs than on HP-HR constructs. NHOsts were then used to validate biological response on HR constructs. NHOsts exhibited decreased DNA content and alkaline phosphatase activity and up to a 2.9-fold increase in OCN, OPG, VEGF, BMP2, and BMP4 on 3D HR constructs than on 2D controls. These results indicate that osteoblasts prefer a 3D architecture than a 2D surface and that osteoblasts are sensitive to the resolution of trabecular detail and porosity parameters of laser-sintered 3D Ti-6Al-4V constructs.
Ethical responsibilities in nursing: research findings and issues.
Cassidy, V R
1991-01-01
Discussions in the literature assert that nurses are becoming increasingly cognizant of their ethical responsibilities, but that they are often ill prepared to participate in ethical decision making. A review of selected research literature from 1970 to 1987 was undertaken to validate these assertions. A total of 12 studies related to ethical responsibilities was identified in the review; all studies were published between 1980 and 1987. The majority of studies were at the descriptive and exploratory levels and employed Kohlberg's cognitive theory of moral development as their conceptual framework. Significant findings related to educational level and ethical responsibilities were consistent across studies. Findings related to age and clinical experience were mixed; the effects of economic level, religion-religiosity, ethnicity, and other variables on ethical responsibilities were not significant. Issues raised in the light of the existing research include the use of Kohlberg's theory as a conceptual orientation in nursing groups and limited data on the reliability and validity of instruments used in measuring ethical constructs. Recommendations for future research on ethical responsibilities include the validation of Kohlberg's theory for nursing investigations, exploration of other frameworks for developing a multidimensional view of ethical responsibilities, and the use of qualitative research designs.
Kim, Jin Goo; Lee, Joong Yub; Seo, Seung Suk; Choi, Choong Hyeok; Lee, Myung Chul
2013-01-01
Purpose To perform a cross-cultural adaptation and to test the measurement properties of the Korean version of International Knee Documentation Committee (K-IKDC) Subjective Knee Form. Materials and Methods According to the guidelines for cross-cultural adaptation, translation and backward translation of the English version of the IKDC Subjective Knee Form were performed. After translation into the Korean version, 150 patients who had knee-related problems were asked to complete the K-IKDC, Lysholm score, and Short Form-36 (SF-36). Of these patients, 126 were retested 2 weeks later to evaluate test-retest reliability, and 104 were recruited 3 months later to evaluate responsiveness. Construct validity was analyzed by investigating the correlation with Lysholm score and SF-36; content validity was also evaluated. Standardized mean response was calculated for evaluating responsiveness. Results The test-retest reliability proved excellent with a high value for the intraclass correlation coefficient (r=0.94). The internal consistency was strong (Cronbach's α=0.91). Good content validity with absence of floor not ceiling effects and good convergent and divergent validity were observed. Moderate responsiveness was shown (standardized mean response=0.689). Conclusions The K-IKDC demonstrated good measurement properties. We suggest that this instrument is an excellent evaluation instrument that can be used for Korean patients with knee-related injuries. PMID:24032098
Creasy, John M; Midya, Abhishek; Chakraborty, Jayasree; Adams, Lauryn B; Gomes, Camilla; Gonen, Mithat; Seastedt, Kenneth P; Sutton, Elizabeth J; Cercek, Andrea; Kemeny, Nancy E; Shia, Jinru; Balachandran, Vinod P; Kingham, T Peter; Allen, Peter J; DeMatteo, Ronald P; Jarnagin, William R; D'Angelica, Michael I; Do, Richard K G; Simpson, Amber L
2018-06-19
This study investigates whether quantitative image analysis of pretreatment CT scans can predict volumetric response to chemotherapy for patients with colorectal liver metastases (CRLM). Patients treated with chemotherapy for CRLM (hepatic artery infusion (HAI) combined with systemic or systemic alone) were included in the study. Patients were imaged at baseline and approximately 8 weeks after treatment. Response was measured as the percentage change in tumour volume from baseline. Quantitative imaging features were derived from the index hepatic tumour on pretreatment CT, and features statistically significant on univariate analysis were included in a linear regression model to predict volumetric response. The regression model was constructed from 70% of data, while 30% were reserved for testing. Test data were input into the trained model. Model performance was evaluated with mean absolute prediction error (MAPE) and R 2 . Clinicopatholologic factors were assessed for correlation with response. 157 patients were included, split into training (n = 110) and validation (n = 47) sets. MAPE from the multivariate linear regression model was 16.5% (R 2 = 0.774) and 21.5% in the training and validation sets, respectively. Stratified by HAI utilisation, MAPE in the validation set was 19.6% for HAI and 25.1% for systemic chemotherapy alone. Clinical factors associated with differences in median tumour response were treatment strategy, systemic chemotherapy regimen, age and KRAS mutation status (p < 0.05). Quantitative imaging features extracted from pretreatment CT are promising predictors of volumetric response to chemotherapy in patients with CRLM. Pretreatment predictors of response have the potential to better select patients for specific therapies. • Colorectal liver metastases (CRLM) are downsized with chemotherapy but predicting the patients that will respond to chemotherapy is currently not possible. • Heterogeneity and enhancement patterns of CRLM can be measured with quantitative imaging. • Prediction model constructed that predicts volumetric response with 20% error suggesting that quantitative imaging holds promise to better select patients for specific treatments.
St-Pierre, Corinne; Desmeules, François; Dionne, Clermont E; Frémont, Pierre; MacDermid, Joy C; Roy, Jean-Sébastien
2016-01-01
To conduct a systematic review of the psychometric properties (reliability, validity and responsiveness) of self-report questionnaires used to assess symptoms and functional limitations of individuals with rotator cuff (RC) disorders. A systematic search in three databases (Cinahl, Medline and Embase) was conducted. Data extraction and critical methodological appraisal were performed independently by three raters using structured tools, and agreement was achieved by consensus. A descriptive synthesis was performed. One-hundred and twenty articles reporting on 11 questionnaires were included. All questionnaires were highly reliable and responsive to change, and showed construct validity; seven questionnaires also shown known-group validity. The minimal detectable change ranged from 6.4% to 20.8% of total score; only two questionnaires (American Shoulder and Elbow Surgeon questionnaire [ASES] and Upper Limb Functional Index [ULFI]) had a measurement error below 10% of global score. Minimal clinically important differences were established for eight questionnaires, and ranged from 8% to 20% of total score. Overall, included questionnaires showed acceptable psychometric properties for individuals with RC disorders. The ASES and ULFI have the smallest absolute error of measurement, while the Western Ontario RC Index is one of the most responsive questionnaires for individuals suffering from RC disorders. All included questionnaires are reliable, valid and responsive for the evaluation of individuals with RC disorders. As all included questionnaires showed good psychometric properties for the targeted population, the choice should be made according to the purpose of the evaluation and to the construct being evaluated by the questionnaire. The WORC, a RC-specific questionnaire, appeared to be more responsive. It should therefore be used to evaluate change in time. If the evaluation is time-limited, shorter questionnaires or short versions should be considered (such as Quick DASH or SST).
Cappelleri, Joseph C; Jason Lundy, J; Hays, Ron D
2014-05-01
The US Food and Drug Administration's guidance for industry document on patient-reported outcomes (PRO) defines content validity as "the extent to which the instrument measures the concept of interest" (FDA, 2009, p. 12). According to Strauss and Smith (2009), construct validity "is now generally viewed as a unifying form of validity for psychological measurements, subsuming both content and criterion validity" (p. 7). Hence, both qualitative and quantitative information are essential in evaluating the validity of measures. We review classical test theory and item response theory (IRT) approaches to evaluating PRO measures, including frequency of responses to each category of the items in a multi-item scale, the distribution of scale scores, floor and ceiling effects, the relationship between item response options and the total score, and the extent to which hypothesized "difficulty" (severity) order of items is represented by observed responses. If a researcher has few qualitative data and wants to get preliminary information about the content validity of the instrument, then descriptive assessments using classical test theory should be the first step. As the sample size grows during subsequent stages of instrument development, confidence in the numerical estimates from Rasch and other IRT models (as well as those of classical test theory) would also grow. Classical test theory and IRT can be useful in providing a quantitative assessment of items and scales during the content-validity phase of PRO-measure development. Depending on the particular type of measure and the specific circumstances, the classical test theory and/or the IRT should be considered to help maximize the content validity of PRO measures. Copyright © 2014 Elsevier HS Journals, Inc. All rights reserved.
The measurement of threat orientations.
Thompson, Suzanne C; Schlehofer, Michèle M; Bovin, Michelle J
2006-01-01
To develop measures of 3 threat orientations that affect responses to health behavior messages. In Study 1, college students (N = 47) completed items assessing threat orientations and health behaviors. In Study 2, college students and community adults (N = 110) completed the threat orientation items and measures of convergent and discriminant validity. In Study 1, the control-based, denial-based, and heightened-sensitivity-based threat orientation scales demonstrated good internal consistency and correlated with engagement in health behaviors. In Study 2, the convergent and discriminant validity of the 3 measures was established. The 3 scales have good internal reliability and construct validity.
Trakman, Gina Louise; Forsyth, Adrienne; Hoye, Russell; Belski, Regina
2018-01-01
The Nutrition for Sport Knowledge Questionnaire (NSKQ) is an 89-item, valid and reliable measure of sports nutrition knowledge (SNK). It takes 25 min to complete and has been subject to low completion and response rates. The aim of this study was to develop an abridged version of the NSKQ (A-NSKQ) and compare response rates, completion rates and NK scores of the NSKQ and A-NSKQ. Rasch analysis was used for the questionnaire validation. The sample ( n = 181) was the same sample that was used in the validation of the full-length NSKQ. Construct validity was assessed using the known-group comparisons method. Temporal stability was assessed using the test-retest reliability method. NK assessment was cross-sectional; responses were collected electronically from members of one non-elite Australian football (AF) and netball club, using Qualtrics Software (Qualtrics, Provo, UT). Validation - The A-NSKQ has 37 items that assess general ( n = 17) and sports ( n = 20) nutrition knowledge (NK). Both sections are unidimensional (Perc5% = 2.84% [general] and 3.41% [sport]). Both sections fit the Rasch Model (overall-interaction statistic mean (SD) = - 0.15 ± 0.96 [general] and 0.22 ± 1.11 [sport]; overall-person interaction statistic mean (SD) = - 0.11 ± 0.61 [general] and 0.08 ± 0.73 [sport]; Chi-Square probability = 0.308 [general] and 0.283 [sport]). Test-retest reliability was confirmed ( r = 0.8, P < 0.001 [general] and r = 0.7, P < 0.001 [sport]). Construct validity was demonstrated (nutrition students = 77% versus non-nutrition students = 60%, P < 0.001 [general] and nutrition students = 60% versus non-nutrition students = 40%, P < 0.001 [sport]. Assessment of NK - 177 usable survey responses from were returned. Response rates were low (7%) but completion rates were high (85%). NK scores on the A-NSKQ (46%) are comparable to results obtained in similar cohorts on the NSKQ (49%). The A-NSKQ took on average 12 min to complete, which is around half the time taken to complete the NSKQ (25 min). The A-NSKQ is a valid and reliable, brief questionnaire designed to assess general NK (GNK) and SNK.
Developing Multiple Choice Tests: Tips & Techniques
ERIC Educational Resources Information Center
McCowan, Richard J.
1999-01-01
Item writing is a major responsibility of trainers. Too often, qualified staff who prepare lessons carefully and teach conscientiously use inadequate tests that do not validly reflect the true level of trainee achievement. This monograph describes techniques for constructing multiple-choice items that measure student performance accurately. It…
The Revised Perceived Environmental Control Measure: A Review and Analysis.
ERIC Educational Resources Information Center
Smith-Sebasto, N. J.
1992-01-01
A study reveals the need for extensive refinement of the Revised Perceived Environmental Control Measure purported in the past to be a reliable and valid instrument to measure the relationship between the psychological construct, "locus of control," and environmental action or environmentally responsible behavior. (MCO)
Measuring Food Use in School-Aged Children.
ERIC Educational Resources Information Center
Randall, Elizabeth
1991-01-01
Ideal measurement of food use in school-aged children possesses validity, reliability, interpretability, and feasibility. The article discusses dietary records and recalls and measures of patterns in food use. Issues to consider when constructing children's food frequency are food list, time intervals, response set, context of questioning, and…
Dimensionality of the California Preschool Social Competency Scale.
ERIC Educational Resources Information Center
Flint, David L.; And Others
1980-01-01
Structure and construct validity of the California Preschool Social Competency Scale was investigated. Five factors were interpreted: considerateness; extraversion; task orientation; verbal facility; and response to the unfamiliar. The first three were found to be similar to the three dimensions of the Classroom Behavior Inventory. (Author/BW)
Paschoalin-Maurin, Tatiana; Dos Anjos-Garcia, Tayllon; Falconi-Sobrinho, Luiz Luciano; de Freitas, Renato Leonardo; Coimbra, Jade Pissamiglio Cysne; Laure, Carlos Júlio; Coimbra, Norberto Cysne
2018-01-15
Using an innovative approach to study the neural bases of psychiatric disorders, this study investigated the behavioral, morphological and pharmacological bases of panic attack-induced responses in a prey-versus-coral snake paradigm. Mesocricetus auratus was chronically treated with intraperitoneal administration of the selective serotonin uptake inhibitor paroxetine or the gamma aminobutyric acid (GABA)/benzodiazepine receptor agonist alprazolam at three different doses and were then confronted with a venomous coral snake (Micrurus frontalis, Reptilia, Elapidae). The threatened rodents exhibited defensive attention, flat back approaches, defensive immobility, and escape defensive responses in the presence of the venomous snake, followed by increases in Fos protein in limbic structure neurons. Chronic administration of both paroxetine and alprazolam decreased these responses with morphological correlates between the panicolytic effect of both drugs administered at the highest dose and decreases in Fos protein-immunolabeled perikarya found in the amygdaloid complex, hypothalamus and periaqueductal gray matter columns, which are structures that make up the encephalic aversion system. These findings provide face, construct and predictive validities of this new experimental model of anxiety- and panic attack-like behavioral responses displayed by threatened prey confronted with venomous coral snakes. Copyright © 2017 IBRO. Published by Elsevier Ltd. All rights reserved.
Claassens, L; Terwee, C B; Deeg, D J H; Broese van Groenou, M I; Widdershoven, G A M; Huisman, M
2016-04-01
In response to the increased emphasis placed on older people's self-reliance in many welfare societies, we aimed to develop and validate a measurement instrument, assessing perceived control in health care among older adults with care needs. The target group consists of older people who live (semi-)independently and use professional health care, with or without informal care. Phase I (development) of the study consisted of the construction of the instrument based on the input from a variety of stakeholders. Phase II (validation) entailed a quantitative study in a sample of 247 respondents selected from the Longitudinal Aging Study Amsterdam, to assess the instrument's construct validity (structural validity and hypotheses testing) and reliability (internal consistency). The questionnaire consists of 29 items, related to organizing professional care, communication with care professionals, health management in the home situation, planning (more) complex care in the future, and perceived support from the social network. Based on a factor analysis, we identified three subscales: (I.) 'perceived personal control in health care'; (II.) 'anticipated personal control regarding future health care'; and (III.) 'perceived support from the social network,' with internal consistencies varying from Cronbach's α = .71 to .90. Factor I was associated with mastery, self-efficacy, self-esteem (r = .31-.35) and factor III with social loneliness (r = -.42). Factor II correlated less strongly with mastery, self-efficacy, and self-esteem (r < .30). Our questionnaire revealed sufficient construct validity and internal consistency. The instrument provides a basis for further quantitative research regarding control, especially in relation to health care-related outcomes.
Validity and Reliability of a General Nutrition Knowledge Questionnaire for Japanese Adults.
Matsumoto, Mai; Tanaka, Rie; Ikemoto, Shinji
2017-01-01
Nutrition knowledge is necessary for individuals to adopt appropriate dietary habits, and needs to be evaluated before nutrition education is provided. However, there is no tool to assess general nutrition knowledge of adults in Japan. Our aims were to determine the validity and reliability of a general nutrition knowledge questionnaire for Japanese adults. We developed the pilot version of the Japanese general nutrition knowledge questionnaire (JGNKQ) and administered the pilot study to assess content validity and internal reliability to 1,182 Japanese adults aged 18-64 y. The JGNKQ was further modified based on the pilot study and the final version consisted of 5 sections and 147 items. The JGNKQ was administered to female undergraduate Japanese students in their senior year twice in 2015 to assess construct validity and test-retest reliability. Ninety-six students majoring in nutrition and 44 students in other majors who studied at the same university completed the first questionnaire. Seventy-five students completed the questionnaire twice. The responses from the first questionnaire and both questionnaires were used to assess construct validity and test-retest reliability, respectively. The students in nutrition major had significantly higher scores than the students in other majors on all sections of the questionnaire (p=0.000); therefore, the questionnaire had good construct validity. The test-retest reliability correlation coefficient value of overall and each section except "The use of dietary information to make dietary choices" were 0.75, 0.67, 0.67, 0.68 and 0.61, respectively. We suggest that the JGNKQ is an effective tool to assess the nutrition knowledge level of Japanese adults.
Huang, Wenhao; Chapman-Novakofski, Karen M
2017-01-01
Background The extensive availability and increasing use of mobile apps for nutrition-based health interventions makes evaluation of the quality of these apps crucial for integration of apps into nutritional counseling. Objective The goal of this research was the development, validation, and reliability testing of the app quality evaluation (AQEL) tool, an instrument for evaluating apps’ educational quality and technical functionality. Methods Items for evaluating app quality were adapted from website evaluations, with additional items added to evaluate the specific characteristics of apps, resulting in 79 initial items. Expert panels of nutrition and technology professionals and app users reviewed items for face and content validation. After recommended revisions, nutrition experts completed a second AQEL review to ensure clarity. On the basis of 150 sets of responses using the revised AQEL, principal component analysis was completed, reducing AQEL into 5 factors that underwent reliability testing, including internal consistency, split-half reliability, test-retest reliability, and interrater reliability (IRR). Two additional modifiable constructs for evaluating apps based on the age and needs of the target audience as selected by the evaluator were also tested for construct reliability. IRR testing using intraclass correlations (ICC) with all 7 constructs was conducted, with 15 dietitians evaluating one app. Results Development and validation resulted in the 51-item AQEL. These were reduced to 25 items in 5 factors after principal component analysis, plus 9 modifiable items in two constructs that were not included in principal component analysis. Internal consistency and split-half reliability of the following constructs derived from principal components analysis was good (Cronbach alpha >.80, Spearman-Brown coefficient >.80): behavior change potential, support of knowledge acquisition, app function, and skill development. App purpose split half-reliability was .65. Test-retest reliability showed no significant change over time (P>.05) for all but skill development (P=.001). Construct reliability was good for items assessing age appropriateness of apps for children, teens, and a general audience. In addition, construct reliability was acceptable for assessing app appropriateness for various target audiences (Cronbach alpha >.70). For the 5 main factors, ICC (1,k) was >.80, with a P value of <.05. When 15 nutrition professionals evaluated one app, ICC (2,15) was .98, with a P value of <.001 for all 7 constructs when the modifiable items were specified for adults seeking weight loss support. Conclusions Our preliminary effort shows that AQEL is a valid, reliable instrument for evaluating nutrition apps’ qualities for clinical interventions by nutrition clinicians, educators, and researchers. Further efforts in validating AQEL in various contexts are needed. PMID:29079554
The Script Concordance Test: a new tool assessing clinical judgement in neurology.
Lubarsky, Stuart; Chalk, Colin; Kazitani, Driss; Gagnon, Robert; Charlin, Bernard
2009-05-01
Clinical judgment, the ability to make appropriate decisions in uncertain situations, is central to neurological practice, but objective measures of clinical judgment in neurology trainees are lacking. The Script Concordance Test (SCT), based on script theory from cognitive psychology, uses authentic clinical scenarios to compare a trainee's judgment skills with those of experts. The SCT has been validated in several medical disciplines, but has not been investigated in neurology. We developed an Internet-based neurology SCT (NSCT) comprising 24 clinical scenarios with three to four questions each. The scenarios were designed to reflect the uncertainty of real-life clinical encounters in adult neurology. The questions explored aspects of the scenario in which several responses might be acceptable; trainees were asked to judge which response they considered to be best. Forty-one PGY1-PGY5 neurology residents and eight medical students from three North American neurology programs (McGill, Calgary, and Mayo Clinic) completed the NSCT. The responses of trainees to each question were compared with the aggregate responses of an expert panel of 16 attending neurologists. The NSCT demonstrated good reliability (Cronbach alpha = 0.79). Neurology residents scored higher than medical students and lower than attending neurologists, supporting the test's construct validity. Furthermore, NSCT scores discriminated between senior (PGY3-5) and junior residents (PGY1-2). Our NSCT is a practical and reliable instrument, and our findings support its construct validity for assessing judgment in neurology trainees. The NSCT has potentially widespread applications as an evaluation tool, both in neurology training and for licensing examinations.
Liang, M H
2000-09-01
Although widely used and reported in research for the evaluation of groups, measures of health status and health-related quality of life have had little application in clinical practice for the assessment of individual patients. One of the principal barriers is the demonstration that these measures add clinically significant information to measures of function or symptoms alone. Here, we review the methods for evaluation of construct validity in longitudinal studies and make recommendations for nomenclature, reporting of study results, and future research agenda. Analytical review. The terms "sensitivity" and "responsiveness" have been used interchangeably, and there are few studies that evaluate the extent to which health status or health-related quality-of life measures capture clinically important changes ("responsiveness"). Current methods of evaluating responsiveness are not standardized or evaluated. Approaches for the assessment of a clinically significant or meaningful change are described; rather than normative information, however, standardized transition questions are proposed. They would be reported routinely and as separate axes of description to capture individual perceptions. Research in methods to assess the subject's evaluation of the importance and magnitude of a measured change are critical if health status and health-related quality-of-life measures are to have an impact on patient care.
Fernández-López, Juan Antonio; Fernández-Fidalgo, María; Martín-Payo, Rubén; Rödel, Andreas
2007-08-01
To evaluate the relationship between Health-Related Quality of Life (HRQL) and stress at work among Primary Care workers, as evidence of the construct validity of the Spanish version (PECVEC) of the profile of quality of life in the chronically ill (PLC) questionnaire. In addition, to check its other psychometric properties. Cross-sectional study. Eighteen primary care centres in Health Area IV, Asturias (Oviedo), Spain, sharing similar socio-demographic conditions. Two hundred and thirty-three primary care nurses and physicians. HRQL was evaluated by the 6 general dimensions of the Spanish version of the PLC. Stress at work was evaluated by the three scales of the Effort-Reward Imbalance (ERI) questionnaire. The construct validity of the PECVEC was assessed by testing the inverse associations of QoL dimensions and job stress ones, when the most important confuser variables were monitored. The non-response rate was low (<3%), and no floor effects and only small ceiling effects were observed. Internal consistency analysis and exploratory and confirmatory factor analysis demonstrated high reliability, factorial validity and convergent/divergent validity of the PECVEC. The PECVEC demonstrates adequate psychometric properties for evaluating HRQL in healthy subjects.
The Effects of Work-Related Values on Communication Between R and D Groups. Part 2
NASA Technical Reports Server (NTRS)
Douds, Charles Fowler
1970-01-01
The methods are presented by which the scales and indicators are combined to provide measures of the variables used in the similarity propositions. These variables are constructs employed to describe the relationships among various phenomena occurring in R and D organizations. The character of these constructs as revealed by the characteristics of the measures derived from response patterns is presented. The reliability and validity of the measures are considered, and test procedures are discussed.
2013-01-01
Background Diarrhea-predominant irritable bowel syndrome (IBS-d) significantly diminishes the health-related quality of life (HRQOL) of patients. Psychological and social impacts are common with many IBS-d patients reporting comorbid depression, anxiety, decreased intimacy, and lost working days. The Irritable Bowel Syndrome Quality of Life (IBS-QOL) questionnaire is a 34-item instrument developed and validated for measurement of HRQOL in non-subtyped IBS patients. The current paper assesses this previously-validated instrument employing data collected from 754 patients who participated in a randomized clinical trial of a novel treatment, eluxadoline, for IBS-d. Methods Psychometric methods common to HRQOL research were employed to evaluate the IBS-QOL. Many of the historical analyses of the IBS-QOL validations were used. Other techniques that extended the original methods were applied where more appropriate for the current dataset. In IBS-d patients, we analyzed the items and substructure of the IBS-QOL via item reduction, factor structure, internal consistency, reproducibility, construct validity, and ability to detect change. Results This study supports the IBS-QOL as a psychometrically valid measure. Factor analyses suggested that IBS-specific QOL as measured by the IBS-QOL is a unidimensional construct. Construct validity was further buttressed by significant correlations between IBS-QOL total scores and related measures of IBS-d severity including the historically-relevant Irritable Bowel Syndrome Adequate Relief (IBS-AR) item and the FDA’s Clinical Responder definition. The IBS-QOL also showed a significant ability to detect change as evidenced by analysis of treatment effects. A minority of the items, unrelated to the IBS-d, performed less well by the standards set by the original authors. Conclusions We established that the IBS-QOL total score is a psychometrically valid measure of HRQOL in IBS-d patients enrolled in this study. Our analyses suggest that the IBS-QOL items demonstrate very good construct validity and ability to detect changes due to treatment effects. Furthermore, our analyses suggest that the IBS-QOL items measure a univariate construct and we believe further modeling of the IBS-QOL from an item response theory (IRT) approach under both non-treatment and treatment conditions would greatly further our understanding as item-based methods could be used to develop a short form. PMID:24330412
Aletaha, Daniel; Martinez-Avila, Jose; Kvien, Tore K; Smolen, Josef S
2012-07-01
The simplified disease activity index (SDAI) and the clinical disease activity index (CDAI) are established instruments to measure disease activity in rheumatoid arthritis (RA). To date, no validated response definitions for the SDAI and CDAI are available. The authors aimed to define minor, moderate and major response criteria for the SDAI. The authors used data from two clinical trials on infliximab versus methotrexate in early (ASPIRE) or established (ATTRACT) RA, and identified the three SDAI cutpoints based on the best agreement (by κ statistics) with the American College of Rheumatology (ACR)20/50/70 responses. Cutpoints were then tested for different aspects of validity in the trial datasets and in a Norwegian disease modifying antirheumatic drug prescription dataset (NOR-DMARD). Based on agreement with the ACR response, the minor, moderate and major responses were identified as SDAI 50%, 70% and 85% improvement. These cutpoints had good face validity concerning the disease activity states achieved by the different response definitions. Construct validity was shown by a clear association of increasing SDAI response categories with increasing levels of functional improvement, achievement of better functional states and lower annual radiographic progression. Across SDAI 50/70/85, the sensitivities regarding a patient-perceived improvement decreased (73%/39%/22%) and the specificities increased (61%/89%/96%) in a meaningful way. Further, the cutpoints discriminated the different treatment arms in ASPIRE and ATTRACT. The same cutpoints were used for the CDAI, with similar results in the validation analyses. These new response criteria expand the usefulness of the SDAI and CDAI for their use as endpoints in clinical trials beyond the definition of disease activity categories.
Validation of new psychosocial factors questionnaires: a Colombian national study.
Villalobos, Gloria H; Vargas, Angélica M; Rondón, Martin A; Felknor, Sarah A
2013-01-01
The study of workers' health problems possibly associated with stressful conditions requires valid and reliable tools for monitoring risk factors. The present study validates two questionnaires to assess psychosocial risk factors for stress-related illnesses within a sample of Colombian workers. The validation process was based on a representative sample survey of 2,360 Colombian employees, aged 18-70 years. Worker response rate was 90%; 46% of the responders were women. Internal consistency was calculated, construct validity was tested with factor analysis and concurrent validity was tested with Spearman correlations. The questionnaires demonstrated adequate reliability (0.88-0.95). Factor analysis confirmed the dimensions proposed in the measurement model. Concurrent validity resulted in significant correlations with stress and health symptoms. "Work and Non-work Psychosocial Factors Questionnaires" were found to be valid and reliable for the assessment of workers' psychosocial factors, and they provide information for research and intervention. Copyright © 2012 Wiley Periodicals, Inc.
Zheng, Jia; Goodyear, Laurie J.
2016-01-01
The development of animal models with construct, face, and predictive validity to accurately model human depression has been a major challenge. One proposed rodent model is the 5 d repeated forced swim stress (5d-RFSS) paradigm, which progressively increases floating during individual swim sessions. The onset and persistence of this floating behavior has been anthropomorphically characterized as a measure of depression. This interpretation has been under debate because a progressive increase in floating over time may reflect an adaptive learned behavioral response promoting survival, and not depression (Molendijk and de Kloet, 2015). To assess construct and face validity, we applied 5d-RFSS to C57BL/6J and BALB/cJ mice, two mouse strains commonly used in neuropsychiatric research, and measured a combination of emotional, homeostatic, and psychomotor symptoms indicative of a depressive-like state. We also compared the efficacy of 5d-RFSS and chronic social defeat stress (CSDS), a validated depression model, to induce a depressive-like state in C57BL/6J mice. In both strains, 5d-RFSS progressively increased floating behavior that persisted for at least 4 weeks. 5d-RFSS did not alter sucrose preference, body weight, appetite, locomotor activity, anxiety-like behavior, or immobility behavior during a tail-suspension test compared with nonstressed controls. In contrast, CSDS altered several of these parameters, suggesting a depressive-like state. Finally, predictive validity was assessed using voluntary wheel running (VWR), a known antidepressant intervention. Four weeks of VWR after 5d-RFSS normalized floating behavior toward nonstressed levels. These observations suggest that 5d-RFSS has no construct or face validity but might have predictive validity to model human depression. PMID:28058270
Mul, Joram D; Zheng, Jia; Goodyear, Laurie J
2016-01-01
The development of animal models with construct, face, and predictive validity to accurately model human depression has been a major challenge. One proposed rodent model is the 5 d repeated forced swim stress (5d-RFSS) paradigm, which progressively increases floating during individual swim sessions. The onset and persistence of this floating behavior has been anthropomorphically characterized as a measure of depression. This interpretation has been under debate because a progressive increase in floating over time may reflect an adaptive learned behavioral response promoting survival, and not depression (Molendijk and de Kloet, 2015). To assess construct and face validity, we applied 5d-RFSS to C57BL/6J and BALB/cJ mice, two mouse strains commonly used in neuropsychiatric research, and measured a combination of emotional, homeostatic, and psychomotor symptoms indicative of a depressive-like state. We also compared the efficacy of 5d-RFSS and chronic social defeat stress (CSDS), a validated depression model, to induce a depressive-like state in C57BL/6J mice. In both strains, 5d-RFSS progressively increased floating behavior that persisted for at least 4 weeks. 5d-RFSS did not alter sucrose preference, body weight, appetite, locomotor activity, anxiety-like behavior, or immobility behavior during a tail-suspension test compared with nonstressed controls. In contrast, CSDS altered several of these parameters, suggesting a depressive-like state. Finally, predictive validity was assessed using voluntary wheel running (VWR), a known antidepressant intervention. Four weeks of VWR after 5d-RFSS normalized floating behavior toward nonstressed levels. These observations suggest that 5d-RFSS has no construct or face validity but might have predictive validity to model human depression.
Hu, Guo-Qing; Rao, Ke-Qin; Sun, Zhen-Qiu
2008-12-01
To develop a capacity questionnaire in public health emergency for Chinese local governments. Literature reviews, conceptual modelling, stake-holder analysis, focus group, interview, and Delphi technique were employed together to develop the questionnaire. Classical test theory and case study were used to assess the reliability and validity. (1) A 2-dimension conceptual model was built. A preparedness and response capacity questionnaire in public health emergency with 10 dimensions and 204 items, was developed. (2) Reliability and validity results. Internal consistency: except for dimension 3 and 8, the Cronbach's alpha coefficient of other dimensions was higher than 0.60. The alpha coefficients of dimension 3 and dimension 8 were 0.59 and 0.39 respectively; Content validity: the questionnaire was recognized by the investigatees; Construct validity: the Spearman correlation coefficients among the 10 dimensions fluctuated around 0.50, ranging from 0.26 to 0.75 (P<0.05); Discrimination validity: comparisons of 10 dimensions among 4 provinces did not show statistical significance using One-way analysis of variance (P>0.05). Criterion-related validity: case study showed significant difference among the 10 dimensions in Beijing between February 2003 (before SARS event) and November 2005 (after SARS event). The preparedness and response capacity questionnaire in public health emergency is a reliable and valid tool, which can be used in all provinces and municipalities in China.
Development and Validation of a Multimedia-based Assessment of Scientific Inquiry Abilities
NASA Astrophysics Data System (ADS)
Kuo, Che-Yu; Wu, Hsin-Kai; Jen, Tsung-Hau; Hsu, Ying-Shao
2015-09-01
The potential of computer-based assessments for capturing complex learning outcomes has been discussed; however, relatively little is understood about how to leverage such potential for summative and accountability purposes. The aim of this study is to develop and validate a multimedia-based assessment of scientific inquiry abilities (MASIA) to cover a more comprehensive construct of inquiry abilities and target secondary school students in different grades while this potential is leveraged. We implemented five steps derived from the construct modeling approach to design MASIA. During the implementation, multiple sources of evidence were collected in the steps of pilot testing and Rasch modeling to support the validity of MASIA. Particularly, through the participation of 1,066 8th and 11th graders, MASIA showed satisfactory psychometric properties to discriminate students with different levels of inquiry abilities in 101 items in 29 tasks when Rasch models were applied. Additionally, the Wright map indicated that MASIA offered accurate information about students' inquiry abilities because of the comparability of the distributions of student abilities and item difficulties. The analysis results also suggested that MASIA offered precise measures of inquiry abilities when the components (questioning, experimenting, analyzing, and explaining) were regarded as a coherent construct. Finally, the increased mean difficulty thresholds of item responses along with three performance levels across all sub-abilities supported the alignment between our scoring rubrics and our inquiry framework. Together with other sources of validity in the pilot testing, the results offered evidence to support the validity of MASIA.
Lo, Barbara Chuen Yee; Zhao, Yue; Kwok, Alice Wai Yee; Chan, Wai; Chan, Calais Kin Yuen
2017-07-01
The present study applied item response theory to examine the psychometric properties of the Asian Adolescent Depression Scale and to construct a short form among 1,084 teenagers recruited from secondary schools in Hong Kong. Findings suggested that some items of the full form reflected higher levels of severity and were more discriminating than others, and the Asian Adolescent Depression Scale was useful in measuring a broad range of depressive severity in community youths. Differential item functioning emerged in several items where females reported higher depressive severity than males. In the short form construction, preliminary validation suggested that, relative to the 20-item full form, our derived short form offered significantly greater diagnostic performance and stronger discriminatory ability in differentiating depressed and nondepressed groups, and simultaneously maintained adequate measurement precision with a reduced response burden in assessing depression in the Asian adolescents. Cultural variance in depressive symptomatology and clinical implications are discussed.
Construction and Validation of Scale of Parenting Style
ERIC Educational Resources Information Center
Abdul Gafor, K.; Kurukkan, Abidha
2014-01-01
This paper describes the development and standardization of a measure of perceived parenting style. The four styles namely authoritative, authoritarian, permissive and negligent proposed by Baumrind (1971) are scaled based on a quadrant of high and low levels of parental responsiveness and control suggested by Maccoby and Martin (1983). The items…
A Framework for Evaluation and Use of Automated Scoring
ERIC Educational Resources Information Center
Williamson, David M.; Xi, Xiaoming; Breyer, F. Jay
2012-01-01
A framework for evaluation and use of automated scoring of constructed-response tasks is provided that entails both evaluation of automated scoring as well as guidelines for implementation and maintenance in the context of constantly evolving technologies. Consideration of validity issues and challenges associated with automated scoring are…
Psychometric Properties of the Revised Mathematics Anxiety Rating Scale
ERIC Educational Resources Information Center
Baloglu, Mustafa; Zelhart, Paul F.
2007-01-01
An exploratory factor analysis and several confirmatory analyses were performed to evaluate the factorial structure of the Revised Mathematics Anxiety Rating Scale (RMARS) through the responses of 805 college students. On 559 students' scores, the instrument's construct validity was tested through a confirmatory factor analysis (CFA) and was found…
ERIC Educational Resources Information Center
Darom, Efraim; Rich, Yisrael
1983-01-01
To measure pupils' inconsistency in replying to a self-report questionnaire regarding attitudes toward school, 6,000 elementary and secondary school students participated in a nation-wide project to construct and validate the Israeli Quality of School Life Questionnaire (IQSL). (PN)
Alocomotino Control Algorithm for Robotic Linkage Systems
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dohner, Jeffrey L.
This dissertation describes the development of a control algorithm that transitions a robotic linkage system between stabilized states producing responsive locomotion. The developed algorithm is demonstrated using a simple robotic construction consisting of a few links with actuation and sensing at each joint. Numerical and experimental validation is presented.
Assessing Assertion: An Investigation of Construct Validity and Reliability.
ERIC Educational Resources Information Center
Kern, Jeffrey M.; MacDonald, Marian L.
The reliability and meaning of assertiveness tests were explored using 120 female undergraduates. Several self-report inventories (the College Self-Expression Scale, Conflict Resolution Inventory, and a global rating from one to seven) were administered, as were three anxiety measures (Timed Behavior Checklist, response latency, and response…
Preliminary validation of the Yale Food Addiction Scale.
Gearhardt, Ashley N; Corbin, William R; Brownell, Kelly D
2009-04-01
Previous research has found similarities between addiction to psychoactive substances and excessive food consumption. Further exploration is needed to evaluate the concept of "food addiction," as there is currently a lack of psychometrically validated measurement tools in this area. The current study represents a preliminary exploration of the Yale Food Addiction Scale (YFAS), designed to identify those exhibiting signs of addiction towards certain types of foods (e.g., high fat and high sugar). Survey data were collected from 353 respondents from a stratified random sample of young adults. In addition to the YFAS, the survey assessed eating pathology, alcohol consumption and other health behaviors. The YFAS exhibited adequate internal reliability, and showed good convergent validity with measures of similar constructs and good discriminant validity relative to related but dissimilar constructs. Additionally, the YFAS predicted binge-eating behavior above and beyond existing measures of eating pathology, demonstrating incremental validity. The YFAS is a sound tool for identifying eating patterns that are similar to behaviors seen in classic areas of addiction. Further evaluation of the scale is needed, especially due to a low response rate of 24.5% and a non-clinical sample, but confirmation of the reliability and validity of the scale has the potential to facilitate empirical research on the concept of "food addiction".
Development and Initial Validation of the Pain Resilience Scale.
Slepian, P Maxwell; Ankawi, Brett; Himawan, Lina K; France, Christopher R
2016-04-01
Over the past decade, the role of positive psychology in pain experience has gained increasing attention. One such positive factor, identified as resilience, has been defined as the ability to maintain positive emotional and physical functioning despite physical or psychological adversity. Although cross-situational measures of resilience have been shown to be related to pain, it was hypothesized that a pain-specific resilience measure would serve as a stronger predictor of acute pain experience. To test this hypothesis, we conducted a series of studies to develop and validate the Pain Resilience Scale. Study 1 described exploratory and confirmatory factor analyses that support a scale with 2 distinct factors, Cognitive/Affective Positivity and Behavioral Perseverance. Study 2 showed test-retest reliability and construct validity of this new scale, including moderate positive relationships with measures of positive psychological functioning and small to moderate negative relationships with vulnerability measures such as pain catastrophizing. Finally, consistent with our initial hypothesis, study 3 showed that the Pain Resilience Scale is more strongly related to ischemic pain responses than existing measures of general resilience. Together, these studies support the predictive utility of this new pain-specific measure of resilience in the context of acute experimental pain. The Pain Resilience Scale represents a novel measure of Cognitive/Affective Positivity and Behavioral Perseverance during exposure to noxious stimuli. Construct validity is supported by expected relationships with existing pain-coping measures, and predictive validity is shown by individual differences in response to acute experimental pain. Copyright © 2016 American Pain Society. Published by Elsevier Inc. All rights reserved.
Exploring the Validity of the Affect Balance Scale With a Sample of Family Caregivers
Perkinson, Margaret A.; Albert, Steven M.; Luborsky, Mark; Moss, Miriam; Glicksman, Allen
2014-01-01
Open-ended responses of caregiving daughters and daughters-in-law were generated by a modified random probe technique to investigate the construct validity of the two subscales of the Affect Balance Scale (ABS), i.e., the 5-item Positive Affect Scale (PAS) and the 5-item Negative Affect Scale (NAS). A set of criteria were developed to distinguish between responses that did and did not correspond to Bradburn’s assumptions concerning affect. While most responses met at least one of the criteria, very few met all. In exploring the nature of affect, we found that positive affect was based to a large extent on personal accomplishments and the recognition of others. The assessment of negative affect was a more interior, or self-focused process. For a significant subset of the sample, a negative response to a closed-ended PAS or NAS item implied disagreement or discontent with the wording or the implications of the item itself, rather than an absence of affect. Not all of the ABS items were equally valid measures of affect. PMID:8056955
Development of the Perioperative Nursing Data Set.
Kleinbeck, S V
1999-07-01
Nursing practice is a major component of health care. Yet, it remains undervalued and essentially invisible because little data exist to substantiate the influence of nurses on patient outcomes. The research-based Perioperative Nursing Data Set (PNDS), with an easily automated nomenclature capable of describing the specialty practice of perioperative nursing, was designed to fill this gap. Four domains (i.e., safety, physiologic response to surgery, patient and family behavioral response to surgery, health system) form the foundation of the PNDS. Each domain, with accompanying desired outcomes, nursing interventions, and nursing diagnoses, has reliability, content validity, and evidence of construct validity. The purpose of this article is to introduce the conceptual framework, taxonomy, and potential clinical applications of the PNDS.
Development and validation of a Malawian version of the primary care assessment tool.
Dullie, Luckson; Meland, Eivind; Hetlevik, Øystein; Mildestvedt, Thomas; Gjesdal, Sturla
2018-05-16
Malawi does not have validated tools for assessing primary care performance from patients' experience. The aim of this study was to develop a Malawian version of Primary Care Assessment Tool (PCAT-Mw) and to evaluate its reliability and validity in the assessment of the core primary care dimensions from adult patients' perspective in Malawi. A team of experts assessed the South African version of the primary care assessment tool (ZA-PCAT) for face and content validity. The adapted questionnaire underwent forward and backward translation and a pilot study. The tool was then used in an interviewer administered cross-sectional survey in Neno district, Malawi, to test validity and reliability. Exploratory factor analysis was performed on a random half of the sample to evaluate internal consistency, reliability and construct validity of items and scales. The identified constructs were then tested with confirmatory factor analysis. Likert scale assumption testing and descriptive statistics were done on the final factor structure. The PCAT-Mw was further tested for intra-rater and inter-rater reliability. From the responses of 631 patients, a 29-item PCAT-Mw was constructed comprising seven multi-item scales, representing five primary care dimensions (first contact, continuity, comprehensiveness, coordination and community orientation). All the seven scales achieved good internal consistency, item-total correlations and construct validity. Cronbach's alpha coefficient ranged from 0.66 to 0.91. A satisfactory goodness of fit model was achieved (GFI = 0.90, CFI = 0.91, RMSEA = 0.05, PCLOSE = 0.65). The full range of possible scores was observed for all scales. Scaling assumptions tests were achieved for all except the two comprehensiveness scales. Intra-class correlation coefficient (ICC) was 0.90 (n = 44, 95% CI 0.81-0.94, p < 0.001) for intra-rater reliability and 0.84 (n = 42, 95% CI 0.71-0.96, p < 0.001) for inter-rater reliability. Comprehensive metric analyses supported the reliability and validity of PCAT-Mw in assessing the core concepts of primary care from adult patients' experience. This tool could be used for health service research in primary care in Malawi.
[KON-2006--Neurotic Personality Questionnaire].
Aleksandrowicz, Jerzy W; Klasa, Katarzyna; Sobański, Jerzy A; Stolarska, Dorota
2007-01-01
Construction of a questionnaire describing personality traits connected to the occurrence and persistence of neurotic disorders. Responses of 794 patients (before treatment) and 520 persons from the control group on items of the constructed personality questionnaire and the symptom checklist "0". Analyses of subscales reliability and item-scale correlations, test-retest and split-half reliability. Factor analyses estimating internal reliability of the questionnaire. Cross-validation with the KO"0". symptom checklist Psychometric properties of KON-2006 questionnaire indicate that it is consistent and reliable enough. Validity analyses indicate a large probability that the X-KON coefficient informs on personality dysfunctions related to neurotic disorders. The Neurotic Personality Questionnaire KON-2006 may serve to estimate personality traits connected to the occurrence and persistence of neurotic disorders as well as changes resulting from psychotherapy.
Kasitanon, N; Wangkaew, S; Puntana, S; Sukitawut, W; Leong, K P; Louthrenoo, W
2013-03-01
The English version of the Systemic Lupus Erythematosus Quality of Life Questionnaire (SLEQOL) is a validated disease-specific quality of life instrument. The aim of this study was to evaluate the psychometric properties of the Thai version of the SLEQOL (SLEQOL-TH). Two independent translators translated the SLEQOL into Thai. The back translation of this version was performed by two other independent translators. The final version, SLEQOL-TH, was completed after resolving the discrepancies revealed by the back translation. One hundred and nine patients with SLE were enrolled to test the reliability, construct validity, floor and ceiling effects, and sensitivity to the changes of the SLEQOL-TH at six months. The differential item functioning (DIF) between the Thai and English versions was analyzed using the partial gamma. The internal consistency of the SLEQOL-TH was satisfactory with the overall Cronbach's alpha of 0.86. The test-retest reliability of the SLEQOL-TH was acceptable with the intra-class correlation coefficient of 0.86. Low correlations between the SLEQOL-TH and SLEDAI were observed. The total score of the SLEQOL-TH was moderately responsive to changes in quality of life, with a standardized response mean of 0.50. When comparing the SLEQOL-TH from Thai SLE patients with the original SLEQOL version obtained from Singapore SLE patients, 11 out of 40 items showed a moderate to large DIF. The SLEQOL-TH has acceptable psychometric properties and shows construct validity. In comparison with the English version of SLEQOL, there are some items that showed DIF. The applicability of the SLEQOL-TH in real-life clinical practice and clinical trials needs to be determined.
AlHeresh, Rawan; LaValley, Michael P; Coster, Wendy; Keysor, Julie J
2017-06-01
To evaluate construct validity and scoring methods of the world health organization-health and work performance questionnaire (HPQ) for people with arthritis. Construct validity was examined through hypothesis testing using the recommended guidelines of the consensus-based standards for the selection of health measurement instruments (COSMIN). The HPQ using the absolute scoring method showed moderate construct validity as four of the seven hypotheses were met. The HPQ using the relative scoring method had weak construct validity as only one of the seven hypotheses were met. The absolute scoring method for the HPQ is superior in construct validity to the relative scoring method in assessing work performance among people with arthritis and related rheumatic conditions; however, more research is needed to further explore other psychometric properties of the HPQ.
A Short Measure of the Revised Reinforcement Sensitivity Theory - RSQ17.
Čolović, Petar; Smederevac, Snežana; Oljača, Milan; Nikolašević, Željka; Mitrović, Dušanka
2018-04-03
The need for a research and practical tool, such as a short, reliable, and valid personality assessment test, suggests researchers to create shortened versions of original instruments. Reinforcement sensitivity questionnaire (RSQ) was created in line with some basic premises of revised Reinforcement sensitivity theory, which proposes three motivational and emotional systems: Behavioral inhibition system (BIS), responsible for scanning environment for potential threats, Behavioral activation system (BAS), responsible for aproaching behavior, and the Fight/Flight/Freeze system (FFFS), responsible for behavior in the present threat. RSQ comprises five scales: BIS, BAS, Fight, Flight, and Freeze. The aim of this study was to develop a short version of RSQ, which would be beneficial to both research and practical purposes. Item response theory analyses were used for item selection. The study comprised two samples of participants, whereby Sample 1 (N = 837, 34.6% male, aged 18 - 82, M = 31.63, SD = 13.54) served as the derivation sample, while Sample 2 (818 participants, 43.6% male, 18-75 years, M = 29.65, SD = 12.52) served as validation sample. Factorial validity of the short RSQ was examined on both Sample 1 and Sample 2. Convergent and divergent validity of the short RSQ was examined using RST-PQ, Jackson-5, BIS/BAS scales, and Big Five Inventory. The results point to satisfactory internal consistency, factorial validity, and construct validity of the short RSQ, suggesting that it is an adequate measure for research settings or other contexts which require the use of short personality questionnaires.
Validation of Medical Tourism Service Quality Questionnaire (MTSQQ) for Iranian Hospitals.
Qolipour, Mohammad; Torabipour, Amin; Khiavi, Farzad Faraji; Malehi, Amal Saki
2017-03-01
Assessing service quality is one of the basic requirements to develop the medical tourism industry. There is no valid and reliable tool to measure service quality of medical tourism. This study aimed to determine the reliability and validity of a Persian version of medical tourism service quality questionnaire for Iranian hospitals. To validate the medical tourism service quality questionnaire (MTSQQ), a cross-sectional study was conducted on 250 Iraqi patients referred to hospitals in Ahvaz (Iran) from 2015. To design a questionnaire and determine its content validity, the Delphi Technique (3 rounds) with the participation of 20 medical tourism experts was used. Construct validity of the questionnaire was assessed through exploratory and confirmatory factor analysis. Reliability was assessed using Cronbach's alpha coefficient. Data were analyzed by Excel 2007, SPSS version18, and Lisrel l8.0 software. The content validity of the questionnaire with CVI=0.775 was confirmed. According to exploratory factor analysis, the MTSQQ included 31 items and 8 dimensions (tangibility, reliability, responsiveness, assurance, empathy, exchange and travel facilities, technical and infrastructure facilities and safety and security). Construct validity of the questionnaire was confirmed, based on the goodness of fit quantities of model (RMSEA=0.032, CFI= 0.98, GFI=0.88). Cronbach's alpha coefficient was 0.837 and 0.919 for expectation and perception questionnaire. The results of the study showed that the medical tourism SERVQUAL questionnaire with 31 items and 8 dimensions was a valid and reliable tool to measure service quality of medical tourism in Iranian hospitals.
Development and validation of a measure of workplace climate for healthy weight maintenance.
Sliter, Katherine A
2013-07-01
Due to the obesity epidemic, an increasing amount of research is being conducted to better understand the antecedents and consequences of excess employee weight. One construct often of interest to researchers in this area is organizational climate. Unfortunately, a viable measure of climate, as related to employee weight, does not exist. The purpose of this study was to remedy this by developing and validating a concise, psychometrically sound measure of climate for healthy weight. An item pool was developed based on surveys of full-time employees, and a sorting task was used to eliminate ambiguous items. Items were pilot tested by a sample of 338 full-time employees, and the item pool was reduced through item response theory (IRT) and reliability analyses. Finally, the retained 14 items, comprising 3 subscales, were completed by a sample of 360 full-time employees, representing 26 different organizations from across the United States. Multilevel modeling indicated that sufficient variance was explained by group membership to support aggregation, and confirmatory factor analysis (CFA) supported the hypothesized model of 3 subscale factors and an overall climate factor. Nine hypotheses specific to construct validation were tested. Scores on the new scale correlated significantly with individual-level reports of psychological constructs (e.g., health motivation, general leadership support for health) and physiological phenomena (e.g., body mass index [BMI], physical health problems) to which they should theoretically relate, supporting construct validity. Implications for the use of this scale in both applied and research settings are discussed. PsycINFO Database Record (c) 2013 APA, all rights reserved.
Ebbeling, Cara B; Wadden, Thomas A; Ludwig, David S
2011-01-01
Background: The circumstances under which the glycemic index (GI) and glycemic load (GL) are derived do not reflect real-world eating behavior. Thus, the ecologic validity of these constructs is incompletely known. Objective: This study examined the relation of dietary intake to glycemic response when foods are consumed under free-living conditions. Design: Participants were 26 overweight or obese adults with type 2 diabetes who participated in a randomized trial of lifestyle modification. The current study includes baseline data, before initiation of the intervention. Participants wore a continuous glucose monitor and simultaneously kept a food diary for 3 d. The dietary variables included GI, GL, and intakes of energy, fat, protein, carbohydrate, sugars, and fiber. The glycemic response variables included AUC, mean and SD of continuous glucose monitoring (CGM) values, percentage of CGM values in euglycemic and hyperglycemic ranges, and mean amplitude of glycemic excursions. Relations between daily dietary intake and glycemic outcomes were examined. Results: Data were available from 41 d of monitoring. Partial correlations, controlled for energy intake, indicated that GI or GL was significantly associated with each glycemic response outcome. In multivariate analyses, dietary GI accounted for 10% to 18% of the variance in each glycemic variable, independent of energy and carbohydrate intakes (P < 0.01). Conclusions: The data support the ecologic validity of the GI and GL constructs in free-living obese adults with type 2 diabetes. GI was the strongest and most consistent independent predictor of glycemic stability and variability. PMID:22071699
Won, Jongsung; Cheng, Jack C P; Lee, Ghang
2016-03-01
Waste generated in construction and demolition processes comprised around 50% of the solid waste in South Korea in 2013. Many cases show that design validation based on building information modeling (BIM) is an effective means to reduce the amount of construction waste since construction waste is mainly generated due to improper design and unexpected changes in the design and construction phases. However, the amount of construction waste that could be avoided by adopting BIM-based design validation has been unknown. This paper aims to estimate the amount of construction waste prevented by a BIM-based design validation process based on the amount of construction waste that might be generated due to design errors. Two project cases in South Korea were studied in this paper, with 381 and 136 design errors detected, respectively during the BIM-based design validation. Each design error was categorized according to its cause and the likelihood of detection before construction. The case studies show that BIM-based design validation could prevent 4.3-15.2% of construction waste that might have been generated without using BIM. Copyright © 2015 Elsevier Ltd. All rights reserved.
dos Anjos, Daniela Brianne Martins; Rodrigues, Roberta Cunha Matheus; Padilha, Kátia Melissa; Pedrosa, Rafaela Batista dos Santos; Gallani, Maria Cecília Bueno Jayme
2016-01-01
ABSTRACT Objective: evaluate the practicality, acceptability and the floor and ceiling effects, estimate the reliability and verify the convergent construct's validity with the instrument called the Heart Valve Disease Impact on daily life (IDCV) of the valve disease in patients with mitral and or aortic heart valve disease. Method: data was obtained from 86 heart valve disease patients through 3 phases: a face to face interview for a socio-demographic and clinic characterization and then other two done through phone calls of the interviewed patients for application of the instrument (test and repeat test). Results: as for the practicality and acceptability, the instrument was applied with an average time of 9,9 minutes and with 110% of responses, respectively. Ceiling and floor effects observed for all domains, especially floor effect. Reliability was tested using the test - repeating pattern to give evidence of temporal stability of the measurement. Significant negative correlations with moderate to strong magnitude were found between the score of the generic question about the impact of the disease and the scores of IDCV, which points to the validity of the instrument convergent construct. Conclusion: the instrument to measure the impact of valve heart disease on the patient's daily life showed evidence of reliability and validity when applied to patients with heart valve disease. PMID:27992024
Chambers, David W
2011-01-01
One of the most extensively studied constructs in dental education is the four-component model of moral behavior proposed by James Rest and the set of instruments for measuring it developed by Rest, Muriel Bebeau, and others. Although significant associations have been identified between the four components Rest proposed (called here Moral Sensitivity, Moral Reasoning, Moral Integrity, and Moral Courage) and dental ethics courses and practitioners with disciplined licenses, there is no single instrument that measures all four components, and existing single component instruments require professional scoring. This article describes the development and validation of a short, self-scoring instrument, the Moral Skills Inventory, that measures all four components. Evidence of face validity, test/retest reliability, and concurrent convergent and divergent predictive validity are demonstrated in three populations: dental students, clinical dental faculty members, and regents and officers of the American College of Dentists. Significant issues remain in developing the Rest four-component model for use in dental education and practice. Specifically, further construct validation research is needed to understand the nature of the components. In particular, it remains undetermined whether moral constructs are characteristics of individuals that drive behavior in specific situations or whether particular patterns of moral behavior learned and used in response to individual circumstances are summarized by researchers and then imputed to practitioners.
Gavett, Brandon E; Horwitz, Julie E
2012-03-01
The serial position effect shows that two interrelated cognitive processes underlie immediate recall of a supraspan word list. The current study used item response theory (IRT) methods to determine whether the serial position effect poses a threat to the construct validity of immediate list recall as a measure of verbal episodic memory. Archival data were obtained from a national sample of 4,212 volunteers aged 28-84 in the Midlife Development in the United States study. Telephone assessment yielded item-level data for a single immediate recall trial of the Rey Auditory Verbal Learning Test (RAVLT). Two parameter logistic IRT procedures were used to estimate item parameters and the Q(1) statistic was used to evaluate item fit. A two-dimensional model better fit the data than a unidimensional model, supporting the notion that list recall is influenced by two underlying cognitive processes. IRT analyses revealed that 4 of the 15 RAVLT items (1, 12, 14, and 15) were misfit (p < .05). Item characteristic curves for items 14 and 15 decreased monotonically, implying an inverse relationship between the ability level and the probability of recall. Elimination of the four misfit items provided better fit to the data and met necessary IRT assumptions. Performance on a supraspan list learning test is influenced by multiple cognitive abilities; failure to account for the serial position of words decreases the construct validity of the test as a measure of episodic memory and may provide misleading results. IRT methods can ameliorate these problems and improve construct validity.
The Clitoral Photoplethysmograph: A Pilot Study Examining Discriminant and Convergent Validity.
Suschinsky, Kelly D; Shelley, Amanda J; Gerritsen, Jeroen; Tuiten, Adriaan; Chivers, Meredith L
2015-12-01
The clitoral photoplethysmograph (CPP) is a relatively new device used to measure changes in clitoral blood volume (CBV); however, its construct validity has not yet been evaluated. To evaluate the discriminant and convergent validity of the CPP. For discriminant validity, CBV responses should differ between sexual and nonsexual emotional films if the CPP accurately assesses clitoral vasocongestion associated with sexual arousal; for convergent validity, CBV responses should significantly correlate with subjective reports of sexual arousal. Twenty women (M age = 21.2 years, SD = 3.4) watched neutral, anxiety-inducing, exhilarating, and sexual (female-male sex) audiovisual stimuli while their genital responses were measured simultaneously using vaginal and clitoral photoplethysmographs and CPPs. Most of these participants continuously reported sexual arousal throughout each stimulus (n = 16), and all reported their sexual and nonsexual affect before and after each stimulus; subjective responses were recorded via button presses using a keypad. Vaginal pulse amplitude (VPA), CBV, and self-reported sexual arousal and nonsexual affect were used as main outcome measures. CBV demonstrated both discriminant and convergent validity. CBV responses were similar to VPA responses and self-reported sexual arousal; all responses differed significantly as a function of stimulus content, with the sexual stimulus eliciting greater relative changes than nonsexual stimuli. CBV, but not VPA, was significantly (negatively) correlated with continuous self-reported sexual arousal during the shorter sexual stimulus. CBV was significantly negatively correlated with VPA for the shorter sexual stimulus. CBV may be a valid measure of women's genital sexual arousal that provides complementary information to VPA and correlates with self-reported sexual arousal. Given our relatively small sample size, and that this is among the first research to use the CPP, the current findings must be replicated. More research using the CPP and other devices is required for a more comprehensive description of women's physiological sexual arousal. © 2015 International Society for Sexual Medicine.
Validation of GC and HPLC systems for residue studies
DOE Office of Scientific and Technical Information (OSTI.GOV)
Williams, M.
1995-12-01
For residue studies, GC and HPLC system performance must be validated prior to and during use. One excellent measure of system performance is the standard curve and associated chromatograms used to construct that curve. The standard curve is a model of system response to an analyte over a specific time period, and is prima facia evidence of system performance beginning at the auto sampler and proceeding through the injector, column, detector, electronics, data-capture device, and printer/plotter. This tool measures the performance of the entire chromatographic system; its power negates most of the benefits associated with costly and time-consuming validation ofmore » individual system components. Other measures of instrument and method validation will be discussed, including quality control charts and experimental designs for method validation.« less
Johnson, Philip L.; Shekhar, Anantha
2013-01-01
Panic disorder (PD) is a severe anxiety disorder characterized by susceptibility to induction of panic attacks by subthreshold interoceptive stimuli such as sodium lactate infusions or hypercapnia induction. Here we review a model of panic vulnerability in rats involving chronic inhibition of GABAergic tone in the dorsomedial/ perifornical hypothalamic (DMH/PeF) region that produces enhanced anxiety and freezing responses in fearful situations, as well as a vulnerability to displaying acute panic-like increases in cardioexcitation, respiration activity and “flight” associated behavior following subthreshold interoceptive stimuli that do not elicit panic responses in control rats. This model of panic vulnerability was developed over 15 years ago and has provided an excellent preclinical model with robust face, predictive and construct validity. The model recapitulates many of the phenotypics features of panic attacks associated with human panic disorder (face validity) including greater sensitivity to panicogenic stimuli demonstrated by sudden onset of anxiety and autonomic activation following an administration of a sub-threshold (i.e., do not usually induce panic in healthy subjects) stimulus such as sodium lactate, CO2, or yohimbine. The construct validity is supported by several key findings; DMH/PeF neurons regulate behavioral and autonomic components of a normal adaptive panic response, as well as being implicated in eliciting panic-like responses in humans. Additionally, Patients with PD have deficits in central GABA activity and pharmacological restoration of central GABA activity prevents panic attacks, consistent with this model. The model’s predictive validity is demonstrated by not only showing panic responses to several panic-inducing agents that elicit panic in patients with PD, but also by the positive therapeutic responses to clinically used agents such as alprazolam and antidepressants that attenuate panic attacks in patients. More importantly, this model has been utilized to discover novel drugs such as group II metabotropic glutamate agonists and a new class of translocator protein enhancers of GABA, both of which subsequently showed anti-panic properties in clinical trials. All of these data suggest that this preparation provides a strong preclinical model of some forms of human panic disorders. PMID:22484112
Hegedish, Omer; Kivilis, Naama; Hoofien, Dan
2015-01-01
The Temporal Memory Sequence Test (TMST) is a new measure of negative response bias (NRB) that was developed to enrich the forced-choice paradigm. The TMST does not resemble the common structure of forced-choice tests and is presented as a temporal recall memory test. The validation sample consisted of 81 participants: 21 healthy control participants, 20 coached simulators, and 40 patients with acquired brain injury (ABI). The TMST had high reliability and significantly high positive correlations with the Test of Memory Malingering and Word Memory Test effort scales. Moreover, the TMST effort scales exhibited high negative correlations with the Glasgow Coma Scale, thus validating the previously reported association between probable malingering and mild traumatic brain injury. A suggested cutoff score yielded acceptable classification rates in the ABI group as well as in the simulator and control groups. The TMST appears to be a promising measure of NRB detection, with respectable rates of reliability and construct and criterion validity.
Coluci, Marina Zambon Orpinelli; Alexandre, Neusa Maria Costa
2014-11-01
The objectives of this study were to develop a questionnaire that evaluates the perception of nursing workers to job factors that may contribute to musculoskeletal symptoms, and to evaluate its psychometric properties. Internationally recommended methodology was followed: construction of domains, items and the instrument as a whole, content validity, and pre-test. Psychometric properties were evaluated among 370 nursing workers. Construct validity was analyzed by the factorial analysis, known-groups technique, and convergent validity. Reliability was assessed through internal consistency and stability. Results indicated satisfactory fit indices during confirmatory factor analysis, significant difference (p < 0.01) between the responses of nursing and office workers, and moderate correlations between the new questionnaire and Numeric Pain Scale, SF-36 and WRFQ. Cronbach's alpha was close to 0.90 and ICC values ranged from 0.64 to 0.76. Therefore, results indicated that the new questionnaire had good psychometric properties for use in studies involving nursing workers. Copyright © 2014 Elsevier Ltd and The Ergonomics Society. All rights reserved.
Instruments Measuring Integrated Care: A Systematic Review of Measurement Properties.
Bautista, Mary Ann C; Nurjono, Milawaty; Lim, Yee Wei; Dessers, Ezra; Vrijhoef, Hubertus Jm
2016-12-01
Policy Points: Investigations on systematic methodologies for measuring integrated care should coincide with the growing interest in this field of research. A systematic review of instruments provides insights into integrated care measurement, including setting the research agenda for validating available instruments and informing the decision to develop new ones. This study is the first systematic review of instruments measuring integrated care with an evidence synthesis of the measurement properties. We found 209 index instruments measuring different constructs related to integrated care; the strength of evidence on the adequacy of the majority of their measurement properties remained largely unassessed. Integrated care is an important strategy for increasing health system performance. Despite its growing significance, detailed evidence on the measurement properties of integrated care instruments remains vague and limited. Our systematic review aims to provide evidence on the state of the art in measuring integrated care. Our comprehensive systematic review framework builds on the Rainbow Model for Integrated Care (RMIC). We searched MEDLINE/PubMed for published articles on the measurement properties of instruments measuring integrated care and identified eligible articles using a standard set of selection criteria. We assessed the methodological quality of every validation study reported using the COSMIN checklist and extracted data on study and instrument characteristics. We also evaluated the measurement properties of each examined instrument per validation study and provided a best evidence synthesis on the adequacy of measurement properties of the index instruments. From the 300 eligible articles, we assessed the methodological quality of 379 validation studies from which we identified 209 index instruments measuring integrated care constructs. The majority of studies reported on instruments measuring constructs related to care integration (33%) and patient-centered care (49%); fewer studies measured care continuity/comprehensive care (15%) and care coordination/case management (3%). We mapped 84% of the measured constructs to the clinical integration domain of the RMIC, with fewer constructs related to the domains of professional (3.7%), organizational (3.4%), and functional (0.5%) integration. Only 8% of the instruments were mapped to a combination of domains; none were mapped exclusively to the system or normative integration domains. The majority of instruments were administered to either patients (60%) or health care providers (20%). Of the measurement properties, responsiveness (4%), measurement error (7%), and criterion (12%) and cross-cultural validity (14%) were less commonly reported. We found <50% of the validation studies to be of good or excellent quality for any of the measurement properties. Only a minority of index instruments showed strong evidence of positive findings for internal consistency (15%), content validity (19%), and structural validity (7%); with moderate evidence of positive findings for internal consistency (14%) and construct validity (14%). Our results suggest that the quality of measurement properties of instruments measuring integrated care is in need of improvement with the less-studied constructs and domains to become part of newly developed instruments. © 2016 Milbank Memorial Fund.
Instruments Measuring Integrated Care: A Systematic Review of Measurement Properties
BAUTISTA, MARY ANN C.; NURJONO, MILAWATY; DESSERS, EZRA; VRIJHOEF, HUBERTUS JM
2016-01-01
Policy Points: Investigations on systematic methodologies for measuring integrated care should coincide with the growing interest in this field of research.A systematic review of instruments provides insights into integrated care measurement, including setting the research agenda for validating available instruments and informing the decision to develop new ones.This study is the first systematic review of instruments measuring integrated care with an evidence synthesis of the measurement properties.We found 209 index instruments measuring different constructs related to integrated care; the strength of evidence on the adequacy of the majority of their measurement properties remained largely unassessed. Context Integrated care is an important strategy for increasing health system performance. Despite its growing significance, detailed evidence on the measurement properties of integrated care instruments remains vague and limited. Our systematic review aims to provide evidence on the state of the art in measuring integrated care. Methods Our comprehensive systematic review framework builds on the Rainbow Model for Integrated Care (RMIC). We searched MEDLINE/PubMed for published articles on the measurement properties of instruments measuring integrated care and identified eligible articles using a standard set of selection criteria. We assessed the methodological quality of every validation study reported using the COSMIN checklist and extracted data on study and instrument characteristics. We also evaluated the measurement properties of each examined instrument per validation study and provided a best evidence synthesis on the adequacy of measurement properties of the index instruments. Findings From the 300 eligible articles, we assessed the methodological quality of 379 validation studies from which we identified 209 index instruments measuring integrated care constructs. The majority of studies reported on instruments measuring constructs related to care integration (33%) and patient‐centered care (49%); fewer studies measured care continuity/comprehensive care (15%) and care coordination/case management (3%). We mapped 84% of the measured constructs to the clinical integration domain of the RMIC, with fewer constructs related to the domains of professional (3.7%), organizational (3.4%), and functional (0.5%) integration. Only 8% of the instruments were mapped to a combination of domains; none were mapped exclusively to the system or normative integration domains. The majority of instruments were administered to either patients (60%) or health care providers (20%). Of the measurement properties, responsiveness (4%), measurement error (7%), and criterion (12%) and cross‐cultural validity (14%) were less commonly reported. We found <50% of the validation studies to be of good or excellent quality for any of the measurement properties. Only a minority of index instruments showed strong evidence of positive findings for internal consistency (15%), content validity (19%), and structural validity (7%); with moderate evidence of positive findings for internal consistency (14%) and construct validity (14%). Conclusions Our results suggest that the quality of measurement properties of instruments measuring integrated care is in need of improvement with the less‐studied constructs and domains to become part of newly developed instruments. PMID:27995711
Asilian-Mahabadi, Hassan; Khosravi, Yahya; Hassanzadeh-Rangi, Narmin; Hajizadeh, Ebrahim; Behzadan, Amir H
2018-02-05
Occupational safety in general, and construction safety in particular, is a complex phenomenon. This study was designed to develop a new valid measure to evaluate factors affecting unsafe behavior in the construction industry. A new questionnaire was generated from qualitative research according to the principles of grounded theory. Key measurement properties (face validity, content validity, construct validity, reliability and discriminative validity) were examined using qualitative and quantitative approaches. The receiver operating characteristic curve was used to estimate the discriminating power and the optimal cutoff score. Construct validity revealed an interpretable 12-factor structure which explained 61.87% of variance. Good internal consistency (Cronbach's α = 0.94) and stability (intra-class correlation coefficient = 0.93) were found for the new instrument. The area under the curve, sensitivity and specificity were 0.80, 0.80 and 0.75, respectively. The new instrument also discriminated safety performance among the construction sites with different workers' accident histories (F = 6.40, p < 0.05). The new instrument appears to be a valid, reliable and sensitive instrument that will contribute to investigating the root causes of workers' unsafe behaviors, thus promoting safety performance in the construction industry.
de Moura, Ionara Holanda; da Silva, Antônia Fabiana Rodrigues; Rocha, Aparecida do Espírito Santo de Holanda; Lima, Luisa Helena de Oliveira; Moreira, Thereza Maria Magalhães; da Silva, Ana Roberta Vilarouca
2017-01-01
ABSTRACT Objective: To develop and validate an educational technology focused on prevention of metabolic syndrome among adolescents. Methods: This was methodological research. Using an integrative review, the available publications on the subject were analyzed. Then, this knowledge was used to describe the theoretical content and, with the help of a graphic designer, the art and layout of the pages were developed. In the third phase, the booklet was evaluated and validated by 21 specialists and 39 adolescents. Data collection included three different questionnaires, according to the focus of evaluation of each group of participants, analyzed for reliability (Cronbach’s Alpha) and agreement by Infraclass Correlation Coefficient. Results: The mean score attributed by technical content experts was 91.7%, and the content validity index, measured by experts responses, was 0.98, showing high reliability and agreement. In addition, the level of agreement of the positive responses given by adolescents was 88.4%. Conclusion: the educational booklet has proved to be a valid and reliable tool to be used for promoting adolescent health. PMID:29020125
Development of the multiple sclerosis (MS) early mobility impairment questionnaire (EMIQ).
Ziemssen, Tjalf; Phillips, Glenn; Shah, Ruchit; Mathias, Adam; Foley, Catherine; Coon, Cheryl; Sen, Rohini; Lee, Andrew; Agarwal, Sonalee
2016-10-01
The Early Mobility Impairment Questionnaire (EMIQ) was developed to facilitate early identification of mobility impairments in multiple sclerosis (MS) patients. We describe the initial development of the EMIQ with a focus on the psychometric evaluation of the questionnaire using classical and item response theory methods. The initial 20-item EMIQ was constructed by clinical specialists and qualitatively tested among people with MS and physicians via cognitive interviews. Data from an observational study was used to make additional updates to the instrument based on exploratory factor analysis (EFA) and item response theory (IRT) analysis, and psychometric analyses were performed to evaluate the reliability and validity of the final instrument's scores and screening properties (i.e., sensitivity and specificity). Based on qualitative interview analyses, a revised 15-item EMIQ was included in the observational study. EFA, IRT and item-to-item correlation analyses revealed redundant items which were removed leading to the final nine-item EMIQ. The nine-item EMIQ performed well with respect to: test-retest reliability (ICC = 0.858); internal consistency (α = 0.893); convergent validity; and known-groups methods for construct validity. A cut-point of 41 on the 0-to-100 scale resulted in sufficient sensitivity and specificity statistics for viably identifying patients with mobility impairment. The EMIQ is a content valid and psychometrically sound instrument for capturing MS patients' experience with mobility impairments in a clinical practice setting. Additional research is suggested to further confirm the EMIQ's screening properties over time.
Validation of Multilevel Constructs: Validation Methods and Empirical Findings for the EDI
ERIC Educational Resources Information Center
Forer, Barry; Zumbo, Bruno D.
2011-01-01
The purposes of this paper are to highlight the foundations of multilevel construct validation, describe two methodological approaches and associated analytic techniques, and then apply these approaches and techniques to the multilevel construct validation of a widely-used school readiness measure called the Early Development Instrument (EDI;…
Amjadi, Sogol S; Maranian, Paul M; Paulus, Harold E; Kaplan, Robert M; Ranganath, Veena K; Furst, Daniel E; Khanna, Puja P; Khanna, Dinesh
2009-06-01
New methodologies allow the scores for the Health Assessment Questionnaire-Disability Index (HAQ-DI) to be translated into preferences/utility scores. We evaluated the construct validity of the HAQ-DI-derived Short Form-6D (SF-6D) score and assessed its responsiveness to change over 6- and 12-month followup periods in patients with early aggressive rheumatoid arthritis (RA). Patients (n=277) participating in an RA observational study completed self-reported measures of symptoms and the HAQ-DI at baseline and at 6 and 12 months. Total Sharp scores, C-reactive protein, and erythrocyte sedimentation rate were assessed along with clinical data. Construct validity was assessed by examining the association between SF-6D score and patient-reported and clinical measures using Spearman correlation coefficients. The responsiveness of SF-6D to change was assessed using patient and physician assessments of the disease as clinical anchors. The magnitude of responsiveness was calculated using SF-6D effect size (ES). Mean SF-6D scores were 0.690, 0.720, and 0.723 at baseline and 6 and 12-month followup, respectively. Baseline patient-reported measures had moderate to high correlations with baseline SF-6D (r=0.43 to 0.52); whereas clinical measures had negligible to low correlations with SF-6D (r=0.001 to 0.32). ES was moderate for the groups that were deemed to have improved (ES 0.63-0.75) but negligible to small for those that did not (ES 0.13-0.46). Our data support the validity and responsiveness of the HAQ-DI derived SF-6D score in an early RA cohort. These results support the use of the HAQ-DI derived SF-6D in RA cohorts and clinical trials lacking preference-based measures.
Cui, Jin; Jia, Zhenyu; Zhi, Xin; Li, Xiaoqun; Zhai, Xiao; Cao, Liehu; Weng, Weizong; Zhang, Jun; Wang, Lin; Chen, Xiao; Su, Jiacan
2017-01-05
The Achilles tendon Total Rupture Score (ATRS), which is originally developed in 2007 in Swedish, is the only patient-reported outcome measure (PROM) for specific outcome assessment of an Achilles tendon rupture.Purpose of this study is to translate and cross-culturally adapt Achilles tendon Total Rupture Score (ATRS) into simplified Chinese, and primarily evaluate the responsiveness, reliability and validity. International recognized guideline which was designed by Beaton was followed to make the translation of ATRS from English into simplified Chinese version (CH-ATRS). A prospective cohort study was carried out for the cross-cultural adaptation. There were 112 participants included into the study. Psychometric properties including floor and ceiling effects, Cronbach's alpha, intraclass correlation coefficient, effect size, standard response mean, and construct validity were tested. The mean scores of CH-ATRS are 57.42 ± 13.70. No sign of floor or ceiling effect was found of CH-ATRS. High level of internal consistency was supported by the value of Cronbach's alpha (0.893). ICC (0.979, 95%CI: 0.984-0.993) was high to indicate the high test-retest reliability. Great responsive ness was proved with the high absolute value of ES and SRM (0.84 and 8.98, respectively). The total CH-ATRS score had very good correlation with physical function and body pain subscales of SF-36 (r = -0.758 and r = -0.694, respectively, p < 0.001), while poor correlation with vitality and role physical subscales of SF-36 (r = -0.033 and r = -0.025, respectively, p ≥ 0.05), which supported construct validity of CH-ATRS. This Chinese version of Achilles tendon Total Rupture Score (CH-ATRS) can be used as a reliable and valid instrument for Achilles tendon rupture assessing in Chinese-speaking population. Level of evidence II.
Flosadottir, Vala; Roos, Ewa M.; Ageberg, Eva
2017-01-01
Background: The Activity Rating Scale (ARS) for disorders of the knee evaluates the level of activity by the frequency of participation in 4 separate activities with high demands on knee function, with a score ranging from 0 (none) to 16 (pivoting activities 4 times/wk). Purpose: To translate and cross-culturally adapt the ARS into Swedish and to assess measurement properties of the Swedish version of the ARS. Study Design: Cohort study (diagnosis); Level of evidence, 2. Methods: The COSMIN guidelines were followed. Participants (N = 100 [55 women]; mean age, 27 years) who were undergoing rehabilitation for a knee injury completed the ARS twice for test-retest reliability. The Knee injury and Osteoarthritis Outcome Score (KOOS), Tegner Activity Scale (TAS), and modernized Saltin-Grimby Physical Activity Level Scale (SGPALS) were administered at baseline to validate the ARS. Construct validity and responsiveness of the ARS were evaluated by testing predefined hypotheses regarding correlations between the ARS, KOOS, TAS, and SGPALS. The Cronbach alpha, intraclass correlation coefficients, absolute reliability, standard error of measurement, smallest detectable change, and Spearman rank-order correlation coefficients were calculated. Results: The ARS showed good internal consistency (α ≈ 0.96), good test-retest reliability (intraclass correlation coefficient >0.9), and no systematic bias between measurements. The standard error of measurement was less than 2 points, and the smallest detectable change was less than 1 point at the group level and less than 5 points at the individual level. More than 75% of the hypotheses were confirmed, indicating good construct validity and good responsiveness of the ARS. Conclusion: The Swedish version of the ARS is valid, reliable, and responsive for evaluating the level of activity based on the frequency of participation in high-demand knee sports activities in young adults with a knee injury. PMID:28979920
Teachers and Technology: Development of an Extended Theory of Planned Behavior
ERIC Educational Resources Information Center
Teo, Timothy; Zhou, Mingming; Noyes, Jan
2016-01-01
This study tests the validity of an extended theory of planned behaviour (TPB) to explain teachers' intention to use technology for teaching and learning. Five hundred and ninety two participants completed a survey questionnaire measuring their responses to eight constructs which form an extended TPB. Using structural equation modelling, the…
Self-Rating and Respondent Anonymity
ERIC Educational Resources Information Center
Goh, Jonathan W. P.; Lee, Ong Kim; Salleh, Hairon
2010-01-01
Background: Most empirical investigations in survey research have been conducted using self-reported or self-evaluated item responses. Such measures are common because they are relatively easy to obtain and are often the only feasible way to assess constructs of interest. In order to improve on the validity of self-reports it has become a common…
Arbab, Dariusch; van Ochten, Johannes H M; Schnurr, Christoph; Bouillon, Bertil; König, Dietmar
2017-12-01
Patient-reported outcome measures are a critical tool in evaluating the efficacy of orthopedic procedures. The intention of this study was to evaluate reliability, validity, responsiveness and minimally important change of the German version of the Hip dysfunction and osteoarthritis outcome score (HOOS). The German HOOS was investigated in 251 consecutive patients before and 6 months after total hip arthroplasty. All patients completed HOOS, Oxford-Hip Score, Short-Form (SF-36) and numeric scales for pain and disability. Test-retest reliability, internal consistency, floor and ceiling effects, construct validity and minimal important change were analyzed. The German HOOS demonstrated excellent test-retest reliability with intraclass correlation coefficient values > 0.7. Cronbach´s alpha values demonstrated strong internal consistency. As hypothesized, HOOS subscales strongly correlated with corresponding OHS and SF-36 domains. All subscales showed excellent (effect size/standardized response means > 0.8) responsiveness between preoperative assessment and postoperative follow-up. The HOOS and all subdomains showed higher changes than the minimal detectable change which indicates true changes. The German version of the HOOS demonstrated good psychometric properties. It proved to be valid, reliable and responsive to the changes instrument for use in patients with hip osteoarthritis undergoing total hip replacement.
Scarponi, Letizia; de Felicio, Claudia Maria; Sforza, Chiarella; Pimenta Ferreira, Claudia Lucia; Ginocchio, Daniela; Pizzorni, Nicole; Barozzi, Stefania; Mozzanica, Francesco; Schindler, Antonio
2018-05-30
To evaluate the reliability, validity, and responsiveness of the Italian OMES (I-OMES). The study consisted of 3 phases: (1) internal consistency and reliability, (2) validity, and (3) responsiveness analysis. The recruited population included 27 patients with orofacial myofunctional disorders (OMD) and 174 healthy volunteers. Forty-seven subjects, 18 healthy and all recruited patients with OMD were assessed for inter-rater and test-retest reliability analysis. I-OMES and Nordic Orofacial Test - Screening (NOT-S) scores of the patients were correlated for concurrent validity analysis. I-OMES scores from 27 patients with OMD and 27 age- and gender-matched healthy subjects were compared to investigate construct validity. I-OMES scores before and after successful swallowing rehabilitation in patients were compared for responsiveness analysis. Adequate internal consistency (Cronbach α = 0.71) and strong inter-rater and test-retest reliability (intraclass coefficient correlation = 0.97 and 0.98, respectively) were found. I-OMES and NOT-S scores significantly and inversely correlated (r = -0.38). A statistical significance (p < 0.001) was found between the pathological group and the control group for the total I-OMES score. The mean I-OMES score improved from 90 (78-102) to 99 (89-103) after myofunctional rehabilitation (p < 0.001). The I-OMES is a reliable and valid tool to evaluate OMD. © 2018 S. Karger AG, Basel.
Alvarsson, Jonathan; Andersson, Claes; Spjuth, Ola; Larsson, Rolf; Wikberg, Jarl E S
2011-05-20
Compound profiling and drug screening generates large amounts of data and is generally based on microplate assays. Current information systems used for handling this are mainly commercial, closed source, expensive, and heavyweight and there is a need for a flexible lightweight open system for handling plate design, and validation and preparation of data. A Bioclipse plugin consisting of a client part and a relational database was constructed. A multiple-step plate layout point-and-click interface was implemented inside Bioclipse. The system contains a data validation step, where outliers can be removed, and finally a plate report with all relevant calculated data, including dose-response curves. Brunn is capable of handling the data from microplate assays. It can create dose-response curves and calculate IC50 values. Using a system of this sort facilitates work in the laboratory. Being able to reuse already constructed plates and plate layouts by starting out from an earlier step in the plate layout design process saves time and cuts down on error sources.
Vispoel, Walter P; Kim, Han Yi
2014-09-01
[Correction Notice: An Erratum for this article was reported in Vol 26(3) of Psychological Assessment (see record 2014-16017-001). The mean, standard deviation and alpha coefficient originally reported in Table 1 should be 74.317, 10.214 and .802, respectively. The validity coefficients in the last column of Table 4 are affected as well. Correcting this error did not change the substantive interpretations of the results, but did increase the mean, standard deviation, alpha coefficient, and validity coefficients reported for the Honesty subscale in the text and in Tables 1 and 4. The corrected versions of Tables 1 and Table 4 are shown in the erratum.] Item response theory (IRT) models were applied to dichotomous and polytomous scoring of the Self-Deceptive Enhancement and Impression Management subscales of the Balanced Inventory of Desirable Responding (Paulhus, 1991, 1999). Two dichotomous scoring methods reflecting exaggerated endorsement and exaggerated denial of socially desirable behaviors were examined. The 1- and 2-parameter logistic models (1PLM, 2PLM, respectively) were applied to dichotomous responses, and the partial credit model (PCM) and graded response model (GRM) were applied to polytomous responses. For both subscales, the 2PLM fit dichotomous responses better than did the 1PLM, and the GRM fit polytomous responses better than did the PCM. Polytomous GRM and raw scores for both subscales yielded higher test-retest and convergent validity coefficients than did PCM, 1PLM, 2PLM, and dichotomous raw scores. Information plots showed that the GRM provided consistently high measurement precision that was superior to that of all other IRT models over the full range of both construct continuums. Dichotomous scores reflecting exaggerated endorsement of socially desirable behaviors provided noticeably weak precision at low levels of the construct continuums, calling into question the use of such scores for detecting instances of "faking bad." Dichotomous models reflecting exaggerated denial of the same behaviors yielded much better precision at low levels of the constructs, but it was still less precision than that of the GRM. These results support polytomous over dichotomous scoring in general, alternative dichotomous scoring for detecting faking bad, and extension of GRM scoring to situations in which IRT offers additional practical advantages over classical test theory (adaptive testing, equating, linking, scaling, detecting differential item functioning, and so forth). PsycINFO Database Record (c) 2014 APA, all rights reserved.
Development and validation of the Myasthenia Gravis Impairment Index.
Barnett, Carolina; Bril, Vera; Kapral, Moira; Kulkarni, Abhaya; Davis, Aileen M
2016-08-30
We aimed to develop a measure of myasthenia gravis impairment using a previously developed framework and to evaluate reliability and validity, specifically face, content, and construct validity. The first draft of the Myasthenia Gravis Impairment Index (MGII) included examination items from available measures enriched with newly developed, patient-reported items, modified after patient input. International neuromuscular specialists evaluated face and content validity via an e-mail survey. Test-retest reliability was assessed in stable patients at a 3-week interval and interrater reliability was evaluated in the same day. Construct validity was assessed through correlations between the MGII and other measures and by comparing scores in different patient groups. The first draft was assessed by 18 patients, and 72 specialists answered the survey. The second draft had 7 examination and 22 patient-reported items. Field testing included 200 patients, with 54 patients completing the reliability studies. Test-retest reliability of the total score was good (intraclass correlation coefficient 0.92; 95% confidence interval 0.79-0.94), as was interrater reliability of the examination component (intraclass correlation coefficient 0.81; 95% confidence interval 0.79-0.94). The MGII correlated well with comparison measures, with higher correlations with the MG-activities of daily living (r = 0.91) and MG-specific quality of life 15-item scale (r = 0.78). When assessing different patient groups, the scores followed expected patterns. The MGII was developed using a patient-centered framework of myasthenia-related impairments and incorporating patient input throughout the development process. It is reliable in an outpatient setting and has demonstrated construct validity. Responsiveness studies are under way. © 2016 American Academy of Neurology.
Development and validation of the Myasthenia Gravis Impairment Index
Bril, Vera; Kapral, Moira; Kulkarni, Abhaya; Davis, Aileen M.
2016-01-01
Objective: We aimed to develop a measure of myasthenia gravis impairment using a previously developed framework and to evaluate reliability and validity, specifically face, content, and construct validity. Methods: The first draft of the Myasthenia Gravis Impairment Index (MGII) included examination items from available measures enriched with newly developed, patient-reported items, modified after patient input. International neuromuscular specialists evaluated face and content validity via an e-mail survey. Test–retest reliability was assessed in stable patients at a 3-week interval and interrater reliability was evaluated in the same day. Construct validity was assessed through correlations between the MGII and other measures and by comparing scores in different patient groups. Results: The first draft was assessed by 18 patients, and 72 specialists answered the survey. The second draft had 7 examination and 22 patient-reported items. Field testing included 200 patients, with 54 patients completing the reliability studies. Test–retest reliability of the total score was good (intraclass correlation coefficient 0.92; 95% confidence interval 0.79–0.94), as was interrater reliability of the examination component (intraclass correlation coefficient 0.81; 95% confidence interval 0.79–0.94). The MGII correlated well with comparison measures, with higher correlations with the MG–activities of daily living (r = 0.91) and MG-specific quality of life 15-item scale (r = 0.78). When assessing different patient groups, the scores followed expected patterns. Conclusions: The MGII was developed using a patient-centered framework of myasthenia-related impairments and incorporating patient input throughout the development process. It is reliable in an outpatient setting and has demonstrated construct validity. Responsiveness studies are under way. PMID:27402891
2006-10-01
Investigation of Item-Pair Presentation and Construct Validity of the Navy Computer Adaptive Personality Scales ( NCAPS ) Christina M. Underhill, Ph.D...Construct Validity of the Navy Computer Adaptive Personality Scales ( NCAPS ) Christina M. Underhill, Ph.D. Reviewed and Approved by Jacqueline A. Mottern...and Construct Validity of the Navy Computer Adaptive Personality Scales ( NCAPS ) 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 0602236N and 0603236N 6
Hadžibajramović, Emina; Ahlborg, Gunnar; Håkansson, Carita; Lundgren-Nilsson, Åsa; Grimby-Ekman, Anna
2015-12-01
Psychosocial stress at work is one of the most important factors behind increasing sick-leave rates. In addition to work stressors, it is important to account for non-work-related stressors when assessing stress responses. In this study, a modified version of the Stress-Energy Questionnaire (SEQ), the SEQ during leisure time (SEQ-LT) was introduced for assessing the affective stress response during leisure time. The aim of this study was to investigate the internal construct validity of the SEQ-LT. A second aim was to define the cut-off points for the scales, which could indicate high and low levels of leisure-time stress and energy, respectively. Internal construct validity of the SEQ-LT was evaluated using a Rasch analysis. We examined the unidimensionality and other psychometric properties of the scale by the fit to the Rasch model. A criterion-based approach was used for classification into high and low stress/energy levels. The psychometric properties of the stress and energy scales of the SEQ-LT were satisfactory, having accommodated for local dependency. The cut-off point for low stress was proposed to be in the interval between 2.45 and 3.02 on the Rasch metric score; while for high stress, it was between 3.65 and 3.90. The suggested cut-off points for the low and high energy levels were values between 1.73-1.97 and 2.66-3.08, respectively. The stress and energy scale of the SEQ-LT satisfied the measurement criteria defined by the Rasch analysis and it provided a useful tool for non-work-related assessment of stress responses. We provide guidelines on how to interpret the scale values. © 2015 the Nordic Societies of Public Health.
Romaniuk, Madeline; Khawaja, Nigar G
2013-09-25
The 30-item USDI is a self-report measure that assesses depressive symptoms among university students. It consists of three correlated three factors: lethargy, cognitive-emotional and academic motivation. The current research used confirmatory factor analysis to asses construct validity and determine whether the original factor structure would be replicated in a different sample. Psychometric properties were also examined. Participants were 1148 students (mean age 22.84 years, SD=6.85) across all faculties from a large Australian metropolitan university. Students completed a questionnaire comprising of the USDI, the depression anxiety stress scale (DASS) and Life Satisfaction Scale (LSS). The three correlated factor model was shown to be an acceptable fit to the data, indicating sound construct validity. Internal consistency of the scale was also demonstrated to be sound, with high Cronbach alpha values. Temporal stability of the scale was also shown to be strong through test-retest analysis. Finally, concurrent and discriminant validity was examined with correlations between the USDI and DASS subscales as well as the LSS, with sound results further supporting the construct validity of the scale. Cut-off points were also developed to aid total score interpretation. Response rates are unclear. In addition, the representativeness of the sample could be improved potentially through targeted recruitment (i.e. reviewing the online sample statistics during data collection, examining the representativeness trends and addressing particular faculties within the university that were underrepresented). The USDI provides a valid and reliable method of assessing depressive symptoms found among university students. © 2013 Elsevier B.V. All rights reserved.
Towards Establishment of a Rice Stress Response Interactome
Seo, Young-Su; Chern, Mawsheng; Bartley, Laura E.; Han, Muho; Jung, Ki-Hong; Lee, Insuk; Walia, Harkamal; Richter, Todd; Xu, Xia; Cao, Peijian; Bai, Wei; Ramanan, Rajeshwari; Amonpant, Fawn; Arul, Loganathan; Canlas, Patrick E.; Ruan, Randy; Park, Chang-Jin; Chen, Xuewei; Hwang, Sohyun; Jeon, Jong-Seong; Ronald, Pamela C.
2011-01-01
Rice (Oryza sativa) is a staple food for more than half the world and a model for studies of monocotyledonous species, which include cereal crops and candidate bioenergy grasses. A major limitation of crop production is imposed by a suite of abiotic and biotic stresses resulting in 30%–60% yield losses globally each year. To elucidate stress response signaling networks, we constructed an interactome of 100 proteins by yeast two-hybrid (Y2H) assays around key regulators of the rice biotic and abiotic stress responses. We validated the interactome using protein–protein interaction (PPI) assays, co-expression of transcripts, and phenotypic analyses. Using this interactome-guided prediction and phenotype validation, we identified ten novel regulators of stress tolerance, including two from protein classes not previously known to function in stress responses. Several lines of evidence support cross-talk between biotic and abiotic stress responses. The combination of focused interactome and systems analyses described here represents significant progress toward elucidating the molecular basis of traits of agronomic importance. PMID:21533176
NASA Astrophysics Data System (ADS)
Astuti, Sri Rejeki Dwi; Suyanta, LFX, Endang Widjajanti; Rohaeti, Eli
2017-05-01
The demanding of assessment in learning process was impact by policy changes. Nowadays, assessment is not only emphasizing knowledge, but also skills and attitudes. However, in reality there are many obstacles in measuring them. This paper aimed to describe how to develop integrated assessment instrument and to verify instruments' validity such as content validity and construct validity. This instrument development used test development model by McIntire. Development process data was acquired based on development test step. Initial product was observed by three peer reviewer and six expert judgments (two subject matter experts, two evaluation experts and two chemistry teachers) to acquire content validity. This research involved 376 first grade students of two Senior High Schools in Bantul Regency to acquire construct validity. Content validity was analyzed used Aiken's formula. The verifying of construct validity was analyzed by exploratory factor analysis using SPSS ver 16.0. The result show that all constructs in integrated assessment instrument are asserted valid according to content validity and construct validity. Therefore, the integrated assessment instrument is suitable for measuring critical thinking abilities and science process skills of senior high school students on electrolyte solution matter.
Koblitz, Amber R.; Persoskie, Alexander; Ferrer, Rebecca A.; Klein, William M. P.; Dwyer, Laura A.; Park, Elyse R.
2016-01-01
Introduction: Absolute and comparative risk perceptions, worry, perceived severity, perceived benefits, and self-efficacy are important theoretical determinants of tobacco use, but no measures have been validated to ensure the discriminant validity as well as test-retest reliability of these measures in the tobacco context. The purpose of the current study is to examine the reliability and factor structure of a measure assessing smoking-related health cognitions and emotions in a national sample of current and former heavy smokers in the National Lung Screening Trial. Methods: A sub-study of the National Lung Screening Trial assessed current and former smokers’ (age 55–74; N = 4379) self-reported health cognitions and emotions at trial enrollment and at 12-month follow-up. Items were derived from the Health Belief Model and Self-Regulation Model. Results: An exploratory factor analysis of baseline responses revealed a five-factor structure for former smokers (risk perceptions, worry, perceived severity, perceived benefits, and self-efficacy) and a six-factor structure for current smokers, such that absolute risk and comparative risk perceptions emerged as separate factors. A confirmatory factor analysis of 12-month follow-up responses revealed a good fit for the five latent constructs for former smokers and six latent constructs for current smokers. Longitudinal stability of these constructs was also demonstrated. Conclusions: This is the first study to examine tobacco-related health cognition and emotional constructs over time in current and former heavy smokers undergoing lung screening. This study found that the theoretical constructs were stable across time and that the factor structure differed based on smoking status (current vs. former). PMID:25964503
Klassen, Anne; Klaassen, Robert J; Dix, David; Pritchard, Sheila; Yanofsky, Rochelle; Sung, Lillian
2010-08-01
A comprehensive evaluation of the psychometric properties of Care of My Child With Cancer (CMCC) was performed in a sample of 411 parents of children undergoing treatment of cancer at five Canadian pediatric oncology centers. Psychometric tests used to assess data quality, targeting, reliability, and construct validity demonstrated that the CMCC is a scientific sound measure. The CMCC will be helpful for assessing increasing parental responsibility for caregiving tasks associated with cancer care. Copyright 2010 Elsevier Inc. All rights reserved.
A formal approach to validation and verification for knowledge-based control systems
NASA Technical Reports Server (NTRS)
Castore, Glen
1987-01-01
As control systems become more complex in response to desires for greater system flexibility, performance and reliability, the promise is held out that artificial intelligence might provide the means for building such systems. An obstacle to the use of symbolic processing constructs in this domain is the need for verification and validation (V and V) of the systems. Techniques currently in use do not seem appropriate for knowledge-based software. An outline of a formal approach to V and V for knowledge-based control systems is presented.
DiFilippo, Kristen Nicole; Huang, Wenhao; Chapman-Novakofski, Karen M
2017-10-27
The extensive availability and increasing use of mobile apps for nutrition-based health interventions makes evaluation of the quality of these apps crucial for integration of apps into nutritional counseling. The goal of this research was the development, validation, and reliability testing of the app quality evaluation (AQEL) tool, an instrument for evaluating apps' educational quality and technical functionality. Items for evaluating app quality were adapted from website evaluations, with additional items added to evaluate the specific characteristics of apps, resulting in 79 initial items. Expert panels of nutrition and technology professionals and app users reviewed items for face and content validation. After recommended revisions, nutrition experts completed a second AQEL review to ensure clarity. On the basis of 150 sets of responses using the revised AQEL, principal component analysis was completed, reducing AQEL into 5 factors that underwent reliability testing, including internal consistency, split-half reliability, test-retest reliability, and interrater reliability (IRR). Two additional modifiable constructs for evaluating apps based on the age and needs of the target audience as selected by the evaluator were also tested for construct reliability. IRR testing using intraclass correlations (ICC) with all 7 constructs was conducted, with 15 dietitians evaluating one app. Development and validation resulted in the 51-item AQEL. These were reduced to 25 items in 5 factors after principal component analysis, plus 9 modifiable items in two constructs that were not included in principal component analysis. Internal consistency and split-half reliability of the following constructs derived from principal components analysis was good (Cronbach alpha >.80, Spearman-Brown coefficient >.80): behavior change potential, support of knowledge acquisition, app function, and skill development. App purpose split half-reliability was .65. Test-retest reliability showed no significant change over time (P>.05) for all but skill development (P=.001). Construct reliability was good for items assessing age appropriateness of apps for children, teens, and a general audience. In addition, construct reliability was acceptable for assessing app appropriateness for various target audiences (Cronbach alpha >.70). For the 5 main factors, ICC (1,k) was >.80, with a P value of <.05. When 15 nutrition professionals evaluated one app, ICC (2,15) was .98, with a P value of <.001 for all 7 constructs when the modifiable items were specified for adults seeking weight loss support. Our preliminary effort shows that AQEL is a valid, reliable instrument for evaluating nutrition apps' qualities for clinical interventions by nutrition clinicians, educators, and researchers. Further efforts in validating AQEL in various contexts are needed. ©Kristen Nicole DiFilippo, Wenhao Huang, Karen M. Chapman-Novakofski. Originally published in JMIR Mhealth and Uhealth (http://mhealth.jmir.org), 27.10.2017.
[New questionnaire to assess self-efficacy toward physical activity in children].
Aedo, Angeles; Avila, Héctor
2009-10-01
To design a questionnaire for assessment of self-efficacy toward physical activity in school children, as well as to measure its construct validity, test-retest reliability, and internal consistency. A four-stage multimethod approach was used: (1) bibliographic research followed by exploratory study and the formulation of questions and responses based on a dichotomous scale of 14 items; (2) validation of the content by a panel of experts; (3) application of the preliminary version of the questionnaire to a sample of 900 school-aged children in Mexico City; and (4) determination of the construct validity, test-retest reliability, and internal consistency (Cronbach's alpha). Three factors were identified that explain 64.15% of the variance: the search for positive alternatives to physical activity, ability to deal with possible barriers to exercising, and expectations of skill or competence. The model was validated using the goodness of fit, and the result of 65% less than 0.05 indicated that the estimated factor model fit the data. Cronbach's consistency alpha was 0.733; test-retest reliability was 0.867. The scale designed has adequate reliability and validity. These results are a good indicator of self-efficacy toward physical activity in school children, which is important when developing programs intended to promote such behavior in this age group.
Marsh, Herbert W; Vallerand, Robert J; Lafrenière, Marc-André K; Parker, Philip; Morin, Alexandre J S; Carbonneau, Noémie; Jowett, Sophia; Bureau, Julien S; Fernet, Claude; Guay, Frédéric; Salah Abduljabbar, Adel; Paquet, Yvan
2013-09-01
The passion scale, based on the dualistic model of passion, measures 2 distinct types of passion: Harmonious and obsessive passions are predictive of adaptive and less adaptive outcomes, respectively. In a substantive-methodological synergy, we evaluate the construct validity (factor structure, reliability, convergent and discriminant validity) of Passion Scale responses (N = 3,571). The exploratory structural equation model fit to the data was substantially better than the confirmatory factor analysis solution, and resulted in better differentiated (less correlated) factors. Results from a 13-model taxonomy of measurement invariance supported complete invariance (factor loadings, factor correlations, item uniquenesses, item intercepts, and latent means) over language (French vs. English; the instrument was originally devised in French, then translated into English) and gender. Strong measurement partial invariance over 5 passion activity groups (leisure, sport, social, work, education) indicates that the same set of items is appropriate for assessing passion across a wide variety of activities--a previously untested, implicit assumption that greatly enhances practical utility. Support was found for the convergent and discriminant validity of the harmonious and obsessive passion scales, based on a set of validity correlates: life satisfaction, rumination, conflict, time investment, activity liking and valuation, and perceiving the activity as a passion.
Dritsakis, Giorgos; van Besouw, Rachel M; Kitterick, Pádraig; Verschuur, Carl A
2017-09-18
A music-related quality of life (MuRQoL) questionnaire was developed for the evaluation of music rehabilitation for adult cochlear implant (CI) users. The present studies were aimed at refinement and validation. Twenty-four experts reviewed the MuRQoL items for face validity. A refined version was completed by 147 adult CI users, and psychometric techniques were used for item selection, assessment of reliability, and definition of the factor structure. The same participants completed the Short Form Health Survey for construct validation. MuRQoL responses from 68 CI users were compared with those of a matched group of adults with normal hearing. Eighteen items measuring music perception and engagement and 18 items measuring their importance were selected; they grouped together into 2 domains. The final questionnaire has high internal consistency and repeatability. Significant differences between CI users and adults with normal hearing and a correlation between music engagement and quality of life support construct validity. Scores of music perception and engagement and importance for the 18 items can be combined to assess the impact of music on the quality of life. The MuRQoL questionnaire is a reliable and valid measure of self-reported music perception, engagement, and their importance for adult CI users with potential to guide music aural rehabilitation.
Kanofsky, Konstantin; Lehmeyer, Mona; Schulze, Jutta; Hehl, Reinhard
2016-01-01
Plants recognize pathogens by microbe-associated molecular patterns (MAMPs) and subsequently induce an immune response. The regulation of gene expression during the immune response depends largely on cis-sequences conserved in promoters of MAMP-responsive genes. These cis-sequences can be analyzed by constructing synthetic promoters linked to a reporter gene and by testing these constructs in transient expression systems. Here, the use of the parsley (Petroselinum crispum) protoplast system for analyzing MAMP-responsive synthetic promoters is described. The synthetic promoter consists of four copies of a potential MAMP-responsive cis-sequence cloned upstream of a minimal promoter and the uidA reporter gene. The reporter plasmid contains a second reporter gene, which is constitutively expressed and hence eliminates the requirement of a second plasmid used as a transformation control. The reporter plasmid is transformed into parsley protoplasts that are elicited by the MAMP Pep25. The MAMP responsiveness is validated by comparing the reporter gene activity from MAMP-treated and untreated cells and by normalizing reporter gene activity using the constitutively expressed reporter gene.
Lee, Sun; Bae, Yuna H; Worley, Marcia; Law, Anandi
2017-09-08
Barriers to medication adherence stem from multiple factors. An effective and convenient tool is needed to identify these barriers so that clinicians can provide a tailored, patient-centered consultation with patients. The Modified Drug Adherence Work-up Tool (M-DRAW) was developed as a 13-item checklist questionnaire to identify barriers to medication adherence. The response scale was a 4-point Likert scale of frequency of occurrence (1 = never to 4 = often). The checklist was accompanied by a GUIDE that provided corresponding motivational interview-based intervention strategies for each identified barrier. The current pilot study examined the psychometric properties of the M-DRAW checklist (reliability, responsiveness and discriminant validity) in patients taking one or more prescription medication(s) for chronic conditions. A cross-sectional sample of 26 patients was recruited between December 2015 and March 2016 at an academic medical center pharmacy in Southern California. A priming question that assessed self-reported adherence was used to separate participants into the control group of 17 "adherers" (65.4%), and into the intervention group of nine "unintentional and intentional non-adherers" (34.6%). Comparable baseline characteristics were observed between the two groups. The M-DRAW checklist showed acceptable reliability (13 item; alpha = 0.74) for identifying factors and barriers leading to medication non-adherence. Discriminant validity of the tool and the priming question was established by the four-fold number of barriers to adherence identified within the self-selected intervention group compared to the control group (4.4 versus 1.2 barriers, p < 0.05). The current study did not investigate construct validity due to small sample size and challenges on follow-up with patients. Future testing of the tool will include construct validation.
Umbehr, Martin H; Bachmann, Lucas M; Poyet, Cedric; Hammerer, Peter; Steurer, Johann; Puhan, Milo A; Frei, Anja
2018-02-20
No official German translation exists for the 50-item Expanded Prostate Cancer Index Composite (EPIC), and no minimal important difference (MID) has been established yet. The aim of the study was to translate and validate a German version of the EPIC with cultural adaptation to the different German speaking countries and to establish the MID. We translated and culturally adapted the EPIC into German. For validation, we included a consecutive subsample of 92 patients with localized prostate cancer undergoing radical prostatectomy who participated the Prostate Cancer Outcomes Cohort. Baseline and follow-up assessments took place before and six weeks after prostatectomy in 2010 and 2011. We assessed the EPIC, EORTC QLQ-PR25, Feeling Thermometer, SF-36 and a global rating of health state change variable. We calculated the internal consistency, test-retest reliability, construct validity, responsiveness and MID. For most EPIC domains and subscales, our a priori defined criteria for reliability were fulfilled (construct reliability: Cronbach's alpha 0.7-0.9; test-retest reliability: intraclass-correlation coefficient ≥ 0.7). Cross-sectional and longitudinal correlations between EPIC and EORTC QLQ-PR25 domains ranged from 0.14-0.79, and 0.06-0.5 and 0.08-0.72 for Feeling Thermometer and SF-36, respectively. We established MID values of 10, 4, 12, and 6 for the urinary, bowel, sexual and hormonal domain. The German version of the EPIC is reliable, responsive and valid to measure HRQL in prostate cancer patients and is now available in German language. With the suggested MID we provide interpretation to what extent changes in HRQL are clinically relevant for patients. Hence, study results are of interest beyond German speaking countries.
A structured interview for the DSM-III personality disorders. A preliminary report.
Stangl, D; Pfohl, B; Zimmerman, M; Bowers, W; Corenthal, C
1985-06-01
With few exceptions, published studies fail to indicate that the DSM-III personality disorders can be distinguished from each other with respect to etiology, prognosis, treatment response, or family history. The Structured Interview for the DSM-III Personality Disorders (SIDP) was developed to improve axis II diagnostic reliability, and hence allow validity testing of axis II. Sixty-three subjects were independently rated by two interviewers using the SIDP. The kappa coefficients for interrater agreement reached .70 or higher for histrionic, borderline, and dependent personalities. While it is impossible to separate the validity testing of the SIDP from validity testing of the DSM-III personality criteria themselves, preliminary results from 102 inpatient SIDP interviews suggest some criterion-based validity with respect to standard personality rating scales and some construct validity with respect to the dexamethasone suppression test.
ERIC Educational Resources Information Center
Rossi, Robert Joseph
Methods drawn from four logical theories associated with studies of inductive processes are applied to the assessment and evaluation of experimental episode construct validity. It is shown that this application provides for estimates of episode informativeness with respect to the person examined in terms of the construct and to the construct…
de Boer, A Siebe; Tjioe, Roderik J C; Van der Sijde, Fleur; Meuffels, Duncan E; den Hoed, Pieter T; Van der Vlies, Cornelis H; Tuinebreijer, Wim E; Verhofstad, Michael H J; Van Lieshout, Esther M M
2017-08-03
The American Orthopaedic Foot and Ankle Society (AOFAS) Ankle-Hindfoot Scale is among the most commonly used instruments for measuring outcome of treatment in patients who sustained a complex ankle or hindfoot injury. It consists of a patient-reported and a physician-reported part. A validated, Dutch version of this instrument is currently not available. The aim of this study was to translate the instrument into Dutch and to determine the measurement properties of the AOFAS Ankle-Hindfoot Scale Dutch language version (DLV) in patients with a unilateral ankle fracture. Multicentre (two Dutch hospitals), prospective observational study. In total, 142 patients with a unilateral ankle fracture were included. Ten patients were lost to follow-up. Patients completed the subjective (patient-reported) part of the AOFAS Ankle-Hindfoot Scale-DLV. A physician or trained physician-assistant completed the physician-reported part. For comparison and evaluation of the measuring characteristics, the Foot Function Index and the Short Form-36 were completed by the patient. Descriptive statistics (including floor and ceiling effects), reliability (ie, internal consistency), construct validity, reproducibility (ie, test-retest reliability, agreement and smallest detectable change) and responsiveness were determined. The AOFAS-DLV and its subscales showed good internal consistency (Cronbach's α >0.90). Construct validity and longitudinal validity were proven to be adequate (76.5% of predefined hypotheses were confirmed). Floor effects were not present. Ceiling effects were present from 6 months onwards, as expected. Responsiveness was adequate, with a smallest detectable change of 12.0 points. The AOFAS-DLV is a reliable, valid and responsive measurement instrument for evaluating functional outcome in patients with a unilateral ankle fracture. This implies that the questionnaire is suitable to compare different treatment modalities within this population or to compare outcome across hospitals. The Netherlands Trial Register (NTR5613; 05-jan-2016). © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Determining the Scoring Validity of a Co-Constructed CEFR-Based Rating Scale
ERIC Educational Resources Information Center
Deygers, Bart; Van Gorp, Koen
2015-01-01
Considering scoring validity as encompassing both reliable rating scale use and valid descriptor interpretation, this study reports on the validation of a CEFR-based scale that was co-constructed and used by novice raters. The research questions this paper wishes to answer are (a) whether it is possible to construct a CEFR-based rating scale with…
Validation of a condition-specific measure for women having an abnormal screening mammography.
Brodersen, John; Thorsen, Hanne; Kreiner, Svend
2007-01-01
The aim of this study is to assess the validity of a new condition-specific instrument measuring psychosocial consequences of abnormal screening mammography (PCQ-DK33). The draft version of the PCQ-DK33 was completed on two occasions by 184 women who had received an abnormal screening mammography and on one occasion by 240 women who had received a normal screening result. Item Response Theories and Classical Test Theories were used to analyze data. Construct validity, concurrent validity, known group validity, objectivity and reliability were established by item analysis examining the fit between item responses and Rasch models. Six dimensions covering anxiety, behavioral impact, sense of dejection, impact on sleep, breast examination, and sexuality were identified. One item belonging to the dejection dimension had uniform differential item functioning. Two items not fitting the Rasch models were retained because of high face validity. A sick leave item added useful information when measuring side effects and socioeconomic consequences of breast cancer screening. Five "poor items" were identified and should be deleted from the final instrument. Preliminary evidence for a valid and reliable condition-specific measure for women having an abnormal screening mammography was established. The measure includes 27 "good" items measuring different attributes of the same overall latent structure-the psychosocial consequences of abnormal screening mammography.
de la Peña, June Bryan; Dela Peña, Irene Joy; Custodio, Raly James; Botanas, Chrislean Jun; Kim, Hee Jin; Cheong, Jae Hoon
2018-05-01
Attention-deficit/hyperactivity disorder (ADHD) is a common, behavioral, and heterogeneous neurodevelopmental condition characterized by hyperactivity, impulsivity, and inattention. Symptoms of this disorder are managed by treatment with methylphenidate, amphetamine, and/or atomoxetine. The cause of ADHD is unknown, but substantial evidence indicates that this disorder has a significant genetic component. Transgenic animals have become an essential tool in uncovering the genetic factors underlying ADHD. Although they cannot accurately reflect the human condition, they can provide insights into the disorder that cannot be obtained from human studies due to various limitations. An ideal animal model of ADHD must have face (similarity in symptoms), predictive (similarity in response to treatment or medications), and construct (similarity in etiology or underlying pathophysiological mechanism) validity. As the exact etiology of ADHD remains unclear, the construct validity of animal models of ADHD would always be limited. The proposed transgenic animal models of ADHD have substantially increased and diversified over the years. In this paper, we compiled and explored the validity of proposed transgenic animal models of ADHD. Each of the reviewed transgenic animal models has strengths and limitations. Some fulfill most of the validity criteria of an animal model of ADHD and have been extensively used, while there are others that require further validation. Nevertheless, these transgenic animal models of ADHD have provided and will continue to provide valuable insights into the genetic underpinnings of this complex disorder.
Ishii, Hitoshi; Shimatsu, Akira; Okimura, Yasuhiko; Tanaka, Toshiaki; Hizuka, Naomi; Kaji, Hidesuke; Hanew, Kunihiko; Oki, Yutaka; Yamashiro, Sayuri; Takano, Koji; Chihara, Kazuo
2012-01-01
To develop and validate the Adult Hypopituitarism Questionnaire (AHQ) as a disease-specific, self-administered questionnaire for evaluation of quality of life (QOL) in adult patients with hypopituitarism. We developed and validated this new questionnaire, using a standardized procedure which included item development, pilot-testing and psychometric validation. Of the patients who participated in psychometric validation, those whose clinical conditions were judged to be stable were asked to answer the survey questionnaire twice, in order to assess test-retest reliability. Content validity of the initial questionnaire was evaluated via two pilot tests. After these tests, we made minor revisions and finalized the initial version of the questionnaire. The questionnaire was constructed with two domains, one psycho-social and the other physical. For psychometric assessment, analyses were performed on the responses of 192 adult patients with various types of hypopituitarism. The intraclass correlations of the respective domains were 0.91 and 0.95, and the Cronbach's alpha coefficients were 0.96 and 0.95, indicating adequate test-retest reliability and internal consistency for each domain. For known-group validity, patients with hypopituitarism due to hypothalamic disorder showed significantly lower scores in 11 out of 13 sub-domains compared to those who had hypopituitarism due to pituitary disorder. Regarding construct validity, the domain structure was found to be almost the same as that initially hypothesized. Exploratory factor analysis (n = 228) demonstrated that each domain consisted of six and seven sub-domains. The AHQ showed good reliability and validity for evaluating QOL in adult patients with hypopituitarism.
Validation of Medical Tourism Service Quality Questionnaire (MTSQQ) for Iranian Hospitals
Qolipour, Mohammad; Torabipour, Amin; Khiavi, Farzad Faraji; Malehi, Amal Saki
2017-01-01
Introduction Assessing service quality is one of the basic requirements to develop the medical tourism industry. There is no valid and reliable tool to measure service quality of medical tourism. This study aimed to determine the reliability and validity of a Persian version of medical tourism service quality questionnaire for Iranian hospitals. Methods To validate the medical tourism service quality questionnaire (MTSQQ), a cross-sectional study was conducted on 250 Iraqi patients referred to hospitals in Ahvaz (Iran) from 2015. To design a questionnaire and determine its content validity, the Delphi Technique (3 rounds) with the participation of 20 medical tourism experts was used. Construct validity of the questionnaire was assessed through exploratory and confirmatory factor analysis. Reliability was assessed using Cronbach’s alpha coefficient. Data were analyzed by Excel 2007, SPSS version18, and Lisrel l8.0 software. Results The content validity of the questionnaire with CVI=0.775 was confirmed. According to exploratory factor analysis, the MTSQQ included 31 items and 8 dimensions (tangibility, reliability, responsiveness, assurance, empathy, exchange and travel facilities, technical and infrastructure facilities and safety and security). Construct validity of the questionnaire was confirmed, based on the goodness of fit quantities of model (RMSEA=0.032, CFI= 0.98, GFI=0.88). Cronbach’s alpha coefficient was 0.837 and 0.919 for expectation and perception questionnaire. Conclusion The results of the study showed that the medical tourism SERVQUAL questionnaire with 31 items and 8 dimensions was a valid and reliable tool to measure service quality of medical tourism in Iranian hospitals. PMID:28461863
AlMenhali, Entesar Ali; Khalid, Khalizani; Iyanna, Shilpa
2018-01-01
The Environmental Attitudes Inventory (EAI) was developed to evaluate the multidimensional nature of environmental attitudes; however, it is based on a dataset from outside the Arab context. This study reinvestigated the construct validity of the EAI with a new dataset and confirmed the feasibility of applying it in the Arab context. One hundred and forty-eight subjects in Study 1 and 130 in Study 2 provided valid responses. An exploratory factor analysis (EFA) was used to extract a new factor structure in Study 1, and confirmatory factor analysis (CFA) was performed in Study 2. Both studies generated a seven-factor model, and the model fit was discussed for both the studies. Study 2 exhibited satisfactory model fit indices compared to Study 1. Factor loading values of a few items in Study 1 affected the reliability values and average variance extracted values, which demonstrated low discriminant validity. Based on the results of the EFA and CFA, this study showed sufficient model fit and suggested the feasibility of applying the EAI in the Arab context with a good construct validity and internal consistency.
2018-01-01
The Environmental Attitudes Inventory (EAI) was developed to evaluate the multidimensional nature of environmental attitudes; however, it is based on a dataset from outside the Arab context. This study reinvestigated the construct validity of the EAI with a new dataset and confirmed the feasibility of applying it in the Arab context. One hundred and forty-eight subjects in Study 1 and 130 in Study 2 provided valid responses. An exploratory factor analysis (EFA) was used to extract a new factor structure in Study 1, and confirmatory factor analysis (CFA) was performed in Study 2. Both studies generated a seven-factor model, and the model fit was discussed for both the studies. Study 2 exhibited satisfactory model fit indices compared to Study 1. Factor loading values of a few items in Study 1 affected the reliability values and average variance extracted values, which demonstrated low discriminant validity. Based on the results of the EFA and CFA, this study showed sufficient model fit and suggested the feasibility of applying the EAI in the Arab context with a good construct validity and internal consistency. PMID:29758021
Construct Validity of Neuropsychological Tests in Schizophrenia.
ERIC Educational Resources Information Center
Allen, Daniel N.; Aldarondo, Felito; Goldstein, Gerald; Huegel, Stephen G.; Gilbertson, Mark; van Kammen, Daniel P.
1998-01-01
The construct validity of neuropsychological tests in patients with schizophrenia was studied with 39 patients who were evaluated with a battery of six tests assessing attention, memory, and abstract reasoning abilities. Results support the construct validity of the neuropsychological tests in patients with schizophrenia. (SLD)
Pasco, Paul Matthew D; Jamora, Roland Dominic G; Rosales, Raymond L; Diesta, Cid Czarina E; Ng, Arlene R; Teleg, Rosalia A; Go, Criscely L; Lee, Lillian; Fernandez, Hubert H
2017-01-01
X-linked dystonia-parkinsonism(XDP) is a neurodegenerative disorder endemic to the Philippines. A rating scale was developed by the authors under the guidance of the Movement Disorder Society of the Philippines (MDSP) to assess XDP severity and progression, functional impact, and response to treatment in future clinical trials. Our main objective was to validate our new scale, the XDP-MDSP scale. The initial validation process included pragmatic testing to XDP patients followed by a modified Delphi procedure with an international advisory panel of dystonia, parkinsonism and scale development experts. Pearson correlation was used to assess construct validity of our new scale versus the assess construct validity of our new scale versus standard dystonia, parkinsonism, non-motor and functional scales; and also to assess divergent validity against behavioral and cognitive scales. The 37-item XDP-MDSP scale has five parts: I-dystonia, II-parkinsonism, III-non-motor features, IV-ADL, and V-global impression. After initial validation, the scale was administered to 204 XDP patients. Inter-domain correlation for the first four parts was acceptable. The correlation between these domains and the global rating was slightly lower. Correlations between Parts I, II, III, and IV versus standard dystonia, parkinsonism, non-motor and functional scales were acceptable with values ranging from 0.323 to 0.428. For divergent validity, a significant correlation was seen with behavioral scales. No significant correlation was noted with the cognitive scale. The proposed XDP-MDSP scale is internally valid but the global rating subscale may need to be modified or eliminated. While there is convergent validity, divergent validation was successful only on cognitive and not behavioral scales. The frequent co-occurrence of anxiety and depression, and its effect on the motor and functional state, may explain this finding.
Student mathematical imagination instruments: construction, cultural adaptation and validity
NASA Astrophysics Data System (ADS)
Dwijayanti, I.; Budayasa, I. K.; Siswono, T. Y. E.
2018-03-01
Imagination has an important role as the center of sensorimotor activity of the students. The purpose of this research is to construct the instrument of students’ mathematical imagination in understanding concept of algebraic expression. The researcher performs validity using questionnaire and test technique and data analysis using descriptive method. Stages performed include: 1) the construction of the embodiment of the imagination; 2) determine the learning style questionnaire; 3) construct instruments; 4) translate to Indonesian as well as adaptation of learning style questionnaire content to student culture; 5) perform content validation. The results stated that the constructed instrument is valid by content validation and empirical validation so that it can be used with revisions. Content validation involves Indonesian linguists, english linguists and mathematics material experts. Empirical validation is done through a legibility test (10 students) and shows that in general the language used can be understood. In addition, a questionnaire test (86 students) was analyzed using a biserial point correlation technique resulting in 16 valid items with a reliability test using KR 20 with medium reability criteria. While the test instrument test (32 students) to find all items are valid and reliability test using KR 21 with reability is 0,62.
Measuring Work Functioning: Validity of a Weighted Composite Work Functioning Approach.
Boezeman, Edwin J; Sluiter, Judith K; Nieuwenhuijsen, Karen
2015-09-01
To examine the construct validity of a weighted composite work functioning measurement approach. Workers (health-impaired/healthy) (n = 117) completed a composite measure survey that recorded four central work functioning aspects with existing scales: capacity to work, quality of work performance, quantity of work, and recovery from work. Previous derived weights reflecting the relative importance of these aspects of work functioning were used to calculate the composite weighted work functioning score of the workers. Work role functioning, productivity, and quality of life were used for validation. Correlations were calculated and norms applied to examine convergent and divergent construct validity. A t test was conducted and a norm applied to examine discriminative construct validity. Overall the weighted composite work functioning measure demonstrated construct validity. As predicted, the weighted composite score correlated (p < .001) strongly (r > .60) with work role functioning and productivity (convergent construct validity), and moderately (.30 < r < .60) with physical quality of life and less strongly than work role functioning and productivity with mental quality of life (divergent validity). Further, the weighted composite measure detected that health-impaired workers show with a large effect size (Cohen's d > .80) significantly worse work functioning than healthy workers (discriminative validity). The weighted composite work functioning measurement approach takes into account the relative importance of the different work functioning aspects and demonstrated good convergent, fair divergent, and good discriminative construct validity.
Fruit and vegetable shopping practices and social support scales: A validation.
Baranowski, Tom; Missaghian, Mariam; Broadfoot, Alison; Watson, Kathy; Cullen, Karen; Nicklas, Theresa; Fisher, Jennifer; Baranowski, Janice; O'Donnell, Sharon
2006-01-01
To assess the psychometric characteristics of new scales of shopping practices and social support for purchasing fruits and vegetables. Participants were recruited in front of diverse grocery stores. Telephone data collection was done on 2 occasions, separated by 6 weeks. 166 food shoppers with children at home participated. New scales of food shopping practices and social support for purchasing fruits and vegetables were psychometrically analyzed and related to a measure of home fruit or vegetable availability as a test of construct validity. Both classical test and item response theory procedures were used. Correlations related the new measures to home fruit and vegetable availability. Single dimension scales were specified for fruit and vegetable shopping practices (35% of the variance), fruit purchase social support (53% of the variance), and vegetable purchase social support (52% of the variance). Item response theory difficulty estimates varied from -0.64 to 0.73 for fruit and vegetable shopping practices, from -0.55 to 0.33 for fruit purchase social support, and from -0.55 to 0.34 for vegetable social support. Each scale significantly correlated with home fruit and vegetable availability (construct validity), even after controlling for social desirability of response (0.19 for shopping practices, 0.37 for fruit purchasing social support, and 0.28 for vegetable purchasing social support). Person separation reliability was 0.80 for food shopping practices, 0.74 for fruit purchasing social support, and 0.73 for vegetable purchasing social support. The scales performed well. These scales are now available to help better understand fruit and vegetable shopping practices, fruit purchase social support, and vegetable purchase social support.
Psychometric properties of the Mayo Elbow Performance Score.
Celik, Derya
2015-06-01
To translate and culturally adapt the Mayo Elbow Performance Score (MEPS), a widely used instrument for evaluating disability associated with elbow injuries, into Turkish (MEPS-T) and to determine psychometric properties of the translated version. The MEPS was translated into Turkish using published methodological guidelines. The measurement properties of the MEPS-T (construct validity and floor and ceiling effects) were tested in 91 patients with elbow pathology. The reproducibility of the MEPS-T was tested in 59 patients over 7-14 days. The responsiveness of the MEPS-T was tested in a subgroup of 46 patients diagnosed with lateral epicondylitis and who received conservative treatment for 6 weeks. The interclass correlation coefficient (ICC) was used to estimate the test-retest reliability. The construct validity was analyzed with the disabilities of the arm, shoulder and hand (DASH), Visual Analog Scale (VAS) and the Short Form 36 (SF-36). Effect size (ES) was used to assess the responsiveness. The distribution of floor and ceiling effects was determined. The MEPS-T showed very good test-retest reliability (ICC 0.89). The correlation coefficients between the MEPS-T and DASH and VAS were -0.61 and -0.53, respectively (p < 0.001). The highest correlations were between the MEPS-T and the mental component summary (r = 0.47, p = 0.001) and role emotional (r = 0.45, p = 0.001). The MEPS-T ES, 0.50, was moderate (95% CI 0.33-0.62). We observed no ceiling or floor effects. The MEPS-T represents a valid, reliable and moderately responsive instrument for evaluating patients with elbow disease.
The validation of the Supervision of Thesis Questionnaire (STQ).
Henricson, Maria; Fridlund, Bengt; Mårtensson, Jan; Hedberg, Berith
2018-06-01
The supervision process is characterized by differences between the supervisors' and the students' expectations before the start of writing a bachelor thesis as well as after its completion. A review of the literature did not reveal any scientifically tested questionnaire for evaluating nursing students' expectations of the supervision process when writing a bachelor thesis. The aim of the study was to determine the construct validity and internal consistency reliability of a questionnaire for measuring nursing students' expectations of the bachelor thesis supervision process. The study had a developmental and methodological design carried out in four steps including construct validity and internal consistency reliability statistical procedures: construction of the items, assessment of face validity, data collection and data analysis. This study was conducted at a university in southern Sweden, where students on the "Nursing student thesis, 15 ECTS" course were consecutively selected for participation. Of the 512 questionnaires distributed, 327 were returned, a response rate of 64%. Five factors with a total variance of 74% and good communalities, ≥0.64, were extracted from the 10-item STQ. The internal consistency of the 10 items was 0.68. The five factors were labelled: The nature of the supervision process, The supervisor's role as a coach, The students' progression to self-support, The interaction between students and supervisor and supervisor competence. A didactic, useful and secure questionnaire measuring nursing students' expectations of the bachelor thesis supervision process based on three main forms of supervision was created. Copyright © 2018 Elsevier Ltd. All rights reserved.
Reynolds, Nicholas A; Ski, Chantal F; McEvedy, Samantha M; Thompson, David R; Cameron, Jan
2018-02-14
The aim of this study was to psychometrically evaluate the Heart Failure Screening Tool (Heart-FaST) via: (1) examination of internal construct validity; (2) testing of scale function in accordance with design; and (3) recommendation for change/s, if items are not well adjusted, to improve psychometric credential. Self-care is vital to the management of heart failure. The Heart-FaST may provide a prospective assessment of risk, regarding the likelihood that patients with heart failure will engage in self-care. Psychometric validation of the Heart-FaST using Rasch analysis. The Heart-FaST was administered to 135 patients (median age = 68, IQR = 59-78 years; 105 males) enrolled in a multidisciplinary heart failure management program. The Heart-FaST is a nurse-administered tool for screening patients with HF at risk of poor self-care. A Rasch analysis of responses was conducted which tested data against Rasch model expectations, including whether items serve as unbiased, non-redundant indicators of risk and measure a single construct and that rating scales operate as intended. The results showed that data met Rasch model expectations after rescoring or deleting items due to poor discrimination, disordered thresholds, differential item functioning, or response dependence. There was no evidence of multidimensionality which supports the use of total scores from Heart-FaST as indicators of risk. Aggregate scores from this modified screening tool rank heart failure patients according to their "risk of poor self-care" demonstrating that the Heart-FaST items constitute a meaningful scale to identify heart failure patients at risk of poor engagement in heart failure self-care. © 2018 John Wiley & Sons Ltd.
Golder, Vera; Huq, Molla; Franklyn, Kate; Calderone, Alicia; Lateef, Aisha; Lau, Chak Sing; Lee, Alfred Lok Hang; Navarra, Sandra Teresa V; Godfrey, Timothy; Oon, Shereen; Hoi, Alberta Yik Bun; Morand, Eric Francis; Nikpour, Mandana
2017-06-01
To evaluate the construct validity of the Lupus Low Disease Activity State (LLDAS), a treatment target in systemic lupus erythematosus (SLE). Fifty SLE case summaries based on real patients were prepared and assessed independently for meeting the operational definition of LLDAS. Fifty international rheumatologists with expertise in SLE, but with no prior involvement in the LLDAS project, responded to a survey in which they were asked to categorize the disease activity state of each case as remission, low, moderate, or high. Agreement between expert opinion and LLDAS was assessed using Cohen's kappa. Overall agreement between expert opinion and the operational definition of LLDAS was 77.96% (95% CI: 76.34-79.58%), with a Cohen's kappa of 0.57 (95% CI: 0.55-0.61). Of the cases (22 of 50) that fulfilled the operational definition of LLDAS, only 5.34% (59 of 22 × 50) of responses classified the cases as moderate/high activity. Of the cases that did not fulfill the operational definition of LLDAS (28 of 50), 35.14% (492 of 28 × 50) of responses classified the cases as remission/low activity. Common reasons for discordance were assignment to remission/low activity of cases with higher corticosteroid doses than defined in LLDAS (prednisolone ≤ 7.5mg) or with SLEDAI-2K >4 due to serological activity (high anti-dsDNA antibody and/or low complement). LLDAS has good construct validity with high overall agreement between the operational definition of LLDAS and expert opinion. Discordance of results suggests that the operational definition of LLDAS is more stringent than expert opinion at defining a low disease activity state. Copyright © 2017 Elsevier Inc. All rights reserved.
The laval questionnaire: a new instrument to measure quality of life in morbid obesity.
Therrien, Fanny; Marceau, Picard; Turgeon, Nathalie; Biron, Simon; Richard, Denis; Lacasse, Yves
2011-08-15
Our recent review of the literature uncovered eleven obesity-specific quality of life questionnaires, all with incomplete demonstration of their measurement properties. Our objective was to validate a new self-administered questionnaire specific to morbid obesity to be used in clinical trials. The study was carried out at the bariatric surgery clinic of Laval Hospital, Quebec City, Canada. This study followed our description of health-related quality of life in morbid obesity from which we constructed the Laval Questionnaire. Its construct validity and responsiveness were tested by comparing the baseline and changes at 1-year follow-up in 6 domain scores (symptoms, activity/mobility, personal hygiene/clothing, emotions, social interactions, sexual life) with those of questionnaires measuring related constructs (SF-36, Impact of Weight on Quality of Life-Lite, Rosenberg Self-Esteem Scale and Beck Depression Inventory-II). 112 patients (67 who got bariatric surgery, 45 who remained on the waiting list during the study period) participated in this study. The analysis of the discriminative function of the questionnaire showed moderate-to-high correlations between the scores in each domain of our instrument and the corresponding questionnaires. The analysis of its evaluative function showed (1) significant differences in score changes between patients with bariatric surgery and those without, and (2) moderate-to-high correlations between the changes in scores in the new instrument and the changes in the corresponding questionnaires. Most of these correlations met the a priori predictions we had made regarding their direction and magnitude. The Laval Questionnaire is a valid measure of health-related quality of life in patients with morbid obesity and is responsive to treatment-induced changes.
Kalsi-Ryan, Sukhvinder; Beaton, Dorcas; Ahn, Henry; Askes, Heather; Drew, Brian; Curt, Armin; Popovic, Milos R; Wang, Justin; Verrier, Mary C; Fehlings, Michael G
2016-02-01
As spinal cord injury (SCI) trials begin to involve subjects with acute cervical SCI, establishing the property of an upper limb outcome measure to detect change over time is critical for its usefulness in clinical trials. The objectives of this study were to define responsiveness, sensitivity, and minimally detectable difference (MDD) of the Graded Redefined Assessment of Strength, Sensibility, and Prehension (GRASSP). An observational, longitudinal study was conducted. International Standards of Neurological Classification of SCI (ISNCSCI), GRASSP, Capabilities of Upper Extremity Questionnaire (CUE-Q), and Spinal Cord Independence Measure (SCIM) were administered 0-10 days, 1, 3, 6, and 12 months post-injury. Standardized Response Means (SRM) for GRASSP and ISNCSCI measures were calculated. Longitudinal construct validity was calculated using Pearson correlation coefficients. Smallest real difference for all subtests was calculated to define the MDD values for all GRASSP subtests. Longitudinal construct validity demonstrated GRASSP and all external measures to be responsive to neurological change for 1 year post-injury. SRM values for the GRASSP subtests ranged from 0.25 to 0.85 units greater than that for ISNCSCI strength and sensation, SCIM-SS, and CUE-Q. MDD values for GRASSP subtests ranged from 2-5 points. GRASSP demonstrates good responsiveness and excellent sensitivity that is superior to ISNCSCI and SCIM III. MDD values are useful in the evaluation of interventions in both clinical and research settings. The responsiveness and sensitivity of GRASSP make it a valuable condition-specific measure in tetraplegia, where changes in upper limb neurological and functional outcomes are essential for evaluating the efficacy of interventions.
ERIC Educational Resources Information Center
Wang, Shudong; Wang, Ning; Hoadley, David
This study examined the comparability of scores on the National Nurses Aides Assessment Program (NNAAP) test across language and administration condition groups for calibration and validation samples that were randomly drawn from the same population. A sample of 20,568 candidate responses to 1 test form was used. This examination is given in…
An exploration of human territoriality in forest recreation
Harry C. Zinn; Laurlyn K. Harmon; Brijesh Thapa; Deborah L. Kerstetter; Alan R. Graefe
2002-01-01
Previous studies in human territoriality have focused largely on behavior in urban settings. It is only recently that researchers are examining this construct in the context of forest settings. This study was designed to assess the territorial responses of visitors to Bald Eagle State Forest in central Pennsylvania and explore the structure and predictive validity of a...
Psychometric Validation of the Academic Motivation Scale in a Dental Student Sample.
Orsini, Cesar; Binnie, Vivian; Evans, Phillip; Ledezma, Priscilla; Fuentes, Fernando; Villegas, Maria J
2015-08-01
The Academic Motivation Scale is one of the most frequently used instruments to assess academic motivation. It relies on the self-determination theory of human motivation. However, motivation has been understudied in dental education. Therefore, to address the lack of valid instruments to assess academic motivation in dental education and contribute to future research in the field, the aim of this study was to analyze the psychometric properties of this instrument in a sample of dental students. Participants were 989 Chilean undergraduate dental students (86% response rate) who completed a survey containing a Chilean face-valid version of the Spanish Academic Motivation Scale and three other motivation-related instruments to assess the survey's construct and criterion validity. Later, 76 of the students (out of 100 invited) took the survey again to assess its test-retest stability. The instrument's construct validity was supported by the superior goodness of fit of the seven-subscale Academic Motivation Scale over competing models through confirmatory factor analysis and by the expected correlations among its subscales. The concurrent criterion validity was supported by the confirmation of correlations between its subscales and external criteria. Adequate internal consistency and test-retest correlations were also found. The evidence from this study suggests that the Academic Motivation Scale is a preliminarily valid and reliable instrument to assess motivation in the predoctoral dental context. Future research in this area is needed to confirm or refute these results.
Simple shoulder test and Oxford Shoulder Score: Persian translation and cross-cultural validation.
Naghdi, Soofia; Nakhostin Ansari, Noureddin; Rustaie, Nilufar; Akbari, Mohammad; Ebadi, Safoora; Senobari, Maryam; Hasson, Scott
2015-12-01
To translate, culturally adapt, and validate the simple shoulder test (SST) and Oxford Shoulder Score (OSS) into Persian language using a cross-sectional and prospective cohort design. A standard forward and backward translation was followed to culturally adapt the SST and the OSS into Persian language. Psychometric properties of floor and ceiling effects, construct convergent validity, discriminant validity, internal consistency reliability, test-retest reliability, standard error of the measurement (SEM), smallest detectable change (SDC), and factor structure were determined. One hundred patients with shoulder disorders and 50 healthy subjects participated in the study. The PSST and the POSS showed no missing responses. No floor or ceiling effects were observed. Both the PSST and POSS detected differences between patients and healthy subjects supporting their discriminant validity. Construct convergent validity was confirmed by a very good correlation between the PSST and POSS (r = 0.68). There was high internal consistency for both the PSST (α = 0.73) and the POSS (α = 0.91 and 0.92). Test-retest reliability with 1-week interval was excellent (ICCagreement = 0.94 for PSST and 0.90 for POSS). Factor analyses demonstrated a three-factor solution for the PSST (49.7 % of variance) and a two-factor solution for the POSS (61.6 % of variance). The SEM/SDC was satisfactory for PSST (5.5/15.3) and POSS (6.8/18.8). The PSST and POSS are valid and reliable outcome measures for assessing functional limitations in Persian-speaking patients with shoulder disorders.
Podsakoff, Nathan P; Podsakoff, Philip M; Mackenzie, Scott B; Klinger, Ryan L
2013-01-01
Several researchers have persuasively argued that the most important evidence to consider when assessing construct validity is whether variations in the construct of interest cause corresponding variations in the measures of the focal construct. Unfortunately, the literature provides little practical guidance on how researchers can go about testing this. Therefore, the purpose of this article is to describe how researchers can use video techniques to test whether their scales measure what they purport to measure. First, we discuss how researchers can develop valid manipulations of the focal construct that they hope to measure. Next, we explain how to design a study to use this manipulation to test the validity of the scale. Finally, comparing and contrasting traditional and contemporary perspectives on validation, we discuss the advantages and limitations of video-based validation procedures. PsycINFO Database Record (c) 2013 APA, all rights reserved.
[Design and validation of the CSR-Hospital-SP scale to measure corporate social responsibility].
Mira, José Joaquín; Lorenzo, Susana; Navarro, Isabel; Pérez-Jover, Virtudes; Vitaller, Julián
2013-01-01
To design and validate a scale (CSR-Hospital-SP) to determine health professionals' views on the approach of management to corporate social responsibility (CSR) in their hospital. The literature was reviewed to identify the main CSR scales and select the dimensions to be evaluated. The initial version of the scale consisted of 25 items. A convenience sample of a minimum of 224 health professionals working in five public hospitals in five autonomous regions were invited to respond. Floor and ceiling effects, internal consistency, reliability, and construct validity were analyzed. A total of 233 health professionals responded. The CSR-Hospital-SP scale had 20 items grouped into four factors. The item-total correlation was higher than 0.30; all factor loadings were greater than 0.50; 59.57% of the variance was explained; Cronbach's alpha was 0.90; Spearman-Brown's coefficient was 0.82. The CSR-Hospital-SP scale is a tool designed for hospitals that implement accountability mechanisms and promote socially responsible management approaches. Copyright © 2012 SESPAS. Published by Elsevier Espana. All rights reserved.
Construct validity of the individual work performance questionnaire.
Koopmans, Linda; Bernaards, Claire M; Hildebrandt, Vincent H; de Vet, Henrica C W; van der Beek, Allard J
2014-03-01
To examine the construct validity of the Individual Work Performance Questionnaire (IWPQ). A total of 1424 Dutch workers from three occupational sectors (blue, pink, and white collar) participated in the study. First, IWPQ scores were correlated with related constructs (convergent validity). Second, differences between known groups were tested (discriminative validity). First, IWPQ scores correlated weakly to moderately with absolute and relative presenteeism, and work engagement. Second, significant differences in IWPQ scores were observed for workers differing in job satisfaction, and workers differing in health. Overall, the results indicate acceptable construct validity of the IWPQ. Researchers are provided with a reliable and valid instrument to measure individual work performance comprehensively and generically, among workers from different occupational sectors, with and without health problems.
A measurement model for general noise reaction in response to aircraft noise.
Kroesen, Maarten; Schreckenberg, Dirk
2011-01-01
In this paper a measurement model for general noise reaction (GNR) in response to aircraft noise is developed to assess the performance of aircraft noise annoyance and a direct measure of general reaction as indicators of this concept. For this purpose GNR is conceptualized as a superordinate latent construct underlying particular manifestations. This conceptualization is empirically tested through estimation of a second-order factor model. Data from a community survey at Frankfurt Airport are used for this purpose (N=2206). The data fit the hypothesized factor structure well and support the conceptualization of GNR as a superordinate construct. It is concluded that noise annoyance and a direct measure of general reaction to noise capture a large part of the negative feelings and emotions in response to aircraft noise but are unable to capture all relevant variance. The paper concludes with recommendations for the valid measurement of community reaction and several directions for further research.
Development and Construct Validation of an Academic Emotions Scale
ERIC Educational Resources Information Center
Govaerts, Sophie; Gregoire, Jacques
2008-01-01
This article describes the development and two studies on the construct validity of the Academic Emotions Scale (AES). The AES is a French self-report questionnaire assessing six emotions in the context of school learning: enjoyment, hope, pride, anxiety, shame and frustration. Its construct validity was studied through exploratory and…
Construct Validation of the Fairy Tale Test--Standardization Data.
ERIC Educational Resources Information Center
Coulacoglou, Carina
2002-01-01
Studied the construct validity of the Fairy Tale Test (C. Coulacoglu, 1993), a personality projective test for children, in a sample of 800 Greek children aged 8, 10, and 12. Factor analysis led to identification of eight primary factors, and correlations with other measures provide construct validity evidence. (SLD)
Psychometric properties of the Albanian version of the Orofacial Esthetic Scale: OES-ALB.
Bimbashi, Venera; Čelebić, Asja; Staka, Gloria; Hoxha, Flurije; Peršić, Sanja; Petričević, Nikola
2015-08-26
The aim was to adapt the Orofacial Esthetic Scale (OES) and to test psychometric properties of the Albanian language version in the cultural environment of the Republic of Kosovo. The OES questionnaire was translated from the original English version according to the accepted techniques. The reliability (internal consistency), and validity (construct, convergent and discriminative) were tested in 169 subjects, test-retest in 61 dental students (DS), and responsiveness in 51 prosthodontic patients with treatment needs (PPTN). The corrected item correlation coefficients of OES-ALB ranged from 0.686 to 0.909. The inter-item correlation coefficient ranged between 0.572 and 0.919. The Cronbach's alpha was 0.961 and IIC 0.758. Test- retest was confirmed by good ICCs and by no significant differences of the OES scores through the period of 14 days without any orofacial changes (p > 0.05). Construct validity was proved by the presence of one-factor composition that assumed 79.079% of the variance. Convergent validity showed significant correlation between one general question about satisfaction with orofacial esthetics and the OES summary score, as well as between the sum of the 3 OHIP-ALB49 questions related to orofacial aesthetics and the OES summary score. Discriminative validity was confirmed with statistically significant differences between DS, prosthodontic patients without treatment need and PPTN (p < 0.01). Responsiveness was confirmed by a significant increase of OES scores after PPTN patients received new fixed partial or removable dentures (P < 0.001). The results proved excellent psychometric properties of the OES-ALB questionnaire in the Republic of Kosovo.
Jansen, Elena; Mallan, Kimberley M; Nicholson, Jan M; Daniels, Lynne A
2014-06-04
Early feeding practices lay the foundation for children's eating habits and weight gain. Questionnaires are available to assess parental feeding but overlapping and inconsistent items, subscales and terminology limit conceptual clarity and between study comparisons. Our aim was to consolidate a range of existing items into a parsimonious and conceptually robust questionnaire for assessing feeding practices with very young children (<3 years). Data were from 462 mothers and children (age 21-27 months) from the NOURISH trial. Items from five questionnaires and two study-specific items were submitted to a priori item selection, allocation and verification, before theoretically-derived factors were tested using Confirmatory Factor Analysis. Construct validity of the new factors was examined by correlating these with child eating behaviours and weight. Following expert review 10 factors were specified. Of these, 9 factors (40 items) showed acceptable model fit and internal reliability (Cronbach's α: 0.61-0.89). Four factors reflected non-responsive feeding practices: 'Distrust in Appetite', 'Reward for Behaviour', 'Reward for Eating', and 'Persuasive Feeding'. Five factors reflected structure of the meal environment and limits: 'Structured Meal Setting', 'Structured Meal Timing', 'Family Meal Setting', 'Overt Restriction' and 'Covert Restriction'. Feeding practices generally showed the expected pattern of associations with child eating behaviours but none with weight. The Feeding Practices and Structure Questionnaire (FPSQ) provides a new reliable and valid measure of parental feeding practices, specifically maternal responsiveness to children's hunger/satiety signals facilitated by routine and structure in feeding. Further validation in more diverse samples is required.
2013-01-01
Background Yearly formative knowledge testing (also known as progress testing) was shown to have a limited construct-validity and reliability in postgraduate medical education. One way to improve construct-validity and reliability is to improve the authenticity of a test. As easily accessible internet has become inseparably linked to daily clinical practice, we hypothesized that allowing internet access for a limited amount of time during the progress test would improve the perception of authenticity (face-validity) of the test, which would in turn improve the construct-validity and reliability of postgraduate progress testing. Methods Postgraduate trainees taking the yearly knowledge progress test were asked to participate in a study where they could access the internet for 30 minutes at the end of a traditional pen and paper test. Before and after the test they were asked to complete a short questionnaire regarding the face-validity of the test. Results Mean test scores increased significantly for all training years. Trainees indicated that the face-validity of the test improved with internet access and that they would like to continue to have internet access during future testing. Internet access did not improve the construct-validity or reliability of the test. Conclusion Improving the face-validity of postgraduate progress testing, by adding the possibility to search the internet for a limited amount of time, positively influences test performance and face-validity. However, it did not change the reliability or the construct-validity of the test. PMID:24195696
2013-01-01
Background A scale validated in one language is not automatically valid in another language or culture. The purpose of this study was to validate the English version of the UNESP-Botucatu multidimensional composite pain scale (MCPS) to assess postoperative pain in cats. The English version was developed using translation, back-translation, and review by individuals with expertise in feline pain management. In sequence, validity and reliability tests were performed. Results Of the three domains identified by factor analysis, the internal consistency was excellent for ‘pain expression’ and ‘psychomotor change’ (0.86 and 0.87) but not for ‘physiological variables’ (0.28). Relevant changes in pain scores at clinically distinct time points (e.g., post-surgery, post-analgesic therapy), confirmed the construct validity and responsiveness (Wilcoxon test, p < 0.001). Favorable correlation with the IVAS scores (p < 0.001) and moderate to very good agreement between blinded observers and ‘gold standard’ evaluations, supported criterion validity. The cut-off point for rescue analgesia was > 7 (range 0–30 points) with 96.5% sensitivity and 99.5% specificity. Conclusions The English version of the UNESP-Botucatu-MCPS is a valid, reliable and responsive instrument for assessing acute pain in cats undergoing ovariohysterectomy, when used by anesthesiologists or anesthesia technicians. The cut-off point for rescue analgesia provides an additional tool for guiding analgesic therapy. PMID:23867090
Brondani, Juliana T; Mama, Khursheed R; Luna, Stelio P L; Wright, Bonnie D; Niyom, Sirirat; Ambrosio, Jennifer; Vogel, Pamela R; Padovani, Carlos R
2013-07-17
A scale validated in one language is not automatically valid in another language or culture. The purpose of this study was to validate the English version of the UNESP-Botucatu multidimensional composite pain scale (MCPS) to assess postoperative pain in cats. The English version was developed using translation, back-translation, and review by individuals with expertise in feline pain management. In sequence, validity and reliability tests were performed. Of the three domains identified by factor analysis, the internal consistency was excellent for 'pain expression' and 'psychomotor change' (0.86 and 0.87) but not for 'physiological variables' (0.28). Relevant changes in pain scores at clinically distinct time points (e.g., post-surgery, post-analgesic therapy), confirmed the construct validity and responsiveness (Wilcoxon test, p < 0.001). Favorable correlation with the IVAS scores (p < 0.001) and moderate to very good agreement between blinded observers and 'gold standard' evaluations, supported criterion validity. The cut-off point for rescue analgesia was > 7 (range 0-30 points) with 96.5% sensitivity and 99.5% specificity. The English version of the UNESP-Botucatu-MCPS is a valid, reliable and responsive instrument for assessing acute pain in cats undergoing ovariohysterectomy, when used by anesthesiologists or anesthesia technicians. The cut-off point for rescue analgesia provides an additional tool for guiding analgesic therapy.
Rasmussen, Trine Bernholdt; Berg, Selina Kikkenborg; Dixon, Jane; Moons, Philip; Konradsen, Hanne
2016-12-01
Negative body perception has been reported in a number of patient populations. No instrument in Danish for measuring body image-related concerns has been available. Without such an instrument, understanding of the phenomenon in Danish-speaking populations is limited. The purpose of the study was thus to translate and validate a Danish version of the Body Image Quality of Life Inventory (BIQLI), in order to obtain a valid instrument applicable for healthcare research. The study consisted of two phases: (i) instrument adaptation, including forward and back translation, expert committee comparisons and cognitive interviewing, and (ii) empirical testing of the Danish version (BIQLI-DA) with subsequent psychometric evaluation. Hypothesised correlations to other measures, including body mass index (BMI), Medical Outcome Short Form-8 (SF-8), Patient Health Questionnaire-9 (PHQ-9), General Anxiety Disorder-7 and Symptom Check List-90-Revised (SCL-90-R ® ) were tested. In addition, exploratory factor structure analysis (EFA) and internal consistency on item and scale level were performed. The adapted instrument was found to be semantically sound, yet concerns about face validity did arise through cognitive interviews. Danish college students (n = 189, 65 men, M age = 21.1 years) participated in the piloting of the BIQLI-DA. Convergent construct validity was demonstrated through associations to related constructs. Exploratory factor analysis revealed a potential subscale structure. Finally, results showed a high internal consistency (Cronbach's alpha = 0.92). Support for the validity of the BIQLI-DA might have been strengthened by repeating cognitive interviews after layout alterations, by piloting the instrument on a larger sample. This study demonstrated tentative support for the validity of the Danish Body Image Quality of Life (BIQLI-DA) and found the measure to be reliable in terms of internal consistency. Further exploration of response processes and construct validity is needed. © 2016 Nordic College of Caring Science.
Developing a validation for environmental sustainability
NASA Astrophysics Data System (ADS)
Adewale, Bamgbade Jibril; Mohammed, Kamaruddeen Ahmed; Nawi, Mohd Nasrun Mohd; Aziz, Zulkifli
2016-08-01
One of the agendas for addressing environmental protection in construction is to reduce impacts and make the construction activities more sustainable. This important consideration has generated several research interests within the construction industry, especially considering the construction damaging effects on the ecosystem, such as various forms of environmental pollution, resource depletion and biodiversity loss on a global scale. Using Partial Least Squares-Structural Equation Modeling technique, this study validates environmental sustainability (ES) construct in the context of large construction firms in Malaysia. A cross-sectional survey was carried out where data was collected from Malaysian large construction firms using a structured questionnaire. Results of this study revealed that business innovativeness and new technology are important in determining environmental sustainability (ES) of the Malaysian construction firms. It also established an adequate level of internal consistency reliability, convergent validity and discriminant validity for each of this study's constructs. And based on this result, it could be suggested that the indicators for organisational innovativeness dimensions (business innovativeness and new technology) are useful to measure these constructs in order to study construction firms' tendency to adopt environmental sustainability (ES) in their project execution.
Screening for learning disabilities in young adult career counseling.
Kasler, Jon; Fawcett, Angela
2009-01-01
The Strengths and Weaknesses Academic Profile (SWAP) was constructed in Israel in response to the local need of career counselors for a valid, reliable, comprehensive, parsimonious, and computerized screening device for identifying those likely to be at risk of learning disabilities (LD). The method chosen was self-report. A set of cognitive items was written and divided into seven scales: reading, writing, attention and memory, computation, English as a foreign language (EFL), study skills, and self-image. The screening tool was validated on a research sample in Sheffield, UK, based on comparison of the results obtained from the screening with the results of standardized diagnosis of learning disabilities administered to the respondents. The questionnaire was administered to 39 students, half of them diagnosed for dyslexia and half tested and found to be free of dyslexia. Results indicate that SWAP is a reliable and valid questionnaire, with a classification power of approximately 90%. The questionnaire is now widely used in Israel, where an Internet site has been constructed to administer the questionnaire and provide immediate and direct results.
Construction of a short form of the healthy eating behaviour inventory for the Japanese population.
Shimazaki, Takashi; Iio, Misa; Lee, Ying-Hua; Suzuki, Akiko; Konuma, Kayo; Teshima, Yoko; Takenaka, Koji
2016-09-01
The present study constructed a short form of the Healthy Eating Behavior Inventory suitable for use in the Japanese culture (HEBI-J) and confirmed the scale's preliminary reliability and validity. In Study 1, Japanese adults (N=75) completed a free-response questionnaire about healthy eating behaviours. Thematic analysis was used to identify three factors-balance, pattern, and restriction-associated with healthy eating behaviours and 12 items related to these healthy eating behaviours. In Study 2, Japanese office workers (N=784) completed two questionnaires regarding 12 items of healthy eating behaviours and the dietary stages of change. Confirmatory factor analysis demonstrated that the three-factor model was appropriate (GFI=.94, AGFI=.90, CFI=.90, RMSEA=.08, SRMR=.53). Reliability was confirmed by alpha coefficients and the Spearman-Brown formula. Concurrent validity was confirmed by the relationship between the healthy eating stages. This study demonstrated the preliminary reliability and validity of the HEBI-J. Copyright © 2015 Asia Oceania Association for the Study of Obesity. Published by Elsevier Ltd. All rights reserved.
Web-Based Assessment of Visual and Visuospatial Symptoms in Parkinson's Disease
Amick, Melissa M.; Miller, Ivy N.; Neargarder, Sandy; Cronin-Golomb, Alice
2012-01-01
Visual and visuospatial dysfunction is prevalent in Parkinson's disease (PD). To promote assessment of these often overlooked symptoms, we adapted the PD Vision Questionnaire for Internet administration. The questionnaire evaluates visual and visuospatial symptoms, impairments in activities of daily living (ADLs), and motor symptoms. PD participants of mild to moderate motor severity (n = 24) and healthy control participants (HC, n = 23) completed the questionnaire in paper and web-based formats. Reliability was assessed by comparing responses across formats. Construct validity was evaluated by reference to performance on measures of vision, visuospatial cognition, ADLs, and motor symptoms. The web-based format showed excellent reliability with respect to the paper format for both groups (all P′s < 0.001; HC completing the visual and visuospatial section only). Demonstrating the construct validity of the web-based questionnaire, self-rated ADL and visual and visuospatial functioning were significantly associated with performance on objective measures of these abilities (all P′s < 0.01). The findings indicate that web-based administration may be a reliable and valid method of assessing visual and visuospatial and ADL functioning in PD. PMID:22530162
ERIC Educational Resources Information Center
Lee, Hee-Sun; Liu, Ou Lydia; Linn, Marcia C.
2011-01-01
This study explores measurement of a construct called knowledge integration in science using multiple-choice and explanation items. We use construct and instructional validity evidence to examine the role multiple-choice and explanation items plays in measuring students' knowledge integration ability. For construct validity, we analyze item…
Validity and Reliability of Psychosocial Factors Related to Breast Cancer Screening.
ERIC Educational Resources Information Center
Zapka, Jane G.; And Others
1991-01-01
The construct validity of hypothesized survey items and data reduction procedures for selected psychosocial constructs frequently used in breast cancer screening research were investigated in telephone interviews with randomly selected samples of 1,184 and 903 women and a sample of 169 Hispanic clinic clients. Validity of the constructs is…
Nelson, Lindsay D.; Patrick, Christopher J.; Bernat, Edward M.
2010-01-01
The externalizing dimension is viewed as a broad dispositional factor underlying risk for numerous disinhibitory disorders. Prior work has documented deficits in event-related brain potential (ERP) responses in individuals prone to externalizing problems. Here, we constructed a direct physiological index of externalizing vulnerability from three ERP indicators and evaluated its validity in relation to criterion measures in two distinct domains: psychometric and physiological. The index was derived from three ERP measures that covaried in their relations with externalizing proneness the error-related negativity and two variants of the P3. Scores on this ERP composite predicted psychometric criterion variables and accounted for externalizing-related variance in P3 response from a separate task. These findings illustrate how a diagnostic construct can be operationalized as a composite (multivariate) psychophysiological variable (phenotype). PMID:20573054
Kyte, Derek; Cockwell, Paul; Marshall, Tom; Gheorghe, Adrian; Keeley, Thomas; Slade, Anita; Calvert, Melanie
2017-01-01
Background Patient-reported outcome measures (PROMs) can provide valuable information which may assist with the care of patients with chronic kidney disease (CKD). However, given the large number of measures available, it is unclear which PROMs are suitable for use in research or clinical practice. To address this we comprehensively evaluated studies that assessed the measurement properties of PROMs in adults with CKD. Methods Four databases were searched; reference list and citation searching of included studies was also conducted. The COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist was used to appraise the methodological quality of the included studies and to inform a best evidence synthesis for each PROM. Results The search strategy retrieved 3,702 titles/abstracts. After 288 duplicates were removed, 3,414 abstracts were screened and 71 full-text articles were retrieved for further review. Of these, 24 full-text articles were excluded as they did not meet the eligibility criteria. Following reference list and citation searching, 19 articles were retrieved bringing the total number of papers included in the final analysis to 66. There was strong evidence supporting internal consistency and moderate evidence supporting construct validity for the Kidney Disease Quality of Life-36 (KDQOL-36) in pre-dialysis patients. In the dialysis population, the KDQOL-Short Form (KDQOL-SF) had strong evidence for internal consistency and structural validity and moderate evidence for test-retest reliability and construct validity while the KDQOL-36 had moderate evidence of internal consistency, test-retest reliability and construct validity. The End Stage Renal Disease-Symptom Checklist Transplantation Module (ESRD-SCLTM) demonstrated strong evidence for internal consistency and moderate evidence for test-retest reliability, structural and construct validity in renal transplant recipients. Conclusions We suggest considering the KDQOL-36 for use in pre-dialysis patients; the KDQOL-SF or KDQOL-36 for dialysis patients and the ESRD-SCLTM for use in transplant recipients. However, further research is required to evaluate the measurement error, structural validity, responsiveness and patient acceptability of PROMs used in CKD. PMID:28636678
Reviewing the psychometric properties of contemporary circadian typology measures.
Di Milia, Lee; Adan, Ana; Natale, Vincenzo; Randler, Christoph
2013-12-01
The accurate measurement of circadian typology (CT) is critical because the construct has implications for a number of health disorders. In this review, we focus on the evidence to support the reliability and validity of the more commonly used CT scales: the Morningness-Eveningness Questionnaire (MEQ), reduced Morningness-Eveningness Questionnaire (rMEQ), the Composite Scale of Morningness (CSM), and the Preferences Scale (PS). In addition, we also consider the Munich ChronoType Questionnaire (MCTQ). In terms of reliability, the MEQ, CSM, and PS consistently report high levels of reliability (>0.80), whereas the reliability of the rMEQ is satisfactory. The stability of these scales is sound at follow-up periods up to 13 mos. The MCTQ is not a scale; therefore, its reliability cannot be assessed. Although it is possible to determine the stability of the MCTQ, these data are yet to be reported. Validity must be given equal weight in assessing the measurement properties of CT instruments. Most commonly reported is convergent and construct validity. The MEQ, rMEQ, and CSM are highly correlated and this is to be expected, given that these scales share common items. The level of agreement between the MCTQ and the MEQ is satisfactory, but the correlation between these two constructs decreases in line with the number of "corrections" applied to the MCTQ. The interesting question is whether CT is best represented by a psychological preference for behavior or by using a biomarker such as sleep midpoint. Good-quality subjective and objective data suggest adequate construct validity for each of the CT instruments, but a major limitation of this literature is studies that assess the predictive validity of these instruments. We make a number of recommendations with the aim of advancing science. Future studies need to (1) focus on collecting data from representative samples that consider a number of environmental factors; (2) employ longitudinal designs to allow the predictive validity of CT measures to be assessed and preferably make use of objective data; (3) employ contemporary statistical approaches, including structural equation modeling and item-response models; and (4) provide better information concerning sample selection and a rationale for choosing cutoff points.
Hibbard, S; Tang, P C; Latko, R; Park, J H; Munn, S; Bolz, S; Somerville, A
2000-12-01
Thematic Apperception Test (Murray, 1943) responses of 69 Asian American (hereafter, Asian) and 83 White students were coded for defenses according to the Defense Mechanism Manual (Cramer, 1991b) and studied for differential validity in predicting paper-and-pencil measures of relevant constructs. Three tests for differential validity were used: (a) differences between validity coefficients, (b) interactions between predictor and ethnicity in criterion prediction, and (c) differences between groups in mean prediction errors using a common regression equation. Modest differential validity was found. It was surprising that the DMM scales were slightly stronger predictors of their criteria among Asians than among Whites and when a common predictor was used, desirable criteria were overpredicted for Asians, whereas undesirable ones were overpredicted for Whites. The results were not affected by acculturation level or English vocabulary among the Asians.
Seen Heng, Yeoh; Sidi, Hatta; Nik Jaafar, Nik Ruzyanei; Razali, Rosdinom; Ram, Hari
2013-04-01
This cross-sectional study aimed to determine the construct of the phases of the female sexual response cycle (SRC) among women attending an infertility clinic in a Malaysian tertiary center. The sexual response phases were measured with a validated Malay version of the Female Sexual Function Index (FSFI). The correlation structure of the items of the SRC phases (i.e. desire, arousal, orgasm, satisfaction and pain) was determined using principal component analysis (PCA), with varimax rotation method. The number of factors obtained was decided using Kaiser's criteria. A total of 150 married women with a mean age of 32 years participated in this study. Factor loadings using PCA with varimax rotation divided the sexual domains into three components. The first construct comprised sexual arousal, lubrication and pain (suggesting a mechanical component). The second construct were orgasm and sexual satisfaction (suggesting a physical achievement). Sexual desire, suggesting a psychological component, stood on its own as the third. The findings suggest that three constructs could be identified and in favor of the Basson model (a non-linear concept of SRC) for Malaysian women's sexual functioning. Understanding this would help clinicians to strategize the treatment approach of sexual dysfunction in women with infertility. Copyright © 2013 Wiley Publishing Asia Pty Ltd.
Development and validation of a mass casualty conceptual model.
Culley, Joan M; Effken, Judith A
2010-03-01
To develop and validate a conceptual model that provides a framework for the development and evaluation of information systems for mass casualty events. The model was designed based on extant literature and existing theoretical models. A purposeful sample of 18 experts validated the model. Open-ended questions, as well as a 7-point Likert scale, were used to measure expert consensus on the importance of each construct and its relationship in the model and the usefulness of the model to future research. Computer-mediated applications were used to facilitate a modified Delphi technique through which a panel of experts provided validation for the conceptual model. Rounds of questions continued until consensus was reached, as measured by an interquartile range (no more than 1 scale point for each item); stability (change in the distribution of responses less than 15% between rounds); and percent agreement (70% or greater) for indicator questions. Two rounds of the Delphi process were needed to satisfy the criteria for consensus or stability related to the constructs, relationships, and indicators in the model. The panel reached consensus or sufficient stability to retain all 10 constructs, 9 relationships, and 39 of 44 indicators. Experts viewed the model as useful (mean of 5.3 on a 7-point scale). Validation of the model provides the first step in understanding the context in which mass casualty events take place and identifying variables that impact outcomes of care. This study provides a foundation for understanding the complexity of mass casualty care, the roles that nurses play in mass casualty events, and factors that must be considered in designing and evaluating information-communication systems to support effective triage under these conditions.
Developing and testing the patient-centred innovation questionnaire for hospital nurses.
Huang, Ching-Yuan; Weng, Rhay-Hung; Wu, Tsung-Chin; Lin, Tzu-En; Hsu, Ching-Tai; Hung, Chiu-Hsia; Tsai, Yu-Chen
2018-03-01
Develop the patient-centred innovation questionnaire for hospital nurses and establish its validity and reliability. Patient-centred care has been adopted by health care managers in their efforts to improve health care quality. It is regarded as a core concept for developing innovation. A cross-sectional study was employed to collect data from hospital nurses in Taiwan. This study was divided into two stages: pilot study and main study. In the main study, 596 valid responses were collected. This study adopted reliability analysis, exploratory factor analysis, confirmatory factor analysis and selected nurse innovation scale as a criterion to test criterion-related validity. Five-dimension patient-centred innovation questionnaire was proposed: access and practicability, co-ordination and communication, sharing power and responsibility, care continuity, family and person focus. Each dimension demonstrated a reliability of 0.89-0.98. All dimensions had acceptable convergent and discriminate validity. The patient-centred innovation questionnaire and nurse innovation scale exhibited a significantly positive correlation. Patient-centred innovation questionnaire not only had a good theoretical basis but also had sufficient reliability and construct validity, and criterion-related validity. Patient-centred innovation questionnaire could give a measure for evaluating the implementation of patient-centred care and could be used as a management tool during the process of nurse innovation. © 2017 John Wiley & Sons Ltd.
Arbab, Dariusch; Kuhlmann, Katharina; Ringendahl, Hubert; Bouillon, Bertil; Eysel, Peer; König, Dietmar
2017-06-13
Patient-reported outcome measures are a critical tool in evaluating the efficacy of orthopaedic procedures. The intention of this study was to develop and culturally adapt a German version of the Manchester-Oxford Foot Questionnaire (MOXFQ) and to evaluate reliability, validity and responsiveness. According to guidelines forward and backward translation has been performed. The German MOXFQ was investigated in 177 consecutive patients before and 6 months after foot or ankle surgery. All patients completed MOXFQ, Foot and Ankle Outcome Score (FAOS), Short form 36 and numeric scales for pain and disability (NRS). Test-Retest reliability, internal consistency, floor and ceiling effects, construct validity and minimal important change were analyzed. The German MOXFQ demonstrated excellent test-retest reliability with ICC values >0.9 Cronbach's alpha (α) values demonstrated strong internal consistency. No floor or ceiling effects were observed. As hypothesized MOXFQ subscales correlated strongly with corresponding FAOS and SF-36 domains. All subscales showed excellent (ES/SRM >0.8) responsiveness between preoperative assessment and postoperative follow-up. The German version of the MOXFQ demonstrated good psychometric properties. It proofed to be a valid and reliable instrument for use in foot and ankle patients. Copyright © 2017 European Foot and Ankle Society. Published by Elsevier Ltd. All rights reserved.
Inspection of Construction Works According to Polish Construction Law
NASA Astrophysics Data System (ADS)
Czemplik, A.
2015-11-01
Construction regulations are still different in many European countries, even though the European Union directives have unified many acts for construction works and construction products in the member countries. The scheme of the construction process presented in the paper could be valid for most countries, despite of detailed regulations of legal systems. The number of construction regulations to be followed in order to get the Construction Permit in Poland is rather big, so the time between the start of the investment process and the day when the Construction Permit is issued could be several months. Only licensed professional engineers can play the role of site managers, site inspectors and designers, registered for the given construction project. Duties and responsibilities (civil liability) of these engineers are strictly defined by regulations. The obligatory inspection of construction works should be executed by the licensed site inspectors. Moreover, the works can be incidentally inspected by Authority, banks, insurance companies or designers. Foreign designers and foreign site engineers in order to be allowed by respective Authority to play official roles on Polish construction sites should present documents proving that they can do the same jobs in their countries as per regulations obligatory there.
ERIC Educational Resources Information Center
Jenkins-Guarnieri, Michael A.; Vaughan, Angela L.; Wright, Stephen L.
2015-01-01
We adapted a work self-determination measure to create the Basic Needs Satisfaction at College Scale. Confirmatory factor analysis and item response theory analyses with data from 525 adults supported a 3-factor model with 13 items most sensitive for lower to middle range levels of the autonomy, competence, and relatedness constructs.
ERIC Educational Resources Information Center
Gardner, Robby Christopher
2009-01-01
The primary objective of this study was to compare the individual teacher interview scores from the Interactive Computer Interview System (ICIS) with their students' responses to "The Steps to Excellence Student Questionnaire". Specifically, the study examined the correlation among the teacher interviews across four themes of the ICIS ("Knowledge…
ERIC Educational Resources Information Center
Edwards, Jeffrey R.; O'Neill, Regina M.
1998-01-01
Confirmatory factor analysis was used to evaluate alternative factor structures, based on previous exploratory factor analyses and coping dimensions derived from the theory of R. Lazarus, for the Ways of Coping Questionnaire (S. Folkman and R. Lazarus, 1988). Results from responses of 654 college graduates provide little support for the factor…
The Reliability and Construct Validity of American College Students' Responses to the WHOQOL-BREF
ERIC Educational Resources Information Center
D'Abundo, Michelle; Orsini, M. M.; Milroy, J. J.; Sidman, C. L.
2011-01-01
The World Health Organization Quality of Life (WHOQOL-100) instrument was developed to assess quality of life from a multi-dimensional perspective. A shorter 26-item version of the instrument was created called the WHOQOL-BREF, which is the focus of this study. Based on previous research, it is unclear if the WHOQOL-BREF instrument is appropriate…
Development of a Measure of Drinking and Driving Expectancies for Youth
ERIC Educational Resources Information Center
McCarthy, Denis M.; Pedersen, Sarah L.; Thompsen, Dana M.; Leuty, Melanie E.
2006-01-01
The present study constructs and provides initial validation for a measure of positive expectancies for drinking and driving for use with adolescents and young adults (PEDD-Y). In Study 1, items were generated through open-ended responses from high school- and college-age youth. Data collected from a 2nd sample of college students (n = 404)…
Vibroacoustic Model Validation for a Curved Honeycomb Composite Panel
NASA Technical Reports Server (NTRS)
Buehrle, Ralph D.; Robinson, Jay H.; Grosveld, Ferdinand W.
2001-01-01
Finite element and boundary element models are developed to investigate the vibroacoustic response of a curved honeycomb composite sidewall panel. Results from vibroacoustic tests conducted in the NASA Langley Structural Acoustic Loads and Transmission facility are used to validate the numerical predictions. The sidewall panel is constructed from a flexible honeycomb core sandwiched between carbon fiber reinforced composite laminate face sheets. This type of construction is being used in the development of an all-composite aircraft fuselage. In contrast to conventional rib-stiffened aircraft fuselage structures, the composite panel has nominally uniform thickness resulting in a uniform distribution of mass and stiffness. Due to differences in the mass and stiffness distribution, the noise transmission mechanisms for the composite panel are expected to be substantially different from those of a conventional rib-stiffened structure. The development of accurate vibroacoustic models will aide in the understanding of the dominant noise transmission mechanisms and enable optimization studies to be performed that will determine the most beneficial noise control treatments. Finite element and boundary element models of the sidewall panel are described. Vibroacoustic response predictions are presented for forced vibration input and the results are compared with experimental data.
Simblett, Sara K; Badham, Rachel; Greening, Kate; Adlam, Anna; Ring, Howard; Bateman, Andrew
2012-01-01
Assessment of everyday problems with executive functioning following acquired brain injury (ABI) is greatly valued by neurorehabilitation services. Reliance on self-report measures alone is problematic within this client group who may experience difficulties with awareness and memory. The construct validity and reliability of independent ratings (i.e., ratings provided by a carer/relative) on the Dysexecutive Questionnaire (DEX-I) was explored in this study. Consistent with the results recently reported on the self-rated version of the DEX (DEX-S; Simblett & Bateman, 2011 ), Rasch analysis completed on 271 responses to the DEX-I revealed that the scale did not fit the Rasch model and did not meet the assumption of unidimensionality, that is, a single underlying construct could not be found for the DEX-I that would allow development of an interval-level measure as a whole. Subscales, based on theoretical conceptualisations of executive functioning (Stuss, 2007 ) previously suggested for the DEX-S, were able to demonstrate fit to the Rasch model and unidimensionality. Reliability of independent responses to these subscales in comparison to self-reported ratings is discussed. These results contribute to a greater understanding of how assessment of executive functioning can be improved.
Burger, Elise; Selles, Ruud; van Nieuwkasteele, Shelly; Bessems, Gert; Pollet, Virginie; Hovius, Steven; van Nieuwenhoven, Christianne
2017-11-04
The purpose of this study is to develop a Dutch version of the Oxford Ankle and Foot Questionnaire for Children (OxAFQ-c) to allow evaluation of pediatric foot care. The OxAFQ-c was translated into Dutch, according to the ISPOR-guidelines. Children with different foot and ankle complaints completed the OxAFQ-c at baseline, after two weeks, and after 4-6 months. Measurement properties were assessed in terms of reliability, responsiveness, and construct validity. Test-retest reliability showed moderate intraclass correlation coefficients. Bland-Altman plots showed wide limits of agreement. After 4-6 months, the group that experienced improvement also showed improved questionnaire outcomes, indicating responsiveness. Moderate correlation between the OxAFQ-c and the Kidscreen and foot-specific VAS-scores were observed, indicating moderate construct validity. The Dutch OxAFQ-c showed moderate to good measurement properties. However, because we observed limited sensitivity to changes and wide limits of agreement in individual patients, we think the questionnaire should only be used in groups. Copyright © 2017 European Foot and Ankle Society. Published by Elsevier Ltd. All rights reserved.
Ishii, Hitoshi; Shimatsu, Akira; Okimura, Yasuhiko; Tanaka, Toshiaki; Hizuka, Naomi; Kaji, Hidesuke; Hanew, Kunihiko; Oki, Yutaka; Yamashiro, Sayuri; Takano, Koji; Chihara, Kazuo
2012-01-01
Objective To develop and validate the Adult Hypopituitarism Questionnaire (AHQ) as a disease-specific, self-administered questionnaire for evaluation of quality of life (QOL) in adult patients with hypopituitarism. Methods We developed and validated this new questionnaire, using a standardized procedure which included item development, pilot-testing and psychometric validation. Of the patients who participated in psychometric validation, those whose clinical conditions were judged to be stable were asked to answer the survey questionnaire twice, in order to assess test-retest reliability. Results Content validity of the initial questionnaire was evaluated via two pilot tests. After these tests, we made minor revisions and finalized the initial version of the questionnaire. The questionnaire was constructed with two domains, one psycho-social and the other physical. For psychometric assessment, analyses were performed on the responses of 192 adult patients with various types of hypopituitarism. The intraclass correlations of the respective domains were 0.91 and 0.95, and the Cronbach’s alpha coefficients were 0.96 and 0.95, indicating adequate test-retest reliability and internal consistency for each domain. For known-group validity, patients with hypopituitarism due to hypothalamic disorder showed significantly lower scores in 11 out of 13 sub-domains compared to those who had hypopituitarism due to pituitary disorder. Regarding construct validity, the domain structure was found to be almost the same as that initially hypothesized. Exploratory factor analysis (n = 228) demonstrated that each domain consisted of six and seven sub-domains. Conclusion The AHQ showed good reliability and validity for evaluating QOL in adult patients with hypopituitarism. PMID:22984490
Development and validation of the simulation-based learning evaluation scale.
Hung, Chang-Chiao; Liu, Hsiu-Chen; Lin, Chun-Chih; Lee, Bih-O
2016-05-01
The instruments that evaluate a student's perception of receiving simulated training are English versions and have not been tested for reliability or validity. The aim of this study was to develop and validate a Chinese version Simulation-Based Learning Evaluation Scale (SBLES). Four stages were conducted to develop and validate the SBLES. First, specific desired competencies were identified according to the National League for Nursing and Taiwan Nursing Accreditation Council core competencies. Next, the initial item pool was comprised of 50 items related to simulation that were drawn from the literature of core competencies. Content validity was established by use of an expert panel. Finally, exploratory factor analysis and confirmatory factor analysis were conducted for construct validity, and Cronbach's coefficient alpha determined the scale's internal consistency reliability. Two hundred and fifty students who had experienced simulation-based learning were invited to participate in this study. Two hundred and twenty-five students completed and returned questionnaires (response rate=90%). Six items were deleted from the initial item pool and one was added after an expert panel review. Exploratory factor analysis with varimax rotation revealed 37 items remaining in five factors which accounted for 67% of the variance. The construct validity of SBLES was substantiated in a confirmatory factor analysis that revealed a good fit of the hypothesized factor structure. The findings tally with the criterion of convergent and discriminant validity. The range of internal consistency for five subscales was .90 to .93. Items were rated on a 5-point scale from 1 (strongly disagree) to 5 (strongly agree). The results of this study indicate that the SBLES is valid and reliable. The authors recommend that the scale could be applied in the nursing school to evaluate the effectiveness of simulation-based learning curricula. Copyright © 2016 Elsevier Ltd. All rights reserved.
Validation of a Spanish version of the Spine Functional Index.
Cuesta-Vargas, Antonio I; Gabel, Charles P
2014-06-27
The Spine Functional Index (SFI) is a recently published, robust and clinimetrically valid patient reported outcome measure. The purpose of this study was the adaptation and validation of a Spanish-version (SFI-Sp) with cultural and linguistic equivalence. A two stage observational study was conducted. The SFI was cross-culturally adapted to Spanish through double forward and backward translation then validated for its psychometric characteristics. Participants (n = 226) with various spine conditions of >12 weeks duration completed the SFI-Sp and a region specific measure: for the back, the Roland Morris Questionnaire (RMQ) and Backache Index (BADIX); for the neck, the Neck Disability Index (NDI); for general health the EQ-5D and SF-12. The full sample was employed to determine internal consistency, concurrent criterion validity by region and health, construct validity and factor structure. A subgroup (n = 51) was used to determine reliability at seven days. The SFI-Sp demonstrated high internal consistency (α = 0.85) and reliability (r = 0.96). The factor structure was one-dimensional and supported construct validity. Criterion specific validity for function was high with the RMQ (r = 0.79), moderate with the BADIX (r = 0.59) and low with the NDI (r = 0.46). For general health it was low with the EQ-5D and inversely correlated (r = -0.42) and fair with the Physical and Mental Components of the SF-12 and inversely correlated (r = -0.56 and r = -0.48), respectively. The study limitations included the lack of longitudinal data regarding other psychometric properties, specifically responsiveness. The SFI-Sp was demonstrated as a valid and reliable spine-regional outcome measure. The psychometric properties were comparable to and supported those of the English-version, however further longitudinal investigations are required.
The medial tibial stress syndrome score: a new patient-reported outcome measure.
Winters, Marinus; Moen, Maarten H; Zimmermann, Wessel O; Lindeboom, Robert; Weir, Adam; Backx, Frank Jg; Bakker, Eric Wp
2016-10-01
At present, there is no validated patient-reported outcome measure (PROM) for patients with medial tibial stress syndrome (MTSS). Our aim was to select and validate previously generated items and create a valid, reliable and responsive PROM for patients with MTSS: the MTSS score. A prospective cohort study was performed in multiple sports medicine, physiotherapy and military facilities in the Netherlands. Participants with MTSS filled out the previously generated items for the MTSS score on 3 occasions. From previously generated items, we selected the best items. We assessed the MTSS score for its validity, reliability and responsiveness. The MTSS score was filled out by 133 participants with MTSS. Factor analysis showed the MTSS score to exhibit a single-factor structure with acceptable internal consistency (α=0.58) and good test-retest reliability (intraclass correlation coefficient=0.81). The MTSS score ranges from 0 to 10 points. The smallest detectable change in our sample was 0.69 at the group level and 4.80 at the individual level. Construct validity analysis showed significant moderate-to-large correlations (r=0.34-0.52, p<0.01). Responsiveness of the MTSS score was confirmed by a significant relation with the global perceived effect scale (β=-0.288, R(2)=0.21, p<0.001). The MTSS score is a valid, reliable and responsive PROM to measure the severity of MTSS. It is designed to evaluate treatment outcomes in clinical studies. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
The Cerebral Palsy Quality of Life for Children (CP QOL-Child): Evidence of Construct Validity
ERIC Educational Resources Information Center
Chen, Kuan-Lin; Wang, Hui-Yi; Tseng, Mei-Hui; Shieh, Jeng-Yi; Lu, Lu; Yao, Kai-Ping Grace; Huang, Chien-Yu
2013-01-01
The Cerebral Palsy Quality of Life for Children (CP QOL-Child) is the first health condition-specific questionnaire designed for measuring QOL in children with cerebral palsy (CP). However, its construct validity has not yet been confirmed by confirmatory factor analysis (CFA). Hence, this study assessed the construct validity of the caregiver…
ERIC Educational Resources Information Center
Gold, Bernadette; Holodynski, Manfred
2015-01-01
The current study describes the development and construct validation of a situational judgment test for assessing the strategic knowledge of classroom management in elementary schools. Classroom scenarios and accompanying courses of action were constructed, of which 17 experts confirmed the content validity. A pilot study and a cross-validation…
Alotaibi, Naser M; Aljadi, Sameera H; Alrowayeh, Hesham N
2016-12-01
To investigate the psychometric properties (reliability, validity and responsiveness) of the DASH-Arabic in a cohort of Arabic patients presenting with various upper extremity conditions. Participants were 139 patients with various upper extremity conditions, who completed the DASH-Arabic at the baseline, 2-5 days later and 30-36 days later. Participants completed demographic data forms, the SF-36 and VAS at baseline, and a Global Rating of Change scale at first and second follow-ups. Cronbach's alpha of the DASH-Arabic was 0.94. Test-retest reliability was excellent with an ICC of 0.97. The SEM was 3.50 and the MDC95 was 9.28. Construct validity of the DASH-Arabic with the SF-36 subscales and VAS scores ranged from r -0.32 to -0.57, all statistically significant (p < 0.001). The effect size (ES) for the DASH-Arabic was 1.39 and its standard response mean was 1.51. The area under the curve was 0.82 (95% CI = 0.72-0.92, p < 0.001). The optimally efficient cutoff for an improvement was found to be a difference of 15 DASH points. The DASH-Arabic is a reliable, valid and responsive upper extremity outcome measure for patients whose primary language is Arabic; it can be used to document patient status and outcomes and support evidence-based practice. Implications for Rehabilitation The DASH-Arabic demonstrated sound psychometric properties of reliability, validity and responsiveness. It is an effective patient status and outcome tool that will support evidence-based practice. This tool is recommended for evaluating upper extremity work-related injuries and tracking therapeutic outcomes.
NASA Astrophysics Data System (ADS)
Setiawan, Jody; Nakazawa, Shoji
2017-10-01
This paper discusses about comparison of seismic response behaviors, seismic performance and seismic loss function of a conventional special moment frame steel structure (SMF) and a special moment frame steel structure with base isolation (BI-SMF). The validation of the proposed simplified estimation method of the maximum deformation of the base isolation system by using the equivalent linearization method and the validation of the design shear force of the superstructure are investigated from results of the nonlinear dynamic response analysis. In recent years, the constructions of steel office buildings with seismic isolation system are proceeding even in Indonesia where the risk of earthquakes is high. Although the design code for the seismic isolation structure has been proposed, there is no actual construction example for special moment frame steel structure with base isolation. Therefore, in this research, the SMF and BI-SMF buildings are designed by Indonesian Building Code which are assumed to be built at Padang City in Indonesia. The material of base isolation system is high damping rubber bearing. Dynamic eigenvalue analysis and nonlinear dynamic response analysis are carried out to show the dynamic characteristics and seismic performance. In addition, the seismic loss function is obtained from damage state probability and repair cost. For the response analysis, simulated ground accelerations, which have the phases of recorded seismic waves (El Centro NS, El Centro EW, Kobe NS and Kobe EW), adapted to the response spectrum prescribed by the Indonesian design code, that has, are used.
Jalenques, I; Auclair, C; Roblin, J; Morand, D; Tourtauchaux, R; May, R; Vaille-Perret, E; Watts, J; Gerbaud, L; De Leo, D
2013-04-01
To cross-culturally adapt a French version of the LEIPAD, a self-administered questionnaire assessing the health-related quality of life (HRQoL) in adults aged 65 years and over living at home, and to evaluate its psychometric properties. After having translated LEIPAD in accordance with guidelines, we studied psychometric properties: reliability and construct validity-factor analysis, relationships between items and scales, internal consistency, concurrent validity with the Medical Outcome Study Short-Form 36 and known-groups validity. The results obtained in a sample of 195 elderly from the general population showed very good acceptability, with response rates superior to 93 %. Exploratory factor analysis extracted eight factors providing a multidimensionality structure with five misclassifications of items in the seven theoretical scales. Good internal consistency (Cronbach's alpha ranging from 0.73 and 0.86) and strong test-retest reliability (ICCs higher than 0.80 for six scales and 0.70 for one) were demonstrated. Concurrent validity with the SF-36 showed small to strong expected correlations. This first evaluation of the French version of LEIPAD's psychometric properties provides evidence in construct validity and reliability. It would allow HRQoL assessment in clinical and common practice, and investigators would be able to take part in national and international research projects.
Mojtabai, Ramin; Corey-Lisle, Patricia K; Ip, Edward Hak-Sing; Kopeykina, Irina; Haeri, Sophia; Cohen, Lisa Janet; Shumaker, Sally
2012-12-30
Investigation of patients' subjective perspective regarding the effectiveness - as opposed to efficacy - of antipsychotic medication has been hampered by a relative shortage of self-report measures of global clinical outcome. This paper presents data supporting the feasibility, inter-item consistency, and construct validity of the Patient Assessment Questionnaire (PAQ)-a self-report measure of psychiatric symptoms, medication side effects and general wellbeing, ultimately intended to assess effectiveness of interventions for schizophrenia-spectrum patients. The original 53-item instrument was developed by a multidisciplinary team which utilized brainstorming sessions for item generation and content analysis, patient focus groups, and expert panel reviews. This instrument and additional validation measures were administered, via Audio Computer-Assisted Self-Interviewing (ACASI), to 300 stable, medicated outpatients diagnosed with schizophrenia or schizoaffective disorder. Item elimination was based on psychometric properties and Item-Response Theory information functions and characteristic curves. Exploratory factor analysis of the resulting 40-item scale yielded a five factor solution. The five subscales (General Distress, Side Effects, Psychotic Symptoms, Cognitive Symptoms, Sleep) showed robust convergent (β's=0.34-0.75, average β=0.49) and discriminant validity. The PAQ demonstrates feasibility, reliability, and construct validity as a self-report measure of multiple domains pertinent to effectiveness. Future research needs to establish the PAQ's sensitivity to change. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Developing evaluation instrument based on CIPP models on the implementation of portfolio assessment
NASA Astrophysics Data System (ADS)
Kurnia, Feni; Rosana, Dadan; Supahar
2017-08-01
This study aimed to develop an evaluation instrument constructed by CIPP model on the implementation of portfolio assessment in science learning. This study used research and development (R & D) method; adapting 4-D by the development of non-test instrument, and the evaluation instrument constructed by CIPP model. CIPP is the abbreviation of Context, Input, Process, and Product. The techniques of data collection were interviews, questionnaires, and observations. Data collection instruments were: 1) the interview guidelines for the analysis of the problems and the needs, 2) questionnaire to see level of accomplishment of portfolio assessment instrument, and 3) observation sheets for teacher and student to dig up responses to the portfolio assessment instrument. The data obtained was quantitative data obtained from several validators. The validators consist of two lecturers as the evaluation experts, two practitioners (science teachers), and three colleagues. This paper shows the results of content validity obtained from the validators and the analysis result of the data obtained by using Aikens' V formula. The results of this study shows that the evaluation instrument based on CIPP models is proper to evaluate the implementation of portfolio assessment instruments. Based on the experts' judgments, practitioners, and colleagues, the Aikens' V coefficient was between 0.86-1,00 which means that it is valid and can be used in the limited trial and operational field trial.
Bell, Cheryl; Johnston, Derek; Allan, Julia; Pollard, Beth; Johnston, Marie
2017-05-01
The Demand-Control (DC) and Effort-Reward Imbalance (ERI) models predict health in a work context. Self-report measures of the four key constructs (demand, control, effort, and reward) have been developed and it is important that these measures have good content validity uncontaminated by content from other constructs. We assessed relevance (whether items reflect the constructs) and representativeness (whether all aspects of the construct are assessed, and all items contribute to that assessment) across the instruments and items. Two studies examined fourteen demand/control items from the Job Content Questionnaire and seventeen effort/reward items from the Effort-Reward Imbalance measure using discriminant content validation and a third study developed new methods to assess instrument representativeness. Both methods use judges' ratings and construct definitions to get transparent quantitative estimates of construct validity. Study 1 used dictionary definitions while studies 2 and 3 used published phrases to define constructs. Overall, 3/5 demand items, 4/9 control items, 1/6 effort items, and 7/11 reward items were uniquely classified to the appropriate theoretical construct and were therefore 'pure' items with discriminant content validity (DCV). All pure items measured a defining phrase. However, both the DC and ERI assessment instruments failed to assess all defining aspects. Finding good discriminant content validity for demand and reward measures means these measures are usable and our quantitative results can guide item selection. By contrast, effort and control measures had limitations (in relevance and representativeness) presenting a challenge to the implementation of the theories. Statement of contribution What is already known on this subject? While the reliability and construct validity of Demand-Control and Effort-Reward-Imbalance (DC and ERI) work stress measures are routinely reported, there has not been adequate investigation of their content validity. This paper investigates their content validity in terms of both relevance and representativeness and provides a model for the investigation of content validity of measures in health psychology more generally. What does this study add? A new application of an existing method, discriminant content validity, and a new method of assessing instrument representativeness. 'Pure' DC and ERI items are identified, as are constructs that are not fully represented by their assessment instruments. The findings are important for studies attempting to distinguish between the main DC and ERI work stress constructs. The quantitative results can be used to guide item selection for future studies. © 2017 The British Psychological Society.
Patient Reported Outcome Measure of Spiritual Care as Delivered by Chaplains.
Snowden, Austyn; Telfer, Iain
2017-01-01
Chaplains are employed by health organizations around the world to support patients in recognizing and addressing their spiritual needs. There is currently no generalizable measure of the impact of these interventions and so the clinical and strategic worth of chaplaincy is difficult to articulate. This article introduces the Scottish PROM, an original five-item patient reported outcome measure constructed specifically to address this gap. It describes the validation process from its conceptual grounding in the spiritual care literature through face and content validity cycles. It shows that the Scottish PROM is internally consistent and unidimensional. Responses to the Scottish PROM show strong convergent validity with responses to the Warwick and Edinburgh Mental Well-Being Scale, a generic well-being scale often used as a proxy for spiritual well-being. In summary, the Scottish PROM is fit for purpose. It measures the outcomes of spiritual care as delivered by chaplains in this study. This novel project introduces an essential and original breakthrough; the possibility of generalizable international chaplaincy research.
Segot-Chicq, Evelyne; Fanchon, Chantal
2013-12-01
The 28-item Menopausal Skin Quality Of Life (MSkinQOL), a previously validated French questionnaire, developed to assess psychological features of menopausal women and to measure the benefits of using cosmetic skincare products was translated and validated to assess a skincare product in the USA. Construct validity, reliability, reproducibility, and responsiveness were assessed with two groups of 100 nonmenopausal (NM) and 100 postmenopausal (PM) women. The group of PM women applied a specially developed skincare product twice daily for 1 month and filled in the same questionnaire after 1 month as well as a general self-assessment questionnaire about the efficacy and cosmetic properties of the product. No ceiling or floor effects were identified. Construct and internal validity was assessed using a multitrait analysis: questionnaire items proved closely correlated, and each dimension covers a different aspect of women answers profile. The three dimensions showed good reliability and stability. Baseline values for social effects of skin appearance, health status, and self-esteem were significantly different between PM and NM volunteers. Values of these three dimensions were significantly improved after 2 weeks of product application, and further improved after 4 weeks. This study shows that a careful translation and a rigorous process of validation lead to a reliable tool adapted to each country to explore and measure quality of life in healthy PM women. © 2013 Wiley Periodicals, Inc.
Taffarel, Marilda Onghero; Luna, Stelio Pacca Loureiro; de Oliveira, Flavia Augusta; Cardoso, Guilherme Schiess; Alonso, Juliana de Moura; Pantoja, Jose Carlos; Brondani, Juliana Tabarelli; Love, Emma; Taylor, Polly; White, Kate; Murrell, Joanna C
2015-04-01
Quantification of pain plays a vital role in the diagnosis and management of pain in animals. In order to refine and validate an acute pain scale for horses a prospective, randomized, blinded study was conducted. Twenty-four client owned adult horses were recruited and allocated to one of four following groups: anaesthesia only (GA); pre-emptive analgesia and anaesthesia (GAA,); anaesthesia, castration and postoperative analgesia (GC); or pre-emptive analgesia, anaesthesia and castration (GCA). One investigator, unaware of the treatment group, assessed all horses at time-points before and after intervention and completed the pain scale. Videos were also obtained at these time-points and were evaluated by a further four blinded evaluators who also completed the scale. The data were used to investigate the relevance, specificity, criterion validity and inter- and intra-observer reliability of each item on the pain scale, and to evaluate construct validity and responsiveness of the scale. Construct validity was demonstrated by the observed differences in scores between the groups, four hours after anaesthetic recovery and before administration of systemic analgesia in the GC group. Inter- and intra-observer reliability for the items was only satisfactory. Subsequently the pain scale was refined, based on results for relevance, specificity and total item correlation. Scale refinement and exclusion of items that did not meet predefined requirements generated a selection of relevant pain behaviours in horses. After further validation for reliability, these may be used to evaluate pain under clinical and experimental conditions.
Mapping a multidimensional emotion in response to television commercials.
Morris, Jon D; Klahr, Nelson J; Shen, Feng; Villegas, Jorge; Wright, Paul; He, Guojun; Liu, Yijun
2009-03-01
Unlike previous emotional studies using functional neuroimaging that have focused on either locating discrete emotions in the brain or linking emotional response to an external behavior, this study investigated brain regions in order to validate a three-dimensional construct--namely pleasure, arousal, and dominance (PAD) of emotion induced by marketing communication. Emotional responses to five television commercials were measured with Advertisement Self-Assessment Manikins (AdSAM) for PAD and with functional magnetic resonance imaging (fMRI) to identify corresponding patterns of brain activation. We found significant differences in the AdSAM scores on the pleasure and arousal rating scales among the stimuli. Using the AdSAM response as a model for the fMRI image analysis, we showed bilateral activations in the inferior frontal gyri and middle temporal gyri associated with the difference on the pleasure dimension, and activations in the right superior temporal gyrus and right middle frontal gyrus associated with the difference on the arousal dimension. These findings suggest a dimensional approach of constructing emotional changes in the brain and provide a better understanding of human behavior in response to advertising stimuli.
Nigg, Claudio R; Motl, Robert W; Horwath, Caroline; Dishman, Rod K
2012-01-01
Objectives Physical activity (PA) research applying the Transtheoretical Model (TTM) to examine group differences and/or change over time requires preliminary evidence of factorial validity and invariance. The current study examined the factorial validity and longitudinal invariance of TTM constructs recently revised for PA. Method Participants from an ethnically diverse sample in Hawaii (N=700) completed questionnaires capturing each TTM construct. Results Factorial validity was confirmed for each construct using confirmatory factor analysis with full-information maximum likelihood. Longitudinal invariance was evidenced across a shorter (3-month) and longer (6-month) time period via nested model comparisons. Conclusions The questionnaires for each validated TTM construct are provided, and can now be generalized across similar subgroups and time points. Further validation of the provided measures is suggested in additional populations and across extended time points. PMID:22778669
The NIH analytical methods and reference materials program for dietary supplements.
Betz, Joseph M; Fisher, Kenneth D; Saldanha, Leila G; Coates, Paul M
2007-09-01
Quality of botanical products is a great uncertainty that consumers, clinicians, regulators, and researchers face. Definitions of quality abound, and include specifications for sanitation, adventitious agents (pesticides, metals, weeds), and content of natural chemicals. Because dietary supplements (DS) are often complex mixtures, they pose analytical challenges and method validation may be difficult. In response to product quality concerns and the need for validated and publicly available methods for DS analysis, the US Congress directed the Office of Dietary Supplements (ODS) at the National Institutes of Health (NIH) to accelerate an ongoing methods validation process, and the Dietary Supplements Methods and Reference Materials Program was created. The program was constructed from stakeholder input and incorporates several federal procurement and granting mechanisms in a coordinated and interlocking framework. The framework facilitates validation of analytical methods, analytical standards, and reference materials.
Moghadam, Manije; Salavati, Mahyar; Sahaf, Robab; Rassouli, Maryam; Moghadam, Mojgan; Kamrani, Ahmad Ali Akbari
2018-03-01
After forward-backward translation, the LSS was administered to 334 Persian speaking, cognitively healthy elderly aged 60 years and over recruited through convenience sampling. To analyze the validity of the model's constructs and the relationships between the constructs, a confirmatory factor analysis followed by PLS analysis was performed. The Construct validity was further investigated by calculating the correlations between the LSS and the "Short Form Health Survey" (SF-36) subscales measuring similar and dissimilar constructs. The LSS was re-administered to 50 participants a month later to assess the reliability. For the eight-factor model of the life satisfaction construct, adequate goodness of fit between the hypothesized model and the model derived from the sample data was attained (positive and statistically significant beta coefficients, good R-squares and acceptable GoF). Construct validity was supported by convergent and discriminant validity, and correlations between the LSS and SF-36 subscales. Minimum Intraclass Correlation Coefficient level of 0.60 was exceeded by all subscales. Minimum level of reliability indices (Cronbach's α, composite reliability and indicator reliability) was exceeded by all subscales. The Persian-version of the Life Satisfaction Scale is a reliable and valid instrument, with psychometric properties which are consistent with the original version.
Characterising the perceived value of mathematics educational apps in preservice teachers
NASA Astrophysics Data System (ADS)
Handal, Boris; Campbell, Chris; Cavanagh, Michael; Petocz, Peter
2016-03-01
This study validated the semantic items of three related scales aimed at characterising the perceived worth of mathematics-education-related mobile applications (apps). The technological pedagogical content knowledge (TPACK) model was used as the conceptual framework for the analysis. Three hundred and seventy-three preservice students studying primary school education from two public and one private Australian universities participated in the study. The respondents examined three different apps using a purposively designed instrument in regard to either their explorative, productive or instructive instructional role. While construct validity could not be established due to a broad range of variability in responses implying a high degree of subjectivity in respondents' judgments, the qualitative analysis was effective in establishing content validity.
Oo, W M; Linklater, J M; Daniel, M; Saarakkala, S; Samuels, J; Conaghan, P G; Keen, H I; Deveza, L A; Hunter, D J
2018-05-01
The aims of this study were to systematically review clinimetrics of commonly assessed ultrasound pathologies in knee, hip and hand osteoarthritis (OA), and to conduct a meta-analysis for each clinimetric. Medline, Embase, and Cochrane Library databases were searched from their inceptions to September 2016. According to the Outcome Measures in Rheumatology (OMERACT) Instrument Selection Algorithm, data extraction focused on ultrasound technical features and performance metrics. Methodological quality was assessed with modified 19-item Downs and Black score and 11-item Quality Appraisal of Diagnostic Reliability (QAREL) score. Separate meta-analyses were performed for clinimetrics: (1) inter-rater/intra-rater reliability; (2) construct validity; (3) criteria validity; and (4) internal/external responsiveness. Statistical Package for the Social Sciences (SPSS), Excel and Comprehensive Meta-analysis were used. Our search identified 1126 records; of these, 100 were eligible, including a total of 8542 patients and 32,373 joints. The average Downs and Black score was 13.01, and average QAREL was 5.93. The stratified meta-analysis was performed only for knee OA, which demonstrated moderate to substantial reliability [minimum kappa > 0.44(0.15,0.74), minimum intraclass correlation coefficient (ICC) > 0.82(0.73-0.89)], weak construct validity against pain (r = 0.12 to 0.27), function (r = 0.15 to 0.23), and blood biomarkers (r = 0.01 to 0.21), but weak to strong correlation with plain radiography (r = 0.13 to 0.60), strong association with Magnetic Resonance Imaging (MRI) [minimum r = 0.60(0.52,0.67)] and strong discrimination against symptomatic patients (OR = 3.08 to 7.46). There was strong criterion validity against cartilage histology [r = 0.66(-0.05,0.93)], and small to moderate internal [standardized mean difference(SMD) = 0.20 to 0.58] and external (r = 0.35 to 0.43) responsiveness to interventions. Ultrasound demonstrated strong criterion validity with cartilage histology, poor to strong correlation with patient findings and MRI, moderate reliability, and low responsiveness to interventions. CRD42016039954. Copyright © 2018 Osteoarthritis Research Society International. All rights reserved.
Deutsch, Judith E; Romney, Wendy; Reynolds, Jan; Manal, Tara Jo
2015-10-08
PTNow.org is an evidence-based, on-line portal created by a professional membership association to promote use of evidence in practice and to help decrease unwarranted variation in practice. The site contains synthesis documents designed to promote efficient clinical reasoning. These documents were written and peer-reviewed by teams of content experts and master clinicians. The purpose of this paper is to report on the content and construct validity as well as usability of the site. Physical therapist participants used clinical summaries (available in 3 formats--as a full summary with hyperlinks, "quick takes" with hyperlinks, and a portable two-page version) on the PTNow.org site to answer knowledge acquisition and clinical reasoning questions related to four patient scenarios. They also responded to questions about ease of use related to website navigation and about format and completeness of information using a 1-5 Likert scale. Responses were coded to reflect how participants used the site and then were summarized descriptively. Preferences for clinical summary format were analyzed using an analysis of variance (ANOVA) and a Dunnett T3 post hoc analysis. Seventeen participants completed the study. Clinical relevance and completeness ratings by experienced clinicians, which were used as the measure of content validity, ranged from 3.1 to 4.6 on a 5 point scale. Construct validity based on the information on the PTNow.org site was supported for knowledge acquisition questions 66 % of the time and for clinical reasoning questions 40 % of the time. Usability ratings for the full clinical summary were 4.6 (1.2); for the quick takes, 3.5 (.98); and for the portable clinical summary, 4.0 (.45). Participants preferred the full clinical summary over the other two formats (F = 5.908, P = 0.007). One hundred percent of the participants stated that they would recommend the PTNow site to their colleagues. Prelimary evidence supported both content validity and construct validity of knowledge acquisition, and partially supported construct validity of clinical reasoning for the clinical summaries on the PTNow.org site. Usability was supported, with users preferring the full clinical summary over the other two formats. Iterative design is ongoing.
Cancer-related Concerns of Spouses of Women with Breast Cancer
Fletcher, Kristin A.; Lewis, Frances Marcus; Haberman, Mel R.
2009-01-01
Objective To describe spouses' reported cancer-related demands attributed to their wife's breast cancer and to test the construct and predictive validity of a brief standardized measure of these demands. Methods Cross-sectional and longitudinal data were obtained from 151 spouses of women newly diagnosed with non-metastatic breast cancer. Descriptive statistics were computed to describe spouses' dominant cancer-related demands and multivariate regression analyses tested the construct and predictive validity of the standardized measure. Results Five categories of spouses' cancer-related demands were identified, such as concerns about: spouses' own functioning; wife's well being and response to treatment; couples' sexual activities; the family's and children's well-being; and the spouses' role in supporting their wives. A 33-item short version of the standardized measure of cancer demands demonstrated construct and predictive validity that was comparable to a 123-item version of the same questionnaire. Greater numbers of illness demands occurred when spouses were more depressed and had less confidence in their ability to manage the impact of the cancer (F=18.08 (3, 103), p<.001). Predictive validity was established by the short form's ability to significantly predict the quality of marital communication and spouses' self-efficacy at a two-month interval. Conclusion The short-version of the standardized measure of cancer-related demands shows promise for future application in clinic settings. Additional testing of the questionnaire is warranted. Spouses' breast cancer-related demands deserve attention by providers. In the absence of assisting them, spouses' illness pressures have deleterious consequences for the quality of marital communication and spouses' self-confidence. PMID:20014184
Rossi, Gina; Debast, Inge; van Alphen, S P J
2017-07-01
The dimensional personality disorders model in the Diagnostic and Statistical Manual (DSM)-5 section III conceptually differentiates impaired personality functioning (criterion A) from the presence of pathological traits (criterion B). This study is the first to specifically address the measurement of criterion A in older adults. Moreover, the convergent/divergent validity of criterion A and criterion B will be compared in younger and older age groups. The Severity Indices of Personality Functioning - Short Form (SIPP-SF) was administered in older (N = 171) and younger adults (N = 210). The factorial structure was analyzed with exploratory structural equation modeling. Differences in convergent/divergent validity between personality functioning (SIPP-SF) and pathological traits (Personality Inventory for DSM-5; Dimensional Assessment of Personality Pathology-Basic Questionnaire) were examined across age groups. Identity Integration, Relational Capacities, Responsibility, Self-Control, and Social Concordance were corroborated as higher order domains. Although the SIPP-SF domains measured unique variation, some high correlations with pathological traits referred to overlapping constructs. Moreover, in older adults, personality functioning was more strongly related to Psychoticism, Disinhibition, Antagonism and Dissocial Behavior compared to younger adults. The SIPP-SF construct validity was demonstrated in terms of a structure of five higher order domains of personality functioning. The instrument is promising as a possible measure of impaired personality functioning in older adults. As such, it is a useful clinical tool to follow up effects of therapy on levels of personality functioning. Moreover, traits were associated with different degrees of personality functioning across age groups.
Wageck, Bruna Borges; de Noronha, Marcos; Lopes, Alexandre Dias; da Cunha, Ronaldo Alves; Takahashi, Ricardo Hisayoshi; Costa, Leonardo Oliveira Pena
2013-03-01
Clinical measurement. To translate, adapt, and test the measurement properties of the Brazilian Portuguese version of the Victorian Institute of Sport Assessment-Patella (VISA-P) questionnaire. It is important to objectively measure symptoms and functional limitations related to patellar tendinopathy using outcome measures that have been validated in the language of the target population. Cross-cultural adaptations are also useful to enhance the understanding of the measurement properties of an assessment tool, regardless of the target language. The VISA-P questionnaire was translated into Brazilian Portuguese, culturally adapted, and titled VISA-P Brazil. It was then administered on 2 occasions with a 24- to 48-hour interval between them, and a third time after a month of physical therapy treatment. The following measurement properties were analyzed: internal consistency, test-retest reliability, agreement, construct validity, floor and ceiling effects, and responsiveness. The VISA-P Brazil had high internal consistency (Cronbach α = .76; if item deleted, Cronbach α = .69-.78), excellent reliability and agreement (intraclass correlation coefficient = 0.91; 95% confidence interval: 0.85, 0.95; standard error of measurement, 5.2 points; minimal detectable change at the 90% confidence level, 12.2 points), and good construct validity (Pearson r = 0.60 compared to Lysholm). No ceiling and floor effects were detected for the VISA-P Brazil, and the responsiveness, based on 32 patients receiving physical therapy intervention for 1 month, demonstrated a large effect size of 0.97 (95% confidence interval: 0.68, 1.25). The VISA-P Brazil is a reproducible and responsive tool and can be used in clinical practice and research to assess the severity of pain and disability of patients with patellar tendinopathy.
Cozzi-Lepri, Alessandro; Prosperi, Mattia C F; Kjær, Jesper; Dunn, David; Paredes, Roger; Sabin, Caroline A; Lundgren, Jens D; Phillips, Andrew N; Pillay, Deenan
2011-01-01
The question of whether a score for a specific antiretroviral (e.g. lopinavir/r in this analysis) that improves prediction of viral load response given by existing expert-based interpretation systems (IS) could be derived from analyzing the correlation between genotypic data and virological response using statistical methods remains largely unanswered. We used the data of the patients from the UK Collaborative HIV Cohort (UK CHIC) Study for whom genotypic data were stored in the UK HIV Drug Resistance Database (UK HDRD) to construct a training/validation dataset of treatment change episodes (TCE). We used the average square error (ASE) on a 10-fold cross-validation and on a test dataset (the EuroSIDA TCE database) to compare the performance of a newly derived lopinavir/r score with that of the 3 most widely used expert-based interpretation rules (ANRS, HIVDB and Rega). Our analysis identified mutations V82A, I54V, K20I and I62V, which were associated with reduced viral response and mutations I15V and V91S which determined lopinavir/r hypersensitivity. All models performed equally well (ASE on test ranging between 1.1 and 1.3, p = 0.34). We fully explored the potential of linear regression to construct a simple predictive model for lopinavir/r-based TCE. Although, the performance of our proposed score was similar to that of already existing IS, previously unrecognized lopinavir/r-associated mutations were identified. The analysis illustrates an approach of validation of expert-based IS that could be used in the future for other antiretrovirals and in other settings outside HIV research.
Construct Validity of Fresh Frozen Human Cadaver as a Training Model in Minimal Access Surgery
Macafee, David; Pranesh, Nagarajan; Horgan, Alan F.
2012-01-01
Background: The construct validity of fresh human cadaver as a training tool has not been established previously. The aims of this study were to investigate the construct validity of fresh frozen human cadaver as a method of training in minimal access surgery and determine if novices can be rapidly trained using this model to a safe level of performance. Methods: Junior surgical trainees, novices (<3 laparoscopic procedure performed) in laparoscopic surgery, performed 10 repetitions of a set of structured laparoscopic tasks on fresh frozen cadavers. Expert laparoscopists (>100 laparoscopic procedures) performed 3 repetitions of identical tasks. Performances were scored using a validated, objective Global Operative Assessment of Laparoscopic Skills scale. Scores for 3 consecutive repetitions were compared between experts and novices to determine construct validity. Furthermore, to determine if the novices reached a safe level, a trimmed mean of the experts score was used to define a benchmark. Mann-Whitney U test was used for construct validity analysis and 1-sample t test to compare performances of the novice group with the benchmark safe score. Results: Ten novices and 2 experts were recruited. Four out of 5 tasks (nondominant to dominant hand transfer; simulated appendicectomy; intracorporeal and extracorporeal knot tying) showed construct validity. Novices’ scores became comparable to benchmark scores between the eighth and tenth repetition. Conclusion: Minimal access surgical training using fresh frozen human cadavers appears to have construct validity. The laparoscopic skills of novices can be accelerated through to a safe level within 8 to 10 repetitions. PMID:23318058
Hojat, Mohammadreza; Spandorfer, John; Isenberg, Gerald A; Vergare, Michael J; Fassihi, Reza; Gonnella, Joseph S
2012-01-01
Despite the emphasis placed on interdisciplinary education and interprofessional collaboration between physicians and pharmacologists, no psychometrically sound instrument is available to measure attitudes toward collaborative relationships. This study was designed to examine psychometrics of an instrument for measuring attitudes toward physician-pharmacist collaborative relationships for administration to students in medical and pharmacy schools and to physicians and pharmacists. The Scale of Attitudes Toward Physician-Pharmacist Collaboration was completed by 210 students at Jefferson Medical College. Factor analysis and correlational methods were used to examine psychometrics of the instrument. Consistent with the conceptual framework of interprofessional collaboration, three underlying constructs, namely "responsibility and accountability;" "shared authority;" and "interdisciplinary education" emerged from the factor analysis of the instrument providing support for its construct validity. The reliability coefficient alpha for the instrument was 0.90. The instrument's criterion-related validity coefficient with scores of a validated instrument (Jefferson Scale of Attitudes Toward Physician-Nurse Collaboration) was 0.70. Findings provide support for the validity and reliability of the instrument for medical students. The instrument has the potential to be used for the evaluation of interdisciplinary education in medical and pharmacy schools, and for the evaluation of patient outcomes resulting from collaborative physician-pharmacist relationships.
Validation of a measure of the Satter eating competence model with low-income females
2011-01-01
Background The purpose of this study was to evaluate the construct validity of a version of the ecSatter Inventory (ecSI), a measure of eating competence (EC), as adapted for use in a low-income (LI) population. Methods Females (n = 507), aged 18 to 45 years, living in households with a history of participating in the Supplemental Nutrition Assistance Program completed a web-based survey that included the ecSI for LI (ecSI/LI) and valid measures of cognitive and affective eating behavior, food preference and practice, and food preparation. Results Most correlations and differences between eating competent and non-eating competent categories and among EC tertiles were compatible with hypothesized relationships. ecSI/LI scores were positively related with self-reported physical activity, food acceptance, fruit and vegetable intake, and food planning/resource management. ecSI/LI scores were negatively associated with body mass index, dissatisfaction with body weight, tendency to overeat in response to external or emotional stimuli, and indices of psychosocial attributes related to disordered eating. Conclusions The ecSI/LI is a valid measure of EC for low-income females and provides a tool for researchers and educators to assess intervention outcomes and further explore the EC construct. PMID:21473765
Validation of a measure of the Satter eating competence model with low-income females.
Krall, Jodi S; Lohse, Barbara
2011-04-07
The purpose of this study was to evaluate the construct validity of a version of the ecSatter Inventory (ecSI), a measure of eating competence (EC), as adapted for use in a low-income (LI) population. Females (n=507), aged 18 to 45 years, living in households with a history of participating in the Supplemental Nutrition Assistance Program completed a web-based survey that included the ecSI for LI (ecSI/LI) and valid measures of cognitive and affective eating behavior, food preference and practice, and food preparation. Most correlations and differences between eating competent and non-eating competent categories and among EC tertiles were compatible with hypothesized relationships. ecSI/LI scores were positively related with self-reported physical activity, food acceptance, fruit and vegetable intake, and food planning/resource management. ecSI/LI scores were negatively associated with body mass index, dissatisfaction with body weight, tendency to overeat in response to external or emotional stimuli, and indices of psychosocial attributes related to disordered eating. The ecSI/LI is a valid measure of EC for low-income females and provides a tool for researchers and educators to assess intervention outcomes and further explore the EC construct. © 2011 Krall and Lohse; licensee BioMed Central Ltd.
La Porta, F; Giordano, A; Caselli, S; Foti, C; Franchignoni, F
2015-12-01
It is unclear whether the BBS is an effective tool for the measurement of early postural control impairments in patients with Parkinson's disease (PD). The aim of this paper was to evaluate BBS' content validity, internal construct validity, reliability and targeting in patients with PD within the Rasch analysis framework. Observational, cross-sectional study. Outpatient Rehabilitation Unit. A sample of 285 outpatients with PD. The content validity of the BBS was assessed using standard linking techniques. The BBS was administered by trained physiotherapists. The data collected then underwent Rasch analysis. Content validity analysis showed a lack of items assessing postural responses to tripping and slips and stability during walking. On Rasch analysis, the BBS failed the requirements of monotonicity, local independence, unidimensionality and invariance. After rescoring 7 items, grouping of locally dependent items into testlets, and deletion of the static sitting balance item because mistargeted and underdiscriminating, the Rasch-modified BBS for PD (BBS-PD) showed adequate internal construct validity (χ(2)24=39.693; P=0.023), including absence of differential item functioning (DIF) across gender and age, and was, as a whole, sufficiently precise for individual person measurement (PSI=0.894). However, the scale was not well targeted to the sample in view of the prevalence of higher scores. This study demonstrated the internal construct validity and reliability of the BBS-PD as a measurement tool for patients with PD within the Rasch analysis framework. However, the lack of items critical to the assessment of postural control impairments typical of PD, affected negatively the targeting, so that a significant percentage of patients was located in the higher ability range of the measurement continuum, where precision of measurement is reduced. These findings suggest that the BBS, even if modified, may not be an effective tool for the measurement of early postural control in patients with PD.
Development and Validation of the Consumer Health Activation Index.
Wolf, Michael S; Smith, Samuel G; Pandit, Anjali U; Condon, David M; Curtis, Laura M; Griffith, James; O'Conor, Rachel; Rush, Steven; Bailey, Stacy C; Kaplan, Gordon; Haufle, Vincent; Martin, David
2018-04-01
Although there has been increasing interest in patient engagement, few measures are publicly available and suitable for patients with limited health literacy. We sought to develop a Consumer Health Activation Index (CHAI) for use among diverse patients. Expert opinion, a systematic literature review, focus groups, and cognitive interviews with patients were used to create and revise a potential set of items. Psychometric testing guided by item response theory was then conducted among 301 English-speaking, community-dwelling adults. This included differential item functioning analyses to evaluate item performance across participant health literacy levels. To determine construct validity, CHAI scores were compared to scales measuring similar personality constructs. Associations between the CHAI and physical and mental health established predictive validity. A second study among 9,478 adults was used to confirm CHAI associations with health outcomes. Exploratory factor analyses revealed a single-factor solution with a 10-item scale. The CHAI showed good internal consistency (alpha = 0.81) and moderate test-retest reliability (ICC = 0.53). Reading grade level was found to be at the 6 th grade. Moderate to strong correlations were found with similar constructs (Multidimensional Health Locus of Control, r = 0.38, P < 0.001; Conscientiousness, r = 0.41, P < 0.001). Predictive validity was demonstrated through associations with functional health status measures (depression, r = -0.28, P < 0.001; anxiety, r = -0.22, P < 0.001; and physical functioning, r = 0.22, P < 0.001). In the validation sample, the CHAI was significantly associated with self-reported physical and mental health ( r = 0.31 and 0.32 respectively; both P < 0.001). The CHAI appears to be a valid, reliable, and easily administered tool that can be used to assess health activation among adults, including those with limited health literacy. Future studies should test the tool in actual use and explore further applications.
Beukinga, Roelof J; Hulshoff, Jan B; van Dijk, Lisanne V; Muijs, Christina T; Burgerhof, Johannes G M; Kats-Ugurlu, Gursah; Slart, Riemer H J A; Slump, Cornelis H; Mul, Véronique E M; Plukker, John Th M
2017-05-01
Adequate prediction of tumor response to neoadjuvant chemoradiotherapy (nCRT) in esophageal cancer (EC) patients is important in a more personalized treatment. The current best clinical method to predict pathologic complete response is SUV max in 18 F-FDG PET/CT imaging. To improve the prediction of response, we constructed a model to predict complete response to nCRT in EC based on pretreatment clinical parameters and 18 F-FDG PET/CT-derived textural features. Methods: From a prospectively maintained single-institution database, we reviewed 97 consecutive patients with locally advanced EC and a pretreatment 18 F-FDG PET/CT scan between 2009 and 2015. All patients were treated with nCRT (carboplatin/paclitaxel/41.4 Gy) followed by esophagectomy. We analyzed clinical, geometric, and pretreatment textural features extracted from both 18 F-FDG PET and CT. The current most accurate prediction model with SUV max as a predictor variable was compared with 6 different response prediction models constructed using least absolute shrinkage and selection operator regularized logistic regression. Internal validation was performed to estimate the model's performances. Pathologic response was defined as complete versus incomplete response (Mandard tumor regression grade system 1 vs. 2-5). Results: Pathologic examination revealed 19 (19.6%) complete and 78 (80.4%) incomplete responders. Least absolute shrinkage and selection operator regularization selected the clinical parameters: histologic type and clinical T stage, the 18 F-FDG PET-derived textural feature long run low gray level emphasis, and the CT-derived textural feature run percentage. Introducing these variables to a logistic regression analysis showed areas under the receiver-operating-characteristic curve (AUCs) of 0.78 compared with 0.58 in the SUV max model. The discrimination slopes were 0.17 compared with 0.01, respectively. After internal validation, the AUCs decreased to 0.74 and 0.54, respectively. Conclusion: The predictive values of the constructed models were superior to the standard method (SUV max ). These results can be considered as an initial step in predicting tumor response to nCRT in locally advanced EC. Further research in refining the predictive value of these models is needed to justify omission of surgery. © 2017 by the Society of Nuclear Medicine and Molecular Imaging.
Validity of peer grading using Calibrated Peer Review in a guided-inquiry, conceptual physics course
NASA Astrophysics Data System (ADS)
Price, Edward; Goldberg, Fred; Robinson, Steve; McKean, Michael
2016-12-01
Constructing and evaluating explanations are important science practices, but in large classes it can be difficult to effectively engage students in these practices and provide feedback. Peer review and grading are scalable instructional approaches that address these concerns, but which raise questions about the validity of the peer grading process. Calibrated Peer Review (CPR) is a web-based system that scaffolds peer evaluation through a "calibration" process where students evaluate sample responses and receive feedback on their evaluations before evaluating their peers. Guided by an activity theory framework, we developed, implemented, and evaluated CPR-based tasks in guided-inquiry, conceptual physics courses for future teachers and general education students. The tasks were developed through iterative testing and revision. Effective tasks had specific and directed prompts and evaluation instructions. Using these tasks, over 350 students at three universities constructed explanations or analyzed physical phenomena, and evaluated their peers' work. By independently assessing students' responses, we evaluated the CPR calibration process and compared students' peer reviews with expert evaluations. On the tasks analyzed, peer scores were equivalent to our independent evaluations. On a written explanation item included on the final exam, students in the courses using CPR outperformed students in similar courses using traditional writing assignments without a peer evaluation element. Our research demonstrates that CPR can be an effective way to explicitly include the science practices of constructing and evaluating explanations into large classes without placing a significant burden on the instructor.
Feletto, Eleonora; Wilson, Laura Kate; Roberts, Alison Sarah; Benrimoj, Shalom Isaac
2011-03-01
Community pharmacy is undergoing transformation with increasing pressure to build its capacity to deliver cognitive pharmaceutical services ("services"). The theoretical framework of organizational flexibility (OF) may be used to assess the capacity of community pharmacy to implement change programs and guide capacity-building initiatives. To test the applicability of an existing scale measuring OF to the industry of community pharmacy in Australia. A mail survey was used to test a preexisting scale measuring OF amended from 28 items to 20 items testing 3 underlying factors of operational, structural, and strategic flexibility in the Australian community pharmacy context. The sample was 2006 randomly-stratified community pharmacies. A confirmatory factor analysis was conducted to assess the validity and reliability of the 1-factor models for each underlying construct and the full measurement model. Responses were received from a total of 395 (19.7%) community pharmacies. The 1-factor models of operational, structural, and strategic flexibility fit the data with appropriate respecification. Overall, the favorable fit of the individual factor constructs suggested that the multiple-factor measurement model should be tested. However, this model did not yield an interpretable response. Operational flexibility covaried negatively to the other factors, whereas structural and strategic flexibility shared covariance. Despite this, the results highlighting the individual factor fit suggest the constructs have application to pharmacy. The individual OF constructs were useful in the development and initial testing of a scale adapted for community pharmacy. When further developed and validated, the scale could be used to identify group of pharmacies that require individualized assistance to build capacity and integrate services and other new endeavors. Copyright © 2011 Elsevier Inc. All rights reserved.
Barzilay, Shira; Yaseen, Zimri S; Hawes, Mariah; Gorman, Bernard; Altman, Rachel; Foster, Adriana; Apter, Alan; Rosenfield, Paul; Galynker, Igor
2018-01-01
Mental health professionals have a pivotal role in suicide prevention. However, they also often have intense emotional responses, or countertransference, during encounters with suicidal patients. Previous studies of the Therapist Response Questionnaire-Suicide Form (TRQ-SF), a brief novel measure aimed at probing a distinct set of suicide-related emotional responses to patients found it to be predictive of near-term suicidal behavior among high suicide-risk inpatients. The purpose of this study was to validate the TRQ-SF in a general outpatient clinic setting. Adult psychiatric outpatients ( N = 346) and their treating mental health professionals ( N = 48) completed self-report assessments following their first clinic meeting. Clinician measures included the TRQ-SF, general emotional states and traits, therapeutic alliance, and assessment of patient suicide risk. Patient suicidal outcomes and symptom severity were assessed at intake and one-month follow-up. Following confirmatory factor analysis of the TRQ-SF, factor scores were examined for relationships with clinician and patient measures and suicidal outcomes. Factor analysis of the TRQ-SF confirmed three dimensions: (1) affiliation, (2) distress, and (3) hope. The three factors also loaded onto a single general factor of negative emotional response toward the patient that demonstrated good internal reliability. The TRQ-SF scores were associated with measures of clinician state anger and anxiety and therapeutic alliance, independently of clinician personality traits after controlling for the state- and patient-specific measures. The total score and three subscales were associated in both concurrent and predictive ways with patient suicidal outcomes, depression severity, and clinicians' judgment of patient suicide risk, but not with global symptom severity, thus indicating specifically suicide-related responses. The TRQ-SF is a brief and reliable measure with a 3-factor structure. It demonstrates construct validity for assessing distinct suicide-related countertransference to psychiatric outpatients. Mental health professionals' emotional responses to their patients are concurrently indicative and prospectively predictive of suicidal thoughts and behaviors. Thus, the TRQ-SF is a useful tool for the study of countertransference in the treatment of suicidal patients and may help clinicians make diagnostic and therapeutic use of their own responses to improve assessment and intervention for individual suicidal patients.
Forrest, Christopher B; Ravens-Sieberer, Ulrike; Devine, Janine; Becker, Brandon D; Teneralli, Rachel; Moon, JeanHee; Carle, Adam; Tucker, Carole A; Bevans, Katherine B
2018-03-01
The purpose of this study is to describe the psychometric evaluation and item response theory calibration of the PROMIS Pediatric Positive Affect item bank, child-report and parent-proxy editions. The initial item pool comprising 53 items, previously developed using qualitative methods, was administered to 1,874 children 8-17 years old and 909 parents of children 5-17 years old. Analyses included descriptive statistics, reliability, factor analysis, differential item functioning, and construct validity. A total of 14 items were deleted, because of poor psychometric performance, and an 8-item short form constructed from the remaining 39 items was administered to a national sample of 1,004 children 8-17 years old, and 1,306 parents of children 5-17 years old. The combined sample was used in item response theory (IRT) calibration analyses. The final item bank appeared unidimensional, the items appeared locally independent, and the items were free from differential item functioning. The scales showed excellent reliability and convergent and discriminant validity. Positive affect decreased with children's age and was lower for those with a special health care need. After IRT calibration, we found that 4 and 8 item short forms had a high degree of precision (reliability) across a wide range of the latent trait (>4 SD units). The PROMIS Pediatric Positive Affect item bank and its short forms provide an efficient, precise, and valid assessment of positive affect in children and youth.
Tadić, Valerija; Cooper, Andrew; Cumberland, Phillippa; Lewando-Hundt, Gillian; Rahi, Jugnoo S
2016-01-01
To report piloting and initial validation of the VQoL_CYP, a novel age-appropriate vision-related quality of life (VQoL) instrument for self-reporting by children with visual impairment (VI). Participants were a random patient sample of children with VI aged 10-15 years. 69 patients, drawn from patient databases at Great Ormond Street Hospital and Moorfields Eye Hospital, United Kingdom, participated in piloting of the draft 47-item VQoL instrument, which enabled preliminary item reduction. Subsequent administration of the instrument, alongside functional vision (FV) and generic health-related quality of life (HRQoL) self-report measures, to 101 children with VI comprising a nationally representative sample enabled further item reduction and evaluation of psychometric properties using Rasch analysis. Construct validity was assessed through Pearson correlation coefficients. Item reduction through piloting (8 items removed for skewness and individual item response pattern) and validation (1 item removed for skewness and 3 for misfit in Rasch) produced a 35-item scale, with fit values within acceptable limits, no notable differential item functioning, good measurement precision, ordered response categories and acceptable targeting in Rasch. The VQoL_CYP showed good construct validity, correlating strongly with HRQoL scores, moderately with FV scores but not with acuity. Robust child-appropriate self-report VQoL measures for children with VI are necessary for understanding the broader impacts of living with a visual disability, distinguishing these from limited functioning per se. Future planned use in larger patient samples will allow further psychometric development of the VQoL_CYP as an adjunct to objective outcomes assessment.
2014-01-01
Background Early feeding practices lay the foundation for children’s eating habits and weight gain. Questionnaires are available to assess parental feeding but overlapping and inconsistent items, subscales and terminology limit conceptual clarity and between study comparisons. Our aim was to consolidate a range of existing items into a parsimonious and conceptually robust questionnaire for assessing feeding practices with very young children (<3 years). Methods Data were from 462 mothers and children (age 21–27 months) from the NOURISH trial. Items from five questionnaires and two study-specific items were submitted to a priori item selection, allocation and verification, before theoretically-derived factors were tested using Confirmatory Factor Analysis. Construct validity of the new factors was examined by correlating these with child eating behaviours and weight. Results Following expert review 10 factors were specified. Of these, 9 factors (40 items) showed acceptable model fit and internal reliability (Cronbach’s α: 0.61-0.89). Four factors reflected non-responsive feeding practices: ‘Distrust in Appetite’, ‘Reward for Behaviour’, ‘Reward for Eating’, and ‘Persuasive Feeding’. Five factors reflected structure of the meal environment and limits: ‘Structured Meal Setting’, ‘Structured Meal Timing’, ‘Family Meal Setting’, ‘Overt Restriction’ and ‘Covert Restriction’. Feeding practices generally showed the expected pattern of associations with child eating behaviours but none with weight. Conclusion The Feeding Practices and Structure Questionnaire (FPSQ) provides a new reliable and valid measure of parental feeding practices, specifically maternal responsiveness to children’s hunger/satiety signals facilitated by routine and structure in feeding. Further validation in more diverse samples is required. PMID:24898364
Spertus, John; Jones, Philip; Poler, Sherri; Rocha-Singh, Krishna
2004-02-01
The most common indication for treating patients with peripheral arterial disease is to improve their health status: their symptoms, function, and quality of life. Quantifying health status requires a valid, reproducible, and sensitive disease-specific measure. The Peripheral Artery Questionnaire (PAQ) is a 20-item questionnaire developed to meet this need by quantifying patients' physical limitations, symptoms, social function, treatment satisfaction, and quality of life. Psychometric and clinical properties of the PAQ were evaluated in a prospective cohort study of 44 patients undergoing elective percutaneous peripheral revascularization. To establish reproducibility, 2 assessments were performed 2 weeks apart and before revascularization. The change in scores before and 6 weeks after revascularization were used to determine the instruments' responsiveness and were compared with the Short Form-36 and the Walking Impairment Questionnaire. A series of cross-sectional analyses were performed to establish the construct validity of the PAQ. The 7 domains of the PAQ were internally reliable, with Cronbach alpha = 0.80 to 0.94. The test-retest reliability analyses revealed insignificant mean changes of 0.6 to 2.3 points (P = not significant for all). Conversely, the change after revascularization ranged from 13.7 to 41.9 points (P < or =.001 for all), reflecting substantial sensitivity of the PAQ to clinical improvement. The PAQ Summary Scale was the most sensitive of all scales tested. Construct validity was established by demonstrating correlations with other measures of patient health status. The PAQ is a valid, reliable, and responsive disease-specific measure for patients with peripheral arterial disease. It may prove to be a useful end point in clinical trials and a potential aid in disease management.
Hung, Man; Baumhauer, Judith F; Latt, L Daniel; Saltzman, Charles L; SooHoo, Nelson F; Hunt, Kenneth J
2013-11-01
In 2012, the American Orthopaedic Foot & Ankle Society(®) established a national network for collecting and sharing data on treatment outcomes and improving patient care. One of the network's initiatives is to explore the use of computerized adaptive tests (CATs) for patient-level outcome reporting. We determined whether the CAT from the NIH Patient Reported Outcome Measurement Information System(®) (PROMIS(®)) Physical Function (PF) item bank provides efficient, reliable, valid, precise, and adequately covered point estimates of patients' physical function. After informed consent, 288 patients with a mean age of 51 years (range, 18-81 years) undergoing surgery for common foot and ankle problems completed a web-based questionnaire. Efficiency was determined by time for test administration. Reliability was assessed with person and item reliability estimates. Validity evaluation included content validity from expert review and construct validity measured against the PROMIS(®) Pain CAT and patient responses based on tradeoff perceptions. Precision was assessed by standard error of measurement (SEM) across patients' physical function levels. Instrument coverage was based on a person-item map. Average time of test administration was 47 seconds. Reliability was 0.96 for person and 0.99 for item. Construct validity against the Pain CAT had an r value of -0.657 (p < 0.001). Precision had an SEM of less than 3.3 (equivalent to a Cronbach's alpha of ≥ 0.90) across a broad range of function. Concerning coverage, the ceiling effect was 0.32% and there was no floor effect. The PROMIS(®) PF CAT appears to be an excellent method for measuring outcomes for patients with foot and ankle surgery. Further validation of the PROMIS(®) item banks may ultimately provide a valid and reliable tool for measuring patient-reported outcomes after injuries and treatment.
Talip, Whadi-ah; Steyn, Nelia P; Visser, Marianne; Charlton, Karen E; Temple, Norman
2003-09-01
We wanted to develop and validate a test that assesses the knowledge and practices of health professionals (HPs) with regard to the role of nutrition, physical activity, and smoking cessation (lifestyle modification) in chronic diseases of lifestyle. A descriptive cross-sectional validation study was carried out. The validation design consisted of two phases, namely 1) test planning and development and 2) test evaluation. The study sample consisted of five groups of HPs: dietitians, dietetic interns, general practitioners, medical students, and nurses. The overall response rate was 58%, resulting in a sample size of 186 participants. A test was designed to evaluate the knowledge and practices of HPs. The test was first evaluated by an expert group to ensure content, construct, and face validity. Thereafter, the questionnaire was tested on five groups of HPs to test for criterion validity. Internal consistency was evaluated by Cronbach's alpha. An expert panel ensured content, construct, and face validity of the test. Groups with the most training and exposure to nutrition (dietitians and dietetic interns) had the highest group mean score, ranging from 61% to 88%, whereas those with limited nutrition training (general practitioners, medical students, and nurses) had significantly lower scores, ranging from 26% to 80%. This result demonstrated criterion validity. Internal consistency of the overall test demonstrated a Cronbach's alpha of 0.99. Most HPs identified the mass media as their main source of information on lifestyle modification. These HPs also identified lack of time, lack of patient compliance, and lack of knowledge as barriers that prevent them from providing counseling on lifestyle modification. The results of this study showed that this test instrument identifies groups of health professionals with adequate training (knowledge) in lifestyle modification and those who require further training (knowledge).
Godar, Sean C; Mosher, Laura J; Strathman, Hunter J; Gochi, Andrea M; Jones, Cori M; Fowler, Stephen C; Bortolato, Marco
2016-07-01
The D1CT-7 mouse is one of the best known animal models of Tourette syndrome (TS), featuring spontaneous tic-like behaviours sensitive to standard TS therapies; these characteristics ensure a high face and predictive validity of this model, yet its construct validity remains elusive. To address this issue, we studied the responses of D1CT-7 mice to two critical components of TS pathophysiology: the exacerbation of tic-like behaviours in response to stress and the presence of sensorimotor gating deficits, which are thought to reflect the perceptual alterations causing the tics. D1CT-7 and wild-type (WT) littermates were subjected to a 20 min session of spatial confinement (SC) within an inescapable, 10 cm wide cylindrical enclosure. Changes in plasma corticosterone levels, tic-like behaviours and other spontaneous responses were measured. SC-exposed mice were also tested for the prepulse inhibition (PPI) of the startle response (a sensorimotor gating index) and other TS-related behaviours, including open-field locomotion, novel object exploration and social interaction and compared with non-confined counterparts. SC produced a marked increase in corticosterone concentrations in both D1CT-7 and WT mice. In D1CT-7, but not WT mice, SC exacerbated tic-like and digging behaviours, and triggered PPI deficits and aggressive responses. Conversely, SC did not modify locomotor activity or novel object exploration in D1CT-7 mice. Both tic-like behaviours and PPI impairments in SC-exposed D1CT-7 mice were inhibited by standard TS therapies and D1 dopamine receptor antagonism. These findings collectively support the translational and construct validity of D1CT-7 mice with respect to TS. This article is part of a themed section on Updating Neuropathology and Neuropharmacology of Monoaminergic Systems. To view the other articles in this section visit http://onlinelibrary.wiley.com/doi/10.1111/bph.v173.13/issuetoc. © 2015 The British Pharmacological Society.
Annear, Michael J; Eccleston, Claire E; McInerney, Frances J; Elliott, Kate-Ellen J; Toye, Christine M; Tranter, Bruce K; Robinson, Andrew L
2016-06-01
To compare the psychometric performance of the Dementia Knowledge Assessment Scale (DKAS) and the Alzheimer's Disease Knowledge Scale (ADKS) when administered to a large international cohort before and after online dementia education. Comparative psychometric analysis with pre- and posteducation scale responses. The setting for this research encompassed 7,909 individuals from 124 countries who completed the 9-week Understanding Dementia Massive Open Online Course (MOOC). Volunteer respondents who completed the DKAS and ADKS before (n = 3,649) and after (n = 878) completion of the Understanding Dementia MOOC. Assessment and comparison of the DKAS and ADKS included evaluation of scale development procedures, interscale correlations, response distribution, internal consistency, and construct validity. The DKAS had superior internal consistency, wider response distribution with less ceiling effect, and better discrimination between pre- and posteducation scores and occupational cohorts than the ADKS. The 27-item DKAS is a reliable and preliminarily valid measure of dementia knowledge that is psychometrically and conceptually sound, overcomes limitations of existing instruments, and can be administered to diverse cohorts to measure baseline understanding and knowledge change. © 2016, Copyright the Authors Journal compilation © 2016, The American Geriatrics Society.
Strong, David R; Messer, Karen; Hartman, Sheri J; Conway, Kevin P; Hoffman, Allison C; Pharris-Ciurej, Nikolas; White, Martha; Green, Victoria R; Compton, Wilson M; Pierce, John
2015-07-01
Nicotine dependence (ND) is a key construct that organizes physiological and behavioral symptoms associated with persistent nicotine intake. Measurement of ND has focused primarily on cigarette smokers. Thus, validation of brief instruments that apply to a broad spectrum of tobacco product users is needed. We examined multiple domains of ND in a longitudinal national study of the United States population, the United States National Epidemiological Survey of Alcohol and Related Conditions (NESARC). We used methods based in item response theory to identify and validate increasingly brief measures of ND that included symptoms to assess ND similarly among cigarette, cigar, smokeless, and poly tobacco users. Confirmatory factor analytic models supported a single, primary dimension underlying symptoms of ND across tobacco use groups. Differential Item Functioning (DIF) analysis generated little support for systematic differences in response to symptoms of ND across tobacco use groups. We established significant concurrent and predictive validity of brief 3- and 5-symptom indices for measuring ND. Measuring ND across tobacco use groups with a common set of symptoms facilitates evaluation of tobacco use in an evolving marketplace of tobacco and nicotine products. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
ERIC Educational Resources Information Center
Spurgeon, Shawn L.
2017-01-01
Construct irrelevance (CI) and construct underrepresentation (CU) are 2 major threats to validity, yet they are rarely discussed within the counseling literature. This article provides information about the relevance of these threats to internal validity. An illustrative case example will be provided to assist counselors in understanding these…
Kim, Myoung-Hee; Cho, Young-Shin; Uhm, Wan-Sik; Kim, Sehyun; Bae, Sang-Cheol
2005-06-01
This study aimed to determine the cross-cultural adaptation and validation of the Korean version of the EQ-5D in rheumatic conditions. Translation, back-translation and cognitive debriefing were performed according to the EuroQol group's guidelines. For validity, 508 patients were recruited and administered the EQ-5D, Short-Form 36 and condition-specific measures. Construct validity and sensitivity were evaluated by testing a-priori hypotheses. For reliability, another 57 patients repeated the EQ-5D at 1-week interval, and intra-class correlations (ICC) and kappa statistics were estimated. For responsiveness, another 60 patients repeated it at 12-week interval within the context of clinical trial, and standardized response mean(SRM) were calculated. The cross-cultural adaptation produced no major modifications in the scale. The associations of the EQ-5D with the generic- and condition-specific measures were observed as expected in hypotheses: the higher EQ-5Dindex and EQ-5D(VAS) scores, the better health status by generic- or condition-specific measures, and the better functional class. The ICCs were 0.751 and 0.767, respectively, and kappa ranged from 0.455 to 0.772. The SRM were 0.649 and 0.410, respectively. The Korean EQ-5D exhibits good validity and sensitivity in various rheumatic conditions. Although its reliability and responsiveness were not excellent, it seems acceptable if condition-specific measures are applied together.
Validity of Self-Report Screening Scale for Elder Abuse: Women's Health Australia Study.
ERIC Educational Resources Information Center
Schofield, Margot J.; Mishra, Gita D.
2003-01-01
Examines the reliability and validity of the Vulnerability to Abuse Screening Scale (VASS) for the early identification of elder abuse. Results confirmed the VASS factor structure and construct validity. The Vulnerability and Coercion factors held the strongest face and construct validity for physical and psychological abuse. (Contains 52…
Assessing the stability of human locomotion: a review of current measures
Bruijn, S. M.; Meijer, O. G.; Beek, P. J.; van Dieën, J. H.
2013-01-01
Falling poses a major threat to the steadily growing population of the elderly in modern-day society. A major challenge in the prevention of falls is the identification of individuals who are at risk of falling owing to an unstable gait. At present, several methods are available for estimating gait stability, each with its own advantages and disadvantages. In this paper, we review the currently available measures: the maximum Lyapunov exponent (λS and λL), the maximum Floquet multiplier, variability measures, long-range correlations, extrapolated centre of mass, stabilizing and destabilizing forces, foot placement estimator, gait sensitivity norm and maximum allowable perturbation. We explain what these measures represent and how they are calculated, and we assess their validity, divided up into construct validity, predictive validity in simple models, convergent validity in experimental studies, and predictive validity in observational studies. We conclude that (i) the validity of variability measures and λS is best supported across all levels, (ii) the maximum Floquet multiplier and λL have good construct validity, but negative predictive validity in models, negative convergent validity and (for λL) negative predictive validity in observational studies, (iii) long-range correlations lack construct validity and predictive validity in models and have negative convergent validity, and (iv) measures derived from perturbation experiments have good construct validity, but data are lacking on convergent validity in experimental studies and predictive validity in observational studies. In closing, directions for future research on dynamic gait stability are discussed. PMID:23516062
Nyongesa, Moses K; Sigilai, Antipa; Hassan, Amin S; Thoya, Janet; Odhiambo, Rachael; Van de Vijver, Fons J R; Newton, Charles R J C; Abubakar, Amina
2017-01-01
Despite bearing the largest HIV-related burden, little is known of the Health-Related Quality of Life (HRQoL) among people living with HIV in sub-Saharan Africa. One of the factors contributing to this gap in knowledge is the lack of culturally adapted and validated measures of HRQoL that are relevant for this setting. We set out to adapt the Functional Assessment of HIV Infection (FAHI) Questionnaire, an HIV-specific measure of HRQoL, and evaluate its internal consistency and validity. The three phase mixed-methods study took place in a rural setting at the Kenyan Coast. Phase one involved a scoping review to describe the evidence base of the reliability and validity of FAHI as well as the geographical contexts in which it has been administered. Phase two involved in-depth interviews (n = 38) to explore the content validity, and initial piloting for face validation of the adapted FAHI. Phase three was quantitative (n = 103) and evaluated the internal consistency, convergent and construct validities of the adapted interviewer-administered questionnaire. In the first phase of the study, we identified 16 studies that have used the FAHI. Most (82%) were conducted in North America. Only seven (44%) of the reviewed studies reported on the psychometric properties of the FAHI. In the second phase, most of the participants (37 out of 38) reported satisfaction with word clarity and content coverage whereas 34 (89%) reported satisfaction with relevance of the items, confirming the face validity of the adapted questionnaire during initial piloting. Our participants indicated that HIV impacted on their physical, functional, emotional, and social wellbeing. Their responses overlapped with items in four of the five subscales of the FAHI Questionnaire establishing its content validity. In the third phase, the internal consistency of the scale was found to be satisfactory with subscale Cronbach's α ranging from 0.55 to 0.78. The construct and convergent validity of the tool were supported by acceptable factor loadings for most of the items on the respective sub-scales and confirmation of expected significant correlations of the FAHI subscale scores with scores of a measure of common mental disorders. The adapted interviewer-administered Swahili version of FAHI questionnaire showed initial strong evidence of good psychometric properties with satisfactory internal consistency and acceptable validity (content, face, and convergent validity). It gives impetus for further validation work, especially construct validity, in similar settings before it can be used for research and clinical purposes in the entire East African region.
Coordination of frontline defense mechanisms under severe oxidative stress.
Kaur, Amardeep; Van, Phu T; Busch, Courtney R; Robinson, Courtney K; Pan, Min; Pang, Wyming Lee; Reiss, David J; DiRuggiero, Jocelyne; Baliga, Nitin S
2010-07-01
Complexity of cellular response to oxidative stress (OS) stems from its wide-ranging damage to nucleic acids, proteins, carbohydrates, and lipids. We have constructed a systems model of OS response (OSR) for Halobacterium salinarum NRC-1 in an attempt to understand the architecture of its regulatory network that coordinates this complex response. This has revealed a multi-tiered OS-management program to transcriptionally coordinate three peroxidase/catalase enzymes, two superoxide dismutases, production of rhodopsins, carotenoids and gas vesicles, metal trafficking, and various other aspects of metabolism. Through experimental validation of interactions within the OSR regulatory network, we show that despite their inability to directly sense reactive oxygen species, general transcription factors have an important function in coordinating this response. Remarkably, a significant fraction of this OSR was accurately recapitulated by a model that was earlier constructed from cellular responses to diverse environmental perturbations--this constitutes the general stress response component. Notwithstanding this observation, comparison of the two models has identified the coordination of frontline defense and repair systems by regulatory mechanisms that are triggered uniquely by severe OS and not by other environmental stressors, including sub-inhibitory levels of redox-active metals, extreme changes in oxygen tension, and a sub-lethal dose of gamma rays.
Unnanuntana, Aasis; Ruangsomboon, Pakpoom; Keesukpunt, Worawut
2018-06-01
The 2-minute walk test (2mwt) is a performance-based test that evaluates functional recovery after total knee arthroplasty (TKA). This study evaluated its validity compared with the modified Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC), Oxford Knee Score (OKS), modified Knee Score, Numerical Pain Rating Scale, and Timed Up and Go test, and its responsiveness in assessing functional recovery in TKA patients. This prospective cohort study included 162 patients undergoing primary TKA between 2013 and 2015. We used patient-reported outcome measures (modified WOMAC, OKS, modified Knee Score, Numerical Pain Rating Scale) and performance-based tests (2mwt and Timed Up and Go test) at baseline and 3, 6, and 12 months postoperatively. The construct validity of 2mwt was determined between the 2mwt distances walked and other outcome measurements. To assess responsiveness, effect size and standardized response mean were analyzed. Minimal clinically important difference of 2mwt at 12 months after TKA was also calculated. All outcome measurements improved significantly from baseline to 3, 6, and 12 months postoperatively. Bivariate analysis revealed mild to moderate associations between the 2mwt and modified WOMAC function subscales, and moderate to strong associations with OKS. Mild to moderate correlations were found for pain and stiffness between 2mwt and other outcome measurements. The effect size and standardized response mean at 12 months were large, with a minimal clinically important difference of 12.7 m. 2mwt is a validated performance-based test with responsiveness properties. Being simple and easy to perform, it can be used routinely in clinical practice to evaluate functional recovery after TKA. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Kolodziejczyk, Julia K; Norman, Gregory J; Rock, Cheryl L; Arredondo, Elva M; Roesch, Scott C; Madanat, Hala; Patrick, Kevin
2016-01-01
This study evaluates the reliability and validity of the strategies for weight management (SWM) measure, a questionnaire that assesses weight management strategies for adults. The SWM includes 20 items that are categorized within the following subscales: (1) energy intake, (2) energy expenditure, (3) self-monitoring, and (4) self-regulation. Baseline and 6-month data were collected from 404 overweight/obese adults (mean age=22±3.8 years, 68% ethnic minority) enrolled in a randomized controlled trial aiming to reduce weight by improving diet and physical activity behaviours. Reliability and validity were assessed for each subscale separately. Cronbach alpha was conducted to assess reliability. Concurrent, construct I (sensitivity to the study treatment condition), and construct II (relationship to the outcomes) validity were assessed using linear regressions with the following outcome measures: weight, self-reported diet, and weekly energy expenditure. All subscales showed strong internal consistency. The strength of the validity evidence depended on subscale and validity type. The strongest validity evidence was concurrent validity of the energy intake and energy expenditure subscales; construct I validity of the energy intake and self-monitoring subscales; and construct II validity of the energy intake, energy expenditure, and self-regulation subscales. Results indicate that the SWM can be used to assess weight management strategies among an ethnically diverse sample of adults as each subscale showed evidence of reliability and select types of validity. As validity is an accumulation of evidence over multiple studies, this study provides initial reliability and validity evidence in one population segment. Copyright © 2015 Asia Oceania Association for the Study of Obesity. Published by Elsevier Ltd. All rights reserved.
Loeb, Danielle F; Crane, Lori A; Leister, Erin; Bayliss, Elizabeth A; Ludman, Evette; Binswanger, Ingrid A; Kline, Danielle M; Smith, Meredith; deGruy, Frank V; Nease, Donald E; Dickinson, L Miriam
Develop and validate self-efficacy scales for primary care provider (PCP) mental illness management and team-based care participation. We developed three self-efficacy scales: team-based care (TBC), mental illness management (MIM), and chronic medical illness (CMI). We developed the scales using Bandura's Social Cognitive Theory as a guide. The survey instrument included items from previously validated scales on team-based care and mental illness management. We administered a mail survey to 900 randomly selected Colorado physicians. We conducted exploratory principal factor analysis with oblique rotation. We constructed self-efficacy scales and calculated standardized Cronbach's alpha coefficients to test internal consistency. We calculated correlation coefficients between the MIM and TBC scales and previously validated measures related to each scale to evaluate convergent validity. We tested correlations between the TBC and the measures expected to correlate with the MIM scale and vice versa to evaluate discriminant validity. PCPs (n=402, response rate=49%) from diverse practice settings completed surveys. Items grouped into factors as expected. Cronbach's alphas were 0.94, 0.88, and 0.83 for TBC, MIM, and CMI scales respectively. In convergent validity testing, the TBC scale was correlated as predicted with scales assessing communications strategies, attitudes toward teams, and other teamwork indicators (r=0.25 to 0.40, all statistically significant). Likewise, the MIM scale was significantly correlated with several items about knowledge and experience managing mental illness (r=0.24 to 41, all statistically significant). As expected in discriminant validity testing, the TBC scale had only very weak correlations with the mental illness knowledge and experience managing mental illness items (r=0.03 to 0.12). Likewise, the MIM scale was only weakly correlated with measures of team-based care (r=0.09 to.17). This validation study of MIM and TBC self-efficacy scales showed high internal validity and good construct validity. Copyright © 2016 Elsevier Inc. All rights reserved.
Mindfulness: A systematic review of instruments to measure an emergent patientreported outcome (PRO)
Park, Taehwan; Reilly-Spong, Maryanne
2013-01-01
Purpose Mindfulness has emerged as an important health concept based on evidence that mindfulness interventions reduce symptoms and improve health-related quality of life. The objectives of this study were to systematically assess and compare the properties of instruments to measure self-reported mindfulness. Methods Ovid Medline®, CINAHL®, and PsycINFO® were searched through May 2012, and articles were selected if their primary purpose was development or evaluation of the measurement properties (validity, reliability, responsiveness) of a self-report mindfulness scale. Two reviewers independently evaluated the methodological quality of the selected studies using the COnsensus-based Standards for the selection of health status Measurement INstruments (COSMIN) checklist. Discrepancies were discussed with a third reviewer, and scored by consensus. Finally, a level of evidence approach was used to synthesize results and study quality. Results Our search strategy identified a total of 2,588 articles. Forty-six articles, reporting 79 unique studies, met inclusion criteria. Ten instruments quantifying mindfulness as a unidimensional scale (n=5) or as a set of 2 to 5 subscales (n=5) were reviewed. The Mindful Attention Awareness Scale (MAAS) was evaluated by the most studies (n=27), and had positive overall quality ratings for most of the psychometric properties reviewed. The Five Facet Mindfulness Questionnaire (FFMQ) received the highest possible rating (“consistent findings in multiple studies of good methodological quality”) for two properties, internal consistency and construct validation by hypothesis testing. However, none of the instruments had sufficient evidence of content validity. Comprehensiveness of construct coverage had not been assessed; qualitative methods to confirm understanding and relevance were absent. In addition, estimates of test-retest reliability, responsiveness, or measurement error to guide users in protocol development or interpretation of scores were lacking. Conclusions Current mindfulness scales have important conceptual differences, and none can be strongly recommended based solely on superior psychometric properties. Important limitations in the field are the absence of qualitative evaluations and accepted external referents to support construct validity. Investigators need to proceed cautiously before optimizing any mindfulness intervention based on the existing scales. PMID:23539467
Chandler, L S; Terhorst, L; Rogers, J C; Holm, M B
2016-07-01
The purpose of this study was to establish the validity, reliability, stability and sensitivity to change of the family-centred Movement Assessment of Children (MAC) in typically developing infants/toddlers from 2 months (1 month 16 days) to 2 years (24 months 15 days) of age. Assessment of infant/toddler motor development is critical so that infants and toddlers who are at-risk for developmental delay or whose functional motor development is delayed can be monitored and receive therapy to improve their developmental outcomes. Infants/toddlers are thought to be more responsive during the MAC assessment because parents and siblings participate and elicit responses. Two hundred seventy six children and 405 assessments contributed to the establishment of age-related parameters for typically developing infants and toddlers on the MAC. The MAC assesses three core domains of functional movement (head control, upper extremities and hands, pelvis and lower extremities), and generates a core total score. Four explanatory domains serve to alert examiners to factors that may impact atypical development (general observations, special senses, primitive reflexes/reactions, muscle tone). Construct validity of functional motor development was examined using the relationship between incremental increases in scores and increases in participants' ages. Subsamples were used to establish inter-rater reliability, test-retest reliability, stability and sensitivity to change. Construct validity was established and inter-rater reliability ICCs for the core items and core total ranged from 0.83 to 0.99. Percent agreement for the explanatory items ranged from 0.72 to 0.96. Stability within age grouping was consistent from baseline to 6 months post-baseline, and sensitivity to change from baseline to 6 months was significant for all core items and the total score. The MAC has proven to be a well-constructed assessment of infant and toddler functional motor development. It is a family-centred and efficient tool that can be used to assess and follow-up of infants and toddlers from 2 months to 2 years. © 2016 John Wiley & Sons Ltd.
Estuarine Human Activities Modulate the Fate of Changjiang-derived Materials in Adjacent Seas
NASA Astrophysics Data System (ADS)
WU, H.
2017-12-01
Mega constructions have been built in many river estuaries, but their environmental consequences in the adjacent coastal oceans were often overlooked. This issue was addressed with an example of the Changjiang River Estuary, which was recently built with massive navigation and reclamation constructions in recent years. Based on the model validations against cruises data and the numerical scenario experiments, it is shown that the estuarine constructions profoundly affected the fates of riverine materials in an indeed large offshore area. This is because estuarine dynamics are highly sensitive to their bathymetries. Previously, the Three Gorges Dam (TGD) was thought to be responsible for some offshore environmental changes through modulating the river plume extension, but here we show that its influences are secondary. Since the TGD and the mega estuarine constructions were built during the similar period, their influences might be confused.
Shen, Minxue; Hu, Ming; Sun, Zhenqiu
2017-01-01
Objectives To develop and validate brief scales to measure common emotional and behavioural problems among adolescents in the examination-oriented education system and collectivistic culture of China. Setting Middle schools in Hunan province. Participants 5442 middle school students aged 11–19 years were sampled. 4727 valid questionnaires were collected and used for validation of the scales. The final sample included 2408 boys and 2319 girls. Primary and secondary outcome measures The tools were assessed by the item response theory, classical test theory (reliability and construct validity) and differential item functioning. Results Four scales to measure anxiety, depression, study problem and sociality problem were established. Exploratory factor analysis showed that each scale had two solutions. Confirmatory factor analysis showed acceptable to good model fit for each scale. Internal consistency and test–retest reliability of all scales were above 0.7. Item response theory showed that all items had acceptable discrimination parameters and most items had appropriate difficulty parameters. 10 items demonstrated differential item functioning with respect to gender. Conclusions Four brief scales were developed and validated among adolescents in middle schools of China. The scales have good psychometric properties with minor differential item functioning. They can be used in middle school settings, and will help school officials to assess the students’ emotional/behavioural problems. PMID:28062469
Beeckman, D; Defloor, T; Demarré, L; Van Hecke, A; Vanderwee, K
2010-11-01
Pressure ulcers continue to be a significant problem in hospitals, nursing homes and community care settings. Pressure ulcer incidence is widely accepted as an indicator for the quality of care. Negative attitudes towards pressure ulcer prevention may result in suboptimal preventive care. A reliable and valid instrument to assess attitudes towards pressure ulcer prevention is lacking. Development and psychometric evaluation of the Attitude towards Pressure ulcer Prevention instrument (APuP). Prospective psychometric instrument validation study. A literature review was performed to design the instrument. Content validity was evaluated by nine European pressure ulcer experts and five experts in psychometric instrument validation in a double Delphi procedure. A convenience sample of 258 nurses and 291 nursing students from Belgium and The Netherlands participated in order to evaluate construct validity and stability reliability of the instrument. The data were collected between February and May 2008. A factor analysis indicated the construct of a 13 item instrument in a five factor solution: (1) attitude towards personal competency to prevent pressure ulcers (three items); (2) attitude towards the priority of pressure ulcer prevention (three items); (3) attitude towards the impact of pressure ulcers (three items); (4) attitude towards personal responsibility in pressure ulcer prevention (two items); and (5) attitude towards confidence in the effectiveness of prevention (two items). This five factor solution accounted for 61.4% of the variance in responses related to attitudes towards pressure ulcer prevention. All items demonstrated factor loadings over 0.60. The instrument produced similar results during stability testing [ICC=0.88 (95% CI=0.84-0.91, P<0.001)]. For the total instrument, the internal consistency (Cronbachs alpha) was 0.79. The APuP is a psychometrically sound instrument that can be used to effectively assess attitudes towards pressure ulcer prevention in patient care, education, and research. In further research, the association between attitude, knowledge and clinical performance should be explored. Copyright 2010 Elsevier Ltd. All rights reserved.
Yadegarfar, Ghasem; Alinia, Tahereh; Hosseini, Reihane; Hassannejad, Razieh; Fayaz, Mahsa; Sanati, Javad; Sanati, Kave; Harandi, Jalal; Hajnoorozali, Vahid; Baghi, Mahmood-Reza; Mirzavand, Enayat; Majeed, Azeem
2013-02-01
To assess the reliability and validity of the Farsi version of the effort-reward imbalance questionnaire (F-ERIQ) and to examine the responsiveness of the tool to changes over time. A longitudinal study was carried out among 227 male employees of Iran Polyacryl Corporation. The F-ERIQ was developed through a forward-backward translation process that includes three scales of effort, reward and over-commitment (OC). Reliability and internal consistency of the F-ERIQ were assessed by split-half and Cronbach's alpha coefficients. Confirmatory factor analysis, convergent and discriminant validity were conducted to evaluate construct validity. Depressive mood was used as an indicator for exploring criterion validity. The variations in mean scores over time for scales were regarded as measures of the responsiveness to changes. Baseline split-half correlations for effort, reward and OC were 0.53, 0.85 and 0.65, respectively; Cronbach's alpha coefficients improved from 0.61 to 0.70 for effort, 0.85 to 0.88 for reward and 0.67 to 0.72 for OC. All of item-total correlations were higher than 0.23 and item-scales correlations were higher than 0.4. Although Values of Goodness of Fit Index and Adjusted GFI were higher than 0.9 and Root Mean Square Error of Approximation, Root Mean Square Residual and Standardized RMR were lower than 0.05, confirmatory factor analysis only confirmed the construct of the effort and OC. People with higher job stress were at higher risk of depressive mood (at least 3 times more). Overall, the mean score of effort, OC and ERI increase, and the figures decrease for reward among people who experience changes. These findings provide evidence that the F-ERIQ is a reliable and valid instrument for assessing psychosocial stress at work among Farsi-speaking male employees. We propose that F-ERIQ be further evaluated across a variety of jobs and industries.
Gerards, Sanne M P L; Hummel, Karin; Dagnelie, Pieter C; de Vries, Nanne K; Kremers, Stef P J
2013-01-18
Evaluating whether parental challenges and self-efficacy toward managing children's lifestyle behaviors are successfully addressed by interventions requires valid instruments. The Lifestyle Behavior Checklist (LBC) has recently been developed in the Australian context. It consists of two subscales: the Problem scale, which measures parental perceptions of children's behavioral problems related to overweight and obesity, and the Confidence scale, measuring parental self-efficacy in dealing with these problems. The aim of the current study was to systematically translate the questionnaire into Dutch and to evaluate its internal consistency, construct validity and test-retest reliability. The LBC was systematically translated by four experts at Maastricht University. In total, 392 parents of 3-to13-year-old children were invited to fill out two successive online questionnaires with a two-week interval. Of these, 273 parents responded to the first questionnaire (test, response rate = 69.6%), and of the 202 who could be invited for the second questionnaire (retest), 100 responded (response rate = 49.5%). We assessed the questionnaire's internal consistency (Cronbach's α), construct validity (Spearman's Rho correlation tests, using the criterion measures: restrictiveness, nurturance, and psychological control), and test-retest reliability (Spearman's Rho correlation tests). Both scales had high internal consistency (Cronbach's α ≥ 0.90). Spearman correlation coefficients indicated acceptable test-retest reliability for both the Problem scale (rs = 0.74) and the Confidence scale (rs = 0.70). The LBC Problem scale was significantly correlated to all criterion scales (nurturance, restrictiveness, psychological control) in the hypothesized direction, and the LBC Confidence scale was significantly correlated with nurturance and psychological control in the hypothesized direction, but not with restrictiveness. The Dutch translation of the LBC was found to be a reliable and reasonably valid questionnaire to measure parental perceptions of children's weight-related problem behavior and the extent to which parents feel confident to manage these problems.
Optimization of Thermal Neutron Converter in SiC Sensors for Spectral Radiation Measurements
DOE Office of Scientific and Technical Information (OSTI.GOV)
Krolikowski, Igor; Cetnar, Jerzy; Issa, Fatima
2015-07-01
Optimization of the neutron converter in SiC sensors is presented. The sensors are used for spectral radiation measurements of thermal and fast neutrons and optionally gamma ray at elevated temperature in harsh radiation environment. The neutron converter, which is based on 10B, allows to detect thermal neutrons by means of neutron capture reaction. Two construction of the sensors were used to measure radiation in experiments. Sensor responses collected in experiments have been reproduced by the computer tool created by authors, it allows to validate the tool. The tool creates the response matrix function describing the characteristic of the sensors andmore » it was used for detailed analyses of the sensor responses. Obtained results help to optimize the neutron converter in order to increase thermal neutron detection. Several enhanced construction of the sensors, which includes the neutron converter based on {sup 10}B or {sup 6}Li, were proposed. (authors)« less
Construction and application of EST library from Setaria italica in response to dehydration stress.
Zhang, Jinpeng; Liu, Tingsong; Fu, Junjie; Zhu, Yun; Jia, Jinping; Zheng, Jun; Zhao, Yinhe; Zhang, Ying; Wang, Guoying
2007-07-01
Foxtail millet is a gramineous crop with low water requirement. Despite its high water use efficiency, less attention has been paid to the molecular genetics of foxtail millet. This article reports the construction of subtracted cDNA libraries from foxtail millet seedlings under dehydration stress and the expression profile analysis of 1947 UniESTs from the subtracted cDNA libraries by a cDNA microarray. The results showed that 95 and 57 ESTs were upregulated by dehydration stress, respectively, in roots and shoots of seedlings and that 10 and 27 ESTs were downregulated, respectively, in roots and shoots. The expression profile analysis showed that genes induced in foxtail millet roots were different from those in shoots during dehydration stress and that the early response to dehydration stress in foxtail millet roots was the activation of the glycolysis metabolism. Moreover, protein degradation pathway may also play a pivotal role in drought-tolerant responses of foxtail millet. Finally, Northern blot analysis validated well the cDNA microarray data.
Chow, Tan; Wolfe, Edward W; Olson, Beth H
2012-07-01
Manager attitude is influential in female employees' perceptions of workplace breastfeeding support. Currently, no instrument is available to assess manager attitude toward supporting women who wish to combine breastfeeding with work. We developed and piloted an instrument to measure manager attitudes toward workplace breastfeeding support entitled the "Managers' Attitude Toward Breastfeeding Support Questionnaire," an instrument that measures four constructs using 60 items that are rated agree/disagree on a 4-point Likert rating scale. We established the content validity of the Managers' Attitude Toward Breastfeeding Support Questionnaire measures through expert content review (n=22), expert assessment of item fit (n=11), and cognitive interviews (n=8). Data were collected from a purposive sample of 185 front-line managers who had experience supervising female employees, and responses were scaled using the Multidimensional Random Coefficients Multinomial Logit Model. Dimensionality analyses supported the proposed four-construct model. Reliability ranged from 0.75 to 0.86, and correlations between the constructs were moderately strong (0.47 to 0.71). Four items in two constructs exhibited model-to-data misfit and/or a low score-measure correlation. One item was revised and the other three items were retained in the Managers' Attitude Toward Breastfeeding Support Questionnaire. Findings of this study suggest that the Managers' Attitude Toward Breastfeeding Support Questionnaire measures are reliable and valid indicators of manager attitude toward workplace breastfeeding support, and future research should be conducted to establish external validity. The Managers' Attitude Toward Breastfeeding Support Questionnaire could be used to collect data in a standardized manner within and across companies to measure and compare manager attitudes toward supporting breastfeeding. Organizations can subsequently develop targeted strategies to improve support for breastfeeding employees through efforts influencing managerial attitude. Copyright © 2012 Academy of Nutrition and Dietetics. Published by Elsevier Inc. All rights reserved.
Olt, Helen; Jirwe, Maria; Gustavsson, Petter; Emami, Azita
2010-01-01
The purpose of this study was to describe the translation, adaption, and psychometric evaluation process in relation to validity and reliability of the Swedish version of the instrument, Inventory for Assessing The Process of Cultural Competence Among Healthcare Professionals-Revised (IAPCC-R) following the translation, adaptation, and psychometric evaluation process. Validity tests were conducted on the response processes (N = 15), the content (N = 7), and the internal structure of the instrument (N = 334). Reliability (alpha = .65 for the total scale varying between -.01 and .65 for the different subscales) was evaluated in terms of internal consistency. Results indicated weak validity and reliability though it is difficult to conclude whether this is related to adaptation issues or the original construction.The testing of the response process identified problems in relation to respondents' conceptualization of cultural competence. The test of the content identified a weak correspondence between the items and the underlying model. In addition, a confirmatory factor analysis did not confirm the proposed structure of the instrument. This study concludes that this instrument is not valid and reliable for use with a Swedish population of practicing nurses or nursing students.
ERIC Educational Resources Information Center
Lowe, Patricia A.; Papanastasiou, Elena C.; DeRuyck, Kimberly A.; Reynolds, Cecil R.
2005-01-01
In this study, the authors investigated the temporal stability and construct validity of the Adult Manifest Anxiety Scale-College Version (AMAS-C; C. R. Reynolds, B. O. Richmond, & P. A. Lowe, 2003b) scores. Results indicated that the AMAS-C scores had adequate to excellent test score stability, and evidence supported the construct validity of the…
VALIDATION OF ADULT OMNI PERCEIVED EXERTION SCALES FOR ELLIPTICAL ERGOMETRY12
MAYS, RYAN J.; GOSS, FREDRIC L.; SCHAFER, MARK A.; KIM, KEVIN H.; NAGLE-STILLEY, ELIZABETH F.; ROBERTSON, ROBERT J.
2012-01-01
Summary This investigation examined the validity of newly developed Adult OMNI Elliptical Ergometer Ratings of Perceived Exertion Scales. Sixty men and women performed a graded exercise test on an elliptical ergometer. Oxygen consumption (VO2), heart rate (HR) and ratings of perceived exertion were recorded each stage from the Borg 15 Category Scale and two different OMNI scales. One scale employed an elliptical ergometer format of the OMNI Picture System of Perceived Exertion. The second scale modified verbal, numerical, and pictorial descriptors at the low end of the response range. Concurrent and construct validity were established by the positive relation between ratings of perceived exertion from each OMNI scale with VO2, HR and Borg Scale ratings of perceived exertion (men, r = .94–.97; women, r = .93–.98). Validity was established for both OMNI scales, indicating either metric can be used to estimate ratings of perceived exertion during partial weight bearing exercise. PMID:21319623
Does the Defining Issues Test measure ethical judgment ability or political position?
Bailey, Charles D
2011-01-01
This article addresses the construct validity of the Defining Issues Test of ethical judgment (DIT/DIT-2). Alleging a political bias in the test, Emler and colleagues (1983, 1998, 1999, 2007), show that conservatives score higher when asked to fake as liberals, implying that they understand the reasoning associated with "higher" moral development but avoid items they see as liberally biased. DIT proponents challenge the internal validity of faking studies, advocating an explained-variance validation. This study takes a new approach: Adult participants complete the DIT-2, then evaluate the raw responses of others to discern political orientation and ethical development. Results show that individuals scoring higher on the DIT-2 rank others' ethical judgment in a way consistent with DIT-2-based rankings. Accuracy at assessing political orientation, however, is low. Results support the DIT-2's validity as a measure of ethical development, not an expression of political position.
Validation of learning style measures: implications for medical education practice.
Chapman, Dane M; Calhoun, Judith G
2006-06-01
It is unclear which learners would most benefit from the more individualised, student-structured, interactive approaches characteristic of problem-based and computer-assisted learning. The validity of learning style measures is uncertain, and there is no unifying learning style construct identified to predict such learners. This study was conducted to validate learning style constructs and to identify the learners most likely to benefit from problem-based and computer-assisted curricula. Using a cross-sectional design, 3 established learning style inventories were administered to 97 post-Year 2 medical students. Cognitive personality was measured by the Group Embedded Figures Test, information processing by the Learning Styles Inventory, and instructional preference by the Learning Preference Inventory. The 11 subscales from the 3 inventories were factor-analysed to identify common learning constructs and to verify construct validity. Concurrent validity was determined by intercorrelations of the 11 subscales. A total of 94 pre-clinical medical students completed all 3 inventories. Five meaningful learning style constructs were derived from the 11 subscales: student- versus teacher-structured learning; concrete versus abstract learning; passive versus active learning; individual versus group learning, and field-dependence versus field-independence. The concurrent validity of 10 of 11 subscales was supported by correlation analysis. Medical students most likely to thrive in a problem-based or computer-assisted learning environment would be expected to score highly on abstract, active and individual learning constructs and would be more field-independent. Learning style measures were validated in a medical student population and learning constructs were established for identifying learners who would most likely benefit from a problem-based or computer-assisted curriculum.
The Physician Values in Practice Scale: Construction and Initial Validation
ERIC Educational Resources Information Center
Hartung, Paul J.; Taber, Brian J.; Richard, George V.
2005-01-01
Measures of values typically appraise the construct globally, across life domains or relative to a broad life domain such as work. We conducted two studies to construct and initially validate an occupation- and context-specific values measure. Study 1, based on a sample of 192 medical students, describes the initial construction and item analysis…
Rohlf, Helena L; Krahé, Barbara
2015-01-01
An observational measure of anger regulation in middle childhood was developed that facilitated the in situ assessment of five maladaptive regulation strategies in response to an anger-eliciting task. 599 children aged 6-10 years (M = 8.12, SD = 0.92) participated in the study. Construct validity of the measure was examined through correlations with parent- and self-reports of anger regulation and anger reactivity. Criterion validity was established through links with teacher-rated aggression and social rejection measured by parent-, teacher-, and self-reports. The observational measure correlated significantly with parent- and self-reports of anger reactivity, whereas it was unrelated to parent- and self-reports of anger regulation. It also made a unique contribution to predicting aggression and social rejection.
Rohlf, Helena L.; Krahé, Barbara
2015-01-01
An observational measure of anger regulation in middle childhood was developed that facilitated the in situ assessment of five maladaptive regulation strategies in response to an anger-eliciting task. 599 children aged 6–10 years (M = 8.12, SD = 0.92) participated in the study. Construct validity of the measure was examined through correlations with parent- and self-reports of anger regulation and anger reactivity. Criterion validity was established through links with teacher-rated aggression and social rejection measured by parent-, teacher-, and self-reports. The observational measure correlated significantly with parent- and self-reports of anger reactivity, whereas it was unrelated to parent- and self-reports of anger regulation. It also made a unique contribution to predicting aggression and social rejection. PMID:25964767
A Psychometric Properties of the Malay-version Police Stress Questionnaire
IRNIZA, Rasdi; EMILIA, Zainal Abidin; MUHAMMAD SALILUDDIN, Suhainizam; NIZAM ISHA, Ahmad Shahrul
2014-01-01
Background: Police Stress Questionnaire (PSQ) was developed to measure police-specific stressors. The present study was the first to have translated the PSQ to Malay. This study aims to test the reliability, construct validity, and component structure of the Malay-version PSQ. Methods: A set of survey consisted of the Malay-version PSQ, General Health Questionnaire (GHQ-12), Job Content Questionnaire (JCQ), Global Stress Questionnaire (GSQ) and General Self-rated Health (GSRH) were distributed to 300 traffic police officers in Kuala Lumpur and all traffic police officers in a few districts of Pahang and Negeri Sembilan. Results: The response rate was 65.5% (N = 262). The reported Cronbach’s alpha coefficient was 0.93 for Operational PSQ (PSQ-Op) and 0.94 for Organisational PSQ (PSQ-Org). Findings indicated that the PSQ had positive construct validity with the GSRH, GSQ, and GHQ. After excluding four factors related to lifestyles, all police-specific stressors were highly loaded (0.50) in one component. Conclusion: It is confirmed that the Malay-version PSQ, excluding the four factors related to lifestyle, was uni-dimensional, reliable, and a valid questionnaire. This study proffers a potentially better instrument for assessing the stressors among Malaysian police. PMID:25977621
Functional outcomes in ICU – what should we be using? – an observational study.
Parry, Selina M; Denehy, Linda; Beach, Lisa J; Berney, Sue; Williamson, Hannah C; Granger, Catherine L
2015-03-29
With growing awareness of the importance of rehabilitation, new measures are being developed specifically for use in the intensive care unit (ICU). There are currently 26 measures reported to assess function in ICU survivors. The Physical Function in Intensive care Test scored (PFIT-s) has established clinimetric properties. It is unknown how other functional measures perform in comparison to the PFIT-s or which functional measure may be the most clinically applicable for use within the ICU. The aims of this study were to determine (1) the criterion validity of the Functional Status Score for the ICU (FSS-ICU), ICU Mobility Scale (IMS) and Short Physical Performance Battery (SPPB) against the PFIT-s; (2) the construct validity of these tests against muscle strength; (3) predictive utility of these tests to predict discharge to home; and (4) the clinical applicability. This was a nested study within an ongoing controlled study and an observational study. Sixty-six individuals were assessed at awakening and ICU discharge. Measures included: PFIT-s, FSS-ICU, IMS and SPPB. Bivariate relationships (Spearman's rank correlation coefficient) and predictive validity (logistic regression) were determined. Responsiveness (effect sizes); floor and ceiling effects; and minimal important differences were calculated. Mean ± SD PFIT-s at awakening was 4.7 ± 2.3 out of 10. On awakening a large positive relationship existed between PFIT-s and the other functional measures: FSS-ICU (rho = 0.87, p < 0.005), IMS (rho = 0.81, p < 0.005) and SPPB (rho = 0.70, p < 0.005). The PFIT-s had excellent construct validity (rho = 0.8, p < 0.005) and FSS-ICU (rho = 0.69, p < 0.005) and IMS (rho = 0.57, p < 0.005) had moderate construct validity with muscle strength. The PFIT-s and FSS-ICU had small floor/ceiling effects <11% at awakening and ICU discharge. The SPPB had a large floor effect at awakening (78%) and ICU discharge (56%). All tests demonstrated responsiveness; however highest effect size was seen in the PFIT-s (Cohen's d = 0.71). There is high criterion validity for other functional measures against the PFIT-s. The PFIT-s and FSS-ICU are promising functional measures and are recommended to measure function within the ICU. Clinicaltrials.gov NCT02214823. Registered 7 August 2014).
Molecular and clinical characterization of IDH associated immune signature in lower-grade gliomas.
Qian, Zenghui; Li, Yiming; Fan, Xing; Zhang, Chuanbao; Wang, Yinyan; Jiang, Tao; Liu, Xing
2018-01-01
Background : Mutations in isocitrate dehydrogenase (IDH) affect the development and prognosis of gliomas. We investigated the role of IDH mutations in the regulation of immune phenotype in lower-grade gliomas (LGGs). Method and patients : A total of 1,008 cases with clinical and IDH mutation data from five cohorts were enrolled. Samples with RNA sequencing data from the Chinese Glioma Genome Atlas (CGGA) were used as training set, whereas RNA data from the Cancer Genome Atlas, Repository for Molecular Brain Neoplasia, GSE16011, and CGGA microarray databases were used for validation. R language tools and bioinformatics analysis were used for gene signature construction and biological function annotation. Results : We found that IDH mutations caused down-regulation of local immune response as among 332 immune system-related genes, 196(59.0%) were differentially expressed according to IDH mutation status. Nearly 70% of those differentially expressed genes exhibited prognostic value in LGGs. An immune response-based gene signature was constructed that distinguished cases with high- or low-risk of unfavorable prognosis and remained an independent prognostic factor in multivariate analyses in both training and validation cohorts. Samples from high-risk cases exhibited elevated expression of genes involved in immune response and NF-κB pathway activation. Furthermore, we found a strong correlation between the risk score and T cells, macrophage-related immune response, and expression of several prominent immune checkpoints. Conclusion : Our results indicated that mutant IDH is highly associated with the regulation of the immune microenvironment in LGGs. The observed immune system gene signature, which was sensitive to IDH mutation status, efficiently predicted patient survival.
An Ethical Issue Scale for Community Pharmacy Setting (EISP): Development and Validation.
Crnjanski, Tatjana; Krajnovic, Dusanka; Tadic, Ivana; Stojkov, Svetlana; Savic, Mirko
2016-04-01
Many problems that arise when providing pharmacy services may contain some ethical components and the aims of this study were to develop and validate a scale that could assess difficulties of ethical issues, as well as the frequency of those occurrences in everyday practice of community pharmacists. Development and validation of the scale was conducted in three phases: (1) generating items for the initial survey instrument after qualitative analysis; (2) defining the design and format of the instrument; (3) validation of the instrument. The constructed Ethical Issue scale for community pharmacy setting has two parts containing the same 16 items for assessing the difficulty and frequency thereof. The results of the 171 completely filled out scales were analyzed (response rate 74.89%). The Cronbach's α value of the part of the instrument that examines difficulties of the ethical situations was 0.83 and for the part of the instrument that examined frequency of the ethical situations was 0.84. Test-retest reliability for both parts of the instrument was satisfactory with all Interclass correlation coefficient (ICC) values above 0.6, (for the part that examines severity ICC = 0.809, for the part that examines frequency ICC = 0.929). The 16-item scale, as a self assessment tool, demonstrated a high degree of content, criterion, and construct validity and test-retest reliability. The results support its use as a research tool to asses difficulty and frequency of ethical issues in community pharmacy setting. The validated scale needs to be further employed on a larger sample of pharmacists.
Pesticide applicators questionnaire content validation: A fuzzy delphi method.
Manakandan, S K; Rosnah, I; Mohd Ridhuan, J; Priya, R
2017-08-01
The most crucial step in forming a set of survey questionnaire is deciding the appropriate items in a construct. Retaining irrelevant items and removing important items will certainly mislead the direction of a particular study. This article demonstrates Fuzzy Delphi method as one of the scientific analysis technique to consolidate consensus agreement within a panel of experts pertaining to each item's appropriateness. This method reduces the ambiguity, diversity, and discrepancy of the opinions among the experts hence enhances the quality of the selected items. The main purpose of this study was to obtain experts' consensus on the suitability of the preselected items on the questionnaire. The panel consists of sixteen experts from the Occupational and Environmental Health Unit of Ministry of Health, Vector-borne Disease Control Unit of Ministry of Health and Occupational and Safety Health Unit of both public and private universities. A set of questionnaires related to noise and chemical exposure were compiled based on the literature search. There was a total of six constructs with 60 items in which three constructs for knowledge, attitude, and practice of noise exposure and three constructs for knowledge, attitude, and practice of chemical exposure. The validation process replicated recent Fuzzy Delphi method that using a concept of Triangular Fuzzy Numbers and Defuzzification process. A 100% response rate was obtained from all the sixteen experts with an average Likert scoring of four to five. Post FDM analysis, the first prerequisite was fulfilled with a threshold value (d) ≤ 0.2, hence all the six constructs were accepted. For the second prerequisite, three items (21%) from noise-attitude construct and four items (40%) from chemical-practice construct had expert consensus lesser than 75%, which giving rise to about 12% from the total items in the questionnaire. The third prerequisite was used to rank the items within the constructs by calculating the average fuzzy numbers. The seven items which did not fulfill the second prerequisite similarly had lower ranks during the analysis, therefore those items were discarded from the final draft. Post FDM analysis, the experts' consensus on the suitability of the pre-selected items on the questionnaire set were obtained, hence it is now ready for further construct validation process.
Steffan, Jean; Olivry, Thierry; Forster, Sophie L; Seewald, Wolfgang
2012-10-01
Hypersensitivity (allergic) dermatitis (HD) is commonly seen in cats, causing pruritus and various patterns of skin lesions, including at least one of the following: head and neck excoriations, self-induced alopecia, eosinophilic plaques and miliary dermatitis. Few studies have evaluated the efficacy of therapeutic interventions for feline HD, and although various scales have been considered, none has been formally validated for the assessment of disease severity and its response to therapy. To design and validate a novel scale (SCORing Feline Allergic Dermatitis; SCORFAD) to assess the value of different criteria used as outcome measures for the treatment of feline HD and to set minimal thresholds for defining the clinical success of tested interventions. One hundred client-owned cats. The SCORFAD scale was designed to include the four most frequently identified lesion types in feline HD (eosinophilic plaque, head and neck excoriations, self-induced alopecia and miliary dermatitis) across 10 body regions. The extent and severity of each lesion type were graded prior to inclusion and after 3 and 6 weeks in a clinical study to compare the efficacy of two doses of ciclosporin with placebo. The SCORFAD scale was found to exhibit satisfactory content, construct, criterion and sensitivity to change. The percentage reduction in SCORFAD from baseline was determined to be the most valid assessment of clinical response. Inter- and intra-observer reliability was not assessed. The SCORFAD scale is proposed for use as a validated tool for the assessment of disease severity and response to therapeutic interventions in clinical trials for feline HD. © 2012 The Authors. Veterinary Dermatology © 2012 ESVD and ACVD.
Developing and validating a nutrition knowledge questionnaire: key methods and considerations.
Trakman, Gina Louise; Forsyth, Adrienne; Hoye, Russell; Belski, Regina
2017-10-01
To outline key statistical considerations and detailed methodologies for the development and evaluation of a valid and reliable nutrition knowledge questionnaire. Literature on questionnaire development in a range of fields was reviewed and a set of evidence-based guidelines specific to the creation of a nutrition knowledge questionnaire have been developed. The recommendations describe key qualitative methods and statistical considerations, and include relevant examples from previous papers and existing nutrition knowledge questionnaires. Where details have been omitted for the sake of brevity, the reader has been directed to suitable references. We recommend an eight-step methodology for nutrition knowledge questionnaire development as follows: (i) definition of the construct and development of a test plan; (ii) generation of the item pool; (iii) choice of the scoring system and response format; (iv) assessment of content validity; (v) assessment of face validity; (vi) purification of the scale using item analysis, including item characteristics, difficulty and discrimination; (vii) evaluation of the scale including its factor structure and internal reliability, or Rasch analysis, including assessment of dimensionality and internal reliability; and (viii) gathering of data to re-examine the questionnaire's properties, assess temporal stability and confirm construct validity. Several of these methods have previously been overlooked. The measurement of nutrition knowledge is an important consideration for individuals working in the nutrition field. Improved methods in the development of nutrition knowledge questionnaires, such as the use of factor analysis or Rasch analysis, will enable more confidence in reported measures of nutrition knowledge.
Goldschmidt, Andrea B.
2017-01-01
Background Binge eating is a marker of weight gain and obesity, and a hallmark feature of eating disorders. Yet, its component constructs—overeating and loss of control (LOC) while eating—are poorly understood and difficult to measure. Objective To critically review the human literature concerning the validity of LOC and overeating across the age and weight spectrum. Data sources English-language articles addressing the face, convergent, discriminant, and predictive validity of LOC and overeating were included. Results LOC and overeating appear to have adequate face validity. Emerging evidence supports the convergent and predictive validity of the LOC construct, given its unique cross-sectional and prospective associations with numerous anthropometric, psychosocial, and eating behavior-related factors. Overeating may be best conceptualized as a marker of excess weight status. Limitations Binge eating constructs, particularly in the context of subjectively large episodes, are challenging to measure reliably. Few studies addressed overeating in the absence of LOC, thereby limiting conclusions about the validity of the overeating construct independent of LOC. Additional studies addressing the discriminant validity of both constructs are warranted. Discussion Suggestions for future weight-related research and for appropriately defining binge eating in the eating disorders diagnostic scheme are presented. PMID:28165655
Devoogdt, Nele; De Groef, An; Hendrickx, Ad; Damstra, Robert; Christiaansen, Anke; Geraerts, Inge; Vervloesem, Nele; Vergote, Ignace; Van Kampen, Marijke
2014-05-01
Patients may develop primary (congenital) or secondary (acquired) lymphedema, causing significant physical and psychosocial problems. To plan treatment for lymphedema and monitor a patient's progress, swelling, and problems in functioning associated with lymphedema development should be assessed at baseline and follow-up. The purpose of this study was to investigate the reliability (test-retest, internal consistency, and measurement variability) and validity (content and construct) of data obtained with the Lymphoedema Functioning, Disability and Health Questionnaire for Lower Limb Lymphoedema (Lymph-ICF-LL). This was a multicenter, cross-sectional study. The Lymph-ICF-LL is a descriptive, evaluative tool containing 28 questions about impairments in function, activity limitations, and participation restrictions in patients with lower limb lymphedema. The questionnaire has 5 domains: physical function, mental function, general tasks/household activities, mobility activities, and life domains/social life. The reliability and validity of the Lymph-ICF-LL were examined in 30 participants with objective lower limb lymphedema. Intraclass correlation coefficients for test-retest reliability ranged from .69 to .94, and Cronbach alpha coefficients for internal consistency ranged from .82 to .97. Measurement variability was acceptable (standard error of measurement=5.9-12.6). Content validity was good because all questions were understandable for 93% of participants, the scoring system (visual analog scale) was clear, and the questionnaire was comprehensive for 90% of participants. Construct validity was good. All hypotheses for assessing convergent validity and divergent validity were accepted. The known-groups validity and responsiveness of the Dutch Lymph-ICF-LL and the cross-cultural validity of the English version of the Lymph-ICF-LL were not investigated. The Lymph-ICF-LL is a Dutch questionnaire with evidence of reliability and validity for assessing impairments in function, activity limitations, and participation restrictions in people with primary or secondary lower limb lymphedema.
Validation of an Arabic version of the Diabetes Treatment Satisfaction Questionnaire in Qatar.
Wilbur, Kerry; Al Hammaq, Abdulla O
2016-03-01
Several instruments evaluate patient-reported outcomes in diabetes mellitus (DM), but almost none are validated for use in Arabic language. The aim of this study is to test the psychometric properties and responsiveness of the Arabic version of the Diabetes Treatment Satisfaction Questionnaire (DTSQs) in Qatar. Ambulatory Arabic speaking DM patients were interviewed at two consecutive time points in Doha, Qatar. The 8-item DTSQs was administered in conjunction with the Medical Outcomes Study 36-Item Short-Form Health Survey (SF-36) and the World Health Organization Quality of Life Measure (WHOQOL-Bref) to assess convergent validity. Reliability was evaluated by internal consistency and item analysis. Construct validity was evaluated using "known groups" comparisons (including gender, insulin use, and HbA1c). Sensitivity of DTSQs scores to the subject's metabolic conditions was determined. One hundred subjects (mean age 50.7) participated. Half (54%) were female. The majority (93%) had Type 2 DM, but 39 (42%) were using insulin. Results revealed satisfactory internal consistency. Metabolic measures (fasting blood glucose and AIC) had significant inverse correlations with DTSQs scores (interview 1, Pearson's r=-0.333 and r=-0.401, respectively, p<0.01). Scale criterion and construct validity were found to be satisfactory. Most sub-dimensions of the SF-36 and WHOQOL-Bref were correlated with the DTSQ, indicating a good concurrent validity. As in prior studies, women demonstrated poorer treatment satisfaction. The Qatar Arabic DTSQs version was found to be a reliable and valid instrument for the assessment of treatment satisfaction in Arabic diabetes mellitus patients in the country. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Older adults' drug benefit beliefs: construct definition and measure development.
Cline, Richard R; Gupta, Kiran; Singh, Reshmi L
2008-03-01
The Medicare Prescription Drug, Improvement and Modernization Act of 2003 provides coverage of outpatient prescription drugs for Medicare beneficiaries. Although much has been learned since the program's implementation, a context within which this information can be understood is lacking. The purpose of this study was to develop a reliable and valid multi-item instrument measuring beliefs about Medicare prescription drug benefits. Survey items were generated using focus group transcripts, other surveys on the Medicare Part "D" program, and past studies of choice and satisfaction in drug insurance programs. Using data from the survey pilot test, item and reliability analyses were used to reduce and refine an initial pool of items. Data then were collected from a cross-sectional, mail survey of older adults living in Minnesota. Data were analyzed using exploratory factor analysis. Summated rating scales then were constructed and assessed further using reliability analyses. Construct validity of summated scales was examined by comparing scale scores across response categories of survey items that collected information on general political attitudes, perceptions of the Medicare Part "D" program, health status, and health care utilization and demographics. The adjusted response rate for the main survey was 55.98% (744/1329). Iterative factor analysis produced 2 interpretable scales. The first, termed "access/equity" (13 items, Cronbach's alpha=0.89) measures beliefs that a Medicare drug benefit should both provide affordable prescription drugs for beneficiaries and do this in a manner that is equitable for all participants. The second, termed "comprehensibility" (6 items, Cronbach's alpha=0.80) assesses beliefs that regulations governing a Medicare drug benefit should be easily understood. Discriminant validity tests suggest that these measures behave in a manner consistent with related research in these areas. Measures of 2 facets of older adults' drug benefit beliefs were developed using a multiple step procedure. Future research could focus on developing a better understanding of other facets of these beliefs and sound methods of measurement.
NASA Astrophysics Data System (ADS)
Schweitzer, Ben; Wilke, Stephen; Khateeb, Siddique; Al-Hallaj, Said
2015-08-01
A lumped (0-D) numerical model has been developed for simulating the thermal response of a lithium-ion battery pack with a phase-change composite (PCC™) thermal management system. A small 10s4p battery pack utilizing PCC material was constructed and subjected to discharge at various C-rates in order to validate the lumped model. The 18650 size Li-ion cells used in the pack were electrically characterized to determine their heat generation, and various PCC materials were thermally characterized to determine their apparent specific heat as a function of temperature. Additionally, a 2-D FEA thermal model was constructed to help understand the magnitude of spatial temperature variation in the pack, and to understand the limitations of the lumped model. Overall, good agreement is seen between experimentally measured pack temperatures and the 0-D model, and the 2-D FEA model predicts minimal spatial temperature variation for PCC-based packs at C-rates of 1C and below.
The Work Disability Functional Assessment Battery (WD-FAB): Feasibility and Psychometric Properties
Meterko, Mark; Marfeo, Elizabeth E.; McDonough, Christine M.; Jette, Alan M.; Ni, Pengsheng; Bogusz, Kara; Rasch, Elizabeth K; Brandt, Diane E.; Chan, Leighton
2015-01-01
Objectives To assess the feasibility and psychometric properties of eight scales covering two domains of the newly developed Work Disability Functional Assessment Battery (WD-FAB): physical function (PF) and behavioral health (BH) function. Design Cross-sectional. Setting Community. Participants Adults unable to work due to a physical (n=497) or mental (n=476) disability. Interventions None. Main Outcome Measures Each disability group responded to a survey consisting of the relevant WD-FAB scales and existing measures of established validity. The WD-FAB scales were evaluated with regard to data quality (score distribution; percent “I don’t know” responses), efficiency of administration (number of items required to achieve reliability criterion; time required to complete the scale) by computerized adaptive testing (CAT), and measurement accuracy as tested by person fit. Construct validity was assessed by examining both convergent and discriminant correlations between the WD-FAB scales and scores on same-domain and cross-domain established measures. Results Data quality was good and CAT efficiency was high across both WD-FAB domains. Measurement accuracy was very good for the PF scales; BH scales demonstrated more variability. Construct validity correlations, both convergent and divergent, between all WD-FAB scales and established measures were in the expected direction and range of magnitude. Conclusions The data quality, CAT efficacy, person fit and construct validity of the WD-FAB scales were well supported and suggest that the WD-FAB could be used to assess physical and behavioral health function related to work disability. Variation in scale performance suggests the need for future work on item replenishment and refinement, particularly regarding the Self-Efficacy scale. PMID:25528263
Sexual Assault Prevention and Response Climate DEOCS 4.1 Construct Validity Summary
2017-08-01
DEOCS, (7) examining variance and descriptive statistics (8) examining the relationship among items/areas to reduce multicollinearity, and (9...selecting items that demonstrate the strongest scale properties. Included is a review of the 4.0 description and items, followed by the proposed...Tables 1 – 7 for the description of each measure and corresponding items. Table 1. DEOCS 4.0 Perceptions of Safety Measure Description
Initial Results from the Variable Intensity Sonic Boom Database
NASA Technical Reports Server (NTRS)
Haering, Edward A., Jr.; Cliatt, Larry J., II; Gabrielson, Thomas; Sparrow, Victor W.; Locey, Lance L.; Bunce, Thomas J.
2008-01-01
43 sonic booms generated (a few were evanescent waves) a) Overpressures of 0.08 to 2.20 lbf/sq ft; b) Rise-times of about 0.7 to 50 ms. Objectives: a) Structural response of a house of modern construction; b) Sonic boom propagation code validation. Approach: a) Measure shockwave directionality; b) Determine effect of height above ground on acoustic level; c) Generate atmospheric turbulence filter functions.
Extremera, Natalio; Fernández-Berrocal, Pablo
2006-02-01
This study investigated the construct validity and reliability of the Spanish Ruminative Responses Scale-Short From, and the Distraction Responses Scale of the Response Styles Questionnaire for a sample of 727 Spanish high school and college students who responded anonymously and voluntarily to a questionnaire (293 men, 434 women; ages 16 to 29 years, M=18.8, SD=3.0). In addition to the above scales, the questionnaire included the Spanish forms of the Beck Depression Inventory, the Trait Anxiety Scale from the State-Trait Anxiety Scale, the Satisfaction with Life Scale, and the Subjective Happiness Scale. The internal consistency of the scales was satisfactory (Cronbach alpha=.86 for the Ruminative Responses Scale and .78 for the Distraction Responses Scale). As expected, scores on the Spanish Ruminative Responses Scale showed positive correlations with those on the Beck Depression Inventory and the Trait Anxiety Scale and negative associations with the Satisfaction with Life Scale and the Subjective Happiness Scale. Conversely, the Spanish Distraction Responses Scale was negatively correlated with the Beck Depression Inventory and positively associated with the Satisfaction with Life Scale and the Subjective Happiness Scale. These results provide evidence of appropriate reliability for research purposes. Furthermore, the correlational analysis supported prior findings that ruminative response and distraction response styles are differentially associated with reported depressed and positive moods.
Validation of the ArthroS virtual reality simulator for arthroscopic skills.
Stunt, J J; Kerkhoffs, G M M J; van Dijk, C N; Tuijthof, G J M
2015-11-01
Virtual reality simulator training has become important for acquiring arthroscopic skills. A new simulator for knee arthroscopy ArthroS™ has been developed. The purpose of this study was to demonstrate face and construct validity, executed according to a protocol used previously to validate arthroscopic simulators. Twenty-seven participants were divided into three groups having different levels of arthroscopic experience. Participants answered questions regarding general information and the outer appearance of the simulator for face validity. Construct validity was assessed with one standardized navigation task. Face validity, educational value and user friendliness were further determined by giving participants three exercises and by asking them to fill out the questionnaire. Construct validity was demonstrated between experts and beginners. Median task times were not significantly different for all repetitions between novices and intermediates, and between intermediates and experts. Median face validity was 8.3 for the outer appearance, 6.5 for the intra-articular joint and 4.7 for surgical instruments. Educational value and user friendliness were perceived as nonsatisfactory, especially because of the lack of tactile feedback. The ArthroS™ demonstrated construct validity between novices and experts, but did not demonstrate full face validity. Future improvements should be mainly focused on the development of tactile feedback. It is necessary that a newly presented simulator is validated to prove it actually contributes to proficiency of skills.
Validity of Sensory Systems as Distinct Constructs
Su, Chia-Ting
2014-01-01
This study investigated the validity of sensory systems as distinct measurable constructs as part of a larger project examining Ayres’s theory of sensory integration. Confirmatory factor analysis (CFA) was conducted to test whether sensory questionnaire items represent distinct sensory system constructs. Data were obtained from clinical records of two age groups, 2- to 5-yr-olds (n = 231) and 6- to 10-yr-olds (n = 223). With each group, we tested several CFA models for goodness of fit with the data. The accepted model was identical for each group and indicated that tactile, vestibular–proprioceptive, visual, and auditory systems form distinct, valid factors that are not age dependent. In contrast, alternative models that grouped items according to sensory processing problems (e.g., over- or underresponsiveness within or across sensory systems) did not yield valid factors. Results indicate that distinct sensory system constructs can be measured validly using questionnaire data. PMID:25184467
Espinoza-Venegas, Maritza; Sanhueza-Alvarado, Olivia; Ramírez-Elizondo, Noé; Sáez-Carrillo, Katia
2015-01-01
OBJECTIVE: The current study aimed to validate the construct and reliability of an emotional intelligence scale. METHOD: The Trait Meta-Mood Scale-24 was applied to 349 nursing students. The process included content validation, which involved expert reviews, pilot testing, measurements of reliability using Cronbach's alpha, and factor analysis to corroborate the validity of the theoretical model's construct. RESULTS: Adequate Cronbach coefficients were obtained for all three dimensions, and factor analysis confirmed the scale's dimensions (perception, comprehension, and regulation). CONCLUSION: The Trait Meta-Mood Scale is a reliable and valid tool to measure the emotional intelligence of nursing students. Its use allows for accurate determinations of individuals' abilities to interpret and manage emotions. At the same time, this new construct is of potential importance for measurements in nursing leadership; educational, organizational, and personal improvements; and the establishment of effective relationships with patients. PMID:25806642
ERIC Educational Resources Information Center
Aebi, Marcel; Plattner, Belinda; Metzke, Christa Winkler; Bessler, Cornelia; Steinhausen, Hans-Christoph
2013-01-01
Background: Different dimensions of oppositional defiant disorder (ODD) have been found as valid predictors of further mental health problems and antisocial behaviors in youth. The present study aimed at testing the construct, concurrent, and predictive validity of ODD dimensions derived from parent- and self-report measures. Method: Confirmatory…
Sørensen, Hans Eibe; Slater, Stanley F
2008-08-01
Atheoretical measure purification may lead to construct deficient measures. The purpose of this paper is to provide a theoretically driven procedure for the development and empirical validation of symmetric component measures of multidimensional constructs. Particular emphasis is placed on establishing a formalized three-step procedure for achieving a posteriori content validity. Then the procedure is applied to development and empirical validation of two symmetrical component measures of market orientation, customer orientation and competitor orientation. Analysis suggests that average variance extracted is particularly critical to reliability in the respecification of multi-indicator measures. In relation to this, the results also identify possible deficiencies in using Cronbach alpha for establishing reliable and valid measures.
Küçükdeveci, Ayse A; Sahin, Hülya; Ataman, Sebnem; Griffiths, Bridget; Tennant, Alan
2004-02-15
Guidelines have been established for cross-cultural adaptation of outcome measures. However, invariance across cultures must also be demonstrated through analysis of Differential Item Functioning (DIF). This is tested in the context of a Turkish adaptation of the Health Assessment Questionnaire (HAQ). Internal construct validity of the adapted HAQ is assessed by Rasch analysis; reliability, by internal consistency and the intraclass correlation coefficient; external construct validity, by association with impairments and American College of Rheumatology functional stages. Cross-cultural validity is tested through DIF by comparison with data from the UK version of the HAQ. The adapted version of the HAQ demonstrated good internal construct validity through fit of the data to the Rasch model (mean item fit 0.205; SD 0.998). Reliability was excellent (alpha = 0.97) and external construct validity was confirmed by expected associations. DIF for culture was found in only 1 item. Cross-cultural validity was found to be sufficient for use in international studies between the UK and Turkey. Future adaptation of instruments should include analysis of DIF at the field testing stage in the adaptation process.
Alterman, Arthur I.; Cacciola, John S.; Ivey, Megan A.; Lynch, Kevin G.
2009-01-01
Objective: This study evaluated the psychometric properties of the 15-item alcohol Short Index of Problems (SIP) instrument and those of a newly constructed 15-item drug Short Index of Problems (SIP-D) instrument in 277 newly entered substance-abuse patients. Method: The SIP is derived from the longer, 50-item Drinker Inventory of Consequences (DrInC), which was designed to assess adverse consequences of alcohol use. The SIP-D was constructed by substituting the term “drug use” for the term “drinking” in each SIP item. A 3-month recall interval was employed. Results: Factor analyses of each of the instruments revealed similar solutions, with only one main factor accounting for the majority of variance. Nonparametric item response theory methods produced the same finding. Internal consistency reliability estimates for the SIP and SIP-D total scores were .98 and .97, respectively. Concurrent validity was demonstrated by examining the correlations of the total scores for each of the instruments with the recent summary indexes of the newly revised Addiction Severity Index (ASI-Version 6): alcohol, drug, medical, economic, legal, family/social, and psychiatric problems. Conclusions: This study is the first to confirm the psychometric validity of the SIP when used as an independent instrument unembedded within the DrInC. The study also supports the use of the SIP-D as a brief measure of adverse consequences of drug use. The findings strongly support the unidimensional structure of both measures. PMID:19261243
Alterman, Arthur I; Cacciola, John S; Ivey, Megan A; Habing, Brian; Lynch, Kevin G
2009-03-01
This study evaluated the psychometric properties of the 15-item alcohol Short Index of Problems (SIP) instrument and those of a newly constructed 15-item drug Short Index of Problems (SIP-D) instrument in 277 newly entered substance-abuse patients. The SIP is derived from the longer, 50-item Drinker Inventory of Consequences (DrInC), which was designed to assess adverse consequences of alcohol use. The SIP-D was constructed by substituting the term "drug use" for the term "drinking" in each SIP item. A 3-month recall interval was employed. Factor analyses of each of the instruments revealed similar solutions, with only one main factor accounting for the majority of variance. Nonparametric item response theory methods produced the same finding. Internal consistency reliability estimates for the SIP and SIP-D total scores were .98 and .97, respectively. Concurrent validity was demonstrated by examining the correlations of the total scores for each of the instruments with the recent summary indexes of the newly revised Addiction Severity Index (ASI-Version 6): alcohol, drug, medical, economic, legal, family/social, and psychiatric problems. This study is the first to confirm the psychometric validity of the SIP when used as an independent instrument unembedded within the DrInC. The study also supports the use of the SIP-D as a brief measure of adverse consequences of drug use. The findings strongly support the unidimensional structure of both measures.
Measurement of latent cognitive abilities involved in concept identification learning.
Thomas, Michael L; Brown, Gregory G; Gur, Ruben C; Moore, Tyler M; Patt, Virginie M; Nock, Matthew K; Naifeh, James A; Heeringa, Steven; Ursano, Robert J; Stein, Murray B
2015-01-01
We used cognitive and psychometric modeling techniques to evaluate the construct validity and measurement precision of latent cognitive abilities measured by a test of concept identification learning: the Penn Conditional Exclusion Test (PCET). Item response theory parameters were embedded within classic associative- and hypothesis-based Markov learning models and were fitted to 35,553 Army soldiers' PCET data from the Army Study to Assess Risk and Resilience in Servicemembers (Army STARRS). Data were consistent with a hypothesis-testing model with multiple latent abilities-abstraction and set shifting. Latent abstraction ability was positively correlated with number of concepts learned, and latent set-shifting ability was negatively correlated with number of perseverative errors, supporting the construct validity of the two parameters. Abstraction was most precisely assessed for participants with abilities ranging from 1.5 standard deviations below the mean to the mean itself. Measurement of set shifting was acceptably precise only for participants making a high number of perseverative errors. The PCET precisely measures latent abstraction ability in the Army STARRS sample, especially within the range of mildly impaired to average ability. This precision pattern is ideal for a test developed to measure cognitive impairment as opposed to cognitive strength. The PCET also measures latent set-shifting ability, but reliable assessment is limited to the impaired range of ability, reflecting that perseverative errors are rare among cognitively healthy adults. Integrating cognitive and psychometric models can provide information about construct validity and measurement precision within a single analytical framework.
Gordon, Kenneth B; Betts, Keith A; Sundaram, Murali; Signorovitch, James E; Li, Junlong; Xie, Meng; Wu, Eric Q; Okun, Martin M
2017-12-01
Most methotrexate-treated psoriasis patients do not achieve a long-term PASI75 (75% reduction from baseline Psoriasis Area and Severity Index score) response. Indications of nonresponse can be apparent after only 4 weeks of treatment. To develop a prediction rule to identify patients unlikely to respond adequately to methotrexate. Patient-level data from CHAMPION (NCT00235820, N = 110) was used to construct a prediction model for week 16 PASI75 by using patient baseline characteristics and week 4 PASI25. A prediction rule was determined on the basis of the sensitivity and specificity and validated in terms of week 16 PASI75 response in an independent validation sample from trial M10-255 (NCT00679731, N = 163). PASI25 achievement at week 4 (odds ratio = 8.917) was highly predictive of response with methotrexate at week 16. Patients with a predicted response probability <30% were recommended to discontinue methotrexate. The rates of week 16 PASI75 response were 65.8% and 21.1% (P < .001) for patients recommended to continue and discontinue methotrexate, respectively. The CHAMPION trial excluded patients previously treated with biologics, and the M10-255 trial had no restrictions. A prediction rule was developed and validated to identify patients unlikely to respond adequately to methotrexate. The rule indicates that 4 weeks of methotrexate might be sufficient to predict long-term response with limited safety risk. Copyright © 2017 American Academy of Dermatology, Inc. Published by Elsevier Inc. All rights reserved.
Developing a database for pedestrians' earthquake emergency evacuation in indoor scenarios.
Zhou, Junxue; Li, Sha; Nie, Gaozhong; Fan, Xiwei; Tan, Jinxian; Li, Huayue; Pang, Xiaoke
2018-01-01
With the booming development of evacuation simulation software, developing an extensive database in indoor scenarios for evacuation models is imperative. In this paper, we conduct a qualitative and quantitative analysis of the collected videotapes and aim to provide a complete and unitary database of pedestrians' earthquake emergency response behaviors in indoor scenarios, including human-environment interactions. Using the qualitative analysis method, we extract keyword groups and keywords that code the response modes of pedestrians and construct a general decision flowchart using chronological organization. Using the quantitative analysis method, we analyze data on the delay time, evacuation speed, evacuation route and emergency exit choices. Furthermore, we study the effect of classroom layout on emergency evacuation. The database for indoor scenarios provides reliable input parameters and allows the construction of real and effective constraints for use in software and mathematical models. The database can also be used to validate the accuracy of evacuation models.
Young, Ian A; Dunning, James; Butts, Raymond; Cleland, Joshua A; Fernández-de-Las-Peñas, César
2018-01-01
Background Self-reported disability and pain intensity are commonly used outcomes in patients with cervicogenic headaches. However, there is a paucity of psychometric evidence to support the use of these self-report outcomes for individuals treated with cervicogenic headaches. Therefore, it is unknown if these measures are reliable, responsive, or result in meaningful clinically important changes in this patient population. Methods A secondary analysis of a randomized clinical trial (n = 110) examining the effects of spinal manipulative therapy with and without exercise in patients with cervicogenic headaches. Reliability, construct validity, responsiveness and thresholds for minimal detectable change and clinically important difference values were calculated for the Neck Disability Index and Numeric Pain Rating Scale. Results The Neck Disability Index exhibited excellent reliability (ICC = 0.92; [95 % CI: 0.46-0.97]), while the Numeric Pain Rating Scale exhibited moderate reliability (ICC = 0.72; [95 % CI: 0.08-0.90]) in the short term. Both instruments also exhibited adequate responsiveness (area under the curve; range = 0.78-0.93) and construct validity ( p < 0.001) in this headache population. Conclusions Both instruments seem well suited as short-term self-report measures for patients with cervicogenic headaches. Clinicians and researchers should expect at least a 2.5-point reduction on the numeric pain rating scale and a 5.5-point reduction on the neck disability index after 4 weeks of intervention to be considered clinically meaningful.
Psychometric properties of the Triarchic Psychopathy Measure: An item response theory approach.
Shou, Yiyun; Sellbom, Martin; Xu, Jing
2018-05-01
There is cumulative evidence for the cross-cultural validity of the Triarchic Psychopathy Measure (TriPM; Patrick, 2010) among non-Western populations. Recent studies using correlational and regression analyses show promising construct validity of the TriPM in Chinese samples. However, little is known about the efficiency of items in TriPM in assessing the proposed latent traits. The current study evaluated the psychometric properties of the Chinese TriPM at the item level using item response theory analyses. It also examined the measurement invariance of the TriPM between the Chinese and the U.S. student samples by applying differential item functioning analyses under the item response theory framework. The results supported the unidimensional nature of the Disinhibition and Meanness scales. Both scales had a greater level of precision in the respective underlying constructs at the positive ends. The two scales, however, had several items that were weakly associated with their respective latent traits in the Chinese student sample. Boldness, on the other hand, was found to be multidimensional, and reflected a more normally distributed range of variation. The examination of measurement bias via differential item functioning analyses revealed that a number of items of the TriPM were not equivalent across the Chinese and the U.S. Some modification and adaptation of items might be considered for improving the precision of the TriPM for Chinese participants. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Langenbucher, J; Sulesund, D; Chung, T; Morgenstern, J
1996-01-01
Illness severity and self-efficacy are two constructs of growing interest as predictors of clinical response in alcoholism. Using alternative measures of illness severity (DSM-IV symptom count, Alcohol Dependence Scale, and Addiction Severity Index) and self-efficacy (brief version of the Situational Confidence Questionnaire) rigorously controlled for theoretically important background variables, we studied their unique contribution to multiple indices of relapse, relapse latency, and use of alternative coping behaviors in a large, heterogeneous clinical sample. The Alcohol Dependence Scale contributed to the prediction of 4 of 5 relapse indicators. The SCQ failed to predict relapse behavior or its precursor, coping response. The findings emphasize the predictive validity of severity of dependence as a course specifier and underline the need for more sensitive and externally valid measures of cognitive processes such as self-efficacy for application in future studies of posttreatment behavior.
Non-Invasive Tension Measurement Devices for Parachute Cordage
NASA Technical Reports Server (NTRS)
Litteken, Douglas A.; Daum, Jared S.
2016-01-01
The need for lightweight and non-intrusive tension measurements has arisen alongside the development of high-fidelity computer models of textile and fluid dynamics. In order to validate these computer models, data must be gathered in the operational environment without altering the design, construction, or performance of the test article. Current measurement device designs rely on severing a cord and breaking the load path to introduce a load cell. These load cells are very reliable, but introduce an area of high stiffness in the load path, directly affecting the structural response, adding excessive weight, and possibly altering the dynamics of the parachute during a test. To capture the required data for analysis validation without affecting the response of the system, non-invasive measurement devices have been developed and tested by NASA. These tension measurement devices offer minimal impact to the mass, form, fit, and function of the test article, while providing reliable, axial tension measurements for parachute cordage.
Lei, Pingguang; Lei, Guanghe; Tian, Jianjun; Zhou, Zengfen; Zhao, Miao; Wan, Chonghua
2014-10-01
This paper is aimed to develop the irritable bowel syndrome (IBS) scale of the system of Quality of Life Instruments for Chronic Diseases (QLICD-IBS) by the modular approach and validate it by both classical test theory and generalizability theory. The QLICD-IBS was developed based on programmed decision procedures with multiple nominal and focus group discussions, in-depth interview, and quantitative statistical procedures. One hundred twelve inpatients with IBS were used to provide the data measuring QOL three times before and after treatments. The psychometric properties of the scale were evaluated with respect to validity, reliability, and responsiveness employing correlation analysis, factor analyses, multi-trait scaling analysis, t tests and also G studies and D studies of generalizability theory analysis. Multi-trait scaling analysis, correlation, and factor analyses confirmed good construct validity and criterion-related validity when using SF-36 as a criterion. Test-retest reliability coefficients (Pearson r and intra-class correlation (ICC)) for the overall score and all domains were higher than 0.80; the internal consistency α for all domains at two measurements were higher than 0.70 except for the social domain (0.55 and 0.67, respectively). The overall score and scores for all domains/facets had statistically significant changes after treatments with moderate or higher effect size standardized response mean (SRM) ranging from 0.72 to 1.02 at domain levels. G coefficients and index of dependability (Ф coefficients) confirmed the reliability of the scale further with more exact variance components. The QLICD-IBS has good validity, reliability, responsiveness, and some highlights and can be used as the quality of life instrument for patients with IBS.
Validation of the Acoustic Voice Quality Index in the Japanese Language.
Hosokawa, Kiyohito; Barsties, Ben; Iwahashi, Toshihiko; Iwahashi, Mio; Kato, Chieri; Iwaki, Shinobu; Sasai, Hisanori; Miyauchi, Akira; Matsushiro, Naoki; Inohara, Hidenori; Ogawa, Makoto; Maryn, Youri
2017-03-01
The Acoustic Voice Quality Index (AVQI) is a multivariate construct for quantification of overall voice quality based on the analysis of continuous speech and sustained vowel. The stability and validity of the AVQI is well established in several language families. However, the Japanese language has distinct characteristics with respect to several parameters of articulatory and phonatory physiology. The aim of the study was to confirm the criterion-related concurrent validity of AVQI, as well as its responsiveness to change and diagnostic accuracy for voice assessment in the Japanese-speaking population. This is a retrospective study. A total of 336 voice recordings, which included 69 pairs of voice recordings (before and after therapeutic interventions), were eligible for the study. The auditory-perceptual judgment of overall voice quality was evaluated by five experienced raters. The concurrent validity, responsiveness to change, and diagnostic accuracy of the AVQI were estimated. The concurrent validity and responsiveness to change based on the overall voice quality was indicated by high correlation coefficients 0.828 and 0.767, respectively. Receiver operating characteristic analysis revealed an excellent diagnostic accuracy for discrimination between dysphonic and normophonic voices (area under the curve: 0.905). The best threshold level for the AVQI of 3.15 corresponded with a sensitivity of 72.5% and specificity of 95.2%, with the positive and negative likelihood ratios of 15.1 and 0.29, respectively. We demonstrated the validity of the AVQI as a tool for assessment of overall voice quality and that of voice therapy outcomes in the Japanese-speaking population. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Is the Simple Shoulder Test a valid outcome instrument for shoulder arthroplasty?
Hsu, Jason E; Russ, Stacy M; Somerson, Jeremy S; Tang, Anna; Warme, Winston J; Matsen, Frederick A
2017-10-01
The Simple Shoulder Test (SST) is a brief, inexpensive, and widely used patient-reported outcome tool, but it has not been rigorously evaluated for patients having shoulder arthroplasty. The goal of this study was to rigorously evaluate the validity of the SST for outcome assessment in shoulder arthroplasty using a systematic review of the literature and an analysis of its properties in a series of 408 surgical cases. SST scores, 36-Item Short Form Health Survey scores, and satisfaction scores were collected preoperatively and 2 years postoperatively. Responsiveness was assessed by comparing preoperative and 2-year postoperative scores. Criterion validity was determined by correlating the SST with the 36-Item Short Form Health Survey. Construct validity was tested through 5 clinical hypotheses regarding satisfaction, comorbidities, insurance status, previous failed surgery, and narcotic use. Scores after arthroplasty improved from 3.9 ± 2.8 to 10.2 ± 2.3 (P < .001). The change in SST correlated strongly with patient satisfaction (P < .001). The SST had large Cohen's d effect sizes and standardized response means. Criterion validity was supported by significant differences between satisfied and unsatisfied patients, those with more severe and less severe comorbidities, those with workers' compensation or Medicaid and other types of insurance, those with and without previous failed shoulder surgery, and those taking and those not taking narcotic pain medication before surgery (P < .005). These data combined with a systematic review of the literature demonstrate that the SST is a valid and responsive patient-reported outcome measure for assessing the outcomes of shoulder arthroplasty. Copyright © 2017 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Elsevier Inc. All rights reserved.
Engels, Leopold G J B; Klinkenberg-Knol, Elly C; Carlsson, Jonas; Halling, Katarina
2010-08-17
The Quality of Life in Reflux and Dyspepsia (QOLRAD) questionnaire is one of the best-characterized disease-specific instruments that captures health-related problems and symptom-patterns in patients with gastroesophageal reflux disease (GERD). This paper reports the psychometric validation of a Dutch translation of the QOLRAD questionnaire in gastroenterology outpatients with GERD. Patients completed the QOLRAD questionnaire at visit 1 (baseline), visit 2 (after 2, 4 or 8 weeks of acute treatment with esomeprazole 40 mg once daily), and visit 4 (after 6 months with on-demand esomeprazole 40 mg once daily or continuous esomeprazole 20 mg once daily). Symptoms were assessed at each visit, and patient satisfaction was assessed at visits 2 and 4. Of the 1166 patients entered in the study, 97.3% had moderate or severe heartburn and 55.5% had moderate or severe regurgitation at baseline. At visit 2, symptoms of heartburn and regurgitation were mild or absent in 96.7% and 97.7%, respectively, and 95.3% of patients reported being satisfied with the treatment. The internal consistency and reliability of the QOLRAD questionnaire (range: 0.83-0.92) supported construct validity. Convergent validity was moderate to low. Known-groups validity was confirmed by a negative correlation between the QOLRAD score and clinician-assessed severity of GERD symptoms. Effect sizes (1.15-1.93) and standardized response means (1.17-1.86) showed good responsiveness to change. GERD symptoms had a negative impact on patients' lives. The psychometric characteristics of the Dutch translation of the QOLRAD questionnaire were found to be satisfactory, with good reliability and responsiveness to change, although convergent validity was at best moderate.
ERIC Educational Resources Information Center
Omizo, Michael M.; And Others
1983-01-01
Construct validity data found some support for the California Occupational Preference System constructs when its results were evaluated on a sample of 213 female undergraduates relative to the Vocational Preference Inventory results. (PN)
Measurement Invariance of the UTAUT Constructs in the Caribbean
ERIC Educational Resources Information Center
Thomas, Troy D.; Singh, Lenandlar; Gaffar, Kemuel; Thakur, Dhanaraj; Jackman, Grace-Ann; Thomas, Michael; Gajraj, Roger; Allen, Claudine; Tooma, Keron
2014-01-01
This article employs confirmatory factor analysis to evaluate the factorial validity and the cross-national comparability of the UTAUT constructs with respect to mobile learning in higher education in four Caribbean countries. Except for the measurement of one factor, the UTAUT constructs exhibit adequate reliability and validity. Though full…
de Alwis, Manudul Pahansen; Äng, Björn Olov; Garme, Karl
2017-01-01
Objective High-performance marine craft personnel (HPMCP) are regularly exposed to vibration and repeated shock (VRS) levels exceeding maximum limitations stated by international legislation. Whereas such exposure reportedly is detrimental to health and performance, the epidemiological data necessary to link these adverse effects causally to VRS are not available in the scientific literature, and no suitable tools for acquiring such data exist. This study therefore constructed a questionnaire for longitudinal investigations in HPMCP. Methods A consensus panel defined content domains, identified relevant items and outlined a questionnaire. The relevance and simplicity of the questionnaire’s content were then systematically assessed by expert raters in three consecutive stages, each followed by revisions. An item-level content validity index (I-CVI) was computed as the proportion of experts rating an item as relevant and simple, and a scale-level content validity index (S-CVI/Ave) as the average I-CVI across items. The thresholds for acceptable content validity were 0.78 and 0.90, respectively. Finally, a dynamic web version of the questionnaire was constructed and pilot tested over a 1-month period during a marine exercise in a study population sample of eight subjects, while accelerometers simultaneously quantified VRS exposure. Results Content domains were defined as work exposure, musculoskeletal pain and human performance, and items were selected to reflect these constructs. Ratings from nine experts yielded S-CVI/Ave of 0.97 and 1.00 for relevance and simplicity, respectively, and the pilot test suggested that responses were sensitive to change in acceleration and that the questionnaire, following some adjustments, was feasible for its intended purpose. Conclusions A dynamic web-based questionnaire for longitudinal survey of key variables in HPMCP was constructed. Expert ratings supported that the questionnaire content is relevant, simple and sufficiently comprehensive, and the pilot test suggested that the questionnaire is feasible for longitudinal measurements in the study population. PMID:28729320
Maillé, A R; Koning, C J; Zwinderman, A H; Willems, L N; Dijkman, J H; Kaptein, A A
1997-05-01
Chronic non-specific lung disease (CNSLD) encompasses asthma as well as chronic obstructive pulmonary disease (COPD). Recently in health care, there has been increasing awareness in the functional, psychological and social aspects of the health of patients; their quality of life (QOL). Quality-of-life research addressing CNSLD patients has been rather underdeveloped for a long period of time. Recently, however, the importance of QOL is being increasingly recognized, and several research groups have started to study QOL in CNSLD patients in more detail. This paper describes the construction of a disease-specific QOL instrument for patients with mild to moderately severe CNSLD. Items relating to several domains of QOL were listed, and 171 CNSLD patients in general practice were asked how much of a problem each item had been (assessed on a seven-point Likert scale). After applying an item-selection procedure, a uni-dimensional QOL questionnaire was constructed consisting of 55 items divided into seven domain subscales: breathing problems, physical problems, emotions, situations triggering or enhancing breathing problems, general activities, daily and domestic activities, and social activities, relationships and sexuality. Reliability estimates of the domain subscales of the constructed questionnaire varied from 0.68 to 0.89, and was 0.92 for the QOL for Respiratory Illness Questionnaire (QOL-RIQ) total scale. A first impression of the construct validity of the questionnaire was gained by investigation of the relationship between the QOL domain subscales and several indicators of illness severity, as well as the relative contribution of illness severity variables, background characteristics and symptoms to QOL, using regression analysis. Further research to validate the questionnaire to a greater extent (construct validity, test-retest reliability and responsiveness to change) is currently taking place.
Analysis of the construct of dignity and content validity of the patient dignity inventory
2011-01-01
Background Maintaining dignity, the quality of being worthy of esteem or respect, is considered as a goal of palliative care. The aim of this study was to analyse the construct of personal dignity and to assess the content validity of the Patient Dignity Inventory (PDI) in people with an advance directive in the Netherlands. Methods Data were collected within the framework of an advance directives cohort study. This cohort study is aiming to get a better insight into how decisions are made at the end of life with regard to advance directives in the Netherlands. One half of the cohort (n = 2404) received an open-ended question concerning factors relevant to dignity. Content labels were assigned to issues mentioned in the responses to the open-ended question. The other half of the cohort (n = 2537) received a written questionnaire including the PDI. The relevance and comprehensiveness of the PDI items were assessed with the COSMIN checklist ('COnsensus-based Standards for the selection of health status Measurement INstruments'). Results The majority of the PDI items were found to be relevant for the construct to be measured, the study population, and the purpose of the study but the items were not completely comprehensive. The responses to the open-ended question indicated that communication and care-related aspects were also important for dignity. Conclusions This study demonstrated that the PDI items were relevant for people with an advance directive in the Netherlands. The comprehensiveness of the items can be improved by including items concerning communication and care. PMID:21682924
Analysis of the construct of dignity and content validity of the patient dignity inventory.
Albers, Gwenda; Pasman, H Roeline W; Rurup, Mette L; de Vet, Henrica C W; Onwuteaka-Philipsen, Bregje D
2011-06-19
Maintaining dignity, the quality of being worthy of esteem or respect, is considered as a goal of palliative care. The aim of this study was to analyse the construct of personal dignity and to assess the content validity of the Patient Dignity Inventory (PDI) in people with an advance directive in the Netherlands. Data were collected within the framework of an advance directives cohort study. This cohort study is aiming to get a better insight into how decisions are made at the end of life with regard to advance directives in the Netherlands. One half of the cohort (n = 2404) received an open-ended question concerning factors relevant to dignity. Content labels were assigned to issues mentioned in the responses to the open-ended question. The other half of the cohort (n = 2537) received a written questionnaire including the PDI. The relevance and comprehensiveness of the PDI items were assessed with the COSMIN checklist ('COnsensus-based Standards for the selection of health status Measurement INstruments'). The majority of the PDI items were found to be relevant for the construct to be measured, the study population, and the purpose of the study but the items were not completely comprehensive. The responses to the open-ended question indicated that communication and care-related aspects were also important for dignity. This study demonstrated that the PDI items were relevant for people with an advance directive in the Netherlands. The comprehensiveness of the items can be improved by including items concerning communication and care.
Callousness and affective face processing in adults: Behavioral and brain-potential indicators.
Brislin, Sarah J; Yancey, James R; Perkins, Emily R; Palumbo, Isabella M; Drislane, Laura E; Salekin, Randall T; Fanti, Kostas A; Kimonis, Eva R; Frick, Paul J; Blair, R James R; Patrick, Christopher J
2018-03-01
The investigation of callous-unemotional (CU) traits has been central to contemporary research on child behavior problems, and served as the impetus for inclusion of a specifier for conduct disorder in the latest edition of the official psychiatric diagnostic system. Here, we report results from 2 studies that evaluated the construct validity of callousness as assessed in adults, by testing for affiliated deficits in behavioral and neural processing of fearful faces, as have been shown in youthful samples. We hypothesized that scores on an established measure of callousness would predict reduced recognition accuracy and diminished electocortical reactivity for fearful faces in adult participants. In Study 1, 66 undergraduate participants performed an emotion recognition task in which they viewed affective faces of different types and indicated the emotion expressed by each. In Study 2, electrocortical data were collected from 254 adult twins during viewing of fearful and neutral face stimuli, and scored for event-related response components. Analyses of Study 1 data revealed that higher callousness was associated with decreased recognition accuracy for fearful faces specifically. In Study 2, callousness was associated with reduced amplitude of both N170 and P200 responses to fearful faces. Current findings demonstrate for the first time that callousness in adults is associated with both behavioral and physiological deficits in the processing of fearful faces. These findings support the validity of the CU construct with adults and highlight the possibility of a multidomain measurement framework for continued study of this important clinical construct. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Utility of pedometers for assessing physical activity: construct validity.
Tudor-Locke, Catrine; Williams, Joel E; Reis, Jared P; Pluto, Delores
2004-01-01
Valid assessment of physical activity is necessary to fully understand this important health-related behaviour for research, surveillance, intervention and evaluation purposes. This article is the second in a companion set exploring the validity of pedometer-assessed physical activity. The previous article published in Sports Medicine dealt with convergent validity (i.e. the extent to which an instrument's output is associated with that of other instruments intended to measure the same exposure of interest). The present focus is on construct validity. Construct validity is the extent to which the measurement corresponds with other measures of theoretically-related parameters. Construct validity is typically evaluated by correlational analysis, that is, the magnitude of concordance between two measures (e.g. pedometer-determined steps/day and a theoretically-related parameter such as age, anthropometric measures and fitness). A systematic literature review produced 29 articles published since > or =1980 directly relevant to construct validity of pedometers in relation to age, anthropometric measures and fitness. Reported correlations were combined and a median r-value was computed. Overall, there was a weak inverse relationship (median r = -0.21) between age and pedometer-determined physical activity. A weak inverse relationship was also apparent with both body mass index and percentage overweight (median r = -0.27 and r = -0.22, respectively). Positive relationships regarding indicators of fitness ranged from weak to moderate depending on the fitness measure utilised: 6-minute walk test (median r = 0.69), timed treadmill test (median r = 0.41) and estimated maximum oxygen uptake (median r = 0.22). Studies are warranted to assess the relationship of pedometer-determined physical activity with other important health-related outcomes including blood pressure and physiological parameters such as blood glucose and lipid profiles. The aggregated evidence of convergent validity (presented in the previous companion article) and construct validity herein provides support for considering simple and inexpensive pedometers in both research and practice.
Construct Validation of the Behavior and Instructional Management Scale
ERIC Educational Resources Information Center
Martin, Nancy K.; Sass, Daniel A.
2010-01-01
Beliefs related to classroom management vary among teachers and play an important role in classrooms. Despite the importance of this construct, valid measures have proven difficult to develop. This study evaluated the psychometric properties of the Behavior and Instructional Management Scale (BIMS), a short but valid measure of teachers'…
Development and Construct Validation of the Mentor Behavior Scale
ERIC Educational Resources Information Center
Brodeur, Pascale; Larose, Simon; Tarabulsy, George; Feng, Bei; Forget-Dubois, Nadine
2015-01-01
Researchers suggest that certain supportive behaviors of mentors could increase the benefits of school-based mentoring for youth. However, the literature contains few validated instruments to measure these behaviors. In our present study, we aimed to construct and validate a tool to measure the supportive behaviors of mentors participating in…
Development and validation of the occupational identity scale.
Melgosa, J
1987-12-01
Ego-identity research utilizing Marcia's (1966) identity statuses has been prolific during the past 15 years. The four types of statuses--achievement, moratorium, foreclosure, diffusion--have become part of the ego-identity development theory. The development of a research tool to study further one of the dimensions of ego-identity development (occupational dimension) was perceived as a need. Therefore, items were created utilizing the criteria established by previous research and content validated by a group of experts. These statements were validated by 417 students from six high schools and colleges. Responses were analyzed and measures of construct and concurrent validity were obtained. Also indexes of internal consistency and item discrimination were estimated. Through factor analysis techniques, four factors were identified for the occupational identity statuses. They accounted for 49 per cent of the total variance. Reliability coefficients ranged between 0.70 and 0.87. Concurrent validity coefficients ranged between 0.38 and 0.79, when correlated with a similar instrument. After deletion of those items that did not contribute significantly to the validity of the instrument, a 28-item Occupational Identity Scale was established.
Peterson, Alexander C; Sutherland, Jason M; Liu, Guiping; Crump, R Trafford; Karimuddin, Ahmer A
2018-06-01
The Fecal Incontinence Quality of Life Scale (FIQL) is a commonly used patient-reported outcome measure for fecal incontinence, often used in clinical trials, yet has not been validated in English since its initial development. This study uses modern methods to thoroughly evaluate the psychometric characteristics of the FIQL and its potential for differential functioning by gender. This study analyzed prospectively collected patient-reported outcome data from a sample of patients prior to colorectal surgery. Patients were recruited from 14 general and colorectal surgeons in Vancouver Coastal Health hospitals in Vancouver, Canada. Confirmatory factor analysis was used to assess construct validity. Item response theory was used to evaluate test reliability, describe item-level characteristics, identify local item dependence, and test for differential functioning by gender. 236 patients were included for analysis, with mean age 58 and approximately half female. Factor analysis failed to identify the lifestyle, coping, depression, and embarrassment domains, suggesting lack of construct validity. Items demonstrated low difficulty, indicating that the test has the highest reliability among individuals who have low quality of life. Five items are suggested for removal or replacement. Differential test functioning was minimal. This study has identified specific improvements that can be made to each domain of the Fecal Incontinence Quality of Life Scale and to the instrument overall. Formatting, scoring, and instructions may be simplified, and items with higher difficulty developed. The lifestyle domain can be used as is. The embarrassment domain should be significantly revised before use.
ERIC Educational Resources Information Center
Maiano, Christophe; Begarie, Jerome; Morin, Alexandre J. S.; Garbarino, Jean-Marie; Ninot, Gregory
2010-01-01
The purpose of this study was to test the reliability (i.e. internal consistency and test-retest reliability) and construct validity (i.e. content validity, factor validity, measurement invariance, and latent mean invariance) of the Nutrition and Activity Knowledge Scale (NAKS) in a sample of French adolescents with mild to moderate Intellectual…
Sirimanna, Pramudith; Gladman, Marc A
2017-10-01
Proficiency-based virtual reality (VR) training curricula improve intraoperative performance, but have not been developed for laparoscopic appendicectomy (LA). This study aimed to develop an evidence-based training curriculum for LA. A total of 10 experienced (>50 LAs), eight intermediate (10-30 LAs) and 20 inexperienced (<10 LAs) operators performed guided and unguided LA tasks on a high-fidelity VR simulator using internationally relevant techniques. The ability to differentiate levels of experience (construct validity) was measured using simulator-derived metrics. Learning curves were analysed. Proficiency benchmarks were defined by the performance of the experienced group. Intermediate and experienced participants completed a questionnaire to evaluate the realism (face validity) and relevance (content validity). Of 18 surgeons, 16 (89%) considered the VR model to be visually realistic and 17 (95%) believed that it was representative of actual practice. All 'guided' modules demonstrated construct validity (P < 0.05), with learning curves that plateaued between sessions 6 and 9 (P < 0.01). When comparing inexperienced to intermediates to experienced, the 'unguided' LA module demonstrated construct validity for economy of motion (5.00 versus 7.17 versus 7.84, respectively; P < 0.01) and task time (864.5 s versus 477.2 s versus 352.1 s, respectively, P < 0.01). Construct validity was also confirmed for number of movements, path length and idle time. Validated modules were used for curriculum construction, with proficiency benchmarks used as performance goals. A VR LA model was realistic and representative of actual practice and was validated as a training and assessment tool. Consequently, the first evidence-based internationally applicable training curriculum for LA was constructed, which facilitates skill acquisition to proficiency. © 2017 Royal Australasian College of Surgeons.
Yen, Po-Yin; Sousa, Karen H; Bakken, Suzanne
2014-01-01
Background In a previous study, we developed the Health Information Technology Usability Evaluation Scale (Health-ITUES), which is designed to support customization at the item level. Such customization matches the specific tasks/expectations of a health IT system while retaining comparability at the construct level, and provides evidence of its factorial validity and internal consistency reliability through exploratory factor analysis. Objective In this study, we advanced the development of Health-ITUES to examine its construct validity and predictive validity. Methods The health IT system studied was a web-based communication system that supported nurse staffing and scheduling. Using Health-ITUES, we conducted a cross-sectional study to evaluate users’ perception toward the web-based communication system after system implementation. We examined Health-ITUES's construct validity through first and second order confirmatory factor analysis (CFA), and its predictive validity via structural equation modeling (SEM). Results The sample comprised 541 staff nurses in two healthcare organizations. The CFA (n=165) showed that a general usability factor accounted for 78.1%, 93.4%, 51.0%, and 39.9% of the explained variance in ‘Quality of Work Life’, ‘Perceived Usefulness’, ‘Perceived Ease of Use’, and ‘User Control’, respectively. The SEM (n=541) supported the predictive validity of Health-ITUES, explaining 64% of the variance in intention for system use. Conclusions The results of CFA and SEM provide additional evidence for the construct and predictive validity of Health-ITUES. The customizability of Health-ITUES has the potential to support comparisons at the construct level, while allowing variation at the item level. We also illustrate application of Health-ITUES across stages of system development. PMID:24567081
Luna-Lario, P; Pena, J; Ojeda, N
2017-04-16
To perform an in-depth examination of the construct validity and the ecological validity of the Wechsler Memory Scale-III (WMS-III) and the Spain-Complutense Verbal Learning Test (TAVEC). The sample consists of 106 adults with acquired brain injury who were treated in the Area of Neuropsychology and Neuropsychiatry of the Complejo Hospitalario de Navarra and displayed memory deficit as the main sequela, measured by means of specific memory tests. The construct validity is determined by examining the tasks required in each test over the basic theoretical models, comparing the performance according to the parameters offered by the tests, contrasting the severity indices of each test and analysing their convergence. The external validity is explored through the correlation between the tests and by using regression models. According to the results obtained, both the WMS-III and the TAVEC have construct validity. The TAVEC is more sensitive and captures not only the deficits in mnemonic consolidation, but also in the executive functions involved in memory. The working memory index of the WMS-III is useful for predicting the return to work at two years after the acquired brain injury, but none of the instruments anticipates the disability and dependence at least six months after the injury. We reflect upon the construct validity of the tests and their insufficient capacity to predict functionality when the sequelae become chronic.
Hu, Yinhuan; Zhang, Zixia; Xie, Jinzhu; Wang, Guanping
2017-02-01
The objective of this study is to describe the development of the Outpatient Experience Questionnaire (OPEQ) and to assess the validity and reliability of the scale. Literature review, patient interviews, Delphi method and Cross-sectional validation survey. Six comprehensive public hospitals in China. The survey was carried out on a sample of 600 outpatients. Acceptability of the questionnaire was assessed according to the overall response rate, item non-response rate and the average completion time. Correlation coefficients and confirmatory factor analysis were used to test construct validity. Delphi method was used to assess the content validity of the questionnaire. Cronbach's coefficient alpha and split-half reliability coefficient were used to estimate the internal reliability of the questionnaire. The overall response rate was 97.2% and the item non-response rate ranged from 0% to 0.3%. The mean completion time was 6 min. The Spearman correlations of item-total score ranged from 0.466 to 0.765. The results of confirmatory factor analysis showed that all items had factor loadings above 0.40 and the dimension intercorrelation ranged from 0.449 to 0.773, the goodness of fit of the questionnaire was reasonable. The overall authority grade of expert consultation was 0.80 and Kendall's coefficient of concordance W was 0.186. The Cronbach's coefficients alpha of six dimensions ranged from 0.708 to 0.895, the split-half reliability coefficient (Spearman-Brown coefficient) was 0.969. The OPEQ is a promising instrument covering the most important aspects which influence outpatient experiences of comprehensive public hospital in China. It has good evidence for acceptability, validity and reliability. © The Author 2016. Published by Oxford University Press in association with the International Society for Quality in Health Care. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com
Kim, J K; Lim, H M
2015-02-01
The purpose of this study was to translate and culturally adapt the Carpal Tunnel Questionnaire to produce an equivalent Korean version. A total of 53 patients completed the Korean version of the Carpal Tunnel Questionnaire pre-operatively and 3 months after open carpal tunnel release. All 53 also completed the Korean version of the Disabilities of Arm, Shoulder, and Hand questionnaire pre-operatively and 3 months post-operatively. Reliability was measured by determining the test-retest reliability and internal consistency. Test-retest reliability was assessed using intraclass correlation coefficients and paired t-tests, and internal consistency using Cronbach's alpha coefficients. Pearson correlation analysis was carried out on the Korean version of the Carpal Tunnel Questionnaire scores and the Korean version of the Disabilities of Arm, Shoulder, and Hand scores to assess construct validity. Responsiveness was evaluated using effect sizes and standardized response means. The reliability of the Korean version of the Carpal Tunnel Questionnaire was good. The scores in the Korean version of the Disabilities of Arm, Shoulder, and Hand strongly correlated with the scores in the Korean version of the Carpal Tunnel Questionnaire. Standardized response mean and effect size were both large for the Korean version of the Carpal Tunnel Questionnaire. The study shows that the Korean version of the Carpal Tunnel Questionnaire is a reliable, valid and responsive instrument for measuring outcomes in carpal tunnel syndrome. © The Author(s) 2014.
Weiss, Maureen R; Bolter, Nicole D; Kipp, Lindsay E
2014-09-01
A signature characteristic of positive youth development (PYD) programs is the opportunity to develop life skills, such as social, behavioral, and moral competencies, that can be generalized to domains beyond the immediate activity. Although context-specific instruments are available to assess developmental outcomes, a measure of life skills transfer would enable evaluation of PYD programs in successfully teaching skills that youth report using in other domains. The purpose of our studies was to develop and validate a measure of perceived life skills transfer, based on data collected with The First Tee, a physical activity-based PYD program. In 3 studies, we conducted a series of steps to provide content and construct validity and internal consistency reliability for the Life Skills Transfer Survey (LSTS), a measure of perceived life skills transfer. Study 1 provided content validity for the LSTS that included 8 life skills and 50 items. Study 2 revealed construct validity (structural validity) through a confirmatory factor analysis and convergent validity by correlating scores on the LSTS with scores on an assessment tool that measures a related construct. Study 3 offered additional construct validity by reassessing youth 1 year later and showing that scores during both time periods were invariant in factor pattern, loadings, and variances and covariances. Studies 2 and 3 demonstrated internal consistency reliability of the LSTS. RESULTS from 3 studies provide evidence of content and construct validity and internal consistency reliability for the LSTS, which can be used in evaluation research with youth development programs.
Harding, Gale; Schein, Jeff R; Nelson, Winnie W; Vallow, Sue; Olson, William H; Hewitt, David J; Polomano, Rosemary C
2010-03-01
To describe the development and psychometric evaluation of a questionnaire assessing the ease of use that patients associate with patient-controlled analgesia (PCA) modalities. Qualitative interviews were conducted with patients who had experience with intravenous (IV) PCA for postoperative pain management to generate items relevant to the ease of using PCA modalities. The content validity of the resulting questionnaire was examined through follow-up patient interviews, and an expert panel reviewed the questionnaire. Cognitive debriefing interviews were conducted with patients to determine the clarity and content of the instructions, items, and response scales, and the ease of completing the instrument. Psychometric evaluation was performed with patients who had undergone surgery and received IV PCA for postoperative pain management. Item and scale quality and the internal consistency reliability of the questionnaire were assessed. Construct validity was evaluated by examining the relationship between subscales of the questionnaire with patient-reported outcome measures. Known-groups validity was determined by assessing the instrument's ability to differentiate between patients with versus without an IV PCA problem. A potential limitation of this study was the exclusive sampling of patients who had experience with IV PCA. The Patient Ease-of-Care (EOC) Questionnaire included 23 items in the following subscales: Confidence with Device, Comfort with Device, Movement, Dosing Confidence, Pain Control, Knowledge/Understanding, and Satisfaction. Coefficient alpha reliability estimates were ≥ 0.66 for Overall EOC (includes all subscales except Satisfaction) and all EOC subscales. Construct validity was supported by the moderate relationship between the Pain Control subscale and measures of pain severity and pain interference; additional evidence of construct validity was provided by correlations of the Confidence with Device subscale, the Satisfaction subscale, and Overall EOC with measures of pain severity, pain interference, and satisfaction. Significant mean score differences were reported between participants with and without IV PCA problems for Overall EOC and for the Comfort with Device, Confidence with Device, Movement, Pain Control, and Satisfaction subscales indicating known-groups validity. Results provide evidence for the reliability and validity of the Patient EOC Questionnaire as a measure of the ease of use that patients associate with PCA systems and may be useful for evaluating emerging PCA modalities.
Wong, Carlos K H; Lang, Brian H H; Yu, Hill M S; Lam, Cindy L K
2017-08-01
The aim of this study was to examine the acceptability, validity, and reliability of the EuroQoL Five-Dimension Five-Level (EQ-5D-5L) and Short-Form Six-Dimension (SF-6D) health utility measures in patients with symptomatic benign thyroid nodules. Data from a randomized controlled trial (ClinicalTrials.gov identifier: NCT02398721) of 294 patients with symptomatic benign thyroid nodules were utilized for this psychometric evaluation of health-related quality of life (HR-QOL) measurement. Three HR-QOL questionnaires-the generic 12-item Short Form Health Survey (SF-12v2), EQ-5D-5L, and SF-6D-were interviewer-administered at baseline and 2 weeks afterwards. Responses to SF-6D were transformed to SF-6D utility scores using a Hong Kong population scoring algorithm derived by standard gamble, whereas responses to EQ-5D-5L were mapped onto EQ-5D-3L response via interim mapping algorithms and then converted to EQ-5D-5L utility scores using a Chinese-specific value set. Construct validity was determined by evaluating Spearman correlation between SF-12v2 scores and utility scores. Two-week test-retest reliability was assessed using intra-class correlation coefficient. No significant (>15%) floor and ceiling effects were observed for SF-6D utility scores. The SF-6D utility scores had a moderate Spearman rank correlation with the SF-12v2 domain score providing evidence for adequate construct validity. The SF-6D utility scores showed good test-retest reliability (0.794; range 0.696-0.860). Better reliability was observed in SF-6D utility scores than in EQ-5D-5L utility scores. While the EQ-5D-5L instrument was less reproducible, the SF-6D instrument appeared to be an applicable, valid, and reliable measure in assessing the HR-QOL of Chinese patients with symptomatic benign thyroid nodules. The impact of utility score selection on the effectiveness and cost effectiveness of clinical interventions targeted to these patients needs further exploration. NCT02398721, ClinicalTrials.gov.
Clinimetrics and clinical psychometrics: macro- and micro-analysis.
Tomba, Elena; Bech, Per
2012-01-01
Clinimetrics was introduced three decades ago to specify the domain of clinical markers in clinical medicine (indexes or rating scales). In this perspective, clinical validity is the platform for selecting the various indexes or rating scales (macro-analysis). Psychometric validation of these indexes or rating scales is the measuring aspect (micro-analysis). Clinical judgment analysis by experienced psychiatrists is included in the macro-analysis and the item response theory models are especially preferred in the micro-analysis when using the total score as a sufficient statistic. Clinical assessment tools covering severity of illness scales, prognostic measures, issues of co-morbidity, longitudinal assessments, recovery, stressors, lifestyle, psychological well-being, and illness behavior have been identified. The constructive dialogue in clinimetrics between clinical judgment and psychometric validation procedures is outlined for generating developments of clinical practice in psychiatry. Copyright © 2012 S. Karger AG, Basel.
The Construct Validity of Language Aptitude: A Meta-Analysis
ERIC Educational Resources Information Center
Li, Shaofeng
2016-01-01
A meta-analysis was conducted to examine the construct validity of language aptitude by synthesizing the existing research that has been accumulated over the past five decades. The study aimed to provide a thorough understanding of the construct by aggregating the data reported in the primary research on its correlations with other individual…
ERIC Educational Resources Information Center
Martin, Andrew J.
2009-01-01
From a developmental construct validity perspective, this study examines motivation and engagement across elementary school, high school, and university/college, with particular focus on the Motivation and Engagement Scale (comprising adaptive, impeding/maladaptive, and maladaptive factors). Findings demonstrated developmental construct validity…
Construct Validity of the Anxiety Sensitivity Index-3 in Clinical Samples
ERIC Educational Resources Information Center
Kemper, Christoph J.; Lutz, Johannes; Bahr, Tobias; Ruddel, Heinz; Hock, Michael
2012-01-01
Using two clinical samples of patients, the presented studies examined the construct validity of the recently revised Anxiety Sensitivity Index-3 (ASI-3). Confirmatory factor analyses established a clear three-factor structure that corresponds to the postulated subdivision of the construct into correlated somatic, social, and cognitive components.…
Selection of Marine Corps Drill Instructors
1980-03-01
8 4. ., ey- Construction and Cross-Validation Statistics for Drill Instructor School Performance Success Keys...Race, and School Attrition ........... ............................. ... 15 13. Key- Construction and Cross-Validation Statistics for Drill... constructed form, the Alternation Ranking of Series Drill Instruc- tors. In this form, DIs in a Series are ranked from highest to lowest in terms of their
The development and psychometric validation of the Ethical Awareness Scale.
Milliken, Aimee; Ludlow, Larry; DeSanto-Madeya, Susan; Grace, Pamela
2018-04-19
To develop and psychometrically assess the Ethical Awareness Scale using Rasch measurement principles and a Rasch item response theory model. Critical care nurses must be equipped to provide good (ethical) patient care. This requires ethical awareness, which involves recognizing the ethical implications of all nursing actions. Ethical awareness is imperative in successfully addressing patient needs. Evidence suggests that the ethical import of everyday issues may often go unnoticed by nurses in practice. Assessing nurses' ethical awareness is a necessary first step in preparing nurses to identify and manage ethical issues in the highly dynamic critical care environment. A cross-sectional design was used in two phases of instrument development. Using Rasch principles, an item bank representing nursing actions was developed (33 items). Content validity testing was performed. Eighteen items were selected for face validity testing. Two rounds of operational testing were performed with critical care nurses in Boston between February-April 2017. A Rasch analysis suggests sufficient item invariance across samples and sufficient construct validity. The analysis further demonstrates a progression of items uniformly along a hierarchical continuum; items that match respondent ability levels; response categories that are sufficiently used; and adequate internal consistency. Mean ethical awareness scores were in the low/moderate range. The results suggest the Ethical Awareness Scale is a psychometrically sound, reliable and valid measure of ethical awareness in critical care nurses. © 2018 John Wiley & Sons Ltd.
Development of the Assessment of Belief Conflict in Relationship-14 (ABCR-14).
Kyougoku, Makoto; Teraoka, Mutsumi; Masuda, Noriko; Ooura, Mariko; Abe, Yasushi
2015-01-01
Nurses and other healthcare workers frequently experience belief conflict, one of the most important, new stress-related problems in both academic and clinical fields. In this study, using a sample of 1,683 nursing practitioners, we developed The Assessment of Belief Conflict in Relationship-14 (ABCR-14), a new scale that assesses belief conflict in the healthcare field. Standard psychometric procedures were used to develop and test the scale, including a qualitative framework concept and item-pool development, item reduction, and scale development. We analyzed the psychometric properties of ABCR-14 according to entropy, polyserial correlation coefficient, exploratory factor analysis, confirmatory factor analysis, average variance extracted, Cronbach's alpha, Pearson product-moment correlation coefficient, and multidimensional item response theory (MIRT). The results of the analysis supported a three-factor model consisting of 14 items. The validity and reliability of ABCR-14 was suggested by evidence from high construct validity, structural validity, hypothesis testing, internal consistency reliability, and concurrent validity. The result of the MIRT offered strong support for good item response of item slope parameters and difficulty parameters. However, the ABCR-14 Likert scale might need to be explored from the MIRT point of view. Yet, as mentioned above, there is sufficient evidence to support that ABCR-14 has high validity and reliability. The ABCR-14 demonstrates good psychometric properties for nursing belief conflict. Further studies are recommended to confirm its application in clinical practice.
The Development and Validation of the Indian Family Violence and Control Scale
Kalokhe, Ameeta S.; Stephenson, Rob; Kelley, Mary E.; Dunkle, Kristin L.; Paranjape, Anuradha; Solas, Vikram; Karve, Latika; del Rio, Carlos; Sahay, Seema
2016-01-01
The high prevalence of domestic violence (DV) among married women in India and associated negative health repercussions highlight the need for effective prevention strategies and tools to measure the efficacy of such interventions. Literature supporting differing manifestations of DV by culture underscores the need for a culturally-tailored scale to more effectively measure DV in the Indian context. We therefore aimed to develop and validate such a tool, the Indian Family Violence and Control Scale (IFVCS), through a mixed-methods study. The psychometric development of IFVCS is herein discussed. After field pre-testing and expert review, a 63-item questionnaire was administered to a random sample of 630 married women from May-July 2013 in Pune, India. The item response theory approach for binary data to explore the IFVCS structure suggested that IFVCS is reliable, with the majority of items having high (>0.5) and significant factor loadings. Concurrent validity, assessed by comparing responses to IFVCS with the validated, abridged Conflict Tactics Scale-2, was high (r = 0.899, p<0.001) as was the construct validity, demonstrated by its significant association with several established DV correlates. Therefore, initial assessment of the IFVCS psychometric properties suggests that it is an effective tool for measuring DV among married women in India and speaks to its capacity for enhancing understanding of DV epidemiology and for evaluating the effectiveness of future DV interventions. PMID:26824611
The Development and Validation of the Indian Family Violence and Control Scale.
Kalokhe, Ameeta S; Stephenson, Rob; Kelley, Mary E; Dunkle, Kristin L; Paranjape, Anuradha; Solas, Vikram; Karve, Latika; del Rio, Carlos; Sahay, Seema
2016-01-01
The high prevalence of domestic violence (DV) among married women in India and associated negative health repercussions highlight the need for effective prevention strategies and tools to measure the efficacy of such interventions. Literature supporting differing manifestations of DV by culture underscores the need for a culturally-tailored scale to more effectively measure DV in the Indian context. We therefore aimed to develop and validate such a tool, the Indian Family Violence and Control Scale (IFVCS), through a mixed-methods study. The psychometric development of IFVCS is herein discussed. After field pre-testing and expert review, a 63-item questionnaire was administered to a random sample of 630 married women from May-July 2013 in Pune, India. The item response theory approach for binary data to explore the IFVCS structure suggested that IFVCS is reliable, with the majority of items having high (>0.5) and significant factor loadings. Concurrent validity, assessed by comparing responses to IFVCS with the validated, abridged Conflict Tactics Scale-2, was high (r = 0.899, p<0.001) as was the construct validity, demonstrated by its significant association with several established DV correlates. Therefore, initial assessment of the IFVCS psychometric properties suggests that it is an effective tool for measuring DV among married women in India and speaks to its capacity for enhancing understanding of DV epidemiology and for evaluating the effectiveness of future DV interventions.
The Hip Sports Activity Scale (HSAS) for patients with femoroacetabular impingement.
Naal, Florian D; Miozzari, Hermes H; Kelly, Bryan T; Magennis, Erin M; Leunig, Michael; Noetzli, Hubert P
2013-01-01
To develop and validate a sports activity scale for patients with a diagnosis of femoroacetabular impingement (FAI). A nine level Hip Sports Activity Scale (HSAS) was constructed both in German and English languages. Fifty-nine consecutive patients undergoing surgical treatment for FAI at two centers in Switzerland and in the US completed a questionnaire set consisting of the HSAS, the University of California at Los Angeles (UCLA) activity scale and different hip joint-specific and generic outcome tools. For reliability assessment, the HSAS was completed twice about nine days apart. Evidence of reliability, validity and responsiveness was investigated by classical psychometric analyses. Reliability was excellent for both the German and the English versions with intraclass correlation coefficients of 0.94 and 0.96, respectively. Evidence of convergent validity was supported by moderate to high correlations with the UCLA activity scale and with the joint-specific measures used. Evidence of divergent validity was supported by low correlations with the SF-12 Mental Component Scale and the WOMAC stiffness subscale. The standardised response mean was 0.69. The HSAS is a reliable and valid tool to determine sports levels in patients suffering from FAI. Its use in future studies investigating outcomes in young patients with hip disease can be recommended. Level III, Diagnostic Studies - An independent, masked comparison with an appropriate population of patients, but reference standard not applied to all study patients.
Development and validation of a fatigue assessment scale for U.S. construction workers.
Zhang, Mingzong; Sparer, Emily H; Murphy, Lauren A; Dennerlein, Jack T; Fang, Dongping; Katz, Jeffrey N; Caban-Martinez, Alberto J
2015-02-01
To develop a fatigue assessment scale and test its reliability and validity for commercial construction workers. Using a two-phased approach, we first identified items (first phase) for the development of a Fatigue Assessment Scale for Construction Workers (FASCW) through review of existing scales in the scientific literature, key informant interviews (n = 11) and focus groups (three groups with six workers each) with construction workers. The second phase included assessment for the reliability, validity, and sensitivity of the new scale using a repeated-measures study design with a convenience sample of construction workers (n = 144). Phase one resulted in a 16-item preliminary scale that after factor analysis yielded a final 10-item scale with two sub-scales ("Lethargy" and "Bodily Ailment"). During phase two, the FASCW and its subscales demonstrated satisfactory internal consistency (alpha coefficients were FASCW [0.91], Lethargy [0.86] and Bodily Ailment [0.84]) and acceptable test-retest reliability (Pearson Correlations Coefficients: 0.59-0.68; Intraclass Correlation Coefficients: 0.74-0.80). Correlation analysis substantiated concurrent and convergent validity. A discriminant analysis demonstrated that the FASCW differentiated between groups with arthritis status and different work hours. The 10-item FASCW with good reliability and validity is an effective tool for assessing the severity of fatigue among construction workers. © 2015 Wiley Periodicals, Inc.
Nemoto, Hitoshi; Watson, Deborah; Masuda, Koichi
2015-01-01
Tissue engineering holds great promise for cartilage repair with minimal donor-site morbidity. The in vivo maturation of a tissue-engineered construct can be tested in the subcutaneous tissues of the same species for autografts or of immunocompromised animals for allografts or xenografts. This section describes detailed protocols for the surgical transplantation of a tissue-engineered construct into an animal model to assess construct validity.