Construct Validity of the Anxiety Sensitivity Index-3 in Clinical Samples
ERIC Educational Resources Information Center
Kemper, Christoph J.; Lutz, Johannes; Bahr, Tobias; Ruddel, Heinz; Hock, Michael
2012-01-01
Using two clinical samples of patients, the presented studies examined the construct validity of the recently revised Anxiety Sensitivity Index-3 (ASI-3). Confirmatory factor analyses established a clear three-factor structure that corresponds to the postulated subdivision of the construct into correlated somatic, social, and cognitive components.…
Nambi, S Gopal
2013-01-01
The most common instruments developed to assess the functional status of patients with Non specific low back pain is the Roland-Morris Disability Questionnaire (RMDQ). Clinical and epidemiological research related to low back pain in the Gujarati population would be facilitated by the availability of well-established outcome measures. To find the reliability, validity, sensitivity and specificity of the Gujarati version of the RMDQ for use in Non Specific Chronic low back pain. A reliability, validity, sensitivity and specificity study of Gujarati version of the Roland-Morris Disability Questionnaire (RMDQ). Thirty out patients with Non Specific Chronic low back pain were assessed by the RMDQ. Reliability is assessed by using internal consistency and the intra-class correlation coefficient (ICC). Internal construct validity is assessed by RASCH Analysis and external construct validity is assessed by association with pain and spinal movement. Clinical calculator was used to determine the sensitivity and specificity. Internal consistency of the RMDQ is found to be adequate (> 0.65) at both times, with high ICC's also at both time points. Internal construct validity of the scale is good, indicating a single underlying construct. Expected associations with pain and spinal movement confirm external construct validity. The Sensitivity and Specificity at cut off point of 0.5 was 80% and 84% with respectively positive predictive value (PPV) of 83.33% and negative predictive value (NPV) of 80.76%. The Questionnaire is at the ordinal level. The RMDQ is a one-dimensional, ordinal measure, which works well in the Gujarati population.
Asilian-Mahabadi, Hassan; Khosravi, Yahya; Hassanzadeh-Rangi, Narmin; Hajizadeh, Ebrahim; Behzadan, Amir H
2018-02-05
Occupational safety in general, and construction safety in particular, is a complex phenomenon. This study was designed to develop a new valid measure to evaluate factors affecting unsafe behavior in the construction industry. A new questionnaire was generated from qualitative research according to the principles of grounded theory. Key measurement properties (face validity, content validity, construct validity, reliability and discriminative validity) were examined using qualitative and quantitative approaches. The receiver operating characteristic curve was used to estimate the discriminating power and the optimal cutoff score. Construct validity revealed an interpretable 12-factor structure which explained 61.87% of variance. Good internal consistency (Cronbach's α = 0.94) and stability (intra-class correlation coefficient = 0.93) were found for the new instrument. The area under the curve, sensitivity and specificity were 0.80, 0.80 and 0.75, respectively. The new instrument also discriminated safety performance among the construction sites with different workers' accident histories (F = 6.40, p < 0.05). The new instrument appears to be a valid, reliable and sensitive instrument that will contribute to investigating the root causes of workers' unsafe behaviors, thus promoting safety performance in the construction industry.
ERIC Educational Resources Information Center
Vujanovic, Anka A.; Arrindell, Willem A.; Bernstein, Amit; Norton, Peter J.; Zvolensky, Michael J.
2007-01-01
The present investigation examined the factor structure, internal consistency, and construct validity of the 16-item Anxiety Sensitivity Index (ASI; Reiss Peterson, Gursky, & McNally 1986) in a young adult sample (n = 420) from the Netherlands. Confirmatory factor analysis was used to comparatively evaluate two-factor, three-factor, and…
Extension of the Rejection Sensitivity Construct to the Interpersonal Functioning of Gay Men
ERIC Educational Resources Information Center
Pachankis, John E.; Goldfried, Marvin R.; Ramrattan, Melissa E.
2008-01-01
On the basis of recent evidence suggesting that gay men are particularly likely to fear interpersonal rejection, the authors set out to extend the "rejection sensitivity" construct to the mental health concerns of gay men. After establishing a reliable and valid measure of the gay-related rejection sensitivity construct, the authors use this to…
Argentzell, Elisabeth; Hultqvist, Jenny; Neil, Sandra; Eklund, Mona
2017-10-01
Personal recovery, defined as an individual process towards meaning, is an important target within mental health services. Measuring recovery hence requires reliable and valid measures. The Process of Recovery Questionnaire (QPR) was developed for that purpose. The aim was to develop a Swedish version of the QPR (QPR-Swe) and explore its psychometric properties in terms of factor structure, internal consistency, construct validity and sensitivity to change. A total of 226 participants entered the study. The factor structure was investigated by Principal Component Analysis and Scree plot. Construct validity was addressed in terms of convergent validity against indicators of self-mastery, self-esteem, quality of life and self-rated health. A one-factor solution of QPR-Swe received better support than a two-factor solution. Good internal consistency was indicated, α = 0.92, and construct validity was satisfactory. The QPR-Swe showed preliminary sensitivity to change. The QPR-Swe showed promising initial psychometric properties in terms of internal consistency, convergent validity and sensitivity to change. The QPR-Swe is recommended for use in research and clinical contexts to assess personal recovery among people with mental illness.
Predictive value and construct validity of the work functioning screener-healthcare (WFS-H).
Boezeman, Edwin J; Nieuwenhuijsen, Karen; Sluiter, Judith K
2016-05-25
To test the predictive value and convergent construct validity of a 6-item work functioning screener (WFS-H). Healthcare workers (249 nurses) completed a questionnaire containing the work functioning screener (WFS-H) and a work functioning instrument (NWFQ) measuring the following: cognitive aspects of task execution and general incidents, avoidance behavior, conflicts and irritation with colleagues, impaired contact with patients and their family, and level of energy and motivation. Productivity and mental health were also measured. Negative and positive predictive values, AUC values, and sensitivity and specificity were calculated to examine the predictive value of the screener. Correlation analysis was used to examine the construct validity. The screener had good predictive value, since the results showed that a negative screener score is a strong indicator of work functioning not hindered by mental health problems (negative predictive values: 94%-98%; positive predictive values: 21%-36%; AUC:.64-.82; sensitivity: 42%-76%; and specificity 85%-87%). The screener has good construct validity due to moderate, but significant (p<.001), associations with productivity (r=.51), mental health (r=.48), and distress (r=.47). The screener (WFS-H) had good predictive value and good construct validity. Its score offers occupational health professionals a helpful preliminary insight into the work functioning of healthcare workers.
Low back related leg pain: an investigation of construct validity of a new classification system.
Schäfer, Axel G M; Hall, Toby M; Rolke, Roman; Treede, Rolf-Detlef; Lüdtke, Kerstin; Mallwitz, Joachim; Briffa, Kathryn N
2014-01-01
Leg pain is associated with back pain in 25-65% of all cases and classified as somatic referred pain or radicular pain. However, distinction between the two may be difficult as different pathomechanisms may cause similar patterns of pain. Therefore a pathomechanism based classification system was proposed, with four distinct hierarchical and mutually exclusive categories: Neuropathic Sensitization (NS) comprising major features of neuropathic pain with sensory sensitization; Denervation (D) arising from significant axonal compromise; Peripheral Nerve Sensitization (PNS) with marked nerve trunk mechanosensitivity; and Musculoskeletal (M) with pain referred from musculoskeletal structures. To investigate construct validity of the classification system. Construct validity was investigated by determining the relationship of nerve functioning with subgroups of patients and asymptomatic controls. Thus somatosensory profiles of subgroups of patients with low back related leg pain (LBRLP) and healthy controls were determined by a comprehensive quantitative sensory test (QST) protocol. It was hypothesized that subgroups of patients and healthy controls would show differences in QST profiles relating to underlying pathomechanisms. 77 subjects with LBRLP were recruited and classified in one of the four groups. Additionally, 18 age and gender matched asymptomatic controls were measured. QST revealed signs of pain hypersensitivity in group NS and sensory deficits in group D whereas Groups PNS and M showed no significant differences when compared to the asymptomatic group. These findings support construct validity for two of the categories of the new classification system, however further research is warranted to achieve construct validation of the classification system as a whole.
ERIC Educational Resources Information Center
Pike, Gary R.
1989-01-01
A study investigated the appropriateness of the American College Testing Program's College Outcome Measures Program, conducted at the University of Tennessee, Knoxville, by applying the criterion of construct validity. Results indicated that while the test primarily measures individual differences, it is also sensitive to the effects of higher…
Sensitivity of Teacher Value-Added Estimates to Student and Peer Control Variables
ERIC Educational Resources Information Center
Johnson, Matthew T.; Lipscomb, Stephen; Gill, Brian
2015-01-01
Teacher value-added models (VAMs) must isolate teachers' contributions to student achievement to be valid. Well-known VAMs use different specifications, however, leaving policymakers with little clear guidance for constructing a valid model. We examine the sensitivity of teacher value-added estimates under different models based on whether they…
2011-01-01
Background Early detection of common mental disorders, such as depression and anxiety, among children and adolescents requires the use of validated, culturally sensitive, and developmentally appropriate screening instruments. The Arab region has a high proportion of youth, yet Arabic-language screening instruments for mental disorders among this age group are virtually absent. Methods We carried out construct and clinical validation on the recently-developed Arab Youth Mental Health (AYMH) scale as a screening tool for depression/anxiety. The scale was administered with 10-14 year old children attending a social service center in Beirut, Lebanon (N = 153). The clinical assessment was conducted by a child and adolescent clinical psychiatrist employing the DSM IV criteria. We tested the scale's sensitivity, specificity, and internal consistency. Results Scale scores were generally significantly associated with how participants responded to standard questions on health, mental health, and happiness, indicating good construct validity. The results revealed that the scale exhibited good internal consistency (Cronbach's alpha = 0.86) and specificity (79%). However, it exhibited moderate sensitivity for girls (71%) and poor sensitivity for boys (50%). Conclusions The AYMH scale is useful as a screening tool for general mental health states and a valid screening instrument for common mental disorders among girls. It is not a valid instrument for detecting depression and anxiety among boys in an Arab culture. PMID:21435213
ERIC Educational Resources Information Center
Karapolat, Hale; Eyigor, Sibel; Kirazli, Yesim; Celebisoy, Nese; Bilgen, Cem; Kirazli, Tayfun
2010-01-01
The aim of this study is to evaluate the internal consistency, test-retest reliability, construct validity, and sensitivity to change of the Activities-specific Balance Confidence Scale (ABC) in people with peripheral vestibular disorder. Thirty-three patients with unilateral peripheral vestibular disease were included in the study. Patients were…
The Coopersmith Self-Esteem Inventory: A Construct Validation Study.
ERIC Educational Resources Information Center
Johnson, Brian W.
1983-01-01
Regression analyses indicated that the Coopersmith Self-Esteem Inventory has convergent validity with regard to the Piers-Harris Children's Self-Concept Scale and the Coopersmith Behavioral Academic Assessment Scale, has discriminant validity with regard to the Children's Social Desirability Scale, is sensitive to differences in achievement level,…
Hauer, Klaus A; Kempen, Gertrudis I J M; Schwenk, Michael; Yardley, Lucy; Beyer, Nina; Todd, Chris; Oster, Peter; Zijlstra, G A Rixt
2011-01-01
Measures of fear of falling have not yet been validated in patients with dementia, leaving a methodological gap that limits research in a population at high risk of falling and fall-related consequences. The objectives of this study are to determine: (1) the validity of the 7-item Short Falls Efficacy Scale International (Short FES-I) in geriatric patients with and without cognitive impairment, and (2) the sensitivity to change of the 10-item Falls Efficacy Scale (FES), the 16-item FES-I and the 7-item Short FES-I in geriatric patients with dementia. Cross-sectional data of community-dwelling older adults and geriatric rehabilitation patients (n = 284) collected during face-to-face interviews were used to determine construct and discriminant validity by testing for differences within variables related to fear of falling. Sensitivity to change was studied in an intervention study including patients with mild to moderate dementia (n = 130) as determined by standard response means (SRMs). The Short FES-I showed excellent construct and discriminant validity in the total group and subsamples according to cognitive status. Sensitivity to change was adequate to good in the FES (range SRM: 0.18-0.77) and FES-I (range SRM: 0.21-0.74), with the Short FES-I showing the highest peak sensitivity to change (range SRM: 0.18-0.91). The Short FES-I is a valid measure to assess fear of falling in frail older adults with and without cognitive impairment, yet it may show floor effects in higher functioning older people. All scales, including the Short FES-I, were sensitive to detecting intervention-induced changes in concerns about falling in geriatric patients with dementia. Copyright © 2010 S. Karger AG, Basel.
Assessing the validity of commercial and municipal food environment data sets in Vancouver, Canada.
Daepp, Madeleine Ig; Black, Jennifer
2017-10-01
The present study assessed systematic bias and the effects of data set error on the validity of food environment measures in two municipal and two commercial secondary data sets. Sensitivity, positive predictive value (PPV) and concordance were calculated by comparing two municipal and two commercial secondary data sets with ground-truthed data collected within 800 m buffers surrounding twenty-six schools. Logistic regression examined associations of sensitivity and PPV with commercial density and neighbourhood socio-economic deprivation. Kendall's τ estimated correlations between density and proximity of food outlets near schools constructed with secondary data sets v. ground-truthed data. Vancouver, Canada. Food retailers located within 800 m of twenty-six schools RESULTS: All data sets scored relatively poorly across validity measures, although, overall, municipal data sets had higher levels of validity than did commercial data sets. Food outlets were more likely to be missing from municipal health inspections lists and commercial data sets in neighbourhoods with higher commercial density. Still, both proximity and density measures constructed from all secondary data sets were highly correlated (Kendall's τ>0·70) with measures constructed from ground-truthed data. Despite relatively low levels of validity in all secondary data sets examined, food environment measures constructed from secondary data sets remained highly correlated with ground-truthed data. Findings suggest that secondary data sets can be used to measure the food environment, although estimates should be treated with caution in areas with high commercial density.
Translation and validation of the German version of the Bournemouth Questionnaire for Neck Pain.
Soklic, Marina; Peterson, Cynthia; Humphreys, B Kim
2012-01-25
Clinical outcome measures are important tools to monitor patient improvement during treatment as well as to document changes for research purposes. The short-form Bournemouth questionnaire for neck pain patients (BQN) was developed from the biopsychosocial model and measures pain, disability, cognitive and affective domains. It has been shown to be a valid and reliable outcome measure in English, French and Dutch and more sensitive to change compared to other questionnaires. The purpose of this study was to translate and validate a German version of the Bournemouth questionnaire for neck pain patients. German translation and back translation into English of the BQN was done independently by four persons and overseen by an expert committee. Face validity of the German BQN was tested on 30 neck pain patients in a single chiropractic practice. Test-retest reliability was evaluated on 31 medical students and chiropractors before and after a lecture. The German BQN was then assessed on 102 first time neck pain patients at two chiropractic practices for internal consistency, external construct validity, external longitudinal construct validity and sensitivity to change compared to the German versions of the Neck Disability Index (NDI) and the Neck Pain and Disability Scale (NPAD). Face validity testing lead to minor changes to the German BQN. The Intraclass Correlation Coefficient for the test-retest reliability was 0.99. The internal consistency was strong for all 7 items of the BQN with Cronbach α's of .79 and .80 for the pre and post-treatment total scores. External construct validity and external longitudinal construct validity using Pearson's correlation coefficient showed statistically significant correlations for all 7 scales of the BQN with the other questionnaires. The German BQN showed greater responsiveness compared to the other questionnaires for all scales. The German BQN is a valid and reliable outcome measure that has been successfully translated and culturally adapted. It is shorter, easier to use, and more responsive to change than the NDI and NPAD.
Factor structure and construct validity of the Anxiety Sensitivity Index among island Puerto Ricans.
Cintrón, Jennifer A; Carter, Michele M; Suchday, Sonia; Sbrocco, Tracy; Gray, James
2005-01-01
The factor structure and convergent and discriminant validity of the Anxiety Sensitivity Index (ASI) were examined among a sample of 275 island Puerto Ricans. Results from a confirmatory factor analysis (CFA) comparing our data to factor solutions commonly reported as representative of European American and Spanish populations indicated a poor fit. A subsequent exploratory factor analysis (EFA) indicated that a two-factor solution (Factor 1, Anxiety Sensitivity; Factor 2, Emotional Concerns) provided the best fit. Correlations between the ASI and anxiety measures were moderately high providing evidence of convergent validity, while correlations between the ASI and BDI were significantly lower providing evidence of discriminant validity. Scores on all measures were positively correlated with acculturation, suggesting that those who ascribe to more traditional Hispanic culture report elevated anxiety.
Measuring the Sensitivity and Construct Validity of 6 Utility Instruments in 7 Disease Areas.
Richardson, Jeff; Iezzi, Angelo; Khan, Munir A; Chen, Gang; Maxwell, Aimee
2016-02-01
Health services that affect quality of life (QoL) are increasingly evaluated using cost utility analyses (CUA). These commonly employ one of a small number of multiattribute utility instruments (MAUI) to assess the effects of the health service on utility. However, the MAUI differ significantly, and the choice of instrument may alter the outcome of an evaluation. The present article has 2 objectives: 1) to compare the results of 3 measures of the sensitivity of 6 MAUI and the results of 6 tests of construct validity in 7 disease areas and 2) to rank the MAUI by each of the test results in each disease area and by an overall composite index constructed from the tests. Patients and the general public were administered a battery of instruments, which included the 6 MAUI, disease-specific QoL instruments (DSI), and 6 other comparator instruments. In each disease area, instrument sensitivity was measured 3 ways: by the unadjusted mean difference in utility between public and patient groups, by the value of the effect size, and by the correlation between MAUI and DSI scores. Content and convergent validity were tested by comparison of MAUI utilities and scores from the 6 comparator instruments. These included 2 measures of health state preferences, measures of subjective well-being and capabilities, and generic measures of physical and mental QoL derived from the SF-36. The apparent sensitivity of instruments varied significantly with the measurement method and by disease area. Validation test results varied with the comparator instruments. Notwithstanding this variability, the 15D, AQoL-8D, and the SF-6D generally achieved better test results than the QWB and EQ-5D-5L. © The Author(s) 2015.
Assessing the stability of human locomotion: a review of current measures
Bruijn, S. M.; Meijer, O. G.; Beek, P. J.; van Dieën, J. H.
2013-01-01
Falling poses a major threat to the steadily growing population of the elderly in modern-day society. A major challenge in the prevention of falls is the identification of individuals who are at risk of falling owing to an unstable gait. At present, several methods are available for estimating gait stability, each with its own advantages and disadvantages. In this paper, we review the currently available measures: the maximum Lyapunov exponent (λS and λL), the maximum Floquet multiplier, variability measures, long-range correlations, extrapolated centre of mass, stabilizing and destabilizing forces, foot placement estimator, gait sensitivity norm and maximum allowable perturbation. We explain what these measures represent and how they are calculated, and we assess their validity, divided up into construct validity, predictive validity in simple models, convergent validity in experimental studies, and predictive validity in observational studies. We conclude that (i) the validity of variability measures and λS is best supported across all levels, (ii) the maximum Floquet multiplier and λL have good construct validity, but negative predictive validity in models, negative convergent validity and (for λL) negative predictive validity in observational studies, (iii) long-range correlations lack construct validity and predictive validity in models and have negative convergent validity, and (iv) measures derived from perturbation experiments have good construct validity, but data are lacking on convergent validity in experimental studies and predictive validity in observational studies. In closing, directions for future research on dynamic gait stability are discussed. PMID:23516062
van Hooren, Susan; van der Veld, William M.; Hutschemaekers, Giel
2017-01-01
Abstract Despite the use of art therapy in clinical practice, its appreciation and reported beneficial results, no instruments are available to measure specific effects of art therapy among patients with personality disorders cluster B/C in multidisciplinary treatment. In the present study, we described the development and psychometric evaluation of the Self‐expression and Emotion Regulation in Art Therapy Scale (SERATS). Structural validity (exploratory and confirmatory factor analysis), reliability, construct validity and sensitivity to change were examined using two independent databases (n = 335; n = 34) of patients diagnosed with personality disorders cluster B/C. This resulted in a nine‐item effect scale with a single factor with a high internal reliability and high test–retest reliability; it demonstrated discriminant validity and sensitivity to change. In conclusion, the SERATS is brief and content‐valid and offers objective and reliable information on self‐expression and emotion regulation in art therapy among patients with personality disorders cluster B/C. Although more research on construct validity is needed, the SERATS is a promising tool to be applied as an effect scale and as a monitoring tool during art therapy treatment. © 2017 The Authors Personality and Mental Health Published by John Wiley & Sons Ltd PMID:28730717
Development and validation of a fatigue assessment scale for U.S. construction workers.
Zhang, Mingzong; Sparer, Emily H; Murphy, Lauren A; Dennerlein, Jack T; Fang, Dongping; Katz, Jeffrey N; Caban-Martinez, Alberto J
2015-02-01
To develop a fatigue assessment scale and test its reliability and validity for commercial construction workers. Using a two-phased approach, we first identified items (first phase) for the development of a Fatigue Assessment Scale for Construction Workers (FASCW) through review of existing scales in the scientific literature, key informant interviews (n = 11) and focus groups (three groups with six workers each) with construction workers. The second phase included assessment for the reliability, validity, and sensitivity of the new scale using a repeated-measures study design with a convenience sample of construction workers (n = 144). Phase one resulted in a 16-item preliminary scale that after factor analysis yielded a final 10-item scale with two sub-scales ("Lethargy" and "Bodily Ailment"). During phase two, the FASCW and its subscales demonstrated satisfactory internal consistency (alpha coefficients were FASCW [0.91], Lethargy [0.86] and Bodily Ailment [0.84]) and acceptable test-retest reliability (Pearson Correlations Coefficients: 0.59-0.68; Intraclass Correlation Coefficients: 0.74-0.80). Correlation analysis substantiated concurrent and convergent validity. A discriminant analysis demonstrated that the FASCW differentiated between groups with arthritis status and different work hours. The 10-item FASCW with good reliability and validity is an effective tool for assessing the severity of fatigue among construction workers. © 2015 Wiley Periodicals, Inc.
Psychometric properties of the Late-Life Function and Disability Instrument: a systematic review
2014-01-01
Background The choice of measure for use as a primary outcome in geriatric research is contingent upon the construct of interest and evidence for its psychometric properties. The Late-Life Function and Disability Instrument (LLFDI) has been widely used to assess functional limitations and disability in studies with older adults. The primary aim of this systematic review was to evaluate the current available evidence for the psychometric properties of the LLFDI. Methods Published studies of any design reporting results based on administration of the original version of the LLFDI in community-dwelling older adults were identified after searches of 9 electronic databases. Data related to construct validity (convergent/divergent and known-groups validity), test-retest reliability and sensitivity to change were extracted. Effect sizes were calculated for within-group changes and summarized graphically. Results Seventy-one studies including 17,301 older adults met inclusion criteria. Data supporting the convergent/divergent and known-groups validity for both the Function and Disability components were extracted from 30 and 18 studies, respectively. High test-retest reliability was found for the Function component, while results for the Disability component were more variable. Sensitivity to change of the LLFDI was confirmed based on findings from 25 studies. The basic lower extremity subscale and overall summary score of the Function component and limitation dimension of the Disability component were associated with the strongest relative effect sizes. Conclusions There is extensive evidence to support the construct validity and sensitivity to change of the LLFDI among various clinical populations of community-dwelling older adults. Further work is needed on predictive validity and values for clinically important change. Findings from this review can be used to guide the selection of the most appropriate LLFDI subscale for use an outcome measure in geriatric research and practice. PMID:24476510
Wood, Louise; Smith, Michael; Miller, Christopher B; O'Carroll, Ronan E
2018-06-19
Vaccinations are important preventative health behaviors. The recently developed Vaccination Attitudes Examination (VAX) Scale aims to measure the reasons behind refusal/hesitancy regarding vaccinations. The aim of this replication study is to conduct an independent test of the newly developed VAX Scale in the UK. We tested (a) internal consistency (Cronbach's α); (b) convergent validity by assessing its relationships with beliefs about medication, medical mistrust, and perceived sensitivity to medicines; and (c) construct validity by testing how well the VAX Scale discriminated between vaccinators and nonvaccinators. A sample of 243 UK adults completed the VAX Scale, the Beliefs About Medicines Questionnaire, the Perceived Sensitivity to Medicines Scale, and the Medical Mistrust Index, in addition to demographics of age, gender, education levels, and social deprivation. Participants were asked (a) whether they received an influenza vaccination in the past year and (b) if they had a young child, whether they had vaccinated the young child against influenza in the past year. The VAX (a) demonstrated high internal consistency (α = .92); (b) was positively correlated with medical mistrust and beliefs about medicines, and less strongly correlated with perceived sensitivity to medicines; and (c) successfully differentiated parental influenza vaccinators from nonvaccinators. The VAX demonstrated good internal consistency, convergent validity, and construct validity in an independent UK sample. It appears to be a useful measure to help us understand the health beliefs that promote or deter vaccination behavior.
Kolodziejczyk, Julia K; Norman, Gregory J; Rock, Cheryl L; Arredondo, Elva M; Roesch, Scott C; Madanat, Hala; Patrick, Kevin
2016-01-01
This study evaluates the reliability and validity of the strategies for weight management (SWM) measure, a questionnaire that assesses weight management strategies for adults. The SWM includes 20 items that are categorized within the following subscales: (1) energy intake, (2) energy expenditure, (3) self-monitoring, and (4) self-regulation. Baseline and 6-month data were collected from 404 overweight/obese adults (mean age=22±3.8 years, 68% ethnic minority) enrolled in a randomized controlled trial aiming to reduce weight by improving diet and physical activity behaviours. Reliability and validity were assessed for each subscale separately. Cronbach alpha was conducted to assess reliability. Concurrent, construct I (sensitivity to the study treatment condition), and construct II (relationship to the outcomes) validity were assessed using linear regressions with the following outcome measures: weight, self-reported diet, and weekly energy expenditure. All subscales showed strong internal consistency. The strength of the validity evidence depended on subscale and validity type. The strongest validity evidence was concurrent validity of the energy intake and energy expenditure subscales; construct I validity of the energy intake and self-monitoring subscales; and construct II validity of the energy intake, energy expenditure, and self-regulation subscales. Results indicate that the SWM can be used to assess weight management strategies among an ethnically diverse sample of adults as each subscale showed evidence of reliability and select types of validity. As validity is an accumulation of evidence over multiple studies, this study provides initial reliability and validity evidence in one population segment. Copyright © 2015 Asia Oceania Association for the Study of Obesity. Published by Elsevier Ltd. All rights reserved.
Chin, Weng Yee; Choi, Edmond P H; Chan, Kit T Y; Wong, Carlos K H
2015-01-01
The Center for Epidemiologic Studies Depression Scale (CES-D) is a commonly used instrument to measure depressive symptomatology. Despite this, the evidence for its psychometric properties remains poorly established in Chinese populations. The aim of this study was to validate the use of the CES-D in Chinese primary care patients by examining factor structure, construct validity, reliability, sensitivity and responsiveness. The psychometric properties were assessed amongst a sample of 3686 Chinese adult primary care patients in Hong Kong. Three competing factor structure models were examined using confirmatory factor analysis. The original CES-D four-structure model had adequate fit, however the data was better fit into a bi-factor model. For the internal construct validity, corrected item-total correlations were 0.4 for most items. The convergent validity was assessed by examining the correlations between the CES-D, the Patient Health Questionnaire 9 (PHQ-9) and the Short Form-12 Health Survey (version 2) Mental Component Summary (SF-12 v2 MCS). The CES-D had a strong correlation with the PHQ-9 (coefficient: 0.78) and SF-12 v2 MCS (coefficient: -0.75). Internal consistency was assessed by McDonald's omega hierarchical (ωH). The ωH value for the general depression factor was 0.855. The ωH values for "somatic", "depressed affect", "positive affect" and "interpersonal problems" were 0.434, 0.038, 0.738 and 0.730, respectively. For the two-week test-retest reliability, the intraclass correlation coefficient was 0.91. The CES-D was sensitive in detecting differences between known groups, with the AUC >0.7. Internal responsiveness of the CES-D to detect positive and negative changes was satisfactory (with p value <0.01 and all effect size statistics >0.2). The CES-D was externally responsive, with the AUC>0.7. The CES-D appears to be a valid, reliable, sensitive and responsive instrument for screening and monitoring depressive symptoms in adult Chinese primary care patients. In its original four-factor and bi-factor structure, the CES-D is supported for cross-cultural comparisons of depression in multi-center studies.
ERIC Educational Resources Information Center
Gabbard, Clinton E.; And Others
1986-01-01
Adaptive Counseling and Therapy (ACT) is an integrative, metatheoretical model for selecting an appropriate therapeutic style based on the task-relevant development maturity of the client. The Counselor Behavior Analysis (CBA) Scale measures the central explanatory construct of ACT theory: counselor adaptability. Three studies designed to assess…
Luna-Lario, P; Pena, J; Ojeda, N
2017-04-16
To perform an in-depth examination of the construct validity and the ecological validity of the Wechsler Memory Scale-III (WMS-III) and the Spain-Complutense Verbal Learning Test (TAVEC). The sample consists of 106 adults with acquired brain injury who were treated in the Area of Neuropsychology and Neuropsychiatry of the Complejo Hospitalario de Navarra and displayed memory deficit as the main sequela, measured by means of specific memory tests. The construct validity is determined by examining the tasks required in each test over the basic theoretical models, comparing the performance according to the parameters offered by the tests, contrasting the severity indices of each test and analysing their convergence. The external validity is explored through the correlation between the tests and by using regression models. According to the results obtained, both the WMS-III and the TAVEC have construct validity. The TAVEC is more sensitive and captures not only the deficits in mnemonic consolidation, but also in the executive functions involved in memory. The working memory index of the WMS-III is useful for predicting the return to work at two years after the acquired brain injury, but none of the instruments anticipates the disability and dependence at least six months after the injury. We reflect upon the construct validity of the tests and their insufficient capacity to predict functionality when the sequelae become chronic.
Guirao-Goris, Silamani J; Ferrer Ferrandis, Esperanza; Montejano Lozoya, Raimunda
2016-02-18
The aim of the study is to identify the construct and criterion validity of the nursing diagnosis label Sedentary Lifestyle. A cross-sectional study in a nursing consultation in primary health care was conducted. Participants were all people that was attended for one year over 50 who voluntarily wish to participate (n=85) in the study. Objective weekly physical activity was measured in METs with an Accelerometer, objective measure of performance was measured by gait speed EPESE Battery (both measures that were used as the gold standard), and physical activity questionnaires (RAPA), the COOP-WONCA physical fitness chart. Spearman correlation coefficients, mean comparison tests and analysis of sensitivity and specificity were used as statistical analysis. The diagnosis "Sedentary Lifestyle" showed a positive correlation between its manifestations and physical activity measured in METs (r=0.39) and EPESE gait speed (r=0.35). The diagnosis showed a sensitivity of 85.1% and a specificity of 65.2% and showed ability to discriminate active people from those that are not using METs as a measure of physical activity (t=-4.4). The diagnosis "Sedentary Lifestyle" shows criterion and construct validity.
Cheung, Jason Pui Yin; Cheung, Prudence Wing Hang; Wong, Carlos King Ho; Samartzis, Dino; Luk, Keith Dip-Kei; Lam, Cindy Lo Kuen; Cheung, Kenneth Man Chee
2016-12-15
Questionnaire translation and validation. The aim of this study was to translate and cross-culturally adapt the Early Onset Scoliosis-24 item Questionnaire (EOSQ-24) into traditional Chinese, and to assess its validity, reliability, and sensitivity in Southern-Chinese patients diagnosed with early onset scoliosis (EOS). Relying on radiographs alone for assessing treatment outcomes in EOS patients is inadequate. To properly gauge health-related quality of life, a disease-specific instrument that assesses patient quality of life and the burden of primary caregivers is necessary. The EOSQ-24 was created for this purpose, but it has not been adapted to the Chinese language. The translation and cross-cultural adaptation of the original English EOSQ-24 were performed using the method of double forward and single backward translations, followed by a panel review. EOS patients of Southern-Chinese descent were recruited, via convenience sampling from a scoliosis specialty clinic. These patients' parents/caretakers were then administered the traditional Chinese EOSQ-24, Likert Scale regarding the understanding of completed EOSQ-24, and the Child Health Questionnaire Parent Form 50 (CHQ-PF50) (Traditional Chinese). Reliability was analyzed using Cronbach alpha. Construct validity of domains and subdomains was assessed using Spearman correlation test against CHQ-PF50 domains with similar constructs. Sensitivity of the EOSQ-24 scores was determined by performing known group comparisons. A total of 100 EOS patients were recruited. A very good reliability was demonstrated (Cronbach α: 0.896) and internal consistency of all domains was excellent (Cronbach α: 0.829-0.919). Subdomain scores of EOSQ-24 and CHQ-PF50 had significant correlations (P < 0.001), indicating a good construct validity. This is the first psychometric study to translate and adapt the EOSQ-24 questionnaire for Chinese EOS patients and it has been found to have satisfactory validity, reliability, and sensitivity. It is a useful disease-specific instrument for assessing patients' quality of life and the burden of caregivers. 2.
Development and Validation of a Fatigue Assessment Scale for U.S. Construction Workers
Zhang, Mingzong; Sparer, Emily H.; Murphy, Lauren A.; Dennerlein, Jack T.; Fang, Dongping; Katz, Jeffrey N.; Caban-Martinez, Alberto J.
2015-01-01
Objective To develop a fatigue assessment scale and test its reliability and validity for commercial construction workers. Methods Using a two-phased approach, we first identified items for the development of a Fatigue Assessment Scale for Construction Workers (FASCW) through review of existing scales in the scientific literature, key informant interviews (n=11) and focus groups (3 groups with 6 workers each) with construction workers. The second phase included assessment for the reliability, validity and sensitivity of the new scale using a repeated-measures study design with a convenience sample of construction workers (n=144). Results Phase one resulted in a 16-item preliminary scale that after factor analysis yielded a final 10-item scale with two sub-scales (“Lethargy” and “Bodily Ailment”).. During phase two, the FASCW and its subscales demonstrated satisfactory internal consistency (alpha coefficients were FASCW (0.91), Lethargy (0.86) and Bodily Ailment (0.84)) and acceptable test-retest reliability (Pearson Correlations Coefficients: 0.59–0.68; Intraclass Correlation Coefficients: 0.74–0.80). Correlation analysis substantiated concurrent and convergent validity. A discriminant analysis demonstrated that the FASCW differentiated between groups with arthritis status and different work hours. Conclusions The 10-item FASCW with good reliability and validity is an effective tool for assessing the severity of fatigue among construction workers. PMID:25603944
ERIC Educational Resources Information Center
Kaya, Osman Nafiz; Kilic, Ziya
2004-01-01
Student-centered approach of scoring the concept maps consisted of three elements namely symbol system, individual portfolio and scoring scheme. We scored student-constructed concept maps based on 5 concept map criteria: validity of concepts, adequacy of propositions, significance of cross-links, relevancy of examples, and interconnectedness. With…
[Validation of the UCLA loneliness scale in an elderly population that live alone].
Velarde-Mayol, C; Fragua-Gil, S; García-de-Cecilia, J M
2016-04-01
This article examines the growing social phenomenon of elderly people living alone from 2 points of view: the objective loneliness of living alone and the subjective loneliness of feeling lonely. To validate the UCLA loneliness scale as a tool for the overall measurement of loneliness and to determine the social profile in elderly people living alone. Observational study carried out over 2 years (2012-2013) to identify elderly people living alone; case-control study to validate the UCLA loneliness scale. The sample was taken from 3 surgeries belonging to 2 Primary Care health centres from urban and rural areas. We studied construct validity, discriminant validity and sensitivity analysis were analysed. Of the elderly population studied 22.3% live alone, 61.7% due to loss of spouse, with a mean age of 70.7 years, and 82.7% women; 17.3% have no family ties and 63.2% feel lonely. UCLA loneliness scale has a construct validity with a high correlation between items. The discriminant validity was confirmed in relation to the elderly who do not live alone, with Cronbach alpha of 0.95, and it is sensitive to change. One in 4-5 elderly live alone, mainly due to the loss of spouse. There are 3 times as many women as men who live alone. Two out of 3 experience the feeling of loneliness. The UCLA loneliness scale has proved to be a useful and sensitive tool to measure loneliness in the elderly population. Copyright © 2015 Sociedad Española de Médicos de Atención Primaria (SEMERGEN). Publicado por Elsevier España, S.L.U. All rights reserved.
The Alcohol Sensitivity Questionnaire: Evidence for Construct Validity
Fleming, Kimberly A.; Bartholow, Bruce D.; Hilgard, Joseph B.; McCarthy, Denis M.; O’Neill, Susan E.; Steinley, Douglas; Sher, Kenneth J.
2016-01-01
Background Variability in sensitivity to the acute effects of alcohol is an important risk factor for the development of alcohol use disorder (AUD). The most commonly used retrospective self-report measure of sensitivity, the Self-Rating of the Effects of Alcohol form (SRE), queries a limited number of alcohol effects and relies on respondents’ ability to recall experiences that might have occurred in the distant past. Here, we investigated the construct validity of an alternative measure that queries a larger number of alcohol effects, the Alcohol Sensitivity Questionnaire (ASQ), and compared it to the SRE in predicting momentary subjective responses to an acute dose of alcohol. Method Healthy young adults (N = 423) completed the SRE and the ASQ and then were randomly assigned to consume either alcohol or a placebo beverage (between-subjects manipulation). Stimulation and sedation (Biphasic Alcohol Effects Scale) and subjective intoxication were measured multiple times after drinking. Results Hierarchical linear models showed that the ASQ reliably predicted each of these outcomes following alcohol but not placebo consumption, provided unique prediction beyond that associated with differences in recent alcohol involvement, and was preferred over the SRE (in terms of model fit) in direct model comparisons of stimulation and sedation. Conclusions The ASQ compared favorably with the better-known SRE in predicting increased stimulation and reduced sedation following an acute alcohol challenge. The ASQ appears to be a valid self-report measure of alcohol sensitivity and therefore holds promise for identifying individuals at-risk for AUD and related problems. PMID:27012527
Görtelmeyer, Roman; Schmidt, Jürgen; Suckfüll, Markus; Jastreboff, Pawel; Gebauer, Alexander; Krüger, Hagen; Wittmann, Werner
2011-08-01
To evaluate the reliability, dimensionality, predictive validity, construct validity, and sensitivity to change of the THI-12 total and sub-scales as diagnostic aids to describe and quantify tinnitus-evoked reactions and evaluate treatment efficacy. Explorative analysis of the German tinnitus handicap inventory (THI-12) to assess potential sensitivity to tinnitus therapy in placebo-controlled randomized studies. Correlation analysis, including Cronbach's coefficient α and explorative common factor analysis (EFA), was conducted within and between assessments to demonstrate the construct validity, dimensionality, and factorial structure of the THI-12. N = 618 patients suffering from subjective tinnitus who were to be screened to participate in a randomized, placebo-controlled, 16-week, longitudinal study. The THI-12 can reliably diagnose tinnitus-related impairments and disabilities and assess changes over time. The test-retest coefficient for neighboured visits was r > 0.69, the internal consistency of the THI-12 total score was α ≤ 0.79 and α ≤ 0.89 at subsequent visits. Predictability of THI-12 total score and overall variance increased with successive measurements. The three-factorial structure allowed for evaluation of factors that affect aspects of patients' health-related quality of life. The THI-12, with its three-factorial structure, is a simple, reliable, and valid instrument for the diagnosis and assessment of tinnitus and associated impairment over time.
Greeven, Anja; Spinhoven, Philip; van Balkom, Anton J L M
2009-01-01
This study investigated the psychometric properties of the first clinician-administered semi-structured interview for assessing the severity of hypochondriacal symptoms. The Hypochondriasis Yale-Brown Obsessive-Compulsive Scale (H-YBOCS) consisted of three a priori dimensions: hypochondriacal obsessions, compulsions and avoidance. The 16-item interview was conducted with 112 participants with Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, hypochondriasis. We analysed factor analytic structure, reliability, construct validity and sensitivity to change. Factor analysis supported a three-factor model similar to the a priori dimensions. Internal consistency ranged from satisfactory to good. Inter-rater reliability was excellent. The construct validity was low to moderate. The H-YBOCS was sensitive for measuring changes in symptom severity. The H-YBOCS is a (factorially) valid and coherent interview with a high level of agreement across different raters. The relatively low discriminant validity could be due to co-morbid anxiety and depressive disorders. Overall, the H-YBOCS seems to be a promising contribution to the assessment of hypochondriasis. *The hypochondriasis Y-BOCS is a feasible clinician rated interview to assess the severity of hypochondriacal complaints.
A systematic review of a functional assessment Tool: UCSD Performance-based skill assessment (UPSA).
Becattini-Oliveira, Ana Claudia; Dutra, Douglas de Farias; Spenciere de Oliveira Campos, Bárbara; de Araujo, Verônica Carvalho; Charchat-Fichman, Helenice
2018-05-18
Performance based assessment instruments have been employed in functional capacity measurement of mental disorders. The aim of this systematic review was to identify the psychometric properties of the UCSD Performance-based Skill Assessment (UPSA). A search was conducted using the PRISMA protocol and 'UPSA' as key word term on electronic databases, with a date range for articles published from 2001-2017. Published studies involving community-dwelling adults were included. Pharmacological and/or clinical interventions involving clinical outcomes and/or institutionalized samples were excluded. Data related to construct validity, test-retest reliability and sensitivity/specificity were extracted, summarized and analyzed according to UPSA versions and psychiatric disorders. Fifty-eight studies including 8782 Community-dwelling adults met selection criteria. Data supporting the construct and known-groups validity were extracted from 41 studies involving Schizophrenia and schizoaffective disorders and 17 studies involving other metal illness. The UPSA was culturally adapted to 8 different languages and employed in 17 countries. Few studies reported sensitivity and specificity and the cut-off points could not be generalized. Moderate to strong evidence of construct validity and test-retest reliability was found. Few studies proposed cut-off points. The UPSA showed good psychometric properties in different versions including those culturally adapted. Copyright © 2018 Elsevier B.V. All rights reserved.
Ockhuijsen, Henrietta D L; van Smeden, Maarten; van den Hoogen, Agnes; Boivin, Jacky
2017-06-01
To examine construct and criterion validity of the Dutch SCREENIVF among women and men undergoing a fertility treatment. A prospective longitudinal study nested in a randomized controlled trial. University hospital. Couples, 468 women and 383 men, undergoing an IVF/intracytoplasmic sperm injection (ICSI) treatment in a fertility clinic, completed the SCREENIVF. Construct and criteria validity of the SCREENIVF. The comparative fit index and root mean square error of approximation for women and men show a good fit of the factor model. Across time, the sensitivity for Hospital Anxiety and Depression Scale subscale in women ranged from 61%-98%, specificity 53%-65%, predictive value of a positive test (PVP) 13%-56%, predictive value of a negative test (PVN) 70%-99%. The sensitivity scores for men ranged from 38%-100%, specificity 71%-75%, PVP 9%-27%, PVN 92%-100%. A prediction model revealed that for women 68.7% of the variance in the Hospital Anxiety and Depression Scale on time 1 and 42.5% at time 2 and 38.9% at time 3 was explained by the predictors, the sum score scales of the SCREENIVF. For men, 58.1% of the variance in the Hospital Anxiety and Depression Scale on time 1 and 46.5% at time 2 and 37.3% at time 3 was explained by the predictors, the sum score scales of the SCREENIVF. The SCREENIVF has good construct validity but the concurrent validity is better than the predictive validity. SCREENIVF will be most effectively used in fertility clinics at the start of treatment and should not be used as a predictive tool. Copyright © 2017 American Society for Reproductive Medicine. All rights reserved.
2015-01-01
Background The Center for Epidemiologic Studies Depression Scale (CES-D) is a commonly used instrument to measure depressive symptomatology. Despite this, the evidence for its psychometric properties remains poorly established in Chinese populations. The aim of this study was to validate the use of the CES-D in Chinese primary care patients by examining factor structure, construct validity, reliability, sensitivity and responsiveness. Methods and Results The psychometric properties were assessed amongst a sample of 3686 Chinese adult primary care patients in Hong Kong. Three competing factor structure models were examined using confirmatory factor analysis. The original CES-D four-structure model had adequate fit, however the data was better fit into a bi-factor model. For the internal construct validity, corrected item-total correlations were 0.4 for most items. The convergent validity was assessed by examining the correlations between the CES-D, the Patient Health Questionnaire 9 (PHQ-9) and the Short Form-12 Health Survey (version 2) Mental Component Summary (SF-12 v2 MCS). The CES-D had a strong correlation with the PHQ-9 (coefficient: 0.78) and SF-12 v2 MCS (coefficient: -0.75). Internal consistency was assessed by McDonald’s omega hierarchical (ωH). The ωH value for the general depression factor was 0.855. The ωH values for “somatic”, “depressed affect”, “positive affect” and “interpersonal problems” were 0.434, 0.038, 0.738 and 0.730, respectively. For the two-week test-retest reliability, the intraclass correlation coefficient was 0.91. The CES-D was sensitive in detecting differences between known groups, with the AUC >0.7. Internal responsiveness of the CES-D to detect positive and negative changes was satisfactory (with p value <0.01 and all effect size statistics >0.2). The CES-D was externally responsive, with the AUC>0.7. Conclusions The CES-D appears to be a valid, reliable, sensitive and responsive instrument for screening and monitoring depressive symptoms in adult Chinese primary care patients. In its original four-factor and bi-factor structure, the CES-D is supported for cross-cultural comparisons of depression in multi-center studies. PMID:26252739
Cole, Adam G; Kennedy, Ryan David; Chaurasia, Ashok; Leatherdale, Scott T
2017-12-06
Within tobacco prevention programming, it is useful to identify youth that are at risk for experimenting with various tobacco products and e-cigarettes. The susceptibility to smoking construct is a simple method to identify never-smoking students that are less committed to remaining smoke-free. However, the predictive validity of this construct has not been tested within the Canadian context or for the use of other tobacco products and e-cigarettes. This study used a large, longitudinal sample of secondary school students that reported never using tobacco cigarettes and non-current use of alternative tobacco products or e-cigarettes at baseline in Ontario, Canada. The sensitivity, specificity, and positive and negative predictive values of the susceptibility construct for predicting tobacco cigarette, e-cigarette, cigarillo or little cigar, cigar, hookah, and smokeless tobacco use one and two years after baseline measurement were calculated. At baseline, 29.4% of the sample was susceptible to future tobacco product or e-cigarette use. The sensitivity of the construct ranged from 43.2% (smokeless tobacco) to 59.5% (tobacco cigarettes), the specificity ranged from 70.9% (smokeless tobacco) to 75.9% (tobacco cigarettes), and the positive predictive value ranged from 2.6% (smokeless tobacco) to 32.2% (tobacco cigarettes). Similar values were calculated for each measure of the susceptibility construct. A significant number of youth that did not currently use tobacco products or e-cigarettes at baseline reported using tobacco products and e-cigarettes over a two-year follow-up period. The predictive validity of the susceptibility construct was high and the construct can be used to predict other tobacco product and e-cigarette use among youth. This study presents the predictive validity of the susceptibility construct for the use of tobacco cigarettes among secondary school students in Ontario, Canada. It also presents a novel use of the susceptibility construct for predicting the use of e-cigarettes, cigarillos or little cigars, cigars, hookah, and smokeless tobacco among secondary school students in Ontario, Canada. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Assessing behavioural changes in ALS: cross-validation of ALS-specific measures.
Pinto-Grau, Marta; Costello, Emmet; O'Connor, Sarah; Elamin, Marwa; Burke, Tom; Heverin, Mark; Pender, Niall; Hardiman, Orla
2017-07-01
The Beaumont Behavioural Inventory (BBI) is a behavioural proxy report for the assessment of behavioural changes in ALS. This tool has been validated against the FrSBe, a non-ALS-specific behavioural assessment, and further comparison of the BBI against a disease-specific tool was considered. This study cross-validates the BBI against the ALS-FTD-Q. Sixty ALS patients, 8% also meeting criteria for FTD, were recruited. All patients were evaluated using the BBI and the ALS-FTD-Q, completed by a carer. Correlational analysis was performed to assess construct validity. Precision, sensitivity, specificity, and overall accuracy of the BBI when compared to the ALS-FTD-Q, were obtained. The mean score of the whole sample on the BBI was 11.45 ± 13.06. ALS-FTD patients scored significantly higher than non-demented ALS patients (31.6 ± 14.64, 9.62 ± 11.38; p < 0.0001). A significant large positive correlation between the BBI and the ALS-FTD-Q was observed (r = 0.807, p < 0.0001), and no significant correlations between the BBI and other clinical/demographic characteristics indicate good convergent and discriminant validity, respectively. 72% of overall concordance was observed. Precision, sensitivity, and specificity for the classification of severely impaired patients were adequate. However, lower concordance in the classification of mild behavioural changes was observed, with higher sensitivity using the BBI, most likely secondary to BBI items which endorsed behavioural aspects not measured by the ALS-FTD-Q. Good construct validity has been further confirmed when the BBI is compared to an ALS-specific tool. Furthermore, the BBI is a more comprehensive behavioural assessment for ALS, as it measures the whole behavioural spectrum in this condition.
Sensitivity curves for searches for gravitational-wave backgrounds
NASA Astrophysics Data System (ADS)
Thrane, Eric; Romano, Joseph D.
2013-12-01
We propose a graphical representation of detector sensitivity curves for stochastic gravitational-wave backgrounds that takes into account the increase in sensitivity that comes from integrating over frequency in addition to integrating over time. This method is valid for backgrounds that have a power-law spectrum in the analysis band. We call these graphs “power-law integrated curves.” For simplicity, we consider cross-correlation searches for unpolarized and isotropic stochastic backgrounds using two or more detectors. We apply our method to construct power-law integrated sensitivity curves for second-generation ground-based detectors such as Advanced LIGO, space-based detectors such as LISA and the Big Bang Observer, and timing residuals from a pulsar timing array. The code used to produce these plots is available at https://dcc.ligo.org/LIGO-P1300115/public for researchers interested in constructing similar sensitivity curves.
Moving Equipment and Workers to Mine Construction Site at a Logistically Challenged Area
NASA Astrophysics Data System (ADS)
Tikasz, Laszlo; Biroscak, Dennis; Pentiah, Scheale Duvah; McCulloch, Robert I.
Social sensitivity of habitants, minimal impact on the environment, low-grade infrastructure, high altitude, frequent rock slides combined with expectations for the timely moving of equipment and workers are some of the challenges emerging from the current construction of a mine. Starting with traditional planning, and experiencing issues in the early phase of the construction, a traffic simulator was requested by the Procurement Department in order to validate daily-weekly schedules and predict likely delays or blockages on the long-term.
Yee, Chee-Seng; Farewell, Vernon; Isenberg, David A; Rahman, Anisur; Teh, Lee-Suan; Griffiths, Bridget; Bruce, Ian N; Ahmad, Yasmeen; Prabu, Athiveeraramapandian; Akil, Mohammed; McHugh, Neil; D'Cruz, David; Khamashta, Munther A; Maddison, Peter; Gordon, Caroline
2007-01-01
Objective To determine the construct and criterion validity of the British Isles Lupus Assessment Group 2004 (BILAG-2004) index for assessing disease activity in systemic lupus erythematosus (SLE). Methods Patients with SLE were recruited into a multicenter cross-sectional study. Data on SLE disease activity (scores on the BILAG-2004 index, Classic BILAG index, and Systemic Lupus Erythematosus Disease Activity Index 2000 [SLEDAI-2K]), investigations, and therapy were collected. Overall BILAG-2004 and overall Classic BILAG scores were determined by the highest score achieved in any of the individual systems in the respective index. Erythrocyte sedimentation rates (ESRs), C3 levels, C4 levels, anti–double-stranded DNA (anti-dsDNA) levels, and SLEDAI-2K scores were used in the analysis of construct validity, and increase in therapy was used as the criterion for active disease in the analysis of criterion validity. Statistical analyses were performed using ordinal logistic regression for construct validity and logistic regression for criterion validity. Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated. Results Of the 369 patients with SLE, 92.7% were women, 59.9% were white, 18.4% were Afro-Caribbean and 18.4% were South Asian. Their mean ± SD age was 41.6 ± 13.2 years and mean disease duration was 8.8 ± 7.7 years. More than 1 assessment was obtained on 88.6% of the patients, and a total of 1,510 assessments were obtained. Increasing overall scores on the BILAG-2004 index were associated with increasing ESRs, decreasing C3 levels, decreasing C4 levels, elevated anti-dsDNA levels, and increasing SLEDAI-2K scores (all P < 0.01). Increase in therapy was observed more frequently in patients with overall BILAG-2004 scores reflecting higher disease activity. Scores indicating active disease (overall BILAG-2004 scores of A and B) were significantly associated with increase in therapy (odds ratio [OR] 19.3, P < 0.01). The BILAG-2004 and Classic BILAG indices had comparable sensitivity, specificity, PPV, and NPV. Conclusion These findings show that the BILAG-2004 index has construct and criterion validity. PMID:18050213
Tsai, Alexander C.; Scott, Jennifer A.; Hung, Kristin J.; Zhu, Jennifer Q.; Matthews, Lynn T.; Psaros, Christina; Tomlinson, Mark
2013-01-01
Background A major barrier to improving perinatal mental health in Africa is the lack of locally validated tools for identifying probable cases of perinatal depression or for measuring changes in depression symptom severity. We systematically reviewed the evidence on the reliability and validity of instruments to assess perinatal depression in African settings. Methods and Findings Of 1,027 records identified through searching 7 electronic databases, we reviewed 126 full-text reports. We included 25 unique studies, which were disseminated in 26 journal articles and 1 doctoral dissertation. These enrolled 12,544 women living in nine different North and sub-Saharan African countries. Only three studies (12%) used instruments developed specifically for use in a given cultural setting. Most studies provided evidence of criterion-related validity (20 [80%]) or reliability (15 [60%]), while fewer studies provided evidence of construct validity, content validity, or internal structure. The Edinburgh postnatal depression scale (EPDS), assessed in 16 studies (64%), was the most frequently used instrument in our sample. Ten studies estimated the internal consistency of the EPDS (median estimated coefficient alpha, 0.84; interquartile range, 0.71-0.87). For the 14 studies that estimated sensitivity and specificity for the EPDS, we constructed 2 x 2 tables for each cut-off score. Using a bivariate random-effects model, we estimated a pooled sensitivity of 0.94 (95% confidence interval [CI], 0.68-0.99) and a pooled specificity of 0.77 (95% CI, 0.59-0.88) at a cut-off score of ≥9, with higher cut-off scores yielding greater specificity at the cost of lower sensitivity. Conclusions The EPDS can reliably and validly measure perinatal depression symptom severity or screen for probable postnatal depression in African countries, but more validation studies on other instruments are needed. In addition, more qualitative research is needed to adequately characterize local understandings of perinatal depression-like syndromes in different African contexts. PMID:24340036
Validating the M. D. Anderson Symptom Inventory (MDASI) for use in patients with ovarian cancer
Sailors, Mary H.; Bodurka, Diane C.; Gning, Ibrahima; Ramondetta, Lois M.; Williams, Loretta A.; Mendoza, Tito R.; Agarwal, Sonika; Sun, Charlotte C.; Cleeland, Charles S.
2013-01-01
Objective The M. D. Anderson Symptom Inventory (MDASI) captures the severity of common cancer symptoms from the patients’ perspective. We describe the validity and sensitivity of a module of the MDASI to be used with patients having ovarian cancer (MDASI-OC). Methods Ovarian cancer–specific module items were developed from 14 qualitative patient interviews. 128 patients with invasive epithelial ovarian, peritoneal, or fallopian-tube cancer treated at MD Anderson Cancer Center were recruited. Patients completed the MDASI-OC, socio-demographic questionnaires, the Functional Assessment of Cancer Therapy-Ovary (FACT-O), and a global quality-of-life (QOL) item. Reliability was assessed using Cronbach α and sensitivity using known group was assessed. Construct validity was tested using exploratory factor analysis. Results The sample was primarily white (85.2%), had a mean age of 57.5 years (±12.7 years), and had previously been treated with chemotherapy (75.0%) and/or surgery (93.8%). Approximately 30% of patients reported disturbed sleep, fatigue, or numbness/tingling of at least moderate severity (≥5 on a 0–10 scale). On the ovarian-cancer-specific symptoms, approximately 20% reported back pain, feeling bloated, or constipation of at least moderate severity. Factor analysis revealed six underlying constructs (pain/sleep; cognitive; disease-related and numbness; treatment-related; affective; gastrointestinal-specific). MDASI-OC symptom and interference items had Cronbach α values of 0.90 and 0.89, respectively. The MDASI-OC was sensitive to symptom severity by performance status (p=0.009), QOL (p=0.002), and FACT-O scores (p<0.001). Conclusions The 27-item MDASI-OC meets common criteria for validation and reliability and is sensitive to expected changes in symptoms related to differences in disease and treatment status. PMID:23685012
Dzhambov, Angel M; Dimitrova, Donka D
2014-01-01
The Noise Sensitivity Scale Short Form (NSS-SF), developed in English as a more practical form of the classical Weinstein NSS, has not to date been validated in other cultures, and its validity and reliability have not yet been confirmed. This study aimed to validate NSS-SF in Bulgarian and to demonstrate its applicability. The study comprised test-retest (n = 115) and a field-testing (n = 71) of the newly validated scale. Its construct validity was examined with confirmatory factor analysis, and very good model-fit was observed. Temporal stability was assessed in a test-retest (r = 0.990), convergent validity was examined with single-item susceptibility to the noise scale (r = 0.906) and discriminant validity was confirmed with single-item noise annoyance scale (r = 0.718). The lowest observed McDonald's omega across the studies was 0.923. The cross-cultural validation of NSS-SF was successful but it proved to be somewhat problematic with respect to its annoyance-based items.
Chandler, L S; Terhorst, L; Rogers, J C; Holm, M B
2016-07-01
The purpose of this study was to establish the validity, reliability, stability and sensitivity to change of the family-centred Movement Assessment of Children (MAC) in typically developing infants/toddlers from 2 months (1 month 16 days) to 2 years (24 months 15 days) of age. Assessment of infant/toddler motor development is critical so that infants and toddlers who are at-risk for developmental delay or whose functional motor development is delayed can be monitored and receive therapy to improve their developmental outcomes. Infants/toddlers are thought to be more responsive during the MAC assessment because parents and siblings participate and elicit responses. Two hundred seventy six children and 405 assessments contributed to the establishment of age-related parameters for typically developing infants and toddlers on the MAC. The MAC assesses three core domains of functional movement (head control, upper extremities and hands, pelvis and lower extremities), and generates a core total score. Four explanatory domains serve to alert examiners to factors that may impact atypical development (general observations, special senses, primitive reflexes/reactions, muscle tone). Construct validity of functional motor development was examined using the relationship between incremental increases in scores and increases in participants' ages. Subsamples were used to establish inter-rater reliability, test-retest reliability, stability and sensitivity to change. Construct validity was established and inter-rater reliability ICCs for the core items and core total ranged from 0.83 to 0.99. Percent agreement for the explanatory items ranged from 0.72 to 0.96. Stability within age grouping was consistent from baseline to 6 months post-baseline, and sensitivity to change from baseline to 6 months was significant for all core items and the total score. The MAC has proven to be a well-constructed assessment of infant and toddler functional motor development. It is a family-centred and efficient tool that can be used to assess and follow-up of infants and toddlers from 2 months to 2 years. © 2016 John Wiley & Sons Ltd.
Tsai, Chung-Yu
2017-07-01
A refractive laser beam shaper comprising two free-form profiles is presented. The profiles are designed using a free-form profile construction method such that each incident ray is directed in a certain user-specified direction or to a particular point on the target surface so as to achieve the required illumination distribution of the output beam. The validity of the proposed design method is demonstrated by means of ZEMAX simulations. The method is mathematically straightforward and easily implemented in computer code. It thus provides a convenient tool for the design and sensitivity analysis of laser beam shapers and similar optical components.
López-de-Uralde-Villanueva, I; Gil-Martínez, A; Candelas-Fernández, P; de Andrés-Ares, J; Beltrán-Alacreu, H; La Touche, R
2016-12-08
The self-administered Leeds Assessment of Neuropathic Symptoms and Signs (S-LANSS) scale is a tool designed to identify patients with pain with neuropathic features. To assess the validity and reliability of the Spanish-language version of the S-LANSS scale. Our study included a total of 182 patients with chronic pain to assess the convergent and discriminant validity of the S-LANSS; the sample was increased to 321 patients to evaluate construct validity and reliability. The validated Spanish-language version of the ID-Pain questionnaire was used as the criterion variable. All participants completed the ID-Pain, the S-LANSS, and the Numerical Rating Scale for pain. Discriminant validity was evaluated by analysing sensitivity, specificity, and the area under the receiver operating characteristic curve (AUC). Construct validity was assessed with factor analysis and by comparing the odds ratio of each S-LANSS item to the total score. Convergent validity and reliability were evaluated with Pearson's r and Cronbach's alpha, respectively. The optimal cut-off point for S-LANSS was ≥12 points (AUC=.89; sensitivity=88.7; specificity=76.6). Factor analysis yielded one factor; furthermore, all items contributed significantly to the positive total score on the S-LANSS (P<.05). The S-LANSS showed a significant correlation with ID-Pain (r=.734, α=.71). The Spanish-language version of the S-LANSS is valid and reliable for identifying patients with chronic pain with neuropathic features. Copyright © 2016 Sociedad Española de Neurología. Publicado por Elsevier España, S.L.U. All rights reserved.
Verification, Validation and Sensitivity Studies in Computational Biomechanics
Anderson, Andrew E.; Ellis, Benjamin J.; Weiss, Jeffrey A.
2012-01-01
Computational techniques and software for the analysis of problems in mechanics have naturally moved from their origins in the traditional engineering disciplines to the study of cell, tissue and organ biomechanics. Increasingly complex models have been developed to describe and predict the mechanical behavior of such biological systems. While the availability of advanced computational tools has led to exciting research advances in the field, the utility of these models is often the subject of criticism due to inadequate model verification and validation. The objective of this review is to present the concepts of verification, validation and sensitivity studies with regard to the construction, analysis and interpretation of models in computational biomechanics. Specific examples from the field are discussed. It is hoped that this review will serve as a guide to the use of verification and validation principles in the field of computational biomechanics, thereby improving the peer acceptance of studies that use computational modeling techniques. PMID:17558646
Ofenloch, R F; Weisshaar, E; Dumke, A-K; Molin, S; Diepgen, T L; Apfelbacher, C
2014-08-01
Health-related quality of life (HRQOL) is widely used as a patient-reported outcome to evaluate clinical trials. In routine care it can also be used to improve treatment strategies or to enhance patients' self-awareness and empowerment. Therefore a disease-specific instrument is needed that assesses in detail all the impairments caused by the disease of interest. For patients with hand eczema (HE) such an instrument was developed by an international expert group, but its measurement properties are unknown. To validate the German version of the Quality of Life in Hand Eczema Questionnaire (QOLHEQ), which covers the domains of (i) symptoms, (ii) emotions, (iii) functioning and (iv) treatment and prevention. The QOLHEQ was assessed up to three times in 316 patients with HE to test reliability and sensitivity to change. To test construct validity we also assessed several reference measures. The scale structure was analysed using the Rasch model for each subscale and a structural equation model was used to test the multi domain structure of the QOLHEQ. After minor adaptions of the scoring structure, all four subscales of the QOLHEQ did not significantly misfit the Rasch model (α > 0·05). The fit indices of the structural equation model showed a good fit of the multi domain construct with four subscales assessing HRQOL. Nearly all a priori-defined hypotheses relating to construct validity could be confirmed. The QOLHEQ showed a sensitivity to change that was superior compared with all reference measures. The QOLHEQ is ready to be used in its German version as a sensitive outcome measure in clinical trials and for routine monitoring. The treatment-relevant subscales enable its use to enhance patients' self-awareness and to monitor treatment decisions. © 2014 British Association of Dermatologists.
Pedersen, Sue D.; Brar, Sony; Faris, Peter; Corenblum, Bernard
2007-01-01
OBJECTIVE To construct and validate a questionnaire for use in diagnosis of polycystic ovary syndrome (PCOS). DESIGN All participants completed a questionnaire, which asked clinical questions designed to assist in the diagnosis of PCOS, before their appointments with an endocrinologist. Following completion of the questionnaire, the endocrinologist (blinded to the answers) made or excluded a diagnosis of PCOS using clinical criteria and biochemical data as indicated. Questions were then evaluated for their power to predict PCOS, and a model was constructed using the most reliable items to establish a system to predict a diagnosis of PCOS. SETTING An outpatient reproductive endocrinology clinic in Calgary, Alta. PARTICIPANTS Adult women patients who had been referred to the clinic. Fifty patients with PCOS and 50 patients without PCOS were included in the study. MAIN OUTCOME MEASURES Demographic information, medical history, related diagnoses, menstrual history, and fertility history. RESULTS A history of infrequent menses, hirsutism, obesity, and acne were strongly predictive of a diagnosis of PCOS, whereas a history of failed pregnancy attempts was not useful. A history of nipple discharge outside of pregnancy strongly predicted no diagnosis of PCOS. We constructed a 4-item questionnaire for use in diagnosis of PCOS; the questionnaire yielded a sensitivity of 85% and a specificity of 85% on multivariate logistic regression and a sensitivity of 77% and a specificity of 94% using the 4-item questionnaire. Predictive accuracy was validated using a second sample of 117 patients, in addition to internal validation using bootstrap analysis. CONCLUSION We have constructed a simple clinical tool to help diagnose PCOS. This questionnaire can be easily incorporated into family physicians’ busy practices. PMID:17872783
Optimizing Spectral Wave Estimates with Adjoint-Based Sensitivity Maps
2014-02-18
J, Orzech MD, Ngodock HE (2013) Validation of a wave data assimilation system based on SWAN. Geophys Res Abst, (15), EGU2013-5951-1, EGU General ...surface wave spectra. Sensitivity maps are generally constructed for a selected system indicator (e.g., vorticity) by computing the differential of...spectral action balance Eq. 2, generally initialized at the off- shore boundary with spectral wave and other outputs from regional models such as
Design and validation of a comprehensive fecal incontinence questionnaire.
Macmillan, Alexandra K; Merrie, Arend E H; Marshall, Roger J; Parry, Bryan R
2008-10-01
Fecal incontinence can have a profound effect on quality of life. Its prevalence remains uncertain because of stigma, lack of consistent definition, and dearth of validated measures. This study was designed to develop a valid clinical and epidemiologic questionnaire, building on current literature and expertise. Patients and experts undertook face validity testing. Construct validity, criterion validity, and test-retest reliability was undertaken. Construct validity comprised factor analysis and internal consistency of the quality of life scale. The validity of known groups was tested against 77 control subjects by using regression models. Questionnaire results were compared with a stool diary for criterion validity. Test-retest reliability was calculated from repeated questionnaire completion. The questionnaire achieved good face validity. It was completed by 104 patients. The quality of life scale had four underlying traits (factor analysis) and high internal consistency (overall Cronbach alpha = 0.97). Patients and control subjects answered the questionnaire significantly differently (P < 0.01) in known-groups validity testing. Criterion validity assessment found mean differences close to zero. Median reliability for the whole questionnaire was 0.79 (range, 0.35-1). This questionnaire compares favorably with other available instruments, although the interpretation of stool consistency requires further research. Its sensitivity to treatment still needs to be investigated.
Validity, Reliability, and Sensitivity of a Volleyball Intermittent Endurance Test.
Rodríguez-Marroyo, Jose A; Medina-Carrillo, Javier; García-López, Juan; Morante, Juan C; Villa, José G; Foster, Carl
2017-03-01
To analyze the concurrent and construct validity of a volleyball intermittent endurance test (VIET). The VIET's test-retest reliability and sensitivity to assess seasonal changes was also studied. During the preseason, 71 volleyball players of different competitive levels took part in this study. All performed the VIET and a graded treadmill test with gas-exchange measurement (GXT). Thirty-one of the players performed an additional VIET to analyze the test-retest reliability. To test the VIET's sensitivity, 28 players repeated the VIET and GXT at the end of their season. Significant (P < .001) relationships between VIET distance and maximal oxygen uptake (r = .74) and GXT maximal speed (r = .78) were observed. There were no significant differences between the VIET performance test and retest (1542.1 ± 338.1 vs 1567.1 ± 358.2 m). Significant (P < .001) relationships and intraclass correlation coefficient (ICC) were found (r = .95, ICC = .96) for VIET performance. VIET performance increased significantly (P < .001) with player performance level and was sensitive to fitness changes across the season (1458.8 ± 343.5 vs 1581.1 ± 334.0 m, P < .01). The VIET may be considered a valid, reliable, and sensitive test to assess the aerobic endurance in volleyball players.
Five-level emergency triage systems: variation in assessment of validity.
Kuriyama, Akira; Urushidani, Seigo; Nakayama, Takeo
2017-11-01
Triage systems are scales developed to rate the degree of urgency among patients who arrive at EDs. A number of different scales are in use; however, the way in which they have been validated is inconsistent. Also, it is difficult to define a surrogate that accurately predicts urgency. This systematic review described reference standards and measures used in previous validation studies of five-level triage systems. We searched PubMed, EMBASE and CINAHL to identify studies that had assessed the validity of five-level triage systems and described the reference standards and measures applied in these studies. Studies were divided into those using criterion validity (reference standards developed by expert panels or triage systems already in use) and those using construct validity (prognosis, costs and resource use). A total of 57 studies examined criterion and construct validity of 14 five-level triage systems. Criterion validity was examined by evaluating (1) agreement between the assigned degree of urgency with objective standard criteria (12 studies), (2) overtriage and undertriage (9 studies) and (3) sensitivity and specificity of triage systems (7 studies). Construct validity was examined by looking at (4) the associations between the assigned degree of urgency and measures gauged in EDs (48 studies) and (5) the associations between the assigned degree of urgency and measures gauged after hospitalisation (13 studies). Particularly, among 46 validation studies of the most commonly used triages (Canadian Triage and Acuity Scale, Emergency Severity Index and Manchester Triage System), 13 and 39 studies examined criterion and construct validity, respectively. Previous studies applied various reference standards and measures to validate five-level triage systems. They either created their own reference standard or used a combination of severity/resource measures. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Objective validation of central sensitization in the rat UVB and heat rekindling model
Weerasinghe, NS; Lumb, BM; Apps, R; Koutsikou, S; Murrell, JC
2014-01-01
Background The UVB and heat rekindling (UVB/HR) model shows potential as a translatable inflammatory pain model. However, the occurrence of central sensitization in this model, a fundamental mechanism underlying chronic pain, has been debated. Face, construct and predictive validity are key requisites of animal models; electromyogram (EMG) recordings were utilized to objectively demonstrate validity of the rat UVB/HR model. Methods The UVB/HR model was induced on the heel of the hind paw under anaesthesia. Mechanical withdrawal thresholds (MWTs) were obtained from biceps femoris EMG responses to a gradually increasing pinch at the mid hind paw region under alfaxalone anaesthesia, 96 h after UVB irradiation. MWT was compared between UVB/HR and SHAM-treated rats (anaesthetic only). Underlying central mechanisms in the model were pharmacologically validated by MWT measurement following intrathecal N-methyl-d-aspartate (NMDA) receptor antagonist, MK-801, or saline. Results Secondary hyperalgesia was confirmed by a significantly lower pre-drug MWT {mean [±standard error of the mean (SEM)]} in UVB/HR [56.3 (±2.1) g/mm2, n = 15] compared with SHAM-treated rats [69.3 (±2.9) g/mm2, n = 8], confirming face validity of the model. Predictive validity was demonstrated by the attenuation of secondary hyperalgesia by MK-801, where mean (±SEM) MWT was significantly higher [77.2 (±5.9) g/mm2 n = 7] in comparison with pre-drug [57.8 (±3.5) g/mm2 n = 7] and saline [57.0 (±3.2) g/mm2 n = 8] at peak drug effect. The occurrence of central sensitization confirmed construct validity of the UVB/HR model. Conclusions This study used objective outcome measures of secondary hyperalgesia to validate the rat UVB/HR model as a translational model of inflammatory pain. What's already known about this topic? Most current animal chronic pain models lack translatability to human subjects. Primary hyperalgesia is an established feature of the UVB/heat rekindling inflammatory pain model in rodents and humans, but the presence of secondary hyperalgesia, a hallmark feature of central sensitization and thus chronic pain, is contentious. What does this study add? Secondary hyperalgesia was demonstrated in the rat UVB/heat rekindling model using an objective outcome measure (electromyogram), overcoming the subjective limitations of previous behavioural studies. PMID:24590815
Orrung Wallin, Anneli; Edberg, Anna-Karin; Beck, Ingela; Jakobsson, Ulf
2013-01-01
There are many instruments assessing the wellbeing of staff, but far from all have been psychometrically investigated. When evaluating supportive interventions directed toward nurse assistants in residential care, valid and reliable instruments are needed in order to detect possible changes. The aim of the study was to investigate validity in terms of data quality, construct validity, convergent and divergent validity and reliability in terms of the internal consistency and stability of the Job Satisfaction Questionnaire, the Psychosocial Aspects of Job Satisfaction, the Strain in Dementia Care Scale (SDCS), and the Stress of Conscience Questionnaire (SCQ) in a residential care context. The psychometric properties of the instruments were investigated in terms of data quality, construct validity, convergent and divergent validity and reliability, including test-retest reliability, in a residential care context with a sample consisting of nurse assistants (n=114). The four instruments responded with different psychometric-related problems such as internal missing data, floor and ceiling effects, problems with construct validity and low test-retest reliability, especially when assessed on the item level. These problems were however reduced or disappeared completely when assessed for total and factor scores. From a psychometric perspective, the SDCS seemed to stand out as the best instrument. However, it should be modified in order to reduce floor effects on item level and thereby gain sensitivity. The Job Satisfaction Questionnaire seemed to have problems both with the construct validity and test-retest reliability. The final choice of instrument must, however, be made dependent on what one intends to measure. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Tiet, Quyen Q; Leyva, Yani; Moos, Rudolf H; Smith, Brandy
2016-07-01
The Alcohol, Smoking and Substance Involvement Screening Test (ASSIST) is a screening instrument to detect substance use in primary care (PC). To screen for illicit substances (excluding tobacco and alcohol), the ASSIST consists of 8-57 questions and requires complicated scoring. To improve the efficiency of screening of drug misuse in PC, this study constructed and validated a two-item screen for drug use from the ASSIST. Guided by previous reviews, the ASSIST was revised. Patients were recruited in VA primary care clinics (N=1283). Half of the sample was used to develop the ASSIST-Drug; the other half was used to validate it. The Mini International Neuropsychiatric Interview (MINI) and the Inventory of Drug Use Consequences were the criterion measures. A brief, two-item ASSIST-Drug was constructed. Based on the development sample, the ASSIST-Drug was 94.1% sensitive and 89.6% specific for drug use disorders. Based on the validation sample, it was 95.4% sensitive and 87.8% specific. The ASSIST-Drug also had comparable sensitivity and specificity to identify drug use negative consequences, as well as for diverse subgroups of patients in terms of gender, age, race/ethnicity, marital status, educational levels, and post traumatic stress disorder status. The ASSIST-Drug may be a useful screening tool for PC settings. It is reliable, brief, and easy to remember, administer and score. It is sensitive and specific for drug use disorders and drug use negative consequences, and the predictive properties are consistent across subgroup of patients. Published by Elsevier Ireland Ltd.
[Psychometric properties and diagnostic value of 'lexical screening for aphasias'].
Pena-Chavez, R; Martinez-Jimenez, L; Lopez-Espinoza, M
2014-09-16
INTRODUCTION. Language assessment in persons with brain injury makes it possible to know whether they require language rehabilitation or not. Given the importance of a precise evaluation, assessment instruments must be valid and reliable, so as to avoid mistaken and subjective diagnoses. AIM. To validate 'lexical screening for aphasias' in a sample of 58 Chilean individuals. SUBJECTS AND METHODS. A screening-type language test, lasting 20 minutes and based on the lexical processing model devised by Patterson and Shewell (1987), was constructed. The sample was made up of two groups containing 29 aphasic subjects and 29 control subjects from different health centres in the regions of Biobio and Maule, Chile. Their ages ranged between 24 and 79 years and had between 0 and 17 years' schooling. Tests were carried out to determine discriminating validity, concurrent validity with the aphasia disorder assessment battery, reliability, sensitivity and specificity. RESULTS. The statistical analysis showed a high discriminating validity (p < 0.001), an acceptable mean concurrent validity with aphasia disorder assessment battery (rs = 0.65), high mean reliability (alpha = 0.87), moderate mean sensitivity (69%) and high mean specificity (86%). CONCLUSION. 'Lexical screening for aphasias' is valid and reliable for assessing language in persons with aphasias; it is sensitive for detecting aphasic subjects and is specific for precluding language disorders in persons with normal language abilities.
Preliminary data on validity of the Drug Addiction Treatment Efficacy Questionnaire.
Kastelic, Andrej; Mlakar, Janez; Pregelj, Peter
2013-09-01
This study describes the validation process for the Slovenian version of the Drug Addiction Treatment Efficacy Questionnaire (DATEQ). DATEQ was constructed from the questionnaires used at the Centre for the Treatment of Drug Addiction, Ljubljana University Psychiatric Hospital, and within the network of Centres for the Prevention and Treatment of Drug Addiction in Slovenia during the past 14 years. The Slovenian version of the DATEQ was translated to English using the 'forward-backward' procedure by its authors and their co-workers. The validation process included 100 male and female patients with established addiction to illicit drugs who had been prescribed opioid substitution therapy. The DATEQ questionnaire was used in the study, together with clinical evaluation to measure psychological state and to evaluate the efficacy of treatment in the last year. To determinate the validity of DATEQ the correlation with the clinical assessments of the outcome was calculated using one-way ANOVA. The F value was 44.4, p<0.001 (sum of squares: between groups 210.4, df=2, within groups 229.7, df=97, total 440.1, df=99). At the cut-off 4 the sensitivity is 81% and specificity 83%. The validation process for the Slovenian DATEQ version shows metric properties similar to those found in international studies of similar questionnaires, suggesting that it measures the same constructs, in the same way and as similar questionnaires. However, the relatively low sensitivity and specificity suggests caution when using DATEQ as the only measure of outcome.
Anderson, J.R.; Ackerman, J.J.H.; Garbow, J.R.
2015-01-01
Two semipermeable, hollow fiber phantoms for the validation of perfusion-sensitive magnetic resonance methods and signal models are described. Semipermeable hollow fibers harvested from a standard commercial hemodialysis cartridge serve to mimic tissue capillary function. Flow of aqueous media through the fiber lumen is achieved with a laboratory-grade peristaltic pump. Diffusion of water and solute species (e.g., Gd-based contrast agent) occurs across the fiber wall, allowing exchange between the lumen and the extralumenal space. Phantom design attributes include: i) small physical size, ii) easy and low-cost construction, iii) definable compartment volumes, and iv) experimental control over media content and flow rate. PMID:26167136
Cross-cultural construct validity study of professionalism of Vietnamese medical students.
Nhan, Vo Thanh; Violato, Claudio; Le An, Pham; Beran, Tanya N
2014-01-01
Although many studies have made efforts to define and assess medical professionalism, few have addressed issues of construct validity. The purpose of this article is to explore further construct validity of medical professionalism employing exploratory and confirmatory factor analysis. The 32-item instrument by the American Board of Internal Medicine (ABIM) was adapted to assess the perceptions on medical professionalism of Vietnamese medical students. A sample of 1,196 (487 first-year, 341 third-year, 368 sixth-year) medical students participated voluntarily in the completion of the instrument. The data were randomly divided into three samples to assess the construct validity of medical professionalism by empirically deriving and confirming a model of professionalism. Exploratory and confirmatory factor analytic techniques resulted in a six-factor well-fitting model with a comparative fit index of .963 and root mean square error approximation of .029, 90% confidence interval [016, .039]: integrity, social responsibility, professional practice habits, ensuring quality care, altruism, and self-awareness. Social responsibility was perceived least important, and self-awareness was perceived most important by Vietnamese medical students. These constructs of medical professionalism were relatively similar with those found in Taiwanese medical students and the ABIM definitions but with some Vietnamese cultural differences. Although the results confirm that medical professionalism is a somewhat culturally sensitive construct, it nonetheless has many elements of medical professionalism that are universal. Future research should be conducted to test the generalizability of our six-factor model of professionalism with various samples (e.g., residents, physicians), cultures, and language groups.
Mares-García, Emma; Palazón-Bru, Antonio; Folgado-de la Rosa, David Manuel; Pereira-Expósito, Avelino; Martínez-Martín, Álvaro; Cortés-Castell, Ernesto; Gil-Guillén, Vicente Francisco
2017-01-01
Other studies have assessed nonadherence to proton pump inhibitors (PPIs), but none has developed a screening test for its detection. To construct and internally validate a predictive model for nonadherence to PPIs. This prospective observational study with a one-month follow-up was carried out in 2013 in Spain, and included 302 patients with a prescription for PPIs. The primary variable was nonadherence to PPIs (pill count). Secondary variables were gender, age, antidepressants, type of PPI, non-guideline-recommended prescription (NGRP) of PPIs, and total number of drugs. With the secondary variables, a binary logistic regression model to predict nonadherence was constructed and adapted to a points system. The ROC curve, with its area (AUC), was calculated and the optimal cut-off point was established. The points system was internally validated through 1,000 bootstrap samples and implemented in a mobile application (Android). The points system had three prognostic variables: total number of drugs, NGRP of PPIs, and antidepressants. The AUC was 0.87 (95% CI [0.83-0.91], p < 0.001). The test yielded a sensitivity of 0.80 (95% CI [0.70-0.87]) and a specificity of 0.82 (95% CI [0.76-0.87]). The three parameters were very similar in the bootstrap validation. A points system to predict nonadherence to PPIs has been constructed, internally validated and implemented in a mobile application. Provided similar results are obtained in external validation studies, we will have a screening tool to detect nonadherence to PPIs.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Emter, Roger; Natsch, Andreas, E-mail: andreas.natsch@givaudan.com
2015-11-01
Heme oxygenase (decycling) 1 (HMOX1) is the most consistently found genetic marker induced by skin sensitizers. HMOX1 is often referred to as typical gene regulated by nuclear factor erythroid 2-related factor 2 (Nrf2), however, it is also regulated by other DNA-binding factors, including BTB and CNC homolog 1 (Bach1). The KeratinoSens™ assay is the first validated in vitro assay for sensitizers that measures gene induction. It is based on luciferase expression regulated by the antioxidant response element (ARE) of the aldoketoreductase 1C2 (AKR1C2) gene. Luciferase upregulation is dependent on Nrf2, while HMOX1 upregulation is only partially Nrf2-dependent. Thus, sensitizer-dependent activationmore » of HMOX1 may integrate multiple signals thereby providing additional information. We constructed reporter cell lines containing the full HMOX1 regulatory region or the HMOX1-ARE sequence and compared them with the construct containing the AKR1C2-ARE sequence. Induction of the AKR1C2-ARE depends on Nrf2, but not on the repressor Bach1. Results obtained with HMOX1-ARE and the full HMOX1 promoter indicate that, within the HMOX1 promoter, the HMOX1-ARE is sufficient to explain the induction by sensitizers and that (i) inhibiting Bach1 leads to strong basal expression, (ii) fold-induction by sensitizers above this level is reduced in the absence of Bach1 and (iii) these constructs are less dependent on Nrf2 as compared to the AKR1C2-ARE. Nevertheless, congruent dose response curves for luciferase activity were obtained with all constructs. Thus, while sensitizer-induced HMOX1 activation is dependent on Nrf2 and Bach1, all constructs give identical information for the in vitro prediction of the sensitization potential. - Highlights: • HMOX1 is a key genetic marker up-regulated by skin sensitizers. • HMOX1-, but not AKR1C2-upregulation, is dependent on both Nrf2 and Bach1. • AKR1C2 and HMOX1-dependent reporter constructs yield congruent dose response curves. • Combining both constructs offers no advantage over either construct used alone.« less
Guetterman, Timothy C; Kron, Frederick W; Campbell, Toby C; Scerbo, Mark W; Zelenski, Amy B; Cleary, James F; Fetters, Michael D
2017-01-01
Despite interest in using virtual humans (VHs) for assessing health care communication, evidence of validity is limited. We evaluated the validity of a VH application, MPathic-VR, for assessing performance-based competence in breaking bad news (BBN) to a VH patient. We used a two-group quasi-experimental design, with residents participating in a 3-hour seminar on BBN. Group A (n=15) completed the VH simulation before and after the seminar, and Group B (n=12) completed the VH simulation only after the BBN seminar to avoid the possibility that testing alone affected performance. Pre- and postseminar differences for Group A were analyzed with a paired t -test, and comparisons between Groups A and B were analyzed with an independent t -test. Compared to the preseminar result, Group A's postseminar scores improved significantly, indicating that the VH program was sensitive to differences in assessing performance-based competence in BBN. Postseminar scores of Group A and Group B were not significantly different, indicating that both groups performed similarly on the VH program. Improved pre-post scores demonstrate acquisition of skills in BBN to a VH patient. Pretest sensitization did not appear to influence posttest assessment. These results provide initial construct validity evidence that the VH program is effective for assessing BBN performance-based communication competence.
Guetterman, Timothy C; Kron, Frederick W; Campbell, Toby C; Scerbo, Mark W; Zelenski, Amy B; Cleary, James F; Fetters, Michael D
2017-01-01
Background Despite interest in using virtual humans (VHs) for assessing health care communication, evidence of validity is limited. We evaluated the validity of a VH application, MPathic-VR, for assessing performance-based competence in breaking bad news (BBN) to a VH patient. Methods We used a two-group quasi-experimental design, with residents participating in a 3-hour seminar on BBN. Group A (n=15) completed the VH simulation before and after the seminar, and Group B (n=12) completed the VH simulation only after the BBN seminar to avoid the possibility that testing alone affected performance. Pre- and postseminar differences for Group A were analyzed with a paired t-test, and comparisons between Groups A and B were analyzed with an independent t-test. Results Compared to the preseminar result, Group A’s postseminar scores improved significantly, indicating that the VH program was sensitive to differences in assessing performance-based competence in BBN. Postseminar scores of Group A and Group B were not significantly different, indicating that both groups performed similarly on the VH program. Conclusion Improved pre–post scores demonstrate acquisition of skills in BBN to a VH patient. Pretest sensitization did not appear to influence posttest assessment. These results provide initial construct validity evidence that the VH program is effective for assessing BBN performance-based communication competence. PMID:28794664
Andrade Ortega, Juan Alfonso; Millán Gómez, Ana Pilar; Ribeiro González, Marisa; Martínez Piró, Pilar; Jiménez Anula, Juan; Sánchez Andújar, María Belén
2017-06-21
The early detection of upper limb complications is important in women operated on for breast cancer. The "FACT-B+4-UL" questionnaire, a specific variant of the Functional Assessment of Cancer Therapy-Breast (FACT-B) is available among others to measure the upper limb function. The Spanish version of the upper limb subscale of the FACT-B+4 was validated in a prospective cohort of 201 women operated on for breast cancer (factor analysis, internal consistency, test-retest reliability, construct validity and sensitivity to change were determined). Its predictive capacity of subsequent lymphoedema and other complications in the upper limb was explored using logistic regression. This subscale is unifactorial and has a great internal consistency (Cronbach's alpha: 0.87), its test-retest reliability and construct validity are strong (intraclass correlation coefficient: 0.986; Pearson's R with "Quick DASH": 0.81) as is its sensitivity to change. It didn't predict the onset of lymphedema. Its predictive capacity for other upper limb complications is low. FACT-B+4-UL is useful in measuring upper limb disability in women surgically treated for breast cancer; but it does not predict the onset of lymphoedema and its predictive capacity for others complications in the upper limb is low. Copyright © 2017 Elsevier España, S.L.U. All rights reserved.
Rachakonda, Tara; Jeffe, Donna B; Shin, Jennifer J; Mankarious, Leila; Fanning, Robert J; Lesperance, Marci M; Lieu, Judith E C
2014-02-01
The prevalence of hearing loss (HL) in adolescents has grown over the past decade, but hearing-related quality of life (QOL) has not been well-measured. We sought to develop a reliable, valid measure of hearing-related QOL for adolescents and the Hearing Environments And Reflection on Quality of Life (HEAR-QL). Multisite observational study. Adolescents with HL and siblings without HL were recruited from five centers. Participants completed the HEAR-QL and validated questionnaires measuring generic pediatric QOL (PedsQL), depression and anxiety (RCADS-25), and hearing-related QOL for adults (HHIA) to determine construct and discriminant validity. Participants completed the HEAR-QL 2 weeks later for test-retest reliability. We used exploratory principal components analysis to determine the HEAR-QL factor structure and measured reliability. Sensitivity and specificity of the HEAR-QL, PedsQL, HHIA, and RCADS-25 were assessed. We compared scores on all surveys between those with normal hearing, unilateral, and bilateral HL. A total of 233 adolescents (13-18 years old) participated: 179 with HL, 54 without HL. The original 45-item HEAR-QL was shortened to 28 items after determining factor structure. The resulting HEAR-QL-28 demonstrated excellent reliability (Cronbach's alpha = 0.95) and construct validity (HHIA: r = .845, PedsQL: r = .587; RCADS-25: r = .433). The HEAR-QL-28 displayed excellent discriminant validity, with higher area under the curve (0.932) than the PedsQL (0.597) or RCADS-25 (0.529). Teens with bilateral HL using hearing devices reported worse QOL on the HEAR-QL and HHIA than peers with HL not using devices. The HEAR-QL is a sensitive, reliable, and valid measure of hearing-related QOL for adolescents. 2b. © 2013 The American Laryngological, Rhinological and Otological Society, Inc.
Rachakonda, Tara; Jeffe, Donna B.; Shin, Jennifer J.; Mankarious, Leila; Fanning, Robert J.; Lesperance, Marci M.; Lieu, Judith E.C.
2014-01-01
Objectives The prevalence of hearing loss (HL) in adolescents has grown over the past decade, but hearing-related quality of life (QOL) has not been well-measured. We sought to develop a reliable, valid measure of hearing-related QOL for adolescents, the Hearing Environments And Reflection on Quality of Life (HEAR-QL). Study Design Multi-site observational study. Methods Adolescents with HL and siblings without HL were recruited from five centers. Participants completed the HEAR-QL and validated questionnaires measuring generic pediatric QOL (PedsQL), depression and anxiety (RCADS-25), and hearing-related QOL for adults (HHIA) to determine construct and discriminant validity. Participants completed the HEAR-QL two weeks later for test-retest reliability. We used exploratory principal components analysis to determine the HEAR-QL factor structure and measured reliability. Sensitivity and specificity of the HEAR-QL, PedsQL, HHIA and RCADS-25 were assessed. We compared scores on all surveys between those with normal hearing, unilateral and bilateral HL. Results 233 adolescents (13–18 years old) participated—179 with HL, 54 without HL. The original 45-item HEAR-QL was shortened to 28 items after determining factor structure. The resulting HEAR-QL-28 demonstrated excellent reliability (Cronbach’s alpha= 0.95) and construct validity (HHIA: r =.845, PedsQL: r =.587; RCADS-25: r =.433). The HEAR-QL-28 displayed excellent discriminant validity, with higher area under the curve (0.932) than the PedsQL (0.597) or RCADS-25 (0.529). Teens with bilateral HL using hearing devices reported worse QOL on the HEAR-QL and HHIA than peers with HL not using devices. Conclusions The HEAR-QL is a sensitive, reliable and valid measure of hearing-related QOL for adolescents. PMID:23900836
Castillo-Tandazo, Wilson; Flores-Fortty, Adolfo; Feraud, Lourdes; Tettamanti, Daniel
2013-01-01
Purpose To translate, cross-culturally adapt, and validate the Questionnaire for Diabetes-Related Foot Disease (Q-DFD), originally created and validated in Australia, for its use in Spanish-speaking patients with diabetes mellitus. Patients and methods The translation and cross-cultural adaptation were based on international guidelines. The Spanish version of the survey was applied to a community-based (sample A) and a hospital clinic-based sample (samples B and C). Samples A and B were used to determine criterion and construct validity comparing the survey findings with clinical evaluation and medical records, respectively; while sample C was used to determine intra- and inter-rater reliability. Results After completing the rigorous translation process, only four items were considered problematic and required a new translation. In total, 127 patients were included in the validation study: 76 to determine criterion and construct validity and 41 to establish intra- and inter-rater reliability. For an overall diagnosis of diabetes-related foot disease, a substantial level of agreement was obtained when we compared the Q-DFD with the clinical assessment (kappa 0.77, sensitivity 80.4%, specificity 91.5%, positive likelihood ratio [LR+] 9.46, negative likelihood ratio [LR−] 0.21); while an almost perfect level of agreement was obtained when it was compared with medical records (kappa 0.88, sensitivity 87%, specificity 97%, LR+ 29.0, LR− 0.13). Survey reliability showed substantial levels of agreement, with kappa scores of 0.63 and 0.73 for intra- and inter-rater reliability, respectively. Conclusion The translated and cross-culturally adapted Q-DFD showed good psychometric properties (validity, reproducibility, and reliability) that allow its use in Spanish-speaking diabetic populations. PMID:24039434
2012-01-01
Background Multi attribute utility (MAU) instruments are used to include the health related quality of life (HRQoL) in economic evaluations of health programs. Comparative studies suggest different MAU instruments measure related but different constructs. The objective of this paper is to describe the methods employed to achieve content validity in the descriptive system of the Assessment of Quality of Life (AQoL)-6D, MAU instrument. Methods The AQoL program introduced the use of psychometric methods in the construction of health related MAU instruments. To develop the AQoL-6D we selected 112 items from previous research, focus groups and expert judgment and administered them to 316 members of the public and 302 hospital patients. The search for content validity across a broad spectrum of health states required both formative and reflective modelling. We employed Exploratory Factor Analysis and Structural Equation Modelling (SEM) to meet these dual requirements. Results and Discussion The resulting instrument employs 20 items in a multi-tier descriptive system. Latent dimension variables achieve sensitive descriptions of 6 dimensions which, in turn, combine to form a single latent QoL variable. Diagnostic statistics from the SEM analysis are exceptionally good and confirm the hypothesised structure of the model. Conclusions The AQoL-6D descriptive system has good psychometric properties. They imply that the instrument has achieved construct validity and provides a sensitive description of HRQoL. This means that it may be used with confidence for measuring health related quality of life and that it is a suitable basis for modelling utilities for inclusion in the economic evaluation of health programs. PMID:22507254
Richardson, Jeffrey R J; Peacock, Stuart J; Hawthorne, Graeme; Iezzi, Angelo; Elsworth, Gerald; Day, Neil A
2012-04-17
Multi attribute utility (MAU) instruments are used to include the health related quality of life (HRQoL) in economic evaluations of health programs. Comparative studies suggest different MAU instruments measure related but different constructs. The objective of this paper is to describe the methods employed to achieve content validity in the descriptive system of the Assessment of Quality of Life (AQoL)-6D, MAU instrument. The AQoL program introduced the use of psychometric methods in the construction of health related MAU instruments. To develop the AQoL-6D we selected 112 items from previous research, focus groups and expert judgment and administered them to 316 members of the public and 302 hospital patients. The search for content validity across a broad spectrum of health states required both formative and reflective modelling. We employed Exploratory Factor Analysis and Structural Equation Modelling (SEM) to meet these dual requirements. The resulting instrument employs 20 items in a multi-tier descriptive system. Latent dimension variables achieve sensitive descriptions of 6 dimensions which, in turn, combine to form a single latent QoL variable. Diagnostic statistics from the SEM analysis are exceptionally good and confirm the hypothesised structure of the model. The AQoL-6D descriptive system has good psychometric properties. They imply that the instrument has achieved construct validity and provides a sensitive description of HRQoL. This means that it may be used with confidence for measuring health related quality of life and that it is a suitable basis for modelling utilities for inclusion in the economic evaluation of health programs.
Cognitive Predictors of Rapid Picture Naming
ERIC Educational Resources Information Center
Decker, Scott L.; Roberts, Alycia M.; Englund, Julia A.
2013-01-01
Deficits in rapid automatized naming (RAN) have been found to be a sensitive cognitive marker for children with dyslexia. However, there is a lack of consensus regarding the construct validity and theoretical neuro-cognitive processes involved in RAN. Additionally, most studies investigating RAN include a narrow range of cognitive measures. The…
Psychometric evaluation of the Swedish version of Rosenberg's self-esteem scale.
Eklund, Mona; Bäckström, Martin; Hansson, Lars
2018-04-01
The widely used Rosenberg's self-esteem scale (RSES) has not been evaluated for psychometric properties in Sweden. This study aimed at analyzing its factor structure, internal consistency, criterion, convergent and discriminant validity, sensitivity to change, and whether a four-graded Likert-type response scale increased its reliability and validity compared to a yes/no response scale. People with mental illness participating in intervention studies to (1) promote everyday life balance (N = 223) or (2) remedy self-stigma (N = 103) were included. Both samples completed the RSES and questionnaires addressing quality of life and sociodemographic data. Sample 1 also completed instruments chosen to assess convergent and discriminant validity: self-mastery (convergent validity), level of functioning and occupational engagement (discriminant validity). Confirmatory factor analysis (CFA), structural equation modeling, and conventional inferential statistics were used. Based on both samples, the Swedish RSES formed one factor and exhibited high internal consistency (>0.90). The two response scales were equivalent. Criterion validity in relation to quality of life was demonstrated. RSES could distinguish between women and men (women scoring lower) and between diagnostic groups (people with depression scoring lower). Correlations >0.5 with variables chosen to reflect convergent validity and around 0.2 with variables used to address discriminant validity further highlighted the construct validity of RSES. The instrument also showed sensitivity to change. The Swedish RSES exhibited a one-component factor structure and showed good psychometric properties in terms of good internal consistency, criterion, convergent and discriminant validity, and sensitivity to change. The yes/no and the four-graded Likert-type response scales worked equivalently.
Li, Honglan; Joh, Yoon Sung; Kim, Hyunwoo; Paek, Eunok; Lee, Sang-Won; Hwang, Kyu-Baek
2016-12-22
Proteogenomics is a promising approach for various tasks ranging from gene annotation to cancer research. Databases for proteogenomic searches are often constructed by adding peptide sequences inferred from genomic or transcriptomic evidence to reference protein sequences. Such inflation of databases has potential of identifying novel peptides. However, it also raises concerns on sensitive and reliable peptide identification. Spurious peptides included in target databases may result in underestimated false discovery rate (FDR). On the other hand, inflation of decoy databases could decrease the sensitivity of peptide identification due to the increased number of high-scoring random hits. Although several studies have addressed these issues, widely applicable guidelines for sensitive and reliable proteogenomic search have hardly been available. To systematically evaluate the effect of database inflation in proteogenomic searches, we constructed a variety of real and simulated proteogenomic databases for yeast and human tandem mass spectrometry (MS/MS) data, respectively. Against these databases, we tested two popular database search tools with various approaches to search result validation: the target-decoy search strategy (with and without a refined scoring-metric) and a mixture model-based method. The effect of separate filtering of known and novel peptides was also examined. The results from real and simulated proteogenomic searches confirmed that separate filtering increases the sensitivity and reliability in proteogenomic search. However, no one method consistently identified the largest (or the smallest) number of novel peptides from real proteogenomic searches. We propose to use a set of search result validation methods with separate filtering, for sensitive and reliable identification of peptides in proteogenomic search.
Irvine, Karen-Amanda; Ferguson, Adam R.; Mitchell, Kathleen D.; Beattie, Stephanie B.; Lin, Amity; Stuck, Ellen D.; Huie, J. Russell; Nielson, Jessica L.; Talbott, Jason F.; Inoue, Tomoo; Beattie, Michael S.; Bresnahan, Jacqueline C.
2014-01-01
The IBB scale is a recently developed forelimb scale for the assessment of fine control of the forelimb and digits after cervical spinal cord injury [SCI; (1)]. The present paper describes the assessment of inter-rater reliability and face, concurrent and construct validity of this scale following SCI. It demonstrates that the IBB is a reliable and valid scale that is sensitive to severity of SCI and to recovery over time. In addition, the IBB correlates with other outcome measures and is highly predictive of biological measures of tissue pathology. Multivariate analysis using principal component analysis (PCA) demonstrates that the IBB is highly predictive of the syndromic outcome after SCI (2), and is among the best predictors of bio-behavioral function, based on strong construct validity. Altogether, the data suggest that the IBB, especially in concert with other measures, is a reliable and valid tool for assessing neurological deficits in fine motor control of the distal forelimb, and represents a powerful addition to multivariate outcome batteries aimed at documenting recovery of function after cervical SCI in rats. PMID:25071704
Spertus, John; Jones, Philip; Poler, Sherri; Rocha-Singh, Krishna
2004-02-01
The most common indication for treating patients with peripheral arterial disease is to improve their health status: their symptoms, function, and quality of life. Quantifying health status requires a valid, reproducible, and sensitive disease-specific measure. The Peripheral Artery Questionnaire (PAQ) is a 20-item questionnaire developed to meet this need by quantifying patients' physical limitations, symptoms, social function, treatment satisfaction, and quality of life. Psychometric and clinical properties of the PAQ were evaluated in a prospective cohort study of 44 patients undergoing elective percutaneous peripheral revascularization. To establish reproducibility, 2 assessments were performed 2 weeks apart and before revascularization. The change in scores before and 6 weeks after revascularization were used to determine the instruments' responsiveness and were compared with the Short Form-36 and the Walking Impairment Questionnaire. A series of cross-sectional analyses were performed to establish the construct validity of the PAQ. The 7 domains of the PAQ were internally reliable, with Cronbach alpha = 0.80 to 0.94. The test-retest reliability analyses revealed insignificant mean changes of 0.6 to 2.3 points (P = not significant for all). Conversely, the change after revascularization ranged from 13.7 to 41.9 points (P < or =.001 for all), reflecting substantial sensitivity of the PAQ to clinical improvement. The PAQ Summary Scale was the most sensitive of all scales tested. Construct validity was established by demonstrating correlations with other measures of patient health status. The PAQ is a valid, reliable, and responsive disease-specific measure for patients with peripheral arterial disease. It may prove to be a useful end point in clinical trials and a potential aid in disease management.
The measurement of threat orientations.
Thompson, Suzanne C; Schlehofer, Michèle M; Bovin, Michelle J
2006-01-01
To develop measures of 3 threat orientations that affect responses to health behavior messages. In Study 1, college students (N = 47) completed items assessing threat orientations and health behaviors. In Study 2, college students and community adults (N = 110) completed the threat orientation items and measures of convergent and discriminant validity. In Study 1, the control-based, denial-based, and heightened-sensitivity-based threat orientation scales demonstrated good internal consistency and correlated with engagement in health behaviors. In Study 2, the convergent and discriminant validity of the 3 measures was established. The 3 scales have good internal reliability and construct validity.
Jongen, S; Vuurman, E F P M; Ramaekers, J G; Vermeeren, A
2016-04-01
Laboratory tests assessing driving related skills can be useful as initial screening tools to assess potential drug induced impairment as part of a standardized behavioural assessment. Unfortunately, consensus about which laboratory tests should be included to reliably assess drug induced impairment has not yet been reached. The aim of the present review was to evaluate the sensitivity of laboratory tests to the dose dependent effects of alcohol, as a benchmark, on performance parameters. In total, 179 experimental studies were included. Results show that a cued go/no-go task and a divided attention test with primary tracking and secondary visual search were consistently sensitive to the impairing effects at medium and high blood alcohol concentrations. Driving performance assessed in a simulator was less sensitive to the effects of alcohol as compared to naturalistic, on-the-road driving. In conclusion, replicating results of several potentially useful tests and their predictive validity of actual driving impairment should deserve further research. In addition, driving simulators should be validated and compared head to head to naturalistic driving in order to increase construct validity. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Evaluation and construction of diagnostic criteria for inclusion body myositis
Mammen, Andrew L.; Amato, Anthony A.; Weiss, Michael D.; Needham, Merrilee
2014-01-01
Objective: To use patient data to evaluate and construct diagnostic criteria for inclusion body myositis (IBM), a progressive disease of skeletal muscle. Methods: The literature was reviewed to identify all previously proposed IBM diagnostic criteria. These criteria were applied through medical records review to 200 patients diagnosed as having IBM and 171 patients diagnosed as having a muscle disease other than IBM by neuromuscular specialists at 2 institutions, and to a validating set of 66 additional patients with IBM from 2 other institutions. Machine learning techniques were used for unbiased construction of diagnostic criteria. Results: Twenty-four previously proposed IBM diagnostic categories were identified. Twelve categories all performed with high (≥97%) specificity but varied substantially in their sensitivities (11%–84%). The best performing category was European Neuromuscular Centre 2013 probable (sensitivity of 84%). Specialized pathologic features and newly introduced strength criteria (comparative knee extension/hip flexion strength) performed poorly. Unbiased data-directed analysis of 20 features in 371 patients resulted in construction of higher-performing data-derived diagnostic criteria (90% sensitivity and 96% specificity). Conclusions: Published expert consensus–derived IBM diagnostic categories have uniformly high specificity but wide-ranging sensitivities. High-performing IBM diagnostic category criteria can be developed directly from principled unbiased analysis of patient data. Classification of evidence: This study provides Class II evidence that published expert consensus–derived IBM diagnostic categories accurately distinguish IBM from other muscle disease with high specificity but wide-ranging sensitivities. PMID:24975859
Mohaseb, Kam; Linder, Mark; Rootman, Jack; Wilkins, G E; Schechter, Martin T; Dolman, Peter J; Singer, Joel
2008-01-01
To construct a patient-based symptom questionnaire to facilitate early referral of thyroid-associated orbitopathy (TAO) in Graves' hyperthyroidism (GH). Phase I of our study involved developing a symptomatology-based questionnaire for the self-reporting of TAO symptoms in patients recently diagnosed with GH. Phase II involved administering the questionnaire along with a standard ophthalmic examination to a screening cohort of patients newly diagnosed with GH. Symptoms highly associated with the clinical diagnosis of TAO were used to construct a tool with the highest possible sensitivity. Phase III involved validation of this tool in a new cohort of patients recently diagnosed with GH. For each patient, the diagnosis of TAO was made by both a standardized orbital ophthalmic exam and the questionnaire. Results from the questionnaire were then compared to the clinical examination. The questionnaire was compared to the standardized examination and found to have a sensitivity of 0.76 and a specificity of 0.82 in the validation phase of the study. This questionnaire may be a useful tool in clinical practice to allow identification of patients with TAO secondary to GH. Future studies using this questionnaire are needed to determine whether earlier identification and management of these patients is associated with reduced morbidity from TAO.
Rodríguez-Martínez, Carlos E; Nino, Gustavo; Castro-Rodriguez, Jose A
2014-01-01
There is a critical need for validation studies of questionnaires designed to assess the level of control of asthma in children younger than 5 years old. To validate the Spanish version of the Test for Respiratory and Asthma Control in Kids (TRACK) questionnaire in children younger than age 5 years with symptoms consistent with asthma. In a prospective cohort validation study, parents and/or caregivers of children younger than age 5 years and with symptoms consistent with asthma, during a baseline and a follow-up visit 2 to 6 weeks later, completed the information required to assess the content validity, criterion validity, construct validity, test-retest reliability, sensitivity to change, internal consistency reliability, and usability of the TRACK questionnaire. Median (interquartile range) of the TRACK scores were significantly different between patients with well-controlled asthma, patients with not well-controlled asthma, and patients with very poorly controlled asthma (90.0 [75.0-95.0], 75.0 [55.0-85.0], and 35.0 [25.0-55.0], respectively, P < .001). TRACK scores were significantly different between patients classified as currently symptomatic and symptomatic in the recent past (42.5 [25.0-55.0] vs 85.0 [75.0-90.0]; P < .001). The intraclass correlation coefficient of the measurements was 0.755 (95% CI, 0.503-1.00). All patients whose clinical status changed showed an increase of 10 or more points in TRACK score between baseline and follow-up visits. The Cronbach α was 0.77 for the questionnaire as a whole. The Spanish version of the TRACK questionnaire has excellent sensitivity to change and usability; adequate criterion validity, construct validity, and test-retest reliability; and an acceptable internal consistency, when used in children younger than age 5 years with symptoms consistent with asthma. Copyright © 2014 American Academy of Allergy, Asthma & Immunology. Published by Elsevier Inc. All rights reserved.
Patient safety and systematic reviews: finding papers indexed in MEDLINE, EMBASE and CINAHL.
Tanon, A A; Champagne, F; Contandriopoulos, A-P; Pomey, M-P; Vadeboncoeur, A; Nguyen, H
2010-10-01
To develop search strategies for identifying papers on patient safety in MEDLINE, EMBASE and CINAHL. Six journals were electronically searched for papers on patient safety published between 2000 and 2006. Identified papers were divided into two gold standards: one to build and the other to validate the search strategies. Candidate terms for strategy construction were identified using a word frequency analysis of titles, abstracts and keywords used to index the papers in the databases. Searches were run for each one of the selected terms independently in every database. Sensitivity, precision and specificity were calculated for each candidate term. Terms with sensitivity greater than 10% were combined to form the final strategies. The search strategies developed were run against the validation gold standard to assess their performance. A final step in the validation process was to compare the performance of each strategy to those of other strategies found in the literature. We developed strategies for all three databases that were highly sensitive (range 95%-100%), precise (range 40%-60%) and balanced (the product of sensitivity and precision being in the range of 30%-40%). The strategies were very specific and outperformed those found in the literature. The strategies we developed can meet the needs of users aiming to maximise either sensitivity or precision, or seeking a reasonable compromise between sensitivity and precision, when searching for papers on patient safety in MEDLINE, EMBASE or CINAHL.
Validation of a measurement tool to assess awareness of breast cancer.
Linsell, Louise; Forbes, Lindsay J L; Burgess, Caroline; Kapari, Marcia; Thurnham, Angela; Ramirez, Amanda J
2010-05-01
Until now, there has been no universally accepted and validated measure of breast cancer awareness. This study aimed to validate the new Breast Cancer Awareness Measure (BCAM) which assesses, using a self-complete questionnaire, knowledge of breast cancer symptoms and age-related risk, and frequency of breast checking. We measured the psychometric properties of the BCAM in 1035 women attending the NHS Breast Screening Programme: acceptability was assessed using a feedback questionnaire (n=292); sensitivity to change after an intervention promoting breast cancer awareness (n=576), and test-retest reliability (n=167). We also assessed readability, and construct validity using the 'known-groups' method. The readability of the BCAM was high. Over 90% of women found it acceptable. The BCAM was sensitive to change: there was an increase in the proportion of women obtaining the full score for breast cancer awareness one month after receiving the intervention promoting breast cancer awareness; this was greater among those who received a more intensive version (less intensive version (booklet): 9.3%, 95% confidence interval (CI): 4.5-14.1%; more intensive version (interaction with health professional plus booklet): 30%, 95% CI: 23.4-36.6%). Test-retest reliability of the BCAM was moderate to good for most items. Cancer experts had higher levels of cancer awareness than non-medical academics (50% versus 6%, p=0.001), indicating good construct validity. The BCAM is a valid and robust measure of breast cancer awareness suitable for use in surveys of breast cancer awareness in the general population and to evaluate the impact of awareness-raising interventions. Copyright (c) 2010 Elsevier Ltd. All rights reserved.
The international Hip Outcome Tool-33 (iHOT-33): multicenter validation and translation to Spanish.
Ruiz-Ibán, Miguel Angel; Seijas, Roberto; Sallent, Andrea; Ares, Oscar; Marín-Peña, Oliver; Muriel, Alfonso; Cuéllar, Ricardo
2015-05-20
The international Hip Outcome Tool-33 (iHOT-33) is a 33-item self administered outcome measure based on a Visual Analogue Scale response format designed for young and active population with hip pathology. The aim of the present study is to translate and validate the iHOT-33 into Spanish. 97 patients undergoing hip arthroscopy were included in this prospective and multicenter study performed between January 2012 and May 2014. Crosscultural adaptation was used to translate iHOT-33 into Spanish. Patients completed the questionnaire before and after surgery. Feasibility, reliability, internal consistency, construct validity (correlation with Western Ontario and McMaster Universities Osteoarthritis Index), ceiling and floor effects and sensitivity to change were assessed for the present study. Mean age was 48 years old. Feasibility: 41.2 % patients had no blank questions, and 71.3 % of patients had fulfilled all but one or two questions. Reliability: ICC for the global questionnaire was 0.97, showing that the questionnaire is highly reproducible. Internal consistency: Cronbach's alpha was 0.98 for the global questionnaire. Construct validity: there was a high correlation with WOMAC (correlation coefficient >0.5). The Ceiling effect (taking into account the minimum detectable change) was 12.1 % and the floor effect was 21.6 %, for the global questionnaire. Large sensitivity to change was shown. the Spanish version of iHOT-33 has shown to be feasible, reliable and sensible to changes for patients undergoing hip arthroscopy. This validated translation of iHOT-33 allows for comparisons between studies involving either Spanish- or English-speaking patients. Prognostic study, Level I.
Shimoda, Shunsuke; Okubo, Nobutoshi; Kobayashi, Mai; Sato, Shigetaka; Kitamura, Hideya
2014-08-01
The Implicit Positive and Negative Affect Test (IPANAT) is an instrument for the indirect assessment of positive and negative affect. A Japanese version of the IPANAT was developed and its reliability and validity were examined. In Study 1, factor analysis identified two independent factors that could be interpreted as implicit positive and negative affect, which corresponded to the original version. The Japanese IPANAT also had sufficient internal consistency and acceptable test-retest reliability. In Study 2, we demonstrated that the Japanese IPANAT was associated with explicit state affect (e.g., PANAS), extraversion, and neuroticism, which indicated its adequate construct validity. In Study 3, we examined the extent to which the Japanese IPANAT was sensitive to changes in affect by assessing a set of IPANAT items after the presentation of positive, negative, or neutral photographs. The results indicated that the Japanese IPANAT was sufficiently sensitive to changes in affect resulting from affective stimuli. Taken together, these studies suggest that the Japanese version of the IPANAT is a useful instrument for the indirect assessment of positive and negative affect.
Development and validation of the brief esophageal dysphagia questionnaire.
Taft, T H; Riehl, M; Sodikoff, J B; Kahrilas, P J; Keefer, L; Doerfler, B; Pandolfino, J E
2016-12-01
Esophageal dysphagia is common in gastroenterology practice and has multiple etiologies. A complication for some patients with dysphagia is food impaction. A valid and reliable questionnaire to rapidly evaluate esophageal dysphagia and impaction symptoms can aid the gastroenterologist in gathering information to inform treatment approach and further evaluation, including endoscopy. 1638 patients participated over two study phases. 744 participants completed the Brief Esophageal Dysphagia Questionnaire (BEDQ) for phase 1; 869 completed the BEDQ, Visceral Sensitivity Index, Gastroesophageal Reflux Disease Questionnaire, and Hospital Anxiety and Depression Scale for phase 2. Demographic and clinical data were obtained via the electronic medical record. The BEDQ was evaluated for internal consistency, split-half reliability, ceiling and floor effects, and construct validity. The BEDQ demonstrated excellent internal consistency, reliability, and construct validity. The symptom frequency and severity scales scored above the standard acceptable cutoffs for reliability while the impaction subscale yielded poor internal consistency and split-half reliability; thus the impaction items were deemed qualifiers only and removed from the total score. No significant ceiling or floor effects were found with the exception of 1 item, and inter-item correlations fell within accepted ranges. Construct validity was supported by moderate yet significant correlations with other measures. The predictive ability of the BEDQ was small but significant. The BEDQ represents a rapid, reliable, and valid assessment tool for esophageal dysphagia with food impaction for clinical practice that differentiates between patients with major motor dysfunction and mechanical obstruction. © 2016 John Wiley & Sons Ltd.
Development and Validation of the Brief Esophageal Dysphagia Questionnaire
Taft, Tiffany H.; Riehl, Megan; Sodikoff, Jamie B.; Kahrilas, Peter J.; Keefer, Laurie; Doerfler, Bethany; Pandolfino, John E.
2017-01-01
Background Esophageal dysphagia is common in gastroenterology practice and has multiple etiologies. A complication for some patients with dysphagia is food impaction. A valid and reliable questionnaire to rapidly evaluate esophageal dysphagia and impaction symptoms can aid the gastroenterologist in gathering information to inform treatment approach and further evaluation, including endoscopy. Methods 1,638 patients participated over two study phases. 744 participants completed the Brief Esophageal Dysphagia Questionnaire (BEDQ) for phase 1; 869 completed the BEDQ, Visceral Sensitivity Index, Gastroesophageal Reflux Disease Questionnaire, and Hospital Anxiety and Depression Scale for phase 2. Demographic and clinical data were obtained via the electronic medical record. The BEDQ was evaluated for internal consistency, split-half reliability, ceiling and floor effects, and construct validity. Key Results The BEDQ demonstrated excellent internal consistency, reliability, and construct validity. The symptom frequency and severity scales scored above the standard acceptable cutoffs for reliability while the impaction subscale yielded poor internal consistency and split-half reliability; thus the impaction items were deemed qualifiers only and removed from the total score. No significant ceiling or floor effects were found with the exception of 1 item, and inter-item correlations fell within accepted ranges. Construct validity was supported by moderate yet significant correlations with other measures. The predictive ability of the BEDQ was small but significant. Conclusions & Inferences The BEDQ represents a rapid, reliable and valid assessment tool for esophageal dysphagia with food impaction for clinical practice that differentiates between patients with major motor dysfunction and mechanical obstruction. PMID:27380834
Development of the multiple sclerosis (MS) early mobility impairment questionnaire (EMIQ).
Ziemssen, Tjalf; Phillips, Glenn; Shah, Ruchit; Mathias, Adam; Foley, Catherine; Coon, Cheryl; Sen, Rohini; Lee, Andrew; Agarwal, Sonalee
2016-10-01
The Early Mobility Impairment Questionnaire (EMIQ) was developed to facilitate early identification of mobility impairments in multiple sclerosis (MS) patients. We describe the initial development of the EMIQ with a focus on the psychometric evaluation of the questionnaire using classical and item response theory methods. The initial 20-item EMIQ was constructed by clinical specialists and qualitatively tested among people with MS and physicians via cognitive interviews. Data from an observational study was used to make additional updates to the instrument based on exploratory factor analysis (EFA) and item response theory (IRT) analysis, and psychometric analyses were performed to evaluate the reliability and validity of the final instrument's scores and screening properties (i.e., sensitivity and specificity). Based on qualitative interview analyses, a revised 15-item EMIQ was included in the observational study. EFA, IRT and item-to-item correlation analyses revealed redundant items which were removed leading to the final nine-item EMIQ. The nine-item EMIQ performed well with respect to: test-retest reliability (ICC = 0.858); internal consistency (α = 0.893); convergent validity; and known-groups methods for construct validity. A cut-point of 41 on the 0-to-100 scale resulted in sufficient sensitivity and specificity statistics for viably identifying patients with mobility impairment. The EMIQ is a content valid and psychometrically sound instrument for capturing MS patients' experience with mobility impairments in a clinical practice setting. Additional research is suggested to further confirm the EMIQ's screening properties over time.
Jirapramukpitak, Tawanchai; Darawuttimaprakorn, Niphon; Punpuing, Sureeporn; Abas, Melanie
2009-11-01
To assess the concurrent and the construct validity of the Euro-D in older Thai persons. Eight local psychiatrists used the major depressive episode section of the Mini International Neuropsychiatric Interview to interview 150 consecutive psychiatric clinic attendees. A trained interviewer administered the Euro-D. We used receiver operating characteristic (ROC) analysis to assess the overall discriminability of the Euro-D scale and principal components factor analysis to assess its construct validity. The area under the ROC curve for the Euro-D with respect to major depressive episode was 0.78 [95% confidence interval (CI) 0.70-0.90] indicating moderately good discriminability. At a cut-point of 5/6 the sensitivity for major depressive episodes is 84.3%, specificity 58.6%, and kappa 0.37 (95% CI 0.22-0.52) indicating fair concordance. However, at the 3/4 cut-point recommended from European studies there is high sensitivity (94%) but poor specificity (34%). The principal components analysis suggested four factors. The first two factors conformed to affective suffering (depression, suicidality and tearfulness) and motivation (interest, concentration and enjoyment). Sleep and appetite constituted a separate factor, whereas pessimism loaded on its own factor. Among Thai psychiatric clinic attendees Euro-D is moderately valid for major depression. A much higher cut-point may be required than that which is usually advocated. The Thai version also shares two common factors as reported from most of previous studies.
Franz, S; Schuld, C; Wilder-Smith, E P; Heutehaus, L; Lang, S; Gantz, S; Schuh-Hofer, S; Treede, R-D; Bryce, T N; Wang, H; Weidner, N
2017-11-01
Neuropathic pain (NeuP) is a frequent sequel of spinal cord injury (SCI). The SCI Pain Instrument (SCIPI) was developed as a SCI-specific NeuP screening tool. A preliminary validation reported encouraging results requiring further evaluation in terms of psychometric properties. The painDETECT questionnaire (PDQ), a commonly applied NeuP assessment tool, was primarily validated in German, but not specifically developed for SCI and not yet validated according to current diagnostic guidelines. We aimed to provide convergent construct validity and to identify the optimal item combination for the SCIPI. The PDQ was re-evaluated according to current guidelines with respect to SCI-related NeuP. Prospective monocentric study. Subjects received a neurological examination according to the International Standards for Neurological Classification of SCI. After linguistic validation of the SCIPI, the IASP-grading system served as reference to diagnose NeuP, accompanied by the PDQ after its re-evaluation as binary classifier. Statistics were evaluated through ROC-analysis, with the area under the ROC curve (AUROC) as optimality criterion. The SCIPI was refined by systematic item permutation. Eighty-eight individuals were assessed with the German SCIPI. Of 127 possible combinations, a 4-item-SCIPI (cut-off-score = 1.5/sensitivity = 0.864/specificity = 0.839) was identified as most reasonable. The SCIPI showed a strong correlation (r sp = 0.76) with PDQ. ROC-analysis of SCIPI/PDQ (AUROC = 0.877) revealed comparable results to SCIPI/IASP (AUROC = 0.916). ROC-analysis of PDQ/IASP delivered a score threshold of 10.5 (sensitivity = 0.727/specificity = 0.903). The SCIPI is a valid easy-to-apply NeuP screening tool in SCI. The PDQ is recommended as complementary NeuP assessment tool in SCI, e.g. to monitor pain severity and/or its time-dependent course. In SCI-related pain, both SCIPI and PainDETECT show strong convergent construct validity versus the current IASP-grading system. SCIPI is now optimized from a 7-item to an easy-to-apply 4-item screening tool in German and English. We provided evidence that the scope for PainDETECT can be expanded to individuals with SCI. © 2017 European Pain Federation - EFIC®.
Depletion sensitivity predicts unhealthy snack purchases.
Salmon, Stefanie J; Adriaanse, Marieke A; Fennis, Bob M; De Vet, Emely; De Ridder, Denise T D
2016-01-01
The aim of the present research is to examine the relation between depletion sensitivity - a novel construct referring to the speed or ease by which one's self-control resources are drained - and snack purchase behavior. In addition, interactions between depletion sensitivity and the goal to lose weight on snack purchase behavior were explored. Participants included in the study were instructed to report every snack they bought over the course of one week. The dependent variables were the number of healthy and unhealthy snacks purchased. The results of the present study demonstrate that depletion sensitivity predicts the amount of unhealthy (but not healthy) snacks bought. The more sensitive people are to depletion, the more unhealthy snacks they buy. Moreover, there was some tentative evidence that this relation is more pronounced for people with a weak as opposed to a strong goal to lose weight, suggesting that a strong goal to lose weight may function as a motivational buffer against self-control failures. All in all, these findings provide evidence for the external validity of depletion sensitivity and the relevance of this construct in the domain of eating behavior. Copyright © 2015 Elsevier Ltd. All rights reserved.
Delaney, Aogán; Tamás, Peter A; Crane, Todd A; Chesterman, Sabrina
2016-01-01
There is increasing interest in using systematic review to synthesize evidence on the social and environmental effects of and adaptations to climate change. Use of systematic review for evidence in this field is complicated by the heterogeneity of methods used and by uneven reporting. In order to facilitate synthesis of results and design of subsequent research a method, construct-centered methods aggregation, was designed to 1) provide a transparent, valid and reliable description of research methods, 2) support comparability of primary studies and 3) contribute to a shared empirical basis for improving research practice. Rather than taking research reports at face value, research designs are reviewed through inductive analysis. This involves bottom-up identification of constructs, definitions and operationalizations; assessment of concepts' commensurability through comparison of definitions; identification of theoretical frameworks through patterns of construct use; and integration of transparently reported and valid operationalizations into ideal-type research frameworks. Through the integration of reliable bottom-up inductive coding from operationalizations and top-down coding driven from stated theory with expert interpretation, construct-centered methods aggregation enabled both resolution of heterogeneity within identically named constructs and merging of differently labeled but identical constructs. These two processes allowed transparent, rigorous and contextually sensitive synthesis of the research presented in an uneven set of reports undertaken in a heterogenous field. If adopted more broadly, construct-centered methods aggregation may contribute to the emergence of a valid, empirically-grounded description of methods used in primary research. These descriptions may function as a set of expectations that improves the transparency of reporting and as an evolving comprehensive framework that supports both interpretation of existing and design of future research.
Crane, Todd A.; Chesterman, Sabrina
2016-01-01
There is increasing interest in using systematic review to synthesize evidence on the social and environmental effects of and adaptations to climate change. Use of systematic review for evidence in this field is complicated by the heterogeneity of methods used and by uneven reporting. In order to facilitate synthesis of results and design of subsequent research a method, construct-centered methods aggregation, was designed to 1) provide a transparent, valid and reliable description of research methods, 2) support comparability of primary studies and 3) contribute to a shared empirical basis for improving research practice. Rather than taking research reports at face value, research designs are reviewed through inductive analysis. This involves bottom-up identification of constructs, definitions and operationalizations; assessment of concepts’ commensurability through comparison of definitions; identification of theoretical frameworks through patterns of construct use; and integration of transparently reported and valid operationalizations into ideal-type research frameworks. Through the integration of reliable bottom-up inductive coding from operationalizations and top-down coding driven from stated theory with expert interpretation, construct-centered methods aggregation enabled both resolution of heterogeneity within identically named constructs and merging of differently labeled but identical constructs. These two processes allowed transparent, rigorous and contextually sensitive synthesis of the research presented in an uneven set of reports undertaken in a heterogenous field. If adopted more broadly, construct-centered methods aggregation may contribute to the emergence of a valid, empirically-grounded description of methods used in primary research. These descriptions may function as a set of expectations that improves the transparency of reporting and as an evolving comprehensive framework that supports both interpretation of existing and design of future research. PMID:26901409
Hughes, Patricia Paulsen; Sherrill, Claudine; Myers, Bettye; Rowe, Nancy; Marshall, David
2003-06-01
Martial arts and self-defense programs train fearful people, especially women, to be more competent and confident to defend themselves in dangerous situations. However, there are no validated instruments to evaluate the effectiveness of programs purporting to teach self-protection. The Perceptions of Dangerous Situations Scale (PDSS), composed of fear, likelihood and confidence subscales, was developed and validated for university women. Participants were 368 university women, ages 17 to 45 years (M age = 20.7 years). Content validity of the PDSS was established through an expert panel, and construct validity was established through principal components analysis and determination of instructional sensitivity. Reliability was established through alpha coefficients. The PDSS, when used with university women, offers promising measurement opportunities in self-defense and martial arts settings.
Au, Raymond Wing Cheong; Tam, Peter Wai Chung; Tam, Gladys Wai Chi; Ungvari, Gabor Sander
2005-01-01
The study validated a culturally sensitive community living skills rating scale for Chinese patients by adapting the St. Louis Inventory of Community Living Skills (SLICLS). The Chinese version (SLICLS-C) was produced by forward and backward translation. An expert panel evaluated its content validity. Its internal consistency, inter-rater reliability, construct and concurrent validity were tested on 80 DSM-IV schizophrenia inpatients in a long-term facility. For predictive validity, the above sample was extended to ensure at least 20 subjects discharged to each of three levels of community care were included in the study sample. The SLICLS-C was psychometrically sound and could be used for predicting level of community care, program evaluation and measuring outcome.
A Short Measure of the Revised Reinforcement Sensitivity Theory - RSQ17.
Čolović, Petar; Smederevac, Snežana; Oljača, Milan; Nikolašević, Željka; Mitrović, Dušanka
2018-04-03
The need for a research and practical tool, such as a short, reliable, and valid personality assessment test, suggests researchers to create shortened versions of original instruments. Reinforcement sensitivity questionnaire (RSQ) was created in line with some basic premises of revised Reinforcement sensitivity theory, which proposes three motivational and emotional systems: Behavioral inhibition system (BIS), responsible for scanning environment for potential threats, Behavioral activation system (BAS), responsible for aproaching behavior, and the Fight/Flight/Freeze system (FFFS), responsible for behavior in the present threat. RSQ comprises five scales: BIS, BAS, Fight, Flight, and Freeze. The aim of this study was to develop a short version of RSQ, which would be beneficial to both research and practical purposes. Item response theory analyses were used for item selection. The study comprised two samples of participants, whereby Sample 1 (N = 837, 34.6% male, aged 18 - 82, M = 31.63, SD = 13.54) served as the derivation sample, while Sample 2 (818 participants, 43.6% male, 18-75 years, M = 29.65, SD = 12.52) served as validation sample. Factorial validity of the short RSQ was examined on both Sample 1 and Sample 2. Convergent and divergent validity of the short RSQ was examined using RST-PQ, Jackson-5, BIS/BAS scales, and Big Five Inventory. The results point to satisfactory internal consistency, factorial validity, and construct validity of the short RSQ, suggesting that it is an adequate measure for research settings or other contexts which require the use of short personality questionnaires.
Fergus, Thomas A; Valentiner, David P
2009-08-01
The present study examined utility of the Illness Attitudes Scale (IAS; [Kellner, R. (1986). Somatization and hypochondriasis. New York: Praeger Publishers]) in a non-clinical college sample (N=235). Relationships among five recently identified IAS dimensions (fear of illness and pain, symptom effects, treatment experience, disease conviction, and health habits) and self-report measures of several anxiety-related constructs (health anxiety, body vigilance, intolerance of uncertainty, anxiety sensitivity, and non-specific anxiety symptoms) were examined. In addition, this study investigated the incremental validity of the IAS dimensions in predicting medical utilization. The fear of illness and pain dimension and the symptom effects dimension consistently shared stronger relations with the anxiety-related constructs compared to the other three IAS dimensions. The symptom effects dimension, the disease conviction dimension, and the health habits dimension showed incremental validity over the anxiety-related constructs in predicting medical utilization. Implications for the IAS and future conceptualizations of HC are discussed.
Aguilar-Raab, Corina; Grevenstein, Dennis; Gotthardt, Linda; Jarczok, Marc N; Hunger, Christina; Ditzen, Beate; Schweitzer, Jochen
2018-06-01
We examine the sensitivity to change in the Evaluation of Social Systems (EVOS) scale, which assesses relationship quality and collective efficacy. In Study 1 we conducted a waitlist-control, short-term couple therapy RCT study (N = 43 couples) with five systemic therapy sessions treating communication and partnership problems; our intent was to provide high external validity. Construct validity of EVOS was assessed by comparison with additionally applied scales (Family Scales; Outcome Questionnaire, OQ-45.2). In Study 2, N = 332 individuals completed an experiment with high internal validity in order to verify sensitivity to change in three different social contexts. Results from Study 1 revealed a significant increase in relationship quality in the treatment group directly after treatment, as compared to the control group. Sensitivity to change was slightly better for EVOS than for other measures. While this positive change could not be fully sustained between posttreatment and a 4-week follow-up, EVOS score did not fall below baseline and pretreatment levels, supporting moderate-to-large sensitivity to change. Study 2 supported high sensitivity to change in EVOS for couple relations, family relations, and work-team relationships. Therefore, EVOS can be used as an outcome measure to monitor the process of systemic interventions focusing on relationship quality and collective efficacy. Due to its sensitivity to change, EVOS can provide evidence for treatment success with regard to relationship aspects. © 2017 Family Process Institute.
Validation of the Spanish version of the Hip Outcome Score: a multicenter study.
Seijas, Roberto; Sallent, Andrea; Ruiz-Ibán, Miguel Angel; Ares, Oscar; Marín-Peña, Oliver; Cuéllar, Ricardo; Muriel, Alfonso
2014-05-13
The Hip Outcome Score (HOS) is a self-reported questionnaire evaluating the outcomes of treatment interventions for hip pathologies, divided in 19 items of activities of daily life (ADL) and 9 sports' items. The aim of the present study is to translate and validate HOS into Spanish. A prospective and multicenter study with 100 patients undergoing hip arthroscopy was performed between June 2012 and January 2013. Crosscultural adaptation was used to translate HOS into Spanish. Patients completed the questionnaire before and after surgery. Feasibility, reliability, internal consistency, construct validity (correlation with Western Ontario and McMaster Universities Osteoarthritis Index), ceiling and floor effects and sensitivity to change were assessed for the present study. Mean age was 45.05 years old. 36 women and 64 men were included. Feasibility: 13% had at least one missing item within the ADL subscale and 17% within the sport subscale. Reliability: the translated version of HOS was highly reproducible with intraclass correlation coefficient of 0.95 for ADL and 0.94 for the sports subscale. Internal consistency was confirmed with Cronbach's alpha >0.90 in both subscales. Construct validity showed statistically significant correlation with WOMAC. Ceiling effect was observed in 6% and 12% for ADL and sports subscale, respectively. Floor effect was found in 3% and 37% ADL and sports subscale, respectively. Large sensitivity to change was shown in both subscales. The translated version of HOS into Spanish has shown to be feasible, reliable and sensible to changes for patients undergoing hip arthroscopy. This validated translation of HOS allows for comparisons between studies involving either Spanish- or English-speaking patients. Prognostic study, Level I.
Burns, Ted M.; Conaway, Mark; Sanders, Donald B.
2010-01-01
Objective: To study the concurrent and construct validity and test-retest reliability in the practice setting of an outcome measure for myasthenia gravis (MG). Methods: Eleven centers participated in the validation study of the Myasthenia Gravis Composite (MGC) scale. Patients with MG were evaluated at 2 consecutive visits. Concurrent and construct validities of the MGC were assessed by evaluating MGC scores in the context of other MG-specific outcome measures. We used numerous potential indicators of clinical improvement to assess the sensitivity and specificity of the MGC for detecting clinical improvement. Test-retest reliability was performed on patients at the University of Virginia. Results: A total of 175 patients with MG were enrolled at 11 sites from July 1, 2008, to January 31, 2009. A total of 151 patients were seen in follow-up. Total MGC scores showed excellent concurrent validity with other MG-specific scales. Analyses of sensitivities and specificities of the MGC revealed that a 3-point improvement in total MGC score was optimal for signifying clinical improvement. A 3-point improvement in the MGC also appears to represent a meaningful improvement to most patients, as indicated by improved 15-item myasthenia gravis quality of life scale (MG-QOL15) scores. The psychometric properties were no better for an individualized subscore made up of the 2 functional domains that the patient identified as most important to treat. The test-retest reliability coefficient of the MGC was 98%, with a lower 95% confidence interval of 97%, indicating excellent test-retest reliability. Conclusions: The Myasthenia Gravis Composite is a reliable and valid instrument for measuring clinical status of patients with myasthenia gravis in the practice setting and in clinical trials. PMID:20439845
Schoemaker, Marina M; Niemeijer, Anuschka S; Flapper, Boudien C T; Smits-Engelsman, Bouwien C M
2012-04-01
The aim of this study was to investigate the validity and reliability of the Movement Assessment Battery for Children-2 Checklist (MABC-2). Teachers completed the Checklist for 383 children (age range 5-8y; mean age 6y 9mo; 190 males; 193 females) and the parents of 130 of these children completed the Developmental Disorder Coordination Questionnaire 2007 (DCDQ'07). All children were assessed with the MABC-2 Test. The internal consistency of the 30 items of the Checklist was determined to measure reliability. Construct validity was investigated using factor analysis and discriminative validity was assessed by comparing the scores of children with and without movement difficulties. Concurrent validity was measured by calculating correlations between the Checklist, Test, and the DCDQ'07. Incremental validity was assessed to determine whether the Checklist was a better predictor of motor impairment than the DCDQ'07. Sensitivity and specificity were investigated using the MABC-2 Test as reference standard (cut-off 15th centile). The Checklist items measure the same construct. Six factors were obtained after factor analysis. This implies that a broad range of functional activities can be assessed with the Checklist, which renders the Checklist useful for assessing criterion B of the diagnostic criteria for DCD. The mean Checklist scores for children with and without motor impairments significantly differed (p<0.001). The scores for the Checklist/Test and DCDQ'07 were significantly correlated (r(S) =-0.38 and p<0.001, and r(S) =-0.36 and p<0.001, respectively). The Checklist better predicted motor impairment than the DCDQ'07. Overall, the sensitivity was low (41%) and the specificity was acceptable (88%). The Checklist meets standards for validity and reliability. © The Authors. Developmental Medicine & Child Neurology © 2012 Mac Keith Press.
Clerici, Francesca; Ghiretti, Roberta; Di Pucchio, Alessandra; Pomati, Simone; Cucumo, Valentina; Marcone, Alessandra; Vanacore, Nicola; Mariani, Claudio; Cappa, Stefano Francesco
2017-06-01
The Free and Cued Selective Reminding Test (FCSRT) is the memory test recommended by the International Working Group on Alzheimer's disease (AD) for the detection of amnestic syndrome of the medial temporal type in prodromal AD. Assessing the construct validity and internal consistency of the Italian version of the FCSRT is thus crucial. The FCSRT was administered to 338 community-dwelling participants with memory complaints (57% females, age 74.5 ± 7.7 years), including 34 with AD, 203 with Mild Cognitive Impairment, and 101 with Subjective Memory Impairment. Internal Consistency was estimated using Cronbach's alpha coefficient. To assess convergent validity, five FCSRT scores (Immediate Free Recall, Immediate Total Recall, Delayed Free Recall, Delayed Total Recall, and Index of Sensitivity of Cueing) were correlated with three well-validated memory tests: Story Recall, Rey Auditory Verbal Learning test, and Rey Complex Figure (RCF) recall (partial correlation analysis). To assess divergent validity, a principal component analysis (an exploratory factor analysis) was performed including, in addition to the above-mentioned memory tasks, the following tests: Word Fluencies, RCF copy, Clock Drawing Test, Trail Making Test, Frontal Assessment Battery, Raven Coloured Progressive Matrices, and Stroop Colour-Word Test. Cronbach's alpha coefficients for immediate recalls (IFR and ITR) and delayed recalls (DFR and DTR) were, respectively, .84 and .81. All FCSRT scores were highly correlated with those of the three well-validated memory tests. The factor analysis showed that the FCSRT does not load on the factors saturated by non-memory tests. These findings indicate that the FCSRT has a good internal consistency and has an excellent construct validity as an episodic memory measure. © 2015 The British Psychological Society.
A virtual reality test battery for assessment and screening of spatial neglect.
Fordell, H; Bodin, K; Bucht, G; Malm, J
2011-03-01
There is a need for improved screening methods for spatial neglect. To construct a VR-test battery and evaluate its accuracy and usability in patients with acute stroke. VR-DiSTRO consists of a standard desktop computer, a CRT monitor and eye shutter stereoscopic glasses, a force feedback interface, and software, developed to create an interactive and immersive 3D experience. VR-tests were developed and validated to the conventional Star Cancellation test, Line bisection, Baking Tray Task (BTT), and Visual Extinction test. A construct validation to The Rivermead Behavioral Inattention Test, used as criterion of visuospatial neglect, was made. Usability was assessed according to ISO 9241-11. Thirty-one patients with stroke were included, 9/31 patients had neglect. The sensitivity was 100% and the specificity 82% for the VR-DiSTRO to correctly identify neglect. VR-BTT and VR-Extinction had the highest correlation (r² = 0.64 and 0.78), as well as high sensitivity and specificity. The kappa values describing the agreement between traditional neglect tests and the corresponding virtual reality test were between 0.47-0.85. Usability was assessed by a questionnaire; 77% reported that the VR-DiSTRO was 'easy' to use. Eighty-eight percent reported that they felt 'focused', 'pleased' or 'alert'. No patient had adverse symptoms. The test session took 15 min. The VR-DiSTRO quickly and with a high accuracy identified visuospatial neglect in patients with stroke in this construct validation. The usability among elderly patients with stroke was high. This VR-test battery has the potential to become an important screening instrument for neglect and a valuable adjunct to the neuropsychological assessment. © 2010 John Wiley & Sons A/S.
Chambers, David W
2011-01-01
One of the most extensively studied constructs in dental education is the four-component model of moral behavior proposed by James Rest and the set of instruments for measuring it developed by Rest, Muriel Bebeau, and others. Although significant associations have been identified between the four components Rest proposed (called here Moral Sensitivity, Moral Reasoning, Moral Integrity, and Moral Courage) and dental ethics courses and practitioners with disciplined licenses, there is no single instrument that measures all four components, and existing single component instruments require professional scoring. This article describes the development and validation of a short, self-scoring instrument, the Moral Skills Inventory, that measures all four components. Evidence of face validity, test/retest reliability, and concurrent convergent and divergent predictive validity are demonstrated in three populations: dental students, clinical dental faculty members, and regents and officers of the American College of Dentists. Significant issues remain in developing the Rest four-component model for use in dental education and practice. Specifically, further construct validation research is needed to understand the nature of the components. In particular, it remains undetermined whether moral constructs are characteristics of individuals that drive behavior in specific situations or whether particular patterns of moral behavior learned and used in response to individual circumstances are summarized by researchers and then imputed to practitioners.
Alyusuf, Raja H; Prasad, Kameshwar; Abdel Satir, Ali M; Abalkhail, Ali A; Arora, Roopa K
2013-01-01
The exponential use of the internet as a learning resource coupled with varied quality of many websites, lead to a need to identify suitable websites for teaching purposes. The aim of this study is to develop and to validate a tool, which evaluates the quality of undergraduate medical educational websites; and apply it to the field of pathology. A tool was devised through several steps of item generation, reduction, weightage, pilot testing, post-pilot modification of the tool and validating the tool. Tool validation included measurement of inter-observer reliability; and generation of criterion related, construct related and content related validity. The validated tool was subsequently tested by applying it to a population of pathology websites. Reliability testing showed a high internal consistency reliability (Cronbach's alpha = 0.92), high inter-observer reliability (Pearson's correlation r = 0.88), intraclass correlation coefficient = 0.85 and κ =0.75. It showed high criterion related, construct related and content related validity. The tool showed moderately high concordance with the gold standard (κ =0.61); 92.2% sensitivity, 67.8% specificity, 75.6% positive predictive value and 88.9% negative predictive value. The validated tool was applied to 278 websites; 29.9% were rated as recommended, 41.0% as recommended with caution and 29.1% as not recommended. A systematic tool was devised to evaluate the quality of websites for medical educational purposes. The tool was shown to yield reliable and valid inferences through its application to pathology websites.
Pololi, Linda H; Evans, Arthur T; Nickell, Leslie; Reboli, Annette C; Coplit, Lisa D; Stuber, Margaret L; Vasiliou, Vasilia; Civian, Janet T; Brennan, Robert T
2017-06-01
A practical, reliable, and valid instrument is needed to measure the impact of the learning environment on medical students' well-being and educational experience and to meet medical school accreditation requirements. From 2012 to 2015, medical students were surveyed at the end of their first, second, and third year of studies at four medical schools. The survey assessed students' perceptions of the following nine dimensions of the school culture: vitality, self-efficacy, institutional support, relationships/inclusion, values alignment, ethical/moral distress, work-life integration, gender equity, and ethnic minority equity. The internal reliability of each of the nine dimensions was measured. Construct validity was evaluated by assessing relationships predicted by our conceptual model and prior research. Assessment was made of whether the measurements were sensitive to differences over time and across institutions. Six hundred and eighty-six students completed the survey (49 % women; 9 % underrepresented minorities), with a response rate of 89 % (range over the student cohorts 72-100 %). Internal consistency of each dimension was high (Cronbach's α 0.71-0.86). The instrument was able to detect significant differences in the learning environment across institutions and over time. Construct validity was supported by demonstrating several relationships predicted by our conceptual model. The C-Change Medical Student Survey is a practical, reliable, and valid instrument for assessing the learning environment of medical students. Because it is sensitive to changes over time and differences across institution, results could potentially be used to facilitate and monitor improvements in the learning environment of medical students.
Jackson, Howard F; Tunstall, Victoria; Hague, Gemma; Daniels, Leanne; Crompton, Stacey; Taplin, Kimberly
2014-01-01
Jackson et al. (this edition) argue that structure is an important component in reducing the handicaps caused by cognitive impairments following acquired brain injury and that post-acute neuropsychological brain injury rehabilitation programmes should not only endeavour to provide structure but also aim to develop self-structuring. However, at present there is no standardized device for assessing self-structuring. To provide preliminary analysis of the psychometric properties of the Behavioural Assessment of Self-Structuring (BASS) staff rating scale (a 26 item informant five point rating scale based on the degree of support client requires to achieve self-structuring item). BASS data was utilised for clients attending residential rehabilitation. Reliability (inter-rarer and intra-rater), validity (construct, concurrent and discriminate) and sensitivity to change were investigated. Initial results indicate that the BASS has reasonably good reliability, good construct validity (via principal components analysis), good discriminant validity, and good concurrent validity correlating well with a number of other outcome measures (HoNOS; NPDS, Supervision Rating Scale, MPAI, FIM and FAM). The BASS did not correlate well with the NPCNA. Finally, the BASS was shown to demonstrate sensitivity to change. Although some caution is required in drawing firm conclusions at the present time and further exploration of the psychometric properties of the BASS is required, initial results are encouraging for the use of the BASS in assessing rehabilitation progress. These findings are discussed in terms of the value of the concept of self-structuring to the rehabilitation process for individuals with neuropsychological impairments consequent on acquired brain injury.
A tool to assess sex-gender when selecting health research projects.
Tomás, Concepción; Yago, Teresa; Eguiluz, Mercedes; Samitier, M A Luisa; Oliveros, Teresa; Palacios, Gemma
2015-04-01
To validate the questionnaire "Gender Perspective in Health Research" (GPIHR) to assess the inclusion of gender perspective in research projects. Validation study in two stages. Feasibility was analysed in the first, and reliability, internal consistence and validity in the second. Aragón Institute of Health Science, Aragón, Spain. GPIHR was applied to 118 research projects funded in national and international competitive tenders from 2003 to 2012. Analysis of inter- and intra-observer reliability with Kappa index and internal consistency with Cronbach's alpha. Content validity analysed through literature review and construct validity with an exploratory factor analysis. Validated GPIHR has 10 questions: 3 in the introduction, 1 for objectives, 3 for methodology and 3 for research purpose. Average time of application was 13min Inter-observer reliability (Kappa) varied between 0.35 and 0.94 and intra-observer between 0.40 and 0.94. Theoretical construct is supported in the literature. Factor analysis identifies three levels of GP inclusion: "difference by sex", "gender sensitive" and "feminist research" with an internal consistency of 0.64, 0.87 and 0.81, respectively, which explain 74.78% of variance. GPIHR questionnaire is a valid tool to assess GP and useful for those researchers who would like to include GP in their projects. Copyright © 2014 Elsevier España, S.L.U. All rights reserved.
Reliability and validity of procedure-based assessments in otolaryngology training.
Awad, Zaid; Hayden, Lindsay; Robson, Andrew K; Muthuswamy, Keerthini; Tolley, Neil S
2015-06-01
To investigate the reliability and construct validity of procedure-based assessment (PBA) in assessing performance and progress in otolaryngology training. Retrospective database analysis using a national electronic database. We analyzed PBAs of otolaryngology trainees in North London from core trainees (CTs) to specialty trainees (STs). The tool contains six multi-item domains: consent, planning, preparation, exposure/closure, technique, and postoperative care, rated as "satisfactory" or "development required," in addition to an overall performance rating (pS) of 1 to 4. Individual domain score, overall calculated score (cS), and number of "development-required" items were calculated for each PBA. Receiver operating characteristic analysis helped determine sensitivity and specificity. There were 3,152 otolaryngology PBAs from 46 otolaryngology trainees analyzed. PBA reliability was high (Cronbach's α 0.899), and sensitivity approached 99%. cS correlated positively with pS and level in training (rs : +0.681 and +0.324, respectively). ST had higher cS and pS than CT (93% ± 0.6 and 3.2 ± 0.03 vs. 71% ± 3.1 and 2.3 ± 0.08, respectively; P < .001). cS and pS increased from CT1 to ST8 showing construct validity (rs : +0.348 and +0.354, respectively; P < .001). The technical skill domain had the highest utilization (98% of PBAs) and was the best predictor of cS and pS (rs : +0.96 and +0.66, respectively). PBA is reliable and valid for assessing otolaryngology trainees' performance and progress at all levels. It is highly sensitive in identifying competent trainees. The tool is used in a formative and feedback capacity. The technical domain is the best predictor and should be given close attention. NA. © 2014 The American Laryngological, Rhinological and Otological Society, Inc.
ERIC Educational Resources Information Center
Jenkins-Guarnieri, Michael A.; Vaughan, Angela L.; Wright, Stephen L.
2015-01-01
We adapted a work self-determination measure to create the Basic Needs Satisfaction at College Scale. Confirmatory factor analysis and item response theory analyses with data from 525 adults supported a 3-factor model with 13 items most sensitive for lower to middle range levels of the autonomy, competence, and relatedness constructs.
ERIC Educational Resources Information Center
Al Sadi, Fatma H.; Basit, Tehmina N.
2017-01-01
The vignettes approach has emerged as a popular tool in quantitative and qualitative research. It has proven to be particularly effective in measuring sensitive topics. This paper focuses on the construction and validation process of questionnaire-based vignettes, which were used as an instrument to examine Omani secondary school girls' cultural…
Lubans, David R; Smith, Jordan J; Harries, Simon K; Barnett, Lisa M; Faigenbaum, Avery D
2014-05-01
The aim of this study was to describe the development and assess test-retest reliability and construct validity of the Resistance Training Skills Battery (RTSB) for adolescents. The RTSB provides an assessment of resistance training skill competency and includes 6 exercises (i.e., body weight squat, push-up, lunge, suspended row, standing overhead press, and front support with chest touches). Scoring for each skill is based on the number of performance criteria successfully demonstrated. An overall resistance training skill quotient (RTSQ) is created by adding participants' scores for the 6 skills. Participants (44 boys and 19 girls, mean age = 14.5 ± 1.2 years) completed the RTSB on 2 occasions separated by 7 days. Participants also completed the following fitness tests, which were used to create a muscular fitness score (MFS): handgrip strength, timed push-up, and standing long jump tests. Intraclass correlation (ICC), paired samples t-tests, and typical error were used to assess test-retest reliability. To assess construct validity, gender and RTSQ were entered into a regression model predicting MFS. The rank order repeatability of the RTSQ was high (ICC = 0.88). The model explained 39% of the variance in MFS (p ≤ 0.001) and RTSQ (r = 0.40, p ≤ 0.001) was a significant predictor. This study has demonstrated the construct validity and test-retest reliability of the RTSB in a sample of adolescents. The RTSB can reliably rank participants in regards to their resistance training competency and has the necessary sensitivity to detect small changes in resistance training skill proficiency.
Bashyam, Ashvin; Li, Matthew; Cima, Michael J
2018-07-01
Single-sided NMR has the potential for broad utility and has found applications in healthcare, materials analysis, food quality assurance, and the oil and gas industry. These sensors require a remote, strong, uniform magnetic field to perform high sensitivity measurements. We demonstrate a new permanent magnet geometry, the Unilateral Linear Halbach, that combines design principles from "sweet-spot" and linear Halbach magnets to achieve this goal through more efficient use of magnetic flux. We perform sensitivity analysis using numerical simulations to produce a framework for Unilateral Linear Halbach design and assess tradeoffs between design parameters. Additionally, the use of hundreds of small, discrete magnets within the assembly allows for a tunable design, improved robustness to variability in magnetization strength, and increased safety during construction. Experimental validation using a prototype magnet shows close agreement with the simulated magnetic field. The Unilateral Linear Halbach magnet increases the sensitivity, portability, and versatility of single-sided NMR. Copyright © 2018 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Bashyam, Ashvin; Li, Matthew; Cima, Michael J.
2018-07-01
Single-sided NMR has the potential for broad utility and has found applications in healthcare, materials analysis, food quality assurance, and the oil and gas industry. These sensors require a remote, strong, uniform magnetic field to perform high sensitivity measurements. We demonstrate a new permanent magnet geometry, the Unilateral Linear Halbach, that combines design principles from "sweet-spot" and linear Halbach magnets to achieve this goal through more efficient use of magnetic flux. We perform sensitivity analysis using numerical simulations to produce a framework for Unilateral Linear Halbach design and assess tradeoffs between design parameters. Additionally, the use of hundreds of small, discrete magnets within the assembly allows for a tunable design, improved robustness to variability in magnetization strength, and increased safety during construction. Experimental validation using a prototype magnet shows close agreement with the simulated magnetic field. The Unilateral Linear Halbach magnet increases the sensitivity, portability, and versatility of single-sided NMR.
Validity of the French form of the Somatosensory Amplification Scale in a Non-Clinical Sample
Bridou, Morgiane; Aguerre, Colette
2013-01-01
The SomatoSensory Amplification Scale (SSAS) is a 10-item self-report instrument designed to assess the tendency to detect somatic and visceral sensations and experience them as unusually intense, toxic and alarming. This study examines the psychometric properties of a French version of the SSAS in a non-clinical population and, more specifically, explores its construct, convergent and discriminant validities. The SSAS was completed by 375 university students, together with measures of somatization propensity (SCL-90-R somatization subscale) and trait anxiety (STAI Y form). The results of principal component and confirmatory factor analyses suggest that the French version of the SSAS evaluates essentially a single, robust factor (Somatosensory amplification) and two kinds of somatic sensitivity (Exteroceptive sensitivity and Interoceptive sensitivity). Somatosensory amplification correlated with somatization tendency and anxiety propensity. These results encourage further investigations in French of the determinants and consequences of somatosensory amplification, and its use as a therapeutic strategy. PMID:26973888
Uchoa, Priscila Regina Candido Espinola; Bezerra, Thiago Freire Pinto; Lima, Élcio Duarte; Fornazieri, Marco Aurélio; Pinna, Fabio de Rezende; Sperandio, Fabiana de Araújo; Voegels, Richard Louis
The concept of quality of life is subjective and variable definition, which depends on the individual's perception of their state of health. Quality of life questionnaires are instruments designed to measure quality of life, but most are developed in a language other than Portuguese. Questionnaires can identify the most important symptoms, focus on consultation, and assist in defining the goals of treatment. Some of these have been validated for the Portuguese language, but none in children. To validate the translation with cross-cultural adaptation and validation of the Sinus and Nasal Quality of Life Survey (SN-5) into Portuguese. Prospective study of children aged 2-12 years with sinonasal symptoms of over 30 days. The study comprised two stages: (I) translation and cross-cultural adaptation of the SN-5 into Portuguese (SN-5p); and (II) validation of the SN5-p. Statistical analysis was performed to assess internal consistency, test-retest reliability, and sensitivity, as well as construct and discriminant validity and standardization. The SN-5 was translated and adapted into Portuguese (SN-5p) and the author of the original version approved the process. Validation was carried out by administration of the SN-5p to 51 pediatric patients with sinonasal complaints (mean age, 5.8±2.5 years; range, 2-12 years). The questionnaire exhibited adequate construct validity (0.62, p<0.01), internal consistency (Cronbach's alpha=0.73), and discriminant validity (p<0.01), as well as good test-retest reproducibility (Goodman-Kruskal gamma=0.957, p<0.001), good correlation with a visual analog scale (r=0.62, p<0.01), and sensitivity to change. This study reports the successful translation and cross-cultural adaptation of the SN-5 instrument into Brazilian Portuguese. The translated version exhibited adequate psychometric properties for assessment of disease-specific quality of life in pediatric patients with sinonasal complaints. Copyright © 2016 Associação Brasileira de Otorrinolaringologia e Cirurgia Cérvico-Facial. Published by Elsevier Editora Ltda. All rights reserved.
Deep Sequencing of Urinary RNAs for Bladder Cancer Molecular Diagnostics.
Sin, Mandy L Y; Mach, Kathleen E; Sinha, Rahul; Wu, Fan; Trivedi, Dharati R; Altobelli, Emanuela; Jensen, Kristin C; Sahoo, Debashis; Lu, Ying; Liao, Joseph C
2017-07-15
Purpose: The majority of bladder cancer patients present with localized disease and are managed by transurethral resection. However, the high rate of recurrence necessitates lifetime cystoscopic surveillance. Developing a sensitive and specific urine-based test would significantly improve bladder cancer screening, detection, and surveillance. Experimental Design: RNA-seq was used for biomarker discovery to directly assess the gene expression profile of exfoliated urothelial cells in urine derived from bladder cancer patients ( n = 13) and controls ( n = 10). Eight bladder cancer specific and 3 reference genes identified by RNA-seq were quantitated by qPCR in a training cohort of 102 urine samples. A diagnostic model based on the training cohort was constructed using multiple logistic regression. The model was further validated in an independent cohort of 101 urines. Results: A total of 418 genes were found to be differentially expressed between bladder cancer and controls. Validation of a subset of these genes was used to construct an equation for computing a probability of bladder cancer score (P BC ) based on expression of three markers ( ROBO1, WNT5A , and CDC42BPB ). Setting P BC = 0.45 as the cutoff for a positive test, urine testing using the three-marker panel had overall 88% sensitivity and 92% specificity in the training cohort. The accuracy of the three-marker panel in the independent validation cohort yielded an AUC of 0.87 and overall 83% sensitivity and 89% specificity. Conclusions: Urine-based molecular diagnostics using this three-marker signature could provide a valuable adjunct to cystoscopy and may lead to a reduction of unnecessary procedures for bladder cancer diagnosis. Clin Cancer Res; 23(14); 3700-10. ©2017 AACR . ©2017 American Association for Cancer Research.
Chen, Chia-Wei; Chu, Hsin; Tsai, Chia-Fen; Yang, Hui-Ling; Tsai, Jui-Chen; Chung, Min-Huey; Liao, Yuan-Mei; Chi, Mei-Ju; Chou, Kuei-Ru
2015-11-01
The purpose of this study was to translate the Rowland Universal Dementia Assessment Scale into Chinese and to evaluate the psychometric properties (reliability and validity) and the diagnostic properties (sensitivity, specificity and predictive values) of the Chinese version of the Rowland Universal Dementia Assessment Scale. The accurate detection of early dementia requires screening tools with favourable cross-cultural linguistic and appropriate sensitivity, specificity, and predictive values, particularly for Chinese-speaking populations. This was a cross-sectional, descriptive study. Overall, 130 participants suspected to have cognitive impairment were enrolled in the study. A test-retest for determining reliability was scheduled four weeks after the initial test. Content validity was determined by five experts, whereas construct validity was established by using contrasted group technique. The participants' clinical diagnoses were used as the standard in calculating the sensitivity, specificity, positive predictive value and negative predictive value. The study revealed that the Chinese version of the Rowland Universal Dementia Assessment Scale exhibited a test-retest reliability of 0.90, an internal consistency reliability of 0.71, an inter-rater reliability (kappa value) of 0.88 and a content validity index of 0.97. Both the patients and healthy contrast group exhibited significant differences in their cognitive ability. The optimal cut-off points for the Chinese version of the Rowland Universal Dementia Assessment Scale in the test for mild cognitive impairment and dementia were 24 and 22, respectively; moreover, for these two conditions, the sensitivities of the scale were 0.79 and 0.76, the specificities were 0.91 and 0.81, the areas under the curve were 0.85 and 0.78, the positive predictive values were 0.99 and 0.83 and the negative predictive values were 0.96 and 0.91 respectively. The Chinese version of the Rowland Universal Dementia Assessment Scale exhibited sound reliability, validity, sensitivity, specificity and predictive values. This scale can help clinical staff members to quickly and accurately diagnose cognitive impairment and provide appropriate treatment as early as possible. © 2015 John Wiley & Sons Ltd.
Validation of the Japanese Version of the Body Vigilance Scale.
Saigo, Tatsuo; Takebayashi, Yoshitake; Tayama, Jun; Bernick, Peter J; Schmidt, Norman B; Shirabe, Susumu; Sakano, Yuji
2016-06-01
The Body Vigilance Scale is a self-report measure of attention to bodily sensations. The measure was translated into Japanese and its reliability, validity, and factor structure were verified. Participants comprised 286 university students (age: 19 ± 1 years). All participants were administered the scale, along with several indices of anxiety (i.e., Anxiety Sensitivity Index, Short Health Anxiety Inventory Illness Likelihood Scale, Social Interaction Anxiety Scale, and Hospital Anxiety and Depression Scale). The Japanese version of the Body Vigilance Scale exhibited a unidimensional factor structure and strong internal consistency. Construct validity was demonstrated by significant correlations with the above measures. Results suggest that the Japanese version of the scale is a reliable, valid tool for measuring body vigilance in Japanese university students. © The Author(s) 2016.
III. NIH Toolbox Cognition Battery (CB): measuring episodic memory.
Bauer, Patricia J; Dikmen, Sureyya S; Heaton, Robert K; Mungas, Dan; Slotkin, Jerry; Beaumont, Jennifer L
2013-08-01
One of the most significant domains of cognition is episodic memory, which allows for rapid acquisition and long-term storage of new information. For purposes of the NIH Toolbox, we devised a new test of episodic memory. The nonverbal NIH Toolbox Picture Sequence Memory Test (TPSMT) requires participants to reproduce the order of an arbitrarily ordered sequence of pictures presented on a computer. To adjust for ability, sequence length varies from 6 to 15 pictures. Multiple trials are administered to increase reliability. Pediatric data from the validation study revealed the TPSMT to be sensitive to age-related changes. The task also has high test-retest reliability and promising construct validity. Steps to further increase the sensitivity of the instrument to individual and age-related variability are described. © 2013 The Society for Research in Child Development, Inc.
Kalsi-Ryan, Sukhvinder; Beaton, Dorcas; Ahn, Henry; Askes, Heather; Drew, Brian; Curt, Armin; Popovic, Milos R; Wang, Justin; Verrier, Mary C; Fehlings, Michael G
2016-02-01
As spinal cord injury (SCI) trials begin to involve subjects with acute cervical SCI, establishing the property of an upper limb outcome measure to detect change over time is critical for its usefulness in clinical trials. The objectives of this study were to define responsiveness, sensitivity, and minimally detectable difference (MDD) of the Graded Redefined Assessment of Strength, Sensibility, and Prehension (GRASSP). An observational, longitudinal study was conducted. International Standards of Neurological Classification of SCI (ISNCSCI), GRASSP, Capabilities of Upper Extremity Questionnaire (CUE-Q), and Spinal Cord Independence Measure (SCIM) were administered 0-10 days, 1, 3, 6, and 12 months post-injury. Standardized Response Means (SRM) for GRASSP and ISNCSCI measures were calculated. Longitudinal construct validity was calculated using Pearson correlation coefficients. Smallest real difference for all subtests was calculated to define the MDD values for all GRASSP subtests. Longitudinal construct validity demonstrated GRASSP and all external measures to be responsive to neurological change for 1 year post-injury. SRM values for the GRASSP subtests ranged from 0.25 to 0.85 units greater than that for ISNCSCI strength and sensation, SCIM-SS, and CUE-Q. MDD values for GRASSP subtests ranged from 2-5 points. GRASSP demonstrates good responsiveness and excellent sensitivity that is superior to ISNCSCI and SCIM III. MDD values are useful in the evaluation of interventions in both clinical and research settings. The responsiveness and sensitivity of GRASSP make it a valuable condition-specific measure in tetraplegia, where changes in upper limb neurological and functional outcomes are essential for evaluating the efficacy of interventions.
Roets-Merken, Lieve M; Zuidema, Sytse U; Vernooij-Dassen, Myrra J F J; Kempen, Gertrudis I J M
2014-11-01
This study investigated the psychometric properties of the Severe Dual Sensory Loss screening tool, a tool designed to help nurses and care assistants to identify hearing, visual and dual sensory impairment in older adults. Construct validity of the Severe Dual Sensory Loss screening tool was evaluated using Crohnbach's alpha and factor analysis. Interrater reliability was calculated using Kappa statistics. To evaluate the predictive validity, sensitivity and specificity were calculated by comparison with the criterion standard assessment for hearing and vision. The criterion used for hearing impairment was a hearing loss of ≥40 decibel measured by pure-tone audiometry, and the criterion for visual impairment was a visual acuity of ≤0.3 diopter or a visual field of ≤0.3°. Feasibility was evaluated by the time needed to fill in the screening tool and the clarity of the instruction and items. Prevalence of dual sensory impairment was calculated. A total of 56 older adults receiving aged care and 12 of their nurses and care assistants participated in the study. Crohnbach's alpha was 0.81 for the hearing subscale and 0.84 for the visual subscale. Factor analysis showed two constructs for hearing and two for vision. Kappa was 0.71 for the hearing subscale and 0.74 for the visual subscale. The predictive validity showed a sensitivity of 0.71 and a specificity of 0.72 for the hearing subscale; and a sensitivity of 0.69 and a specificity of 0.78 for the visual subscale. The optimum cut-off point for each subscale was score 1. The nurses and care assistants reported that the Severe Dual Sensory Loss screening tool was easy to use. The prevalence of hearing and vision impairment was 55% and 29%, respectively, and that of dual sensory impairment was 20%. The Severe Dual Sensory Loss screening tool was compared with the criterion standards for hearing and visual impairment and was found a valid and reliable tool, enabling nurses and care assistants to identify hearing, visual and dual sensory impairment among older adults. Copyright © 2014 Elsevier Ltd. All rights reserved.
Kudo, Yuka; Nakagawa, Atsuo; Tamura, Noriko; Kato, Noriko; Williams, Aya; Aida, Nobuo; Mimura, Masaru
2016-07-01
Parker et al. (2006) proposed a new approach to classify specific sub-types of non-melancholic depression caused by various stress factors and premorbid personality styles: the Temperament and Personality Questionnaire (T&P). The current study aim was to develop the Japanese version of the T&P and evaluate its reliability and validity. We studied 114 patients with non-melancholic depression. Reliability was assessed using the test-retest method. Convergent validity of the T&P was compared with the clinician ratings of each patient for the eight personality traits. We also assessed the impact of depressive state on the T&P. The test-retest intraclass correlation coefficients among eight constructs of the T&P ranged from 0.77 to 0.89, indicating good-to-excellent reliability. Anxious Worrying (rho=0.29), Perfectionism (rho=0.17), Personal Reserve (rho=0.18), Irritability (rho=0.38), and Social Avoidance (rho=0.32) showed adequate levels of convergent validity; Rejection Sensitivity (rho=0.16), Self-criticism (rho=-0.02), and Self-focus (rho=0.07) showed relatively weak convergent validity. Perfectionism (rho=-0.06), Social Avoidance (rho=0.17), Anxious Worrying (rho=0.40), Personal Reserve (rho=0.30), Irritability (rho=0.28), Rejection Sensitivity (rho=0.35), Self-criticism (rho=0.49), and Self-focus (rho=0.24) showed minimal sensitivity to mood state effects. Only one site was used. While a Likert scale was used, the clinician-rated personality trait measure had not been validated. The J-T&P is a reliable and valid measure for assessing temperament and personality in Japanese patients with non-melancholic depression. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Englbrecht, Matthias; Alten, Rieke; Aringer, Martin; Baerwald, Christoph G; Burkhardt, Harald; Eby, Nancy; Fliedner, Gerhard; Gauger, Bettina; Henkemeier, Ulf; Hofmann, Michael W; Kleinert, Stefan; Kneitz, Christian; Krueger, Klaus; Pohl, Christoph; Roske, Anne-Eve; Schett, Georg; Schmalzing, Marc; Tausche, Anne-Kathrin; Peter Tony, Hans; Wendler, Joerg
2017-01-01
To validate standard self-report questionnaires for depression screening in patients with rheumatoid arthritis (RA) and compare these measures to one another and to the Montgomery-Åsberg Depression Rating Scale (MADRS), a standardized structured interview. In 9 clinical centers across Germany, depressive symptomatology was assessed in 262 adult RA patients at baseline (T0) and at 12 ± 2 weeks followup (T1) using the World Health Organization 5-Item Well-Being Index (WHO-5), the Patient Health Questionnaire (PHQ-9), and the Beck Depression Inventory II (BDI-II). The construct validity of these depression questionnaires (using convergent and discriminant validity) was evaluated using Spearman's correlations at both time points. The test-retest reliability of the questionnaires was evaluated in RA patients who had not undergone a psychotherapeutic intervention or received antidepressants between T0 and T1. The sensitivity and the specificity of the questionnaires were calculated using the results of the MADRS, a structured interview, as the gold standard. According to Spearman's correlation coefficients, all questionnaires met convergent validity criteria (ρ > |0.50|), with the BDI-II performing best, while correlations with age and disease activity for all questionnaires met the criteria for discriminant validity (ρ < |0.50|). The only questionnaire to meet the predefined retest reliability criterion (ρ ≥ 0.70) was the BDI-II (r s = 0.77), which also achieved the best results for both sensitivity and specificity (>80%) when using the MADRS as the gold standard. The BDI-II best met the predefined criteria, and the PHQ-9 met most of the validity criteria, with lower sensitivity and specificity. © 2016, American College of Rheumatology.
Ghezeljeh, Tahereh Najafi; Ardebili, Fatimah Mohades; Rafii, Forough; Hagani, Hamid
2013-09-01
Burn as a traumatic life incident manifests severe pain and psychological problems. Specific instruments are needed to evaluate burn patients' psychological issues related to the injury. The aim of this study was to translate and evaluate the reliability and validity of the Persian versions of Impact of Burn Specific Pain Anxiety scale (BSPAS) and Impact of Event Scale (IES). In this cross-sectional study, convenience sampling method was utilized to select 55 Iranian hospitalized burn patients. Combined translation was utilized for translating scales. Alpha cronbach, item-total correlation, convergent and discriminative validity were evaluated. The Cronbach's α for both BSPAS- and IES-Persian version was 0.96. Item-total correlation coefficients ranged from 0.70 to 0.90. Convergent construct validity was confirmed by indicating high correlation between the scales designed to measure the same concepts. The mean score of BSPAS- and IES-Persian version was lower for individuals with a lower TBSA burn percentage which assessed discriminative construct validity of scales. BSPAS- and IES-Persian version showed high internal consistency and good validity for the assessment of burn psychological outcome in hospitalized burn patients. Future studies are needed to determine repeatability, factor structure, sensitivity and specificity of the scales. Copyright © 2013 Elsevier Ltd and ISBI. All rights reserved.
INCLEN Diagnostic Tool for Autism Spectrum Disorder (INDT-ASD): development and validation.
Juneja, Monica; Mishra, Devendra; Russell, Paul S S; Gulati, Sheffali; Deshmukh, Vaishali; Tudu, Poma; Sagar, Rajesh; Silberberg, Donald; Bhutani, Vinod K; Pinto, Jennifer M; Durkin, Maureen; Pandey, Ravindra M; Nair, M K C; Arora, Narendra K
2014-05-01
To develop and validate INCLEN Diagnostic Tool for Autism Spectrum Disorder (INDT-ASD). Diagnostic test evaluation by cross sectional design. Four tertiary pediatric neurology centers in Delhi and Thiruvanthapuram, India. Children aged 2-9 years were enrolled in the study. INDT-ASD and Childhood Autism Rating Scale (CARS) were administered in a randomly decided sequence by trained psychologist, followed by an expert evaluation by DSM-IV TR diagnostic criteria (gold standard). Psychometric parameters of diagnostic accuracy, validity (construct, criterion and convergent) and internal consistency. 154 children (110 boys, mean age 64.2 mo) were enrolled. The overall diagnostic accuracy (AUC=0.97, 95% CI 0.93, 0.99; P<0.001) and validity (sensitivity 98%, specificity 95%, positive predictive value 91%, negative predictive value 99%) of INDT-ASD for Autism spectrum disorder were high, taking expert diagnosis using DSM-IV-TR as gold standard. The concordance rate between the INDT-ASD and expert diagnosis for 'ASD group' was 82.52% [Cohen's k=0.89; 95% CI (0.82, 0.97); P=0.001]. The internal consistency of INDT-ASD was 0.96. The convergent validity with CARS (r = 0.73, P= 0.001) and divergent validity with Binet-Kamat Test of intelligence (r = -0.37; P=0.004) were significantly high. INDT-ASD has a 4-factor structure explaining 85.3% of the variance. INDT-ASD has high diagnostic accuracy, adequate content validity, good internal consistency high criterion validity and high to moderate convergent validity and 4-factor construct validity for diagnosis of Autistm spectrum disorder.
Testing alternative ground water models using cross-validation and other methods
Foglia, L.; Mehl, S.W.; Hill, M.C.; Perona, P.; Burlando, P.
2007-01-01
Many methods can be used to test alternative ground water models. Of concern in this work are methods able to (1) rank alternative models (also called model discrimination) and (2) identify observations important to parameter estimates and predictions (equivalent to the purpose served by some types of sensitivity analysis). Some of the measures investigated are computationally efficient; others are computationally demanding. The latter are generally needed to account for model nonlinearity. The efficient model discrimination methods investigated include the information criteria: the corrected Akaike information criterion, Bayesian information criterion, and generalized cross-validation. The efficient sensitivity analysis measures used are dimensionless scaled sensitivity (DSS), composite scaled sensitivity, and parameter correlation coefficient (PCC); the other statistics are DFBETAS, Cook's D, and observation-prediction statistic. Acronyms are explained in the introduction. Cross-validation (CV) is a computationally intensive nonlinear method that is used for both model discrimination and sensitivity analysis. The methods are tested using up to five alternative parsimoniously constructed models of the ground water system of the Maggia Valley in southern Switzerland. The alternative models differ in their representation of hydraulic conductivity. A new method for graphically representing CV and sensitivity analysis results for complex models is presented and used to evaluate the utility of the efficient statistics. The results indicate that for model selection, the information criteria produce similar results at much smaller computational cost than CV. For identifying important observations, the only obviously inferior linear measure is DSS; the poor performance was expected because DSS does not include the effects of parameter correlation and PCC reveals large parameter correlations. ?? 2007 National Ground Water Association.
Cocaine sensitization models an anhedonia-like condition in rats.
Scheggi, Simona; Marchese, Giovanna; Grappi, Silvia; Secci, Maria Elena; De Montis, Maria Graziella; Gambarana, Carla
2011-04-01
Anhedonia is a core symptom of depression that also characterizes substance abuse-related mood disorders, in particular those secondary to stimulant abuse. This study investigated the long-lasting condition of cocaine sensitization as an inducing condition for anhedonia in rats. Cortical-mesolimbic dopamine plays a central role in assessing the incentive value of a stimulus and an increased dopamine output in these areas after a novel palatable meal seems to correlate with the ability to acquire an instrumental behaviour aimed at earning it again. This dopaminergic response is associated with consistent modifications in the phosphorylation pattern of some cAMP-dependent protein kinase (PKA) substrates and it is mediated by dopamine D1 receptor stimulation. Thus, since behavioural cocaine sensitization is characterized by tonically increased levels of phospho-Thr75 DARPP-32 that is a potent PKA inhibitor, we hypothesized that cocaine-sensitized rats might reveal deficits in palatable food responding. Indeed, non-food-deprived cocaine-sensitized rats showed no interest in palatable food, no dopaminergic response after a palatable meal in terms of increased dopamine output and DARPP-32 phosphorylation changes, and no ability to acquire a palatable food-sustained instrumental behaviour. Repeated administration of an established antidepressant compound, imipramine, corrected these deficits and reinstated the dopaminergic response in the cortico-mesolimbic areas to control values. Thus, the behavioural modifications observed in cocaine-sensitized rats satisfy some requirements for an experimental model of anhedonia since they are induced by repeated cocaine administration (aetiological validity), they mimic an anhedonia-like symptom (construct validity), and are reversed by the administration of imipramine (predictive validity).
Cavelti, Marialuisa; Contin, Giuliana; Beck, Eva-Marina; Kvrgic, Sara; Kossowsky, Joe; Stieglitz, Rolf-Dieter; Vauth, Roland
2012-01-01
Because the mere definition of insight from the therapist's viewpoint may not be sufficient to identify treatment targets for adherence enhancement, we need assessment strategies which are more sensitive to the patient's perspective. Illness perception (IP), defined as the beliefs a patient holds about his/her health problems, has been shown to affect coping in the context of a physical or mental illness, e.g. compliance behaviour. To assess IP in people diagnosed with schizophrenia, the Illness Perception Questionnaire for Schizophrenia (IPQS) was developed. The aim of the present study was to analyse the psychometric properties of the German version of the IPQS. The study sample consisted of 128 German-speaking outpatients suffering from chronic schizophrenia or schizoaffective disorder. To achieve comparability with the validation of the English scale version, the same constructs were assessed: psychopathology, depression, and beliefs about medication. Furthermore, insight into one's illness was assessed. Internal consistency, test-retest reliability and construct validity including convergent and discriminant validity were analysed. Five of eight IPQS subscales were found to be internally reliable and all subscales demonstrated high stability over time. Correlations with validity measures indicated that the subscales assess dimensions of a construct, which is distinct from psychopathology, depression, beliefs about medication and insight, except for the Identity subscale which substantially overlapped with measures of insight. The German version of the IPQS is an essentially reliable and valid measure of IP for German-speaking people with a schizophrenia spectrum disorder. This may encourage its usage in further studies investigating the impact of subjective beliefs about mental health problems on outcome and recovery in schizophrenia. Copyright © 2012 S. Karger AG, Basel.
ERIC Educational Resources Information Center
Wilkerson, Judy R.
2015-01-01
This commentary on the article titled "Examining the Internal Structure Evidence for the Performance Assessment for California Teachers: A Validation Study of the Elementary Literacy Teaching Event for Tier 1 Teacher Licensure" provides an overview of Performance Assessment for California Teachers (PACT), its relationship to edTPA and…
2011-01-01
Background The lack of culturally adapted and validated instruments for child mental health and psychosocial support in low and middle-income countries is a barrier to assessing prevalence of mental health problems, evaluating interventions, and determining program cost-effectiveness. Alternative procedures are needed to validate instruments in these settings. Methods Six criteria are proposed to evaluate cross-cultural validity of child mental health instruments: (i) purpose of instrument, (ii) construct measured, (iii) contents of construct, (iv) local idioms employed, (v) structure of response sets, and (vi) comparison with other measurable phenomena. These criteria are applied to transcultural translation and alternative validation for the Depression Self-Rating Scale (DSRS) and Child PTSD Symptom Scale (CPSS) in Nepal, which recently suffered a decade of war including conscription of child soldiers and widespread displacement of youth. Transcultural translation was conducted with Nepali mental health professionals and six focus groups with children (n = 64) aged 11-15 years old. Because of the lack of child mental health professionals in Nepal, a psychosocial counselor performed an alternative validation procedure using psychosocial functioning as a criterion for intervention. The validation sample was 162 children (11-14 years old). The Kiddie-Schedule for Affective Disorders and Schizophrenia (K-SADS) and Global Assessment of Psychosocial Disability (GAPD) were used to derive indication for treatment as the external criterion. Results The instruments displayed moderate to good psychometric properties: DSRS (area under the curve (AUC) = 0.82, sensitivity = 0.71, specificity = 0.81, cutoff score ≥ 14); CPSS (AUC = 0.77, sensitivity = 0.68, specificity = 0.73, cutoff score ≥ 20). The DSRS items with significant discriminant validity were "having energy to complete daily activities" (DSRS.7), "feeling that life is not worth living" (DSRS.10), and "feeling lonely" (DSRS.15). The CPSS items with significant discriminant validity were nightmares (CPSS.2), flashbacks (CPSS.3), traumatic amnesia (CPSS.8), feelings of a foreshortened future (CPSS.12), and easily irritated at small matters (CPSS.14). Conclusions Transcultural translation and alternative validation feasibly can be performed in low clinical resource settings through task-shifting the validation process to trained mental health paraprofessionals using structured interviews. This process is helpful to evaluate cost-effectiveness of psychosocial interventions. PMID:21816045
Construct validity of the MMPI-2 College Maladjustment (Mt) Scale.
Barthlow, Deanna L; Graham, John R; Ben-Porath, Yossef S; McNulty, John L
2004-09-01
The construct validity of the MMPI-2 (Minnesota Multiphasic Personality Inventory-2) College Maladjustment (Mt) Scale was examined using 376 student clients at a university psychological clinic. A principal components analysis and correlations of Mt scale scores with clients' and therapists' ratings of symptoms and functioning showed that the Mt scale identifies the presence of maladjustment as defined in terms of depressive and anxious symptoms. There is no evidence to show that the scale is specific to college students or that it is sensitive to severe psychological disturbance. The Mt scale does not inform the clinician as to why a person is distressed. In addition, there is no evidence from this study to suggest the superiority of the Mt scale over other MMPI-2 maladjustment measures. Therapists should use the entire MMPI-2 profile, not just the Mt scale, to gain the most comprehensive and specific understanding of clients.
Alyusuf, Raja H.; Prasad, Kameshwar; Abdel Satir, Ali M.; Abalkhail, Ali A.; Arora, Roopa K.
2013-01-01
Background: The exponential use of the internet as a learning resource coupled with varied quality of many websites, lead to a need to identify suitable websites for teaching purposes. Aim: The aim of this study is to develop and to validate a tool, which evaluates the quality of undergraduate medical educational websites; and apply it to the field of pathology. Methods: A tool was devised through several steps of item generation, reduction, weightage, pilot testing, post-pilot modification of the tool and validating the tool. Tool validation included measurement of inter-observer reliability; and generation of criterion related, construct related and content related validity. The validated tool was subsequently tested by applying it to a population of pathology websites. Results and Discussion: Reliability testing showed a high internal consistency reliability (Cronbach's alpha = 0.92), high inter-observer reliability (Pearson's correlation r = 0.88), intraclass correlation coefficient = 0.85 and κ =0.75. It showed high criterion related, construct related and content related validity. The tool showed moderately high concordance with the gold standard (κ =0.61); 92.2% sensitivity, 67.8% specificity, 75.6% positive predictive value and 88.9% negative predictive value. The validated tool was applied to 278 websites; 29.9% were rated as recommended, 41.0% as recommended with caution and 29.1% as not recommended. Conclusion: A systematic tool was devised to evaluate the quality of websites for medical educational purposes. The tool was shown to yield reliable and valid inferences through its application to pathology websites. PMID:24392243
Yılmaz, Emel; Eser, Erhan; Şekuri, Cevad; Kültürsay, Hakan
2011-08-01
The purpose of this study was to describe the psychometric properties of the Myocardial Infarction Dimensional Assessment Scale (MIDAS). This is a methodological cultural adaptation study. The MIDAS consists of 35-items covering seven domains: physical activity, insecurity, emotional reaction, dependency, diet, concerns over medication, and side effects which are rated on a five-point Likert scale from 1: never to 5:always. The highest score of MIDAS is 100.Quality of life (QOL) decreases as the score of scale increases. Overall 185 myocardial infarction (MI) patients were enrolled in this study. Cronbach alpha was used for the reliability analysis. The criterion validity, structural validity, and sensitivity analysis approach was used for validity analysis. New York Heart Association (NYHA) and the Canadian Cardiovascular Society Functional Classifications (CCSFC) for testing the criterion validity; SF-36 for construct validity testing of the Turkish version of the MIDAS were used. The range of Cronbach alpha values is 0.79-0.90 for seven domains of the scale. No problematic items were observed for the entire scale. Medication related domains of the MIDAS showed considerable floor effects (35.7%-22.7%). Confirmatory Factor analysis indicators [Comparative Fit Index (CFI) =0.95 and Root Mean Square Error of Approximation (RMSEA) =0.075] supported the construct validity of MIDAS. Convergent validity of the MIDAS was confirmed with correlation of SF-36 scale where appropriate. Criterion validity results was also satisfactory by comparing different stages of the NYHA and the CCSFC (p<0.05). Overall results revealed that Turkish version of the MIDAS is a reliable and valid instrument.
Validation of the CMT Pediatric Scale as an outcome measure of disability
Burns, Joshua; Ouvrier, Robert; Estilow, Tim; Shy, Rosemary; Laurá, Matilde; Pallant, Julie F.; Lek, Monkol; Muntoni, Francesco; Reilly, Mary M.; Pareyson, Davide; Acsadi, Gyula; Shy, Michael E.; Finkel, Richard S.
2012-01-01
Objective Charcot-Marie-Tooth disease (CMT) is a common heritable peripheral neuropathy. There is no treatment for any form of CMT although clinical trials are increasingly occurring. Patients usually develop symptoms during the first two decades of life but there are no established outcome measures of disease severity or response to treatment. We identified a set of items that represent a range of impairment levels and conducted a series of validation studies to build a patient-centered multi-item rating scale of disability for children with CMT. Methods As part of the Inherited Neuropathies Consortium, patients aged 3–20 years with a variety of CMT types were recruited from the USA, UK, Italy and Australia. Initial development stages involved: definition of the construct, item pool generation, peer review and pilot testing. Based on data from 172 patients, a series of validation studies were conducted, including: item and factor analysis, reliability testing, Rasch modeling and sensitivity analysis. Results Seven areas for measurement were identified (strength, dexterity, sensation, gait, balance, power, endurance), and a psychometrically robust 11-item scale constructed (Charcot-Marie-Tooth disease Pediatric Scale: CMTPedS). Rasch analysis supported the viability of the CMTPedS as a unidimensional measure of disability in children with CMT. It showed good overall model fit, no evidence of misfitting items, no person misfit and it was well targeted for children with CMT. Interpretation The CMTPedS is a well-tolerated outcome measure that can be completed in 25-minutes. It is a reliable, valid and sensitive global measure of disability for children with CMT from the age of 3 years. PMID:22522479
Validation of the Mayo Hip Score: construct validity, reliability and responsiveness to change.
Singh, Jasvinder A; Schleck, Cathy; Harmsen, W Scott; Lewallen, David G
2016-01-19
Previous studies have provided the initial evidence for construct validity and test-retest reliability of the Mayo Hip Score. Instruments used for Total Hip Arthroplasty (THA) outcomes assessment should be valid, reliable and responsive to change. Our main objective was to examine the responsiveness to change, association with subsequent revision and the construct validity of the Mayo hip score. Discriminant ability was assessed by calculating effect size (ES), standardized response mean (SRM) and Guyatt's responsiveness index (GRI). Minimal clinically important difference (MCII) and moderate improvement thresholds were calculated. We assessed construct validity by examining association of scores with preoperative patient characteristics and correlation with Harris hip score, and assessed association of scores with the risk of subsequent revision. Five thousand three hundred seven provided baseline data; of those with baseline data, 2,278 and 2,089 (39%) provided 2- and 5-year data, respectively. Large ES, SRM and GRI ranging 2.66-2.78, 2.42-2.61 and 1.67-1.88 were noted for Mayo hip scores with THA, respectively. The MCII and moderate improvement thresholds were 22.4-22.7 and 39.4-40.5 respectively. Hazard ratios of revision surgery were higher with lower final score or less improvement in Mayo hip score at 2-years and borderline significant/non-significant at 5-years, respectively: (1) score ≤55 with hazard ratios of 2.24 (95% CI, 1.45, 3.46; p = 0.0003) and 1.70 (95% CI, 1.00, 2.92; p = 0.05) of implant revision subsequently, compared to 72-80 points; (2) no improvement or worsening score with hazard ratios 3.94 (95% CI, 1.50, 10.30; p = 0.005) and 2.72 (95% CI, 0.85,8.70; p = 0.09), compared to improvement >50-points. Mayo hip score had significant positive correlation with younger age, male gender, lower BMI, lower ASA class and lower Deyo-Charlson index (p ≤ 0.003 for each) and with Harris hip scores (p < 0.001). Mayo Hip Score is valid, sensitive to change and associated with future risk of revision surgery in patients with primary THA.
Construct Validity: Advances in Theory and Methodology
Strauss, Milton E.; Smith, Gregory T.
2008-01-01
Measures of psychological constructs are validated by testing whether they relate to measures of other constructs as specified by theory. Each test of relations between measures reflects on the validity of both the measures and the theory driving the test. Construct validation concerns the simultaneous process of measure and theory validation. In this chapter, we review the recent history of validation efforts in clinical psychological science that has led to this perspective, and we review five recent advances in validation theory and methodology of importance for clinical researchers. These are: the emergence of nonjustificationist philosophy of science; an increasing appreciation for theory and the need for informative tests of construct validity; valid construct representation in experimental psychopathology; the need to avoid representing multidimensional constructs with a single score; and the emergence of effective new statistical tools for the evaluation of convergent and discriminant validity. PMID:19086835
Strand, Julia F; Brown, Violet A; Merchant, Madeleine B; Brown, Hunter E; Smith, Julia
2018-06-19
Listening effort (LE) describes the attentional or cognitive requirements for successful listening. Despite substantial theoretical and clinical interest in LE, inconsistent operationalization makes it difficult to make generalizations across studies. The aims of this large-scale validation study were to evaluate the convergent validity and sensitivity of commonly used measures of LE and assess how scores on those tasks relate to cognitive and personality variables. Young adults with normal hearing (N = 111) completed 7 tasks designed to measure LE, 5 tests of cognitive ability, and 2 personality measures. Scores on some behavioral LE tasks were moderately intercorrelated but were generally not correlated with subjective and physiological measures of LE, suggesting that these tasks may not be tapping into the same underlying construct. LE measures differed in their sensitivity to changes in signal-to-noise ratio and the extent to which they correlated with cognitive and personality variables. Given that LE measures do not show consistent, strong intercorrelations and differ in their relationships with cognitive and personality predictors, these findings suggest caution in generalizing across studies that use different measures of LE. The results also indicate that people with greater cognitive ability appear to use their resources more efficiently, thereby diminishing the detrimental effects associated with increased background noise during language processing.
Time-saving impact of an algorithm to identify potential surgical site infections.
Knepper, B C; Young, H; Jenkins, T C; Price, C S
2013-10-01
To develop and validate a partially automated algorithm to identify surgical site infections (SSIs) using commonly available electronic data to reduce manual chart review. Retrospective cohort study of patients undergoing specific surgical procedures over a 4-year period from 2007 through 2010 (algorithm development cohort) or over a 3-month period from January 2011 through March 2011 (algorithm validation cohort). A single academic safety-net hospital in a major metropolitan area. Patients undergoing at least 1 included surgical procedure during the study period. Procedures were identified in the National Healthcare Safety Network; SSIs were identified by manual chart review. Commonly available electronic data, including microbiologic, laboratory, and administrative data, were identified via a clinical data warehouse. Algorithms using combinations of these electronic variables were constructed and assessed for their ability to identify SSIs and reduce chart review. The most efficient algorithm identified in the development cohort combined microbiologic data with postoperative procedure and diagnosis codes. This algorithm resulted in 100% sensitivity and 85% specificity. Time savings from the algorithm was almost 600 person-hours of chart review. The algorithm demonstrated similar sensitivity on application to the validation cohort. A partially automated algorithm to identify potential SSIs was highly sensitive and dramatically reduced the amount of manual chart review required of infection control personnel during SSI surveillance.
Measurement Properties of the Central Sensitization Inventory: A Systematic Review.
Scerbo, Thomas; Colasurdo, Joseph; Dunn, Sally; Unger, Jacob; Nijs, Jo; Cook, Chad
2018-04-01
Central sensitization (CS) is a phenomenon associated with several medical diagnoses, including postcancer pain, low back pain, osteoarthritis, whiplash, and fibromyalgia. CS involves an amplification of neural signaling within the central nervous system that results in pain hypersensitivity. The purpose of this systematic review was to gather published studies of a widely used outcome measure (the Central Sensitization Inventory [CSI]), determine the quality of evidence these publications reported, and examine the measurement properties of the CSI. Four databases were searched for publications from 2011 (when the CSI was developed) to July 2017. The Consensus-Based Standards for the Selection of Health Measurement Instruments (COSMIN) checklist was applied to evaluate methodological quality and risk of bias. In instances when COSMIN did not offer a scoring system for measurement properties, qualitative analyses were performed. Fourteen studies met inclusion criteria. Quality of evidence examined with the COSMIN checklist was determined to be good to excellent for all studies for their respective measurement property reports. Interpretability measures were consistent when publications were analyzed qualitatively, and construct validity was strong when examined alongside other validated measures relating to CS. An assessment of the published measurement studies of the CSI suggest the tool generates reliable and valid data that quantify the severity of several symptoms of CS. © 2017 World Institute of Pain.
Schlauch, Robert C.; Crane, Cory A.; Houston, Rebecca J.; Molnar, Danielle S.; Schlienz, Nicolas J.; Lang, Alan R.
2015-01-01
The current project sought to examine the psychometric properties of a personality based measure (Substance Use Risk Profile Scale; SURPS: introversion-hopelessness, anxiety sensitivity, impulsivity, and sensation seeking) designed to differentially predict substance use preferences and patterns by matching primary personality-based motives for use to the specific effects of various psychoactive substances. Specifically, we sought to validate the SURPS in a clinical sample of substance users using cue reactivity methodology to assess current inclinations to consume a wide range of psychoactive substances. Using confirmatory factor analysis and correlational analyses, the SURPS demonstrated good psychometric properties and construct validity. Further, impulsivity and sensation-seeking were associated with use of multiple substances but could be differentiated by motives for use and susceptibility to the reinforcing effects of stimulants (i.e., impulsivity) and alcohol (i.e. sensation-seeking). In contrast, introversion-hopelessness and anxiety sensitivity demonstrated a pattern of use more focused on reducing negative affect, but were not differentiated based on specific patterns of use. Taken together, results suggests that among those receiving inpatient treatment for substance use disorders, the SURPS is a valid instrument for measuring four distinct personality dimensions that may be sensitive to motivational susceptibilities to specific patterns of alcohol and drug use. PMID:26052180
Olino, Thomas M; McMakin, Dana L; Forbes, Erika E
2016-11-20
Positive emotionality, anhedonia, and reward sensitivity share motivational and experiential elements of approach motivation and pleasure. Earlier work has examined the interrelationships among these constructs from measures of extraversion. More recently, the Research Domain Criteria introduced the Positive Valence Systems as a primary dimension to better understand psychopathology. However, the suggested measures tapping this construct have not yet been integrated within the structural framework of personality, even at the level of self-report. Thus, this study conducted exploratory factor and exploratory bifactor analyses on 17 different dimensions relevant to approach motivation, spanning anhedonia, behavioral activation system functioning, and positive emotionality. Convergent validity of these dimensions is tested by examining associations with depressive symptoms. Relying on multiple indices of fit, our preferred model included a general factor along with specific factors of affiliation, positive emotion, assertiveness, and pleasure seeking. These factors demonstrated different patterns of association with depressive symptoms. We discuss the plausibility of this model and highlight important future directions for work on the structure of a broad Positive Valence Systems construct. © The Author(s) 2016.
LeBouthillier, Daniel M; Asmundson, Gordon J G
2015-01-01
Several mechanisms have been posited for the anxiolytic effects of exercise, including reductions in anxiety sensitivity through interoceptive exposure. Studies on aerobic exercise lend support to this hypothesis; however, research investigating aerobic exercise in comparison to placebo, the dose-response relationship between aerobic exercise anxiety sensitivity, the efficacy of aerobic exercise on the spectrum of anxiety sensitivity and the effect of aerobic exercise on other related constructs (e.g. intolerance of uncertainty, distress tolerance) is lacking. We explored reductions in anxiety sensitivity and related constructs following a single session of exercise in a community sample using a randomized controlled trial design. Forty-one participants completed 30 min of aerobic exercise or a placebo stretching control. Anxiety sensitivity, intolerance of uncertainty and distress tolerance were measured at baseline, post-intervention and 3-day and 7-day follow-ups. Individuals in the aerobic exercise group, but not the control group, experienced significant reductions with moderate effect sizes in all dimensions of anxiety sensitivity. Intolerance of uncertainty and distress tolerance remained unchanged in both groups. Our trial supports the efficacy of aerobic exercise in uniquely reducing anxiety sensitivity in individuals with varying levels of the trait and highlights the importance of empirically validating the use of aerobic exercise to address specific mental health vulnerabilities. Aerobic exercise may have potential as a temporary substitute for psychotherapy aimed at reducing anxiety-related psychopathology.
Supranowicz, Piotr; Paź, Małgorzata
2014-01-01
A holistic approach to health requires the development of tools that would allow to measure the inner world of individuals within its physical, mental and social dimensions. To create the Physical, Mental and Social Well-being scale (PMSW-21) that allows a holistic representation of various dimensions of well-being in such a way as they are perceived by the individuals and how affected their health. The study was conducted on the sample of 406 inhabitants of Warsaw involving in the Social Participation in Health Reform project. The PMSW-21 scale included: headache, tiredness, abdominal pain, palpitation, joint pain, backache, sleep disturbance (physical domain), anxiety, guiltiness, helplessness, hopelessness, sadness, self-dissatisfaction, hostility (mental domain), security, communicability, protection, loneliness, rejection, sociability and appreciation (social domain). The five criterial variables of health and seven of life experiences were adopted to assess the discriminative power of the PMSW-21 scale. The total well-being scale as well as its physical, mental and social domains showed high reliability (Cronbach a 0.81, 0.77, 0.90, 0.72, respectively). The analysis confirmed the construct validity. All the items stronger correlated with their own domain than with the others (ranges for physical: 0.41 - 0.55, mental: 0.49 - 0.80 and social: 0.31 - 0.50). The total scale demonstrate high sensitivity; it significantly differentiated almost all criterial variables. Physical domain showed high sensitivity for health as well as for negative life events variables, while the mental and social domains were more sensitive for life events. The analysis confirmed the usefulness of PMSW-21 scale for measure the holistic well-being. The reliability of the total scale and its domains, construct validity and sensitivity for health and life determinants were at acceptable level.
2016-04-01
research has been that the feedback amplifiers are sensitive to many controllable and some, as of yet, uncontrollable environmental factors. Many of these...shall be subject to any penalty for failing to comply with a collection of information if it does not display a currently valid OMB control number...41 3.2.3 Design , construction, and testing of GEN-1 feedback amplifier
cDNA Clones with Rare and Recurrent Mutations Found in Cancers | Office of Cancer Genomics
The CTD2 Center at UT- MD Anderson Cancer Center has developed High-Throughput Mutagenesis and Molecular Barcoding (HiTMMoB)1,2 pipeline to construct mutant alleles open reading frame expression clones that are either recurrent or rare in cancers. These barcoded genes can be used for context-specific functional validation, detection of novel biomarkers (pathway activation) and targets (drug sensitivity).
Development of a QDots 800 based fluorescent solid phantom for validation of NIRF imaging platforms
NASA Astrophysics Data System (ADS)
Zhu, Banghe; Sevick-Muraca, Eva M.
2013-02-01
Over the past decade, we developed near-infrared fluorescence (NIRF) devices for non-invasive lymphatic imaging using microdosages of ICG in humans and for detection of lymph node metastasis in animal models mimicking metastatic human prostate cancer. To validate imaging, a NIST traceable phantom is needed so that developed "first-inhumans" drugs may be used with different luorescent imaging platforms. In this work, we developed a QDots 800 based fluorescent solid phantom for installation and operational qualification of clinical and preclinical, NIRF imaging devices. Due to its optical clearance, polyurethane was chosen as the base material. Titanium dioxide was used as the scattering agent because of its miscibility in polyurethane. QDots 800 was chosen owing to its stability and NIR emission spectra. A first phantom was constructed for evaluation of the noise floor arising from excitation light leakage, a phenomenon that can be minimized during engineering and design of fluorescent imaging systems. A second set of phantoms were constructed to enable quantification of device sensitivity associated with our preclinical and clinical devices. The phantoms have been successfully applied for installation and operational qualification of our preclinical and clinical devices. Assessment of excitation light leakage provides a figure of merit for "noise floor" and imaging sensitivity can be used to benchmark devices for specific imaging agents.
NASA Technical Reports Server (NTRS)
Taylor, Arthur C., III; Hou, Gene W.; Korivi, Vamshi M.
1991-01-01
A gradient-based design optimization strategy for practical aerodynamic design applications is presented, which uses the 2D thin-layer Navier-Stokes equations. The strategy is based on the classic idea of constructing different modules for performing the major tasks such as function evaluation, function approximation and sensitivity analysis, mesh regeneration, and grid sensitivity analysis, all driven and controlled by a general-purpose design optimization program. The accuracy of aerodynamic shape sensitivity derivatives is validated on two viscous test problems: internal flow through a double-throat nozzle and external flow over a NACA 4-digit airfoil. A significant improvement in aerodynamic performance has been achieved in both cases. Particular attention is given to a consistent treatment of the boundary conditions in the calculation of the aerodynamic sensitivity derivatives for the classic problems of external flow over an isolated lifting airfoil on 'C' or 'O' meshes.
Heald, Alison E; Fudman, Edward J; Anklesaria, Pervin; Mease, Philip J
2010-05-01
To assess the validity, responsiveness, and reliability of single-joint outcome measures for determining target joint (TJ) response in patients with inflammatory arthritis. Patient-reported outcomes (PRO), consisting of responses to single questions about TJ global status on a 100-mm visual analog scale (VAS; TJ global score), function on a 100-mm VAS (TJ function score), and pain on a 5-point Likert scale (TJ pain score) were piloted in 66 inflammatory arthritis subjects in a phase 1/2 clinical study of an intraarticular gene transfer agent and compared to physical examination measures (TJ swelling, TJ tenderness) and validated function questionnaires (Disabilities of the Arm, Shoulder and Hand scale, Rheumatoid Arthritis Outcome Score, and the Health Assessment Questionnaire). Construct validity was assessed by evaluating the correlation between the single-joint outcome measures and validated function questionnaires using Spearman's rank correlation. Responsiveness or sensitivity to change was assessed through calculating effect size and standardized response means (SRM). Reliability of physical examination measures was assessed by determining interobserver agreement. The single-joint PRO were highly correlated with each other and correlated well with validated functional measures. The TJ global score exhibited modest effect size and modest SRM that correlated well with the patient's assessment of response on a 100-mm VAS. Physical examination measures exhibited high interrater reliability, but correlated less well with validated functional measures and the patient's assessment of response. Single-joint PRO, particularly the TJ global score, are simple to administer and demonstrate construct validity and responsiveness in patients with inflammatory arthritis. (ClinicalTrials.gov identifier NCT00126724).
Development and Validation of a Job Exposure Matrix for Physical Risk Factors in Low Back Pain
Solovieva, Svetlana; Pehkonen, Irmeli; Kausto, Johanna; Miranda, Helena; Shiri, Rahman; Kauppinen, Timo; Heliövaara, Markku; Burdorf, Alex; Husgafvel-Pursiainen, Kirsti; Viikari-Juntura, Eira
2012-01-01
Objectives The aim was to construct and validate a gender-specific job exposure matrix (JEM) for physical exposures to be used in epidemiological studies of low back pain (LBP). Materials and Methods We utilized two large Finnish population surveys, one to construct the JEM and another to test matrix validity. The exposure axis of the matrix included exposures relevant to LBP (heavy physical work, heavy lifting, awkward trunk posture and whole body vibration) and exposures that increase the biomechanical load on the low back (arm elevation) or those that in combination with other known risk factors could be related to LBP (kneeling or squatting). Job titles with similar work tasks and exposures were grouped. Exposure information was based on face-to-face interviews. Validity of the matrix was explored by comparing the JEM (group-based) binary measures with individual-based measures. The predictive validity of the matrix against LBP was evaluated by comparing the associations of the group-based (JEM) exposures with those of individual-based exposures. Results The matrix includes 348 job titles, representing 81% of all Finnish job titles in the early 2000s. The specificity of the constructed matrix was good, especially in women. The validity measured with kappa-statistic ranged from good to poor, being fair for most exposures. In men, all group-based (JEM) exposures were statistically significantly associated with one-month prevalence of LBP. In women, four out of six group-based exposures showed an association with LBP. Conclusions The gender-specific JEM for physical exposures showed relatively high specificity without compromising sensitivity. The matrix can therefore be considered as a valid instrument for exposure assessment in large-scale epidemiological studies, when more precise but more labour-intensive methods are not feasible. Although the matrix was based on Finnish data we foresee that it could be applicable, with some modifications, in other countries with a similar level of technology. PMID:23152793
Schmidt, Carsten Oliver; Kohlmann, T; Pfingsten, M; Lindena, G; Marnitz, U; Pfeifer, K; Chenot, J F
2016-01-01
Recognizing patients at risk of developing chronic low back pain is essential for targeted interventions. One of the best researched screening instruments for this purpose is the Örebro Musculoskeletal Pain Questionnaire (ÖMSPQ). This work addresses psychometric properties of the German ÖMSPQ short form and its construct and prognostic validity. Analyses are based on a cluster-randomized trial assessing a risk tailored intervention for patients consulting for low back pain in 35 general practices. A total of 360 patients consulting for acute and sub-acute back pain, aged 20-60 years, were included. All patients received a 10-item German short version of the ÖMSPQ, and other generic instruments (Graded Chronic Pain Scale, Patient Health Questionnaire-Depression, Hannover Functional Ability Questionnaire, Fear-Avoidance Beliefs Questionnaire). The construct validity was assessed based on the factorial structure of the items and correlations with generic instruments. The area under the curve (AUC), sensitivity and specificity were calculated as measures of prognostic validity. ÖMSPQ items belonging to the same subscale correlated highest among each other. The internal consistency of the ÖMSPQ items was 0.80 (Cronbach's α). The factorial structure corresponds with theoretic expectations. ÖMSPQ subscales on pain related disability, depression, and fear-avoidance beliefs correlated highest with their counterpart generic scales. The AUC for three ÖMSPQ-based prediction models ranged from 0.77 to 0.81. Our results support a satisfactory factorial and prognostic validity of the German short ÖMSPQ. The instrument may guide the provision of targeted interventions. Further research should link it to targeted treatments.
Mojtabai, Ramin; Corey-Lisle, Patricia K; Ip, Edward Hak-Sing; Kopeykina, Irina; Haeri, Sophia; Cohen, Lisa Janet; Shumaker, Sally
2012-12-30
Investigation of patients' subjective perspective regarding the effectiveness - as opposed to efficacy - of antipsychotic medication has been hampered by a relative shortage of self-report measures of global clinical outcome. This paper presents data supporting the feasibility, inter-item consistency, and construct validity of the Patient Assessment Questionnaire (PAQ)-a self-report measure of psychiatric symptoms, medication side effects and general wellbeing, ultimately intended to assess effectiveness of interventions for schizophrenia-spectrum patients. The original 53-item instrument was developed by a multidisciplinary team which utilized brainstorming sessions for item generation and content analysis, patient focus groups, and expert panel reviews. This instrument and additional validation measures were administered, via Audio Computer-Assisted Self-Interviewing (ACASI), to 300 stable, medicated outpatients diagnosed with schizophrenia or schizoaffective disorder. Item elimination was based on psychometric properties and Item-Response Theory information functions and characteristic curves. Exploratory factor analysis of the resulting 40-item scale yielded a five factor solution. The five subscales (General Distress, Side Effects, Psychotic Symptoms, Cognitive Symptoms, Sleep) showed robust convergent (β's=0.34-0.75, average β=0.49) and discriminant validity. The PAQ demonstrates feasibility, reliability, and construct validity as a self-report measure of multiple domains pertinent to effectiveness. Future research needs to establish the PAQ's sensitivity to change. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Construct Validity of the Nepalese School Leaving English Reading Test
ERIC Educational Resources Information Center
Dawadi, Saraswati; Shrestha, Prithvi N.
2018-01-01
There has been a steady interest in investigating the validity of language tests in the last decades. Despite numerous studies on construct validity in language testing, there are not many studies examining the construct validity of a reading test. This paper reports on a study that explored the construct validity of the English reading test in…
Quek, Kia Fatt; Chua, Chong Beng; Razack, Azad Hassan; Low, Wah Yun; Loh, Chit Sin
2005-01-01
The purpose of the present study was to validate the Mandarin version of the International Prostate Symptom Score (Mand-IPSS) in a Malaysian population. The validity and reliability were studied in patients with lower urinary tract symptoms (LUTS; benign prostatic hyperplasia [BPH] group) and without LUTS (control group). Test-retest methodology was used to assess the reliability while Cronbach alpha was used to assess the internal consistency. Sensitivity to change was used to express the effect size index in the preintervention versus post-intervention score in patients with LUTS who underwent transurethral resection of the prostate. For the control group and BPH group, the internal consistency was excellent and a high degree of internal consistency was observed for all seven items (Cronbach alpha = 0.86-0.98 and 0.90-0.98, respectively). Test-retest correlation coefficients for all items were highly significant. Intraclass correlation coefficient (ICC) was high for the control (ICC = 0.93-0.99) and BPH group (ICC = 0.91-0.99). The sensitivity and specificity showed a high degree of sensitivity and specificity to the effects of treatment. A high degree of significance between baseline and post-treatment scores was observed across all seven items in the BPH group but not in the control group. The Mand-IPSS is a suitable, reliable, valid and sensitive instrument to measure clinical change in the Malaysian population.
Estuarine Human Activities Modulate the Fate of Changjiang-derived Materials in Adjacent Seas
NASA Astrophysics Data System (ADS)
WU, H.
2017-12-01
Mega constructions have been built in many river estuaries, but their environmental consequences in the adjacent coastal oceans were often overlooked. This issue was addressed with an example of the Changjiang River Estuary, which was recently built with massive navigation and reclamation constructions in recent years. Based on the model validations against cruises data and the numerical scenario experiments, it is shown that the estuarine constructions profoundly affected the fates of riverine materials in an indeed large offshore area. This is because estuarine dynamics are highly sensitive to their bathymetries. Previously, the Three Gorges Dam (TGD) was thought to be responsible for some offshore environmental changes through modulating the river plume extension, but here we show that its influences are secondary. Since the TGD and the mega estuarine constructions were built during the similar period, their influences might be confused.
Alanazi, Fahad; Gleeson, Peggy; Olson, Sharon; Roddey, Toni
2017-04-01
Prospective cohort study of a cross-cultural low back pain (LBP) questionnaire OBJECTIVE.: The objectives of the present study were to translate and cross-culturally adapt the Fear-Avoidance Beliefs Questionnaire (FABQ) to create a version in Arabic and to test its psychometric properties. The FABQ measures the effects that fear and avoidance beliefs have on work and on physical activity. An FABQ cross-culturally adapted for Arabic readers and speakers was created by forward translation, translation synthesis, and backward translation. Forty patients in Riyadh, Saudi Arabia, with LBP evaluated use of the questionnaire, and 70 patients from the same hospital participated in reliability, validity, and sensitivity studies. To determine test-retest reliability of the Arabic FABQ, patients completed it twice within 48 hours without receiving any active treatment between these two sessions. Patients completed the Arabic FABQ (and three other scales) at baseline and 14 days later to determine its validity and sensitivity. Test-retest reliability was good (FABQ-work: intraclass coefficient [ICC] = 0.74; FABQ-physical activity: ICC = 0.90; FABQ overall: ICC = 0.76). Correlations between the FABQ and three other instruments for measuring pain and disability were weak. The strongest correlation was found at the follow-up session with the Arabic Oswestry Questionnaire (r = 0.283; P ≤ 0.05). Sensitivity to change was low. The translation and adaptation of the Arabic version of the FABQ was successful. Overall, the Arabic FABQ had good test-retest reliability, acceptable construct validity, and low sensitivity to change. The Arabic version of the FABQ shows promise in the assessment of fear-avoidance beliefs among patients with LBP who speak and read Arabic. 3.
Moving to Capture Children's Attention: Developing a Methodology for Measuring Visuomotor Attention.
Hill, Liam J B; Coats, Rachel O; Mushtaq, Faisal; Williams, Justin H G; Aucott, Lorna S; Mon-Williams, Mark
2016-01-01
Attention underpins many activities integral to a child's development. However, methodological limitations currently make large-scale assessment of children's attentional skill impractical, costly and lacking in ecological validity. Consequently we developed a measure of 'Visual Motor Attention' (VMA)-a construct defined as the ability to sustain and adapt visuomotor behaviour in response to task-relevant visual information. In a series of experiments, we evaluated the capability of our method to measure attentional processes and their contributions in guiding visuomotor behaviour. Experiment 1 established the method's core features (ability to track stimuli moving on a tablet-computer screen with a hand-held stylus) and demonstrated its sensitivity to principled manipulations in adults' attentional load. Experiment 2 standardised a format suitable for use with children and showed construct validity by capturing developmental changes in executive attention processes. Experiment 3 tested the hypothesis that children with and without coordination difficulties would show qualitatively different response patterns, finding an interaction between the cognitive and motor factors underpinning responses. Experiment 4 identified associations between VMA performance and existing standardised attention assessments and thereby confirmed convergent validity. These results establish a novel approach to measuring childhood attention that can produce meaningful functional assessments that capture how attention operates in an ecologically valid context (i.e. attention's specific contribution to visuomanual action).
Proposal and validation of a clinical trunk control test in individuals with spinal cord injury.
Quinzaños, J; Villa, A R; Flores, A A; Pérez, R
2014-06-01
One of the problems that arise in spinal cord injury (SCI) is alteration in trunk control. Despite the need for standardized scales, these do not exist for evaluating trunk control in SCI. To propose and validate a trunk control test in individuals with SCI. National Institute of Rehabilitation, Mexico. The test was developed and later evaluated for reliability and criteria, content, and construct validity. We carried out 531 tests on 177 patients and found high inter- and intra-rater reliability. In terms of criterion validity, analysis of variance demonstrated a statistically significant difference in the test score of patients with adequate or inadequate trunk control according to the assessment of a group of experts. A receiver operating characteristic curve was plotted for optimizing the instrument's cutoff point, which was determined at 13 points, with a sensitivity of 98% and a specificity of 92.2%. With regard to construct validity, the correlation between the proposed test and the spinal cord independence measure (SCIM) was 0.873 (P=0.001) and that with the evolution time was 0.437 (P=0.001). For testing the hypothesis with qualitative variables, the Kruskal-Wallis test was performed, which resulted in a statistically significant difference between the scores in the proposed scale of each group defined by these variables. It was proven experimentally that the proposed trunk control test is valid and reliable. Furthermore, the test can be used for all patients with SCI despite the type and level of injury.
Psychometrics of the PHQ-9 as a measure of depressive symptoms in patients with heart failure.
Hammash, Muna H; Hall, Lynne A; Lennie, Terry A; Heo, Seongkum; Chung, Misook L; Lee, Kyoung Suk; Moser, Debra K
2013-10-01
Depression in patients with heart failure commonly goes undiagnosed and untreated. The Patient Health Questionnaire-9 (PHQ-9) is a simple, valid measure of depressive symptoms that may facilitate clinical assessment. It has not been validated in patients with heart failure. To test the reliability, and concurrent and construct validity of the PHQ-9 in patients with heart failure. A total of 322 heart failure patients (32% female, 61 ± 12 years, 56% New York Heart Association class III/IV) completed the PHQ-9, the Beck Depression Inventory-II (BDI-II), and the Control Attitudes Scale (CAS). Cronbach's alpha of .83 supported the internal consistency reliability of the PHQ-9 in this sample. Inter-item correlations (range .22-.66) and item-total correlation (except item 9) supported homogeneity of the PHQ-9. Spearman's rho of .80, (p < .001) between the PHQ-9 and the BDI-II supported the concurrent validity as did the agreement between the PHQ-9 and the BDI-II (Kappa = 0.64, p < .001). At cut-off score of 10, the PHQ-9 was 70% sensitive and 92% specific in identifying depressive symptoms, using the BDI-II scores as the criterion for comparison. Differences in PHQ-9 scores by level of perceived control measured by CAS (t(318) = -5.05, p < .001) supported construct validity. The PHQ-9 is a reliable, valid measure of depressive symptoms in patients with heart failure.
Measuring Cognitive and Affective Constructs in the Context of an Acute Health Event
Boudreaux, Edwin D.; Moon, Simon; Tappe, Karyn A.; Bock, Beth; Baumann, Brigitte; Chapman, Gretchen B.
2013-01-01
The latest recommendations for building dynamic health behavior theories emphasize that cognitions, emotions, and behaviors – and the nature of their inter-relationships -- can change over time. This paper describes the development and psychometric validation of four scales created to measure smoking-related causal attributions, perceived illness severity, event-related emotions, and intention to quit smoking among patients experiencing acute cardiac symptoms. After completing qualitative work with a sample of 50 cardiac patients, we administered the scales to 300 patients presenting to the emergency department for cardiac-related symptoms. Factor analyses, alpha coefficients, ANOVAS, and Pearson correlation coefficients were used to establish the scales' reliability and validity. Factor analyses revealed a stable factor structures for each of the four constructs. The scales were internally consistent, with the majority having an alpha of >0.80 (range: 0.57 to 0.89). Mean differences in ratings of the perceived illness severity and event-related emotions were noted across the three time anchors. Significant increases in intention to quit at the time of enrollment, compared to retrospective ratings of intention to quit before the event, provide preliminary support for the sensitivity of this measure to the motivating impact of the event. Finally, smoking-related causal attributions, perceived illness severity, and event-related emotions correlated in the expected directions with intention to quit smoking, providing preliminary support for construct validity. PMID:22970703
Orthorexia nervosa: validation of a diagnosis questionnaire.
Donini, L M; Marsili, D; Graziani, M P; Imbriale, M; Cannella, C
2005-06-01
To validate a questionnaire for the diagnosis of orhorexia oervosa, an eating disorder defined as "maniacal obsession for healthy food". 525 subjects were enrolled. Then they were randomized into two samples (sample of 404 subjects for the construction of the test for the diagnosis of orthorexia ORTO-15; sample of 121 subjects for the validation of the test). The ORTO-15 questionnaire, validated for the diagnosis of orthorexia, is made-up of 15 multiple-choice items. The test we proposed for the diagnosis of orthorexia (ORTO 15) showed a good predictive capability at a threshold value of 40 (efficacy 73.8%, sensitivity 55.6% and specificity 75.8%) also on verification with a control sample. However, it has a limit in identifying the obsessive disorder. For this reason we maintain that further investigation is necessary and that new questions useful for the evaluation of the obsessive-compulsive behavior should be added to the ORTO-15 questionnaire.
Lombardo, Caterina; Iani, Luca; Barbaranelli, Claudio
2016-08-01
The present paper describes two studies designed to evaluate the construct and the predictive validity of an Italian version of the Food Craving Questionnaire-State (FCQ-S). In the first study 368 volunteers aged 18-65years completed the FCQ-S and the Disordered Eating Questionnaire (DEQ). In the second study 41 females with eating disorders symptoms (mean age: 24.4yrs., DEQ≥30; Body Mass Index (BMI) in the range 17 to 30.9kg/m(2), 87.5% in the normal range) and 43 female healthy controls (mean age: 25.6yrs., DEQ<30; BMI in the normal range) took part in an experiment aimed at assessing changes in FCQ-S after exposure to words or images of highly palatable foods. The results of Study 1 showed that the five-factor model had acceptable fit indices. All subscales of the FCQ-S (but Desire) significantly correlated with the disordered eating measure. The strongest relationship was found between disordered eating and fear of losing control over food intake. The results of Study 2 revealed that four out of five FCQ-S subscales significantly increased after exposure to food stimuli. Participants with eating disorders symptoms, as compared to controls, also showed higher fear of losing control over food and higher negative reinforcement, although this difference was only marginally significant. The Italian version of the FCQ-S has good construct and concurrent validity, and it seems sensitive in detecting changes induced by stimuli related to highly palatable foods. Copyright © 2016 Elsevier Ltd. All rights reserved.
Butler, Stephen F.; Black, Ryan A.; McCaffrey, Stacey A.; Ainscough, Jessica; Doucette, Ann M.
2017-01-01
The purpose of this study was to develop and validate a computer adaptive testing (CAT) version of the Addiction Severity Index-Multimedia Version (ASI-MV®), the Addiction Severity CAT. This goal was accomplished in four steps. First, new candidate items for Addiction Severity CAT domains were evaluated after brainstorming sessions with experts in substance abuse treatment. Next, this new item bank was psychometrically evaluated on a large non-clinical (n =4419) and substance abuse treatment sample (n =845). Based on these results, final items were selected and calibrated for the creation of the Addiction Severity CAT algorithms. Once the algorithms were developed for the entire assessment, a fully functioning prototype of an Addiction Severity CAT was created. CAT simulations were conducted and optimal termination criteria were selected for the Addiction Severity CAT algorithms. Finally, construct validity of the CAT algorithms was evaluated by examining convergent/discriminant validity and sensitivity to change. The Addiction Severity CAT was determined to be valid, sensitive to change, and reliable. Further, the Addiction Severity CAT’s time of administration was found to be significantly less than the average time of administration for the ASI-MV composite scores. This study represents the initial validation of an IRT-based Addiction Severity CAT, and further exploration of the Addiction Severity CAT is needed. PMID:28230387
Butler, Stephen F; Black, Ryan A; McCaffrey, Stacey A; Ainscough, Jessica; Doucette, Ann M
2017-05-01
The purpose of this study was to develop and validate a computer adaptive testing (CAT) version of the Addiction Severity Index-Multimedia Version (ASI-MV), the Addiction Severity CAT. This goal was accomplished in 4 steps. First, new candidate items for Addiction Severity CAT domains were evaluated after brainstorming sessions with experts in substance abuse treatment. Next, this new item bank was psychometrically evaluated on a large nonclinical (n = 4,419) and substance abuse treatment (n = 845) sample. Based on these results, final items were selected and calibrated for the creation of the Addiction Severity CAT algorithms. Once the algorithms were developed for the entire assessment, a fully functioning prototype of an Addiction Severity CAT was created. CAT simulations were conducted, and optimal termination criteria were selected for the Addiction Severity CAT algorithms. Finally, construct validity of the CAT algorithms was evaluated by examining convergent and discriminant validity and sensitivity to change. The Addiction Severity CAT was determined to be valid, sensitive to change, and reliable. Further, the Addiction Severity CAT's time of completion was found to be significantly less than the average time of completion for the ASI-MV composite scores. This study represents the initial validation of an Addiction Severity CAT based on item response theory, and further exploration of the Addiction Severity CAT is needed. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Validation of the M. D. Anderson Symptom Inventory multiple myeloma module
2013-01-01
Background The symptom burden associated with multiple myeloma (MM) is often severe. Presently, no instrument comprehensively assesses disease-related and treatment-related symptoms in patients with MM. We sought to validate a module of the M. D. Anderson Symptom Inventory (MDASI) developed specifically for patients with MM (MDASI-MM). Methods The MDASI-MM was developed with clinician input, cognitive debriefing, and literature review, and administered to 132 patients undergoing induction chemotherapy or stem cell transplantation. We demonstrated the MDASI-MM’s reliability (Cronbach α values); criterion validity (item and subscale correlations between the MDASI-MM and the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire (EORTC QLQ-C30) and the EORTC MM module (QLQ-MY20)), and construct validity (differences between groups by performance status). Ratings from transplant patients were examined to demonstrate the MDASI-MM’s sensitivity in detecting the acute worsening of symptoms post-transplantation. Results The MDASI-MM demonstrated excellent correlations with subscales of the 2 EORTC instruments, strong ability to distinguish clinically different patient groups, high sensitivity in detecting change in patients’ performance status, and high reliability. Cognitive debriefing confirmed that the MDASI-MM encompasses the breadth of symptoms relevant to patients with MM. Conclusion The MDASI-MM is a valid, reliable, comprehensive-yet-concise tool that is recommended as a uniform symptom assessment instrument for patients with MM. PMID:23384030
Johnston, Marie; Dixon, Diane; Hart, Jo; Glidewell, Liz; Schröder, Carin; Pollard, Beth
2014-05-01
In studies involving theoretical constructs, it is important that measures have good content validity and that there is not contamination of measures by content from other constructs. While reliability and construct validity are routinely reported, to date, there has not been a satisfactory, transparent, and systematic method of assessing and reporting content validity. In this paper, we describe a methodology of discriminant content validity (DCV) and illustrate its application in three studies. Discriminant content validity involves six steps: construct definition, item selection, judge identification, judgement format, single-sample test of content validity, and assessment of discriminant items. In three studies, these steps were applied to a measure of illness perceptions (IPQ-R) and control cognitions. The IPQ-R performed well with most items being purely related to their target construct, although timeline and consequences had small problems. By contrast, the study of control cognitions identified problems in measuring constructs independently. In the final study, direct estimation response formats for theory of planned behaviour constructs were found to have as good DCV as Likert format. The DCV method allowed quantitative assessment of each item and can therefore inform the content validity of the measures assessed. The methods can be applied to assess content validity before or after collecting data to select the appropriate items to measure theoretical constructs. Further, the data reported for each item in Appendix S1 can be used in item or measure selection. Statement of contribution What is already known on this subject? There are agreed methods of assessing and reporting construct validity of measures of theoretical constructs, but not their content validity. Content validity is rarely reported in a systematic and transparent manner. What does this study add? The paper proposes discriminant content validity (DCV), a systematic and transparent method of assessing and reporting whether items assess the intended theoretical construct and only that construct. In three studies, DCV was applied to measures of illness perceptions, control cognitions, and theory of planned behaviour response formats. Appendix S1 gives content validity indices for each item of each questionnaire investigated. Discriminant content validity is ideally applied while the measure is being developed, before using to measure the construct(s), but can also be applied after using a measure. © 2014 The British Psychological Society.
Evaluating the Dimensionality of Pornography.
Busby, Dean M; Chiu, Hsin-Yao; Olsen, Joseph A; Willoughby, Brian J
2017-08-01
Pornography may be a construct with a single trait or one with many traits. Research in the past was inconsistent in this regard with most researchers assuming that pornography was unidimensional (with one single trait of pornography). However, the considerable amounts of residual variation found in these studies beyond that explained by the single trait hints at what might be a multidimensional construct (with multiple traits such as sensitization and differentiation). Consequently, in this study, we intended to address the question of whether pornography consisted of a single trait or if it was multidimensional. Using MTurk, 2173 participants from the United States and the Commonwealth of Nations (in which pornography is not strictly illegal) were recruited and asked to rate how pornographic they thought a list of different depictions were. The data were analyzed utilizing the cross-validation procedure in which two subsamples were created from the main sample and one was used to establish the model building and the other to validate the model. Various models, including first-order and higher-order exploratory and confirmatory factor models, were tested. Results indicated that a bi-factor (multidimensional) model generated the best model fit, and that it was most appropriate to consider pornography multidimensional. The final model contained two dimensions ("Sensitization" and "Differentiation"). While sensitization revealed the participants' general tendency to rate all items to be more or less pornographic, differentiation revealed the participants' tendency to differentiate highly pornographic items from less pornographic items. Based on the findings of this study, we suggest that future research on the usage and effects of pornography be conducted while taking into consideration the multidimensional nature of pornography.
Measuring social impacts of breast carcinoma treatment in Chinese women.
Fielding, Richard; Lam, Wendy W T
2004-06-15
There is no existing instrument that is suitable for measuring the social impact of breast carcinoma (BC) and its treatment among women of Southern Chinese descent. In the current study, the authors assessed the validity of the Chinese Social Adjustment Scale, which was designed to address the need for such an instrument. Five dimensions of social concern were identified in a previous study of Cantonese-speaking Chinese women with BC; these dimensions were family and other relationships, intimacy, private self-image, and public self-image. The authors designed 40 items to address perceptions of change in these areas. These items were administered to a group of 226 women who had received treatment for BC, and factor analysis subsequently was performed to determine construct characteristics. The resulting draft instrument then was administered, along with other measures for the assessment of basic psychometric properties, to a second group of 367 women who recently had undergone surgery for BC. Factor analysis optimally identified 5 factors (corresponding to 33 items): 1) Relationships with Family (10 items, accounting for 22% of variance); 2) Self-Image (7 items, accounting for 15% of variance); 3) Relationships with Friends (7 items, accounting for 8% of variance); 4) Social Enjoyment (4 items, accounting for 6% of variance); and 5) Attractiveness and Sexuality (5 items, accounting for 5% of variance). Subscales were reliable (alpha = 0.63-0.93) and exhibited convergent validity in positive correlations with related measures and divergent validity in appropriate inverse or nonsignificant correlations with other measures. Criterion validity was good, and sensitivity was acceptable. Patterns of change on the scales were consistent with reports in the literature. Self-administration resulted in improved sensitivity. The 33-item Chinese Social Adjustment Scale validly, reliably, and sensitively measures the social impact of BC on Cantonese-speaking Hong Kong Chinese women. Further development of the scale to increase its sensitivity is underway. Copyright 2004 American Cancer Society.
Strand, Edythe A; McCauley, Rebecca J; Weigand, Stephen D; Stoeckel, Ruth E; Baas, Becky S
2013-04-01
In this article, the authors report reliability and validity evidence for the Dynamic Evaluation of Motor Speech Skill (DEMSS), a new test that uses dynamic assessment to aid in the differential diagnosis of childhood apraxia of speech (CAS). Participants were 81 children between 36 and 79 months of age who were referred to the Mayo Clinic for diagnosis of speech sound disorders. Children were given the DEMSS and a standard speech and language test battery as part of routine evaluations. Subsequently, intrajudge, interjudge, and test-retest reliability were evaluated for a subset of participants. Construct validity was explored for all 81 participants through the use of agglomerative cluster analysis, sensitivity measures, and likelihood ratios. The mean percentage of agreement for 171 judgments was 89% for test-retest reliability, 89% for intrajudge reliability, and 91% for interjudge reliability. Agglomerative hierarchical cluster analysis showed that total DEMSS scores largely differentiated clusters of children with CAS vs. mild CAS vs. other speech disorders. Positive and negative likelihood ratios and measures of sensitivity and specificity suggested that the DEMSS does not overdiagnose CAS but sometimes fails to identify children with CAS. The value of the DEMSS in differential diagnosis of severe speech impairments was supported on the basis of evidence of reliability and validity.
Screening for confabulations with the confabulation screen.
Dalla Barba, Gianfranco; Brazzarola, Marta; Marangoni, Sara; La Corte, Valentina
2018-04-24
The objective of this work is to devise and validate a sensitive and specific test for confabulatory impairment. We conceived a screening test for confabulation, the Confabulation Screen (CS), a brief test using 10 questions of episodic memory (EM), where confabulators most frequently confabulate. It was postulated that the CS would predict confabulations not only in EM, but also in the other subordinate structures of personal temporality, namely the present and the future. Thirty confabulating amnesic patients of various aetiologies and 97 normal controls entered the study. Participants were administered the CS and the Confabulation Battery (Dalla Barba, G., & Decaix, C. (2009). "Do you remeber what you did on March 13 1985?" A case study of confabulatory hypermnesia. Cortex, 45(5), 566-574). Confabulations in the CS positively and significantly correlated with confabulations in personal temporality domains of the CB, namely EM, orientation in time and place and episodic plans. Conversely, as expected, they did not correlate with confabulations in impersonal temporality domains of the CB. Consistent with results of previous studies, the most frequently observed type of confabulation in the CS was Habits Confabulation. The CS had high construct validity and good discriminative validity in terms of sensitivity and specificity. Cut-off scores for clinical and research purposes are proposed. The CS provides efficient and valid screening for confabulatory impairment.
Validity of the posttraumatic stress disorders (PTSD) checklist in pregnant women.
Gelaye, Bizu; Zheng, Yinnan; Medina-Mora, Maria Elena; Rondon, Marta B; Sánchez, Sixto E; Williams, Michelle A
2017-05-12
The PTSD Checklist-civilian (PCL-C) is one of the most commonly used self-report measures of PTSD symptoms, however, little is known about its validity when used in pregnancy. This study aims to evaluate the reliability and validity of the PCL-C as a screen for detecting PTSD symptoms among pregnant women. A total of 3372 pregnant women who attended their first prenatal care visit in Lima, Peru participated in the study. We assessed the reliability of the PCL-C items using Cronbach's alpha. Criterion validity and performance characteristics of PCL-C were assessed against an independent, blinded Clinician-Administered PTSD Scale (CAPS) interview using measures of sensitivity, specificity and receiver operating characteristics (ROC) curves. We tested construct validity using exploratory and confirmatory factor analytic approaches. The reliability of the PCL-C was excellent (Cronbach's alpha =0.90). ROC analysis showed that a cut-off score of 26 offered optimal discriminatory power, with a sensitivity of 0.86 (95% CI: 0.78-0.92) and a specificity of 0.63 (95% CI: 0.62-0.65). The area under the ROC curve was 0.75 (95% CI: 0.71-0.78). A three-factor solution was extracted using exploratory factor analysis and was further complemented with three other models using confirmatory factor analysis (CFA). In a CFA, a three-factor model based on DSM-IV symptom structure had reasonable fit statistics with comparative fit index of 0.86 and root mean square error of approximation of 0.09. The Spanish-language version of the PCL-C may be used as a screening tool for pregnant women. The PCL-C has good reliability, criterion validity and factorial validity. The optimal cut-off score obtained by maximizing the sensitivity and specificity should be considered cautiously; women who screened positive may require further investigation to confirm PTSD diagnosis.
Anota, Amélie; Mariet, Anne-Sophie; Maingon, Philippe; Joly, Florence; Bosset, Jean-François; Guizard, Anne-Valérie; Bittard, Hugues; Velten, Michel; Mercier, Mariette
2016-12-06
Health-related quality of life (HRQoL) has been positioned as one of the major endpoints in oncology. Thus, there is a need to validate cancer-site specific survey instruments. This study aimed to perform a transcultural adaptation of the 50-item Expanded Prostate cancer Index Composite (EPIC) questionnaire for HRQoL in prostate cancer patients and to validate the psychometric properties of the French-language version. The EPIC questionnaire measures urinary, bowel, sexual and hormonal domains. The first step, corresponding to transcultural adaptation of the original English version of the EPIC was performed according to the back translation technique. The second step, comprising the validation of the psychometric properties of the EPIC questionnaire, was performed in patients under treatment for localized prostate cancer (treatment group) and in patients cured of prostate cancer (cured group). The EORTC QLQ-C30 and QLQ-PR25 prostate cancer module were also completed by patients to assess criterion validity. Two assessments were performed, i.e., before and at the end of treatment for the Treatment group, to assess sensitivity to change; and at 2 weeks' interval in the Cured group to assess test-retest reliability. Psychometric properties were explored according to classical test theory. The first step showed overall good acceptability and understanding of the questionnaire. In the second step, 215 patients were included from January 2012 to June 2014: 125 in the Treatment group, and 90 in the Cured group. All domains exhibited good internal consistency, except the bowel domain (Cronbach's α = 0.61). No floor effect was observed. Test-retest reliability assessed in the cured group was acceptable, expect for bowel function (intraclass coefficient = 0.68). Criterion validity was good for each domain and subscale. Construct validity was not demonstrated for the hormonal and bowel domains. Sensitivity to change was exhibited for 5/8 subscales and 2/4 summary scores for patients who experienced toxicities during treatment. The French EPIC questionnaire seems to have adequate psychometric properties, comparable to those exhibited by the original English-language version, except for the construct validity, which was not available in original version.
Stoll, C; Kapfhammer, H P; Rothenhäusler, H B; Haller, M; Briegel, J; Schmidt, M; Krauseneck, T; Durst, K; Schelling, G
1999-07-01
Many survivors of critical illness and intensive care unit (ICU) treatment have traumatic memories such as nightmares, panic or pain which can be associated with the development of posttraumatic stress disorder (PTSD). In order to simplify the rapid and early detection of PTSD in such patients, we modified an existing questionnaire for diagnosis of PTSD and validated the instrument in a cohort of ARDS patients after long-term ICU therapy. Follow-up cohort study. The 20-bed ICU of a university teaching hospital. A cohort of 52 long-term survivors of the acute respiratory distress syndrome (ARDS). The questionnaire was administered to the study cohort at two time points 2 years apart. At the second evaluation, the patients underwent a structured interview with two trained psychiatrists to diagnose PTSD according to Diagnostic and Statistical Manual of Mental Disorders, 4th edition (DSM-IV) criteria. The reliability and validity of the questionnaire was then estimated and its specificity, sensitivity and optimal decision threshold determined using receiver operating characteristic (ROC) curve analyses. The questionnaire showed a high internal consistency (Crohnbach's alpha = 0.93) and a high test-retest reliability (intraclass correlation coefficient alpha = 0.89). There was evidence of construct validity by a linear relationship between scores and the number of traumatic memories from the ICU the patients described (Spearman's rho = 0.48, p < 0.01). Criterion validity was demonstrated by ROC curve analyses resulting in a sensitivity of 77.0% and a specificity of 97.5% for the diagnosis of PTSD. The questionnaire was found to be a responsive, valid and reliable instrument to screen survivors of intensive care for PTSD.
Koshy, Anson J.; Watkins, Marley W.; Cassano, Michael C.; Wahlberg, Andrea C.; Mautone, Jennifer A.; Blum, Nathan J.
2013-01-01
Objective To evaluate the construct validity of the Behavioral Health Checklist (BHCL) for children aged from 4 to 12 years from diverse backgrounds. Method The parents of 4–12-year-old children completed the BHCL in urban and suburban primary care practices affiliated with a tertiary-care children’s hospital. Across practices, 1,702 were eligible and 1,406 (82.6%) provided consent. Children of participating parents were primarily non-Hispanic black/African American and white/Caucasian from low- to middle-income groups. Confirmatory factor analyses examined model fit for the total sample and subsamples defined by demographic characteristics. Results The findings supported the hypothesized 3-factor structure: Internalizing Problems, Externalizing Problems, and Inattention/Hyperactivity. The model demonstrated adequate to good fit across age-groups, gender, races, income groups, and suburban versus urban practices. Conclusion The findings provide strong evidence of the construct validity, developmental appropriateness, and cultural sensitivity of the BHCL when used for screening in primary care. PMID:23978505
Gu, Hairong; Kim, Woojae; Hou, Fang; Lesmes, Luis Andres; Pitt, Mark A; Lu, Zhong-Lin; Myung, Jay I
2016-01-01
Measurement efficiency is of concern when a large number of observations are required to obtain reliable estimates for parametric models of vision. The standard entropy-based Bayesian adaptive testing procedures addressed the issue by selecting the most informative stimulus in sequential experimental trials. Noninformative, diffuse priors were commonly used in those tests. Hierarchical adaptive design optimization (HADO; Kim, Pitt, Lu, Steyvers, & Myung, 2014) further improves the efficiency of the standard Bayesian adaptive testing procedures by constructing an informative prior using data from observers who have already participated in the experiment. The present study represents an empirical validation of HADO in estimating the human contrast sensitivity function. The results show that HADO significantly improves the accuracy and precision of parameter estimates, and therefore requires many fewer observations to obtain reliable inference about contrast sensitivity, compared to the method of quick contrast sensitivity function (Lesmes, Lu, Baek, & Albright, 2010), which uses the standard Bayesian procedure. The improvement with HADO was maintained even when the prior was constructed from heterogeneous populations or a relatively small number of observers. These results of this case study support the conclusion that HADO can be used in Bayesian adaptive testing by replacing noninformative, diffuse priors with statistically justified informative priors without introducing unwanted bias.
Gu, Hairong; Kim, Woojae; Hou, Fang; Lesmes, Luis Andres; Pitt, Mark A.; Lu, Zhong-Lin; Myung, Jay I.
2016-01-01
Measurement efficiency is of concern when a large number of observations are required to obtain reliable estimates for parametric models of vision. The standard entropy-based Bayesian adaptive testing procedures addressed the issue by selecting the most informative stimulus in sequential experimental trials. Noninformative, diffuse priors were commonly used in those tests. Hierarchical adaptive design optimization (HADO; Kim, Pitt, Lu, Steyvers, & Myung, 2014) further improves the efficiency of the standard Bayesian adaptive testing procedures by constructing an informative prior using data from observers who have already participated in the experiment. The present study represents an empirical validation of HADO in estimating the human contrast sensitivity function. The results show that HADO significantly improves the accuracy and precision of parameter estimates, and therefore requires many fewer observations to obtain reliable inference about contrast sensitivity, compared to the method of quick contrast sensitivity function (Lesmes, Lu, Baek, & Albright, 2010), which uses the standard Bayesian procedure. The improvement with HADO was maintained even when the prior was constructed from heterogeneous populations or a relatively small number of observers. These results of this case study support the conclusion that HADO can be used in Bayesian adaptive testing by replacing noninformative, diffuse priors with statistically justified informative priors without introducing unwanted bias. PMID:27105061
Chien, Wai-Tong; Lee, Isabella Yuet-Ming; Wang, Li-Qun
2017-01-01
The purpose of this study was to test the reliability, validity, and factor structure of a Chinese version of the Psychotic Symptom Rating Scale (PSYRATS) in 198 and 202 adult patients with recent-onset and chronic psychosis, respectively. The PSYRATS has been translated into different language versions and has been validated for clinical and research use mainly in chronic psychotic patients but not in recent-onset psychosis patients or in Chinese populations. The psychometric analysis of the translated Chinese version included assessment of its content validity, semantic equivalence, interrater and test-retest reliability, reproducibility, sensitivity to changes in psychotic symptoms, internal consistency, concurrent validity (compared to a valid psychotic symptom scale), and factor structure. The Chinese version demonstrated very satisfactory content validity as rated by an expert panel, good semantic equivalence with the original version, and high interrater and test-retest (at 2-week interval) reliability. It also indicated very good reproducibility of and sensitivity to changes in psychotic symptoms in line with the symptom severity measured with the Positive and Negative Syndrome Scale (PANSS). The scale consisted of four factors for the hallucination subscale and two factors for the delusion subscale, explaining about 80% of the total variance of the construct, indicating satisfactory correlations between the hallucination and delusion factors themselves, between items, factors, subscales, and overall scale, and between factors and relevant item and subscale scores of the PANSS. The Chinese version of the PSYRATS is a reliable and valid instrument to measure symptom severity in Chinese psychotic patients complementary to other existing measures mainly in English language.
A framework to enhance security of physically unclonable functions using chaotic circuits
NASA Astrophysics Data System (ADS)
Chen, Lanxiang
2018-05-01
As a new technique for authentication and key generation, physically unclonable function (PUF) has attracted considerable attentions, with extensive research results achieved already. To resist the popular machine learning modeling attacks, a framework to enhance the security of PUFs is proposed. The basic idea is to combine PUFs with a chaotic system of which the response is highly sensitive to initial conditions. For this framework, a specific construction which combines the common arbiter PUF circuit, a converter, and the Chua's circuit is given to implement a more secure PUF. Simulation experiments are presented to further validate the framework. Finally, some practical suggestions for the framework and specific construction are also discussed.
Climate change and heat-related mortality in six cities Part 1: model construction and validation
NASA Astrophysics Data System (ADS)
Gosling, Simon N.; McGregor, Glenn R.; Páldy, Anna
2007-08-01
Heat waves are expected to increase in frequency and magnitude with climate change. The first part of a study to produce projections of the effect of future climate change on heat-related mortality is presented. Separate city-specific empirical statistical models that quantify significant relationships between summer daily maximum temperature ( T max) and daily heat-related deaths are constructed from historical data for six cities: Boston, Budapest, Dallas, Lisbon, London, and Sydney. ‘Threshold temperatures’ above which heat-related deaths begin to occur are identified. The results demonstrate significantly lower thresholds in ‘cooler’ cities exhibiting lower mean summer temperatures than in ‘warmer’ cities exhibiting higher mean summer temperatures. Analysis of individual ‘heat waves’ illustrates that a greater proportion of mortality is due to mortality displacement in cities with less sensitive temperature-mortality relationships than in those with more sensitive relationships, and that mortality displacement is no longer a feature more than 12 days after the end of the heat wave. Validation techniques through residual and correlation analyses of modelled and observed values and comparisons with other studies indicate that the observed temperature-mortality relationships are represented well by each of the models. The models can therefore be used with confidence to examine future heat-related deaths under various climate change scenarios for the respective cities (presented in Part 2).
Månsson, Viktor; Gilsdorf, Janet R; Kahlmeter, Gunnar; Kilian, Mogens; Kroll, J Simon; Riesbeck, Kristian; Resman, Fredrik
2018-03-01
Encapsulated Haemophilus influenzae strains belong to type-specific genetic lineages. Reliable capsule typing requires PCR, but a more efficient method would be useful. We evaluated capsule typing by using matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry. Isolates of all capsule types (a-f and nontypeable; n = 258) and isogenic capsule transformants (types a-d) were investigated. Principal component and biomarker analyses of mass spectra showed clustering, and mass peaks correlated with capsule type-specific genetic lineages. We used 31 selected isolates to construct a capsule typing database. Validation with the remaining isolates (n = 227) showed 100% sensitivity and 92.2% specificity for encapsulated strains (a-f; n = 61). Blinded validation of a supplemented database (n = 50) using clinical isolates (n = 126) showed 100% sensitivity and 100% specificity for encapsulated strains (b, e, and f; n = 28). MALDI-TOF mass spectrometry is an accurate method for capsule typing of H. influenzae.
Cabrera-Arana, Gustavo A; Londoño-Pimienta, Jaime L; Bello-Parías, León D
2008-01-01
Validating an instrument for measuring the perceived quality of services received by people using hospitals forming part of the Colombian Ministry of Social Protection's restructuring, redesigning and modernisation programme for health-service providing networks. Sánchez and Echeverri's guidelines for validating health quality measurement scales were followed due to the lack of a valid instrument for doing this in Colombia. Conceptual synthesis led to identifying a structure of constituent indicators, domains and sub-domains regarding the perception of health service quality. A list of reactions (having a scale for categorising the replies) was analysed according to the validity of appearance, construct, criteria and utility as criteria for sensitivity and usefulness. Successive revisions and three rounds of field-trials led to producing PECASUSS, an acronym given to the instrument for measuring users' perception of health service quality (Percepción de Calidad Según Usuarios de Servicios de Salud). The guidelines effectively orientated the validation of the instrument required for measuring the perceived quality of health services received by people using hospitals forming part of the programme.
Kashikar-Zuck, Susmita; Carle, Adam; Barnett, Kimberly; Goldschneider, Kenneth R.; Sherry, David D.; Mara, Constance A.; Cunningham, Natoshia; Farrell, Jennifer; Tress, Jenna; DeWitt, Esi Morgan
2015-01-01
The Patient Reported Outcomes Measurement Information System (PROMIS) initiative is a comprehensive strategy by the National Institutes of Health to support the development and validation of precise instruments to assess self-reported health domains across healthy and disease-specific populations. Much progress has been made in instrument development but there remains a gap in the validation of PROMIS measures for pediatric chronic pain. The purpose of this study was to investigate the construct validity and responsiveness to change of seven PROMIS domains for the assessment of children (ages 8-18) with chronic pain – Pain Interference, Fatigue, Anxiety, Depression, Mobility, Upper Extremity Function and Peer Relationships. PROMIS measures were administered at the initial visit and two follow-up visits at an outpatient chronic pain clinic (CPC; N=82) and at an intensive amplified pain day-treatment program (AMP; N= 63). Aim 1 examined construct validity of PROMIS measures by comparing them with corresponding “legacy” measures administered as part of usual care in the CPC sample. Aim 2 examined sensitivity to change in both CPC and AMP samples. Longitudinal growth models showed that PROMIS Pain Interference, Anxiety, Depression, Mobility, Upper Extremity and Peer Relationship measures and legacy instruments generally performed similarly with slightly steeper slopes of improvement in legacy measures. All seven PROMIS domains showed responsiveness to change. Results offered initial support for the validity of PROMIS measures in pediatric chronic pain. Further validation with larger and more diverse pediatric pain samples and additional legacy measures would broaden the scope of use of PROMIS in clinical research. PMID:26447704
An approach to measure parameter sensitivity in watershed ...
Hydrologic responses vary spatially and temporally according to watershed characteristics. In this study, the hydrologic models that we developed earlier for the Little Miami River (LMR) and Las Vegas Wash (LVW) watersheds were used for detail sensitivity analyses. To compare the relative sensitivities of the hydrologic parameters of these two models, we used Normalized Root Mean Square Error (NRMSE). By combining the NRMSE index with the flow duration curve analysis, we derived an approach to measure parameter sensitivities under different flow regimes. Results show that the parameters related to groundwater are highly sensitive in the LMR watershed, whereas the LVW watershed is primarily sensitive to near surface and impervious parameters. The high and medium flows are more impacted by most of the parameters. Low flow regime was highly sensitive to groundwater related parameters. Moreover, our approach is found to be useful in facilitating model development and calibration. This journal article describes hydrological modeling of climate change and land use changes on stream hydrology, and elucidates the importance of hydrological model construction in generating valid modeling results.
Exhaled molecular profiles in the assessment of cystic fibrosis and primary ciliary dyskinesia.
Paff, T; van der Schee, M P; Daniels, J M A; Pals, G; Postmus, P E; Sterk, P J; Haarman, E G
2013-09-01
Early diagnosis and monitoring of disease activity are essential in cystic fibrosis (CF) and primary ciliary dyskinesia (PCD). We aimed to establish exhaled molecular profiles as the first step in assessing the potential of breath analysis. Exhaled breath was analyzed by electronic nose in 25 children with CF, 25 with PCD and 23 controls. Principle component reduction and canonical discriminant analysis were used to construct internally cross-validated ROC curves. CF and PCD patients had significantly different breath profiles when compared to healthy controls (CF: sensitivity 84%, specificity 65%; PCD: sensitivity 88%, specificity 52%) and from each other (sensitivity 84%, specificity 60%). Patients with and without exacerbations had significantly different breath profiles (CF: sensitivity 89%, specificity 56%; PCD: sensitivity 100%, specificity 90%). Exhaled molecular profiles significantly differ between patients with CF, PCD and controls. The eNose may have potential in disease monitoring based on the influence of exacerbations on the VOC-profile. Copyright © 2012 European Cystic Fibrosis Society. Published by Elsevier B.V. All rights reserved.
Peters, Lorna; Sunderland, Matthew; Andrews, Gavin; Rapee, Ronald M; Mattick, Richard P
2012-03-01
Shortened forms of the Social Interaction Anxiety Scale (SIAS) and the Social Phobia Scale (SPS) were developed using nonparametric item response theory methods. Using data from socially phobic participants enrolled in 5 treatment trials (N = 456), 2 six-item scales (the SIAS-6 and the SPS-6) were developed. The validity of the scores on the SIAS-6 and the SPS-6 was then tested using traditional methods for their convergent validity in an independent clinical sample and a student sample, as well as for their sensitivity to change and diagnostic sensitivity in the clinical sample. The scores on the SIAS-6 and the SPS-6 correlated as well as the scores on the original SIAS and SPS, with scores on measures of related constructs, discriminated well between those with and without a diagnosis of social phobia, providing cutoffs for diagnosis and were as sensitive to measuring change associated with treatment as were the SIAS and SPS. Together, the SIAS-6 and the SPS-6 appear to be an efficient method of measuring symptoms of social phobia and provide a brief screening tool.
Construct Validation of the Dietary Inflammatory Index among Postmenopausal Women
Tabung, Fred K.; Steck, Susan E.; Zhang, Jiajia; Ma, Yunsheng; Liese, Angela D.; Agalliu, Ilir; Hingle, Melanie; Hou, Lifang; Hurley, Thomas G.; Jiao, Li; Martin, Lisa W.; Millen, Amy E.; Park, Hannah L.; Rosal, Milagros C.; Shikany, James M.; Shivappa, Nitin; Ockene, Judith K.; Hebert, James R.
2015-01-01
Purpose Many dietary factors have either pro- or anti-inflammatory properties. We previously developed a dietary inflammatory index (DII) to assess the inflammatory potential of diet. In this study we conducted a construct validation of the DII based on data from a food frequency questionnaire and three inflammatory biomarkers in a subsample of 2,567 postmenopausal women in the Women’s Health Initiative Observational Study. Methods We used multiple linear and logistic regression models, controlling for potential confounders, to test whether baseline DII predicted concentrations of interleukin-6 (IL-6), high-sensitivity C-reactive protein (hs-CRP), tumor necrosis factor alpha receptor 2 (TNFα-R2), or an overall biomarker score combining all three inflammatory biomarkers. Results The DII was associated with the four biomarkers with beta estimates (95%CI) comparing the highest with lowest DII quintiles as follows: IL-6: 1.26 (1.15, 1.38), Ptrend<0.0001; TNFα-R2: 81.43 (19.15, 143.71), Ptrend=0.004; dichotomized hs-CRP (odds ratio for higher versus lower hs-CRP): 1.30 (0.97, 1.67), Ptrend=0.34); and the combined inflammatory biomarker score: 0.26 (0.12, 0.40), Ptrend=0.0001. Conclusion The DII was significantly associated with inflammatory biomarkers. Construct validity of the DII indicates its utility for assessing the inflammatory potential of diet and for expanding its use to include associations with common chronic diseases in future studies. PMID:25900255
Moving to Capture Children’s Attention: Developing a Methodology for Measuring Visuomotor Attention
Coats, Rachel O.; Mushtaq, Faisal; Williams, Justin H. G.; Aucott, Lorna S.; Mon-Williams, Mark
2016-01-01
Attention underpins many activities integral to a child’s development. However, methodological limitations currently make large-scale assessment of children’s attentional skill impractical, costly and lacking in ecological validity. Consequently we developed a measure of ‘Visual Motor Attention’ (VMA)—a construct defined as the ability to sustain and adapt visuomotor behaviour in response to task-relevant visual information. In a series of experiments, we evaluated the capability of our method to measure attentional processes and their contributions in guiding visuomotor behaviour. Experiment 1 established the method’s core features (ability to track stimuli moving on a tablet-computer screen with a hand-held stylus) and demonstrated its sensitivity to principled manipulations in adults’ attentional load. Experiment 2 standardised a format suitable for use with children and showed construct validity by capturing developmental changes in executive attention processes. Experiment 3 tested the hypothesis that children with and without coordination difficulties would show qualitatively different response patterns, finding an interaction between the cognitive and motor factors underpinning responses. Experiment 4 identified associations between VMA performance and existing standardised attention assessments and thereby confirmed convergent validity. These results establish a novel approach to measuring childhood attention that can produce meaningful functional assessments that capture how attention operates in an ecologically valid context (i.e. attention's specific contribution to visuomanual action). PMID:27434198
Bozcuk, H; Yıldız, M; Artaç, M; Kocer, M; Kaya, Ç; Ulukal, E; Ay, S; Kılıç, M P; Şimşek, E H; Kılıçkaya, P; Uçar, S; Coskun, H S; Savas, B
2015-06-01
There is clinical need to predict risk of febrile neutropenia before a specific cycle of chemotherapy in cancer patients. Data on 3882 chemotherapy cycles in 1089 consecutive patients with lung, breast, and colon cancer from four teaching hospitals were used to construct a predictive model for febrile neutropenia. A final nomogram derived from the multivariate predictive model was prospectively confirmed in a second cohort of 960 consecutive cases and 1444 cycles. The following factors were used to construct the nomogram: previous history of febrile neutropenia, pre-cycle lymphocyte count, type of cancer, cycle of current chemotherapy, and patient age. The predictive model had a concordance index of 0.95 (95 % confidence interval (CI) = 0.91-0.99) in the derivation cohort and 0.85 (95 % CI = 0.80-0.91) in the external validation cohort. A threshold of 15 % for the risk of febrile neutropenia in the derivation cohort was associated with a sensitivity of 0.76 and specificity of 0.98. These figures were 1.00 and 0.49 in the validation cohort if a risk threshold of 50 % was chosen. This nomogram is helpful in the prediction of febrile neutropenia after chemotherapy in patients with lung, breast, and colon cancer. Usage of this nomogram may help decrease the morbidity and mortality associated with febrile neutropenia and deserves further validation.
Burke, Kylie; McCarthy, Maria; Lowe, Cherie; Sanders, Matthew R; Lloyd, Erin; Bowden, Madeleine; Williams, Lauren
2017-03-01
Childhood cancer is associated with child adjustment difficulties including, eating and sleep disturbance, and emotional and other behavioral difficulties. However, there is a lack of validated instruments to measure the specific child adjustment issues associated with pediatric cancer treatments. The aim of this study was to develop and evaluate the reliability and validity of a parent-reported, child adjustment scale. One hundred thirty-two parents from two pediatric oncology centers who had children (aged 2-10 years) diagnosed with cancer completed the newly developed measure and additional measures of child behavior, sleep, diet, and quality of life. Children were more than 4 weeks postdiagnosis and less than 12 months postactive treatment. Factor structure, internal consistency, and construct (convergent) validity analyses were conducted. Principal component analysis revealed five distinct and theoretically coherent factors: Sleep Difficulties, Impact of Child's Illness, Eating Difficulties, Hospital-Related Behavior Difficulties, and General Behavior Difficulties. The final 25-item measure, the Children's Oncology Child Adjustment Scale (ChOCs), demonstrated good internal consistency (α = 0.79-0.91). Validity of the ChOCs was demonstrated by significant correlations between the subscales and measures of corresponding constructs. The ChOCs provides a new measure of child adjustment difficulties designed specifically for pediatric oncology. Preliminary analyses indicate strong theoretical and psychometric properties. Future studies are required to further examine reliability and validity of the scale, including test-retest reliability, discriminant validity, as well as change sensitivity and generalizability across different oncology samples and ages of children. The ChOCs shows promise as a measure of child adjustment relevant for oncology clinical settings and research purposes. © 2016 Wiley Periodicals, Inc.
Validation of an Arabic version of the Diabetes Treatment Satisfaction Questionnaire in Qatar.
Wilbur, Kerry; Al Hammaq, Abdulla O
2016-03-01
Several instruments evaluate patient-reported outcomes in diabetes mellitus (DM), but almost none are validated for use in Arabic language. The aim of this study is to test the psychometric properties and responsiveness of the Arabic version of the Diabetes Treatment Satisfaction Questionnaire (DTSQs) in Qatar. Ambulatory Arabic speaking DM patients were interviewed at two consecutive time points in Doha, Qatar. The 8-item DTSQs was administered in conjunction with the Medical Outcomes Study 36-Item Short-Form Health Survey (SF-36) and the World Health Organization Quality of Life Measure (WHOQOL-Bref) to assess convergent validity. Reliability was evaluated by internal consistency and item analysis. Construct validity was evaluated using "known groups" comparisons (including gender, insulin use, and HbA1c). Sensitivity of DTSQs scores to the subject's metabolic conditions was determined. One hundred subjects (mean age 50.7) participated. Half (54%) were female. The majority (93%) had Type 2 DM, but 39 (42%) were using insulin. Results revealed satisfactory internal consistency. Metabolic measures (fasting blood glucose and AIC) had significant inverse correlations with DTSQs scores (interview 1, Pearson's r=-0.333 and r=-0.401, respectively, p<0.01). Scale criterion and construct validity were found to be satisfactory. Most sub-dimensions of the SF-36 and WHOQOL-Bref were correlated with the DTSQ, indicating a good concurrent validity. As in prior studies, women demonstrated poorer treatment satisfaction. The Qatar Arabic DTSQs version was found to be a reliable and valid instrument for the assessment of treatment satisfaction in Arabic diabetes mellitus patients in the country. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
A whole blood gene expression-based signature for smoking status
2012-01-01
Background Smoking is the leading cause of preventable death worldwide and has been shown to increase the risk of multiple diseases including coronary artery disease (CAD). We sought to identify genes whose levels of expression in whole blood correlate with self-reported smoking status. Methods Microarrays were used to identify gene expression changes in whole blood which correlated with self-reported smoking status; a set of significant genes from the microarray analysis were validated by qRT-PCR in an independent set of subjects. Stepwise forward logistic regression was performed using the qRT-PCR data to create a predictive model whose performance was validated in an independent set of subjects and compared to cotinine, a nicotine metabolite. Results Microarray analysis of whole blood RNA from 209 PREDICT subjects (41 current smokers, 4 quit ≤ 2 months, 64 quit > 2 months, 100 never smoked; NCT00500617) identified 4214 genes significantly correlated with self-reported smoking status. qRT-PCR was performed on 1,071 PREDICT subjects across 256 microarray genes significantly correlated with smoking or CAD. A five gene (CLDND1, LRRN3, MUC1, GOPC, LEF1) predictive model, derived from the qRT-PCR data using stepwise forward logistic regression, had a cross-validated mean AUC of 0.93 (sensitivity=0.78; specificity=0.95), and was validated using 180 independent PREDICT subjects (AUC=0.82, CI 0.69-0.94; sensitivity=0.63; specificity=0.94). Plasma from the 180 validation subjects was used to assess levels of cotinine; a model using a threshold of 10 ng/ml cotinine resulted in an AUC of 0.89 (CI 0.81-0.97; sensitivity=0.81; specificity=0.97; kappa with expression model = 0.53). Conclusion We have constructed and validated a whole blood gene expression score for the evaluation of smoking status, demonstrating that clinical and environmental factors contributing to cardiovascular disease risk can be assessed by gene expression. PMID:23210427
McAlinden, Colm; Pesudovs, Konrad; Moore, Jonathan E
2010-11-01
To develop an instrument to measure subjective quality of vision: the Quality of Vision (QoV) questionnaire. A 30-item instrument was designed with 10 symptoms rated in each of three scales (frequency, severity, and bothersome). The QoV was completed by 900 subjects in groups of spectacle wearers, contact lens wearers, and those having had laser refractive surgery, intraocular refractive surgery, or eye disease and investigated with Rasch analysis and traditional statistics. Validity and reliability were assessed by Rasch fit statistics, principal components analysis (PCA), person separation, differential item functioning (DIF), item targeting, construct validity (correlation with visual acuity, contrast sensitivity, total root mean square [RMS] higher order aberrations [HOA]), and test-retest reliability (two-way random intraclass correlation coefficients [ICC] and 95% repeatability coefficients [R(c)]). Rasch analysis demonstrated good precision, reliability, and internal consistency for all three scales (mean square infit and outfit within 0.81-1.27; PCA >60% variance explained by the principal component; person separation 2.08, 2.10, and 2.01 respectively; and minimal DIF). Construct validity was indicated by strong correlations with visual acuity, contrast sensitivity and RMS HOA. Test-retest reliability was evidenced by a minimum ICC of 0.867 and a minimum 95% R(c) of 1.55 units. The QoV Questionnaire consists of a Rasch-tested, linear-scaled, 30-item instrument on three scales providing a QoV score in terms of symptom frequency, severity, and bothersome. It is suitable for measuring QoV in patients with all types of refractive correction, eye surgery, and eye disease that cause QoV problems.
Yun, Young Ho; Kang, Eun Kyo; Lee, Jihye; Choo, Jiyeon; Ryu, Hyewon; Yun, Hye-Min; Kang, Jung Hun; Kim, Tae You; Sim, Jin-Ah; Kim, Yaeji
2018-03-05
In this study, we aimed to develop and validate an instrument that could be used by patients with cancer to evaluate their quality of palliative care. Development of the questionnaire followed the four-phase process: item generation and reduction, construction, pilot testing, and field testing. Based on the literature, we constructed a list of items for the quality of palliative care from 104 quality care issues divided into 14 subscales. We constructed scales of 43 items that only the cancer patients were asked to answer. Using relevance and feasibility criteria and pilot testing, we developed a 44-item questionnaire. To assess the sensitivity and validity of the questionnaire, we recruited 220 patients over 18 years of age from three Korean hospitals. Factor analysis of the data and fit statistics process resulted in the 4-factor, 32-item Quality Care Questionnaire-Palliative Care (QCQ-PC), which covers appropriate communication with health care professionals (ten items), discussing value of life and goals of care (nine items), support and counseling for needs of holistic care (seven items), and accessibility and sustainability of care (six items). All subscales and total scores showed a high internal consistency (Cronbach alpha range, 0.89 to 0.97). Multi-trait scaling analysis showed good convergent (0.568-0.995) and discriminant (0.472-0.869) validity. The correlation between the total and subscale scores of QCQ-PC and those of EORTC QLQ-C15-PAL, MQOL, SAT-SF, and DCS was obtained. This study demonstrates that the QCQ-PC can be adopted to assess the quality of care in patients with cancer.
Automatic identification of variables in epidemiological datasets using logic regression.
Lorenz, Matthias W; Abdi, Negin Ashtiani; Scheckenbach, Frank; Pflug, Anja; Bülbül, Alpaslan; Catapano, Alberico L; Agewall, Stefan; Ezhov, Marat; Bots, Michiel L; Kiechl, Stefan; Orth, Andreas
2017-04-13
For an individual participant data (IPD) meta-analysis, multiple datasets must be transformed in a consistent format, e.g. using uniform variable names. When large numbers of datasets have to be processed, this can be a time-consuming and error-prone task. Automated or semi-automated identification of variables can help to reduce the workload and improve the data quality. For semi-automation high sensitivity in the recognition of matching variables is particularly important, because it allows creating software which for a target variable presents a choice of source variables, from which a user can choose the matching one, with only low risk of having missed a correct source variable. For each variable in a set of target variables, a number of simple rules were manually created. With logic regression, an optimal Boolean combination of these rules was searched for every target variable, using a random subset of a large database of epidemiological and clinical cohort data (construction subset). In a second subset of this database (validation subset), this optimal combination rules were validated. In the construction sample, 41 target variables were allocated on average with a positive predictive value (PPV) of 34%, and a negative predictive value (NPV) of 95%. In the validation sample, PPV was 33%, whereas NPV remained at 94%. In the construction sample, PPV was 50% or less in 63% of all variables, in the validation sample in 71% of all variables. We demonstrated that the application of logic regression in a complex data management task in large epidemiological IPD meta-analyses is feasible. However, the performance of the algorithm is poor, which may require backup strategies.
Kyratzis, Amy; Ross, Tamara Shuqum; Koymen, S Bahar
2010-01-01
Children are believed to construct their causal theories through talk and interaction, but with the exception of a few studies, little or nothing is known about how young children justify and build theories of the world together with same-age peers through naturally occurring interaction, Children's sensitivity to when a pair or group of interlocutors who interact frequently together feel that a justification is needed, is an index of developing pragmatic competence (Goetz & Shatz, 1999) and may be influenced by interactive goals and gender identity positioning. Studies suggest that salient contexts for justifications for young children are disagreement and control (e.g. Veneziano & Sinclair, 1995) but researchers have been less recognizant of 'situations in which partners verbally assist in the construction of justifications as a means to maintain contact or create solidarity' (Goetz & Shatz, 1999: 722) as contexts for justifications. The present study examined the spontaneously produced justification constructions in the naturally occurring free play of five friendship groups of preschool-aged children (aged from 3 ; 6 to 5 ; 4), in terms of the motivating context of the justification, marking of the causal relationship with a connective, and causal theories accessed in the talk. Partner expansion (validating justifications) was a salient motivating context for justifications, especially in the talk of friendship groups of girls, and seemed to privilege greater marking of the causal relationship with a connective and less arbitrary reasoning. One group of girls varied their use of validating justifications depending on the theme of play. Results are discussed in terms of the implications of use of validating justifications for children's causal theory building with peers, linguistic development, and pragmatic development.
AlHeresh, Rawan; LaValley, Michael P; Coster, Wendy; Keysor, Julie J
2017-06-01
To evaluate construct validity and scoring methods of the world health organization-health and work performance questionnaire (HPQ) for people with arthritis. Construct validity was examined through hypothesis testing using the recommended guidelines of the consensus-based standards for the selection of health measurement instruments (COSMIN). The HPQ using the absolute scoring method showed moderate construct validity as four of the seven hypotheses were met. The HPQ using the relative scoring method had weak construct validity as only one of the seven hypotheses were met. The absolute scoring method for the HPQ is superior in construct validity to the relative scoring method in assessing work performance among people with arthritis and related rheumatic conditions; however, more research is needed to further explore other psychometric properties of the HPQ.
Won, Jongsung; Cheng, Jack C P; Lee, Ghang
2016-03-01
Waste generated in construction and demolition processes comprised around 50% of the solid waste in South Korea in 2013. Many cases show that design validation based on building information modeling (BIM) is an effective means to reduce the amount of construction waste since construction waste is mainly generated due to improper design and unexpected changes in the design and construction phases. However, the amount of construction waste that could be avoided by adopting BIM-based design validation has been unknown. This paper aims to estimate the amount of construction waste prevented by a BIM-based design validation process based on the amount of construction waste that might be generated due to design errors. Two project cases in South Korea were studied in this paper, with 381 and 136 design errors detected, respectively during the BIM-based design validation. Each design error was categorized according to its cause and the likelihood of detection before construction. The case studies show that BIM-based design validation could prevent 4.3-15.2% of construction waste that might have been generated without using BIM. Copyright © 2015 Elsevier Ltd. All rights reserved.
The intelligibility in Context Scale: validity and reliability of a subjective rating measure.
McLeod, Sharynne; Harrison, Linda J; McCormack, Jane
2012-04-01
To describe a new measure of functional intelligibility, the Intelligibility in Context Scale (ICS), and evaluate its validity, reliability, and sensitivity using 3 clinical measures of severity of speech sound disorder: (a) percentage of phonemes correct (PPC), (b) percentage of consonants correct (PCC), and (c) percentage of vowels correct (PVC). Speech skills of 120 preschool children (109 with parent-/teacher-identified concern about how they talked and made speech sounds and 11 with no identified concern) were assessed with the Diagnostic Evaluation of Articulation and Phonology (Dodd, Hua, Crosbie, Holm, & Ozanne, 2002). Parents completed the 7-item ICS, which rates the degree to which children's speech is understood by different communication partners (parents, immediate family, extended family, friends, acquaintances, teachers, and strangers) on a 5-point scale. Parents' ratings showed that most children were always (5) or usually (4) understood by parents, immediate family, and teachers, but only sometimes (3) by strangers. Factor analysis confirmed the internal consistency of the ICS items; therefore, ratings were averaged to form an overall intelligibility score. The ICS had high internal reliability (α = .93), sensitivity, and construct validity. Criterion validity was established through significant correlations between the ICS and PPC (r = .54), PCC (r = .54), and PVC (r = .36). The ICS is a promising new measure of functional intelligibility. These data provide initial support for the ICS as an easily administered, valid, and reliable estimate of preschool children's intelligibility when speaking with people of varying levels of familiarity and authority.
Kim, Myoung-Hee; Cho, Young-Shin; Uhm, Wan-Sik; Kim, Sehyun; Bae, Sang-Cheol
2005-06-01
This study aimed to determine the cross-cultural adaptation and validation of the Korean version of the EQ-5D in rheumatic conditions. Translation, back-translation and cognitive debriefing were performed according to the EuroQol group's guidelines. For validity, 508 patients were recruited and administered the EQ-5D, Short-Form 36 and condition-specific measures. Construct validity and sensitivity were evaluated by testing a-priori hypotheses. For reliability, another 57 patients repeated the EQ-5D at 1-week interval, and intra-class correlations (ICC) and kappa statistics were estimated. For responsiveness, another 60 patients repeated it at 12-week interval within the context of clinical trial, and standardized response mean(SRM) were calculated. The cross-cultural adaptation produced no major modifications in the scale. The associations of the EQ-5D with the generic- and condition-specific measures were observed as expected in hypotheses: the higher EQ-5Dindex and EQ-5D(VAS) scores, the better health status by generic- or condition-specific measures, and the better functional class. The ICCs were 0.751 and 0.767, respectively, and kappa ranged from 0.455 to 0.772. The SRM were 0.649 and 0.410, respectively. The Korean EQ-5D exhibits good validity and sensitivity in various rheumatic conditions. Although its reliability and responsiveness were not excellent, it seems acceptable if condition-specific measures are applied together.
Predicting distant failure in early stage NSCLC treated with SBRT using clinical parameters.
Zhou, Zhiguo; Folkert, Michael; Cannon, Nathan; Iyengar, Puneeth; Westover, Kenneth; Zhang, Yuanyuan; Choy, Hak; Timmerman, Robert; Yan, Jingsheng; Xie, Xian-J; Jiang, Steve; Wang, Jing
2016-06-01
The aim of this study is to predict early distant failure in early stage non-small cell lung cancer (NSCLC) treated with stereotactic body radiation therapy (SBRT) using clinical parameters by machine learning algorithms. The dataset used in this work includes 81 early stage NSCLC patients with at least 6months of follow-up who underwent SBRT between 2006 and 2012 at a single institution. The clinical parameters (n=18) for each patient include demographic parameters, tumor characteristics, treatment fraction schemes, and pretreatment medications. Three predictive models were constructed based on different machine learning algorithms: (1) artificial neural network (ANN), (2) logistic regression (LR) and (3) support vector machine (SVM). Furthermore, to select an optimal clinical parameter set for the model construction, three strategies were adopted: (1) clonal selection algorithm (CSA) based selection strategy; (2) sequential forward selection (SFS) method; and (3) statistical analysis (SA) based strategy. 5-cross-validation is used to validate the performance of each predictive model. The accuracy was assessed by area under the receiver operating characteristic (ROC) curve (AUC), sensitivity and specificity of the system was also evaluated. The AUCs for ANN, LR and SVM were 0.75, 0.73, and 0.80, respectively. The sensitivity values for ANN, LR and SVM were 71.2%, 72.9% and 83.1%, while the specificity values for ANN, LR and SVM were 59.1%, 63.6% and 63.6%, respectively. Meanwhile, the CSA based strategy outperformed SFS and SA in terms of AUC, sensitivity and specificity. Based on clinical parameters, the SVM with the CSA optimal parameter set selection strategy achieves better performance than other strategies for predicting distant failure in lung SBRT patients. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Niimi, Shingo; Nishimiya, Kazuhiro; Nishidate, Masanobu; Saito, Tetsu; Minoura, Kyoko; Kadotsuji, Kenta; Shimakura, Jin; Shigemizu, Hiroko; Hosogi, Jun; Adachi, Maiko; Hashimoto, Tsutomu; Mori, Tamiki; Harada, Hideki; Yamamoto, Ken-Ichi; Nakamura, Takahiro; Nomura, Tatsuki; Yamaguchi, Itadaki; Sonehara, Kazuhiko; Ishii-Watabe, Akiko; Kawasaki, Nana
2018-04-01
This study was undertaken to evaluate the performance of anti-drug antibody (ADA) assays constructed by each participating company using common samples including ADA, drug and human serum. The ADA assays constructed by each company showed good sensitivity and precision for evaluation of ADA. Cut points for screening and confirmatory assays and assay selectivity were determined by various calculation methods. In evaluations of blind ADA samples, nearly similar results were obtained by the study companies in determinations of whether samples were positive or negative except at the lowest sample concentration (5 ng/mL). In measurement of drug tolerance, for almost samples containing ADA and drugs, more positive results were obtained in assays using acid dissociation compared to those without acid dissociation. Overall, the performance of ADA assays constructed by the 10 companies participating in this study was acceptable in terms of sensitivity and reproducibility for detection and evaluation of immunogenicity in both patients and healthy subjects. On the other hand, based on results for samples containing ADA and drugs, validity of results for ADA assays conducted without acid dissociation was less meaningful and more difficult to evaluate. Thus, acid dissociation was confirmed to be useful for improving drug tolerance. Copyright © 2018 The Japanese Society for the Study of Xenobiotics. Published by Elsevier Ltd. All rights reserved.
[Reliability and validity of the Braden Scale for predicting pressure sore risk].
Boes, C
2000-12-01
For more accurate and objective pressure sore risk assessment various risk assessment tools were developed mainly in the USA and Great Britain. The Braden Scale for Predicting Pressure Sore Risk is one such example. By means of a literature analysis of German and English texts referring to the Braden Scale the scientific control criteria reliability and validity will be traced and consequences for application of the scale in Germany will be demonstrated. Analysis of 4 reliability studies shows an exclusive focus on interrater reliability. Further, even though examination of 19 validity studies occurs in many different settings, such examination is limited to the criteria sensitivity and specificity (accuracy). The range of sensitivity and specificity level is 35-100%. The recommended cut off points rank in the field of 10 to 19 points. The studies prove to be not comparable with each other. Furthermore, distortions in these studies can be found which affect accuracy of the scale. The results of the here presented analysis show an insufficient proof for reliability and validity in the American studies. In Germany, the Braden scale has not yet been tested under scientific criteria. Such testing is needed before using the scale in different German settings. During the course of such testing, construction and study procedures of the American studies can be used as a basis as can the problems be identified in the analysis presented below.
Hernansaiz-Garrido, Helena; Alonso-Tapia, Jesús
2017-01-01
Internalized stigma and disclosure concerns are key elements for the study of mental health in people living with HIV. Since no measures of these constructs were available for Spanish population, this study sought to develop such instruments, to analyze their reliability and validity and to provide a short version. A heterogeneous sample of 458 adults from different Spanish-speaking countries completed the HIV-Internalized Stigma Scale and the HIV-Disclosure Concerns Scale, along with the Hospital Anxiety and Depression Scale, Rosenberg's Self-esteem Scale and other socio-demographic variables. Reliability and correlation analyses, exploratory factor analyses, path analyses with latent variables, and ANOVAs were conducted to test the scales' psychometric properties. The scales showed good reliability in terms of internal consistency and temporal stability, as well as good sensitivity and factorial and criterion validity. The HIV-Internalized Stigma Scale and the HIV-Disclosure Concerns Scale are reliable and valid means to assess these variables in several contexts.
Wellen, B; Skriner, L C; Freeman, J; Stewart, E; Garcia, A; Sapyta, J; Franklin, M
2017-02-01
Researchers have demonstrated that quality of life (QOL) is an important construct to measure in individuals with mental health disorders, yet only a small amount of research has been dedicated to examining QOL and its response to treatment in children and adolescents with obsessive-compulsive disorder (OCD). The current study explored the psychometric properties of a measure of QOL, the Pediatric Quality of Life Enjoyment and Satisfaction Questionnaire (PQ-LES-Q), by examining the reliability, validity, and treatment sensitivity of this measure delivered in two separate RCTs for OCD (total N = 251 across both studies). Our results provide evidence for the reliability and validity of the PQ-LES-Q in adolescents with OCD (all Cronbach's alphas >.89, convergent validity correlations significant at the p < .05 level), but that an adaptation of the measure many be necessary for valid use in younger children with OCD.
The trucker strain monitor: an occupation-specific questionnaire measuring psychological job strain.
De Croon, E M; Blonk, R W; Van der Beek, J; Frings-Dresen, M H
2001-08-01
To develop and validate a short and user-friendly questionnaire measuring psychological job strain in truck drivers. In cooperation with an occupational physician in the Dutch road transport industry we developed items on the basis of face validity and information of existing questionnaires on the subject. These items were pilot-tested, by means of interviews, in 15 truck drivers. Study I examined the factorial structure of the initial 30-item trucker strain monitor (TSM) in a sample of 153 truck drivers. Subsequently, number of items per factor was reduced on the basis of reliability analyses (Cronbach's alpha). Study II examined construct and criterion validity of the TSM in a randomly selected group of 2,000 truck drivers, of whom 1,111 participated (adjusted response = 63%). Additionally, sensitivity and specificity were assessed by examining the ability of the TSM to identify truck drivers with or without self-reported sickness absence in the past 12 months because of psychological complaints. Factor analyses of the initial 30-item TSM revealed a two-factor solution. Item reduction resulted in a six-item work-related fatigue scale and four-item sleeping problems scale with high internal consistency. Results of study II confirmed the internal consistency of the TSM scales and provided support for construct and criterion validity. The composite, work-related fatigue, and sleeping problems scale had a sensitivity of 83%, 80% and 71% respectively, in identifying truck drivers with prior sickness absence because of psychological complaints. Specificity rates were 72%, 73% and 72% respectively. Despite methodological limitations, the results suggest that the TSM is a reliable and valid indicator of psychological job strain in truck drivers. In particular, the composite and work-related fatigue scale identified drivers with prior absenteeism because of psychological complaints, quite accurately. Future longitudinal research in specific sub-groups of truck drivers including both self-reported and objective psychological health measures should evidence whether (1) the distinction between two indicators of psychological job strain is useful, and whether (2) the TSM can be used in screening out truck drivers at risk of developing psychological health problems.
Simons, Janine A; Fietzek, Urban M; Waldmann, Annika; Warnecke, Tobias; Schuster, Tibor; Ceballos-Baumann, Andrés O
2014-09-01
Dysphagia in patients with Parkinson's disease (PD) significantly reduces quality of life and predicted lifetime. Current screening procedures are insufficiently evaluated. We aimed to develop and validate a patient-reported outcome questionnaire for early diagnosis of dysphagia in patients with PD. The two-phased project comprised the questionnaire, diagnostic scales construction (N = 105), and a validation study (N = 82). Data for the project were gathered from PD patients at a German Movement Disorder Center. For validation purposes, a clinical evaluation focusing on swallowing tests, tests of sensory reflexes, and fiberoptic endoscopic evaluation of swallowing (FEES) was performed that yielded a criteria sum score against which the results of the questionnaire were compared. Specificity and sensitivity were evaluated for the detection of noticeable dysphagia and for the risk of aspiration. The Munich Dysphagia Test - Parkinson's disease (MDT-PD) consists of 26 items that show high internal consistency (α = 0.91). For the validation study, 82 patients, aged 70.9 ± 8.7 (mean ± SD), with a median Hoehn & Yahr stage of 3, were assessed. 73% of patients had dysphagia with noticeable oropharyngeal symptoms (44%) or with penetration/aspiration (29%). The criteria sum score correlated positively with the screening result (r = 0.70, p < 0.001). The MDT-PD sum score classified not noticeable dysphagia vs. risk of aspiration (noticeable dysphagia) with a sensitivity of 90% (82%) and a specificity of 86% (71%), and yielded similar results in cross-validation, respectively. MDT-PD is a valid screening tool for early diagnosis of swallowing problems and aspiration risk, as well as initial graduation of dysphagia severity in PD patients. Copyright © 2014 Elsevier Ltd. All rights reserved.
Chiner, Eusebi; Landete, Pedro; Sancho-Chust, José Norberto; Martínez-García, Miguel Ángel; Pérez-Ferrer, Patricia; Pastor, Esther; Senent, Cristina; Arlandis, Mar; Navarro, Cristina; Selma, María José
2016-11-01
To analyze the reliability and validity of the Spanish version of the OSA-18 quality of life questionnaire in children with apnea-hypopnea syndrome (SAHS). Children with suspected SAHS were studied with polysomnography (PSG) before and after adenotonsillectomy (AA). Age, gender, clinical data, PSG, anthropometric data, and Mallampati and Brodsky scales were analyzed. OSA-18 was administered at baseline and 3-6months post AA. After translation and backtranslation by bilingual professionals, the internal consistency, reliability, construct validity, concurrent validity, predictive validity and sensitivity to change of the questionnaire was assessed. In total, 45 boys and 15 girls were evaluated, showing BMI 18±4, neck 28±5, Brodsky (0: 7%; <25%: 12%; 25-50%: 27%; >50 to <75%: 45%; >75%: 6%), AHI 12±7 pre AA. Global Cronbach alpha was 0.91. Correlations between domains were significant except for emotional aspects, although the total scores correlated with all domains (0.50 to 0.90). The factorial analysis was virtually identical to the original structure. The total scores showed good correlation for concurrent validity (0.2-0.45). With regard to predictive validity, the questionnaire adequately differentiated levels of severity according to Mallampati (ANOVA P=.002) and apnea-hypopnea index (ANOVA P=.006). Test-retest reliability was excellent, as was sensitivity to change, both in the total scores (P<.001) and in each domain (P<.001). The Spanish adaptation of the OSA-18 and its psychometric characteristics suggest that the Spanish version is equivalent to the original and can be used in Spanish-speaking countries. Copyright © 2016 SEPAR. Publicado por Elsevier España, S.L.U. All rights reserved.
Validation of Multilevel Constructs: Validation Methods and Empirical Findings for the EDI
ERIC Educational Resources Information Center
Forer, Barry; Zumbo, Bruno D.
2011-01-01
The purposes of this paper are to highlight the foundations of multilevel construct validation, describe two methodological approaches and associated analytic techniques, and then apply these approaches and techniques to the multilevel construct validation of a widely-used school readiness measure called the Early Development Instrument (EDI;…
de Alwis, Manudul Pahansen; Äng, Björn Olov; Garme, Karl
2017-01-01
Objective High-performance marine craft personnel (HPMCP) are regularly exposed to vibration and repeated shock (VRS) levels exceeding maximum limitations stated by international legislation. Whereas such exposure reportedly is detrimental to health and performance, the epidemiological data necessary to link these adverse effects causally to VRS are not available in the scientific literature, and no suitable tools for acquiring such data exist. This study therefore constructed a questionnaire for longitudinal investigations in HPMCP. Methods A consensus panel defined content domains, identified relevant items and outlined a questionnaire. The relevance and simplicity of the questionnaire’s content were then systematically assessed by expert raters in three consecutive stages, each followed by revisions. An item-level content validity index (I-CVI) was computed as the proportion of experts rating an item as relevant and simple, and a scale-level content validity index (S-CVI/Ave) as the average I-CVI across items. The thresholds for acceptable content validity were 0.78 and 0.90, respectively. Finally, a dynamic web version of the questionnaire was constructed and pilot tested over a 1-month period during a marine exercise in a study population sample of eight subjects, while accelerometers simultaneously quantified VRS exposure. Results Content domains were defined as work exposure, musculoskeletal pain and human performance, and items were selected to reflect these constructs. Ratings from nine experts yielded S-CVI/Ave of 0.97 and 1.00 for relevance and simplicity, respectively, and the pilot test suggested that responses were sensitive to change in acceleration and that the questionnaire, following some adjustments, was feasible for its intended purpose. Conclusions A dynamic web-based questionnaire for longitudinal survey of key variables in HPMCP was constructed. Expert ratings supported that the questionnaire content is relevant, simple and sufficiently comprehensive, and the pilot test suggested that the questionnaire is feasible for longitudinal measurements in the study population. PMID:28729320
Evaluating process in child and family interventions: aggression prevention as an example.
Tolan, Patrick H; Hanish, Laura D; McKay, Mary M; Dickey, Mitchell H
2002-06-01
This article reports on 2 studies designed to develop and validate a set of measures for use in evaluating processes of child and family interventions. In Study 1 responses from 187 families attending an outpatient clinic for child behavior problems were factor analyzed to identify scales, consistent across sources: Alliance (Satisfactory Relationship with Interventionist and Program Satisfaction), Parenting Skill Attainment, Child Cooperation During Session, Child Prosocial Behavior, and Child Aggressive Behavior. Study 2 focused on patterns of scale scores among 78 families taking part in a 22-week preventive intervention designed to affect family relationships, parenting, and child antisocial and prosocial behaviors. The factor structure identified in Study 1 was replicated. Scale construct validity was demonstrated through across-source convergence, sensitivity to intervention change, and ability to discriminate individual differences. Path analysis validated the scales' utility in explaining key aspects of the intervention process. Implications for evaluating processes in family interventions are discussed.
Using entropy measures to characterize human locomotion.
Leverick, Graham; Szturm, Tony; Wu, Christine Q
2014-12-01
Entropy measures have been widely used to quantify the complexity of theoretical and experimental dynamical systems. In this paper, the value of using entropy measures to characterize human locomotion is demonstrated based on their construct validity, predictive validity in a simple model of human walking and convergent validity in an experimental study. Results show that four of the five considered entropy measures increase meaningfully with the increased probability of falling in a simple passive bipedal walker model. The same four entropy measures also experienced statistically significant increases in response to increasing age and gait impairment caused by cognitive interference in an experimental study. Of the considered entropy measures, the proposed quantized dynamical entropy (QDE) and quantization-based approximation of sample entropy (QASE) offered the best combination of sensitivity to changes in gait dynamics and computational efficiency. Based on these results, entropy appears to be a viable candidate for assessing the stability of human locomotion.
Scale of attitudes toward alcohol - Spanish version: evidences of validity and reliability 1
Ramírez, Erika Gisseth León; de Vargas, Divane
2017-01-01
ABSTRACT Objective: validate the Scale of attitudes toward alcohol, alcoholism and individuals with alcohol use disorders in its Spanish version. Method: methodological study, involving 300 Colombian nurses. Adopting the classical theory, confirmatory factor analysis was applied without prior examination, based on the strong historical evidence of the factorial structure of the original scale to determine the construct validity of this Spanish version. To assess the reliability, Cronbach’s Alpha and Mc Donalid’s Omega coefficients were used. Results: the confirmatory factor analysis indicated the good fit of the scale model in a four-factor distribution, with a cut-off point at 3.2, demonstrating 66.7% of sensitivity. Conclusions: the Scale of attitudes toward alcohol, alcoholism and individuals with alcohol use disorders in Spanish presented robust psychometric qualities, affirming that the instrument possesses a solid factorial structure and reliability and is capable of precisely measuring the nurses’ atittudes towards the phenomenon proposed. PMID:28793126
Psychometric properties of the postpartum depression screening scale beyond the postpartum period.
Vogeli, Jo M; Hooker, Stephanie A; Everhart, Kevin D; Kaplan, Peter S
2018-04-01
Accurate postpartum depression screening measures are needed to identify mothers with depressive symptoms both in the postpartum period and beyond. Because it had not been tested beyond the immediate postpartum period, the reliability and validity of the Postpartum Depression Screening Scale (PDSS) and its sensitivity, specificity, and predictive value for diagnoses of major depressive disorder (MDD) were assessed in a diverse community sample of 238 mothers of 4- to 15-month-old infants. Mothers (N = 238; M age = 30.2, SD = 5.3) attended a lab session and completed the PDSS, the Beck Depression Inventory-II (BDI-II), and a structured clinical interview (SCID) to diagnose MDD. The reliability, validity, specificity, sensitivity, and predictive value of the PDSS to identify maternal depression were assessed. Confirmatory factor analysis supported the construct validity of five but not seven content subscales. The PDSS total and subscale scores demonstrated acceptable to high reliability (α = 0.68-0.95). Discriminant function analysis showed the scale correctly provided diagnostic classification at a rate higher than chance alone. Sensitivity and specificity for major depressive disorder (MDD) diagnosis were good and comparable to those of the BDI-II. Even in mothers who were somewhat more diverse and had older infants than those in the original normative study, the PDSS appears to be a psychometrically sound screener for identifying depressed mothers in the 15 months after childbirth. © 2018 Wiley Periodicals, Inc.
Rane, Smita; Prabhakar, Bala
2013-07-01
The aim of this study was to investigate the combined influence of 3 independent variables in the preparation of paclitaxel containing pH-sensitive liposomes. A 3 factor, 3 levels Box-Behnken design was used to derive a second order polynomial equation and construct contour plots to predict responses. The independent variables selected were molar ratio phosphatidylcholine:diolylphosphatidylethanolamine (X1), molar concentration of cholesterylhemisuccinate (X2), and amount of drug (X3). Fifteen batches were prepared by thin film hydration method and evaluated for percent drug entrapment, vesicle size, and pH sensitivity. The transformed values of the independent variables and the percent drug entrapment were subjected to multiple regression to establish full model second order polynomial equation. F was calculated to confirm the omission of insignificant terms from the full model equation to derive a reduced model polynomial equation to predict the dependent variables. Contour plots were constructed to show the effects of X1, X2, and X3 on the percent drug entrapment. A model was validated for accurate prediction of the percent drug entrapment by performing checkpoint analysis. The computer optimization process and contour plots predicted the levels of independent variables X1, X2, and X3 (0.99, -0.06, 0, respectively), for maximized response of percent drug entrapment with constraints on vesicle size and pH sensitivity.
Bussing, Regina; Murphy, Tanya K; Storch, Eric A; McNamara, Joseph P H; Reid, Adam M; Garvan, Cynthia W; Goodman, Wayne K
2013-02-28
This study evaluated the psychometric properties of the treatment-emergent activation and suicidality assessment profile (TEASAP) in a clinical sample of 56 youth aged 7-17 with obsessive-compulsive disorder (OCD) who participated in a double-blind randomized controlled trial. The 38-item TEASAP demonstrated good internal consistency for its total score (α=0.93) and adequate to good performance for its five subscale scores (α=0.65-0.92). One-week test-retest stability (N=18) was adequate (Intraclass correlation coefficient [ICC]=0.68-0.80) except for Self-Injury (ICC=0.46). Construct validity was supported by total and subscale TEASAP score relationships with related constructs, including irritability, hyperactivity, externalizing behaviors, manic symptoms, and suicidal ideation, and the absence of relationships with unrelated constructs. Predictive validity was established for the Disinhibition subscale through significant associations with subsequent activation events. Furthermore, TEASAP sensitivity to change in activation scores over time was supported by longitudinal associations of TEASAP scores with clinician ratings of activation over the course of treatment. Findings indicate that the TEASAP has acceptable psychometric properties in a clinical sample of youth with OCD and merits further study in larger samples for additional refinement of its measurement approaches. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
2006-10-01
Investigation of Item-Pair Presentation and Construct Validity of the Navy Computer Adaptive Personality Scales ( NCAPS ) Christina M. Underhill, Ph.D...Construct Validity of the Navy Computer Adaptive Personality Scales ( NCAPS ) Christina M. Underhill, Ph.D. Reviewed and Approved by Jacqueline A. Mottern...and Construct Validity of the Navy Computer Adaptive Personality Scales ( NCAPS ) 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 0602236N and 0603236N 6
Li, Yi; Tseng, Yufeng J.; Pan, Dahua; Liu, Jianzhong; Kern, Petra S.; Gerberick, G. Frank; Hopfinger, Anton J.
2008-01-01
Currently, the only validated methods to identify skin sensitization effects are in vivo models, such as the Local Lymph Node Assay (LLNA) and guinea pig studies. There is a tremendous need, in particular due to novel legislation, to develop animal alternatives, eg. Quantitative Structure-Activity Relationship (QSAR) models. Here, QSAR models for skin sensitization using LLNA data have been constructed. The descriptors used to generate these models are derived from the 4D-molecular similarity paradigm and are referred to as universal 4D-fingerprints. A training set of 132 structurally diverse compounds and a test set of 15 structurally diverse compounds were used in this study. The statistical methodologies used to build the models are logistic regression (LR), and partial least square coupled logistic regression (PLS-LR), which prove to be effective tools for studying skin sensitization measures expressed in the two categorical terms of sensitizer and non-sensitizer. QSAR models with low values of the Hosmer-Lemeshow goodness-of-fit statistic, χHL2, are significant and predictive. For the training set, the cross-validated prediction accuracy of the logistic regression models ranges from 77.3% to 78.0%, while that of PLS-logistic regression models ranges from 87.1% to 89.4%. For the test set, the prediction accuracy of logistic regression models ranges from 80.0%-86.7%, while that of PLS-logistic regression models ranges from 73.3%-80.0%. The QSAR models are made up of 4D-fingerprints related to aromatic atoms, hydrogen bond acceptors and negatively partially charged atoms. PMID:17226934
Batista-Foguet, Joan; Sipahi-Dantas, Alaide; Guillén, Laura; Martínez Arias, Rosario; Serlavós, Ricard
2016-03-22
Most questionnaires used for managerial purposes have been developed in Anglo-Saxon countries and then adapted for other cultures. However, this process is controversial. This paper fills the gap for more culturally sensitive assessment instruments in the specific field of human resources while also addressing the methodological issues that scientists and practitioners face in the development of questionnaires. First, we present the development process of a Personal and Motive-based competencies questionnaire targeted to Spanish-speaking countries. Second, we address the validation process by guiding the reader through testing the questionnaire construct validity. We performed two studies: a first study with 274 experts and practitioners of competency development and a definitive study with 482 members of the general public. Our results support a model of nineteen competencies grouped into four higher-order factors. To assure valid construct comparisons we have tested the factorial invariance of gender and work experience. Subsequent analysis have found that women self-rate themselves significantly higher than men on only two of the nineteen competencies, empathy (p < .001) and service orientation (p < .05). The effect of work experience was significant in twelve competencies (p < .001), in which less experienced workers self-rate higher than experienced workers. Finally, we derive theoretical and practical implications.
Cataudella, Danielle; Morley, Tara Elise; Nesin, April; Fernandez, Conrad V; Johnston, Donna Lynn; Sung, Lillian; Zelcer, Shayna
2014-10-01
There is currently no published, validated measures available that comprehensively capture quality of life (QoL) symptoms for children with poor-prognosis malignancies. The pediatric advanced care-quality of life scale (PAC-QoL) has been developed to address this gap. The current paper describes the first two phases in the development of this measure. The first two phases included: (1) construct and item generation, and (2) preliminary content validation. Domains of QoL relevant to this population were identified from the literature and items generated to capture each; items were then adapted to create versions sensitive to age/developmental differences. Two types of experts reviewed the draft PAC-QoL and rated items for relevance, understandability, and sensitivity of wording: bereaved parents (n = 8) and health care professionals (HCP; n = 7). Content validity was calculated using the index of content validity (CVI [Lynn. Nurs Res 1986;35:382-385]). One hundred and forty-one candidate items congruent with the domains identified as relevant to children with advanced malignancies were generated, and four report versions with a 5-choice response scale created. Parent mean scores for importance, understandability, and sensitivity of wording ranged from 4.29 (SD = 0.52) to 4.66 (SD = 0.50). The CVI ranged from 95% to 100%. These steps resulted in reductions of the PAC-QoL to 57-65 items, as well as a modification of the response scale to a 4-choice option with new anchors. The next phase of this study will be to conduct cognitive probing with the intended population to further modify and reduce candidate items prior to psychometric evaluation. © 2014 Wiley Periodicals, Inc.
Billing code algorithms to identify cases of peripheral artery disease from administrative data
Fan, Jin; Arruda-Olson, Adelaide M; Leibson, Cynthia L; Smith, Carin; Liu, Guanghui; Bailey, Kent R; Kullo, Iftikhar J
2013-01-01
Objective To construct and validate billing code algorithms for identifying patients with peripheral arterial disease (PAD). Methods We extracted all encounters and line item details including PAD-related billing codes at Mayo Clinic Rochester, Minnesota, between July 1, 1997 and June 30, 2008; 22 712 patients evaluated in the vascular laboratory were divided into training and validation sets. Multiple logistic regression analysis was used to create an integer code score from the training dataset, and this was tested in the validation set. We applied a model-based code algorithm to patients evaluated in the vascular laboratory and compared this with a simpler algorithm (presence of at least one of the ICD-9 PAD codes 440.20–440.29). We also applied both algorithms to a community-based sample (n=4420), followed by a manual review. Results The logistic regression model performed well in both training and validation datasets (c statistic=0.91). In patients evaluated in the vascular laboratory, the model-based code algorithm provided better negative predictive value. The simpler algorithm was reasonably accurate for identification of PAD status, with lesser sensitivity and greater specificity. In the community-based sample, the sensitivity (38.7% vs 68.0%) of the simpler algorithm was much lower, whereas the specificity (92.0% vs 87.6%) was higher than the model-based algorithm. Conclusions A model-based billing code algorithm had reasonable accuracy in identifying PAD cases from the community, and in patients referred to the non-invasive vascular laboratory. The simpler algorithm had reasonable accuracy for identification of PAD in patients referred to the vascular laboratory but was significantly less sensitive in a community-based sample. PMID:24166724
Kneebone, Ian I; Fife-Schaw, Chris; Lincoln, Nadina B; Harder, Helena
2016-12-01
To investigate the validity and reliability of the Geriatric Anxiety Inventory in screening for anxiety in older inpatients post-stroke. Longitudinal. A total of 81 inpatients with stroke aged 65 years or older were recruited at four centres in England. At phase 1 the Geriatric Anxiety Inventory and the Hospital Anxiety and Depression Scale were administered and then the Structured Clinical Interview for Diagnostic and Statistical Manual of Mental Disorders 4th edition (phase 2). The Geriatric Anxiety Inventory was repeated a median of seven days later (phase 3). Internal reliability of the Geriatric Anxiety Inventory was high (α = 0.95) and test-retest reliability acceptable (τB = 0.53). Construct validity was evident relative to the Hospital Anxiety and Depression Scale - Anxiety subscale (τB = 0.61). At a cut off of 6/7, sensitivity of the Geriatric Anxiety Inventory was 0.88, specificity 0.84, with respect to the Structured Clinical Interview anxiety diagnosis. Hospital Anxiety and Depressions Scale - Anxiety subscale sensitivity was 0.88, specificity 0.54 at the optimum cut off of 5/6. A comparison of the areas under the curve of the Receiver Operating Characteristics for the two instruments indicated that the area under the curve of the Geriatric Anxiety Inventory was significantly larger than that of the Hospital Anxiety and Depressions Scale - Anxiety subscale, supporting its superiority. The Geriatric Anxiety Inventory is an internally consistent, reliable (stable) and valid instrument with acceptable sensitivity and specificity to screen for anxiety in older inpatients with stroke. © The Author(s) 2015.
Validating MEDIQUAL Constructs
NASA Astrophysics Data System (ADS)
Lee, Sang-Gun; Min, Jae H.
In this paper, we validate MEDIQUAL constructs through the different media users in help desk service. In previous research, only two end-users' constructs were used: assurance and responsiveness. In this paper, we extend MEDIQUAL constructs to include reliability, empathy, assurance, tangibles, and responsiveness, which are based on the SERVQUAL theory. The results suggest that: 1) five MEDIQUAL constructs are validated through the factor analysis. That is, importance of the constructs have relatively high correlations between measures of the same construct using different methods and low correlations between measures of the constructs that are expected to differ; and 2) five MEDIQUAL constructs are statistically significant on media users' satisfaction in help desk service by regression analysis.
NASA Astrophysics Data System (ADS)
Astuti, Sri Rejeki Dwi; Suyanta, LFX, Endang Widjajanti; Rohaeti, Eli
2017-05-01
The demanding of assessment in learning process was impact by policy changes. Nowadays, assessment is not only emphasizing knowledge, but also skills and attitudes. However, in reality there are many obstacles in measuring them. This paper aimed to describe how to develop integrated assessment instrument and to verify instruments' validity such as content validity and construct validity. This instrument development used test development model by McIntire. Development process data was acquired based on development test step. Initial product was observed by three peer reviewer and six expert judgments (two subject matter experts, two evaluation experts and two chemistry teachers) to acquire content validity. This research involved 376 first grade students of two Senior High Schools in Bantul Regency to acquire construct validity. Content validity was analyzed used Aiken's formula. The verifying of construct validity was analyzed by exploratory factor analysis using SPSS ver 16.0. The result show that all constructs in integrated assessment instrument are asserted valid according to content validity and construct validity. Therefore, the integrated assessment instrument is suitable for measuring critical thinking abilities and science process skills of senior high school students on electrolyte solution matter.
Zuithoff, Nicolaas P A; Vergouwe, Yvonne; King, Michael; Nazareth, Irwin; van Wezep, Manja J; Moons, Karel G M; Geerlings, Mirjam I
2010-12-13
There is a need for brief instruments to ascertain the diagnosis of major depressive disorder. In this study, we present the reliability, construct validity and accuracy of the PHQ-9 and PHQ-2 to detect major depressive disorder in primary care. Cross-sectional analyses within a large prospective cohort study (PREDICT-NL). Data was collected in seven large general practices in the centre of the Netherlands. 1338 subjects were recruited in the general practice waiting room, irrespective of their presenting complaint. The diagnostic accuracy (the area under the ROC curve and sensitivities and specificities for various thresholds) was calculated against a diagnosis of major depressive disorder determined with the Composite International Diagnostic Interview (CIDI). The PHQ-9 showed a high degree of internal consistency (ICC = 0.88) and test-retest reliability (correlation = 0.94). With respect to construct validity, it showed a clear association with functional status measurements, sick days and number of consultations. The discriminative ability was good for the PHQ-9 (area under the ROC curve = 0.87, 95% CI: 0.84-0.90) and the PHQ-2 (ROC area = 0.83, 95% CI 0.80-0.87). Sensitivities at the recommended thresholds were 0.49 for the PHQ-9 at a score of 10 and 0.28 for a categorical algorithm. Adjustment of the threshold and the algorithm improved sensitivities to 0.82 and 0.84 respectively but the specificity decreased from 0.95 to 0.82 (threshold) and from 0.98 to 0.81 (algorithm). Similar results were found for the PHQ-2: the recommended threshold of 3 had a sensitivity of 0.42 and lowering the threshold resulted in an improved sensitivity of 0.81. The PHQ-9 and the PHQ-2 are useful instruments to detect major depressive disorder in primary care, provided a high score is followed by an additional diagnostic work-up. However, often recommended thresholds for the PHQ-9 and the PHQ-2 resulted in many undetected major depressive disorders.
Wang, X T; Gao, L M; Xu, W; Ding, X
2016-10-20
Objective: To test the Beijing questionnaire as a means of identifying patients with obstructive sleep apnea hypopnea syndrome(OSAHS). Method: The Beijing questionnaire is designed as an explorative tool consist of 11 questions for patients with obstructive sleep apnea hypopnea, and is targeted toward key symptoms include snoring, apneas, daytime sleepiness, hypertension and overweight. 1 336 female participants living in communities of age≥40 years and 198 male adult subjects visting clinics were given questionnaires. Finally, 59 female and 198 male subjects underwent sleep studies after factor analysis,reliability check,internal consistency study. The correlation analysis was performed between the scores from the Beijing questionnaire and the apnea-hypopnea index from inlaboratory polysomnography.Receiver operating characteristics were constructed to determine optimal sensitivity and specificity. Twenty-four male subjects were recorded in the sleep laberatory again after operative. Result: Factor analysis reduced 11 questions of scale to four common factors as we have designed: snoring,apneas,other symptoms,risk factors. Cronbach's α coefficient of scale reached 0.7.There were an acceptable level of testretest reliability(r=0.619, P <0.01).The apnea hypopnea indices were significantly correlated with their Beijing questionnaire scores( P <0.01).For wemen,an Beijing questionnaire scroe of 19.5 provided a sensitivity of 74.3% and a specificity of 62.5%.For men,an Beijing questionnaire scroe of 22.5 provided a sensitivity of 90.9% and a specificity of 54.5%. And the postoperative Beijing questionnaire scroes changed with the apnea hypopnea indices. Conclusion: This questionnaire has a good validity and reliability and appears to be valid and sensitive to clinical change. Copyright© by the Editorial Department of Journal of Clinical Otorhinolaryngology Head and Neck Surgery.
Merolla, Giovanni; Corona, Katia; Zanoli, Gustavo; Cerciello, Simone; Giannotti, Stefano; Porcellini, Giuseppe
2017-12-01
The Kerlan-Jobe Orthopaedic Clinic (KJOC) Shoulder and Elbow score is a reliable and sensitive tool to measure the performance of overhead athletes. The purpose of this study was to carry out a cross-cultural adaptation and validation of the KJOC questionnaire in Italian and to assess its reliability, validity, and responsiveness. Ninety professional athletes with a painful shoulder were included in this study and were assigned to the "injury group" (n = 32) or the "overuse group" (n = 58); 65 were managed conservatively and 25 were treated by arthroscopic surgery. To assess the reliability of the KJOC score, patients were asked to fill in the questionnaire at baseline and after 2 weeks. To test the construct validity, KJOC scores were compared to those obtained with the Italian version of the Disabilities of the Arm, Shoulder, and Hand (DASH) scale, and with the DASH sports/performing arts module. To test KJOC score responsiveness, the follow-up KJOC scores of the participants treated conservatively were compared to those of the patients treated by arthroscopic surgery. Statistical analysis demonstrated that the KJOC questionnaire is reliable in terms of the single items and the overall score (ICC 0.95-0.99); that it has high construct validity (r s = -0.697; p < 0.01); and that it is responsive to clinical differences in shoulder function (p < 0.0001). The Italian version of the KJOC Shoulder and Elbow score performed in a similar way to the English version and demonstrated good validity, reliability, and responsiveness after conservative and surgical treatment. II.
Patterns of pulmonary maturation in normal and abnormal pregnancy.
Goldkrand, J W; Slattery, D S
1979-03-01
Fetal pulmonary maturation may be a variable event depending on various feto-maternal environmental and biochemical influences. The patterns of maturation were studied in 211 amniotic fluid samples from 123 patients (normal 55; diabetes 23; Rh sensitization 19; preeclampsia 26). The phenomenon of globule formation from the amniotic fluid lipid extract and is relation to pulmonary maturity was utilized for this analysis. Validation of this technique is presented. A normal curve was constructed from 22 to 42 weeks; gestation and compared to the abnormal pregnancies. Patients with class A, B, and C diabetes and Rh-sensitized pregnancies had delayed pulmonary maturation. Patients with class D diabetes and preclampsia paralleled the normal course of maturation. A discussion of these results and their possible cause is presented.
Cheng, Alice; Humayun, Aiza; Schwartz, Zvi
2016-01-01
Abstract The addition of porosity to the traditionally used solid titanium metal implants has been suggested to more closely mimic the natural mechanical properties of bone and increase osseointegration in dental and orthopedic implants. The objective of this study was to evaluate cellular response to three-dimensional (3D) porous Ti-6Al-4V constructs fabricated by additive manufacturing using laser sintering with low porosity (LP), medium porosity (MP), and high porosity (HP) with low resolution (LR) and high resolution (HR) based on a computed tomography scan of human trabecular bone. After surface processing, construct porosity ranged from 41.0% to 76.1%, but all possessed micro-/nanoscale surface roughness and similar surface chemistry containing mostly Ti, O, and C. Biological responses (osteoblast differentiation, maturation, and local factor production) by MG63 osteoblast-like cells and normal human osteoblasts favored 3D than two-dimensional (2D) solid constructs. First, MG63 cells were used to assess differences in cell response to 2D compared to LR and HR porous 3D constructs. MG63 cells were sensitive to porosity resolution and exhibited increased osteocalcin (OCN), vascular endothelial growth factor (VEGF), osteoprotegerin (OPG), and bone morphogenetic protein 2 (BMP2) on HR 3D constructs than on 2D and LR 3D constructs. MG63 cells also exhibited porosity-dependent responses on HR constructs, with up to a 6.9-fold increase in factor production on LP-HR and MP-HR constructs than on HP-HR constructs. NHOsts were then used to validate biological response on HR constructs. NHOsts exhibited decreased DNA content and alkaline phosphatase activity and up to a 2.9-fold increase in OCN, OPG, VEGF, BMP2, and BMP4 on 3D HR constructs than on 2D controls. These results indicate that osteoblasts prefer a 3D architecture than a 2D surface and that osteoblasts are sensitive to the resolution of trabecular detail and porosity parameters of laser-sintered 3D Ti-6Al-4V constructs. PMID:28804735
Cheng, Alice; Humayun, Aiza; Boyan, Barbara D; Schwartz, Zvi
2016-03-01
The addition of porosity to the traditionally used solid titanium metal implants has been suggested to more closely mimic the natural mechanical properties of bone and increase osseointegration in dental and orthopedic implants. The objective of this study was to evaluate cellular response to three-dimensional (3D) porous Ti-6Al-4V constructs fabricated by additive manufacturing using laser sintering with low porosity (LP), medium porosity (MP), and high porosity (HP) with low resolution (LR) and high resolution (HR) based on a computed tomography scan of human trabecular bone. After surface processing, construct porosity ranged from 41.0% to 76.1%, but all possessed micro-/nanoscale surface roughness and similar surface chemistry containing mostly Ti, O, and C. Biological responses (osteoblast differentiation, maturation, and local factor production) by MG63 osteoblast-like cells and normal human osteoblasts favored 3D than two-dimensional (2D) solid constructs. First, MG63 cells were used to assess differences in cell response to 2D compared to LR and HR porous 3D constructs. MG63 cells were sensitive to porosity resolution and exhibited increased osteocalcin (OCN), vascular endothelial growth factor (VEGF), osteoprotegerin (OPG), and bone morphogenetic protein 2 (BMP2) on HR 3D constructs than on 2D and LR 3D constructs. MG63 cells also exhibited porosity-dependent responses on HR constructs, with up to a 6.9-fold increase in factor production on LP-HR and MP-HR constructs than on HP-HR constructs. NHOsts were then used to validate biological response on HR constructs. NHOsts exhibited decreased DNA content and alkaline phosphatase activity and up to a 2.9-fold increase in OCN, OPG, VEGF, BMP2, and BMP4 on 3D HR constructs than on 2D controls. These results indicate that osteoblasts prefer a 3D architecture than a 2D surface and that osteoblasts are sensitive to the resolution of trabecular detail and porosity parameters of laser-sintered 3D Ti-6Al-4V constructs.
Lai, Y-C; Li, H-Y; Hung, C-S; Lin, M-S; Shih, S-R; Ma, W-Y; Hua, C-H; Chuang, L-M; Sung, F-C; Wei, J-N
2013-03-01
To evaluate whether homeostasis model assessment and high-sensitivity C-reactive protein improve the prediction of isolated post-load hyperglycaemia. The subjects were 1458 adults without self-reported diabetes recruited between 2006 and 2010. Isolated post-load hyperglycaemia was defined as fasting plasma glucose < 7 mmol/l and 2-h post-load plasma glucose ≥ 11.1 mmol/l. Risk scores of isolated post-load hyperglycaemia were constructed by multivariate logistic regression. An independent group (n = 154) was enrolled from 2010 to 2011 to validate the models' performance. One hundred and twenty-three subjects (8.28%) were newly diagnosed as having diabetes mellitus. Among those with undiagnosed diabetes, 64 subjects (52%) had isolated post-load hyperglycaemia. Subjects with isolated post-load hyperglycaemia were older, more centrally obese and had higher blood pressure, HbA(1c), fasting plasma glucose, triglycerides, LDL cholesterol, high-sensitivity C-reactive protein and homeostasis model assessment of insulin resistance and lower homeostasis model assessment of β-cell function than those without diabetes. The risk scores included age, gender, BMI, homeostasis model assessment, high-sensitivity C-reactive protein and HbA(1c). The full model had high sensitivity (84%) and specificity (87%) and area under the receiver operating characteristic curve (0.91), with a cut-off point of 23.81; validation in an independent data set showed 88% sensitivity, 77% specificity and an area under curve of 0.89. Over half of those with undiagnosed diabetes had isolated post-load hyperglycaemia. Homeostasis model assessment and high-sensitivity C-reactive protein are useful to identify subjects with isolated post-load hyperglycaemia, with improved performance over fasting plasma glucose or HbA(1c) alone. © 2012 The Authors. Diabetic Medicine © 2012 Diabetes UK.
McCormack, K.; Howell, B. R.; Guzman, D.; Villongco, C.; Pears, K.; Kim, H.; Gunnar, M.R.; Sanchez, M.M.
2014-01-01
One of the strongest predictors of healthy child development is the quality of maternal care. Although many measures of observation and self-report exist in humans to assess global aspects of maternal care, such qualitative measures are lacking in nonhuman primates. In this study we developed an instrument to measure global aspects of maternal care in rhesus monkeys, with the goal of complementing the individual behavioral data collected using a well-established rhesus macaque ethogram during the first months postpartum. The 22 items of the instrument were adapted from human maternal sensitivity assessments and a maternal Q-sort instrument already published for macaques. The 22 items formed four dimensions with high levels of internal reliability that represented major constructs of maternal care: 1) Sensitivity/Responsivity, 2) Protectiveness, 3) Permissiveness, and 4) Irritability. These dimensions yielded high construct validity when correlated with mother-infant frequency and duration behavior that was collected from focal observations across the first three postnatal months. In addition, comparisons of two groups of mothers (Maltreating versus Competent mothers), showed significant differences across the dimensions suggesting that this instrument has strong concurrent validity, even after controlling for focal observation variables that have been previously shown to significantly differentiate these groups. Our findings suggest that this Instrument of Macaque Maternal Care (IMMC) has the potential to capture global aspects of the mother-infant relationship that complement individual behaviors collected through focal observations. PMID:25066041
McCormack, K; Howell, B R; Guzman, D; Villongco, C; Pears, K; Kim, H; Gunnar, M R; Sanchez, M M
2015-01-01
One of the strongest predictors of healthy child development is the quality of maternal care. Although many measures of observation and self-report exist in humans to assess global aspects of maternal care, such qualitative measures are lacking in nonhuman primates. In this study, we developed an instrument to measure global aspects of maternal care in rhesus monkeys, with the goal of complementing the individual behavioral data collected using a well-established rhesus macaque ethogram during the first months postpartum. The 22 items of the instrument were adapted from human maternal sensitivity assessments and a maternal Q-sort instrument already published for macaques. The 22 items formed four dimensions with high levels of internal reliability that represented major constructs of maternal care: (1) Sensitivity/Responsivity, (2) Protectiveness, (3) Permissiveness, and (4) Irritability. These dimensions yielded high construct validity when correlated with mother-infant frequency and duration behavior that was collected from focal observations across the first 3 postnatal months. In addition, comparisons of two groups of mothers (Maltreating vs. Competent mothers) showed significant differences across the dimensions suggesting that this instrument has strong concurrent validity, even after controlling for focal observation variables that have been previously shown to significantly differentiate these groups. Our findings suggest that this Instrument of Macaque Maternal Care has the potential to capture global aspects of the mother-infant relationship that complement individual behaviors collected through focal observations. © 2014 Wiley Periodicals, Inc.
ERIC Educational Resources Information Center
Rossi, Robert Joseph
Methods drawn from four logical theories associated with studies of inductive processes are applied to the assessment and evaluation of experimental episode construct validity. It is shown that this application provides for estimates of episode informativeness with respect to the person examined in terms of the construct and to the construct…
Determining the Scoring Validity of a Co-Constructed CEFR-Based Rating Scale
ERIC Educational Resources Information Center
Deygers, Bart; Van Gorp, Koen
2015-01-01
Considering scoring validity as encompassing both reliable rating scale use and valid descriptor interpretation, this study reports on the validation of a CEFR-based scale that was co-constructed and used by novice raters. The research questions this paper wishes to answer are (a) whether it is possible to construct a CEFR-based rating scale with…
Environmental sensitivity: equivocal illness in the context of place.
Fletcher, Christopher M
2006-03-01
This article presents a phenomenologically oriented description of the interaction of illness experience, social context, and place. This is used to explore an outbreak of environmental sensitivities in Nova Scotia, Canada. Environmental Sensitivity (ES) is a popular designation for bodily reactions to mundane environmental stimuli that are insignificant for most people. Mainstream medicine cannot support the popular models of this disease process and consequently illness experience is subject to ambiguity and contestation. As an 'equivocal illness', ES generates considerable social action around the nature, meaning and validity of suffering. Sense of place plays an important role in this process. In this case, the meanings that accrue to illness experience and that produce salient popular disease etiology are grounded in the experience and social construction of the Nova Scotian landscape over time. Shifting representations of place are reflected in illness experience and the meanings that arise around illness are emplaced in landscape.
Evaluation of the Validity and Reliability of the Waterlow Pressure Ulcer Risk Assessment Scale
Charalambous, Charalambos; Koulori, Agoritsa; Vasilopoulos, Aristidis; Roupa, Zoe
2018-01-01
Introduction Prevention is the ideal strategy to tackle the problem of pressure ulcers. Pressure ulcer risk assessment scales are one of the most pivotal measures applied to tackle the problem, much criticisms has been developed regarding the validity and reliability of these scales. Objective To investigate the validity and reliability of the Waterlow pressure ulcer risk assessment scale. Method The methodology used is a narrative literature review, the bibliography was reviewed through Cinahl, Pubmed, EBSCO, Medline and Google scholar, 26 scientific articles where identified. The articles where chosen due to their direct correlation with the objective under study and their scientific relevance. Results The construct and face validity of the Waterlow appears adequate, but with regards to content validity changes in the category age and gender can be beneficial. The concurrent validity cannot be assessed. The predictive validity of the Waterlow is characterized by high specificity and low sensitivity. The inter-rater reliability has been demonstrated to be inadequate, this may be due to lack of clear definitions within the categories and differentiating level of knowledge between the users. Conclusion Due to the limitations presented regarding the validity and reliability of the Waterlow pressure ulcer risk assessment scale, the scale should be used in conjunction with clinical assessment to provide optimum results. PMID:29736104
Evaluation of the Validity and Reliability of the Waterlow Pressure Ulcer Risk Assessment Scale.
Charalambous, Charalambos; Koulori, Agoritsa; Vasilopoulos, Aristidis; Roupa, Zoe
2018-04-01
Prevention is the ideal strategy to tackle the problem of pressure ulcers. Pressure ulcer risk assessment scales are one of the most pivotal measures applied to tackle the problem, much criticisms has been developed regarding the validity and reliability of these scales. To investigate the validity and reliability of the Waterlow pressure ulcer risk assessment scale. The methodology used is a narrative literature review, the bibliography was reviewed through Cinahl, Pubmed, EBSCO, Medline and Google scholar, 26 scientific articles where identified. The articles where chosen due to their direct correlation with the objective under study and their scientific relevance. The construct and face validity of the Waterlow appears adequate, but with regards to content validity changes in the category age and gender can be beneficial. The concurrent validity cannot be assessed. The predictive validity of the Waterlow is characterized by high specificity and low sensitivity. The inter-rater reliability has been demonstrated to be inadequate, this may be due to lack of clear definitions within the categories and differentiating level of knowledge between the users. Due to the limitations presented regarding the validity and reliability of the Waterlow pressure ulcer risk assessment scale, the scale should be used in conjunction with clinical assessment to provide optimum results.
Construct Validity of Neuropsychological Tests in Schizophrenia.
ERIC Educational Resources Information Center
Allen, Daniel N.; Aldarondo, Felito; Goldstein, Gerald; Huegel, Stephen G.; Gilbertson, Mark; van Kammen, Daniel P.
1998-01-01
The construct validity of neuropsychological tests in patients with schizophrenia was studied with 39 patients who were evaluated with a battery of six tests assessing attention, memory, and abstract reasoning abilities. Results support the construct validity of the neuropsychological tests in patients with schizophrenia. (SLD)
Ulloa, R E; Narváez, M R; Arroyo, E; del Bosque, J; de la Peña, F
2009-01-01
Teacher's rating scales for the evaluation of attention deficit and superactivity disorder (TDAH) and conduct disorders have been shown to be useful and valid tools. The Child Psychiatric Hospital Teacher Questionnaire (CPHTQ) of the Hospital Psiquiátrico Infantil Dr. Juan N. Navarro was designed for the assessment of ADHD symptoms, externalizing symptoms and school functioning difficulties of children and adolescents. Internal consistency, criterion validity, construct validity and sensitivity of the scale to changes in symptom severity were evaluated in this study. The scale was administered to 282 teachers of children and adolescents aged 5 to 17 years who came to a unit specialized in child psychiatry. The validity analysis of the instrument showed that the internal consistency measured by Cronbach's alpha was 0.94. The factorial analysis yielded 5 factors accounting for 59.1% of the variance: hyperactivity and conduct symptoms, predatory, conduct disorder, inattentive, poor functioning and motor disturbances. The CPHTQ scores on the scale showed positive correlation with the Clinical Global impression (CGI) scale in the patients' response to drug treatment. The CPHTQ shows adequate validity characteristics that demonstrate its utility in the evaluation of patients with ADHD and its comorbidity with other behavior disorders.
Wood, Lisa; Burke, Eilish; Byrne, Rory; Enache, Gabriela; Morrison, Anthony P
2016-10-01
Stigma is a significant difficulty for people who experience psychosis. To date, there have been no outcome measures developed to examine stigma exclusively in people with psychosis. The aim of this study was develop and validate a semi-structured interview measure of stigma (SIMS) in psychosis. The SIMS is an eleven item measure of stigma developed in consultation with service users who have experienced psychosis. 79 participants with experience of psychosis were recruited for the purposes of this study. They were administered the SIMS alongside a battery of other relevant outcome measures to examine reliability and validity. A one-factor solution was identified for the SIMS which encompassed all ten rateable items. The measure met all reliability and validity criteria and illustrated good internal consistency, inter-rater reliability, test retest reliability, criterion validity, construct validity, sensitivity to change and had no floor or ceiling effects. The SIMS is a reliable and valid measure of stigma in psychosis. It may be more engaging and acceptable than other stigma measures due to its semi-structured interview format. Crown Copyright © 2016. Published by Elsevier B.V. All rights reserved.
Student mathematical imagination instruments: construction, cultural adaptation and validity
NASA Astrophysics Data System (ADS)
Dwijayanti, I.; Budayasa, I. K.; Siswono, T. Y. E.
2018-03-01
Imagination has an important role as the center of sensorimotor activity of the students. The purpose of this research is to construct the instrument of students’ mathematical imagination in understanding concept of algebraic expression. The researcher performs validity using questionnaire and test technique and data analysis using descriptive method. Stages performed include: 1) the construction of the embodiment of the imagination; 2) determine the learning style questionnaire; 3) construct instruments; 4) translate to Indonesian as well as adaptation of learning style questionnaire content to student culture; 5) perform content validation. The results stated that the constructed instrument is valid by content validation and empirical validation so that it can be used with revisions. Content validation involves Indonesian linguists, english linguists and mathematics material experts. Empirical validation is done through a legibility test (10 students) and shows that in general the language used can be understood. In addition, a questionnaire test (86 students) was analyzed using a biserial point correlation technique resulting in 16 valid items with a reliability test using KR 20 with medium reability criteria. While the test instrument test (32 students) to find all items are valid and reliability test using KR 21 with reability is 0,62.
Lu, Hengyu; Villafane, Nicole; Dogruluk, Turgut; Grzeskowiak, Caitlin L; Kong, Kathleen; Tsang, Yiu Huen; Zagorodna, Oksana; Pantazi, Angeliki; Yang, Lixing; Neill, Nicholas J; Kim, Young Won; Creighton, Chad J; Verhaak, Roel G; Mills, Gordon B; Park, Peter J; Kucherlapati, Raju; Scott, Kenneth L
2017-07-01
Oncogenic gene fusions drive many human cancers, but tools to more quickly unravel their functional contributions are needed. Here we describe methodology permitting fusion gene construction for functional evaluation. Using this strategy, we engineered the known fusion oncogenes, BCR-ABL1, EML4-ALK , and ETV6-NTRK3, as well as 20 previously uncharacterized fusion genes identified in The Cancer Genome Atlas datasets. In addition to confirming oncogenic activity of the known fusion oncogenes engineered by our construction strategy, we validated five novel fusion genes involving MET, NTRK2 , and BRAF kinases that exhibited potent transforming activity and conferred sensitivity to FDA-approved kinase inhibitors. Our fusion construction strategy also enabled domain-function studies of BRAF fusion genes. Our results confirmed other reports that the transforming activity of BRAF fusions results from truncation-mediated loss of inhibitory domains within the N-terminus of the BRAF protein. BRAF mutations residing within this inhibitory region may provide a means for BRAF activation in cancer, therefore we leveraged the modular design of our fusion gene construction methodology to screen N-terminal domain mutations discovered in tumors that are wild-type at the BRAF mutation hotspot, V600. We identified an oncogenic mutation, F247L, whose expression robustly activated the MAPK pathway and sensitized cells to BRAF and MEK inhibitors. When applied broadly, these tools will facilitate rapid fusion gene construction for subsequent functional characterization and translation into personalized treatment strategies. Cancer Res; 77(13); 3502-12. ©2017 AACR . ©2017 American Association for Cancer Research.
Measuring Work Functioning: Validity of a Weighted Composite Work Functioning Approach.
Boezeman, Edwin J; Sluiter, Judith K; Nieuwenhuijsen, Karen
2015-09-01
To examine the construct validity of a weighted composite work functioning measurement approach. Workers (health-impaired/healthy) (n = 117) completed a composite measure survey that recorded four central work functioning aspects with existing scales: capacity to work, quality of work performance, quantity of work, and recovery from work. Previous derived weights reflecting the relative importance of these aspects of work functioning were used to calculate the composite weighted work functioning score of the workers. Work role functioning, productivity, and quality of life were used for validation. Correlations were calculated and norms applied to examine convergent and divergent construct validity. A t test was conducted and a norm applied to examine discriminative construct validity. Overall the weighted composite work functioning measure demonstrated construct validity. As predicted, the weighted composite score correlated (p < .001) strongly (r > .60) with work role functioning and productivity (convergent construct validity), and moderately (.30 < r < .60) with physical quality of life and less strongly than work role functioning and productivity with mental quality of life (divergent validity). Further, the weighted composite measure detected that health-impaired workers show with a large effect size (Cohen's d > .80) significantly worse work functioning than healthy workers (discriminative validity). The weighted composite work functioning measurement approach takes into account the relative importance of the different work functioning aspects and demonstrated good convergent, fair divergent, and good discriminative construct validity.
Chan, Kin Sun
2018-01-01
Objectives This study aimed to evaluate the internal consistency, reliability, convergent validity, known-group comparisons, and structural validity of the Chinese version of Fear of Intimacy with Helping Professionals (C–FIS–HP) scale in Macau. Methods A cross-sectional design was used on a sample of 593 older people in 6 health centers. We used Chinese version of Exercise of Self-Care Agency Scale (C-ESCAS) and Morisky 4-item medication adherence scale to evaluate self-care actions and medication adherence. The internal consistency and reliability of C–FIS–HP were analyzed using the Spearman-Brown split-half reliability, Cronbach’s alpha, and test–retest reliability. Convergent validity was tested the construct of C–FIS–HP and self-care actions. Known-group comparisons differentiated predefined groups in an expected direction. Two separated samples were used to test the structural validity. An exploratory factor analysis (EFA) tested the factor structure of C–FISHP using the principal axis factoring. A confirmatory factor analysis (CFA) was further conducted to confirm the factor structure constructed in the prior EFA. Results The C–FIS–HP had a Spearman-Brown split-half coefficient, Cronbach’s alpha, and intraclass correlation coefficient of 0.96, 0.93, and 0.96, respectively. Convergent validity was satisfactory with significantly correlations between the C-FIS-HP and C-ESCAS. C–FIS–HP to differentiate the differences between high-, moderate-, and low- medication adherence groups. EFA demonstrated a two-factor structure among 297 older people. A first-order CFA was performed to confirm the construct dimensionality of C–FIS–HP with satisfactory fit indices (NFI = 0.92; IFI = 0.95; TLI = 0.94; CFI = 0.95 and RMSEA = 0.07) among 296 older people. Conclusions C–FIS–HP is a reliable and valid test for assessing helping relationships in older Chinese people. Health professionals can use C–FIS–HP as a clinical tool to assess the comfort level of patients in a helping relationship, and use this information to develop culturally sensitive therapeutic interventions and treatment plans. Further studies need to be conducted concerning the different psychometric properties, as well as the application of C–FIS–HP in various regions. PMID:29795563
Psychometric properties and clinical utility of the Scale for Suicidal Ideation (SSI) in adolescents
Holi, Matti M; Pelkonen, Mirjami; Karlsson, Linnea; Kiviruusu, Olli; Ruuttu, Titta; Heilä, Hannele; Tuisku, Virpi; Marttunen, Mauri
2005-01-01
Background Accurate assessment of suicidality is of major importance in both clinical and research settings. The Scale for Suicidal Ideation (SSI) is a well-established clinician-rating scale but its suitability to adolescents has not been studied. The aim of this study was to evaluate the reliability and validity, and to test an appropriate cutoff threshold for the SSI in a depressed adolescent outpatient population and controls. Methods 218 adolescent psychiatric outpatient clinic patients suffering from depressive disorders and 200 age- and sex-matched school-attending controls were evaluated by the SSI for presence and severity of suicidal ideation. Internal consistency, discriminative-, concurrent-, and construct validity as well as the screening properties of the SSI were evaluated. Results Cronbach's α for the whole SSI was 0.95. The SSI total score differentiated patients and controls, and increased statistically significantly in classes with increasing severity of suicidality derived from the suicidality items of the K-SADS-PL diagnostic interview. Varimax-rotated principal component analysis of the SSI items yielded three theoretically coherent factors suggesting construct validity. Area under the receiver operating characteristic (ROC) curve was 0.84 for the whole sample and 0.80 for the patient sample. The optimal cutoff threshold for the SSI total score was 3/4 yielding sensitivity of 75% and specificity of 88.9% in this population. Conclusions SSI appears to be a reliable and a valid measure of suicidal ideation for depressed adolescents. PMID:15691388
Holi, Matti M; Pelkonen, Mirjami; Karlsson, Linnea; Kiviruusu, Olli; Ruuttu, Titta; Heilä, Hannele; Tuisku, Virpi; Marttunen, Mauri
2005-02-03
Accurate assessment of suicidality is of major importance in both clinical and research settings. The Scale for Suicidal Ideation (SSI) is a well-established clinician-rating scale but its suitability to adolescents has not been studied. The aim of this study was to evaluate the reliability and validity, and to test an appropriate cutoff threshold for the SSI in a depressed adolescent outpatient population and controls. 218 adolescent psychiatric outpatient clinic patients suffering from depressive disorders and 200 age- and sex-matched school-attending controls were evaluated by the SSI for presence and severity of suicidal ideation. Internal consistency, discriminative-, concurrent-, and construct validity as well as the screening properties of the SSI were evaluated. Cronbach's alpha for the whole SSI was 0.95. The SSI total score differentiated patients and controls, and increased statistically significantly in classes with increasing severity of suicidality derived from the suicidality items of the K-SADS-PL diagnostic interview. Varimax-rotated principal component analysis of the SSI items yielded three theoretically coherent factors suggesting construct validity. Area under the receiver operating characteristic (ROC) curve was 0.84 for the whole sample and 0.80 for the patient sample. The optimal cutoff threshold for the SSI total score was 3/4 yielding sensitivity of 75% and specificity of 88.9% in this population. SSI appears to be a reliable and a valid measure of suicidal ideation for depressed adolescents.
Advancing implementation science through measure development and evaluation: a study protocol.
Lewis, Cara C; Weiner, Bryan J; Stanick, Cameo; Fischer, Sarah M
2015-07-22
Significant gaps related to measurement issues are among the most critical barriers to advancing implementation science. Three issues motivated the study aims: (a) the lack of stakeholder involvement in defining pragmatic measure qualities; (b) the dearth of measures, particularly for implementation outcomes; and (c) unknown psychometric and pragmatic strength of existing measures. Aim 1: Establish a stakeholder-driven operationalization of pragmatic measures and develop reliable, valid rating criteria for assessing the construct. Aim 2: Develop reliable, valid, and pragmatic measures of three critical implementation outcomes, acceptability, appropriateness, and feasibility. Aim 3: Identify Consolidated Framework for Implementation Research and Implementation Outcome Framework-linked measures that demonstrate both psychometric and pragmatic strength. For Aim 1, we will conduct (a) interviews with stakeholder panelists (N = 7) and complete a literature review to populate pragmatic measure construct criteria, (b) Q-sort activities (N = 20) to clarify the internal structure of the definition, (c) Delphi activities (N = 20) to achieve consensus on the dimension priorities, (d) test-retest and inter-rater reliability assessments of the emergent rating system, and (e) known-groups validity testing of the top three prioritized pragmatic criteria. For Aim 2, our systematic development process involves domain delineation, item generation, substantive validity assessment, structural validity assessment, reliability assessment, and predictive validity assessment. We will also assess discriminant validity, known-groups validity, structural invariance, sensitivity to change, and other pragmatic features. For Aim 3, we will refine our established evidence-based assessment (EBA) criteria, extract the relevant data from the literature, rate each measure using the EBA criteria, and summarize the data. The study outputs of each aim are expected to have a positive impact as they will establish and guide a comprehensive measurement-focused research agenda for implementation science and provide empirically supported measures, tools, and methods for accomplishing this work.
Podsakoff, Nathan P; Podsakoff, Philip M; Mackenzie, Scott B; Klinger, Ryan L
2013-01-01
Several researchers have persuasively argued that the most important evidence to consider when assessing construct validity is whether variations in the construct of interest cause corresponding variations in the measures of the focal construct. Unfortunately, the literature provides little practical guidance on how researchers can go about testing this. Therefore, the purpose of this article is to describe how researchers can use video techniques to test whether their scales measure what they purport to measure. First, we discuss how researchers can develop valid manipulations of the focal construct that they hope to measure. Next, we explain how to design a study to use this manipulation to test the validity of the scale. Finally, comparing and contrasting traditional and contemporary perspectives on validation, we discuss the advantages and limitations of video-based validation procedures. PsycINFO Database Record (c) 2013 APA, all rights reserved.
An evidence-based decision assistance model for predicting training outcome in juvenile guide dogs.
Harvey, Naomi D; Craigon, Peter J; Blythe, Simon A; England, Gary C W; Asher, Lucy
2017-01-01
Working dog organisations, such as Guide Dogs, need to regularly assess the behaviour of the dogs they train. In this study we developed a questionnaire-style behaviour assessment completed by training supervisors of juvenile guide dogs aged 5, 8 and 12 months old (n = 1,401), and evaluated aspects of its reliability and validity. Specifically, internal reliability, temporal consistency, construct validity, predictive criterion validity (comparing against later training outcome) and concurrent criterion validity (comparing against a standardised behaviour test) were evaluated. Thirty-nine questions were sourced either from previously published literature or created to meet requirements identified via Guide Dogs staff surveys and staff feedback. Internal reliability analyses revealed seven reliable and interpretable trait scales named according to the questions within them as: Adaptability; Body Sensitivity; Distractibility; Excitability; General Anxiety; Trainability and Stair Anxiety. Intra-individual temporal consistency of the scale scores between 5-8, 8-12 and 5-12 months was high. All scales excepting Body Sensitivity showed some degree of concurrent criterion validity. Predictive criterion validity was supported for all seven scales, since associations were found with training outcome, at at-least one age. Thresholds of z-scores on the scales were identified that were able to distinguish later training outcome by identifying 8.4% of all dogs withdrawn for behaviour and 8.5% of all qualified dogs, with 84% and 85% specificity. The questionnaire assessment was reliable and could detect traits that are consistent within individuals over time, despite juvenile dogs undergoing development during the study period. By applying thresholds to scores produced from the questionnaire this assessment could prove to be a highly valuable decision-making tool for Guide Dogs. This is the first questionnaire-style assessment of juvenile dogs that has shown value in predicting the training outcome of individual working dogs.
Liu, May; Purohit, Shreya; Mazanetz, Joshua; Allen, Whitney; Kreaden, Usha S; Curet, Myriam
2018-01-01
Skill assessment during robotically assisted surgery remains challenging. While the popularity of the Global Evaluative Assessment of Robotics Skills (GEARS) has grown, its lack of discrimination between independent console skills limits its usefulness. The purpose of this study was to evaluate construct validity and interrater reliability of a novel assessment designed to overcome this limitation. We created the Assessment of Robotic Console Skills (ARCS), a global rating scale with six console skill domains. Fifteen volunteers who were console surgeons for 0 ("novice"), 1-100 ("intermediate"), or >100 ("experienced") robotically assisted procedures performed three standardized tasks. Three blinded raters scored the task videos using ARCS, with a 5-point Likert scale for each skill domain. Scores were analyzed for evidence of construct validity and interrater reliability. Group demographics were indistinguishable except for the number of robotically assisted procedures performed (p = 0.001). The mean scores of experienced subjects exceeded those of novices in dexterity (3.8 > 1.4, p < 0.001), field of view (4.1 > 1.8, p < 0.001), instrument visualization (3.9 > 2.2, p < 0.001), manipulator workspace (3.6 > 1.9, p = 0.001), and force sensitivity (4.3 > 2.6, p < 0.001). The mean scores of intermediate subjects exceeded those of novices in dexterity (2.8 > 1.4, p = 0.002), field of view (2.8 > 1.8, p = 0.021), instrument visualization (3.2 > 2.2, p = 0.045), manipulator workspace (3.1 > 1.9, p = 0.004), and force sensitivity (3.7 > 2.6, p = 0.033). The mean scores of experienced subjects exceeded those of intermediates in dexterity (3.8 > 2.8, p = 0.003), field of view (4.1 > 2.8, p < 0.001), and instrument visualization (3.9 > 3.2, p = 0.044). Rater agreement in each domain demonstrated statistically significant concordance (p < 0.05). We present strong evidence for construct validity and interrater reliability of ARCS. Our study shows that learning curves for some console skills plateau faster than others. Therefore, ARCS may be more useful than GEARS to evaluate distinct console skills. Future studies will examine why some domains did not adequately differentiate between subjects and applications for intraoperative use.
Bush, Hillary H; Eisenhower, Abbey; Briggs-Gowan, Margaret; Carter, Alice S
2015-01-01
Rooted in the theory of attention put forth by Mirsky, Anthony, Duncan, Ahearn, and Kellam (1991), the Structured Attention Module (SAM) is a developmentally sensitive, computer-based performance task designed specifically to assess sustained selective attention among 3- to 6-year-old children. The current study addressed the feasibility and validity of the SAM among 64 economically disadvantaged preschool-age children (mean age = 58 months; 55% female); a population known to be at risk for attention problems and adverse math performance outcomes. Feasibility was demonstrated by high completion rates and strong associations between SAM performance and age. Principal Factor Analysis with rotation produced robust support for a three-factor model (Accuracy, Speed, and Endurance) of SAM performance, which largely corresponded with existing theorized models of selective and sustained attention. Construct validity was evidenced by positive correlations between SAM Composite scores and all three SAM factors and IQ, and between SAM Accuracy and sequential memory. Value-added predictive validity was not confirmed through main effects of SAM on math performance above and beyond age and IQ; however, significant interactions by child sex were observed: Accuracy and Endurance both interacted with child sex to predict math performance. In both cases, the SAM factors predicted math performance more strongly for girls than for boys. There were no overall sex differences in SAM performance. In sum, the current findings suggest that interindividual variation in sustained selective attention, and potentially other aspects of attention and executive function, among young, high-risk children can be captured validly with developmentally sensitive measures.
[Spanish validation of the Boston Carpal Tunnel Questionnaire].
Oteo-Álvaro, Ángel; Marín, María T; Matas, José A; Vaquero, Javier
2016-03-18
To describe the process of cultural adaptation and validation of the Boston Carpal Tunnel Questionnaire (BCTQ) measuring symptom intensity, functional status and quality of life in carpal tunnel syndrome patients and to report the psychometric properties of this version. A 3 expert panel supervised the adaptation process. After translation, review and back-translation of the original instrument, a new Spanish version was obtained, which was administered to 2 patient samples: a pilot sample of 20 patients for assessing comprehension, and a 90 patient sample for assessing structural validity (factor analysis and reliability), construct validity and sensitivity to change. A re-test measurement was carried out in 21 patients. Follow-up was accomplished in 40 patients. The questionnaire was well accepted by all participants. Celling effect was observed for 3 items. Reliability was very good, internal consistency: αS=0.91 y αF=0.87; test-retest stability: rS=0.939 and rF=0.986. Both subscales fitted to a general dimension. Subscales correlated with dynamometer measurements (rS=0.77 and rF=0.75) and showed to be related to abnormal 2-point discrimination, muscle atrophy and electromyography deterioration level. Scores properly correlated with other validated instruments: Douleur Neuropatique 4 questions and Brief Pain Inventory. BCTQ demonstrated to be sensitive to clinical changes, with large effect sizes (dS=-3.3 and dF=-1.9). The Spanish version of the BCTQ shows good psychometric properties warranting its use in clinical settings. Copyright © 2015 Elsevier España, S.L.U. All rights reserved.
Modeling Liver-Related Adverse Effects of Drugs Using kNN QSAR Method
Rodgers, Amie D.; Zhu, Hao; Fourches, Dennis; Rusyn, Ivan; Tropsha, Alexander
2010-01-01
Adverse effects of drugs (AEDs) continue to be a major cause of drug withdrawals both in development and post-marketing. While liver-related AEDs are a major concern for drug safety, there are few in silico models for predicting human liver toxicity for drug candidates. We have applied the Quantitative Structure Activity Relationship (QSAR) approach to model liver AEDs. In this study, we aimed to construct a QSAR model capable of binary classification (active vs. inactive) of drugs for liver AEDs based on chemical structure. To build QSAR models, we have employed an FDA spontaneous reporting database of human liver AEDs (elevations in activity of serum liver enzymes), which contains data on approximately 500 approved drugs. Approximately 200 compounds with wide clinical data coverage, structural similarity and balanced (40/60) active/inactive ratio were selected for modeling and divided into multiple training/test and external validation sets. QSAR models were developed using the k nearest neighbor method and validated using external datasets. Models with high sensitivity (>73%) and specificity (>94%) for prediction of liver AEDs in external validation sets were developed. To test applicability of the models, three chemical databases (World Drug Index, Prestwick Chemical Library, and Biowisdom Liver Intelligence Module) were screened in silico and the validity of predictions was determined, where possible, by comparing model-based classification with assertions in publicly available literature. Validated QSAR models of liver AEDs based on the data from the FDA spontaneous reporting system can be employed as sensitive and specific predictors of AEDs in pre-clinical screening of drug candidates for potential hepatotoxicity in humans. PMID:20192250
Relapse Risk Assessment for Schizophrenia Patients (RASP): A New Self-Report Screening Tool.
Velligan, Dawn; Carpenter, William; Waters, Heidi C; Gerlanc, Nicole M; Legacy, Susan N; Ruetsch, Charles
2018-01-01
The Relapse Assessment for Schizophrenia Patients (RASP) was developed as a six-question self-report screener that measures indicators of Increased Anxiety and Social Isolation to assess patient stability and predict imminent relapse. This paper describes the development and psychometric characteristics of the RASP. The RASP and Positive and Negative Syndrome Scale (PANSS) were administered to patients with schizophrenia (n=166) three separate times. Chart data were collected on a subsample of patients (n=81). Psychometric analyses of RASP included tests of reliability, construct validity, and concurrent validity of items. Factors from RASP were correlated with subscales from PANSS (sensitivity to change and criterion validity [agreement between RASP and evidence of relapse]). Test-retest reliability returned modest to strong agreement at the item level and strong agreement at the questionnaire level. RASP showed good item response curves and internal consistency for the total instrument and within each of the two subscales (Increased Anxiety and Social Isolation). RASP Total Score and subscales showed good concurrent validity when correlated with PANSS Total Score, Positive, Excitement, and Anxiety subscales. RASP correctly predicted relapse in 67% of cases, with good specificity and negative predictive power and acceptable positive predictive power and sensitivity. The reliability and validity data presented support the use of RASP in settings where addition of a brief self-report assessment of relapse risk among patients with schizophrenia may be of benefit. Ease of use and scoring, and the ability to administer without clinical supervision allows for routine administration and assessment of relapse risk.
Costa, Marta; Manton, James D; Ostrovsky, Aaron D; Prohaska, Steffen; Jefferis, Gregory S X E
2016-07-20
Neural circuit mapping is generating datasets of tens of thousands of labeled neurons. New computational tools are needed to search and organize these data. We present NBLAST, a sensitive and rapid algorithm, for measuring pairwise neuronal similarity. NBLAST considers both position and local geometry, decomposing neurons into short segments; matched segments are scored using a probabilistic scoring matrix defined by statistics of matches and non-matches. We validated NBLAST on a published dataset of 16,129 single Drosophila neurons. NBLAST can distinguish neuronal types down to the finest level (single identified neurons) without a priori information. Cluster analysis of extensively studied neuronal classes identified new types and unreported topographical features. Fully automated clustering organized the validation dataset into 1,052 clusters, many of which map onto previously described neuronal types. NBLAST supports additional query types, including searching neurons against transgene expression patterns. Finally, we show that NBLAST is effective with data from other invertebrates and zebrafish. VIDEO ABSTRACT. Copyright © 2016 MRC Laboratory of Molecular Biology. Published by Elsevier Inc. All rights reserved.
Construct validity of the individual work performance questionnaire.
Koopmans, Linda; Bernaards, Claire M; Hildebrandt, Vincent H; de Vet, Henrica C W; van der Beek, Allard J
2014-03-01
To examine the construct validity of the Individual Work Performance Questionnaire (IWPQ). A total of 1424 Dutch workers from three occupational sectors (blue, pink, and white collar) participated in the study. First, IWPQ scores were correlated with related constructs (convergent validity). Second, differences between known groups were tested (discriminative validity). First, IWPQ scores correlated weakly to moderately with absolute and relative presenteeism, and work engagement. Second, significant differences in IWPQ scores were observed for workers differing in job satisfaction, and workers differing in health. Overall, the results indicate acceptable construct validity of the IWPQ. Researchers are provided with a reliable and valid instrument to measure individual work performance comprehensively and generically, among workers from different occupational sectors, with and without health problems.
Development and Construct Validation of an Academic Emotions Scale
ERIC Educational Resources Information Center
Govaerts, Sophie; Gregoire, Jacques
2008-01-01
This article describes the development and two studies on the construct validity of the Academic Emotions Scale (AES). The AES is a French self-report questionnaire assessing six emotions in the context of school learning: enjoyment, hope, pride, anxiety, shame and frustration. Its construct validity was studied through exploratory and…
Construction and Validation of a Professional Suitability Scale for Social Work Practice
ERIC Educational Resources Information Center
Tam, Dora M. Y.; Coleman, Heather
2009-01-01
This article reports on the construction and validation of a professional suitability scale, designed for assessing students' suitability for social work practice. Data were collected from 188 field supervisors who provided usable questionnaires, representing a response rate of 74%. Construct validation by exploratory factor analysis identified a…
Construct Validation of the Fairy Tale Test--Standardization Data.
ERIC Educational Resources Information Center
Coulacoglou, Carina
2002-01-01
Studied the construct validity of the Fairy Tale Test (C. Coulacoglu, 1993), a personality projective test for children, in a sample of 800 Greek children aged 8, 10, and 12. Factor analysis led to identification of eight primary factors, and correlations with other measures provide construct validity evidence. (SLD)
2013-01-01
Background Yearly formative knowledge testing (also known as progress testing) was shown to have a limited construct-validity and reliability in postgraduate medical education. One way to improve construct-validity and reliability is to improve the authenticity of a test. As easily accessible internet has become inseparably linked to daily clinical practice, we hypothesized that allowing internet access for a limited amount of time during the progress test would improve the perception of authenticity (face-validity) of the test, which would in turn improve the construct-validity and reliability of postgraduate progress testing. Methods Postgraduate trainees taking the yearly knowledge progress test were asked to participate in a study where they could access the internet for 30 minutes at the end of a traditional pen and paper test. Before and after the test they were asked to complete a short questionnaire regarding the face-validity of the test. Results Mean test scores increased significantly for all training years. Trainees indicated that the face-validity of the test improved with internet access and that they would like to continue to have internet access during future testing. Internet access did not improve the construct-validity or reliability of the test. Conclusion Improving the face-validity of postgraduate progress testing, by adding the possibility to search the internet for a limited amount of time, positively influences test performance and face-validity. However, it did not change the reliability or the construct-validity of the test. PMID:24195696
Developing a validation for environmental sustainability
NASA Astrophysics Data System (ADS)
Adewale, Bamgbade Jibril; Mohammed, Kamaruddeen Ahmed; Nawi, Mohd Nasrun Mohd; Aziz, Zulkifli
2016-08-01
One of the agendas for addressing environmental protection in construction is to reduce impacts and make the construction activities more sustainable. This important consideration has generated several research interests within the construction industry, especially considering the construction damaging effects on the ecosystem, such as various forms of environmental pollution, resource depletion and biodiversity loss on a global scale. Using Partial Least Squares-Structural Equation Modeling technique, this study validates environmental sustainability (ES) construct in the context of large construction firms in Malaysia. A cross-sectional survey was carried out where data was collected from Malaysian large construction firms using a structured questionnaire. Results of this study revealed that business innovativeness and new technology are important in determining environmental sustainability (ES) of the Malaysian construction firms. It also established an adequate level of internal consistency reliability, convergent validity and discriminant validity for each of this study's constructs. And based on this result, it could be suggested that the indicators for organisational innovativeness dimensions (business innovativeness and new technology) are useful to measure these constructs in order to study construction firms' tendency to adopt environmental sustainability (ES) in their project execution.
Divya, O; Mishra, Ashok K
2007-05-29
Quantitative determination of kerosene fraction present in diesel has been carried out based on excitation emission matrix fluorescence (EEMF) along with parallel factor analysis (PARAFAC) and N-way partial least squares regression (N-PLS). EEMF is a simple, sensitive and nondestructive method suitable for the analysis of multifluorophoric mixtures. Calibration models consisting of varying compositions of diesel and kerosene were constructed and their validation was carried out using leave-one-out cross validation method. The accuracy of the model was evaluated through the root mean square error of prediction (RMSEP) for the PARAFAC, N-PLS and unfold PLS methods. N-PLS was found to be a better method compared to PARAFAC and unfold PLS method because of its low RMSEP values.
Construction of a technological semi-digital hadronic calorimeter using GRPC
NASA Astrophysics Data System (ADS)
Laktineh, I.
2011-04-01
A high-granularity semi-digital Hadronic calorimeter using GRPC as sensitive medium is one of the two HCAL options considered by the ILD collaboration to be proposed for the detector of the future International Linear Collider project. A prototype of 1m3 has been conceived within the CALICE collaboration in order to validate this option. The prototype intends to be as close as possible to the one proposed in the ILD Letter Of Intent. Few units made of 1m2 GRPC fully equipped with semi-digital readout electronics and new gas distribution design were produced and successfully tested. In 2010 we intend to produce 40 similar units to be inserted in a self-supporting mechanical structure. The prototype will then be exposed to TestBeams at CERN for final validation.
The Child Adolescent Bullying Scale (CABS): Psychometric evaluation of a new measure.
Strout, Tania D; Vessey, Judith A; DiFazio, Rachel L; Ludlow, Larry H
2018-06-01
While youth bullying is a significant public health problem, healthcare providers have been limited in their ability to identify bullied youths due to the lack of a reliable, and valid instrument appropriate for use in clinical settings. We conducted a multisite study to evaluate the psychometric properties of a new 22-item instrument for assessing youths' experiences of being bullied, the Child Adolescent Bullying Scale (CABS). The 20 items summed to produce the measure's score were evaluated here. Diagnostic performance was assessed through evaluation of sensitivity, specificity, predictive values, and area under receiver operating characteristic (AUROC) curve. A sample of 352 youths from diverse racial, ethnic, and geographic backgrounds (188 female, 159 male, 5 transgender, sample mean age 13.5 years) were recruited from two clinical sites. Participants completed the CABS and existing youth bullying measures. Analyses grounded in classical test theory, including assessments of reliability and validity, item analyses, and principal components analysis, were conducted. The diagnostic performance and test characteristics of the CABS were also evaluated. The CABS is comprised of one component, accounting for 67% of observed variance. Analyses established evidence of internal consistency reliability (Cronbach's α = 0.97), construct and convergent validity. Sensitivity was 84%, specificity was 65%, and the AUROC curve was 0.74 (95% CI: 0.69-0.80). Findings suggest that the CABS holds promise as a reliable, valid tool for healthcare provider use in screening for bullying exposure in the clinical setting. © 2018 Wiley Periodicals, Inc.
ERIC Educational Resources Information Center
Lee, Hee-Sun; Liu, Ou Lydia; Linn, Marcia C.
2011-01-01
This study explores measurement of a construct called knowledge integration in science using multiple-choice and explanation items. We use construct and instructional validity evidence to examine the role multiple-choice and explanation items plays in measuring students' knowledge integration ability. For construct validity, we analyze item…
Validity and Reliability of Psychosocial Factors Related to Breast Cancer Screening.
ERIC Educational Resources Information Center
Zapka, Jane G.; And Others
1991-01-01
The construct validity of hypothesized survey items and data reduction procedures for selected psychosocial constructs frequently used in breast cancer screening research were investigated in telephone interviews with randomly selected samples of 1,184 and 903 women and a sample of 169 Hispanic clinic clients. Validity of the constructs is…
2017-09-01
VALIDATION OF MODEL UPDATING AND DAMAGE DETECTION VIA EIGENVALUE SENSITIVITY METHODS WITH ARTIFICIAL BOUNDARY CONDITIONS by Matthew D. Bouwense...VALIDATION OF MODEL UPDATING AND DAMAGE DETECTION VIA EIGENVALUE SENSITIVITY METHODS WITH ARTIFICIAL BOUNDARY CONDITIONS 5. FUNDING NUMBERS 6. AUTHOR...unlimited. EXPERIMENTAL VALIDATION OF MODEL UPDATING AND DAMAGE DETECTION VIA EIGENVALUE SENSITIVITY METHODS WITH ARTIFICIAL BOUNDARY
Vaughan, Frances L; Neal, Jo Anne; Mulla, Farzana Nizam; Edwards, Barbara; Coetzer, Rudi
2017-04-01
The Brain Injury Cognitive Screen (BICS) was developed as an in-service cognitive assessment battery for acquired brain injury patients entering community rehabilitation. The BICS focuses on domains that are particularly compromised following TBI, and provides a broader and more detailed assessment of executive function, attention and information processing than comparable screening assessments. The BICS also includes brief assessments of perception, naming, and construction, which were predicted to be more sensitive to impairments following non-traumatic brain injury. The studies reported here examine preliminary evidence for its validity in post-acute rehabilitation. In Study 1, TBI patients completed the BICS and were compared with matched controls. Patients with focal lesions and matched controls were compared in Study 2. Study 3 examined demographic effects in a sample of normative data. TBI and focal lesion patients obtained significantly lower composite memory, executive function and attention and information processing BICS scores than healthy controls. Injury severity effects were also obtained. Logistic regression analyses indicated that each group of BICS memory, executive function and attention measures reliably differentiated TBI and focal lesion participants from controls. Design Recall, Prospective Memory, Verbal Fluency, and Visual Search test scores showed significant independent regression effects. Other subtest measures showed evidence of sensitivity to brain injury. The study provides preliminary evidence of the BICS' sensitivity to cognitive impairment caused by acquired brain injury, and its potential clinical utility as a cognitive screen. Further validation based on a revised version of the BICS and more normative data are required.
Koutsogiannou, Persa; Dimoliatis, Ioannis D K; Mavridis, Dimitris; Bellos, Stefanos; Karathanos, Vassilis; Jelastopulu, Eleni
2015-11-30
The Greek version of the Postgraduate Hospital Educational Environment Measure (PHEEM) was evaluated to determine its psychometric properties, i.e., validity, internal consistency, sensitivity and responsiveness to be used for measuring the learning environment in Greek hospitals. The PHEEM was administered to Greek hospital residents. Internal consistency was measured using Cronbach's alpha. Root Mean Square Error of Approximation (RMSEA) was used to evaluate the fit of Structural Equation Models. Content validity was addressed by the original study. Construct validity was tested using confirmatory (to test the set of underlying dimensions suggested by the original study) and exploratory (to explore the dimensions needed to explain the variability of the given answers) factor analysis using Varimax rotation. Convergent validity was calculated by Pearson's correlation coefficient regarding the participant's PHEEM score and participant's overall satisfaction score of the added item "Overall, I am very satisfied with my specialization in this post". Sensitivity was checked by comparing good versus poor aspects of the educational environment and by satisfied versus unsatisfied participants. A total of 731 residents from 83 hospitals and 41 prefectures responded to the PHEEM. The original three-factor model didn't fit better compared to one factor model that is accounting for 32% of the variance. Cronbach's α was 0.933 when assuming one-factor model. Using a three-factor model (autonomy, teaching, social support), Cronbach's α were 0.815 (expected 0.830), 0.908 (0.839), 0.734 (0.793), respectively. The three-factor model gave an RMSEA value of 0.074 (90% confidence interval 0.071, 0.076), suggesting a fair fit. Pearson's correlation coefficient between total PHEEM and global satisfaction was 0.765. Mean question scores ranged from 19.0 (very poor) to 73.7 (very good), and mean participant scores from 5.5 (very unsatisfied) to 96.5 (very satisfied). The Greek version of PHEEM is a valid, reliable, and sensitive instrument measuring the educational environment among junior doctors in Greek hospitals and it can be used for evidence-based SWOT analysis and policy.
Qin, Caidie; Bai, Xue; Zhang, Yue; Gao, Kai
2018-05-03
A photoelectrochemical wire microelectrode was constructed based on the use of a TiO 2 nanotube array with electrochemically deposited CdSe semiconductor. A strongly amplified photocurrent is generated on the sensor surface. The microsensor has a response in the 0.05-20 μM dopamine (DA) concentration range and a 16.7 μM detection limit at a signal-to-noise ratio of 3. Sensitivity, recovery and reproducibility of the sensor were validated by detecting DA in spiked human urine, and satisfactory results were obtained. Graphical abstract Schematic of a sensitive photoelectrochemical microsensor based on CdSe modified TiO 2 nanotube array. The photoelectrochemical microsensor was successfully applied to the determination of dopamine in urine samples.
[Validation of the German Version of Tinnitus Functional Index (TFI)].
Brüggemann, Petra; Szczepek, Agnieszka J; Kleinjung, Tobias; Ojo, Michael; Mazurek, Birgit
2017-09-01
Tinnitus belongs to seriously debilitating auditory conditions and is often complicated by comorbidities such as insomnia, difficulties with concentration, depression, frustration and irritability. To facilitate the grading of symptoms and the effects of therapeutic strategies, we validated a German-version Tinnitus Functional Index (TFI) in 229 subjects suffering from chronic tinnitus. Outcome validity was assessed using the Tinnitus Questionnaire (TQ, German adaptation by Goebel u. Hiller [1998]). Construct validity was assessed using the "Hamburger Allgemeine Depressionsskala" (HADS). The German TFI featured excellent internal consistency (total score Cronbach's α=0.93). Factor analysis disclosed eight TFI subscales as proposed earlier by Meikle et al. [2012]. Intercorrelations were strong both between the TFI and the TQ (r=0.83), and between the TFI and the HADS (depression r=0.49, anxiety r=0.51). The German-version TFI qualifies as a rapid and statistically robust tool for grading the impact of tinnitus on daily living and for the measurements of therapeutic effects. Regarding depressive symptomatology, sensitivity of the TFI was comparable to that of the TQ. © Georg Thieme Verlag KG Stuttgart · New York.
The Cerebral Palsy Quality of Life for Children (CP QOL-Child): Evidence of Construct Validity
ERIC Educational Resources Information Center
Chen, Kuan-Lin; Wang, Hui-Yi; Tseng, Mei-Hui; Shieh, Jeng-Yi; Lu, Lu; Yao, Kai-Ping Grace; Huang, Chien-Yu
2013-01-01
The Cerebral Palsy Quality of Life for Children (CP QOL-Child) is the first health condition-specific questionnaire designed for measuring QOL in children with cerebral palsy (CP). However, its construct validity has not yet been confirmed by confirmatory factor analysis (CFA). Hence, this study assessed the construct validity of the caregiver…
ERIC Educational Resources Information Center
Gold, Bernadette; Holodynski, Manfred
2015-01-01
The current study describes the development and construct validation of a situational judgment test for assessing the strategic knowledge of classroom management in elementary schools. Classroom scenarios and accompanying courses of action were constructed, of which 17 experts confirmed the content validity. A pilot study and a cross-validation…
Bayard, Sophie; Lebrun, Cindy; Maudarbocus, Khaalid Hassan; Schellaert, Vanessa; Joffre, Alicia; Ferrante, Esther; Le Louedec, Marie; Cournoulat, Alice; Gely-Nargeot, Marie-Christine; Luik, Annemarie I
2017-12-01
Insomnia disorder is frequent in the population, yet there is no French screening instrument available that is based on the updated DSM-5 criteria. We evaluated the validity and reliability of the French version of an insomnia screening instrument based on DSM-5 criteria, the Sleep Condition Indicator, in a population-based sample of adults. A total of 366 community-dwelling participants completed a face-to-face clinical interview to determine insomnia disorder against DSM-5 criteria and several questionnaires including the French Sleep Condition Indicator version. Three-hundred and twenty-nine participants completed the Sleep Condition Indicator again after 1 month. Statistical analyses were performed to determine the reliability, construct validity, divergent validity and temporal stability of the French translation of the Sleep Condition Indicator. In addition, an explanatory factor analysis was performed to assess the underlying structure. The internal consistency (α = 0.87) and temporal stability (r = 0.86, P < 0.001) of the French Sleep Condition Indicator were high. When using the previously defined cut-off value of ≤ 16, the area under the receiver operating characteristic curve was 0.93 with a sensitivity of 95% and a specificity of 75%. Additionally, good construct and divergent validity were demonstrated. The factor analyses showed a two-factor structure with a focus on sleep and daytime effects. The French version of the Sleep Condition Indicator demonstrates satisfactory psychometric properties while being a useful instrument in detecting cases of insomnia disorder, consistent with features of DSM-5, in the general population. © 2017 European Sleep Research Society.
Herzog, Annabel; Voigt, Katharina; Meyer, Björn; Wollburg, Eileen; Weinmann, Nina; Langs, Gernot; Löwe, Bernd
2015-06-01
The new DSM-5 Somatic Symptom Disorder (SSD) emphasizes the importance of psychological processes related to somatic symptoms in patients with somatoform disorders. To address this, the Somatic Symptoms Experiences Questionnaire (SSEQ), the first self-report scale that assesses a broad range of psychological and interactional characteristics relevant to patients with a somatoform disorder or SSD, was developed. This prospective study was conducted to validate the SSEQ. The 15-item SSEQ was administered along with a battery of self-report questionnaires to psychosomatic inpatients. Patients were assessed with the Structured Clinical Interview for DSM-IV to confirm a somatoform, depressive, or anxiety disorder. Confirmatory factor analyses, tests of internal consistency and tests of validity were performed. Patients (n=262) with a mean age of 43.4 years, 60.3% women, were included in the analyses. The previously observed four-factor model was replicated and internal consistency was good (Cronbach's α=.90). Patients with a somatoform disorder had significantly higher scores on the SSEQ (t=4.24, p<.001) than patients with a depressive/anxiety disorder. Construct validity was shown by high correlations with other instruments measuring related constructs. Hierarchical multiple regression analyses showed that the questionnaire predicted health-related quality of life. Sensitivity to change was shown by significantly higher effect sizes of the SSEQ change scores for improved patients than for patients without improvement. The SSEQ appears to be a reliable, valid, and efficient instrument to assess a broad range of psychological and interactional features related to the experience of somatic symptoms. Copyright © 2015 Elsevier Inc. All rights reserved.
Validation of the Virtual MET as an assessment tool for executive functions.
Rand, Debbie; Basha-Abu Rukan, Soraya; Weiss, Patrice L Tamar; Katz, Noomi
2009-08-01
The purpose of this study was to establish ecological validity and initial construct validity of a Virtual Multiple Errands Test (VMET) as an assessment tool for executive functions. It was implemented within the Virtual Mall (VMall), a novel functional video-capture virtual shopping environment. The main objectives were (1) to examine the relationships between the performance of three groups of participants in the Multiple Errands Test (MET) carried out in a real shopping mall and their performance in the VMET, (2) to assess the relationships between the MET and VMET of the post-stroke participant's level of executive functioning and independence in instrumental activities of daily living, and (3) to compare the performance of post-stroke participants to those of healthy young and older controls in both the MET and VMET. The study population included three groups; post-stroke participants (n = 9), healthy young participants (n = 20), and healthy older participants (n = 20). The VMET was able to differentiate between two age groups of healthy participants and between healthy and post-stroke participants thus demonstrating that it is sensitive to brain injury and ageing and supports construct validity between known groups. In addition, significant correlations were found between the MET and the VMET for both the post-stroke participants and older healthy participants. This provides initial support for the ecological validity of the VMET as an assessment tool of executive functions. However, further psychometric data on temporal stability are needed, namely test-retest reliability and responsiveness, before it is ready for clinical application. Further research using the VMET as an assessment tool within the VMall with larger groups and in additional populations is also recommended.
Edmonds, Lisa A; Donovan, Neila J
2012-04-01
There is a pressing need for psychometrically sound naming materials for Spanish/English bilingual adults. To address this need, in this study the authors examined the psychometric properties of An Object and Action Naming Battery (An O&A Battery; Druks & Masterson, 2000) in bilingual speakers. Ninety-one Spanish/English bilinguals named O&A Battery items in English and Spanish. Responses underwent a Rasch analysis. Using correlation and regression analyses, the authors evaluated the effect of psycholinguistic (e.g., imageability) and participant (e.g., proficiency ratings) variables on accuracy. Rasch analysis determined unidimensionality across English and Spanish nouns and verbs and robust item-level psychometric properties, evidence for content validity. Few items did not fit the model, there were no ceiling or floor effects after uninformative and misfit items were removed, and items reflected a range of difficulty. Reliability coefficients were high, and the number of statistically different ability levels provided indices of sensitivity. Regression analyses revealed significant correlations between psycholinguistic variables and accuracy, providing preliminary construct validity. The participant variables that contributed most to accuracy were proficiency ratings and time of language use. Results suggest adequate content and construct validity of O&A items retained in the analysis for Spanish/English bilingual adults and support future efforts to evaluate naming in older bilinguals and persons with bilingual aphasia.
NASA Astrophysics Data System (ADS)
Flanagan, S.; Hurtt, G. C.; Fisk, J. P.; Rourke, O.
2012-12-01
A robust understanding of the sensitivity of the pattern, structure, and dynamics of ecosystems to climate, climate variability, and climate change is needed to predict ecosystem responses to current and projected climate change. We present results of a study designed to first quantify the sensitivity of ecosystems to climate through the use of climate and ecosystem data, and then use the results to test the sensitivity of the climate data in a state-of the art ecosystem model. A database of available ecosystem characteristics such as mean canopy height, above ground biomass, and basal area was constructed from sources like the National Biomass and Carbon Dataset (NBCD). The ecosystem characteristics were then paired by latitude and longitude with the corresponding climate characteristics temperature, precipitation, photosynthetically active radiation (PAR) and dew point that were retrieved from the North American Regional Reanalysis (NARR). The average yearly and seasonal means of the climate data, and their associated maximum and minimum values, over the 1979-2010 time frame provided by NARR were constructed and paired with the ecosystem data. The compiled results provide natural patterns of vegetation structure and distribution with regard to climate data. An advanced ecosystem model, the Ecosystem Demography model (ED), was then modified to allow yearly alterations to its mechanistic climate lookup table and used to predict the sensitivities of ecosystem pattern, structure, and dynamics to climate data. The combined ecosystem structure and climate data results were compared to ED's output to check the validity of the model. After verification, climate change scenarios such as those used in the last IPCC were run and future forest structure changes due to climate sensitivities were identified. The results of this study can be used to both quantify and test key relationships for next generation models. The sensitivity of ecosystem characteristics to climate data shown in the database construction and by the model reinforces the need for high-resolution datasets and stresses the importance of understanding and incorporating climate change scenarios into earth system models.
Bell, Cheryl; Johnston, Derek; Allan, Julia; Pollard, Beth; Johnston, Marie
2017-05-01
The Demand-Control (DC) and Effort-Reward Imbalance (ERI) models predict health in a work context. Self-report measures of the four key constructs (demand, control, effort, and reward) have been developed and it is important that these measures have good content validity uncontaminated by content from other constructs. We assessed relevance (whether items reflect the constructs) and representativeness (whether all aspects of the construct are assessed, and all items contribute to that assessment) across the instruments and items. Two studies examined fourteen demand/control items from the Job Content Questionnaire and seventeen effort/reward items from the Effort-Reward Imbalance measure using discriminant content validation and a third study developed new methods to assess instrument representativeness. Both methods use judges' ratings and construct definitions to get transparent quantitative estimates of construct validity. Study 1 used dictionary definitions while studies 2 and 3 used published phrases to define constructs. Overall, 3/5 demand items, 4/9 control items, 1/6 effort items, and 7/11 reward items were uniquely classified to the appropriate theoretical construct and were therefore 'pure' items with discriminant content validity (DCV). All pure items measured a defining phrase. However, both the DC and ERI assessment instruments failed to assess all defining aspects. Finding good discriminant content validity for demand and reward measures means these measures are usable and our quantitative results can guide item selection. By contrast, effort and control measures had limitations (in relevance and representativeness) presenting a challenge to the implementation of the theories. Statement of contribution What is already known on this subject? While the reliability and construct validity of Demand-Control and Effort-Reward-Imbalance (DC and ERI) work stress measures are routinely reported, there has not been adequate investigation of their content validity. This paper investigates their content validity in terms of both relevance and representativeness and provides a model for the investigation of content validity of measures in health psychology more generally. What does this study add? A new application of an existing method, discriminant content validity, and a new method of assessing instrument representativeness. 'Pure' DC and ERI items are identified, as are constructs that are not fully represented by their assessment instruments. The findings are important for studies attempting to distinguish between the main DC and ERI work stress constructs. The quantitative results can be used to guide item selection for future studies. © 2017 The British Psychological Society.
Nigg, Claudio R; Motl, Robert W; Horwath, Caroline; Dishman, Rod K
2012-01-01
Objectives Physical activity (PA) research applying the Transtheoretical Model (TTM) to examine group differences and/or change over time requires preliminary evidence of factorial validity and invariance. The current study examined the factorial validity and longitudinal invariance of TTM constructs recently revised for PA. Method Participants from an ethnically diverse sample in Hawaii (N=700) completed questionnaires capturing each TTM construct. Results Factorial validity was confirmed for each construct using confirmatory factor analysis with full-information maximum likelihood. Longitudinal invariance was evidenced across a shorter (3-month) and longer (6-month) time period via nested model comparisons. Conclusions The questionnaires for each validated TTM construct are provided, and can now be generalized across similar subgroups and time points. Further validation of the provided measures is suggested in additional populations and across extended time points. PMID:22778669
Jiménez-Huete, Adolfo; Riva, Elena; Toledano, Rafael; Campo, Pablo; Esteban, Jesús; Barrio, Antonio Del; Franch, Oriol
2014-12-01
The validity of neuropsychological tests for the differential diagnosis of degenerative dementias may depend on the clinical context. We constructed a series of logistic models taking into account this factor. We retrospectively analyzed the demographic and neuropsychological data of 301 patients with probable Alzheimer's disease (AD), frontotemporal degeneration (FTLD), or dementia with Lewy bodies (DLB). Nine models were constructed taking into account the diagnostic question (eg, AD vs DLB) and subpopulation (incident vs prevalent). The AD versus DLB model for all patients, including memory recovery and phonological fluency, was highly accurate (area under the curve = 0.919, sensitivity = 90%, and specificity = 80%). The results were comparable in incident and prevalent cases. The FTLD versus AD and DLB versus FTLD models were both inaccurate. The models constructed from basic neuropsychological variables allowed an accurate differential diagnosis of AD versus DLB but not of FTLD versus AD or DLB. © The Author(s) 2014.
Moghadam, Manije; Salavati, Mahyar; Sahaf, Robab; Rassouli, Maryam; Moghadam, Mojgan; Kamrani, Ahmad Ali Akbari
2018-03-01
After forward-backward translation, the LSS was administered to 334 Persian speaking, cognitively healthy elderly aged 60 years and over recruited through convenience sampling. To analyze the validity of the model's constructs and the relationships between the constructs, a confirmatory factor analysis followed by PLS analysis was performed. The Construct validity was further investigated by calculating the correlations between the LSS and the "Short Form Health Survey" (SF-36) subscales measuring similar and dissimilar constructs. The LSS was re-administered to 50 participants a month later to assess the reliability. For the eight-factor model of the life satisfaction construct, adequate goodness of fit between the hypothesized model and the model derived from the sample data was attained (positive and statistically significant beta coefficients, good R-squares and acceptable GoF). Construct validity was supported by convergent and discriminant validity, and correlations between the LSS and SF-36 subscales. Minimum Intraclass Correlation Coefficient level of 0.60 was exceeded by all subscales. Minimum level of reliability indices (Cronbach's α, composite reliability and indicator reliability) was exceeded by all subscales. The Persian-version of the Life Satisfaction Scale is a reliable and valid instrument, with psychometric properties which are consistent with the original version.
Indrebø, Kirsten Lerum; Andersen, John Roger; Natvig, Gerd Karin
2014-01-01
The purpose of this study was to adapt the Ostomy Adjustment Scale to a Norwegian version and to assess its construct validity and 2 components of its reliability (internal consistency and test-retest reliability). One hundred fifty-eight of 217 patients (73%) with a colostomy, ileostomy, or urostomy participated in the study. Slightly more than half (56%) were men. Their mean age was 64 years (range, 26-91 years). All respondents had undergone ostomy surgery at least 3 months before participation in the study. The Ostomy Adjustment Scale was translated into Norwegian according to standard procedures for forward and backward translation. The questionnaire was sent to the participants via regular post. The Cronbach alpha and test-retest were computed to assess reliability. Construct validity was evaluated via correlations between each item and score sums; correlations were used to analyze relationships between the Ostomy Adjustment Scale and the 36-item Short Form Health Survey, the Quality of Life Scale, the Hospital Anxiety & Depression Scale, and the General Self-Efficacy Scale. The Cronbach alpha was 0.93, and test-retest reliability r was 0.69. The average correlation quotient item to sum score was 0.49 (range, 0.31-0.73). Results showed moderate negative correlations between the Ostomy Adjustment Scale and the Hospital Anxiety and Depression Scale (-0.37 and -0.40), and moderate positive correlations between the Ostomy Adjustment Scale and the 36-item Short Form Health Survey, the Quality of Life Scale, and the General Self-Efficacy Scale (0.30-0.45) with the exception of the pain domain in the Short Form 36 (0.28). Regression analysis showed linear associations between the Ostomy Adjustment Scale and sociodemographic and clinical variables with the exception of education. The Norwegian language version of the Ostomy Adjustment Scale was found to possess construct validity, along with internal consistency and test-retest reliability. The instrument is sensitive for sociodemographic and clinical variables pertinent to persons with urostomies, colostomies, and ileostomies.
Singh, Amika S; Chinapaw, Mai J M; Uijtdewilligen, Léonie; Vik, Froydis N; van Lippevelde, Wendy; Fernández-Alvira, Juan M; Stomfai, Sarolta; Manios, Yannis; van der Sluijs, Maria; Terwee, Caroline; Brug, Johannes
2012-08-13
Insight in parental energy balance-related behaviours, their determinants and parenting practices are important to inform childhood obesity prevention. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. The objective of the current study was to examine the test-retest reliability and construct validity of the parent questionnaire used in the ENERGY-project, assessing parental energy balance-related behaviours, their determinants, and parenting practices among parents of 10-12 year old children. We collected data among parents (n = 316 in the test-retest reliability study; n = 109 in the construct validity study) of 10-12 year-old children in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent interview was assessed using ICC and percentage agreement.All but one item showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Construct validity appeared to be good to excellent for 92 out of 121 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 29 items, construct validity was moderate for 24 and poor for 5 items. The reliability and construct validity of the items of the ENERGY-parent questionnaire on multiple energy balance-related behaviours, their potential determinants, and parenting practices appears to be good. Based on the results of the validity study, we strongly recommend adapting parts of the ENERGY-parent questionnaire if used in future research.
2013-01-01
Background A scale validated in one language is not automatically valid in another language or culture. The purpose of this study was to validate the English version of the UNESP-Botucatu multidimensional composite pain scale (MCPS) to assess postoperative pain in cats. The English version was developed using translation, back-translation, and review by individuals with expertise in feline pain management. In sequence, validity and reliability tests were performed. Results Of the three domains identified by factor analysis, the internal consistency was excellent for ‘pain expression’ and ‘psychomotor change’ (0.86 and 0.87) but not for ‘physiological variables’ (0.28). Relevant changes in pain scores at clinically distinct time points (e.g., post-surgery, post-analgesic therapy), confirmed the construct validity and responsiveness (Wilcoxon test, p < 0.001). Favorable correlation with the IVAS scores (p < 0.001) and moderate to very good agreement between blinded observers and ‘gold standard’ evaluations, supported criterion validity. The cut-off point for rescue analgesia was > 7 (range 0–30 points) with 96.5% sensitivity and 99.5% specificity. Conclusions The English version of the UNESP-Botucatu-MCPS is a valid, reliable and responsive instrument for assessing acute pain in cats undergoing ovariohysterectomy, when used by anesthesiologists or anesthesia technicians. The cut-off point for rescue analgesia provides an additional tool for guiding analgesic therapy. PMID:23867090
Brondani, Juliana T; Mama, Khursheed R; Luna, Stelio P L; Wright, Bonnie D; Niyom, Sirirat; Ambrosio, Jennifer; Vogel, Pamela R; Padovani, Carlos R
2013-07-17
A scale validated in one language is not automatically valid in another language or culture. The purpose of this study was to validate the English version of the UNESP-Botucatu multidimensional composite pain scale (MCPS) to assess postoperative pain in cats. The English version was developed using translation, back-translation, and review by individuals with expertise in feline pain management. In sequence, validity and reliability tests were performed. Of the three domains identified by factor analysis, the internal consistency was excellent for 'pain expression' and 'psychomotor change' (0.86 and 0.87) but not for 'physiological variables' (0.28). Relevant changes in pain scores at clinically distinct time points (e.g., post-surgery, post-analgesic therapy), confirmed the construct validity and responsiveness (Wilcoxon test, p < 0.001). Favorable correlation with the IVAS scores (p < 0.001) and moderate to very good agreement between blinded observers and 'gold standard' evaluations, supported criterion validity. The cut-off point for rescue analgesia was > 7 (range 0-30 points) with 96.5% sensitivity and 99.5% specificity. The English version of the UNESP-Botucatu-MCPS is a valid, reliable and responsive instrument for assessing acute pain in cats undergoing ovariohysterectomy, when used by anesthesiologists or anesthesia technicians. The cut-off point for rescue analgesia provides an additional tool for guiding analgesic therapy.
VanDierendonck, Machteld C; van Loon, Johannes P A M
2016-10-01
This study presents the validation of two recently described pain scales, the Equine Utrecht University Scale for Composite Pain Assessment (EQUUS-COMPASS) and the Equine Utrecht University Scale for Facial Assessment of Pain (EQUUS-FAP), in horses with acute colic. A follow-up cohort study of 46 adult horses (n = 23 with acute colic; n = 23 healthy control horses) was performed for validation and refinement of the constructed scales. Both pain scales showed statistically significant differences between horses with colic and healthy control horses, and between horses with colic that could be treated conservatively and those that required surgical treatment or were euthanased. Sensitivity and specificity were good for both EQUUS-COMPASS (87% and 71%, respectively) and EQUUS-FAP (77% and 100%, respectively) and were not substantially influenced by applying weighting factors to the individual parameters. Copyright © 2016. Published by Elsevier Ltd.
[A short form of the positions on nursing diagnosis scale: development and psychometric testing].
Romero-Sánchez, José Manuel; Paloma-Castro, Olga; Paramio-Cuevas, Juan Carlos; Pastor-Montero, Sonia María; O'Ferrall-González, Cristina; Gabaldón-Bravo, Eva Maria; González-Domínguez, Maria Eugenia; Castro-Yuste, Cristina; Frandsen, Anna J; Martínez-Sabater, Antonio
2013-06-01
The Positions on Nursing Diagnosis (PND) is a scale that uses the semantic differential technique to measure nurses' attitudes towards the nursing diagnosis concept. The aim of this study was to develop a shortened form of the Spanish version of this scale and evaluate its psychometric properties and efficiency. A double theoretical-empirical approach was used to obtain a short form of the PND, the PND-7-SV, which would be equivalent to the original. Using a cross-sectional survey design, the reliability (internal consistency and test-retest reliability), construct (exploratory factor analysis, known-groups technique and discriminant validity) and criterion-related validity (concurrent validity), sensitivity to change and efficiency of the PND-7-SV were assessed in a sample of 476 Spanish nursing students. The results endorsed the utility of the PND-7-SV to measure attitudes toward nursing diagnosis in an equivalent manner to the complete form of the scale and in a shorter time.
Validation of the Mobile Information Software Evaluation Tool (MISET) With Nursing Students.
Secco, M Loretta; Furlong, Karen E; Doyle, Glynda; Bailey, Judy
2016-07-01
This study evaluated the Mobile Information Software Evaluation Tool (MISET) with a sample of Canadian undergraduate nursing students (N = 240). Psychometric analyses determined how well the MISET assessed the extent that nursing students find mobile device-based information resources useful and supportive of learning in the clinical and classroom settings. The MISET has a valid three-factor structure with high explained variance (74.7%). Internal consistency reliabilities were high for the MISET total (.90) and three subscales: Usefulness/Helpfulness, Information Literacy Support, and Use of Evidence-Based Sources (.87 to .94). Construct validity evidence included significantly higher mean total MISET, Helpfulness/Usefulness, and Information Literacy Support scores for senior students and those with higher computer competence. The MISET is a promising tool to evaluate mobile information technologies and information literacy support; however, longitudinal assessment of changes in scores over time would determine scale sensitivity and responsiveness. [J Nurs Educ. 2016;55(7):385-390.]. Copyright 2016, SLACK Incorporated.
van Steensel, Francisca J A; Deutschman, Amber A C G; Bögels, Susan M
2013-11-01
The psychometric properties of a questionnaire developed to assess symptoms of anxiety disorders (SCARED-71) were compared between two groups of children: children with high-functioning autism spectrum disorder and comorbid anxiety disorders (ASD-group; n = 115), and children with anxiety disorders (AD-group; n = 122). Anxiety disorders were established with a semi-structured interview (ADIS-C/P), using child- as well as parent-report. Internal consistency, construct validity, sensitivity, specificity, and discriminant validity of the SCARED-71 was investigated. Results revealed that the psychometric properties of the SCARED-71 for the ASD-group were quite comparable to the AD-group, however, the discriminant validity of the SCARED-71 child-report was less in the ASD-group. Raising the parental cutoffs of the SCARED-71 resulted in higher specificity rates, which suggests that research should focus more on establishing alternative cutoffs for the ASD-population.
An examination of the MASC Social Anxiety Scale in a non-referred sample of adolescents.
Anderson, Emily R; Jordan, Judith A; Smith, Ashley J; Inderbitzen-Nolan, Heidi M
2009-12-01
Social phobia is prevalent during adolescence and is associated with negative outcomes. Two self-report instruments are empirically validated to specifically assess social phobia symptomatology in youth: the Social Phobia and Anxiety Inventory for Children and the Social Anxiety Scale for Adolescents. The Multidimensional Anxiety Scale for Children is a broad-band measure of anxiety containing a scale assessing the social phobia construct. The present study investigated the MASC Social Anxiety Scale in relation to other well-established measures of social phobia and depression in a non-referred sample of adolescents. Results support the convergent validity of the MASC Social Anxiety Scale and provide some support for its discriminant validity, suggesting its utility in the initial assessment of social phobia. Receiver Operating Characteristics (ROCs) calculated the sensitivity and specificity of the MASC Social Anxiety Scale. Binary logistic regression analyses determined the predictive utility of the MASC Social Anxiety Scale. Implications for assessment are discussed.
McWilliams, L A; Kowal, J; Wilson, K G
2015-10-01
To facilitate efficient screening and reduce the length of comprehensive self-report batteries, a four-item short form of the Pain Catastrophizing Scale (PCS) and a two-item short form of the Pain Self-Efficacy Questionnaire (PSEQ) have been developed and evaluated in samples of patients with arm and upper extremity pain. The first aim of this study was to evaluate these short forms in a heterogeneous sample of patients seeking treatment for chronic musculoskeletal pain, using a priori criteria for determining adequate internal consistency, construct validity and sensitivity to change. In addition, the findings of past studies were used to identify items suitable for new and potentially stronger short forms of these measures. Data were provided by 280 patients who completed the original PCS and PSEQ as part of an interdisciplinary rehabilitation programme. The previously developed four-item PCS and the newly developed six-item short form of the PCS both met the internal consistency and construct validity criteria. They did not meet the criterion regarding sensitivity to change. However, similar to what was obtained using the original PCS, large effect sizes were found when using these short forms to examine pre-treatment to post-treatment changes in catastrophizing. For the PSEQ, the new four-item short form was clearly superior to the other alternatives and met all three criteria. The strongest short forms of the PCS and PSEQ could facilitate the assessment of pain catastrophizing and self-efficacy in situations in which the use of the longer original measures is not feasible. © 2015 European Pain Federation - EFIC®
Sa-Ngamuang, Chaitawat; Haddawy, Peter; Luvira, Viravarn; Piyaphanee, Watcharapong; Iamsirithaworn, Sopon; Lawpoolsri, Saranath
2018-06-18
Differentiating dengue patients from other acute febrile illness patients is a great challenge among physicians. Several dengue diagnosis methods are recommended by WHO. The application of specific laboratory tests is still limited due to high cost, lack of equipment, and uncertain validity. Therefore, clinical diagnosis remains a common practice especially in resource limited settings. Bayesian networks have been shown to be a useful tool for diagnostic decision support. This study aimed to construct Bayesian network models using basic demographic, clinical, and laboratory profiles of acute febrile illness patients to diagnose dengue. Data of 397 acute undifferentiated febrile illness patients who visited the fever clinic of the Bangkok Hospital for Tropical Diseases, Thailand, were used for model construction and validation. The two best final models were selected: one with and one without NS1 rapid test result. The diagnostic accuracy of the models was compared with that of physicians on the same set of patients. The Bayesian network models provided good diagnostic accuracy of dengue infection, with ROC AUC of 0.80 and 0.75 for models with and without NS1 rapid test result, respectively. The models had approximately 80% specificity and 70% sensitivity, similar to the diagnostic accuracy of the hospital's fellows in infectious disease. Including information on NS1 rapid test improved the specificity, but reduced the sensitivity, both in model and physician diagnoses. The Bayesian network model developed in this study could be useful to assist physicians in diagnosing dengue, particularly in regions where experienced physicians and laboratory confirmation tests are limited.
Ramanah, Rajeev; Omar, Sikiyah; Guillien, Alicia; Pugin, Aurore; Martin, Alain; Riethmuller, Didier; Mottet, Nicolas
2018-06-01
Nomograms are statistical models that combine variables to obtain the most accurate and reliable prediction for a particular risk. Fetal heart rate (FHR) interpretation alone has been found to be poorly predictive for fetal acidosis while other clinical risk factors exist. The aim of this study was to create and validate a nomogram based on FHR patterns and relevant clinical parameters to provide a non-invasive individualized prediction of umbilical artery pH during labour. A retrospective observational study was conducted on 4071 patients in labour presenting singleton pregnancies at >34 gestational weeks and delivering vaginally. Clinical characteristics, FHR patterns and umbilical cord gas of 1913 patients were used to construct a nomogram predicting an umbilical artery (Ua) pH <7.18 (10th centile of the study population) after an univariate and multivariate stepwise logistic regression analysis. External validation was obtained from an independent cohort of 2158 patients. Area under the receiver operating characteristics (ROC) curve, sensitivity, specificity, positive and negative predictive values of the nomogram were determined. Upon multivariate analysis, parity (p < 0.01), induction of labour (p = 0.01), a prior uterine scar (p = 0.02), maternal fever (p = 0.02) and the type of FHR (p < 0.01) were significantly associated with an Ua pH <7.18 (p < 0.05). Apgar score at 1, 5 and 10 min were significantly lower in the group with an Ua pH <7.18 (p < 0.01). The nomogram constructed had a Concordance Index of 0.75 (area under the curve) with a sensitivity of 57%, a specificity of 91%, a negative predictive value of 5% and a positive predictive value of 99%. Calibration found no difference between the predicted probabilities and the observed rate of Ua pH <7.18 (p = 0.63). The validation set had a Concordance Index of 0.72 and calibration with a p < 0.77. We successfully developed and validated a nomogram to predict Ua pH by combining easily available clinical variables and FHR. Discrimination and calibration of the model were statistically good. This mathematical tool can help clinicians in the management of labour by predicting umbilical artery pH based on FHR tracings. Copyright © 2018 Elsevier B.V. All rights reserved.
Construct Validity of Fresh Frozen Human Cadaver as a Training Model in Minimal Access Surgery
Macafee, David; Pranesh, Nagarajan; Horgan, Alan F.
2012-01-01
Background: The construct validity of fresh human cadaver as a training tool has not been established previously. The aims of this study were to investigate the construct validity of fresh frozen human cadaver as a method of training in minimal access surgery and determine if novices can be rapidly trained using this model to a safe level of performance. Methods: Junior surgical trainees, novices (<3 laparoscopic procedure performed) in laparoscopic surgery, performed 10 repetitions of a set of structured laparoscopic tasks on fresh frozen cadavers. Expert laparoscopists (>100 laparoscopic procedures) performed 3 repetitions of identical tasks. Performances were scored using a validated, objective Global Operative Assessment of Laparoscopic Skills scale. Scores for 3 consecutive repetitions were compared between experts and novices to determine construct validity. Furthermore, to determine if the novices reached a safe level, a trimmed mean of the experts score was used to define a benchmark. Mann-Whitney U test was used for construct validity analysis and 1-sample t test to compare performances of the novice group with the benchmark safe score. Results: Ten novices and 2 experts were recruited. Four out of 5 tasks (nondominant to dominant hand transfer; simulated appendicectomy; intracorporeal and extracorporeal knot tying) showed construct validity. Novices’ scores became comparable to benchmark scores between the eighth and tenth repetition. Conclusion: Minimal access surgical training using fresh frozen human cadavers appears to have construct validity. The laparoscopic skills of novices can be accelerated through to a safe level within 8 to 10 repetitions. PMID:23318058
Social anxiety and fear of negative evaluation: construct validity of the BFNE-II.
Carleton, R Nicholas; Collimore, Kelsey C; Asmundson, Gordon J G
2007-01-01
The Brief Fear of Negative Evaluation Scale [BFNE; Leary, M. R. (1983). A brief version of the Fear of Negative Evaluation Scale. Personality and Social Psychology Bulletin, 9, 371-375] is a self-report measure designed to assess fear of negative evaluation, a characteristic feature of social anxiety disorders [Rapee, R. M., & Heimberg, R. G. (1997). A cognitive-behavioral model of anxiety in social phobia. Behaviour Research and Therapy, 35, 741-756]. Recent psychometric assessments have suggested that a 2-factor model is most appropriate, with the first factor comprising the straightforwardly worded items and the second factor comprising the reverse-worded items [Carleton, R. N., McCreary, D., Norton, P. J., & Asmundson, G. J. G. (in press-a). The Brief Fear of Negative Evaluation Scale, Revised. Depression & Anxiety; Rodebaugh, T. L., Woods, C. M., Thissen, D. M., Heimberg, R. G., Chambless, D. L., & Rapee, R. M. (2004). More information from fewer questions: the factor structure and item properties of the original and brief fear of negative evaluation scale. Psychological Assessment, 2, 169-181; Weeks, J. W., Heimberg, R. G., Fresco, D. M., Hart, T. A., Turk, C. L., Schneier, F. R., et al. (2005). Empirical validation and psychometric evaluation of the Brief Fear of Negative Evaluation Scale in patients with social anxiety disorder. Psychological Assessment, 17, 179-190]. Some researchers recommend the reverse-worded items be removed from scoring [e.g., Rodebaugh, T. L., Woods, C. M., Thissen, D. M., Heimberg, R. G., Chambless, D. L., & Rapee, R. M. (2004). More information from fewer questions: the factor structure and item properties of the original and brief fear of negative evaluation scale. Psychological Assessment, 2, 169-181; Weeks, J. W., Heimberg, R. G., Fresco, D. M., Hart, T. A., Turk, C. L., Schneier, F. R., et al. (2005). Empirical validation and psychometric evaluation of the Brief Fear of Negative Evaluation Scale in patients with social anxiety disorder. Psychological Assessment, 17, 179-190]; however [Carleton, R. N., McCreary, D., Norton, P. J., & Asmundson, G. J. G. (in press-a). The Brief Fear of Negative Evaluation Scale, Revised. Depression & Anxiety; Collins, K. A., Westra, H. A., Dozois, D. J. A., & Stewart, S. H. (2005). The validity of the brief version of the fear of negative evaluation scale. Journal of Anxiety Disorders, 19, 345-359] recommend that these items be reworded to maintain scale sensitivity. The present study examined the reliability and validity of the BFNE-II, a version of the BFNE evaluating revisions of the reverse-worded items in a community sample. A unitary model of the BFNE-II resulted in excellent confirmatory factor analysis fit indices. Moderate convergent and discriminant validity were found when BFNE-II items were correlated with additional independent measures of social anxiety [i.e., Social Interaction Anxiety & Social Phobia Scales; Mattick, R. P., & Clarke, J. C. (1998). Development and validation of measures of social phobia scrutiny fear and social interaction anxiety. Behaviour Research and Therapy, 36, 455-470], and fear [i.e., Anxiety Sensitivity Index; Reiss, S., & McNally, R. J. (1985). The expectancy model of fear. In S. Reiss, R. R. Bootzin (Eds.), Theoretical issues in behaviour therapy (pp. 107--121). New York: Academic Press. and the Illness/Injury Sensitivity Index; Carleton, R. N., Park, I., & Asmundson, G. J. G. (in press-b). The Illness/Injury Sensitivity Index: an examination of construct validity. Depression & Anxiety). These findings support the utility of the revised items and the validity of the BFNE-II as a measure of the fear of negative evaluation. Implications and future research directions are discussed.
Dougados, Maxime; Jousse-Joulin, Sandrine; Mistretta, Frederic; d'Agostino, Maria-Antonietta; Backhaus, Marina; Bentin, Jacques; Chalès, Gérard; Chary-Valckenaere, Isabelle; Conaghan, Philip; Etchepare, Fabien; Gaudin, Philippe; Grassi, Walter; van der Heijde, Désirée; Sellam, Jérémie; Naredo, Esperanza; Szkudlarek, Marcin; Wakefield, Richard; Saraux, Alain
2010-05-01
To evaluate different global ultrasonographic (US) synovitis scoring systems as potential outcome measures of rheumatoid arthritis (RA) according to the Outcome Measures in Rheumatoid Arthritis Clinical Trials (OMERACT) filter. To study selected global scoring systems, for the clinical, B mode and power Doppler techniques, the following joints were evaluated: 28 joints (28-joint Disease Activity Score (DAS28)), 20 joints (metacarpophalangeals (MCPs) + metatarsophalangeals (MTPs)) and 38 joints (28 joints + MTPs) using either a binary (yes/no) or a 0-3 grade. The study was a prospective, 4-month duration follow-up of 76 patients with RA requiring anti-tumour necrosis factor (TNF) therapy (complete follow-up data: 66 patients). Intraobserver reliability was evaluated using the intraclass correlation coefficient (ICC), construct validity was evaluated using the Cronbach alpha test and external validity was evaluated using level of correlation between scoring system and C reactive protein (CRP). Sensitivity to change was evaluated using the standardised response mean. Discriminating capacity was evaluated using the standardised mean differences in patients considered by the doctor as significantly improved or not at the end of the study. Different clinimetric properties of various US scoring systems were at least as good as the clinical scores with, for example, intraobserver reliability ranging from 0.61 to 0.97 versus from 0.53 to 0.82, construct validity ranging from 0.76 to 0.89 versus from 0.76 to 0.88, correlation with CRP ranging from 0.28 to 0.34 versus from 0.28 to 0.35 and sensitivity to change ranging from 0.60 to 1.21 versus from 0.96 to 1.36 for US versus clinical scoring systems, respectively. This study suggests that US evaluation of synovitis is an outcome measure at least as relevant as physical examination. Further studies are required in order to achieve optimal US scoring systems for monitoring patients with RA in clinical trials and in clinical practice.
Validation of the Hospital Ethical Climate Survey for older people care.
Suhonen, Riitta; Stolt, Minna; Katajisto, Jouko; Charalambous, Andreas; Olson, Linda L
2015-08-01
The exploration of the ethical climate in the care settings for older people is highlighted in the literature, and it has been associated with various aspects of clinical practice and nurses' jobs. However, ethical climate is seldom studied in the older people care context. Valid, reliable, feasible measures are needed for the measurement of ethical climate. This study aimed to test the reliability, validity, and sensitivity of the Hospital Ethical Climate Survey in healthcare settings for older people. A non-experimental cross-sectional study design was employed, and a survey using questionnaires, including the Hospital Ethical Climate Survey was used for data collection. Data were analyzed using descriptive statistics, inferential statistics, and multivariable methods. Survey data were collected from a sample of nurses working in the care settings for older people in Finland (N = 1513, n = 874, response rate = 58%) in 2011. This study was conducted according to good scientific inquiry guidelines, and ethical approval was obtained from the university ethics committee. The mean score for the Hospital Ethical Climate Survey total was 3.85 (standard deviation = 0.56). Cronbach's alpha was 0.92. Principal component analysis provided evidence for factorial validity. LISREL provided evidence for construct validity based on goodness-of-fit statistics. Pearson's correlations of 0.68-0.90 were found between the sub-scales and the Hospital Ethical Climate Survey. The Hospital Ethical Climate Survey was found able to reveal discrimination across care settings and proved to be a valid and reliable tool for measuring ethical climate in care settings for older people and sensitive enough to reveal variations across various clinical settings. The Finnish version of the Hospital Ethical Climate Survey, used mainly in the hospital settings previously, proved to be a valid instrument to be used in the care settings for older people. Further studies are due to analyze the factor structure and some items of the Hospital Ethical Climate Survey. © The Author(s) 2014.
Development and validation of the French-Canadian Chronic Pain Self-efficacy Scale
Lacasse, Anaïs; Bourgault, Patricia; Tousignant-Laflamme, Yannick; Courtemanche-Harel, Roxanne; Choinière, Manon
2015-01-01
BACKGROUND: Perceived self-efficacy is a non-negligible outcome when measuring the impact of self-management interventions for chronic pain patients. However, no validated, chronic pain-specific self-efficacy scales exist for studies conducted with French-speaking populations. OBJECTIVES: To establish the validity of the use of the French-Canadian Chronic Pain Self-efficacy Scale (FC-CPSES) among chronic pain patients. METHODS: The Chronic Disease Self-Efficacy Scale is a validated 33-item self-administered questionnaire that measures perceived self-efficacy to perform self-management behaviours, manage chronic disease in general and achieve outcomes (a six-item version is also available). This scale was adapted to the context of chronic pain patients following cross-cultural adaptation guidelines. The FC-CPSES was administered to 109 fibromyalgia and 34 chronic low back pain patients (n=143) who participated in an evidence-based self-management intervention (the PASSAGE program) offered in 10 health care centres across the province of Quebec. Cronbach’s alpha coefficients (α) were calculated to determine the internal consistency of the 33- and six-item versions of the FC-CPSES. With regard to convergent construct validity, the association between the FC-CPSES baseline scores and related clinical outcomes was examined. With regard to the scale’s sensitivity to change, pre- and postintervention FC-CPSES scores were compared. RESULTS: Internal consistency was high for both versions of the FC-CPSES (α=0.86 to α=0.96). Higher self-efficacy was significantly associated with higher mental health-related quality of life and lower pain intensity and catastrophizing (P<0.05), supporting convergent validity of the scale. There was a statistically significant increase in FC-CPSES scores between pre- and postintervention measures for both versions of the FC-CPSES (P<0.003), which supports their sensitivity to clinical change during an intervention. CONCLUSIONS: These data suggest that both versions of the FC-CPSES are reliable and valid for the measurement of pain management self-efficacy among chronic pain patients. PMID:25848845
Lo Martire, Riccardo; de Alwis, Manudul Pahansen; Äng, Björn Olov; Garme, Karl
2017-07-20
High-performance marine craft personnel (HPMCP) are regularly exposed to vibration and repeated shock (VRS) levels exceeding maximum limitations stated by international legislation. Whereas such exposure reportedly is detrimental to health and performance, the epidemiological data necessary to link these adverse effects causally to VRS are not available in the scientific literature, and no suitable tools for acquiring such data exist. This study therefore constructed a questionnaire for longitudinal investigations in HPMCP. A consensus panel defined content domains, identified relevant items and outlined a questionnaire. The relevance and simplicity of the questionnaire's content were then systematically assessed by expert raters in three consecutive stages, each followed by revisions. An item-level content validity index (I-CVI) was computed as the proportion of experts rating an item as relevant and simple, and a scale-level content validity index (S-CVI/Ave) as the average I-CVI across items. The thresholds for acceptable content validity were 0.78 and 0.90, respectively. Finally, a dynamic web version of the questionnaire was constructed and pilot tested over a 1-month period during a marine exercise in a study population sample of eight subjects, while accelerometers simultaneously quantified VRS exposure. Content domains were defined as work exposure, musculoskeletal pain and human performance, and items were selected to reflect these constructs. Ratings from nine experts yielded S-CVI/Ave of 0.97 and 1.00 for relevance and simplicity, respectively, and the pilot test suggested that responses were sensitive to change in acceleration and that the questionnaire, following some adjustments, was feasible for its intended purpose. A dynamic web-based questionnaire for longitudinal survey of key variables in HPMCP was constructed. Expert ratings supported that the questionnaire content is relevant, simple and sufficiently comprehensive, and the pilot test suggested that the questionnaire is feasible for longitudinal measurements in the study population. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Algamdi, Maaidah M; Hanneman, Sandra K
2018-02-14
Valid and reliable instruments in Arabic are needed to measure self-efficacy and quality of life for Arabic patients with cancer. The aim of this study was to test the psychometric performance of the Cancer Behavior Inventory-Brief Arabic (CBI-BA), including participant understanding of items, and the Functional Assessment of Cancer Therapy-Breast Arabic (FACT-BA). Using a cross-sectional design, 438 cancer patients completed the CBI-BA, 30 of whom completed cognitive interviews. A subsample 167 women with breast cancer also completed the FACT-BA. Internal consistency evidence was assessed with Cronbach's α and construct validity with principal axis factoring. Internal consistency estimates were acceptable for the total CBI-BA (α = .81) and FACT-BA (α = .88) scales. Exploratory factor analyses showed evidence of construct validity for the CBI-BA; 1 factor was derived, compared with four in the original English version. Cognitive interviews indicated satisfactory patient understanding of CBI-BA items. The Arabic version of the general FACT-General scale had 4 factors according to expectation. The CBI-BA has adequate psychometric performance for the measurement of self-efficacy for coping with cancer in Arabic patients. The FACT-General Arabic has adequate evidence of reliability and validity for the measurement of quality of life in Arabic women with breast cancer. The availability of culturally sensitive and psychometrically sound instruments for Arabic patients diagnosed with cancer should be valuable for healthcare clinicians and researchers to assess self-efficacy for coping with cancer and quality of life.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lin, Zhenhong; Dong, Jing; Liu, Changzheng
2012-01-01
The petroleum and electricity consumptions of plug-in hybrid electric vehicles (PHEVs) are sensitive to the variation of daily vehicle miles traveled (DVMT). Some studies assume DVMT to follow a Gamma distribution, but such a Gamma assumption is yet to be validated. This study finds the Gamma assumption valid in the context of PHEV energy analysis, based on continuous GPS travel data of 382 vehicles, each tracked for at least 183 days. The validity conclusion is based on the found small prediction errors, resulting from the Gamma assumption, in PHEV petroleum use, electricity use, and energy cost. The finding that themore » Gamma distribution is valid and reliable is important. It paves the way for the Gamma distribution to be assumed for analyzing energy uses of PHEVs in the real world. The Gamma distribution can be easily specified with very few pieces of driver information and is relatively easy for mathematical manipulation. Given the validation in this study, the Gamma distribution can now be used with better confidence in a variety of applications, such as improving vehicle consumer choice models, quantifying range anxiety for battery electric vehicles, investigating roles of charging infrastructure, and constructing online calculators that provide personal estimates of PHEV energy use.« less
Nishi, Daisuke; Uehara, Ritei; Yoshikawa, Eisho; Sato, Goro; Ito, Masaya; Matsuoka, Yutaka
2013-04-01
Although scales specific to resilience are available and widely used, qualities of resilience could be culturally sensitive. This study aimed to develop a concise scale of resilience for Japanese populations, and compare its validity to that of the Resilience Scale 14-item version (RS-14), one of the most widely used scales for measuring resilience. The Tachikawa Resilience Scale (TRS) was developed on the basis of data obtained from unstructured interviews with Japanese motor vehicle accident survivors without psychiatric disorder. The reliability and validity of the TRS and RS-14 were then examined in cross-sectional studies performed with 523 company workers and 140 psychiatric outpatients. The TRS and RS-14 were negatively correlated with depressive symptoms in company workers and psychiatric outpatients and with anxiety in psychiatric outpatients, and were positively correlated with social support in company workers. Internal consistency and test-retest reliability of the TRS were high. Construct validity of the TRS was equivalent to that of the RS-14 in company workers, and higher than that of the RS-14 in psychiatric outpatients. The reliability and validity of the TRS and RS-14 in Japanese company workers and patients with psychiatric disorders were acceptable. The validity of the TRS was equivalent to or better than that of the RS-14. Although the TRS cannot be regarded as an established scale due to a lack of theoretical rationale, the results of this study suggest that scales measuring resilience that cover cultural aspects might be more relevant in given populations. © 2013 The Authors. Psychiatry and Clinical Neurosciences © 2013 Japanese Society of Psychiatry and Neurology.
Tsai, Alexander C.
2014-01-01
OBJECTIVES To systematically review the reliability and validity of instruments used to screen for major depressive disorder or assess depression symptom severity among persons with HIV in sub-Saharan Africa. DESIGN Systematic review and meta-analysis. METHODS A systematic evidence search protocol was applied to seven bibliographic databases. Studies examining the reliability and/or validity of depression assessment tools were selected for inclusion if they were based on data collected from HIV-positive adults in any African member state of the United Nations. Random-effects meta-analysis was employed to calculate pooled estimates of depression prevalence. In a subgroup of studies of criterion-related validity, the bivariate random-effects model was used to calculate pooled estimates of sensitivity and specificity. RESULTS Of 1,117 records initially identified, I included 13 studies of 5,373 persons with HIV in 7 sub-Saharan African countries. Reported estimates of Cronbach’s alpha ranged from 0.63–0.95, and analyses of internal structure generally confirmed the existence of a depression-like construct accounting for a substantial portion of variance. The pooled prevalence of probable depression was 29.5% (95% CI, 20.5–39.4), while the pooled prevalence of major depressive disorder was 13.9% (95% CI, 9.7–18.6). The Center for Epidemiologic Studies-Depression scale was the most frequently studied instrument, with a pooled sensitivity of 0.82 (95% CI, 0.73–0.87) for detecting major depressive disorder. CONCLUSIONS Depression screening instruments yielded relatively high false positive rates. Overall, few studies described the reliability and/or validity of depression instruments in sub-Saharan Africa. PMID:24853307
Elsharkawy, Aisha; Alboraie, Mohamed; Fouad, Rabab; Asem, Noha; Abdo, Mahmoud; Elmakhzangy, Hesham; Mehrez, Mai; Khattab, Hany; Esmat, Gamal
2017-12-01
Transient elastography is widely used to assess fibrosis stage in chronic hepatitis C (CHC). We aimed to establish and validate different transient elastography cut-off values for significant fibrosis and cirrhosis in CHC genotype 4 patients. The data of 100 treatment-naive CHC patients (training set) and 652 patients (validation set) were analysed. The patients were subjected to routine pretreatment laboratory investigations, liver biopsy and histopathological staging of hepatic fibrosis according to the METAVIR scoring system. Transient elastography was performed before and in the same week as liver biopsy using FibroScan (Echosens, Paris, France). Transient elastography results were correlated to different stages of hepatic fibrosis in both the training and validation sets. ROC curves were constructed. In the training set, the best transient elastography cut-off values for significant hepatic fibrosis (≥F2 METAVIR), advanced hepatic fibrosis (≥F3 METAVIR) and cirrhosis (F4 METAVIR) were 7.1, 9 and 12.2 kPa, with sensitivities of 87%, 87.5% and 90.9% and specificities of 100%, 99.9% and 99.9%, respectively. The application of these cut-offs in the validation set showed sensitivities of 85.5%, 82.8% and 92% and specificities of 86%, 89.4% and 99.01% for significant hepatic fibrosis, advanced hepatic fibrosis and cirrhosis, respectively. Transient elastography performs well for significant hepatic fibrosis, advanced hepatic fibrosis and cirrhosis, with validated cut-offs of 7.1, 9 and 12.2 kPa, respectively, in genotype 4 CHC patients. Copyright © 2017 Pan-Arab Association of Gastroenterology. Published by Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Woicik, P.A.; Stewart, S.H.; Pihl, R.O.
The Substance Use Risk Profile Scale (SURPS) is based on a model of personality risk for substance abuse in which four personality dimensions (hopelessness, anxiety sensitivity, impulsivity, and sensation seeking) are hypothesized to differentially relate to specific patterns of substance use. The current series of studies is a preliminary exploration of the psychometric properties of the SURPS in two populations (undergraduate and high school students). In study 1, an analysis of the internal structure of two versions of the SURPS shows that the abbreviated version best reflects the 4-factor structure. Concurrent, discriminant, and incremental validity of the SURPS is supportedmore » by convergent/divergent relationships between the SURPS subscales and other theoretically relevant personality and drug use criterion measures. In Study 2, the factorial structure of the SURPS is confirmed and evidence is provided for its test-retest reliability and validity with respect to measuring personality vulnerability to reinforcement-specific substance use patterns. In Study 3, the SURPS was administered in a more youthful population to test its sensitivity in identifying younger problematic drinkers. The results from the current series of studies demonstrate support for the reliability and construct validity of the SURPS, and suggest that four personality dimensions may be linked to substance-related behavior through different reinforcement processes. This brief assessment tool may have important implications for clinicians and future research.« less
NASA Astrophysics Data System (ADS)
Douglass, D. H.; Kalnay, E.; Li, H.; Cai, M.
2005-05-01
Carbon monoxide (CO) is present in the troposphere as a product of fossil fuel combustion, biomass burning and the oxidation of volatile hydrocarbons. It is the principal sink of the hydroxyl radical (OH), thereby affecting the concentrations of greenhouse gases such as CH4 and O3. In addition, CO has a lifetime of 1-3 months, making it a good tracer for studying the long range transport of pollution. Satellite observations present a valuable tool in the investigation of tropospheric CO. The Atmospheric InfraRed Sounder (AIRS), onboard the Aqua satellite, is sensitive to tropospheric CO in a number of its 2378 channels. This sensitivity to CO, combined with the daily global coverage provided by AIRS, makes AIRS a potentially useful instrument for observing CO sources and transport. A maximum a posteriori (MAP) retrieval scheme (Rodgers 2000) has been developed for AIRS, to provide CO profiles from near-surface altitudes to around 150 hPa. An extensive validation data set, consisting of over 50 in-situ aircraft CO profiles, has been constructed. This data set combines CO data from a number of independent aircraft campaigns. Results from this validation study and comparisons with the AIRS level 2 CO product will be presented. Rodgers, C. D. (2000), Inverse Methods for Atmospheric Sounding : Theory and Practice, World Scientific, Singapore.
Woicik, Patricia A; Stewart, Sherry H; Pihl, Robert O; Conrod, Patricia J
2009-12-01
The Substance Use Risk Profile Scale (SURPS) is based on a model of personality risk for substance abuse in which four personality dimensions (hopelessness, anxiety sensitivity, impulsivity, and sensation seeking) are hypothesized to differentially relate to specific patterns of substance use. The current series of studies is a preliminary exploration of the psychometric properties of the SURPS in two populations (undergraduate and high school students). In study 1, an analysis of the internal structure of two versions of the SURPS shows that the abbreviated version best reflects the 4-factor structure. Concurrent, discriminant, and incremental validity of the SURPS is supported by convergent/divergent relationships between the SURPS subscales and other theoretically relevant personality and drug use criterion measures. In Study 2, the factorial structure of the SURPS is confirmed and evidence is provided for its test-retest reliability and validity with respect to measuring personality vulnerability to reinforcement-specific substance use patterns. In Study 3, the SURPS was administered in a more youthful population to test its sensitivity in identifying younger problematic drinkers. The results from the current series of studies demonstrate support for the reliability and construct validity of the SURPS, and suggest that four personality dimensions may be linked to substance-related behavior through different reinforcement processes. This brief assessment tool may have important implications for clinicians and future research.
Burger, Elise; Selles, Ruud; van Nieuwkasteele, Shelly; Bessems, Gert; Pollet, Virginie; Hovius, Steven; van Nieuwenhoven, Christianne
2017-11-04
The purpose of this study is to develop a Dutch version of the Oxford Ankle and Foot Questionnaire for Children (OxAFQ-c) to allow evaluation of pediatric foot care. The OxAFQ-c was translated into Dutch, according to the ISPOR-guidelines. Children with different foot and ankle complaints completed the OxAFQ-c at baseline, after two weeks, and after 4-6 months. Measurement properties were assessed in terms of reliability, responsiveness, and construct validity. Test-retest reliability showed moderate intraclass correlation coefficients. Bland-Altman plots showed wide limits of agreement. After 4-6 months, the group that experienced improvement also showed improved questionnaire outcomes, indicating responsiveness. Moderate correlation between the OxAFQ-c and the Kidscreen and foot-specific VAS-scores were observed, indicating moderate construct validity. The Dutch OxAFQ-c showed moderate to good measurement properties. However, because we observed limited sensitivity to changes and wide limits of agreement in individual patients, we think the questionnaire should only be used in groups. Copyright © 2017 European Foot and Ankle Society. Published by Elsevier Ltd. All rights reserved.
Liang, M H
2000-09-01
Although widely used and reported in research for the evaluation of groups, measures of health status and health-related quality of life have had little application in clinical practice for the assessment of individual patients. One of the principal barriers is the demonstration that these measures add clinically significant information to measures of function or symptoms alone. Here, we review the methods for evaluation of construct validity in longitudinal studies and make recommendations for nomenclature, reporting of study results, and future research agenda. Analytical review. The terms "sensitivity" and "responsiveness" have been used interchangeably, and there are few studies that evaluate the extent to which health status or health-related quality-of life measures capture clinically important changes ("responsiveness"). Current methods of evaluating responsiveness are not standardized or evaluated. Approaches for the assessment of a clinically significant or meaningful change are described; rather than normative information, however, standardized transition questions are proposed. They would be reported routinely and as separate axes of description to capture individual perceptions. Research in methods to assess the subject's evaluation of the importance and magnitude of a measured change are critical if health status and health-related quality-of-life measures are to have an impact on patient care.
Measuring value sensitivity in medicine.
Ineichen, Christian; Christen, Markus; Tanner, Carmen
2017-01-28
Value sensitivity - the ability to recognize value-related issues when they arise in practice - is an indispensable competence for medical practitioners to enter decision-making processes related to ethical questions. However, the psychological competence of value sensitivity is seldom an explicit subject in the training of medical professionals. In this contribution, we outline the traditional concept of moral sensitivity in medicine and its revised form conceptualized as value sensitivity and we propose an instrument that measures value sensitivity. We developed an instrument for assessing the sensitivity for three value groups (moral-related values, values related to the principles of biomedical ethics, strategy-related values) in a four step procedure: 1) value identification (n = 317); 2) value representation (n = 317); 3) vignette construction and quality evaluation (n = 37); and 4) instrument validation by comparing nursing professionals with hospital managers (n = 48). We find that nursing professionals recognize and ascribe importance to principle-related issues more than professionals from hospital management. The latter are more likely to recognize and ascribe importance to strategy-related issues. These hypothesis-driven results demonstrate the discriminatory power of our newly developed instrument, which makes it useful not only for health care professionals in practice but for students and people working in the clinical context as well.
Subgroups of physically abusive parents based on cluster analysis of parenting behavior and affect.
Haskett, Mary E; Smith Scott, Susan; Sabourin Ward, Caryn
2004-10-01
Cluster analysis of observed parenting and self-reported discipline was used to categorize 83 abusive parents into subgroups. A 2-cluster solution received support for validity. Cluster 1 parents were relatively warm, positive, sensitive, and engaged during interactions with their children, whereas Cluster 2 parents were relatively negative, disengaged or intrusive, and insensitive. Further, clusters differed in emotional health, parenting stress, perceptions of children, and problem solving. Children of parents in the 2 clusters differed on several indexes of social adjustment. Cluster 1 parents were similar to nonabusive parents (n = 66) on parenting and related constructs, but Cluster 2 parents differed from nonabusive parents on all clustering variables and many validation variables. Results highlight clinically relevant diversity in parenting practices and functioning among abusive parents. ((c) 2004 APA, all rights reserved).
Development of a Self-Report Measure of Reward Sensitivity:A Test in Current and Former Smokers.
Hughes, John R; Callas, Peter W; Priest, Jeff S; Etter, Jean-Francois; Budney, Alan J; Sigmon, Stacey C
2017-06-01
Tobacco use or abstinence may increase or decrease reward sensitivity. Most existing measures of reward sensitivity were developed decades ago, and few have undergone extensive psychometric testing. We developed a 58-item survey of the anticipated enjoyment from, wanting for, and frequency of common rewards (the Rewarding Events Inventory-REI). The current analysis focuses on ratings of anticipated enjoyment. The first validation study recruited current and former smokers from Internet sites. The second study recruited smokers who wished to quit and monetarily reinforced them to stay abstinent in a laboratory study and a comparison group of former smokers. In both studies, participants completed the inventory on two occasions, 3-7 days apart. They also completed four anhedonia scales and a behavioral test of reduced reward sensitivity. Half of the enjoyment ratings loaded on four factors: socializing, active hobbies, passive hobbies, and sex/drug use. Cronbach's alpha coefficients were all ≥0.73 for overall mean and factor scores. Test-retest correlations were all ≥0.83. Correlations of the overall and factor scores with frequency of rewards and anhedonia scales were 0.19-0.53, except for the sex/drugs factor. The scores did not correlate with behavioral tests of reward and did not differ between current and former smokers. Lower overall mean enjoyment score predicted a shorter time to relapse. Internal reliability and test-retest reliability of the enjoyment outcomes of the REI are excellent, and construct and predictive validity are modest but promising. The REI is comprehensive and up-to-date, yet is short enough to use on repeated occasions. Replication tests, especially predictive validity tests, are needed. Both use of and abstinence from nicotine appear to increase or decrease how rewarding nondrug rewards are; however, self-report scales to test this have limitations. Our inventory of enjoyment from 58 rewards appears to be reliable and valid as well as comprehensive and up-to-date, yet is short enough to use on repeated occasions. Replication tests, especially of the predictive validity of our scale, are needed. © The Author 2017. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
ERIC Educational Resources Information Center
Spurgeon, Shawn L.
2017-01-01
Construct irrelevance (CI) and construct underrepresentation (CU) are 2 major threats to validity, yet they are rarely discussed within the counseling literature. This article provides information about the relevance of these threats to internal validity. An illustrative case example will be provided to assist counselors in understanding these…
Attentional Bias for Reward and Punishment in Overweight and Obesity: The TRAILS Study.
Jonker, Nienke C; Glashouwer, Klaske A; Ostafin, Brian D; van Hemel-Ruiter, Madelon E; Smink, Frédérique R E; Hoek, Hans W; de Jong, Peter J
2016-01-01
More than 80% of obese adolescents will become obese adults, and it is therefore important to enhance insight into characteristics that underlie the development and maintenance of overweight and obesity at a young age. The current study is the first to focus on attentional biases towards rewarding and punishing cues as potentially important factors. Participants were young adolescents (N = 607) who were followed from the age of 13 until the age of 19, and completed a motivational game indexing the attentional bias to general cues of reward and punishment. Additionally, self-reported reward and punishment sensitivity was measured. This study showed that attentional biases to cues that signal reward or punishment and self-reported reward and punishment sensitivity were not related to body mass index or the change in body mass index over six years in adolescents. Thus, attentional bias to cues of reward and cues of punishment, and self-reported reward and punishment sensitivity, do not seem to be crucial factors in the development and maintenance of overweight and obesity in adolescents. Exploratory analyses of the current study suggest that the amount of effort to gain reward and to avoid punishment may play a role in the development and maintenance of overweight and obesity. However, since the effort measure was a construct based on face validity and has not been properly validated, more studies are necessary before firm conclusions can be drawn.
Validity of Self-Report Screening Scale for Elder Abuse: Women's Health Australia Study.
ERIC Educational Resources Information Center
Schofield, Margot J.; Mishra, Gita D.
2003-01-01
Examines the reliability and validity of the Vulnerability to Abuse Screening Scale (VASS) for the early identification of elder abuse. Results confirmed the VASS factor structure and construct validity. The Vulnerability and Coercion factors held the strongest face and construct validity for physical and psychological abuse. (Contains 52…
Rater Cognition: Implications for Validity
ERIC Educational Resources Information Center
Bejar, Issac I.
2012-01-01
The scoring process is critical in the validation of tests that rely on constructed responses. Documenting that readers carry out the scoring in ways consistent with the construct and measurement goals is an important aspect of score validity. In this article, rater cognition is approached as a source of support for a validity argument for scores…
Urbanowicz, Richard A; McClure, C Patrick; King, Barnabas; Mason, Christopher P; Ball, Jonathan K; Tarr, Alexander W
2016-09-01
Retrovirus pseudotypes are a highly tractable model used to study the entry pathways of enveloped viruses. This model has been extensively applied to the study of the hepatitis C virus (HCV) entry pathway, preclinical screening of antiviral antibodies and for assessing the phenotype of patient-derived viruses using HCV pseudoparticles (HCVpp) possessing the HCV E1 and E2 glycoproteins. However, not all patient-isolated clones produce particles that are infectious in this model. This study investigated factors that might limit phenotyping of patient-isolated HCV glycoproteins. Genetically related HCV glycoproteins from quasispecies in individual patients were discovered to behave very differently in this entry model. Empirical optimization of the ratio of packaging construct and glycoprotein-encoding plasmid was required for successful HCVpp genesis for different clones. The selection of retroviral packaging construct also influenced the function of HCV pseudoparticles. Some glycoprotein constructs tolerated a wide range of assay parameters, while others were much more sensitive to alterations. Furthermore, glycoproteins previously characterized as unable to mediate entry were found to be functional. These findings were validated using chimeric cell-cultured HCV bearing these glycoproteins. Using the same empirical approach we demonstrated that generation of infectious ebolavirus pseudoviruses (EBOVpv) was also sensitive to the amount and ratio of plasmids used, and that protocols for optimal production of these pseudoviruses are dependent on the exact virus glycoprotein construct. These findings demonstrate that it is crucial for studies utilizing pseudoviruses to conduct empirical optimization of pseudotype production for each specific glycoprotein sequence to achieve optimal titres and facilitate accurate phenotyping.
Johansen, Kristoffer; Song, Jae Hee; Prentice, Paul
2018-05-01
We describe the design, construction and characterisation of a broadband passive cavitation detector, with the specific aim of detecting low frequency components of periodic shock waves, with high sensitivity. A finite element model is used to guide selection of matching and backing layers for the shock wave passive cavitation detector (swPCD), and the performance is evaluated against a commercially available device. Validation of the model, and characterisation of the swPCD is achieved through experimental detection of laser-plasma bubble collapse shock waves. The final swPCD design is 20 dB more sensitive to the subharmonic component, from acoustic cavitation driven at 220 kHz, than the comparable commercial device. This work may be significant for monitoring cavitation in medical applications, where sensitive detection is critical, and higher frequencies are more readily absorbed by tissue. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Finding knowledge translation articles in CINAHL.
Lokker, Cynthia; McKibbon, K Ann; Wilczynski, Nancy L; Haynes, R Brian; Ciliska, Donna; Dobbins, Maureen; Davis, David A; Straus, Sharon E
2010-01-01
The process of moving research into practice has a number of names including knowledge translation (KT). Researchers and decision makers need to be able to readily access the literature on KT for the field to grow and to evaluate the existing evidence. To develop and validate search filters for finding KT articles in the database Cumulative Index to Nursing and Allied Health (CINAHL). A gold standard database was constructed by hand searching and classifying articles from 12 journals as KT Content, KT Applications and KT Theory. Sensitivity, specificity, precision, and accuracy of the search filters. Optimized search filters had fairly low sensitivity and specificity for KT Content (58.4% and 64.9% respectively), while sensitivity and specificity increased for retrieving KT Application (67.5% and 70.2%) and KT Theory articles (70.4% and 77.8%). Search filter performance was suboptimal marking the broad base of disciplines and vocabularies used by KT researchers. Such diversity makes retrieval of KT studies in CINAHL difficult.
Singh, Amika S; Vik, Froydis N; Chinapaw, Mai J M; Uijtdewilligen, Léonie; Verloigne, Maïté; Fernández-Alvira, Juan M; Stomfai, Sarolta; Manios, Yannis; Martens, Marloes; Brug, Johannes
2011-12-09
Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items.
2011-01-01
Background Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. Objective To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. Methods We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Results Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Conclusions Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items. PMID:22152048
de Vente, Wieke; Majdandžić, Mirjana; Voncken, Marisol J; Beidel, Deborah C; Bögels, Susan M
2014-03-01
We developed a new version of the Social Phobia and Anxiety Inventory (SPAI) in order to have a brief instrument for measuring social anxiety and social anxiety disorder (SAD) with a strong conceptual foundation. In the construction phase, a set of items representing 5 core aspects of social anxiety was selected by a panel of social anxiety experts. The selected item pool was validated using factor analysis, reliability analysis, and diagnostic analysis in a sample of healthy participants (N = 188) and a sample of clinically referred participants diagnosed with SAD (N = 98). This procedure resulted in an abbreviated version of the Social Phobia Subscale of the SPAI consisting of 18 items (i.e. the SPAI-18), which correlated strongly with the Social Phobia Subscale of the original SPAI (both groups r = .98). Internal consistency and diagnostic characteristics using a clinical cut-off score > 48 were good to excellent (Cronbach's alpha healthy group = .93; patient group = .91; sensitivity: .94; specificity: .88). The SPAI-18 was further validated in a community sample of parents-to-be without SAD (N = 237) and with SAD (N = 65). Internal consistency was again excellent (both groups Cronbach's alpha = .93) and a screening cut-off of > 36 proved to result in good sensitivity and specificity. The SPAI-18 also correlated strongly with other social anxiety instruments, supporting convergent validity. In sum, the SPAI-18 is a psychometrically sound instrument with good screening capacity for social anxiety disorder in clinical as well as community samples. Copyright © 2013 Elsevier Ltd. All rights reserved.
The bogus taste test: Validity as a measure of laboratory food intake.
Robinson, Eric; Haynes, Ashleigh; Hardman, Charlotte A; Kemps, Eva; Higgs, Suzanne; Jones, Andrew
2017-09-01
Because overconsumption of food contributes to ill health, understanding what affects how much people eat is of importance. The 'bogus' taste test is a measure widely used in eating behaviour research to identify factors that may have a causal effect on food intake. However, there has been no examination of the validity of the bogus taste test as a measure of food intake. We conducted a participant level analysis of 31 published laboratory studies that used the taste test to measure food intake. We assessed whether the taste test was sensitive to experimental manipulations hypothesized to increase or decrease food intake. We examined construct validity by testing whether participant sex, hunger and liking of taste test food were associated with the amount of food consumed in the taste test. In addition, we also examined whether BMI (body mass index), trait measures of dietary restraint and over-eating in response to palatable food cues were associated with food consumption. Results indicated that the taste test was sensitive to experimental manipulations hypothesized to increase or decrease food intake. Factors that were reliably associated with increased consumption during the taste test were being male, have a higher baseline hunger, liking of the taste test food and a greater tendency to overeat in response to palatable food cues, whereas trait dietary restraint and BMI were not. These results indicate that the bogus taste test is likely to be a valid measure of food intake and can be used to identify factors that have a causal effect on food intake. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Hsu, Lan-Fang; Kao, Ching-Chiu; Wang, Mei-Yeh; Chang, Chun-Jen; Tsai, Pei-Shan
2014-12-01
The Clinically Useful Depression Outcome Scale (CUDOS) is a self-report instrument that assesses symptoms and the severity of depression, but its psychometric properties in patients with type 2 diabetes mellitus in Chinese-Speaking populations are unknown. To examine the psychometric properties of the Mandarin Chinese version of the CUDOS (CUDOS-Chinese). A methodological research design. Endocrinology and metabolism outpatient clinics at 2 university-affiliated hospitals in northern Taiwan. Two-hundred and fourteen type 2 diabetic patients with the mean age of 62.6 years were enrolled, and two-hundred and twelve of them completed the study. Internal consistency, test-retest reliability, concurrent, and contrasted-groups validity were assessed. A receiver operating characteristic curve analysis was performed to assess sensitivity and specificity. Construct validity by means of confirmatory factor analysis was conducted. Internal consistency (Cronbach α of total scale and four subscales=0.93, 0.80, 0.66, 0.80, and 0.83, respectively), test-retest reliability (intra-class correlation coefficients of total scale and four subscales=0.92, 0.89, 0.94, 0.89, and 0.91, respectively), and strong correlations with the Beck Depression Inventory-II (r=0.87) suggested good reliability and validity. The confirmatory factor analysis supported a four-factor model. A cut-off score of 19/20 yielded 77.8% sensitivity and 75.6% specificity. The CUDOS-Chinese demonstrated satisfactory validity and reliability for detecting depression in type 2 diabetic patients in Taiwan. Copyright © 2014 Elsevier Ltd. All rights reserved.
A robust high-throughput fungal biosensor assay for the detection of estrogen activity.
Zutz, Christoph; Wagener, Karen; Yankova, Desislava; Eder, Stefanie; Möstl, Erich; Drillich, Marc; Rychli, Kathrin; Wagner, Martin; Strauss, Joseph
2017-10-01
Estrogenic active compounds are present in a variety of sources and may alter biological functions in vertebrates. Therefore, it is crucial to develop innovative analytical systems that allow us to screen a broad spectrum of matrices and deliver fast and reliable results. We present the adaptation and validation of a fungal biosensor for the detection of estrogen activity in cow derived samples and tested the clinical applicability for pregnancy diagnosis in 140 mares and 120 cows. As biosensor we used a previously engineered genetically modified strain of the filamentous fungus Aspergillus nidulans, which contains the human estrogen receptor alpha and a reporter construct, in which β-galactosidase gene expression is controlled by an estrogen-responsive-element. The estrogen response of the fungal biosensor was validated with blood, urine, feces, milk and saliva. All matrices were screened for estrogenic activity prior to and after chemical extraction and the results were compared to an enzyme immunoassay (EIA). The biosensor showed consistent results in milk, urine and feces, which were comparable to those of the EIA. In contrast to the EIA, no sample pre-treatment by chemical extraction was needed. For 17β-estradiol, the biosensor showed a limit of detection of 1ng/L. The validation of the biosensor for pregnancy diagnosis revealed a specificity of 100% and a sensitivity of more than 97%. In conclusion, we developed and validated a highly robust fungal biosensor for detection of estrogen activity, which is highly sensitive and economic as it allows analyzing in high-throughput formats without the necessity for organic solvents. Copyright © 2017 Elsevier Inc. All rights reserved.
Dusenberry, Michael W; Brown, Charles K; Brewer, Kori L
2017-02-01
To construct an artificial neural network (ANN) model that can predict the presence of acute CT findings with both high sensitivity and high specificity when applied to the population of patients≥age 65years who have incurred minor head injury after a fall. An ANN was created in the Python programming language using a population of 514 patients ≥ age 65 years presenting to the ED with minor head injury after a fall. The patient dataset was divided into three parts: 60% for "training", 20% for "cross validation", and 20% for "testing". Sensitivity, specificity, positive and negative predictive values, and accuracy were determined by comparing the model's predictions to the actual correct answers for each patient. On the "cross validation" data, the model attained a sensitivity ("recall") of 100.00%, specificity of 78.95%, PPV ("precision") of 78.95%, NPV of 100.00%, and accuracy of 88.24% in detecting the presence of positive head CTs. On the "test" data, the model attained a sensitivity of 97.78%, specificity of 89.47%, PPV of 88.00%, NPV of 98.08%, and accuracy of 93.14% in detecting the presence of positive head CTs. ANNs show great potential for predicting CT findings in the population of patients ≥ 65 years of age presenting with minor head injury after a fall. As a good first step, the ANN showed comparable sensitivity, predictive values, and accuracy, with a much higher specificity than the existing decision rules in clinical usage for predicting head CTs with acute intracranial findings. Copyright © 2016 Elsevier Inc. All rights reserved.
An FMRI-compatible Symbol Search task.
Liebel, Spencer W; Clark, Uraina S; Xu, Xiaomeng; Riskin-Jones, Hannah H; Hawkshead, Brittany E; Schwarz, Nicolette F; Labbe, Donald; Jerskey, Beth A; Sweet, Lawrence H
2015-03-01
Our objective was to determine whether a Symbol Search paradigm developed for functional magnetic resonance imaging (FMRI) is a reliable and valid measure of cognitive processing speed (CPS) in healthy older adults. As all older adults are expected to experience cognitive declines due to aging, and CPS is one of the domains most affected by age, establishing a reliable and valid measure of CPS that can be administered inside an MR scanner may prove invaluable in future clinical and research settings. We evaluated the reliability and construct validity of a newly developed FMRI Symbol Search task by comparing participants' performance in and outside of the scanner and to the widely used and standardized Symbol Search subtest of the Wechsler Adult Intelligence Scale (WAIS). A brief battery of neuropsychological measures was also administered to assess the convergent and discriminant validity of the FMRI Symbol Search task. The FMRI Symbol Search task demonstrated high test-retest reliability when compared to performance on the same task administered out of the scanner (r=.791; p<.001). The criterion validity of the new task was supported, as it exhibited a strong positive correlation with the WAIS Symbol Search (r=.717; p<.001). Predicted convergent and discriminant validity patterns of the FMRI Symbol Search task were also observed. The FMRI Symbol Search task is a reliable and valid measure of CPS in healthy older adults and exhibits expected sensitivity to the effects of age on CPS performance.
Zhang, Jinshui; Yuan, Zhoumiqi; Shuai, Guanyuan; Pan, Yaozhong; Zhu, Xiufang
2017-04-26
This paper developed an approach, the window-based validation set for support vector data description (WVS-SVDD), to determine optimal parameters for support vector data description (SVDD) model to map specific land cover by integrating training and window-based validation sets. Compared to the conventional approach where the validation set included target and outlier pixels selected visually and randomly, the validation set derived from WVS-SVDD constructed a tightened hypersphere because of the compact constraint by the outlier pixels which were located neighboring to the target class in the spectral feature space. The overall accuracies for wheat and bare land achieved were as high as 89.25% and 83.65%, respectively. However, target class was underestimated because the validation set covers only a small fraction of the heterogeneous spectra of the target class. The different window sizes were then tested to acquire more wheat pixels for validation set. The results showed that classification accuracy increased with the increasing window size and the overall accuracies were higher than 88% at all window size scales. Moreover, WVS-SVDD showed much less sensitivity to the untrained classes than the multi-class support vector machine (SVM) method. Therefore, the developed method showed its merits using the optimal parameters, tradeoff coefficient ( C ) and kernel width ( s ), in mapping homogeneous specific land cover.
Initial research program for the National Transonic Facility
NASA Technical Reports Server (NTRS)
Gloss, B. B.
1984-01-01
The construction and checkout of the National Transonic Facility (NTF) have been completed, and detailed calibration is now in progress. The initial NTF research program covers a wide range of study areas falling into three major elements: (1) the assessment of Reynolds number sensitivities for a broad range of configurations and flow phenomena; (2) validation of the ability of NTF to simulate full-scale aerodynamics; and (3) the development of test techniques for improved test simulations in existing wind tunnels. This paper, therefore, is a status report on these various elements of the initial NTF research program.
Sensitivity analysis of cool-down strategies for a transonic cryogenic tunnel
NASA Technical Reports Server (NTRS)
Thibodeaux, J. J.
1982-01-01
Guidelines and suggestions substantiated by real-time simulation data to ensure optimum time and energy use of injected liquid nitrogen for cooling the Langley 0.3-Meter Transonic Cryogenic Tunnel (TCT) are presented. It is directed toward enabling operators and researchers to become cognizant of criteria for using the 0.3-m TCT in an energy- or time-efficient manner. Operational recommendations were developed based on information collected from a validated simulator of the 0.3-m TCT and experimental data from the tunnel. Results and trends, however, can be extrapolated to other similarly constructed cryogenic wind tunnels.
Sensitivity and specificity of univariate MRI analysis of experimentally degraded cartilage
Lin, Ping-Chang; Reiter, David A.; Spencer, Richard G.
2010-01-01
MRI is increasingly used to evaluate cartilage in tissue constructs, explants, and animal and patient studies. However, while mean values of MR parameters, including T1, T2, magnetization transfer rate km, apparent diffusion coefficient ADC, and the dGEMRIC-derived fixed charge density, correlate with tissue status, the ability to classify tissue according to these parameters has not been explored. Therefore, the sensitivity and specificity with which each of these parameters was able to distinguish between normal and trypsin- degraded, and between normal and collagenase-degraded, cartilage explants were determined. Initial analysis was performed using a training set to determine simple group means to which parameters obtained from a validation set were compared. T1 and ADC showed the greatest ability to discriminate between normal and degraded cartilage. Further analysis with k-means clustering, which eliminates the need for a priori identification of sample status, generally performed comparably. Use of fuzzy c-means (FCM) clustering to define centroids likewise did not result in improvement in discrimination. Finally, a FCM clustering approach in which validation samples were assigned in a probabilistic fashion to control and degraded groups was implemented, reflecting the range of tissue characteristics seen with cartilage degradation. PMID:19705467
Lam, Elegance Ting Pui; Lam, Cindy Lo Kuen; Lai, Ching Lung; Yuen, Man Fung; Fong, Daniel Yee Tak
2009-01-01
AIM: To test the psychometric properties of a Chinese [(Hong Kong) HK] translation of the chronic liver disease questionnaire (CLDQ). METHODS: A Chinese (HK) translation of the CLDQ was developed by iterative translation and cognitive debriefing. It was then administered to 72 uncomplicated and 78 complicated chronic hepatitis B (CHB) patients in Hong Kong together with a structured questionnaire on service utilization, and the Chinese (HK) SF-36 Health Survey Version 2 (SF-36v2). RESULTS: Scaling success was ≥ 80% for all but three items. A new factor assessing sleep was found and items of two (Fatigue and Systemic Symptoms) subscales tended to load on the same factor. Internal consistency and test-retest reliabilities ranged from 0.58-0.90 for different subscales. Construct validity was confirmed by the expected correlations between the SF-36v2 Health Survey and CLDQ scores. Mean scores of CLDQ were significantly lower in complicated compared with uncomplicated CHB, supporting sensitivity in detecting differences between groups. CONCLUSION: The Chinese (HK) CLDQ is valid, reliable and sensitive for patients with CHB. Some modifications to the scaling structure might further improve its psychometric properties. PMID:19598306
ERIC Educational Resources Information Center
Lowe, Patricia A.; Papanastasiou, Elena C.; DeRuyck, Kimberly A.; Reynolds, Cecil R.
2005-01-01
In this study, the authors investigated the temporal stability and construct validity of the Adult Manifest Anxiety Scale-College Version (AMAS-C; C. R. Reynolds, B. O. Richmond, & P. A. Lowe, 2003b) scores. Results indicated that the AMAS-C scores had adequate to excellent test score stability, and evidence supported the construct validity of the…
White, Darcy; Rosenberg, Eli S; Cooper, Hannah L F; del Rio, Carlos; Sanchez, Travis H; Salazar, Laura F; Sullivan, Patrick S
2014-05-01
Men who have sex with men (MSM), particularly young black MSM, are disproportionately affected in the United States' HIV epidemic. Drug use may contribute to these disparities, yet previous studies have failed to provide evidence of elevated use among black MSM, relying exclusively on self-reported usage. This study uses biological assays to validate self-reports of drug use and explore the potential for misclassification to distort findings on racial patterns of use in this population. From an Atlanta-based cohort study of 454 black and 349 white MSM from 2010 to 2012, participants' self-reported drug use was compared to urine drug screening findings. The sensitivity of self-report was calculated as the proportion reporting recent usage among those who screened positive. Multivariable regression models were constructed to examine racial patterns in self-report, urine-detection, and self-report sensitivity of marijuana and cocaine usage, adjusted for socio-demographic factors. In analyses that adjusted for age, education, income, sexual orientation, and history of arrest, black MSM were less likely to report recent use of marijuana (P<0.001) and cocaine (P=0.02), but equally likely to screen positive for either drug. This discrepancy between self-reported and urine-detected drug use was explained by significantly lower sensitivity of self-report for black participants (P<0.001 for marijuana, P<0.05 for cocaine). The contribution of individual drug-related risk behaviors to the HIV disparities between black and white MSM should be revisited with methods that validate self-reports of illegal drug use. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Cox, Zachary L; Lewis, Connie M; Lai, Pikki; Lenihan, Daniel J
2017-01-01
We aim to validate the diagnostic performance of the first fully automatic, electronic heart failure (HF) identification algorithm and evaluate the implementation of an HF Dashboard system with 2 components: real-time identification of decompensated HF admissions and accurate characterization of disease characteristics and medical therapy. We constructed an HF identification algorithm requiring 3 of 4 identifiers: B-type natriuretic peptide >400 pg/mL; admitting HF diagnosis; history of HF International Classification of Disease, Ninth Revision, diagnosis codes; and intravenous diuretic administration. We validated the diagnostic accuracy of the components individually (n = 366) and combined in the HF algorithm (n = 150) compared with a blinded provider panel in 2 separate cohorts. We built an HF Dashboard within the electronic medical record characterizing the disease and medical therapies of HF admissions identified by the HF algorithm. We evaluated the HF Dashboard's performance over 26 months of clinical use. Individually, the algorithm components displayed variable sensitivity and specificity, respectively: B-type natriuretic peptide >400 pg/mL (89% and 87%); diuretic (80% and 92%); and International Classification of Disease, Ninth Revision, code (56% and 95%). The HF algorithm achieved a high specificity (95%), positive predictive value (82%), and negative predictive value (85%) but achieved limited sensitivity (56%) secondary to missing provider-generated identification data. The HF Dashboard identified and characterized 3147 HF admissions over 26 months. Automated identification and characterization systems can be developed and used with a substantial degree of specificity for the diagnosis of decompensated HF, although sensitivity is limited by clinical data input. Copyright © 2016 Elsevier Inc. All rights reserved.
Validation of learning style measures: implications for medical education practice.
Chapman, Dane M; Calhoun, Judith G
2006-06-01
It is unclear which learners would most benefit from the more individualised, student-structured, interactive approaches characteristic of problem-based and computer-assisted learning. The validity of learning style measures is uncertain, and there is no unifying learning style construct identified to predict such learners. This study was conducted to validate learning style constructs and to identify the learners most likely to benefit from problem-based and computer-assisted curricula. Using a cross-sectional design, 3 established learning style inventories were administered to 97 post-Year 2 medical students. Cognitive personality was measured by the Group Embedded Figures Test, information processing by the Learning Styles Inventory, and instructional preference by the Learning Preference Inventory. The 11 subscales from the 3 inventories were factor-analysed to identify common learning constructs and to verify construct validity. Concurrent validity was determined by intercorrelations of the 11 subscales. A total of 94 pre-clinical medical students completed all 3 inventories. Five meaningful learning style constructs were derived from the 11 subscales: student- versus teacher-structured learning; concrete versus abstract learning; passive versus active learning; individual versus group learning, and field-dependence versus field-independence. The concurrent validity of 10 of 11 subscales was supported by correlation analysis. Medical students most likely to thrive in a problem-based or computer-assisted learning environment would be expected to score highly on abstract, active and individual learning constructs and would be more field-independent. Learning style measures were validated in a medical student population and learning constructs were established for identifying learners who would most likely benefit from a problem-based or computer-assisted curriculum.
The Physician Values in Practice Scale: Construction and Initial Validation
ERIC Educational Resources Information Center
Hartung, Paul J.; Taber, Brian J.; Richard, George V.
2005-01-01
Measures of values typically appraise the construct globally, across life domains or relative to a broad life domain such as work. We conducted two studies to construct and initially validate an occupation- and context-specific values measure. Study 1, based on a sample of 192 medical students, describes the initial construction and item analysis…
Randhawa, Sharan; Walterfang, Mark; Miller, Kathryn; Scholes, Amelia; Mocellin, Ramon; Velakoulis, Dennis
2007-07-01
The carer history is an integral part of the assessment of patients with cognitive impairment. We aimed to develop a comprehensive yet concise carer questionnaire, the CogRisk, which captures actuarial risk variables for cognitive impairment in addition to key symptoms suggestive of cognitive decline in a number of cognitive domains, and to then assess its validity and reliability in a neuropsychiatric population. Carers of patients assessed for cognitive impairment completed the CogRisk, and patients were clinically assessed using the Mini-Mental State Examination (MMSE) and Neuropsychiatry Unit COGnitive assessment tool (NUCOG). Reliability was assessed using test-retest and interrater measures and measures of internal consistency. Construct and concurrent validity was assessed using correlation between total and subscale scores on the CogRisk, total scores on the NUCOG and MMSE, and subscale scores on the NUCOG. Predictive validity was determined using measures of sensitivity and specificity and using receiver operating characteristic (ROC) methods. The CogRisk was completed by all carers in less than 10 min. The total CogRisk score correlated significantly with total MMSE and NUCOG scores (r=-0.511 and -0.563, respectively) and remained highly significant when age and education were controlled for. Internal consistency of CogRisk items was high (alpha=0.943). Intrarater reliability of the CogRisk was high with an intraclass correlation coefficient of .978 (P<.001), and interrater reliability between carers was also high at 0.868 (P<.05). Sensitivity and specificity for the detection of dementia were .70 and .73, respectively, with area under the ROC curve not significantly different from that of the MMSE or NUCOG. The CogRisk is a brief carer-rated tool of a patient's cognitive functioning developed for use within a neuropsychiatric setting. It exhibited good concurrent validity, internal consistency, and interrater and intrarater reliability. The CogRisk also demonstrated good sensitivity and specificity for dementia. The CogRisk provides carer information, which complements the clinical assessment and can be used to focus on direct carer interview.
Singh, Varun Pratap; Singh, Rajkumar
2014-03-01
The aim of this study was to develop a reliable and valid Nepali version of the Psychosocial Impact of Dental Aesthetic Questionnaire (PIDAQ). Cross-sectional descriptive validation study. B.P. Koirala Institute of Health Sciences, Dharan, Nepal. A rigorous translation process including conceptual and semantic evaluation, translation, back translation and pre-testing was carried out. Two hundred and fifty-two undergraduates, including equal numbers of males and females with an age ranging from 18 to 29 years (mean age: 22·33±2·114 years), participated in this study. Reliability was assessed by Cronbach's alpha coefficient and the coefficient of correlation was used to assess correlation between items and test-retest reliability. The construct validity was tested by factorial analysis. Convergent construct validity was tested by comparison of PIDAQ scores with the aesthetic component of the index of orthodontic treatment needs (IOTN-AC) and perception of occlusion scale (POS), respectively. Discriminant construct validity was assessed by differences in score for those who demand treatment and those who did not. The response rate was 100%. One hundred and twenty-three individuals had a demand for orthodontic treatment. The Nepali PIDAQ had excellent reliability with Cronbach's alpha of 0·945, corrected item correlation between 0·525 and 0·790 and overall test-retest reliability of 0·978. The construct validity was good with formation of a new sub-domain 'Dental self-consciousness'. The scale had good correlation with IOTN-AC and POS fulfilling convergent construct validity. The discriminant construct validity was proved by significant differences in scores for subjects with demand and without demand for treatment. To conclude, Nepali version of PIDAQ has good psychometric properties and can be used effectively in this population group for further research.
Goldschmidt, Andrea B.
2017-01-01
Background Binge eating is a marker of weight gain and obesity, and a hallmark feature of eating disorders. Yet, its component constructs—overeating and loss of control (LOC) while eating—are poorly understood and difficult to measure. Objective To critically review the human literature concerning the validity of LOC and overeating across the age and weight spectrum. Data sources English-language articles addressing the face, convergent, discriminant, and predictive validity of LOC and overeating were included. Results LOC and overeating appear to have adequate face validity. Emerging evidence supports the convergent and predictive validity of the LOC construct, given its unique cross-sectional and prospective associations with numerous anthropometric, psychosocial, and eating behavior-related factors. Overeating may be best conceptualized as a marker of excess weight status. Limitations Binge eating constructs, particularly in the context of subjectively large episodes, are challenging to measure reliably. Few studies addressed overeating in the absence of LOC, thereby limiting conclusions about the validity of the overeating construct independent of LOC. Additional studies addressing the discriminant validity of both constructs are warranted. Discussion Suggestions for future weight-related research and for appropriately defining binge eating in the eating disorders diagnostic scheme are presented. PMID:28165655
An evidence-based decision assistance model for predicting training outcome in juvenile guide dogs
Craigon, Peter J.; Blythe, Simon A.; England, Gary C. W.; Asher, Lucy
2017-01-01
Working dog organisations, such as Guide Dogs, need to regularly assess the behaviour of the dogs they train. In this study we developed a questionnaire-style behaviour assessment completed by training supervisors of juvenile guide dogs aged 5, 8 and 12 months old (n = 1,401), and evaluated aspects of its reliability and validity. Specifically, internal reliability, temporal consistency, construct validity, predictive criterion validity (comparing against later training outcome) and concurrent criterion validity (comparing against a standardised behaviour test) were evaluated. Thirty-nine questions were sourced either from previously published literature or created to meet requirements identified via Guide Dogs staff surveys and staff feedback. Internal reliability analyses revealed seven reliable and interpretable trait scales named according to the questions within them as: Adaptability; Body Sensitivity; Distractibility; Excitability; General Anxiety; Trainability and Stair Anxiety. Intra-individual temporal consistency of the scale scores between 5–8, 8–12 and 5–12 months was high. All scales excepting Body Sensitivity showed some degree of concurrent criterion validity. Predictive criterion validity was supported for all seven scales, since associations were found with training outcome, at at-least one age. Thresholds of z-scores on the scales were identified that were able to distinguish later training outcome by identifying 8.4% of all dogs withdrawn for behaviour and 8.5% of all qualified dogs, with 84% and 85% specificity. The questionnaire assessment was reliable and could detect traits that are consistent within individuals over time, despite juvenile dogs undergoing development during the study period. By applying thresholds to scores produced from the questionnaire this assessment could prove to be a highly valuable decision-making tool for Guide Dogs. This is the first questionnaire-style assessment of juvenile dogs that has shown value in predicting the training outcome of individual working dogs. PMID:28614347
Validation of the ArthroS virtual reality simulator for arthroscopic skills.
Stunt, J J; Kerkhoffs, G M M J; van Dijk, C N; Tuijthof, G J M
2015-11-01
Virtual reality simulator training has become important for acquiring arthroscopic skills. A new simulator for knee arthroscopy ArthroS™ has been developed. The purpose of this study was to demonstrate face and construct validity, executed according to a protocol used previously to validate arthroscopic simulators. Twenty-seven participants were divided into three groups having different levels of arthroscopic experience. Participants answered questions regarding general information and the outer appearance of the simulator for face validity. Construct validity was assessed with one standardized navigation task. Face validity, educational value and user friendliness were further determined by giving participants three exercises and by asking them to fill out the questionnaire. Construct validity was demonstrated between experts and beginners. Median task times were not significantly different for all repetitions between novices and intermediates, and between intermediates and experts. Median face validity was 8.3 for the outer appearance, 6.5 for the intra-articular joint and 4.7 for surgical instruments. Educational value and user friendliness were perceived as nonsatisfactory, especially because of the lack of tactile feedback. The ArthroS™ demonstrated construct validity between novices and experts, but did not demonstrate full face validity. Future improvements should be mainly focused on the development of tactile feedback. It is necessary that a newly presented simulator is validated to prove it actually contributes to proficiency of skills.
Validity of Sensory Systems as Distinct Constructs
Su, Chia-Ting
2014-01-01
This study investigated the validity of sensory systems as distinct measurable constructs as part of a larger project examining Ayres’s theory of sensory integration. Confirmatory factor analysis (CFA) was conducted to test whether sensory questionnaire items represent distinct sensory system constructs. Data were obtained from clinical records of two age groups, 2- to 5-yr-olds (n = 231) and 6- to 10-yr-olds (n = 223). With each group, we tested several CFA models for goodness of fit with the data. The accepted model was identical for each group and indicated that tactile, vestibular–proprioceptive, visual, and auditory systems form distinct, valid factors that are not age dependent. In contrast, alternative models that grouped items according to sensory processing problems (e.g., over- or underresponsiveness within or across sensory systems) did not yield valid factors. Results indicate that distinct sensory system constructs can be measured validly using questionnaire data. PMID:25184467
Espinoza-Venegas, Maritza; Sanhueza-Alvarado, Olivia; Ramírez-Elizondo, Noé; Sáez-Carrillo, Katia
2015-01-01
OBJECTIVE: The current study aimed to validate the construct and reliability of an emotional intelligence scale. METHOD: The Trait Meta-Mood Scale-24 was applied to 349 nursing students. The process included content validation, which involved expert reviews, pilot testing, measurements of reliability using Cronbach's alpha, and factor analysis to corroborate the validity of the theoretical model's construct. RESULTS: Adequate Cronbach coefficients were obtained for all three dimensions, and factor analysis confirmed the scale's dimensions (perception, comprehension, and regulation). CONCLUSION: The Trait Meta-Mood Scale is a reliable and valid tool to measure the emotional intelligence of nursing students. Its use allows for accurate determinations of individuals' abilities to interpret and manage emotions. At the same time, this new construct is of potential importance for measurements in nursing leadership; educational, organizational, and personal improvements; and the establishment of effective relationships with patients. PMID:25806642
ERIC Educational Resources Information Center
Aebi, Marcel; Plattner, Belinda; Metzke, Christa Winkler; Bessler, Cornelia; Steinhausen, Hans-Christoph
2013-01-01
Background: Different dimensions of oppositional defiant disorder (ODD) have been found as valid predictors of further mental health problems and antisocial behaviors in youth. The present study aimed at testing the construct, concurrent, and predictive validity of ODD dimensions derived from parent- and self-report measures. Method: Confirmatory…
Sørensen, Hans Eibe; Slater, Stanley F
2008-08-01
Atheoretical measure purification may lead to construct deficient measures. The purpose of this paper is to provide a theoretically driven procedure for the development and empirical validation of symmetric component measures of multidimensional constructs. Particular emphasis is placed on establishing a formalized three-step procedure for achieving a posteriori content validity. Then the procedure is applied to development and empirical validation of two symmetrical component measures of market orientation, customer orientation and competitor orientation. Analysis suggests that average variance extracted is particularly critical to reliability in the respecification of multi-indicator measures. In relation to this, the results also identify possible deficiencies in using Cronbach alpha for establishing reliable and valid measures.
Küçükdeveci, Ayse A; Sahin, Hülya; Ataman, Sebnem; Griffiths, Bridget; Tennant, Alan
2004-02-15
Guidelines have been established for cross-cultural adaptation of outcome measures. However, invariance across cultures must also be demonstrated through analysis of Differential Item Functioning (DIF). This is tested in the context of a Turkish adaptation of the Health Assessment Questionnaire (HAQ). Internal construct validity of the adapted HAQ is assessed by Rasch analysis; reliability, by internal consistency and the intraclass correlation coefficient; external construct validity, by association with impairments and American College of Rheumatology functional stages. Cross-cultural validity is tested through DIF by comparison with data from the UK version of the HAQ. The adapted version of the HAQ demonstrated good internal construct validity through fit of the data to the Rasch model (mean item fit 0.205; SD 0.998). Reliability was excellent (alpha = 0.97) and external construct validity was confirmed by expected associations. DIF for culture was found in only 1 item. Cross-cultural validity was found to be sufficient for use in international studies between the UK and Turkey. Future adaptation of instruments should include analysis of DIF at the field testing stage in the adaptation process.
The reliability and validity of a Japanese version of symptom checklist 90 revised
Tomioka, Mitsunao; Shimura, Midori; Hidaka, Mikio; Kubo, Chiharu
2008-01-01
Objective To examine the validity and reliability of a Japanese version of the Symptom Checklist 90 Revised (SCL-90-R (J)). Methods The English SCL-90-R was translated to Japanese and the Japanese version confirmed by back-translation. To determine the factor validity and internal consistency of the nine primary subscales, 460 people from the community completed SCL-90-R(J). Test-retest reliability was examined for 104 outpatients and 124 healthy undergraduate students. The convergent-discriminant validity was determined for 80 inpatients who replied to both SCL-90-R(J) and the Minnesota Multiphasic Personality Inventory (MMPI). Results The correlation coefficients between the nine primary subscales and items were .26 to .78. Cronbach's alpha coefficients were from .76 (Phobic Anxiety) to .86 (Interpersonal Sensitivity). Pearson's correlation coefficients between test-retest scores were from .81 (Psychoticism) to .90 (Somatization) for the outpatients and were from .64 (Phobic Anxiety) to .78 (Paranoid Ideation) for the students. Each of the nine primary subscales correlated well with their corresponding constructs in the MMPI. Conclusion We confirmed the validity and reliability of SCL-90-R(J) for the measurement of individual distress. The nine primary subscales were consistent with the items of the original English version. PMID:18957078
Lingner, Thomas; Kataya, Amr R. A.; Reumann, Sigrun
2012-01-01
We recently developed the first algorithms specifically for plants to predict proteins carrying peroxisome targeting signals type 1 (PTS1) from genome sequences.1 As validated experimentally, the prediction methods are able to correctly predict unknown peroxisomal Arabidopsis proteins and to infer novel PTS1 tripeptides. The high prediction performance is primarily determined by the large number and sequence diversity of the underlying positive example sequences, which mainly derived from EST databases. However, a few constructs remained cytosolic in experimental validation studies, indicating sequencing errors in some ESTs. To identify erroneous sequences, we validated subcellular targeting of additional positive example sequences in the present study. Moreover, we analyzed the distribution of prediction scores separately for each orthologous group of PTS1 proteins, which generally resembled normal distributions with group-specific mean values. The cytosolic sequences commonly represented outliers of low prediction scores and were located at the very tail of a fitted normal distribution. Three statistical methods for identifying outliers were compared in terms of sensitivity and specificity.” Their combined application allows elimination of erroneous ESTs from positive example data sets. This new post-validation method will further improve the prediction accuracy of both PTS1 and PTS2 protein prediction models for plants, fungi, and mammals. PMID:22415050
The development and testing of a skin tear risk assessment tool.
Newall, Nelly; Lewin, Gill F; Bulsara, Max K; Carville, Keryln J; Leslie, Gavin D; Roberts, Pam A
2017-02-01
The aim of the present study is to develop a reliable and valid skin tear risk assessment tool. The six characteristics identified in a previous case control study as constituting the best risk model for skin tear development were used to construct a risk assessment tool. The ability of the tool to predict skin tear development was then tested in a prospective study. Between August 2012 and September 2013, 1466 tertiary hospital patients were assessed at admission and followed up for 10 days to see if they developed a skin tear. The predictive validity of the tool was assessed using receiver operating characteristic (ROC) analysis. When the tool was found not to have performed as well as hoped, secondary analyses were performed to determine whether a potentially better performing risk model could be identified. The tool was found to have high sensitivity but low specificity and therefore have inadequate predictive validity. Secondary analysis of the combined data from this and the previous case control study identified an alternative better performing risk model. The tool developed and tested in this study was found to have inadequate predictive validity. The predictive validity of an alternative, more parsimonious model now needs to be tested. © 2015 Medicalhelplines.com Inc and John Wiley & Sons Ltd.
Lingner, Thomas; Kataya, Amr R A; Reumann, Sigrun
2012-02-01
We recently developed the first algorithms specifically for plants to predict proteins carrying peroxisome targeting signals type 1 (PTS1) from genome sequences. As validated experimentally, the prediction methods are able to correctly predict unknown peroxisomal Arabidopsis proteins and to infer novel PTS1 tripeptides. The high prediction performance is primarily determined by the large number and sequence diversity of the underlying positive example sequences, which mainly derived from EST databases. However, a few constructs remained cytosolic in experimental validation studies, indicating sequencing errors in some ESTs. To identify erroneous sequences, we validated subcellular targeting of additional positive example sequences in the present study. Moreover, we analyzed the distribution of prediction scores separately for each orthologous group of PTS1 proteins, which generally resembled normal distributions with group-specific mean values. The cytosolic sequences commonly represented outliers of low prediction scores and were located at the very tail of a fitted normal distribution. Three statistical methods for identifying outliers were compared in terms of sensitivity and specificity." Their combined application allows elimination of erroneous ESTs from positive example data sets. This new post-validation method will further improve the prediction accuracy of both PTS1 and PTS2 protein prediction models for plants, fungi, and mammals.
Satisfaction with Daily Occupations for Elderly People (SDO-E)—Adaptation and Psychometric Testing
Wästberg, Birgitta; Eklund, Mona
2017-01-01
Satisfaction with everyday occupations has been shown to be important for health and well-being in various populations. Research into satisfaction with everyday occupations among elderly persons is, however, lacking. The aim was to investigate the psychometric properties of an adapted test version of the Satisfaction with Daily Occupations instrument (SDO) for elderly people, called SDO-E. Five hospital-based occupational therapists working with elderly people evaluated the content validity and usability of the SDO-E. The elderly participants consisted of 50 people from outside of the health services and 42 inpatients at an internal medicine clinic. They completed the SDO-E and rated their perceived health, activity level, and general satisfaction with daily occupations. The SDO-E showed fair content validity and utility, acceptable internal consistency, good preliminary construct validity and relevant known-groups validity. The SDO-E thus appears to be a useful screening tool for assessing activity level and satisfaction with daily occupations among elderly people, and a complement to other self-report instruments concerning factors connected with health and well-being. Future research should further explore the content validity of the SDO-E, particularly the views of the elderly themselves, and investigate the SDO-E in terms of sensitivity to change. PMID:28946667
Suárez, María Fernanda; Sánchez, Ricardo; Calvo, José Manuel
2013-09-01
To validate the SQLS scale in Colombian patients diagnosed with schizophrenia. The self-report scale was applied to 251 patients. Measures of test-retest reliability, internal consistency and correlation inter-scales with the SF-12 were made by applying the scale 2 days later in 28 patients, and 30 days later in 38; 50 patients filled-out the SF-12 scale to determine the concurrent validity. Three domains were found with all of them having Cronbach's alphas >0.7. The three factors model did not show adequate fit indexes. Test-retest evaluation showed satisfactory correlation values (>0.86). Sensitivity to change did not shown significant differences between the repeated measures. As regards concurrent validity, acceptable correlation values were found only in SF-12 domains related to mental health and functioning. The SQLS has a factorial structure consistent with previous reports, adequate internal consistency and temporal stability. However, a more detailed examination of some of these items is required, considering that the measurement of the construct does not appear to be adequate. Copyright © 2013 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.
Chaabene, Helmi; Negra, Yassine; Bouguezzi, Raja; Capranica, Laura; Franchini, Emerson; Prieske, Olaf; Hbacha, Hamdi; Granacher, Urs
2018-01-01
The regular monitoring of physical fitness and sport-specific performance is important in elite sports to increase the likelihood of success in competition. This study aimed to systematically review and to critically appraise the methodological quality, validation data, and feasibility of the sport-specific performance assessment in Olympic combat sports like amateur boxing, fencing, judo, karate, taekwondo, and wrestling. A systematic search was conducted in the electronic databases PubMed, Google-Scholar, and Science-Direct up to October 2017. Studies in combat sports were included that reported validation data (e.g., reliability, validity, sensitivity) of sport-specific tests. Overall, 39 studies were eligible for inclusion in this review. The majority of studies (74%) contained sample sizes <30 subjects. Nearly, 1/3 of the reviewed studies lacked a sufficient description (e.g., anthropometrics, age, expertise level) of the included participants. Seventy-two percent of studies did not sufficiently report inclusion/exclusion criteria of their participants. In 62% of the included studies, the description and/or inclusion of a familiarization session (s) was either incomplete or not existent. Sixty-percent of studies did not report any details about the stability of testing conditions. Approximately half of the studies examined reliability measures of the included sport-specific tests (intraclass correlation coefficient [ICC] = 0.43-1.00). Content validity was addressed in all included studies, criterion validity (only the concurrent aspect of it) in approximately half of the studies with correlation coefficients ranging from r = -0.41 to 0.90. Construct validity was reported in 31% of the included studies and predictive validity in only one. Test sensitivity was addressed in 13% of the included studies. The majority of studies (64%) ignored and/or provided incomplete information on test feasibility and methodological limitations of the sport-specific test. In 28% of the included studies, insufficient information or a complete lack of information was provided in the respective field of the test application. Several methodological gaps exist in studies that used sport-specific performance tests in Olympic combat sports. Additional research should adopt more rigorous validation procedures in the application and description of sport-specific performance tests in Olympic combat sports.
Slappendel, Geerte; Mandy, William; van der Ende, Jan; Verhulst, Frank C; van der Sijde, Ad; Duvekot, Jorieke; Skuse, David; Greaves-Lord, Kirstin
2016-05-01
The Developmental Diagnostic Dimensional Interview-short version (3Di-sv) provides a brief standardized parental interview for diagnosing autism spectrum disorder (ASD). This study explored its validity, and compatibility with DSM-5 ASD. 3Di-sv classifications showed good sensitivity but low specificity when compared to ADOS-2-confirmed clinical diagnosis. Confirmatory factor analyses found a better fit against a DSM-5 model than a DSM-IV-TR model of ASD. Exploration of the content validity of the 3Di-sv for the DSM-5 revealed some construct underrepresentation, therefore we obtained data from a panel of 3Di-trained clinicians from ASD-specialized centers to recommend items to fill these gaps. Taken together, the 3Di-sv provides a solid basis to create a similar instrument suitable for DSM-5. Concrete recommendations are provided to improve DSM-5 compatibility.
Posttraumatic maladaptive beliefs scale: evolution of the personal beliefs and reactions scale.
Vogt, Dawne S; Shipherd, Jillian C; Resick, Patricia A
2012-09-01
The posttraumatic maladaptive beliefs scale (PMBS) was developed to measure maladaptive beliefs about current life circumstances that may occur following trauma exposure. This scale assesses maladaptive beliefs within three domains: (a) threat of harm, (b) self-worth and judgment, and (c) reliability and trustworthiness of others. Items for the PMBS were drawn from a larger preexisting measure that assesses a wide range of personal beliefs and reactions associated with trauma exposure. The construct validity of the PMBS was assessed in two independent samples of interpersonal trauma survivors. This article provides data to support the reliability and validity of the PMBS as an instrument to assess general, rather than trauma-specific, maladaptive beliefs that have relevance for functioning in the aftermath of a traumatic event. Moreover, the measure is sensitive to changes that occur in treatment, and the length of the measure (15 items) is practical for use in clinical settings.
Development and validation of a Haitian Creole screening instrument for depression
Rasmussen, Andrew; Eustache, Eddy; Raviola, Giuseppe; Kaiser, Bonnie; Grelotti, David; Belkin, Gary
2014-01-01
Developing mental health care capacity in post-earthquake Haiti is hampered by the lack of assessments that include culturally bound idioms Haitians use when discussing emotional distress. The current study describes a novel emic-etic approach to developing a depression screening for Partners In Health/Zanmi Lasante. In Study 1 Haitian key informants were asked to classify symptoms and describe categories within a pool of symptoms of common mental disorders. Study 2 tested the symptom set that best approximated depression in a sample of depressed and not depressed Haitians in order to select items for the screening tool. The resulting 13-item instrument produced scores with high internal reliability that were sensitive to culturally-informed diagnoses, and interpretations with construct and concurrent validity (vis-à-vis functional impairment). Discussion focuses on the appropriate use of this tool and integrating emic perspectives into developing psychological assessments globally. The screening tool is provided as an Appendix. PMID:25080426
Langenbucher, J; Sulesund, D; Chung, T; Morgenstern, J
1996-01-01
Illness severity and self-efficacy are two constructs of growing interest as predictors of clinical response in alcoholism. Using alternative measures of illness severity (DSM-IV symptom count, Alcohol Dependence Scale, and Addiction Severity Index) and self-efficacy (brief version of the Situational Confidence Questionnaire) rigorously controlled for theoretically important background variables, we studied their unique contribution to multiple indices of relapse, relapse latency, and use of alternative coping behaviors in a large, heterogeneous clinical sample. The Alcohol Dependence Scale contributed to the prediction of 4 of 5 relapse indicators. The SCQ failed to predict relapse behavior or its precursor, coping response. The findings emphasize the predictive validity of severity of dependence as a course specifier and underline the need for more sensitive and externally valid measures of cognitive processes such as self-efficacy for application in future studies of posttreatment behavior.
The juvenile arthritis foot disability index: development and evaluation of measurement properties.
André, Marie; Hagelberg, Stefan; Stenström, Christina H
2004-12-01
To develop a new juvenile arthritis foot disability index (JAFI) and to test it for validity and reliability. Samples of 14 children/adolescents and 30 children/adolescents with juvenile idiopathic arthritis (JIA) and 29 healthy children/adolescents participated. We used a questionnaire derived from the International Classification of Functioning, Disability and Health that included 27 statements divided into the dimensions Impairment, Activity Limitation, and Participation Restriction. Comments on the contents were invited from parents and adolescents. Convergent and divergent construct validity was examined by comparing the 3 JAFI dimensions to joint impairment scores, the Childhood Health Assessment Questionnaire (CHAQ), and self-rated, foot-related participation restriction. Known groups construct validity was assessed by comparing answers from children with JIA to those from healthy children. Test-retest stability was investigated over one week. One item was added after suggestions from 2 participants. A consistent pattern of increasing JAFI scores was found with increasing joint impairment scores, CHAQ scores, and self-rated foot-related participation restriction. Foot-related disability as assessed by JAFI was more pronounced in children with JIA than in healthy controls. One statement showing a floor effect was excluded. No internal redundancy (rs > 0.90) between items was found, and internal consistency within each subscale was satisfactory (rs > 0.50) for all items but one. No systematic differences were found between test and retest, and weighted kappa coefficients for the 3 JAFI dimensions were 0.90, 0.85, and 0.88. The JAFI appears to be valid and reliable for assessing foot-related disability among children/adolescents with JIA. Its sensitivity to change remains to be investigated.
Kasitanon, N; Wangkaew, S; Puntana, S; Sukitawut, W; Leong, K P; Louthrenoo, W
2013-03-01
The English version of the Systemic Lupus Erythematosus Quality of Life Questionnaire (SLEQOL) is a validated disease-specific quality of life instrument. The aim of this study was to evaluate the psychometric properties of the Thai version of the SLEQOL (SLEQOL-TH). Two independent translators translated the SLEQOL into Thai. The back translation of this version was performed by two other independent translators. The final version, SLEQOL-TH, was completed after resolving the discrepancies revealed by the back translation. One hundred and nine patients with SLE were enrolled to test the reliability, construct validity, floor and ceiling effects, and sensitivity to the changes of the SLEQOL-TH at six months. The differential item functioning (DIF) between the Thai and English versions was analyzed using the partial gamma. The internal consistency of the SLEQOL-TH was satisfactory with the overall Cronbach's alpha of 0.86. The test-retest reliability of the SLEQOL-TH was acceptable with the intra-class correlation coefficient of 0.86. Low correlations between the SLEQOL-TH and SLEDAI were observed. The total score of the SLEQOL-TH was moderately responsive to changes in quality of life, with a standardized response mean of 0.50. When comparing the SLEQOL-TH from Thai SLE patients with the original SLEQOL version obtained from Singapore SLE patients, 11 out of 40 items showed a moderate to large DIF. The SLEQOL-TH has acceptable psychometric properties and shows construct validity. In comparison with the English version of SLEQOL, there are some items that showed DIF. The applicability of the SLEQOL-TH in real-life clinical practice and clinical trials needs to be determined.
NASA Astrophysics Data System (ADS)
Liu, Sijun; Chen, Jiaping; Wang, Jianming; Wu, Zhuchao; Wu, Weihua; Xu, Zhiwei; Hu, Wenbiao; Xu, Fei; Tong, Shilu; Shen, Hongbing
2017-10-01
Hand, foot, and mouth disease (HFMD) is a significant public health issue in China and an accurate prediction of epidemic can improve the effectiveness of HFMD control. This study aims to develop a weather-based forecasting model for HFMD using the information on climatic variables and HFMD surveillance in Nanjing, China. Daily data on HFMD cases and meteorological variables between 2010 and 2015 were acquired from the Nanjing Center for Disease Control and Prevention, and China Meteorological Data Sharing Service System, respectively. A multivariate seasonal autoregressive integrated moving average (SARIMA) model was developed and validated by dividing HFMD infection data into two datasets: the data from 2010 to 2013 were used to construct a model and those from 2014 to 2015 were used to validate it. Moreover, we used weekly prediction for the data between 1 January 2014 and 31 December 2015 and leave-1-week-out prediction was used to validate the performance of model prediction. SARIMA (2,0,0)52 associated with the average temperature at lag of 1 week appeared to be the best model (R 2 = 0.936, BIC = 8.465), which also showed non-significant autocorrelations in the residuals of the model. In the validation of the constructed model, the predicted values matched the observed values reasonably well between 2014 and 2015. There was a high agreement rate between the predicted values and the observed values (sensitivity 80%, specificity 96.63%). This study suggests that the SARIMA model with average temperature could be used as an important tool for early detection and prediction of HFMD outbreaks in Nanjing, China.
Scanavino, Marco de T; Ventuneac, Ana; Rendina, H Jonathon; Abdo, Carmita H N; Tavares, Hermano; Amaral, Maria L S do; Messina, Bruna; Reis, Sirlene C dos; Martins, João P L B; Gordon, Marina C; Vieira, Julie C; Parsons, Jeffrey T
2016-01-01
Epidemiological, behavioral, and clinical data on sexual compulsivity in Brazil are very limited. This study sought to adapt and validate the Sexual Compulsivity Scale (SCS), the 22-item version of the Compulsive Sexual Behavior Inventory (CSBI-22), and the Hypersexual Disorder Screening Inventory (HDSI) for use in Brazil. A total of 153 participants underwent psychiatric assessment and completed self-reported measures. The adaptation process of the instruments from English to Portuguese followed the guidelines of the International Society for Pharmacoeconomics and Outcomes Research. The reliability and validity of the HDSI criteria were evaluated and the construct validity of all measures was examined. For the SCS and HDSI, factor analysis revealed one factor for each measure. For the CSBI-22, four factors were retained although we only calculated the scores of two factors (control and violence). All scores had good internal consistency (alpha >.75), presented high temporal stability (>.76), discriminated between patients and controls, and presented strong (ρ > .81) correlations with the Sexual Addiction Screening Test (except for the violence domain = .40) and moderate correlations with the Impulsive Sensation Seeking domain of the Zuckerman Kuhlman Personality Questionnaire (ρ between .43 and .55). The sensitivity of the HDSI was 71.93 % and the specificity was 100 %. All measures showed very good psychometric properties. The SCS, the HDSI, and the control domain of the CSBI-22 seemed to measure theoretically similar constructs, as they were highly correlated (ρ > .85). The findings support the conceptualization of hypersexuality as a cluster of problematic symptoms that are highly consistent across a variety of measures.
ERIC Educational Resources Information Center
Omizo, Michael M.; And Others
1983-01-01
Construct validity data found some support for the California Occupational Preference System constructs when its results were evaluated on a sample of 213 female undergraduates relative to the Vocational Preference Inventory results. (PN)
Measurement Invariance of the UTAUT Constructs in the Caribbean
ERIC Educational Resources Information Center
Thomas, Troy D.; Singh, Lenandlar; Gaffar, Kemuel; Thakur, Dhanaraj; Jackman, Grace-Ann; Thomas, Michael; Gajraj, Roger; Allen, Claudine; Tooma, Keron
2014-01-01
This article employs confirmatory factor analysis to evaluate the factorial validity and the cross-national comparability of the UTAUT constructs with respect to mobile learning in higher education in four Caribbean countries. Except for the measurement of one factor, the UTAUT constructs exhibit adequate reliability and validity. Though full…
The Modified Cognitive Constructions Coding System: Reliability and Validity Assessments
ERIC Educational Resources Information Center
Moran, Galia S.; Diamond, Gary M.
2006-01-01
The cognitive constructions coding system (CCCS) was designed for coding client's expressed problem constructions on four dimensions: intrapersonal-interpersonal, internal-external, responsible-not responsible, and linear-circular. This study introduces, and examines the reliability and validity of, a modified version of the CCCS--a version that…
Utility of pedometers for assessing physical activity: construct validity.
Tudor-Locke, Catrine; Williams, Joel E; Reis, Jared P; Pluto, Delores
2004-01-01
Valid assessment of physical activity is necessary to fully understand this important health-related behaviour for research, surveillance, intervention and evaluation purposes. This article is the second in a companion set exploring the validity of pedometer-assessed physical activity. The previous article published in Sports Medicine dealt with convergent validity (i.e. the extent to which an instrument's output is associated with that of other instruments intended to measure the same exposure of interest). The present focus is on construct validity. Construct validity is the extent to which the measurement corresponds with other measures of theoretically-related parameters. Construct validity is typically evaluated by correlational analysis, that is, the magnitude of concordance between two measures (e.g. pedometer-determined steps/day and a theoretically-related parameter such as age, anthropometric measures and fitness). A systematic literature review produced 29 articles published since > or =1980 directly relevant to construct validity of pedometers in relation to age, anthropometric measures and fitness. Reported correlations were combined and a median r-value was computed. Overall, there was a weak inverse relationship (median r = -0.21) between age and pedometer-determined physical activity. A weak inverse relationship was also apparent with both body mass index and percentage overweight (median r = -0.27 and r = -0.22, respectively). Positive relationships regarding indicators of fitness ranged from weak to moderate depending on the fitness measure utilised: 6-minute walk test (median r = 0.69), timed treadmill test (median r = 0.41) and estimated maximum oxygen uptake (median r = 0.22). Studies are warranted to assess the relationship of pedometer-determined physical activity with other important health-related outcomes including blood pressure and physiological parameters such as blood glucose and lipid profiles. The aggregated evidence of convergent validity (presented in the previous companion article) and construct validity herein provides support for considering simple and inexpensive pedometers in both research and practice.
Construct Validation of the Behavior and Instructional Management Scale
ERIC Educational Resources Information Center
Martin, Nancy K.; Sass, Daniel A.
2010-01-01
Beliefs related to classroom management vary among teachers and play an important role in classrooms. Despite the importance of this construct, valid measures have proven difficult to develop. This study evaluated the psychometric properties of the Behavior and Instructional Management Scale (BIMS), a short but valid measure of teachers'…
Development and Construct Validation of the Mentor Behavior Scale
ERIC Educational Resources Information Center
Brodeur, Pascale; Larose, Simon; Tarabulsy, George; Feng, Bei; Forget-Dubois, Nadine
2015-01-01
Researchers suggest that certain supportive behaviors of mentors could increase the benefits of school-based mentoring for youth. However, the literature contains few validated instruments to measure these behaviors. In our present study, we aimed to construct and validate a tool to measure the supportive behaviors of mentors participating in…
ERIC Educational Resources Information Center
Maiano, Christophe; Begarie, Jerome; Morin, Alexandre J. S.; Garbarino, Jean-Marie; Ninot, Gregory
2010-01-01
The purpose of this study was to test the reliability (i.e. internal consistency and test-retest reliability) and construct validity (i.e. content validity, factor validity, measurement invariance, and latent mean invariance) of the Nutrition and Activity Knowledge Scale (NAKS) in a sample of French adolescents with mild to moderate Intellectual…
Sirimanna, Pramudith; Gladman, Marc A
2017-10-01
Proficiency-based virtual reality (VR) training curricula improve intraoperative performance, but have not been developed for laparoscopic appendicectomy (LA). This study aimed to develop an evidence-based training curriculum for LA. A total of 10 experienced (>50 LAs), eight intermediate (10-30 LAs) and 20 inexperienced (<10 LAs) operators performed guided and unguided LA tasks on a high-fidelity VR simulator using internationally relevant techniques. The ability to differentiate levels of experience (construct validity) was measured using simulator-derived metrics. Learning curves were analysed. Proficiency benchmarks were defined by the performance of the experienced group. Intermediate and experienced participants completed a questionnaire to evaluate the realism (face validity) and relevance (content validity). Of 18 surgeons, 16 (89%) considered the VR model to be visually realistic and 17 (95%) believed that it was representative of actual practice. All 'guided' modules demonstrated construct validity (P < 0.05), with learning curves that plateaued between sessions 6 and 9 (P < 0.01). When comparing inexperienced to intermediates to experienced, the 'unguided' LA module demonstrated construct validity for economy of motion (5.00 versus 7.17 versus 7.84, respectively; P < 0.01) and task time (864.5 s versus 477.2 s versus 352.1 s, respectively, P < 0.01). Construct validity was also confirmed for number of movements, path length and idle time. Validated modules were used for curriculum construction, with proficiency benchmarks used as performance goals. A VR LA model was realistic and representative of actual practice and was validated as a training and assessment tool. Consequently, the first evidence-based internationally applicable training curriculum for LA was constructed, which facilitates skill acquisition to proficiency. © 2017 Royal Australasian College of Surgeons.
Yen, Po-Yin; Sousa, Karen H; Bakken, Suzanne
2014-01-01
Background In a previous study, we developed the Health Information Technology Usability Evaluation Scale (Health-ITUES), which is designed to support customization at the item level. Such customization matches the specific tasks/expectations of a health IT system while retaining comparability at the construct level, and provides evidence of its factorial validity and internal consistency reliability through exploratory factor analysis. Objective In this study, we advanced the development of Health-ITUES to examine its construct validity and predictive validity. Methods The health IT system studied was a web-based communication system that supported nurse staffing and scheduling. Using Health-ITUES, we conducted a cross-sectional study to evaluate users’ perception toward the web-based communication system after system implementation. We examined Health-ITUES's construct validity through first and second order confirmatory factor analysis (CFA), and its predictive validity via structural equation modeling (SEM). Results The sample comprised 541 staff nurses in two healthcare organizations. The CFA (n=165) showed that a general usability factor accounted for 78.1%, 93.4%, 51.0%, and 39.9% of the explained variance in ‘Quality of Work Life’, ‘Perceived Usefulness’, ‘Perceived Ease of Use’, and ‘User Control’, respectively. The SEM (n=541) supported the predictive validity of Health-ITUES, explaining 64% of the variance in intention for system use. Conclusions The results of CFA and SEM provide additional evidence for the construct and predictive validity of Health-ITUES. The customizability of Health-ITUES has the potential to support comparisons at the construct level, while allowing variation at the item level. We also illustrate application of Health-ITUES across stages of system development. PMID:24567081
[Validation of the German version of the Oxford Elbow Score : A cross-sectional study].
Marquardt, J; Schöttker-Königer, T; Schäfer, A
2016-08-01
Elbow complaints are complex problems leading to severe consequences for affected people and the healthcare system. The German version of the Oxford Elbow Score (OES) is the first German-speaking instrument that specifically measures elbow complaints from the patient's perspective and changes of their health status. The aim of this study is the validation of the German version of the OES. In this context the internal consistency and the construct validity were investigated. 59 patients with elbow complaints completed the German version of the OES, the DASH and the SF-36 in a cross-sectional study. The internal consistency was calculated with Cronbach's alpha coefficients. Spearman's correlation coefficients were used to confirm construct validity. Cronbach's alpha for pain, function and psychological subscales was 0.88, 0.81 and 0.90, respectively. The whole questionnaire presents a Cronbach's alpha value of 0.93. Convergent construct validity was confirmed with correlation coefficients containing values of -0.84, -0.77 and -0.82 compared to DASH and values ranging from 0.41 to 0.80 compared with the physical domains of the SF-36. The divergent construct validity presented values ranging from 0.07 to 0.20 with the SF-36 domains of "general health perception" and "mental health". The German OES is an internal consistent instrument with good convergent and divergent construct validity. Other aspects of the validity, the reliability and the responsiveness should be confirmed through further studies.
Weiss, Maureen R; Bolter, Nicole D; Kipp, Lindsay E
2014-09-01
A signature characteristic of positive youth development (PYD) programs is the opportunity to develop life skills, such as social, behavioral, and moral competencies, that can be generalized to domains beyond the immediate activity. Although context-specific instruments are available to assess developmental outcomes, a measure of life skills transfer would enable evaluation of PYD programs in successfully teaching skills that youth report using in other domains. The purpose of our studies was to develop and validate a measure of perceived life skills transfer, based on data collected with The First Tee, a physical activity-based PYD program. In 3 studies, we conducted a series of steps to provide content and construct validity and internal consistency reliability for the Life Skills Transfer Survey (LSTS), a measure of perceived life skills transfer. Study 1 provided content validity for the LSTS that included 8 life skills and 50 items. Study 2 revealed construct validity (structural validity) through a confirmatory factor analysis and convergent validity by correlating scores on the LSTS with scores on an assessment tool that measures a related construct. Study 3 offered additional construct validity by reassessing youth 1 year later and showing that scores during both time periods were invariant in factor pattern, loadings, and variances and covariances. Studies 2 and 3 demonstrated internal consistency reliability of the LSTS. RESULTS from 3 studies provide evidence of content and construct validity and internal consistency reliability for the LSTS, which can be used in evaluation research with youth development programs.
Predicting the risk of toxic blooms of golden alga from cell abundance and environmental covariates
Patino, Reynaldo; VanLandeghem, Matthew M.; Denny, Shawn
2016-01-01
Golden alga (Prymnesium parvum) is a toxic haptophyte that has caused considerable ecological damage to marine and inland aquatic ecosystems worldwide. Studies focused primarily on laboratory cultures have indicated that toxicity is poorly correlated with the abundance of golden alga cells. This relationship, however, has not been rigorously evaluated in the field where environmental conditions are much different. The ability to predict toxicity using readily measured environmental variables and golden alga abundance would allow managers rapid assessments of ichthyotoxicity potential without laboratory bioassay confirmation, which requires additional resources to accomplish. To assess the potential utility of these relationships, several a priori models relating lethal levels of golden alga ichthyotoxicity to golden alga abundance and environmental covariates were constructed. Model parameters were estimated using archived data from four river basins in Texas and New Mexico (Colorado, Brazos, Red, Pecos). Model predictive ability was quantified using cross-validation, sensitivity, and specificity, and the relative ranking of environmental covariate models was determined by Akaike Information Criterion values and Akaike weights. Overall, abundance was a generally good predictor of ichthyotoxicity as cross validation of golden alga abundance-only models ranged from ∼ 80% to ∼ 90% (leave-one-out cross-validation). Environmental covariates improved predictions, especially the ability to predict lethally toxic events (i.e., increased sensitivity), and top-ranked environmental covariate models differed among the four basins. These associations may be useful for monitoring as well as understanding the abiotic factors that influence toxicity during blooms.
Yu, Zhangbin; Han, Shuping; Wu, Jinxia; Li, Mingxia; Wang, Huaiyan; Wang, Jimei; Liu, Jiebo; Pan, Xinnian; Yang, Jie; Chen, Chao
2014-01-01
to prospectively validate a previously constructed transcutaneous bilirubin (TcB) nomogram for identifying severe hyperbilirubinemia in healthy Chinese term and late-preterm infants. this was a multicenter study that included 9,174 healthy term and late-preterm infants in eight hospitals of China. TcB measurements were performed using a JM-103 bilirubinometer. TcB values were plotted on a previously developed TcB nomogram, to identify the predictive ability for subsequent significant hyperbilirubinemia. in the present study, 972 neonates (10.6%) developed significant hyperbilirubinemia. The 40(th) percentile of the nomogram could identify all neonates who were at risk of significant hyperbilirubinemia, but with a low positive predictive value (PPV) (18.9%). Of the 453 neonates above the 95(th) percentile, 275 subsequently developed significant hyperbilirubinemia, with a high PPV (60.7%), but with low sensitivity (28.3%). The 75(th) percentile was highly specific (81.9%) and moderately sensitive (79.8%). The area under the curve (AUC) for the TcB nomogram was 0.875. this study validated the previously developed TcB nomogram, which could be used to predict subsequent significant hyperbilirubinemia in healthy Chinese term and late-preterm infants. However, combining TcB nomogram and clinical risk factors could improve the predictive accuracy for severe hyperbilirubinemia, which was not assessed in the study. Further studies are necessary to confirm this combination. Copyright © 2014 Sociedade Brasileira de Pediatria. Published by Elsevier Editora Ltda. All rights reserved.
Rice, Simon M; Ogrodniczuk, John S; Kealy, David; Seidler, Zac E; Dhillon, Haryana M; Oliffe, John L
2017-12-22
Clinical practice and literature has supported the existence of a phenotypic sub-type of depression in men. While a number of self-report rating scales have been developed in order to empirically test the male depression construct, psychometric validation of these scales is limited. To confirm the psychometric properties of the multidimensional Male Depression Risk Scale (MDRS-22) and to develop clinical cut-off scores for the MDRS-22. Data were obtained from an online sample of 1000 Canadian men (median age (M) = 49.63, standard deviation (SD) = 14.60). Confirmatory factor analysis (CFA) was used to replicate the established six-factor model of the MDRS-22. Psychometric values of the MDRS subscales were comparable to the widely used Patient Health Questionnaire-9. CFA model fit indices indicated adequate model fit for the six-factor MDRS-22 model. ROC curve analysis indicated the MDRS-22 was effective for identifying those with a recent (previous four-weeks) suicide attempt (area under curve (AUC) values = 0.837). The MDRS-22 cut-off identified proportionally more (84.62%) cases of recent suicide attempt relative to the PHQ-9 moderate range (53.85%). The MDRS-22 is the first male-sensitive depression scale to be psychometrically validated using CFA techniques in independent and cross-nation samples. Additional studies should identify differential item functioning and evaluate cross-cultural effects.
Twomey, Michèle; Wallis, Lee A; Myers, Jonathan E
2014-07-01
To evaluate the construct of triage acuity as measured by the South African Triage Scale (SATS) against a set of reference vignettes. A modified Delphi method was used to develop a set of reference vignettes. Delphi participants completed a 2-round consensus-building process, and independently assigned triage acuity ratings to 100 written vignettes unaware of the ratings given by others. Triage acuity ratings were summarised for all vignettes, and only those that reached 80% consensus during round 2 were included in the reference set. Triage ratings for the reference vignettes given by two independent experts using the SATS were compared with the ratings given by the international Delphi panel. Measures of sensitivity, specificity, associated percentages for over-triage/under-triage were used to evaluate the construct of triage acuity (as measured by the SATS) by examining the association between the ratings by the two experts and the international panel. On completion of the Delphi process, 42 of the 100 vignettes reached 80% consensus on their acuity rating and made up the reference set. On average, over all acuity levels, sensitivity was 74% (CI 64% to 82%), specificity 92% (CI 87% to 94%), under-triage occurred 14% (CI 8% to 23%) and over-triage 12% (CI 8% to 23%) of the time. The results of this study provide an alternative to evaluating triage scales against the construct of acuity as measured with the SATS. This method of using 80% consensus vignettes may, however, systematically bias the validity estimate towards better performance. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Photoacoustic Spectroscopy Analysis of Traditional Chinese Medicine
NASA Astrophysics Data System (ADS)
Chen, Lu; Zhao, Bin-xing; Xiao, Hong-tao; Tong, Rong-sheng; Gao, Chun-ming
2013-09-01
Chinese medicine is a historic cultural legacy of China. It has made a significant contribution to medicine and healthcare for generations. The development of Chinese herbal medicine analysis is emphasized by the Chinese pharmaceutical industry. This study has carried out the experimental analysis of ten kinds of Chinese herbal powder including Fritillaria powder, etc., based on the photoacoustic spectroscopy (PAS) method. First, a photoacoustic spectroscopy system was designed and constructed, especially a highly sensitive solid photoacoustic cell was established. Second, the experimental setup was verified through the characteristic emission spectrum of the light source, obtained by using carbon as a sample in the photoacoustic cell. Finally, as the photoacoustic spectroscopy analysis of Fritillaria, etc., was completed, the specificity of the Chinese herb medicine analysis was verified. This study shows that the PAS can provide a valid, highly sensitive analytical method for the specificity of Chinese herb medicine without preparing and damaging samples.
A Transcription and Translation Protocol for Sensitive Cross-Cultural Team Research.
Clark, Lauren; Birkhead, Ana Sanchez; Fernandez, Cecilia; Egger, Marlene J
2017-10-01
Assurance of transcript accuracy and quality in interview-based qualitative research is foundational for data accuracy and study validity. Based on our experience in a cross-cultural ethnographic study of women's pelvic organ prolapse, we provide practical guidance to set up step-by-step interview transcription and translation protocols for team-based research on sensitive topics. Beginning with team decisions about level of detail in transcription, completeness, and accuracy, we operationalize the process of securing vendors to deliver the required quality of transcription and translation. We also share rubrics for assessing transcript quality and the team protocol for managing transcripts (assuring consistency of format, insertion of metadata, anonymization, and file labeling conventions) and procuring an acceptable initial translation of Spanish-language interviews. Accurate, complete, and systematically constructed transcripts in both source and target languages respond to the call for more transparency and reproducibility of scientific methods.
Laser interferometer for space-based mapping of Earth's gravity field
NASA Astrophysics Data System (ADS)
Dehne, Marina; Sheard, Benjamin; Gerberding, Oliver; Mahrdt, Christoph; Heinzel, Gerhard; Danzmann, Karsten
2010-05-01
Laser interferometry will play a key role in the next generation of GRACE-type satellite gravity missions. The measurement concepts for future missions include a heterodyne laser interferometer. Furthermore, it is favourable to use polarising components in the laser interferometer for beam splitting. In the first step the influence of these components on the interferometer sensitivity has been investigated. Additionally, a length stability on a nm-scale has been validated. The next step will include a performance test of an interferometric SST system in an active symmetric transponder setup including two lasers and two optical benches. The design and construction of a quasi-monolithic interferometer for comparing the interferometric performance of non-polarising and polarising optics will be discussed. The results of the interferometric readout of a heterodyne configuration together with polarising optics will be presented to fulfil the phase sensitivity requirement of 1nm/√Hz-- for a typical SSI scenario.
Niméus, A; Hjalmarsson Ståhlfors, F; Sunnqvist, C; Stanley, B; Träskman-Bendz, L
2006-10-01
The Suicide Assessment Scale (SUAS) was constructed to be sensitive to change of suicidality. It was recently found to be predictive of suicide in a group of suicide attempters. The aim of the present study was to evaluate the reliability and validity of a modified interview version of SUAS with defined scores and also a new self-rating version (SUAS-S). The subjects consisted of former inpatients, 42 persons who had been admitted because of a suicide attempt about 12 years ago and 22 control patients. The subjects were rated according to the SUAS, the SUAS-S, as well as the Montgomery Asberg Depression Rating Scale (MADRS). The interrater reliability was found to be high. The SUAS correlated significantly with the MADRS, but the concordance was not consistent, which indicates that the SUAS measures something different from depression. The SUAS-S correlated significantly with the interview-rated SUAS, thus exhibiting good concurrent validity. In summary, both the modified interview version of SUAS and the SUAS-S seem to be valid, reliable and easily used suicide assessment instruments.
The Construct Validity of Language Aptitude: A Meta-Analysis
ERIC Educational Resources Information Center
Li, Shaofeng
2016-01-01
A meta-analysis was conducted to examine the construct validity of language aptitude by synthesizing the existing research that has been accumulated over the past five decades. The study aimed to provide a thorough understanding of the construct by aggregating the data reported in the primary research on its correlations with other individual…
ERIC Educational Resources Information Center
Martin, Andrew J.
2009-01-01
From a developmental construct validity perspective, this study examines motivation and engagement across elementary school, high school, and university/college, with particular focus on the Motivation and Engagement Scale (comprising adaptive, impeding/maladaptive, and maladaptive factors). Findings demonstrated developmental construct validity…
Selection of Marine Corps Drill Instructors
1980-03-01
8 4. ., ey- Construction and Cross-Validation Statistics for Drill Instructor School Performance Success Keys...Race, and School Attrition ........... ............................. ... 15 13. Key- Construction and Cross-Validation Statistics for Drill... constructed form, the Alternation Ranking of Series Drill Instruc- tors. In this form, DIs in a Series are ranked from highest to lowest in terms of their
Nemoto, Hitoshi; Watson, Deborah; Masuda, Koichi
2015-01-01
Tissue engineering holds great promise for cartilage repair with minimal donor-site morbidity. The in vivo maturation of a tissue-engineered construct can be tested in the subcutaneous tissues of the same species for autografts or of immunocompromised animals for allografts or xenografts. This section describes detailed protocols for the surgical transplantation of a tissue-engineered construct into an animal model to assess construct validity.
ERIC Educational Resources Information Center
Ercikan, Kadriye; Oliveri, María Elena
2016-01-01
Assessing complex constructs such as those discussed under the umbrella of 21st century constructs highlights the need for a principled assessment design and validation approach. In our discussion, we made a case for three considerations: (a) taking construct complexity into account across various stages of assessment development such as the…
ERIC Educational Resources Information Center
Koskey, Kristin L. K.; Sondergeld, Toni A.; Stewart, Victoria C.; Pugh, Kevin J.
2018-01-01
Onwuegbuzie and colleagues proposed the Instrument Development and Construct Validation (IDCV) process as a mixed methods framework for creating and validating measures. Examples applying IDCV are lacking. We provide an illustrative case integrating the Rasch model and cognitive interviews applied to the development of the Transformative…
Validity of Childhood Career Development Scale Scores in South Africa
ERIC Educational Resources Information Center
Stead, Graham B.; Schultheiss, Donna E. Palladino
2010-01-01
The purpose of this study was to provide evidence of the construct and concurrent validity of the Childhood Career Development Scale's (CCDS) scores among South African primary school children. Using a sample of 808 children in grades four through seven, evidence for the CCDS's construct validity was provided using confirmatory factor analysis,…
ERIC Educational Resources Information Center
St. Louis, Kenneth O.; Reichel, Isabella K.; Yaruss, J. Scott; Lubker, Bobbie Boyd
2009-01-01
Purpose: Construct validity and concurrent validity were investigated in a prototype survey instrument, the "Public Opinion Survey of Human Attributes-Experimental Edition" (POSHA-E). The POSHA-E was designed to measure public attitudes toward stuttering within the context of eight other attributes, or "anchors," assumed to range from negative…
Development and Validation of an Observation System for Analyzing Teaching Roles.
ERIC Educational Resources Information Center
Southwell, Reba K.; Webb, Jeaninne N.
The construction and validation of a theoretically based sign system for the analysis of teaching roles in childhood education is described. A theoretical and empirical approach to validation were developed. In the first, the general concept of teacher role was identified as a viable construct for investigating characteristic patterns of classroom…
The Construct of the Learning Organization: Dimensions, Measurement, and Validation
ERIC Educational Resources Information Center
Yang, Baiyin; Watkins, Karen E.; Marsick, Victoria J.
2004-01-01
This research describes efforts to develop and validate a multidimensional measure of the learning organization. An instrument was developed based on a critical review of both the conceptualization and practice of this construct. Supporting validity evidence for the instrument was obtained from several sources, including best model-data fit among…
Multinational Validation of the Spanish Bracken Basic Concept Scale for Cross-Cultural Assessments.
ERIC Educational Resources Information Center
Bracken, Bruce A.; And Others
1990-01-01
Investigated construct validity of the Spanish translation of the Bracken Basic Concept Scale (BBCS) in Latino children (n=293) including monolingual Spanish-speaking children from Puerto Rico and Venezuela and Spanish-dominant bilingual Latino children from Texas. Results provided support for construct validity of the Spanish version of the…
Mobile Phone Use in a Developing Country: A Malaysian Empirical Study
ERIC Educational Resources Information Center
Yeow, Paul H. P.; Yen Yuen, Yee; Connolly, Regina
2008-01-01
This study examined the factors that influence consumer satisfaction with mobile telephone use in Malaysia. The validity of the study's constructs, criterion, and content was confirmed. Construct validity was verified through the factor analysis with a total variance of 73.72 percent explained by all six independent factors. Content validity was…
Davis, Sarah K; Wigelsworth, Michael
2018-01-01
Emotional intelligence (EI) is a popular construct with concentrated areas of application in education and health contexts. There is a need for reliable and valid measurement of EI in young people, with brief yet sensitive measures of the construct preferable for use in time-limited settings. However, the proliferation of EI measures has often outpaced rigorous psychometric evaluation (Gignac, 2009 ). Using data from 849 adolescents (407 females, 422 males) aged 11 to 16 years (M age 13.4, SD = 1.2 years), this article systematically examines the structural and predictive properties of a frequently employed measure of adolescent trait EI-the Emotional Quotient Inventory Youth Version-Short Form (EQ-i:YV[S]); Bar-On & Parker, 2000 ). Although the intended multidimensional factor structure was recovered through confirmatory factor analysis, the statistical and conceptual coherency of the underlying model was inadequate. Using a multitrait-multimethod approach, the EQ-i:YV(S) was found to converge with other measures of EI; however, evidence for divergent validity (Big Five personality dimensions) was less robust. Predictive utility for adolescent mental health outcomes (depression, disruptive behavior) was also limited. Findings suggest that use of the EQ-i:YV(S) for predictive or evaluative purposes should be avoided until refinements to the scale are made.
Measuring the development of inhibitory control: The challenge of heterotypic continuity
Petersen, Isaac T.; Hoyniak, Caroline P.; McQuillan, Maureen E.; Bates, John E.; Staples, Angela D.
2016-01-01
Inhibitory control is thought to demonstrate heterotypic continuity, in other words, continuity in its purpose or function but changes in its behavioral manifestation over time. This creates major methodological challenges for studying the development of inhibitory control in childhood including construct validity, developmental appropriateness and sensitivity of measures, and longitudinal factorial invariance. We meta-analyzed 198 studies using measures of inhibitory control, a key aspect of self-regulation, to estimate age ranges of usefulness for each measure. The inhibitory control measures showed limited age ranges of usefulness owing to ceiling/floor effects. Tasks were useful, on average, for a developmental span of less than 3 years. This suggests that measuring inhibitory control over longer spans of development may require use of different measures at different time points, seeking to measure heterotypic continuity. We suggest ways to study the development of inhibitory control, with overlapping measurement in a structural equation modeling framework and tests of longitudinal factorial or measurement invariance. However, as valuable as this would be for the area, we also point out that establishing longitudinal factorial invariance is neither sufficient nor necessary for examining developmental change. Any study of developmental change should be guided by theory and construct validity, aiming toward a better empirical and theoretical approach to the selection and combination of measures. PMID:27346906
Attentional Bias for Reward and Punishment in Overweight and Obesity: The TRAILS Study
Glashouwer, Klaske A.; Ostafin, Brian D.; van Hemel-Ruiter, Madelon E.; Smink, Frédérique R. E.; Hoek, Hans W.; de Jong, Peter J.
2016-01-01
More than 80% of obese adolescents will become obese adults, and it is therefore important to enhance insight into characteristics that underlie the development and maintenance of overweight and obesity at a young age. The current study is the first to focus on attentional biases towards rewarding and punishing cues as potentially important factors. Participants were young adolescents (N = 607) who were followed from the age of 13 until the age of 19, and completed a motivational game indexing the attentional bias to general cues of reward and punishment. Additionally, self-reported reward and punishment sensitivity was measured. This study showed that attentional biases to cues that signal reward or punishment and self-reported reward and punishment sensitivity were not related to body mass index or the change in body mass index over six years in adolescents. Thus, attentional bias to cues of reward and cues of punishment, and self-reported reward and punishment sensitivity, do not seem to be crucial factors in the development and maintenance of overweight and obesity in adolescents. Exploratory analyses of the current study suggest that the amount of effort to gain reward and to avoid punishment may play a role in the development and maintenance of overweight and obesity. However, since the effort measure was a construct based on face validity and has not been properly validated, more studies are necessary before firm conclusions can be drawn. PMID:27391017
Skritskaya, Natalia A; Carson-Wong, Amanda R; Moeller, James R; Shen, Sa; Barsky, Arthur J; Fallon, Brian A
2012-07-01
Clinician-administered measures to assess severity of illness anxiety and response to treatment are few. The authors evaluated a modified version of the hypochondriasis-Y-BOCS (H-YBOCS-M), a 19-item, semistructured, clinician-administered instrument designed to rate severity of illness-related thoughts, behaviors, and avoidance. The scale was administered to 195 treatment-seeking adults with DSM-IV hypochondriasis. Test-retest reliability was assessed in a subsample of 20 patients. Interrater reliability was assessed by 27 interviews independently rated by four raters. Sensitivity to change was evaluated in a subsample of 149 patients. Convergent and discriminant validity was examined by comparing H-YBOCS-M scores to other measures administered. Item clustering was examined with confirmatory and exploratory factor analyses. The H-YBOCS-M demonstrated good internal consistency, interrater and test-retest reliability, and sensitivity to symptom change with treatment. Construct validity was supported by significant higher correlations with scores on other measures of hypochondriasis than with nonhypochondriacal measures. Improvement over time in response to treatment correlated with improvement both on measures of hypochondriasis and on measures of somatization, depression, anxiety, and functional status. Confirmatory factor analysis did not show adequate fit for a three-factor model. Exploratory factor analysis revealed a five-factor solution with the first two factors consistent with the separation of the H-YBOCS-M items into the subscales of illness-related avoidance and compulsions. H-YBOCS-M appears to be valid, reliable, and appropriate as an outcome measure for treatment studies of illness anxiety. Study results highlight "avoidance" as a key feature of illness anxiety-with potentially important nosologic and treatment implications. © 2012 Wiley Periodicals, Inc.
Chin, Kelly M; Gomberg-Maitland, Mardi; Channick, Richard N; Cuttica, Michael J; Fischer, Aryeh; Frantz, Robert P; Hunsche, Elke; Kleinman, Leah; McConnell, John W; McLaughlin, Vallerie V; Miller, Chad E; Zamanian, Roham T; Zastrow, Michael S; Badesch, David B
2018-04-26
Disease-specific patient-reported outcome (PRO) instruments are important in assessing the impact of disease and treatment. PAH-SYMPACT ® is the first questionnaire for quantifying pulmonary arterial hypertension (PAH) symptoms and impacts developed following the 2009 FDA PRO guidance; previous qualitative research with PAH patients supported its initial content validity. Content finalization and psychometric validation were conducted using data from SYMPHONY, a single-arm, 16-week study with macitentan 10mg in US patients with PAH. Item performance, Rasch, and factor analyses were used to select final item content of the PRO and define its domain structure. Internal consistency, test-retest reliability, known-group and construct validity, sensitivity to change, and influence of oxygen on item performance were evaluated. Data from 278 patients (79% female, mean age 60 years) were analyzed. Following removal of redundant/misfitting items, the final questionnaire has 11 symptom items across 2 domains (cardiopulmonary and cardiovascular symptoms) and 11 impact items across 2 domains (physical and cognitive/emotional impacts). Differential item function analysis confirmed PRO scoring is unaffected by oxygen use. For all 4 domains, internal consistency reliability was high (Cronbach's alpha >0.80) and scores were highly reproducible in stable patients (intra-class correlation coefficient 0.84-0.94). Correlations with CAMPHOR and SF-36 were moderate-to-high ([r]=0.34-0.80). The questionnaire differentiated well between patients with different disease severity levels, and was sensitive to improvements in clinician- and patient-reported disease severity. The PAH-SYMPACT ® is a brief, disease-specific PRO instrument possessing good psychometric properties which can be administered in clinical practice and clinical studies. Copyright © 2018. Published by Elsevier Inc.
Glauser, Gaétan; Grund, Baptiste; Gassner, Anne-Laure; Menin, Laure; Henry, Hugues; Bromirski, Maciej; Schütz, Frédéric; McMullen, Justin; Rochat, Bertrand
2016-03-15
A paradigm shift is underway in the field of quantitative liquid chromatography-mass spectrometry (LC-MS) analysis thanks to the arrival of recent high-resolution mass spectrometers (HRMS). The capability of HRMS to perform sensitive and reliable quantifications of a large variety of analytes in HR-full scan mode is showing that it is now realistic to perform quantitative and qualitative analysis with the same instrument. Moreover, HR-full scan acquisition offers a global view of sample extracts and allows retrospective investigations as virtually all ionized compounds are detected with a high sensitivity. In time, the versatility of HRMS together with the increasing need for relative quantification of hundreds of endogenous metabolites should promote a shift from triple-quadrupole MS to HRMS. However, a current "pitfall" in quantitative LC-HRMS analysis is the lack of HRMS-specific guidance for validated quantitative analyses. Indeed, false positive and false negative HRMS detections are rare, albeit possible, if inadequate parameters are used. Here, we investigated two key parameters for the validation of LC-HRMS quantitative analyses: the mass accuracy (MA) and the mass-extraction-window (MEW) that is used to construct the extracted-ion-chromatograms. We propose MA-parameters, graphs, and equations to calculate rational MEW width for the validation of quantitative LC-HRMS methods. MA measurements were performed on four different LC-HRMS platforms. Experimentally determined MEW values ranged between 5.6 and 16.5 ppm and depended on the HRMS platform, its working environment, the calibration procedure, and the analyte considered. The proposed procedure provides a fit-for-purpose MEW determination and prevents false detections.
Skritskaya, Natalia A.; Carson-Wong, Amanda R.; Moeller, James R.; Shen, Sa; Barsky, Arthur J.; Fallon, Brian A.
2012-01-01
Background Clinician-administered measures to assess severity of illness anxiety and response to treatment are few. The authors evaluated a modified version of the hypochondriasis-Y-BOCS (H-YBOCS-M), a 19-item, semistructured, clinician-administered instrument designed to rate severity of illness-related thoughts, behaviors, and avoidance. Methods The scale was administered to 195 treatment-seeking adults with DSM-IV hypochondriasis. Test–retest reliability was assessed in a subsample of 20 patients. Interrater reliability was assessed by 27 interviews independently rated by four raters. Sensitivity to change was evaluated in a subsample of 149 patients. Convergent and discriminant validity was examined by comparing H-YBOCS-M scores to other measures administered. Item clustering was examined with confirmatory and exploratory factor analyses. Results The H-YBOCS-M demonstrated good internal consistency, interrater and test–retest reliability, and sensitivity to symptom change with treatment. Construct validity was supported by significant higher correlations with scores on other measures of hypochondriasis than with nonhypochondriacal measures. Improvement over time in response to treatment correlated with improvement both on measures of hypochondriasis and on measures of somatization, depression, anxiety, and functional status. Confirmatory factor analysis did not show adequate fit for a three-factor model. Exploratory factor analysis revealed a five-factor solution with the first two factors consistent with the separation of the H-YBOCS-M items into the subscales of illness-related avoidance and compulsions. Conclusions H-YBOCS-M appears to be valid, reliable, and appropriate as an outcome measure for treatment studies of illness anxiety. Study results highlight “avoidance” as a key feature of illness anxiety—with potentially important nosologic and treatment implications. PMID:22504935
Development and Initial Validation of the Self-Assessed Lupus Damage Index Questionnaire (LDIQ)
Costenbader, Karen H.; Khamashta, Munther; Ruiz-Garcia, Silvia; Perez-Rodriguez, Maria Teresa; Petri, Michelle; Elliott, Jennifer; Manzi, Susan; Karlson, Elizabeth W.; Turner-Stokes, Tabitha; Bermas, Bonnie; Coblyn, Jonathan; Massarotti, Elena; Schur, Peter; Fraser, Patricia; Navarro, Iris; Hanly, John G.; Shaver, Timothy S.; Katz, Robert S.; Chakravarty, Eliza; Fortin, Paul R.; Sanchez, Martha L.; Liu, Jigna; Michaud, Kaleb; Alarcón, Graciela S.; Wolfe, Frederick
2010-01-01
Purpose The SLICC Damage Index (SDI) is a validated instrument for assessing organ damage in systemic lupus erythematosus (SLE). Trained physicians must complete it, limiting utility where this is impossible. Methods We developed and pilot-tested a self-assessed organ damage instrument, the Lupus Damage Index Questionnaire (LDIQ), in 37 SLE subjects and 7 physicians. After refinement, 569 English-speaking SLE subjects and 14 rheumatologists from 11 international SLE clinics participated in validation. Subjects and physicians completed instruments separately. We calculated sensitivity, specificity, Spearman correlations and agreement, using the SDI as gold standard. 605 SLE participants in the community-based National Data Bank for Rheumatic Diseases (NDB) study completed the LDIQ and we assessed correlations with outcome and disability measures. Results Mean LDIQ score was 3.3 (0-16) and mean SDI score was 1.5 (0-9). LDIQ had a moderately high correlation with SDI (Spearman r=0.50, p<0.001). Specificities of individual LDIQ items were >80%, except for neuropathy. Sensitivities were variable and lowest for damage with <1% prevalence. Agreement between SDI and LDIQ was > 85% for all but neuropathy, reduced renal function, deforming arthritis and alopecia. In the NDB, LDIQ correlated well with comorbidity index (r=0.45), SF-36 physical component scale (0.43), Medical Research Council dyspnea scale (0.40), disability (0.37) and SLE Activity Questionnaire score (0.37). Conclusions The LDIQ’s metric properties are good compared to the SDI. It has construct validity and correlations with health assessments similar to the SDI. The LDIQ should allow expansion of SLE research. Its ultimate value will be determined in longitudinal studies. PMID:20391512
NASA Astrophysics Data System (ADS)
Burke, Benjamin P.; Baghdadi, Neazar; Kownacka, Alicja E.; Nigam, Shubhanchi; Clemente, Gonçalo S.; Al-Yassiry, Mustafa M.; Domarkas, Juozas; Lorch, Mark; Pickles, Martin; Gibbs, Peter; Tripier, Raphaël; Cawthorne, Christopher; Archibald, Stephen J.
2015-09-01
The commercial availability of combined magnetic resonance imaging (MRI)/positron emission tomography (PET) scanners for clinical use has increased demand for easily prepared agents which offer signal or contrast in both modalities. Herein we describe a new class of silica coated iron-oxide nanorods (NRs) coated with polyethylene glycol (PEG) and/or a tetraazamacrocyclic chelator (DO3A). Studies of the coated NRs validate their composition and confirm their properties as in vivo T2 MRI contrast agents. Radiolabelling studies with the positron emitting radioisotope gallium-68 (t1/2 = 68 min) demonstrate that, in the presence of the silica coating, the macrocyclic chelator was not required for preparation of highly stable radiometal-NR constructs. In vivo PET-CT and MR imaging studies show the expected high liver uptake of gallium-68 radiolabelled nanorods with no significant release of gallium-68 metal ions, validating our innovation to provide a novel simple method for labelling of iron oxide NRs with a radiometal in the absence of a chelating unit that can be used for high sensitivity liver imaging.The commercial availability of combined magnetic resonance imaging (MRI)/positron emission tomography (PET) scanners for clinical use has increased demand for easily prepared agents which offer signal or contrast in both modalities. Herein we describe a new class of silica coated iron-oxide nanorods (NRs) coated with polyethylene glycol (PEG) and/or a tetraazamacrocyclic chelator (DO3A). Studies of the coated NRs validate their composition and confirm their properties as in vivo T2 MRI contrast agents. Radiolabelling studies with the positron emitting radioisotope gallium-68 (t1/2 = 68 min) demonstrate that, in the presence of the silica coating, the macrocyclic chelator was not required for preparation of highly stable radiometal-NR constructs. In vivo PET-CT and MR imaging studies show the expected high liver uptake of gallium-68 radiolabelled nanorods with no significant release of gallium-68 metal ions, validating our innovation to provide a novel simple method for labelling of iron oxide NRs with a radiometal in the absence of a chelating unit that can be used for high sensitivity liver imaging. Electronic supplementary information (ESI) available. See DOI: 10.1039/c5nr02753e
Construct Validation of the Piers-Harris Children's Self Concept Scale.
ERIC Educational Resources Information Center
Franklin, Melvin R., Jr.; And Others
1981-01-01
Results indicated that the Piers-Harris Children's Self Concept Scale demonstrates both convergent and discriminant validity in an assessment of a relatively stable and internally consistent construct. (Author/BW)
Developing a measure of medication-related quality of life for people with polypharmacy.
Tseng, Hsu-Min; Lee, Chia-Hui; Chen, Yin-Jen; Hsu, Hsiang-Hao; Huang, Li-Yueh; Huang, Jing-Long
2016-05-01
To develop a measure of medication-related quality of life (MRQoL) and to validate the measure in a hospital-based population of patients with polypharmacy. The Medication-Related Quality of Life Scale version 1.0 (MRQoLS-v1.0) included 14 items developed on the basis of interviews with elderly patients with polypharmacy, defined as taking five or more medications simultaneously. This scale was tested in 219 outpatients (99 with polypharmacy and 120 without polypharmacy). Two measures were used to establish construct validity the Psychological Distress Checklist, for convergent validity, and the Medication Adherence Behavior Scale (MABS), for discriminant validity. The 14-item scale was found to be both reliable and valid. Internal consistency reliability evaluated using Cronbach's alpha for this scale was 0.91. Scores on the MRQoLS-v1.0 correlated statistically significantly and negatively with those on the Psychological Distress Checklist. Discriminant validity was demonstrated by low correlation with MABS, indicating that the MRQoLS-v1.0 measured concepts different from medication adherence. Significant differences in the MRQoLS-v1.0 between patients with polypharmacy and those without polypharmacy provided evidence for known-group validity. The study presents a psychometric evaluation of a measure used to assess MRQoL of patients with polypharmacy. The instrument is practical to administer in clinics and provides a valuable adjunct to the outcome measurement for patients with polypharmacy. Further research on the sensitivity of this instrument to medication change in multi-medicated patients is warranted.
Sensitivity study and parameter optimization of OCD tool for 14nm finFET process
NASA Astrophysics Data System (ADS)
Zhang, Zhensheng; Chen, Huiping; Cheng, Shiqiu; Zhan, Yunkun; Huang, Kun; Shi, Yaoming; Xu, Yiping
2016-03-01
Optical critical dimension (OCD) measurement has been widely demonstrated as an essential metrology method for monitoring advanced IC process in the technology node of 90 nm and beyond. However, the rapidly shrunk critical dimensions of the semiconductor devices and the increasing complexity of the manufacturing process bring more challenges to OCD. The measurement precision of OCD technology highly relies on the optical hardware configuration, spectral types, and inherently interactions between the incidence of light and various materials with various topological structures, therefore sensitivity analysis and parameter optimization are very critical in the OCD applications. This paper presents a method for seeking the optimum sensitive measurement configuration to enhance the metrology precision and reduce the noise impact to the greatest extent. In this work, the sensitivity of different types of spectra with a series of hardware configurations of incidence angles and azimuth angles were investigated. The optimum hardware measurement configuration and spectrum parameter can be identified. The FinFET structures in the technology node of 14 nm were constructed to validate the algorithm. This method provides guidance to estimate the measurement precision before measuring actual device features and will be beneficial for OCD hardware configuration.
A Tightly Regulated Genetic Selection System with Signaling-Active Alleles of Phytochrome B.
Hu, Wei; Lagarias, J Clark
2017-01-01
Selectable markers derived from plant genes circumvent the potential risk of antibiotic/herbicide-resistance gene transfer into neighboring plant species, endophytic bacteria, and mycorrhizal fungi. Toward this goal, we have engineered and validated signaling-active alleles of phytochrome B (eYHB) as plant-derived selection marker genes in the model plant Arabidopsis (Arabidopsis thaliana). By probing the relationship of construct size and induction conditions to optimal phenotypic selection, we show that eYHB-based alleles are robust substitutes for antibiotic/herbicide-dependent marker genes as well as surprisingly sensitive reporters of off-target transgene expression. © 2017 American Society of Plant Biologists. All Rights Reserved.
Self-efficacy: a means of identifying problems in nursing education and career progress.
Harvey, V; McMurray, N
1994-10-01
Two nursing self-efficacy scales (academic and clinical) were developed and refined for use in identifying problems in progress in undergraduate nurses. Emergent factors within each scale contained items representing important aspects of nursing education. Both measures showed good internal consistency, test-retest reliability, and construct validity. Sensitivity to content and focus of tuition at time of completion was shown with some changes in factor structure over samples of first year nursing students. Academic self-efficacy (but not clinical self-efficacy) was predictive of course withdrawal. Applications to nursing education, progress in pursuing a nursing career and attrition are discussed.
A Tightly Regulated Genetic Selection System with Signaling-Active Alleles of Phytochrome B1[OPEN
2017-01-01
Selectable markers derived from plant genes circumvent the potential risk of antibiotic/herbicide-resistance gene transfer into neighboring plant species, endophytic bacteria, and mycorrhizal fungi. Toward this goal, we have engineered and validated signaling-active alleles of phytochrome B (eYHB) as plant-derived selection marker genes in the model plant Arabidopsis (Arabidopsis thaliana). By probing the relationship of construct size and induction conditions to optimal phenotypic selection, we show that eYHB-based alleles are robust substitutes for antibiotic/herbicide-dependent marker genes as well as surprisingly sensitive reporters of off-target transgene expression. PMID:27881727
Jokovic, Aleksandra; Locker, David; Guyatt, Gordan
2006-01-01
Background The Child Perceptions Questionnaire for children aged 11 to 14 years (CPQ11–14) is a 37-item measure of oral-health-related quality of life (OHRQoL) encompassing four domains: oral symptoms, functional limitations, emotional and social well-being. To facilitate its use in clinical settings and population-based health surveys, it was shortened to 16 and 8 items. Item impact and stepwise regression methods were used to produce each version. This paper describes the developmental process, compares the discriminative properties of the resulting four short-forms and evaluates their precision relative to the original CPQ11–14. Methods The item impact method used data from the CPQ11–14 item reduction study to select the questions with the highest impact scores in each domain. The regression method, where the dependent variable was the overall CPQ11–14 score and the independent variables its individual questions, was applied to the data collected in the validity study for the CPQ11–14. The measurement properties (i.e. criterion validity, construct validity, internal consistency reliability and test-retest reliability) of all 4 short-forms were evaluated using the data from the validity and reliability studies for the CPQ11–14. Results All short forms detected substantial variability in children's OHRQoL. The mean scores on the two 16-item questionnaires were almost identical, while on the two 8-item questionnaires they differed by only one score point. The mean scores standardized to 0–100 were higher on the short forms than the original CPQ11–14 (p < 0.001). There were strong significant correlations between all short-form scores and CPQ11–14 scores (0.87–0.98; p < 0.001). Hypotheses concerning construct validity were confirmed: the short-forms' scores were highest in the oro-facial, lower in the orthodontic and lowest in the paediatric dentistry group; all short-form questionnaires were positively correlated with the ratings of oral health and overall well-being, with the correlation coefficient being higher for the latter. The relative validity coefficients were 0.85 to 1.18. Cronbach's alpha and intraclass correlation coefficients ranged 0.71–0.83 and 0.71–0.77, respectively. Conclusion All short forms demonstrated excellent criterion validity and good construct validity. The reliability coefficients exceeded standards for group-level comparisons. However, these are preliminary findings based on the convenience sampling and further testing in replicated studies involving clinical and general samples of children in various settings is necessary to establish measurement sensitivity and discriminative properties of these questionnaires. PMID:16423298
Mickley, Manfred; Renner, Gerolf
2015-01-01
Do Current German-Language Intelligence Tests Take into Consideration the Special Needs of Children with Disabilities? A review of 23 German intelligence test manuals shows that test-authors do not exclude the use of their tests for children with disabilities. However, these special groups play a minor role in the construction, standardization, and validation of intelligence tests. There is no sufficient discussion and reflection concerning the issue which construct-irrelevant requirements may reduce the validity of the test or which individual test-adaptations are allowed or recommended. Intelligence testing of children with disabilities needs more empirical evidence on objectivity, reliability, and validity of the assessment-procedures employed. Future test construction and validation should systematically analyze construct-irrelevant variance in item format, the special needs of handicapped children, and should give hints for useful test-adaptations.
Hadi, Azlihanis Abdul; Naing, Nyi Nyi; Daud, Aziah; Nordin, Rusli
2006-11-01
This study was conducted to assess the reliability and construct validity of the Malay version of Job Content Questionnaire (JCQ) among secondary school teachers in Kota Bharu, Kelantan. A total of 68 teachers consented to participate in the study and were administered the Malay version of JCQ. Reliability was determined using Cronbach's alpha for internal consistency whilst construct validity was assessed using factor analysis. The results indicated that Cronbach's alpha coefficients revealed decision latitude (0.75), psychological job demand (0.50) and social support (0.84). Factor analysis showed three meaningful common factors that could explain the construct of Karasek's demand-control-social support model. The study suggests the JCQ scales are reliable and valid tools for assessing job stress in school teachers.
Morizot, Julien
2014-10-01
While there are a number of short personality trait measures that have been validated for use with adults, few are specifically validated for use with adolescents. To trust such measures, it must be demonstrated that they have adequate construct validity. According to the view of construct validity as a unifying form of validity requiring the integration of different complementary sources of information, this article reports the evaluation of content, factor, convergent, and criterion validities as well as reliability of adolescents' self-reported personality traits. Moreover, this study sought to address an inherent potential limitation of short personality trait measures, namely their limited conceptual breadth. In this study, starting with items from a known measure, after the language-level was adjusted for use with adolescents, items tapping fundamental primary traits were added to determine the impact of added conceptual breadth on the psychometric properties of the scales. The resulting new measure was named the Big Five Personality Trait Short Questionnaire (BFPTSQ). A group of expert judges considered the items to have adequate content validity. Using data from a community sample of early adolescents, the results confirmed the factor validity of the Big Five structure in adolescence as well as its measurement invariance across genders. More important, the added items did improve the convergent and criterion validities of the scales, but did not negatively affect their reliability. This study supports the construct validity of adolescents' self-reported personality traits and points to the importance of conceptual breadth in short personality measures. © The Author(s) 2014.
Philips, Zoë; Whynes, David K; Avis, Mark
2006-02-01
This paper describes an experiment to test the construct validity of contingent valuation, by eliciting women's valuations for the NHS cervical cancer screening programme. It is known that, owing to low levels of knowledge of cancer and screening in the general population, women both over-estimate the risk of disease and the efficacy of screening. The study is constructed as a randomised experiment, in which one group is provided with accurate information about cervical cancer screening, whilst the other is not. The first hypothesis supporting construct validity, that controls who perceive greater benefits from screening will offer higher valuations, is substantiated. Both groups are then provided with objective information on an improvement to the screening programme, and are asked to value the improvement as an increment to their original valuations. The second hypothesis supporting construct validity, that controls who perceive the benefits of the programme to be high already will offer lower incremental valuations, is also substantiated. Copyright 2005 John Wiley & Sons, Ltd.
Statistically Controlling for Confounding Constructs Is Harder than You Think
Westfall, Jacob; Yarkoni, Tal
2016-01-01
Social scientists often seek to demonstrate that a construct has incremental validity over and above other related constructs. However, these claims are typically supported by measurement-level models that fail to consider the effects of measurement (un)reliability. We use intuitive examples, Monte Carlo simulations, and a novel analytical framework to demonstrate that common strategies for establishing incremental construct validity using multiple regression analysis exhibit extremely high Type I error rates under parameter regimes common in many psychological domains. Counterintuitively, we find that error rates are highest—in some cases approaching 100%—when sample sizes are large and reliability is moderate. Our findings suggest that a potentially large proportion of incremental validity claims made in the literature are spurious. We present a web application (http://jakewestfall.org/ivy/) that readers can use to explore the statistical properties of these and other incremental validity arguments. We conclude by reviewing SEM-based statistical approaches that appropriately control the Type I error rate when attempting to establish incremental validity. PMID:27031707
Zhang, Tan; Chen, Ang
2017-01-01
Based on the job demands-resources model, the study developed and validated an instrument that measures physical education teachers' job demands-resources perception. Expert review established content validity with the average item rating of 3.6/5.0. Construct validity and reliability were determined with a teacher sample ( n = 397). Exploratory factor analysis established a five-dimension construct structure matching the theoretical construct deliberated in the literature. The composite reliability scores for the five dimensions range from .68 to .83. Validity coefficients (intraclass correlational coefficients) are .69 for job resources items and .82 for job demands items. Inter-scale correlational coefficients range from -.32 to .47. Confirmatory factor analysis confirmed the construct validity with high dimensional factor loadings (ranging from .47 to .84 for job resources scale and from .50 to .85 for job demands scale) and adequate model fit indexes (root mean square error of approximation = .06). The instrument provides a tool to measure physical education teachers' perception of their working environment.
Physics Metacognition Inventory Part II: Confirmatory factor analysis and Rasch analysis
NASA Astrophysics Data System (ADS)
Taasoobshirazi, Gita; Bailey, MarLynn; Farley, John
2015-11-01
The Physics Metacognition Inventory was developed to measure physics students' metacognition for problem solving. In one of our earlier studies, an exploratory factor analysis provided evidence of preliminary construct validity, revealing six components of students' metacognition when solving physics problems including knowledge of cognition, planning, monitoring, evaluation, debugging, and information management. The college students' scores on the inventory were found to be reliable and related to students' physics motivation and physics grade. However, the results of the exploratory factor analysis indicated that the questionnaire could be revised to improve its construct validity. The goal of this study was to revise the questionnaire and establish its construct validity through a confirmatory factor analysis. In addition, a Rasch analysis was applied to the data to better understand the psychometric properties of the inventory and to further evaluate the construct validity. Results indicated that the final, revised inventory is a valid, reliable, and efficient tool for assessing student metacognition for physics problem solving.
Zhang, Tan; Chen, Ang
2017-01-01
Based on the job demands–resources model, the study developed and validated an instrument that measures physical education teachers’ job demands–resources perception. Expert review established content validity with the average item rating of 3.6/5.0. Construct validity and reliability were determined with a teacher sample (n = 397). Exploratory factor analysis established a five-dimension construct structure matching the theoretical construct deliberated in the literature. The composite reliability scores for the five dimensions range from .68 to .83. Validity coefficients (intraclass correlational coefficients) are .69 for job resources items and .82 for job demands items. Inter-scale correlational coefficients range from −.32 to .47. Confirmatory factor analysis confirmed the construct validity with high dimensional factor loadings (ranging from .47 to .84 for job resources scale and from .50 to .85 for job demands scale) and adequate model fit indexes (root mean square error of approximation = .06). The instrument provides a tool to measure physical education teachers’ perception of their working environment. PMID:29200808
Pike, Nancy A; Poulsen, Marie K; Woo, Mary A
Cognitive deficits are common, long-term sequelae in children and adolescents with congenital heart disease (CHD) who have undergone surgical palliation. However, there is a lack of a validated brief cognitive screening tool appropriate for the outpatient setting for adolescents with CHD. One candidate instrument is the Montreal Cognitive Assessment (MoCA) questionnaire. The purpose of the research was to validate scores from the MoCA against the General Memory Index (GMI) of the Wide Range Assessment of Memory and Learning, 2nd Edition (WRAML2), a widely accepted measure of cognition/memory, in adolescents and young adults with CHD. We administered the MoCA and the WRAML2 to 156 adolescents and young adults ages 14-21 (80 youth with CHD and 76 healthy controls who were gender and age matched). Spearman's rank order correlations were used to assess concurrent validity. To assess construct validity, the Mann-Whitney U test was used to compare differences in scores in youth with CHD and the healthy control group. Receiver operating characteristic curves were created and area under the curve, sensitivity, specificity, positive predictive value, and negative predictive value were also calculated. The MoCA median scores in the CHD versus healthy controls were (23, range 15-29 vs. 28, range 22-30; p < .001), respectively. With the screening cutoff scores at <26 points for the MoCA and 85 for GMI (<1 SD, M = 100, SD = 15), the CHD versus healthy control groups showed sensitivity of .96 and specificity of .67 versus sensitivity of .75 and specificity of .90, respectively, in the detection of cognitive deficits. A cutoff score of 26 on the MoCA was optimal in the CHD group; a cutoff of 25 had similar properties except for a lower negative predictive value. The area under the receiver operating characteristic curve (95% CI) for the MoCA was 0.84 (95% CI [0.75, 0.93], p < .001) and 0.84 (95% CI [0.62, 1.00], p = .02) for the CHD and controls, respectively. Scores on the MoCA were valid for screening to detect cognitive deficits in adolescents and young adults aged 14-21 with CHD when a cutoff score of 26 is used to differentiate youth with and without significant cognitive impairment. Future studies are needed in other adolescent disease groups with known cognitive deficits and healthy populations to explore the generalizability of validity of MoCA scores in adolescents and young adults.
The validation of a home food inventory.
Fulkerson, Jayne A; Nelson, Melissa C; Lytle, Leslie; Moe, Stacey; Heitzler, Carrie; Pasch, Keryn E
2008-11-04
Home food inventories provide an efficient method for assessing home food availability; however, few are validated. The present study's aim was to develop and validate a home food inventory that is easily completed by research participants in their homes and includes a comprehensive range of both healthful and less healthful foods that are associated with obesity. A home food inventory (HFI) was developed and tested with two samples. Sample 1 included 51 adult participants and six trained research staff who independently completed the HFI in participants' homes. Sample 2 included 342 families in which parents completed the HFI and the Diet History Questionnaire (DHQ) and students completed three 24-hour dietary recall interviews. HFI items assessed 13 major food categories as well as two categories assessing ready-access to foods in the kitchen and the refrigerator. An obesogenic household food availability score was also created. To assess criterion validity, participants' and research staffs' assessment of home food availability were compared (staff = gold standard). Criterion validity was evaluated with kappa, sensitivity, and specificity. Construct validity was assessed with correlations of five HFI major food category scores with servings of the same foods and associated nutrients from the DHQ and dietary recalls. Kappa statistics for all 13 major food categories and the two ready-access categories ranged from 0.61 to 0.83, indicating substantial agreement. Sensitivity ranged from 0.69 to 0.89, and specificity ranged from 0.86 to 0.95. Spearman correlations between staff and participant major food category scores ranged from 0.71 to 0.97. Correlations between the HFI scores and food group servings and nutrients on the DHQ (parents) were all significant (p < .05) while about half of associations between the HFI and dietary recall interviews (adolescents) were significant (p < .05). The obesogenic home food availability score was significantly associated (p < .05) with energy intake of both parents and adolescents. This new home food inventory is valid, participant-friendly, and may be useful for community-based behavioral nutrition and obesity prevention research. The inventory builds on previous measures by including a wide range of healthful and less healthful foods rather than foods targeted for a specific intervention.
Garfjeld Roberts, Patrick; Guyver, Paul; Baldwin, Mathew; Akhtar, Kash; Alvand, Abtin; Price, Andrew J; Rees, Jonathan L
2017-02-01
To assess the construct and face validity of ArthroS, a passive haptic VR simulator. A secondary aim was to evaluate the novel performance metrics produced by this simulator. Two groups of 30 participants, each divided into novice, intermediate or expert based on arthroscopic experience, completed three separate tasks on either the knee or shoulder module of the simulator. Performance was recorded using 12 automatically generated performance metrics and video footage of the arthroscopic procedures. The videos were blindly assessed using a validated global rating scale (GRS). Participants completed a survey about the simulator's realism and training utility. This new simulator demonstrated construct validity of its tasks when evaluated against a GRS (p ≤ 0.003 in all cases). Regarding it's automatically generated performance metrics, established outputs such as time taken (p ≤ 0.001) and instrument path length (p ≤ 0.007) also demonstrated good construct validity. However, two-thirds of the proposed 'novel metrics' the simulator reports could not distinguish participants based on arthroscopic experience. Face validity assessment rated the simulator as a realistic and useful tool for trainees, but the passive haptic feedback (a key feature of this simulator) is rated as less realistic. The ArthroS simulator has good task construct validity based on established objective outputs, but some of the novel performance metrics could not distinguish between surgical experience. The passive haptic feedback of the simulator also needs improvement. If simulators could offer automated and validated performance feedback, this would facilitate improvements in the delivery of training by allowing trainees to practise and self-assess.
Bhandari, T R; Dangal, G; Sarma, P S; Kutty, V R
2014-01-01
Women's autonomy is one of the predictors of maternal health care service utilization. This study aimed to construct and validate a scale for measuring women's autonomy with relevance to developing countries. We conducted a study for construction and validation of a scale in Rupandehi and further validated in Kapilvastu districts of Nepal. Initially, we administered a 24-item preliminary scale and finalized a 23-item scale using psychometric tests. After defining the construct of women's autonomy, we pooled 194 items and selected 24 items to develop a preliminary scale. The scale development process followed different steps i.e. definition of construct, generation of items pool, pretesting, analysis of psychometric test and further validation. The new scale was strongly supported by Cronbach's Alpha value (0.84), test-retest Pearson correlation (0.87), average content validity ratio (0.8) and overall agreement- Kappa value of the items (0.83) whereas all values were found satisfactory. From factor analysis, we selected 23 items for the final scale which show good convergent and discriminant validity. From preliminary draft, we removed one item; the remaining 23 items were loaded in five factors. All five factors had single loading items by suppressing absolute coefficient value less than 0.45 and average coefficient was more than 0.60 of each factor. Similarly, the factors and loaded items had good convergent and discriminant validity which further showed strong measurement capacity of the scale. The new scale is a reliable tool for assessing women's autonomy in developing countries. We recommend for further use and validation of the scale for ensuring the measurement capacity.
Hickman, Ronald L; Clochesy, John M; Hetland, Breanna; Alaamri, Marym
2017-04-01
There are limited reliable and valid measures of the patient- provider interaction among adults with hypertension. Therefore, the purpose of this report is to describe the construct validity and reliability of the Questionnaire on the Quality of Physician-Patient Interaction (QQPPI), in community-dwelling adults with hypertension. A convenience sample of 109 participants with hypertension was recruited and administered the QQPPI at baseline and 8 weeks later. The exploratory factor analysis established a 12-item, 2-factor structure for the QQPPI was valid in this sample. The modified QQPPI proved to have sufficient internal consistency and test- retest reliability. The modified QQPPI is a valid and reliable measure of the provider-patient interaction, a construct posited to impact self-management, in adults with hypertension.
Innstrand, Siw Tone; Christensen, Marit; Undebakke, Kirsti Godal; Svarva, Kyrre
2015-12-01
The aim of the present paper is to present and validate a Knowledge-Intensive Work Environment Survey Target (KIWEST), a questionnaire developed for assessing the psychosocial factors among people in knowledge-intensive work environments. The construct validity and reliability of the measurement model where tested on a representative sample of 3066 academic and administrative staff working at one of the largest universities in Norway. Confirmatory factor analysis provided initial support for the convergent validity and internal consistency of the 30 construct KIWEST measurement model. However, discriminant validity tests indicated that some of the constructs might overlap to some degree. Overall, the KIWEST measure showed promising psychometric properties as a psychosocial work environment measure. © 2015 the Nordic Societies of Public Health.
Soleimani, Mohammad Ali; Yaghoobzadeh, Ameneh; Bahrami, Nasim; Sharif, Saeed Pahlevan; Sharif Nia, Hamid
2016-10-01
In this study, 398 Iranian cancer patients completed the 15-item Templer's Death Anxiety Scale (TDAS). Tests of internal consistency, principal components analysis, and confirmatory factor analysis were conducted to assess the internal consistency and factorial validity of the Persian TDAS. The construct reliability statistic and average variance extracted were also calculated to measure construct reliability, convergent validity, and discriminant validity. Principal components analysis indicated a 3-component solution, which was generally supported in the confirmatory analysis. However, acceptable cutoffs for construct reliability, convergent validity, and discriminant validity were not fulfilled for the three subscales that were derived from the principal component analysis. This study demonstrated both the advantages and potential limitations of using the TDAS with Persian-speaking cancer patients.
Smith, Gregory T.; McCarthy, Denis M.; Zapolski, Tamika C. B.
2010-01-01
The authors argue for a significant shift in how clinical psychology researchers conduct construct validation and theory validation tests. They argue that sound theory and validation tests can best be conducted on measures of unidimensional or homogeneous constructs. Hierarchical organizations of such constructs are useful descriptively and theoretically, but higher order composites do not refer to definable psychological processes. Application of this perspective to the approach of the Diagnostic and Statistical Manual of Mental Disorders to describing psychopathology calls into doubt the traditional use of the syndromal approach, in which single scores reflect the presence of multidimensional disorders. For many forms of psychological dysfunction, this approach does not appear optimal and may need to be discarded. The authors note that their perspective represents a straightforward application of existing psychometric theory, they demonstrate the practical value of adopting this perspective, and they provide evidence that this shift is already under way among clinical researchers. Description in terms of homogeneous dimensions provides improved validity, utility, and parsimony. In contrast, the use of composite diagnoses can retard scientific progress and hamper clinicians' efforts to understand and treat dysfunction. PMID:19719340
Calibration of the Dutch-Flemish PROMIS Pain Behavior item bank in patients with chronic pain.
Crins, M H P; Roorda, L D; Smits, N; de Vet, H C W; Westhovens, R; Cella, D; Cook, K F; Revicki, D; van Leeuwen, J; Boers, M; Dekker, J; Terwee, C B
2016-02-01
The aims of the current study were to calibrate the item parameters of the Dutch-Flemish PROMIS Pain Behavior item bank using a sample of Dutch patients with chronic pain and to evaluate cross-cultural validity between the Dutch-Flemish and the US PROMIS Pain Behavior item banks. Furthermore, reliability and construct validity of the Dutch-Flemish PROMIS Pain Behavior item bank were evaluated. The 39 items in the bank were completed by 1042 Dutch patients with chronic pain. To evaluate unidimensionality, a one-factor confirmatory factor analysis (CFA) was performed. A graded response model (GRM) was used to calibrate the items. To evaluate cross-cultural validity, Differential item functioning (DIF) for language (Dutch vs. English) was evaluated. Reliability of the item bank was also examined and construct validity was studied using several legacy instruments, e.g. the Roland Morris Disability Questionnaire. CFA supported the unidimensionality of the Dutch-Flemish PROMIS Pain Behavior item bank (CFI = 0.960, TLI = 0.958), the data also fit the GRM, and demonstrated good coverage across the pain behavior construct (threshold parameters range: -3.42 to 3.54). Analysis showed good cross-cultural validity (only six DIF items), reliability (Cronbach's α = 0.95) and construct validity (all correlations ≥0.53). The Dutch-Flemish PROMIS Pain Behavior item bank was found to have good cross-cultural validity, reliability and construct validity. The development of the Dutch-Flemish PROMIS Pain Behavior item bank will serve as the basis for Dutch-Flemish PROMIS short forms and computer adaptive testing (CAT). © 2015 European Pain Federation - EFIC®
Engagement DEOCS 4.1 Construct Validity Summary
2017-08-01
Engagement DEOCS 4.1 Construct Validity Summary DEFENSE EQUAL OPPORTUNITY MANAGEMENT INSTITUTE DIRECTORATE OF...increasingly popular construct in industry and research. Indeed, management literature suggests employee engagement is the key to an organization’s...definition was drawn upon to inform the creation of a definition and measure of engagement that was then adapted using subject matter expert (SME)1
ERIC Educational Resources Information Center
Joosten, Annette V.; Bundy, Anita C.
2008-01-01
Construct validity of the Motivation Assessment Scale (MAS) (Durand, Crimmins, The Motivation Assessment Scale 1988) was studied using Rasch analysis data from 67 children (246 MASs), with dual diagnosis of autism and intellectual disability or with intellectual disability only. Results failed to support the proposed unidimensional construct or…
ERIC Educational Resources Information Center
Setari, Anthony Philip
2016-01-01
The purpose of this study was to construct a holistic education school evaluation tool using Montessori Erdkinder principles, and begin the validation process of examining the proposed tool. This study addresses a vital need in the holistic education community for a school evaluation tool. The tool construction process included using Erdkinder…
Teacher Efficacy and Preservice Teachers: A Construct Validation.
ERIC Educational Resources Information Center
Kushner, Susan N.
A construct validation of a modified version of a teacher efficiency scale was conducted to establish its use with preservice teachers. The scale adapted by A. E. Woolfolk and W. K. Hoy from one constructed by S. Gibson and M. H. Dembo, which contained 12 personal efficacy (PE) and 6 general teaching efficacy (TE) items, was further modified for…
ERIC Educational Resources Information Center
Jarjoura, David; Hartman-Stein, Paula; Speight, Joan; Reuter, Jeanette
1999-01-01
Examined the reliability and construct validity in an older adult population (n=149 older adults and their informants) of scores on the Behavioral Competence Inventory (BCI) (P. Hartman-Stein). Results indicate that scores on the BCI's seven scales show adequate internal consistencies and represent seven overlapping but distinct constructs in this…
Nia, Hamid Sharif; Sharif, Saeed Pahlevan; Froelicher, Erika Sivarajan; Boyle, Christopher; Goudarzian, Amir Hossein; Yaghoobzadeh, Ameneh; Oskouie, Fatemeh
2018-04-01
The aim of this study was to validate a Persian version of the Cardiac Depression Scale (CDS) in Iranian patients with acute myocardial infarction (AMI). The CDS was forward translated from English into Persian and back-translated to English. Validity was assessed using face, content, and construct validity. Also Cronbach's alpha (α), theta (), and McDonald's omega coefficient were used to evaluate the reliability. Construct validity of the scale showed two factors with eigenvalues greater than one. The Cronbach's α, , McDonald's omega, and construct reliability were greater than .70. The Persian version of the CDS has a two-factor structure (i.e., death anxiety and life satisfaction) and has acceptable reliability and validity. Therefore, the validated instrument can be used in future studies to assess depression in patients with AMI in Iranians.
ERIC Educational Resources Information Center
Huelsman, Timothy J.; Gagnon, Sandra Glover; Kidder-Ashley, Pamela; Griggs, Marissa Swaim
2014-01-01
Research Findings: Child temperament is an important construct, but its measurement has been marked by a number of weaknesses that have diminished the frequency with which it is assessed in practice. We address this problem by presenting the results of a quantitative construct validation study. We calculated validity indices by hypothesizing the…
Construction of Valid and Reliable Test for Assessment of Students
ERIC Educational Resources Information Center
Osadebe, P. U.
2015-01-01
The study was carried out to construct a valid and reliable test in Economics for secondary school students. Two research questions were drawn to guide the establishment of validity and reliability for the Economics Achievement Test (EAT). It is a multiple choice objective test of five options with 100 items. A sample of 1000 students was randomly…
ERIC Educational Resources Information Center
Eleje, Lydia I.; Esomonu, Nkechi P. M.
2018-01-01
A Test to measure achievement in quantitative economics among secondary school students was developed and validated in this study. The test is made up 20 multiple choice test items constructed based on quantitative economics sub-skills. Six research questions guided the study. Preliminary validation was done by two experienced teachers in…
An Evaluation of the Validity and Reliability of a Food Behavior Checklist Modified for Children
ERIC Educational Resources Information Center
Branscum, Paul; Sharma, Manoj; Kaye, Gail; Succop, Paul
2010-01-01
Objective: The objective of this study was to report the construct validity and internal consistency reliability of the Food Behavior Checklist modified for children (FBC-MC), with low-income, Youth Expanded Food and Nutrition Education Program (EFNEP)-eligible children. Methods: Using a cross-sectional research design, construct validity was…
ERIC Educational Resources Information Center
Livingstone, Holly A.; Day, Arla L.
2005-01-01
Despite the popularity of the concept of emotional intelligence(EI), there is much controversy around its definition, measurement, and validity. Therefore, the authors examined the construct and criterion-related validity of an ability-based EI measure (Mayer Salovey Caruso Emotional Intelligence Test [MSCEIT]) and a mixed-model EI measure…
ERIC Educational Resources Information Center
Shriver, Mark D.; Frerichs, Lynae J.; Williams, Melissa; Lancaster, Blake M.
2013-01-01
Direct observation is often considered the "gold standard" for assessing the function, frequency, and intensity of problem behavior. Currently, the literature investigating the construct validity of direct observation conducted in the clinic setting reveals conflicting results. Previous studies on the construct validity of clinic-based…
Construction and Evaluation of Reliability and Validity of Reasoning Ability Test
ERIC Educational Resources Information Center
Bhat, Mehraj A.
2014-01-01
This paper is based on the construction and evaluation of reliability and validity of reasoning ability test at secondary school students. In this paper an attempt was made to evaluate validity, reliability and to determine the appropriate standards to interpret the results of reasoning ability test. The test includes 45 items to measure six types…
Hazing DEOCS 4.1 Construct Validity Summary
2017-08-01
Hazing DEOCS 4.1 Construct Validity Summary DEFENSE EQUAL OPPORTUNITY MANAGEMENT INSTITUTE DIRECTORATE OF...the analysis. Tables 4 – 6 provide additional information regarding the descriptive statistics and reliability of the Hazing items. Table 7 provides
Sensitivity-Uncertainty Based Nuclear Criticality Safety Validation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brown, Forrest B.
2016-09-20
These are slides from a seminar given to the University of Mexico Nuclear Engineering Department. Whisper is a statistical analysis package developed to support nuclear criticality safety validation. It uses the sensitivity profile data for an application as computed by MCNP6 along with covariance files for the nuclear data to determine a baseline upper-subcritical-limit for the application. Whisper and its associated benchmark files are developed and maintained as part of MCNP6, and will be distributed with all future releases of MCNP6. Although sensitivity-uncertainty methods for NCS validation have been under development for 20 years, continuous-energy Monte Carlo codes such asmore » MCNP could not determine the required adjoint-weighted tallies for sensitivity profiles. The recent introduction of the iterated fission probability method into MCNP led to the rapid development of sensitivity analysis capabilities for MCNP6 and the development of Whisper. Sensitivity-uncertainty based methods represent the future for NCS validation – making full use of today’s computer power to codify past approaches based largely on expert judgment. Validation results are defensible, auditable, and repeatable as needed with different assumptions and process models. The new methods can supplement, support, and extend traditional validation approaches.« less
Ruch, Willibald; Heintz, Sonja
2017-01-01
How strongly does humor (i.e., the construct-relevant content) in the Humor Styles Questionnaire (HSQ; Martin et al., 2003) determine the responses to this measure (i.e., construct validity)? Also, how much does humor influence the relationships of the four HSQ scales, namely affiliative, self-enhancing, aggressive, and self-defeating, with personality traits and subjective well-being (i.e., criterion validity)? The present paper answers these two questions by experimentally manipulating the 32 items of the HSQ to only (or mostly) contain humor (i.e., construct-relevant content) or to substitute the humor content with non-humorous alternatives (i.e., only assessing construct-irrelevant context). Study 1 (N = 187) showed that the HSQ affiliative scale was mainly determined by humor, self-enhancing and aggressive were determined by both humor and non-humorous context, and self-defeating was primarily determined by the context. This suggests that humor is not the primary source of the variance in three of the HQS scales, thereby limiting their construct validity. Study 2 (N = 261) showed that the relationships of the HSQ scales to the Big Five personality traits and subjective well-being (positive affect, negative affect, and life satisfaction) were consistently reduced (personality) or vanished (subjective well-being) when the non-humorous contexts in the HSQ items were controlled for. For the HSQ self-defeating scale, the pattern of relationships to personality was also altered, supporting an positive rather than a negative view of the humor in this humor style. The present findings thus call for a reevaluation of the role that humor plays in the HSQ (construct validity) and in the relationships to personality and well-being (criterion validity). PMID:28473794
Nessen, Thomas; Demmelmaier, Ingrid; Nordgren, Birgitta; Opava, Christina H
2015-01-01
The aim of the present study was to investigate aspects of reliability and validity of the Exercise Self-Efficacy Scale (ESES-S) in a rheumatoid arthritis (RA) population. A total of 244 people with RA participating in a physical activity study were included. The six-item ESES-S, exploring confidence in performing exercise, was assessed for test-retest reliability over 4-6 months, and for internal consistency. Construct validity investigated correlation with similar and other constructs. An intraclass correlation coefficient (ICC) of 0.59 (95% CI 0.37-0.73) was found for 84 participants with stable health perceptions between measurement occasions. Cronbach's alpha coefficients of 0.87 and 0.89 were found at the first and second measurements. Corrected item-total correlation single ESES-S items ranged between 0.53 and 0.73. Construct convergent validity for the ESES-S was partly confirmed by correlations with health-enhancing physical activity and outcome expectations respectively (Pearson's r = 0.18, p < 0.01). Construct divergent validity was confirmed by the absence of correlations with age or gender. No floor or ceiling effects were found for ESES-S. The results indicate that the ESES-S has moderate test-retest reliability and respectable internal consistency in people with RA. Construct validity was partially supported in the present sample. Further research on construct validity of the ESES-S is recommended. Physical exercise is crucial for management of symptoms and co-morbidity in rheumatoid arthritis. Self-efficacy for exercise is important to address in rehabilitation as it regulates exercise motivation and behavior. Measurement properties of self-efficacy scales need to be assessed in specific populations and different languages.
Valente, Ana Rita S; Hall, Andreia; Alvelos, Helena; Leahy, Margaret; Jesus, Luis M T
2018-04-12
The appropriate use of language in context depends on the speaker's pragmatic language competencies. A coding system was used to develop a specific and adult-focused self-administered questionnaire to adults who stutter and adults who do not stutter, The Assessment of Language Use in Social Contexts for Adults, with three categories: precursors, basic exchanges, and extended literal/non-literal discourse. This paper presents the content validity, item analysis, reliability coefficients and evidences of construct validity of the instrument. Content validity analysis was based on a two-stage process: first, 11 pragmatic questionnaires were assessed to identify items that probe each pragmatic competency and to create the first version of the instrument; second, items were assessed qualitatively by an expert panel composed by adults who stutter and controls, and quantitatively and qualitatively by an expert panel composed by clinicians. A pilot study was conducted with five adults who stutter and five controls to analyse items and calculate reliability. Construct validity evidences were obtained using the hypothesized relationships method and factor analysis with 28 adults who stutter and 28 controls. Concerning content validity, the questionnaires assessed up to 13 pragmatic competencies. Qualitative and quantitative analysis revealed ambiguities in items construction. Disagreement between experts was solved through item modification. The pilot study showed that the instrument presented internal consistency and temporal stability. Significant differences between adults who stutter and controls and different response profiles revealed the instrument's underlying construct. The instrument is reliable and presented evidences of construct validity.
Validation of the Short Gambling Harm Screen (SGHS): A Tool for Assessment of Harms from Gambling.
Browne, Matthew; Goodwin, Belinda C; Rockloff, Matthew J
2018-06-01
It is common for jurisdictions tasked with minimising gambling-related harm to conduct problem gambling prevalence studies for the purpose of monitoring the impact of gambling on the community. However, given that both public health theory and empirical findings suggest that harms can occur without individuals satisfying clinical criteria of addiction, there is a recognized conceptual disconnect between the prevalence of clinical problem gamblers, and aggregate harm to the community. Starting with an initial item pool of 72 specific harms caused by problematic gambling, our aim was to develop a short gambling harms scale (SGHS) to screen for the presence and degree of harm caused by gambling. An Internet panel of 1524 individuals who had gambled in the last year completed a 72-item checklist, along with the Personal Wellbeing Index, the PGSI, and other measures. We selected 10 items for the SGHS, with the goals of maximising sensitivity and construct coverage. Psychometric analysis suggests very strong reliability, homogeneity and unidimensionality. Non-zero responses on the SGHS were associated with a large decrease in personal wellbeing, with wellbeing decreasing linearly with the number of harms indicated. We conclude that weighted SGHS scores can be aggregated at the population level to yield a sensitive and valid measure of gambling harm.
Prenatal expectations in Mexican American women: development of a culturally sensitive measure.
Gress-Smith, Jenna L; Roubinov, Danielle S; Tanaka, Rika; Cmic, Keith; Cirnic, Keith; Gonzales, Nancy; Enders, Craig; Luecken, Linda J
2013-08-01
Prenatal expectations describe various domains a woman envisions in preparation for her role as a new mother and influence how women transition into the maternal role. Although the maternal role is strongly influenced by the prevailing familial and sociocultural context, research characterizing prenatal expectations in ethnic minority and low-income women is lacking. As part of the largest growing minority group in the USA, Latina mothers represent an important group to study. Two hundred and ten low-income Mexican American women were administered the Prenatal Experiences Scale for Mexican Americans (PESMA) that was adapted to capture specific cultural aspects of prenatal expectations. Measures of current support, prenatal depressive symptoms, and other sociodemographic characteristics were also completed to assess validity. Exploratory factor analysis identified three underlying factors of prenatal expectations: paternal support, family support, and maternal role fulfillment. Associations among these subscales and demographic and cultural variables were conducted to characterize women who reported higher and lower levels of expectations. The PESMA demonstrated good concurrent validity when compared to measures of social support, prenatal depressive symptoms, and other sociodemographic constructs. A culturally sensitive measure of prenatal expectations is an important step towards a better understanding of how Mexican American women transition to the maternal role and identify culturally specific targets for interventions to promote maternal health.
Durão, Solange; Kredo, Tamara; Volmink, Jimmy
2015-06-01
To develop, assess, and maximize the sensitivity of a search strategy to identify diet and nutrition trials in PubMed using relative recall. We developed a search strategy to identify diet and nutrition trials in PubMed. We then constructed a gold standard reference set to validate the identified trials using the relative recall method. Relative recall was calculated by dividing the number of references from the gold standard our search strategy identified by the total number of references in the gold standard. Our gold standard comprised 298 trials, derived from 16 included systematic reviews. The initial search strategy identified 242 of 298 references, with a relative recall of 81.2% [95% confidence interval (CI): 76.3%, 85.5%]. We analyzed titles and abstracts of the 56 missed references for possible additional terms. We then modified the search strategy accordingly. The relative recall of the final search strategy was 88.6% (95% CI: 84.4%, 91.9%). We developed a search strategy to identify diet and nutrition trials in PubMed with a high relative recall (sensitivity). This could be useful for establishing a nutrition trials register to support the conduct of future research, including systematic reviews. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Noh, Wonjung; Seomun, Gyeongae
2015-06-01
This study was conducted to develop key performance indicators (KPIs) for home care nursing (HCN) based on a balanced scorecard, and to construct a performance prediction model of strategic objectives using the Bayesian Belief Network (BBN). This methodological study included four steps: establishment of KPIs, performance prediction modeling, development of a performance prediction model using BBN, and simulation of a suggested nursing management strategy. An HCN expert group and a staff group participated. The content validity index was analyzed using STATA 13.0, and BBN was analyzed using HUGIN 8.0. We generated a list of KPIs composed of 4 perspectives, 10 strategic objectives, and 31 KPIs. In the validity test of the performance prediction model, the factor with the greatest variance for increasing profit was maximum cost reduction of HCN services. The factor with the smallest variance for increasing profit was a minimum image improvement for HCN. During sensitivity analysis, the probability of the expert group did not affect the sensitivity. Furthermore, simulation of a 10% image improvement predicted the most effective way to increase profit. KPIs of HCN can estimate financial and non-financial performance. The performance prediction model for HCN will be useful to improve performance.
Stress-Related Alterations of Visceral Sensation: Animal Models for Irritable Bowel Syndrome Study
Mulak, Agata; Taché, Yvette
2011-01-01
Stressors of different psychological, physical or immune origin play a critical role in the pathophysiology of irritable bowel syndrome participating in symptoms onset, clinical presentation as well as treatment outcome. Experimental stress models applying a variety of acute and chronic exteroceptive or interoceptive stressors have been developed to target different periods throughout the lifespan of animals to assess the vulnerability, the trigger and perpetuating factors determining stress influence on visceral sensitivity and interactions within the brain-gut axis. Recent evidence points towards adequate construct and face validity of experimental models developed with respect to animals' age, sex, strain differences and specific methodological aspects such as non-invasive monitoring of visceromotor response to colorectal distension as being essential in successful identification and evaluation of novel therapeutic targets aimed at reducing stress-related alterations in visceral sensitivity. Underlying mechanisms of stress-induced modulation of visceral pain involve a combination of peripheral, spinal and supraspinal sensitization based on the nature of the stressors and dysregulation of descending pathways that modulate nociceptive transmission or stress-related analgesic response. PMID:21860814
Translation and validation of the Cardiac Depression Scale to Arabic.
Papasavvas, T; Al-Amin, H; Ghabrash, H F; Micklewright, D
2016-08-01
The Cardiac Depression Scale (CDS) has been designed to measure depressive symptoms in patients with heart disease. There is no Arabic version of the CDS. We translated and validated the CDS in an Arabic sample of patients with heart disease. Forward and back translation of the CDS was followed by assessment of cultural relevance and content validity. The Arabic version of the CDS (A-CDS) and the Arabic version of the Hospital Anxiety and Depression Scale (A-HADS) were then administered to 260 Arab in-patients with heart disease from 18 Arabic countries. Construct validity was assessed using exploratory factor analysis with polychoric correlations. Internal consistency was assessed using ordinal reliability alpha and item-to-factor polychoric correlations. Concurrent validity was assessed using Pearson's correlation coefficient between the A-CDS and the depression subscale of the A-HADS (A-HADS-D). Cultural relevance and content validity of the A-CDS were satisfactory. Exploratory factor analysis revealed three robust factors, without cross-loadings, that formed a single dimension. Internal consistency was high (ordinal reliability alpha for the total scale and the three factors were .94, .91, .86, and .87, respectively; item-to-factor correlations ranged from .77 to .91). Concurrent validity was high (r=.72). The A-CDS demonstrated a closer to normal distribution of scores than the A-HADS-D. Sensitivity and specificity of the A-CDS were not objectively assessed. The A-CDS appears to be a valid and reliable instrument to measure depressive symptoms in a representative sample of Arab in-patients with heart disease. Copyright © 2016 Elsevier B.V. All rights reserved.
Wakeling, Helen C
2007-09-01
This study examined the reliability and validity of the Social Problem-Solving Inventory--Revised (SPSI-R; D'Zurilla, Nezu, & Maydeu-Olivares, 2002) with a population of incarcerated sexual offenders. An availability sample of 499 adult male sexual offenders was used. The SPSI-R had good reliability measured by internal consistency and test-retest reliability, and adequate validity. Construct validity was determined via factor analysis. An exploratory factor analysis extracted a two-factor model. This model was then tested against the theory-driven five-factor model using confirmatory factor analysis. The five-factor model was selected as the better fitting of the two, and confirmed the model according to social problem-solving theory (D'Zurilla & Nezu, 1982). The SPSI-R had good convergent validity; significant correlations were found between SPSI-R subscales and measures of self-esteem, impulsivity, and locus of control. SPSI-R subscales were however found to significantly correlate with a measure of socially desirable responding. This finding is discussed in relation to recent research suggesting that impression management may not invalidate self-report measures (e.g. Mills & Kroner, 2005). The SPSI-R was sensitive to sexual offender intervention, with problem-solving improving pre to post-treatment in both rapists and child molesters. The study concludes that the SPSI-R is a reasonably internally valid and appropriate tool to assess problem-solving in sexual offenders. However future research should cross-validate the SPSI-R with other behavioural outcomes to examine the external validity of the measure. Furthermore, future research should utilise a control group to determine treatment impact.
Validity and Reliability Testing of an e-learning Questionnaire for Chemistry Instruction
NASA Astrophysics Data System (ADS)
Guspatni, G.; Kurniawati, Y.
2018-04-01
The aim of this paper is to examine validity and reliability of a questionnaire used to evaluate e-learning implementation in chemistry instruction. 48 questionnaires were filled in by students who had studied chemistry through e-learning system. The questionnaire consisted of 20 indicators evaluating students’ perception on using e-learning. Parametric testing was done as data were assumed to follow normal distribution. Item validity of the questionnaire was examined through item-total correlation using Pearson’s formula while its reliability was assessed with Cronbach’s alpha formula. Moreover, convergent validity was assessed to see whether indicators building a factor had theoretically the same underlying construct. The result of validity testing revealed 19 valid indicators while the result of reliability testing revealed Cronbach’s alpha value of .886. The result of factor analysis showed that questionnaire consisted of five factors, and each of them had indicators building the same construct. This article shows the importance of factor analysis to get a construct valid questionnaire before it is used as research instrument.
The Importance of Considering Clinical Utility in the Construction of a Diagnostic Manual.
Mullins-Sweatt, Stephanie N; Lengel, Gregory J; DeShong, Hilary L
2016-01-01
The development of major diagnostic manuals primarily has been guided by construct validity rather than clinical utility. The purpose of this article is to summarize recent research and theory examining the importance of clinical utility when constructing and evaluating a diagnostic manual. We suggest that construct validity is a necessary but not sufficient criterion for diagnostic constructs. This article discusses components of clinical utility and how these have applied to the current and forthcoming diagnostic manuals. Implications and suggestions for future research are provided.
Leenaars, Lindsey; Lester, David
2007-02-01
In a sample of 117 undergraduates, helplessness scores and the discrepancy scores on a measure of perfectionism predicted depression scores, providing evidence for construct validity for the hopelessness, helplessness, and haplessness scales.
Construct validity of the Moral Development Scale for Professionals (MDSP).
Söderhamn, Olle; Bjørnestad, John Olav; Skisland, Anne; Cliffordson, Christina
2011-01-01
The aim of this study was to investigate the construct validity of the Moral Development Scale for Professionals (MDSP) using structural equation modeling. The instrument is a 12-item self-report instrument, developed in the Scandinavian cultural context and based on Kohlberg's theory. A hypothesized simplex structure model underlying the MDSP was tested through structural equation modeling. Validity was also tested as the proportion of respondents older than 20 years that reached the highest moral level, which according to the theory should be small. A convenience sample of 339 nursing students with a mean age of 25.3 years participated. Results confirmed the simplex model structure, indicating that MDSP reflects a moral construct empirically organized from low to high. A minority of respondents >20 years of age (13.5%) scored more than 80% on the highest moral level. The findings support the construct validity of the MDSP and the stages and levels in Kohlberg's theory.
Belone, Lorenda; Lucero, Julie E; Duran, Bonnie; Tafoya, Greg; Baker, Elizabeth A; Chan, Domin; Chang, Charlotte; Greene-Moton, Ella; Kelley, Michele A; Wallerstein, Nina
2016-01-01
A national community-based participatory research (CBPR) team developed a conceptual model of CBPR partnerships to understand the contribution of partnership processes to improved community capacity and health outcomes. With the model primarily developed through academic literature and expert consensus building, we sought community input to assess face validity and acceptability. Our research team conducted semi-structured focus groups with six partnerships nationwide. Participants validated and expanded on existing model constructs and identified new constructs based on "real-world" praxis, resulting in a revised model. Four cross-cutting constructs were identified: trust development, capacity, mutual learning, and power dynamics. By empirically testing the model, we found community face validity and capacity to adapt the model to diverse contexts. We recommend partnerships use and adapt the CBPR model and its constructs, for collective reflection and evaluation, to enhance their partnering practices and achieve their health and research goals. © The Author(s) 2014.
NASA Astrophysics Data System (ADS)
Widodo, W.; Sudibyo, E.; Sari, D. A. P.
2018-04-01
This study aims to develop student worksheets for higher education that apply integrated science learning in discussing issues about motion in humans. These worksheets will guide students to solve the problem about human movement. They must integrate their knowledge about biology, physics, and chemistry to solve the problem. The worksheet was validated by three experts in Natural Science Integrated Science, especially in Human Movement topic. The aspects of the validation were feasibility of the content, the construction, and the language. This research used the Likert scale to measure the validity of each aspect, which is 4.00 for very good validity criteria, 3.00 for good validity criteria, 2.00 for more or less validity criteria, and 1.00 for not good validity criteria. Data showed that the validity for each aspect were in the range of good validity and very good validity criteria (3.33 to 3.67 for the content aspect, 2.33 to 4.00 for the construction aspect, and 3.33 to 4.00 for language aspect). However, there was a part of construction aspect that needed to improve. Overall, this students’ worksheet can be applied in classroom after some revisions based on suggestions from the validators.
Vaegter, Katarina Kebbon; Lakic, Tatevik Ghukasyan; Olovsson, Matts; Berglund, Lars; Brodin, Thomas; Holte, Jan
2017-03-01
To construct a prediction model for live birth after in vitro fertilization/intracytoplasmic sperm injection (IVF/ICSI) treatment and single-embryo transfer (SET) after 2 days of embryo culture. Prospective observational cohort study. University-affiliated private infertility center. SET in 8,451 IVF/ICSI treatments in 5,699 unselected consecutive couples during 1999-2014. A total of 100 basal patient characteristics and treatment data were analyzed for associations with live birth after IVF/ICSI (adjusted for repeated treatments) and subsequently combined for prediction model construction. Live birth rate (LBR) and performance of live birth prediction model. Embryo score, treatment history, ovarian sensitivity index (OSI; number of oocytes/total dose of FSH administered), female age, infertility cause, endometrial thickness, and female height were all independent predictors of live birth. A prediction model (training data set; n = 5,722) based on these variables showed moderate discrimination, but predicted LBR with high accuracy in subgroups of patients, with LBR estimates ranging from <10% to >40%. Outcomes were similar in an internal validation data set (n = 2,460). Based on 100 variables prospectively recorded during a 15-year period, a model for live birth prediction after strict SET was constructed and showed excellent calibration in internal validation. For the first time, female height qualified as a predictor of live birth after IVF/ICSI. Copyright © 2016 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
Vitola, E S; Bau, C H D; Salum, G A; Horta, B L; Quevedo, L; Barros, F C; Pinheiro, R T; Kieling, C; Rohde, L A; Grevet, E H
2017-03-01
There are still uncertainties on the psychometric validity of the DSM-5 attention deficit hyperactivity disorder (ADHD) criteria for its use in the adult population. We aim to describe the adult ADHD phenotype, to test the psychometric properties of the DSM-5 ADHD criteria, and to calculate the resulting prevalence in a population-based sample in their thirties. A cross-sectional evaluation using the DSM-5 ADHD criteria was carried out in 3574 individuals from the 1982 Pelotas Birth Cohort. Through receiver operator curve, latent and regression analyses, we obtained parameters on construct and discriminant validity. Still, prevalence rates were calculated for different sets of criteria. The latent analysis suggested that the adult ADHD phenotype is constituted mainly by inattentive symptoms. Also, inattention symptoms were the symptoms most associated with impairment. The best cut-off for diagnosis was four symptoms, but sensitivity and specificity for this cut-off was low. ADHD prevalence rates were 2.1% for DSM-5 ADHD criteria and 5.8% for ADHD disregarding age-of-onset criterion. The bi-dimensional ADHD structure proposed by the DSM demonstrated both construct and discriminant validity problems when used in the adult population, since inattention is a much more relevant feature in the adult phenotype. The use of the DSM-5 criteria results in a higher prevalence of ADHD when compared to those obtained by DSM-IV, and prevalence would increase almost threefold when considering current ADHD syndrome. These findings suggest a need for further refinement of the criteria for its use in the adult population.
Kutlay, Sehim; Kuçukdeveci, Ayse A; Elhan, Atilla H; Yavuzer, Gunes; Tennant, Alan
2007-02-28
Assessment of cognitive impairment with a valid cognitive screening tool is essential in neurorehabilitation. The aim of this study was to test the reliability and validity of the Turkish-adapted version of the Middlesex Elderly Assessment of Mental State (MEAMS) among acquired brain injury patients in Turkey. Some 155 patients with acquired brain injury admitted for rehabilitation were assessed by the adapted version of MEAMS at admission and discharge. Reliability was tested by internal consistency, intra-class correlation coefficient (ICC) and person separation index; internal construct validity by Rasch analysis; external construct validity by associations with physical and cognitive disability (FIM); and responsiveness by Effect Size. Reliability was found to be good with Cronbach's alpha of 0.82 at both admission and discharge; and likewise an ICC of 0.80. Person separation index was 0.813. Internal construct validity was good by fit of the data to the Rasch model (mean item fit -0.178; SD 1.019). Items were substantially free of differential item functioning. External construct validity was confirmed by expected associations with physical and cognitive disability. Effect size was 0.42 compared with 0.22 for cognitive FIM. The reliability and validity of the Turkish version of MEAMS as a cognitive impairment screening tool in acquired brain injury has been demonstrated.
Simulation-based training for prostate surgery.
Khan, Raheej; Aydin, Abdullatif; Khan, Muhammad Shamim; Dasgupta, Prokar; Ahmed, Kamran
2015-10-01
To identify and review the currently available simulators for prostate surgery and to explore the evidence supporting their validity for training purposes. A review of the literature between 1999 and 2014 was performed. The search terms included a combination of urology, prostate surgery, robotic prostatectomy, laparoscopic prostatectomy, transurethral resection of the prostate (TURP), simulation, virtual reality, animal model, human cadavers, training, assessment, technical skills, validation and learning curves. Furthermore, relevant abstracts from the American Urological Association, European Association of Urology, British Association of Urological Surgeons and World Congress of Endourology meetings, between 1999 and 2013, were included. Only studies related to prostate surgery simulators were included; studies regarding other urological simulators were excluded. A total of 22 studies that carried out a validation study were identified. Five validated models and/or simulators were identified for TURP, one for photoselective vaporisation of the prostate, two for holmium enucleation of the prostate, three for laparoscopic radical prostatectomy (LRP) and four for robot-assisted surgery. Of the TURP simulators, all five have demonstrated content validity, three face validity and four construct validity. The GreenLight laser simulator has demonstrated face, content and construct validities. The Kansai HoLEP Simulator has demonstrated face and content validity whilst the UroSim HoLEP Simulator has demonstrated face, content and construct validity. All three animal models for LRP have been shown to have construct validity whilst the chicken skin model was also content valid. Only two robotic simulators were identified with relevance to robot-assisted laparoscopic prostatectomy, both of which demonstrated construct validity. A wide range of different simulators are available for prostate surgery, including synthetic bench models, virtual-reality platforms, animal models, human cadavers, distributed simulation and advanced training programmes and modules. The currently validated simulators can be used by healthcare organisations to provide supplementary training sessions for trainee surgeons. Further research should be conducted to validate simulated environments, to determine which simulators have greater efficacy than others and to assess the cost-effectiveness of the simulators and the transferability of skills learnt. With surgeons investigating new possibilities for easily reproducible and valid methods of training, simulation offers great scope for implementation alongside traditional methods of training. © 2014 The Authors BJU International © 2014 BJU International Published by John Wiley & Sons Ltd.
The Resilience Scale for Adults: Construct Validity and Measurement in a Belgian Sample
ERIC Educational Resources Information Center
Hjemdal, Odin; Friborg, Oddgeir; Braun, Stephanie; Kempenaers, Chantal; Linkowski, Paul; Fossion, Pierre
2011-01-01
The Resilience Scale for Adults (RSA) was developed and has been extensively validated in Norwegian samples. The purpose of this study was to explore the construct validity of the Resilience Scale for Adults in a French-speaking Belgian sample and test measurement invariance between the Belgian and a Norwegian sample. A Belgian student sample (N =…
ERIC Educational Resources Information Center
McGill, D. A.; van der Vleuten, C. P. M.; Clarke, M. J.
2013-01-01
Supervisor assessments are critical for both formative and summative assessment in the workplace. Supervisor ratings remain an important source of such assessment in many educational jurisdictions even though there is ambiguity about their validity and reliability. The aims of this evaluation is to explore the: (1) construct validity of ward-based…
ERIC Educational Resources Information Center
Sawaki, Yasuyo
2007-01-01
This is a construct validation study of a second language speaking assessment that reported a language profile based on analytic rating scales and a composite score. The study addressed three key issues: score dependability, convergent/discriminant validity of analytic rating scales and the weighting of analytic ratings in the composite score.…
ERIC Educational Resources Information Center
Hatala, Rose; Cook, David A.; Brydges, Ryan; Hawkins, Richard
2015-01-01
In order to construct and evaluate the validity argument for the Objective Structured Assessment of Technical Skills (OSATS), based on Kane's framework, we conducted a systematic review. We searched MEDLINE, EMBASE, CINAHL, PsycINFO, ERIC, Web of Science, Scopus, and selected reference lists through February 2013. Working in duplicate, we selected…
Buntragulpoontawee, Montana; Phutrit, Suphatha; Tongprasert, Siam; Wongpakaran, Tinakon; Khunachiva, Jeeranan
2018-03-27
This study evaluated additional psychometric properties of the Thai version of the disabilities of the arm, shoulder and hand questionnaire (DASH-TH) which included, test-retest reliability, construct validity, internal consistency of in patients with carpal tunnel syndrome. As for determining construct validity, the Thai EuroQOL questionnaire (EQ-5D-5L) was also administered in order to examine convergent and divergent validity. Fifty patients completed both questionnaires. The DASH-TH showed excellent test-retest reliability (intraclass correlation coefficient = 0.811) and internal consistency (Cronbach's alpha = 0.911). The exploratory factor analysis yielded a six-factor solution while the confirmatory factor analysis denoted that the hypothesized model adequately fit the data with a comparative fit index of 0.967 and a Tucker-Lewis index of 0.964. The related subscales between the DASH-TH and the Thai EQ-5D-5L were significantly correlated, indicating the DASH-TH's convergent and discriminant validity. The DASH-TH demonstrated good reliability, internal consistency construct validity, and multidimensionality, in assessing the upper extremity function in carpal tunnel syndrome patients.
Yen, Po-Yin; Sousa, Karen H; Bakken, Suzanne
2014-10-01
In a previous study, we developed the Health Information Technology Usability Evaluation Scale (Health-ITUES), which is designed to support customization at the item level. Such customization matches the specific tasks/expectations of a health IT system while retaining comparability at the construct level, and provides evidence of its factorial validity and internal consistency reliability through exploratory factor analysis. In this study, we advanced the development of Health-ITUES to examine its construct validity and predictive validity. The health IT system studied was a web-based communication system that supported nurse staffing and scheduling. Using Health-ITUES, we conducted a cross-sectional study to evaluate users' perception toward the web-based communication system after system implementation. We examined Health-ITUES's construct validity through first and second order confirmatory factor analysis (CFA), and its predictive validity via structural equation modeling (SEM). The sample comprised 541 staff nurses in two healthcare organizations. The CFA (n=165) showed that a general usability factor accounted for 78.1%, 93.4%, 51.0%, and 39.9% of the explained variance in 'Quality of Work Life', 'Perceived Usefulness', 'Perceived Ease of Use', and 'User Control', respectively. The SEM (n=541) supported the predictive validity of Health-ITUES, explaining 64% of the variance in intention for system use. The results of CFA and SEM provide additional evidence for the construct and predictive validity of Health-ITUES. The customizability of Health-ITUES has the potential to support comparisons at the construct level, while allowing variation at the item level. We also illustrate application of Health-ITUES across stages of system development. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Development and validation study of the Smartphone Overuse Screening Questionnaire.
Lee, Han-Kyeong; Kim, Ji-Hae; Fava, Maurizio; Mischoulon, David; Park, Jae-Hyun; Shim, Eun-Jung; Lee, Eun-Ho; Lee, Ji Hyeon; Jeon, Hong Jin
2017-11-01
The aim of this study was to develop a screening questionnaire that could distinguish individuals at high risk of smartphone overuse from casual users. The reliability, validity, and diagnostic ability of the Smartphone Overuse Screening Questionnaire (SOS-Q) were evaluated. Preliminary items were assessed by 50 addiction experts on-line, and 28 questions were selected. A total of 158 subjects recruited from six community centers for internet addiction participated in this study. The SOS-Q, Young's internet addiction scale, Korean scale for internet addiction, and Smartphone Scale for Smartphone Addiction (S-Scale) were used to assess the concurrent validity. Construct validity was supported by a six-factor model using an exploratory factor analysis. The internal consistency and the item-total correlations were favorable (α = 0.95, r = 0.35-0.81). The test-retest reliability was moderate (r = 0.70). The SOS-Q showed superior concurrent validity with the highest correlation between the S-Scale (r = 0.76). Receiver operating characteristic curve analysis revealed an area under the curve of 0.877. A cut-off point of 49 effectively categorized addiction high-risk group with a sensitivity of 0.81 and specificity of 0.86. Overall, the current study supports the use of SOS-Q as both a primary and supplementary measurement tool in a variety of settings. Copyright © 2017 Elsevier B.V. All rights reserved.
Alves, Vinicius M.; Muratov, Eugene; Fourches, Denis; Strickland, Judy; Kleinstreuer, Nicole; Andrade, Carolina H.; Tropsha, Alexander
2015-01-01
Repetitive exposure to a chemical agent can induce an immune reaction in inherently susceptible individuals that leads to skin sensitization. Although many chemicals have been reported as skin sensitizers, there have been very few rigorously validated QSAR models with defined applicability domains (AD) that were developed using a large group of chemically diverse compounds. In this study, we have aimed to compile, curate, and integrate the largest publicly available dataset related to chemically-induced skin sensitization, use this data to generate rigorously validated and QSAR models for skin sensitization, and employ these models as a virtual screening tool for identifying putative sensitizers among environmental chemicals. We followed best practices for model building and validation implemented with our predictive QSAR workflow using random forest modeling technique in combination with SiRMS and Dragon descriptors. The Correct Classification Rate (CCR) for QSAR models discriminating sensitizers from non-sensitizers were 71–88% when evaluated on several external validation sets, within a broad AD, with positive (for sensitizers) and negative (for non-sensitizers) predicted rates of 85% and 79% respectively. When compared to the skin sensitization module included in the OECD QSAR toolbox as well as to the skin sensitization model in publicly available VEGA software, our models showed a significantly higher prediction accuracy for the same sets of external compounds as evaluated by Positive Predicted Rate, Negative Predicted Rate, and CCR. These models were applied to identify putative chemical hazards in the ScoreCard database of possible skin or sense organ toxicants as primary candidates for experimental validation. PMID:25560674
The Validity of Conscientiousness Is Overestimated in the Prediction of Job Performance.
Kepes, Sven; McDaniel, Michael A
2015-01-01
Sensitivity analyses refer to investigations of the degree to which the results of a meta-analysis remain stable when conditions of the data or the analysis change. To the extent that results remain stable, one can refer to them as robust. Sensitivity analyses are rarely conducted in the organizational science literature. Despite conscientiousness being a valued predictor in employment selection, sensitivity analyses have not been conducted with respect to meta-analytic estimates of the correlation (i.e., validity) between conscientiousness and job performance. To address this deficiency, we reanalyzed the largest collection of conscientiousness validity data in the personnel selection literature and conducted a variety of sensitivity analyses. Publication bias analyses demonstrated that the validity of conscientiousness is moderately overestimated (by around 30%; a correlation difference of about .06). The misestimation of the validity appears to be due primarily to suppression of small effects sizes in the journal literature. These inflated validity estimates result in an overestimate of the dollar utility of personnel selection by millions of dollars and should be of considerable concern for organizations. The fields of management and applied psychology seldom conduct sensitivity analyses. Through the use of sensitivity analyses, this paper documents that the existing literature overestimates the validity of conscientiousness in the prediction of job performance. Our data show that effect sizes from journal articles are largely responsible for this overestimation.
The Validity of Conscientiousness Is Overestimated in the Prediction of Job Performance
2015-01-01
Introduction Sensitivity analyses refer to investigations of the degree to which the results of a meta-analysis remain stable when conditions of the data or the analysis change. To the extent that results remain stable, one can refer to them as robust. Sensitivity analyses are rarely conducted in the organizational science literature. Despite conscientiousness being a valued predictor in employment selection, sensitivity analyses have not been conducted with respect to meta-analytic estimates of the correlation (i.e., validity) between conscientiousness and job performance. Methods To address this deficiency, we reanalyzed the largest collection of conscientiousness validity data in the personnel selection literature and conducted a variety of sensitivity analyses. Results Publication bias analyses demonstrated that the validity of conscientiousness is moderately overestimated (by around 30%; a correlation difference of about .06). The misestimation of the validity appears to be due primarily to suppression of small effects sizes in the journal literature. These inflated validity estimates result in an overestimate of the dollar utility of personnel selection by millions of dollars and should be of considerable concern for organizations. Conclusion The fields of management and applied psychology seldom conduct sensitivity analyses. Through the use of sensitivity analyses, this paper documents that the existing literature overestimates the validity of conscientiousness in the prediction of job performance. Our data show that effect sizes from journal articles are largely responsible for this overestimation. PMID:26517553
Bullying DEOCS 4.1 Construct Validity Summary
2017-08-01
Bullying DEOCS 4.1 Construct Validity Summary DEFENSE EQUAL OPPORTUNITY MANAGEMENT INSTITUTE DIRECTORATE OF...excessive or abusive use of water; the forced consumption of food , alcohol, drugs, or any other substance; and degrading or damaging the person or his
Esbensen, A J; Hoffman, E K; Stansberry, E; Shaffer, R
2018-04-01
There is a need for rigorous measures of sleep in children with Down syndrome as sleep is a substantial problem in this population and there are barriers to obtaining the gold standard polysomnography (PSG). PSG is cost-prohibitive when measuring treatment effects in some clinical trials, and children with Down syndrome may not cooperate with undergoing a PSG. Minimal information is available on the validity of alternative methods of assessing sleep in children with Down syndrome, such as actigraphy and parent ratings. Our study examined the concurrent and convergent validity of different measures of sleep, including PSG, actigraphy and parent reports of sleep among children with Down syndrome. A clinic (n = 27) and a community (n = 47) sample of children with Down syndrome were examined. In clinic, children with Down syndrome wore an actigraph watch during a routine PSG. In the community, children with Down syndrome wore an actigraph watch for a week at home at night as part of a larger study on sleep and behaviour. Their parent completed ratings of the child's sleep during that same week. Actigraph watches demonstrated convergent validity with PSG when measuring a child with Down syndrome's total amount of sleep time, total wake time after sleep onset and sleep period efficiency. In contrast, actigraph watches demonstrated poor correlations with parent reports of sleep, and with PSG when measuring the total time in bed and total wake episodes. Actigraphy, PSG and parent ratings of sleep demonstrated poor concurrent validity with clinical diagnosis of obstructive sleep apnoea. Our current data suggest that actigraph watches demonstrate convergent validity and are sensitive to measuring certain sleep constructs (duration, efficiency) in children with Down syndrome. However, parent reports, such as the Children's Sleep Habits Questionnaire, may be measuring other sleep constructs. These findings highlight the importance of selecting measures of sleep related to target concerns. © 2018 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
He, Jinbo; Zhu, Hong; Luo, Xingwei; Cai, Taisheng; Wu, Siyao; Lu, Yao
2016-06-01
The Impact of Weight on Quality of Life for Kids (IWQOL-Kids) is the first self-report questionnaire for assessing weight-related quality of life for youth. However, there is no Chinese version of IWQOL-Kids. Thus, the objective of this research was to translate IWQOL-Kids into Mandarin and evaluate its psychometric properties in a large school-based sample. The total sample included 2282 participants aged 11-18 years old, including 1703 non-overweight, 386 overweight and 193 obese students. IWQOL-Kids was translated and culturally adapted by following the international guidelines for instrument linguistic validation procedures. The psychometric evaluation included internal consistency, test-retest reliability, exploratory factor analysis (EFA), confirmatory factor analysis (CFA), convergent validity and discriminant validity. Cronbach's α for the Chinese version of IWQOL-Kids (IWQOL-Kids-C) was 0.956 and ranged from 0.891 to 0.927 for subscales. IWQOL-Kids-C showed a test-retest coefficient of 0.937 after 2 weeks and ranged from 0.847 to 0.903 for subscales. The original four-factor model was reproduced by EFA after seven iterations, accounting for 69.28% of the total variance. CFA demonstrated that the four-factor model had good fit indices with comparative fit index = 0.92, normed fit index = 0.91, goodness of fit index = 0.86, root mean square error of approximation = 0.07 and root mean square residual = 0.03. Convergent validity and discriminant validity were demonstrated with higher correlations between similar constructs and lower correlations between dissimilar constructs of IWQOL-Kids-C and PedsQL™ 4.0. The significant differences were found across the body mass index groups, and IWQOL-Kids-C had higher effect sizes than PedsQL™4.0 when comparing non-overweight and obese groups, supporting the sensitivity of IWQOL-Kids-C. IWQOL-Kids-C is a satisfactory, valid and reliable instrument to assess weight-related quality of life for Chinese children and adolescents aged 11-18 years old. © The Author 2015. Published by Oxford University Press on behalf of Faculty of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Validation of a multi-criteria evaluation model for animal welfare.
Martín, P; Czycholl, I; Buxadé, C; Krieter, J
2017-04-01
The aim of this paper was to validate an alternative multi-criteria evaluation system to assess animal welfare on farms based on the Welfare Quality® (WQ) project, using an example of welfare assessment of growing pigs. This alternative methodology aimed to be more transparent for stakeholders and more flexible than the methodology proposed by WQ. The WQ assessment protocol for growing pigs was implemented to collect data in different farms in Schleswig-Holstein, Germany. In total, 44 observations were carried out. The aggregation system proposed in the WQ protocol follows a three-step aggregation process. Measures are aggregated into criteria, criteria into principles and principles into an overall assessment. This study focussed on the first two steps of the aggregation. Multi-attribute utility theory (MAUT) was used to produce a value of welfare for each criterion and principle. The utility functions and the aggregation function were constructed in two separated steps. The MACBETH (Measuring Attractiveness by a Categorical-Based Evaluation Technique) method was used for utility function determination and the Choquet integral (CI) was used as an aggregation operator. The WQ decision-makers' preferences were fitted in order to construct the utility functions and to determine the CI parameters. The validation of the MAUT model was divided into two steps, first, the results of the model were compared with the results of the WQ project at criteria and principle level, and second, a sensitivity analysis of our model was carried out to demonstrate the relative importance of welfare measures in the different steps of the multi-criteria aggregation process. Using the MAUT, similar results were obtained to those obtained when applying the WQ protocol aggregation methods, both at criteria and principle level. Thus, this model could be implemented to produce an overall assessment of animal welfare in the context of the WQ protocol for growing pigs. Furthermore, this methodology could also be used as a framework in order to produce an overall assessment of welfare for other livestock species. Two main findings are obtained from the sensitivity analysis, first, a limited number of measures had a strong influence on improving or worsening the level of welfare at criteria level and second, the MAUT model was not very sensitive to an improvement in or a worsening of single welfare measures at principle level. The use of weighted sums and the conversion of disease measures into ordinal scores should be reconsidered.
2013-01-01
Background Antibiotics overuse is a global public health issue influenced by several factors, of which some are parent-related psychosocial factors that can only be measured using valid and reliable psychosocial measurement instruments. The PAPA scale was developed to measure these factors and the content validity of this instrument was assessed. Aim This study further validated the recently developed instrument in terms of (1) face validity and (2) construct validity including: deciding the number and nature of factors, and item selection. Methods Questionnaires were self-administered to parents of children between the ages of 0 and 12 years old. Parents were conveniently recruited from schools’ parental meetings in the Eastern Province, Saudi Arabia. Face validity was assessed with regards to questionnaire clarity and unambiguity. Construct validity and item selection processes were conducted using Exploratory factor analysis. Results Parallel analysis and Exploratory factor analysis using principal axis factoring produced six factors in the developed instrument: knowledge and beliefs, behaviours, sources of information, adherence, awareness about antibiotics resistance, and parents’ perception regarding doctors’ prescribing behaviours. Reliability was assessed (Cronbach’s alpha = 0.78) which demonstrates the instrument as being reliable. Conclusion The ‘factors’ produced in this study coincide with the constructs contextually identified in the development phase of other instruments used to study antibiotic use. However, no other study considering perceptions of antibiotic use had gone beyond content validation of such instruments. This study is the first to constructively validate the factors underlying perceptions regarding antibiotic use in any population and in parents in particular. PMID:23497151
Mukherjee, Sharmila; Aneja, Satinder; Russell, Paul S S; Gulati, Sheffali; Deshmukh, Vaishali; Sagar, Rajesh; Silberberg, Donald; Bhutani, Vinod K; Pinto, Jennifer M; Durkin, Maureen; Pandey, Ravindra M; Nair, M K C; Arora, Narendra K
2014-06-01
To develop and validate INCLEN Diagnostic Tool for Attention Deficit Hyperactivity Disorder (INDT-ADHD). Diagnostic test evaluation by cross sectional design. Tertiary care pediatric centers. 156 children aged 65-117 months. After randomization, INDT-ADHD and Connors 3 Parent Rating Scale (C3PS) were administered, followed by an expert evaluation by DSM-IV-TR diagnostic criteria. Psychometric evaluation of diagnostic accuracy, validity (construct, criterion and convergent) and internal consistency. INDT-ADHD had 18 items that quantified symptoms and impairment. Attention deficit hyperactivity disorder was identified in 57, 87 and 116 children by expert evaluation, INDT-ADHD and C3PS, respectively. Psychometric parameters of INDT-ADHD for differentiating attention deficit hyperactivity disorder and normal children were: sensitivity 87.7%, specificity 97.2%, positive predictive value 98.0% and negative predictive value 83.3%, whereas for differentiating from other neuro-developmental disorders were 87.7%, 42.9%, 58.1% and 79.4%, respectively. Internal consistency was 0.91. INDT-ADHD has a 4-factor structure explaining 60.4% of the variance. Convergent validity with Conner's Parents Rating Scale was moderate (r =0.73, P= 0.001). INDT-ADHD is suitable for diagnosing attention deficit hyperactivity disorder in Indian children between the ages of 6 to 9 years.
Development and Validation Study of the Internet Overuse Screening Questionnaire
Lee, Han-Kyeong; Lee, Hae-Woo; Han, Joo Hyun; Park, Subin; Ju, Seok-Jin; Choi, Kwanwoo; Lee, Ji Hyeon; Jeon, Hong Jin
2018-01-01
Objective Concerns over behavioral and emotional problems caused by excessive internet usage have been developed. This study intended to develop and a standardize questionnaire that can efficiently identify at-risk internet users through their internet usage habits. Methods Participants (n=158) were recruited at six I-will-centers located in Seoul, South Korea. From the initial 36 questionnaire item pool, 28 preliminary items were selected through expert evaluation and panel discussions. The construct validity, internal consistency, and concurrent validity were examined. We also conducted Receiver Operating Curve (ROC) analysis to assess diagnostic ability of the Internet Overuse Screening-Questionnaire (IOS-Q). Results The exploratory factor analysis yielded a five factor structure. Four factors with 17 items remained after items that had unclear factor loading were removed. The Cronbach’s alpha for the IOS-Q total score was 0.91, and test-retest reliability was 0.72. The correlation between Young’s internet addiction scale and K-scale supported concurrent validity. ROC analysis showed that the IOS-Q has superior diagnostic ability with the Area Under the Curve of 0.87. At the cut-off point of 25.5, the sensitivity was 0.93 and specificity was 0.86. Conclusion Overall, this study supports the use of IOS-Q for internet addiction research and for screening high-risk individuals. PMID:29669406
ARDAKANI, Abolfazl; SEGHATOLESLAM, Tahereh; HABIL, Hussain; JAMEEI, Fahimeh; RASHID, Rusdi; ZAHIRODIN, Alireza; MOTLAQ, Farid; MASJIDI ARANI, Abbas
2016-01-01
Background: Given that validity is the baseline of psychological assessments, there is a need to provide evidence-based data for construct validity of such scales to advance the clinicians for evaluating psychiatric morbidity in psychiatric and psychosomatic setting. Methods: This comparative cross-sectional study aimed to investigate the construct validity of the Malaysian version of the GHQ-28 and the SCL-90-R. The sample comprised 660 individuals including diabetics, drug dependents, and normal population. The research scales were administered to the participants. Convergent and discriminant validity of both scales were investigated by Confirmatory Factor Analysis (CFA) using AMOS. The Pearson correlation coefficient was utilized to obtain the relationship between the two scales. Results: The internal consistency of the GHQ-28 and SCL-90-R were highly acceptable, and confirmatory factor analysis confirmed the convergent validity of both scales. The results of this study revealed that the construct validity of GHQ-28 was acceptable, whereas discriminant validity of SCL-90-R was not adequate. According to Pearson correlation coefficient the relationships between three common subscales of the GHQ-28 and SCL-90-R were significantly positive; somatization (r=0.671, P<0.01), Anxiety (r=0.728, P<0.01), and Depression (r=0.660, P <0.01). Conclusions: This study replicated the construct of the Malaysian version of GHQ-28, yet failed to support the nine-factor structure of the SCL-90-R. Therefore, multidimensionality of the SCL-90-R as clinical purposes is questionable, and it may be a better unitary measure for assessing and screening mental disorders. Further research need to be carried out to prove this finding. PMID:27252914
Kwan, Yu Heng; Fong, Warren Weng Seng; Lui, Nai Lee; Yong, Si Ting; Cheung, Yin Bun; Malhotra, Rahul; Østbye, Truls; Thumboo, Julian
2016-12-01
The Short Form 36 Health Survey (SF-36) is a popular health-related quality of life (HrQoL) tool. However, few studies have assessed its psychometric properties in patients with spondyloarthritis (SpA). We therefore aimed to assess the reliability and validity of the SF-36 in patients with SpA in Singapore. Cross-sectional data from a registry of 196 SpA patients recruited from a dedicated tertiary referral clinic in Singapore from 2011 to 2014 was used. Analyses were guided by the COnsensus-based Standards for the selection of health Measurement INstruments framework. Internal consistency reliability was assessed using Cronbach's alpha. Construct validity was assessed through 33 a priori hypotheses by correlations of the eight subscales and two summary scores of SF-36 with other health outcomes. Known-group construct validity was assessed by comparison of the means of the subscales and summary scores of the SF-36 of SpA patients and the general population of Singapore using student's t tests. Among 196 patients (155 males (79.0 %), median (range) age: 36 (17-70), 166 Chinese (84.6 %)), SF-36 scales showed high internal consistency ranging from 0.88 to 0.90. Convergent construct validity was supported as shown by fulfillment of all hypotheses. Divergent construct validity was supported, as SF-36 MCS was not associated with PGA, pain and HAQ. Known-group construct validity showed SpA patients had lower scores of 3.8-12.5 when compared to the general population at p < 0.001. This study supports the SF-36 as a valid and reliable measure of HrQoL for use in patients with SpA at a single time point.
Sureshkumar, Premala; Cumming, Robert G; Craig, Jonathan C
2006-06-01
We describe the validity and reliability of a questionnaire designed to determine frequency, severity and risk factors of urinary tract infection and daytime urinary incontinence in primary school-age children. Based on published validated questionnaires and advice from content experts, a questionnaire was developed and piloted in children attending outpatient clinics. Construct validity for parent report of frequency and severity of daytime urinary incontinence was tested by comparison with a daily accident diary in 52 primary school children, and criterion validity of parent report for UTI was verified by comparison with the reference standard (urine culture) in 100 primary school children. Test-retest reliability of the questionnaire was assessed in 106 children from primary schools. There was excellent agreement between the questionnaire and accident diary in severity (weighted kappa 0.94, 95% confidence intervals 0.85 to 1.03) and frequency of daytime urinary incontinence (0.88, 0.7 to 1.0). Parents reported urinary tract infection in 15% of children, compared to a positive urine culture in 8% (sensitivity 100% and specificity 68.5%). Test-retest reliability of the questionnaire was excellent (mean k 0.78, range 0.61 to 1.00). Parents overreport UTI by about 2-fold but can recall frequency and severity of daytime urinary incontinence well during a 3-month period. The developed questionnaire is a valid tool to estimate frequency, severity and risk factors of daytime urinary incontinence and UTI in primary school children.
Construct Validation Theory Applied to the Study of Personality Dysfunction
Zapolski, Tamika C. B.; Guller, Leila; Smith, Gregory T.
2013-01-01
The authors review theory validation and construct validation principles as related to the study of personality dysfunction. Historically, personality disorders have been understood to be syndromes of heterogeneous symptoms. The authors argue that the syndrome approach to description results in diagnoses of unclear meaning and constrained validity. The alternative approach of describing personality dysfunction in terms of homogeneous dimensions of functioning avoids the problems of the syndromal approach and has been shown to provide more valid description and diagnosis. The authors further argue that description based on homogeneous dimensions of personality function/dysfunction is more useful, because it provides direct connections to validated treatments. PMID:22321263
Integrating Validity Theory with Use of Measurement Instruments in Clinical Settings
Kelly, P Adam; O'Malley, Kimberly J; Kallen, Michael A; Ford, Marvella E
2005-01-01
Objective To present validity concepts in a conceptual framework useful for research in clinical settings. Principal Findings We present a three-level decision rubric for validating measurement instruments, to guide health services researchers step-by-step in gathering and evaluating validity evidence within their specific situation. We address construct precision, the capacity of an instrument to measure constructs it purports to measure and differentiate from other, unrelated constructs; quantification precision, the reliability of the instrument; and translation precision, the ability to generalize scores from an instrument across subjects from the same or similar populations. We illustrate with specific examples, such as an approach to validating a measurement instrument for veterans when prior evidence of instrument validity for this population does not exist. Conclusions Validity should be viewed as a property of the interpretations and uses of scores from an instrument, not of the instrument itself: how scores are used and the consequences of this use are integral to validity. Our advice is to liken validation to building a court case, including discovering evidence, weighing the evidence, and recognizing when the evidence is weak and more evidence is needed. PMID:16178998
NASA Astrophysics Data System (ADS)
Skala, Melissa C.; Crow, Matthew J.; Wax, Adam; Izatt, Joseph A.
2009-02-01
Molecular imaging is a powerful tool for investigating disease processes and potential therapies in both in vivo and in vitro systems. However, high resolution molecular imaging has been limited to relatively shallow penetration depths that can be accessed with microscopy. Optical coherence tomography (OCT) is an optical analogue to ultrasound with relatively good penetration depth (1-2 mm) and resolution (~1-10 μm). We have developed and characterized photothermal OCT as a molecular contrast mechanism that allows for high resolution molecular imaging at deeper penetration depths than microscopy. Our photothermal system consists of an amplitude-modulated heating beam that spatially overlaps with the focused spot of the sample arm of a spectral-domain OCT microscope. Validation experiments in tissue-like phantoms containing gold nanospheres that absorb at 532 nm revealed a sensitivity of 14 parts per million nanospheres (weight/weight) in a tissue-like environment. The nanospheres were then conjugated to anti-EGFR, and molecular targeting was confirmed in cells that over-express EGFR (MDA-MB-468) and cells that express low levels of EGFR (MDA-MB-435). Molecular imaging in three-dimensional tissue constructs was confirmed with a significantly lower photothermal signal (p<0.0001) from the constructs composed of cells that express low levels of EGFR compared to the over-expressing cell constructs (300% signal increase). This technique could potentially augment confocal and multiphoton microscopy as a method for deep-tissue, depth-resolved molecular imaging with relatively high resolution and target sensitivity, without photobleaching or cytotoxicity.
Substance versus style: a new look at social desirability in motivating contexts.
Smith, D Brent; Ellingson, Jill E
2002-04-01
Although there is an emerging consensus that social desirability does not meaningfully affect criterion-related validity, several researchers have reaffirmed the argument that social desirability degrades the construct validity of personality measures. Yet, most research demonstrating the adverse consequences of faking for construct validity uses a fake-good instruction set. The consequence of such a manipulation is to exacerbate the effects of response distortion beyond what would be expected under realistic circumstances (e.g., an applicant setting). The research reported in this article was designed to assess these issues by using real-world contexts not influenced by artificial instructions. Results suggest that response distortion has little impact on the construct validity of personality measures used in selection contexts.
Farhan, Bilal; Soltani, Tandis; Do, Rebecca; Perez, Claudia; Choi, Hanul; Ghoniem, Gamal
2018-05-02
Endoscopic injection of urethral bulking agents is an office procedure that is used to treat stress urinary incontinence secondary to internal sphincteric deficiency. Validation studies important part of simulator evaluation and is considered important step to establish the effectiveness of simulation-based training. The endoscopic needle injection (ENI) simulator has not been formally validated, although it has been used widely at University of California, Irvine. We aimed to assess the face, content, and construct validity of the UC, Irvine ENI simulator. Dissected female porcine bladders were mounted in a modified Hysteroscopy Diagnostic Trainer. Using routine endoscopic equipment for this procedure with video monitoring, 6 urologists (experts group) and 6 urology trainee (novice group) completed urethral bulking agents injections on a total of 12 bladders using ENI simulator. Face and content validities were assessed by using structured quantitative survey which rating the realism. Construct validity was assessed by comparing the performance, time of the procedure, and the occlusive (anatomical and functional) evaluations between the experts and novices. Trainees also completed a postprocedure feedback survey. Effective injections were evaluated by measuring the retrograde urethral opening pressure, visual cystoscopic coaptation, and postprocedure gross anatomic examination. All 12 participants felt the simulator was a good training tool and should be used as essential part of urology training (face validity). ENI simulator showed good face and content validity with average score varies between the experts and the novices was 3.9/5 and 3.8/5, respectively. Content validity evaluation showed that most aspects of the simulator were adequately realistic (mean Likert scores 3.9-3.8/5). However, the bladder does not bleed, and sometimes thin. Experts significantly outperformed novices (p < 001) across all measure of performance therefore establishing construct validity. The ENI simulator shows face, content and construct validities, although few aspects of simulator were not very realistic (e.g., bleeding).This study provides a base for the future formal validation for this simulator and for continuing use of this simulator in endourology training. Copyright © 2018 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
The Nature of Science Instrument-Elementary (NOSI-E): the end of the road?
Peoples, Shelagh M; O'Dwyer, Laura M
2014-01-01
This research continues prior work published in this journal (Peoples, O'Dwyer, Shields and Wang, 2013). The first paper described the scale development, psychometric analyses and part-validation of a theoretically-grounded Rasch-based instrument, the Nature of Science Instrument-Elementary (NOSI-E). The NOSI-E was designed to measure elementary students' understanding of the Nature of Science (NOS). In the first paper, evidence was provided for three of the six validity aspects (content, substantive and generalizability) needed to support the construct validity of the NOSI-E. The research described in this paper examines two additional validity aspects (structural and external). The purpose of this study was to determine which of three competing internal models provides reliable, interpretable, and responsive measures of students' understanding of NOS. One postulate is that the NOS construct is unidimensional;. alternatively, the NOS construct is composed of five independent unidimensional constructs (the consecutive approach). Lastly, the NOS construct is multidimensional and composed of five inter-related but separate dimensions. The vast body of evidence supported the claim that the NOS construct is multidimensional. Measures from the multidimensional model were positively related to student science achievement and students' perceptions of their classroom environment; this provided supporting evidence for the external validity aspect of the NOS construct. As US science education moves toward students learning science through engaging in authentic scientific practices and building learning progressions (NRC, 2012), it will be important to assess whether this new approach to teaching science is effective, and the NOSI-E may be used as a measure of the impact of this reform.
Jones, Andrew; Button, Emily; Rose, Abigail K; Robinson, Eric; Christiansen, Paul; Di Lemma, Lisa; Field, Matt
2016-03-01
Motivation to drink alcohol can be measured in the laboratory using an ad-libitum 'taste test', in which participants rate the taste of alcoholic drinks whilst their intake is covertly monitored. Little is known about the construct validity of this paradigm. The objective of this study was to investigate variables that may compromise the validity of this paradigm and its construct validity. We re-analysed data from 12 studies from our laboratory that incorporated an ad-libitum taste test. We considered time of day and participants' awareness of the purpose of the taste test as potential confounding variables. We examined whether gender, typical alcohol consumption, subjective craving, scores on the Alcohol Use Disorders Identification Test and perceived pleasantness of the drinks predicted ad-libitum consumption (construct validity). We included 762 participants (462 female). Participant awareness and time of day were not related to ad-libitum alcohol consumption. Males drank significantly more alcohol than females (p < 0.001), and individual differences in typical alcohol consumption (p = 0.04), craving (p < 0.001) and perceived pleasantness of the drinks (p = 0.04) were all significant predictors of ad-libitum consumption. We found little evidence that time of day or participant awareness influenced alcohol consumption. The construct validity of the taste test was supported by relationships between ad-libitum consumption and typical alcohol consumption, craving and pleasantness ratings of the drinks. The ad-libitum taste test is a valid method for the assessment of alcohol intake in the laboratory.
Vatan, Sevginar; Ertaş, Sedar; Lester, David
2011-04-01
In a sample of 100 Turkish psychiatric patients with diagnoses of anxiety disorders, Lester's Helplessness, Hopelessness, and Haplessness inventory had moderate estimates of internal consistency, test-retest reliability, and construct validity.
Evaluating Evidence for Conceptually Related Constructs Using Bivariate Correlations
ERIC Educational Resources Information Center
Swank, Jacqueline M.; Mullen, Patrick R.
2017-01-01
The article serves as a guide for researchers in developing evidence of validity using bivariate correlations, specifically construct validity. The authors outline the steps for calculating and interpreting bivariate correlations. Additionally, they provide an illustrative example and discuss the implications.
Dimensionality and construct validity of the Perceptions of Organizational Politics Scale (POPS).
DOT National Transportation Integrated Search
1992-02-01
This study examined the dimensionality and construct validity of Kacmar and Ferris (1991) Perceptions of Organizational Politics Scale (POPS), which is comprised of 3 subscales: "General Political Behavior," "Going Along to Get Ahead," and "Pay and P...
Rater reliability and construct validity of a mobile application for posture analysis
Szucs, Kimberly A.; Brown, Elena V. Donoso
2018-01-01
[Purpose] Measurement of posture is important for those with a clinical diagnosis as well as researchers aiming to understand the impact of faulty postures on the development of musculoskeletal disorders. A reliable, cost-effective and low tech posture measure may be beneficial for research and clinical applications. The purpose of this study was to determine rater reliability and construct validity of a posture screening mobile application in healthy young adults. [Subjects and Methods] Pictures of subjects were taken in three standing positions. Two raters independently digitized the static standing posture image twice. The app calculated posture variables, including sagittal and coronal plane translations and angulations. Intra- and inter-rater reliability were calculated using the appropriate ICC models for complete agreement. Construct validity was determined through comparison of known groups using repeated measures ANOVA. [Results] Intra-rater reliability ranged from 0.71 to 0.99. Inter-rater reliability was good to excellent for all translations. ICCs were stronger for translations versus angulations. The construct validity analysis found that the app was able to detect the change in the four variables selected. [Conclusion] The posture mobile application has demonstrated strong rater reliability and preliminary evidence of construct validity. This application may have utility in clinical and research settings. PMID:29410561
Rater reliability and construct validity of a mobile application for posture analysis.
Szucs, Kimberly A; Brown, Elena V Donoso
2018-01-01
[Purpose] Measurement of posture is important for those with a clinical diagnosis as well as researchers aiming to understand the impact of faulty postures on the development of musculoskeletal disorders. A reliable, cost-effective and low tech posture measure may be beneficial for research and clinical applications. The purpose of this study was to determine rater reliability and construct validity of a posture screening mobile application in healthy young adults. [Subjects and Methods] Pictures of subjects were taken in three standing positions. Two raters independently digitized the static standing posture image twice. The app calculated posture variables, including sagittal and coronal plane translations and angulations. Intra- and inter-rater reliability were calculated using the appropriate ICC models for complete agreement. Construct validity was determined through comparison of known groups using repeated measures ANOVA. [Results] Intra-rater reliability ranged from 0.71 to 0.99. Inter-rater reliability was good to excellent for all translations. ICCs were stronger for translations versus angulations. The construct validity analysis found that the app was able to detect the change in the four variables selected. [Conclusion] The posture mobile application has demonstrated strong rater reliability and preliminary evidence of construct validity. This application may have utility in clinical and research settings.
ERIC Educational Resources Information Center
Gwaltney, Kevin Dale
2012-01-01
This effort: 1) establishes an autonomy definition uniquely tailored for teaching, 2) validates a nationally generalizable teacher autonomy construct, 3) demonstrates that the model describes and explains the autonomy levels of particular teacher groups, and 4) verifies the construct can represent teacher autonomy in other empirical models. The…
Computer Literacy and the Construct Validity of a High-Stakes Computer-Based Writing Assessment
ERIC Educational Resources Information Center
Jin, Yan; Yan, Ming
2017-01-01
One major threat to validity in high-stakes testing is construct-irrelevant variance. In this study we explored whether the transition from a paper-and-pencil to a computer-based test mode in a high-stakes test in China, the College English Test, has brought about variance irrelevant to the construct being assessed in this test. Analyses of the…
van Loon, Johannes P A M; Van Dierendonck, Machteld C
2015-12-01
Although recognition of equine pain has been studied extensively over the past decades there is still need for improvement in objective identification of pain in horses with acute colic. This study describes scale construction and clinical applicability of the Equine Utrecht University Scale for Composite Pain Assessment (EQUUS-COMPASS) and the Equine Utrecht University Scale for Facial Assessment of Pain (EQUUS-FAP) in horses with acute colic. A cohort follow-up study was performed using 50 adult horses (n = 25 with acute colic, n = 25 controls). Composite pain scores were assessed by direct observations, Visual Analog Scale (VAS) scores were assessed from video clips. Colic patients were assessed at arrival, and on the first and second mornings after arrival. Both the EQUUS-COMPASS and EQUUS-FAP scores showed high inter-observer reliability (ICC = 0.98 for EQUUS-COMPASS, ICC = 0.93 for EQUUS-FAP, P <0.001), while a moderate inter-observer reliability for the VAS scores was found (ICC = 0.63, P <0.001). The cut-off value for differentiation between healthy and colic horses for the EQUUS-COMPASS was 5, and for differentiation between conservatively treated and surgically treated or euthanased patients it was 11. For the EQUUS-FAP, cut-off values were 4 and 6, respectively. Internal sensitivity and specificity were good for both EQUUS-COMPASS (sensitivity 95.8%, specificity 84.0%) and EQUUS-FAP (sensitivity 87.5%, specificity 88.0%). The use of the EQUUS-COMPASS and EQUUS-FAP enabled repeated and objective scoring of pain in horses with acute colic. A follow-up study with new patients and control animals will be performed to further validate the constructed scales that are described in this study. Copyright © 2015 Elsevier Ltd. All rights reserved.
Bahrdt, C; Krech, A B; Wurz, A; Wulff, D
2010-03-01
For years, an increasing number and diversity of genetically modified plants has been grown on a commercial scale. The need for detection and identification of these genetically modified organisms (GMOs) calls for broad and at the same time flexible high throughput testing methods. Here we describe the development and validation of a hexaplex real-time polymerase chain reaction (PCR) screening assay covering more than 100 approved GMOs containing at least one of the GMO targets of the assay. The assay comprises detection systems for Cauliflower Mosaic Virus 35S promoter, Agrobacterium tumefaciens NOS terminator, Figwort Mosaic Virus 34S promoter and two construct-specific sequences present in novel genetically modified soybean and maize that lack common screening elements. Additionally a detection system for an internal positive control (IPC) indicating the presence or absence of PCR inhibiting substances was included. The six real-time PCR systems were allocated to five detection channels showing no significant crosstalk between the detection channels. As part of an extensive validation, a limit of detection (LOD(abs)) < or = ten target copies was proven in hexaplex format. A sensitivity < or = ten target copies of each GMO detection system was still shown in highly asymmetric target situations in the presence of 1,000 copies of all other GMO targets of each detection channel. Furthermore, the applicability to a broad sample spectrum and reliable indication of inhibition by the IPC system was demonstrated. The presented hexaplex assay offers sensitive and reliable detection of GMOs in processed and unprocessed food, feed and seed samples with high efficiency.