[Development of competency to stand trial rating scale in offenders with mental disorders].
Chen, Xiao-Bing; Cai, Wei-Xiong
2013-04-01
According with Chinese legal system, to develop a competency to stand trial rating scale in offenders with mental disorders. Proceeding from the juristical elements, 15 items were extracted and formulated a preliminary instrument named the competency to stand trial rating scale in offenders with mental disorders. The item analysis included six aspects, which were critical ratio, item-total correlation, corrected item-total correlation, alpha value if item deleted, communalities of items, and factor loading. The Logistic regression equation and cut-off score of ROC curve were used to explore the diagnostic efficiency. The data of critical ratio of extreme group were 18.390-46.763; item-total correlation, 0.639-0.952; corrected item-total correlation, 0.582-0.944; communalities of items, 0.377-0.916; and factor loadings, 0.614-0.957. Seven items were included in the regression equation and the accuracy of back substitution test was 96.0%. The score of 33 was ascertained as the cut-off score by ROC fitting curve, the overlapping ratio compared with the expertise was 95.8%. The sensibility and the specificity were 0.938 and 0.966, respectively, while the positive and negative likelihood ratios were 27.67 and 0.06, respectively. With all items satisfied the requirement of homogeneity test, the rating scale has a reasonable construct and excellent diagnostic efficiency.
Trust in Leadership DEOCS 4.1 Construct Validity Summary
2017-08-01
Item Corrected Item- Total Correlation Cronbach’s Alpha if Item Deleted Four-point Scale Items I can depend on my immediate supervisor to meet...1974) were used to assess the fit between the data and the factor. The BTS hypothesizes that the correlation matrix is an identity matrix. The...to reject the null hypothesis that the correlation matrix is an identity, and to conclude that the factor analysis is an appropriate method to
Validation of general job satisfaction in the Korean Labor and Income Panel Study.
Park, Shin Goo; Hwang, Sang Hee
2017-01-01
The purpose of this study is to assess the validity and reliability of general job satisfaction (JS) in the Korean Labor and Income Panel Study (KLIPS). We used the data from the 17th wave (2014) of the nationwide KLIPS, which selected a representative panel sample of Korean households and individuals aged 15 or older residing in urban areas. We included in this study 7679 employed subjects (4529 males and 3150 females). The general JS instrument consisted of five items rated on a scale from 1 (strongly disagree) to 5 (strongly agree). The general JS reliability was assessed using the corrected item-total correlation and Cronbach's alpha coefficient. The validity of general JS was assessed using confirmatory factor analysis (CFA) and Pearson's correlation. The corrected item-total correlations ranged from 0.736 to 0.837. Therefore, no items were removed. Cronbach's alpha for general JS was 0.925, indicating excellent internal consistency. The CFA of the general JS model showed a good fit. Pearson's correlation coefficients for convergent validity showed moderate or strong correlations. The results obtained in our study confirm the validity and reliability of general JS.
Methods for Linking Item Parameters.
1981-08-01
within and across data sets; all proportion-correct distributions were quite platykurtic . Biserial item-total correlations had relatively consistent...would produce a distribution of a parameters which had a larger mean and standard deviation, was more positively skewed, and was somewhat more platykurtic
Goossens, Joline; Verhaeghe, Sofie; Van Hecke, Ann; Barrett, Geraldine; Delbaere, Ilse; Beeckman, Dimitri
2018-01-01
To evaluate the psychometric properties of the Dutch version of the London Measure of Unplanned Pregnancy in women with pregnancies ending in birth. A two-phase psychometric evaluation design was set-up. Phase I comprised the translation from English into Dutch and pretesting with 6 women using cognitive interviews. In phase II, the reliability and validity of the Dutch version of the LMUP was assessed in 517 women giving birth recently. Reliability (internal consistency) was assessed using Cronbach's alpha, inter-item correlations, and corrected item-total correlations. Construct validity was assessed using principal components analysis and hypothesis testing. Exploratory Mokken scale analysis was carried out. 517 women aged 15-45 completed the Dutch version of the LMUP. Reliability testing showed acceptable internal consistency (alpha = 0.74, positive inter-item correlations between all items, all corrected item-total correlations >0.20). Validity testing confirmed the unidimensional structure of the scale and all hypotheses were confirmed. The overall Loevinger's H coefficient was 0.57, representing a 'strong' scale. The Dutch version of the LMUP is a reliable and valid measure that can be used in the Dutch-speaking population in Belgium to assess pregnancy planning. Future research is necessary to assess the stability of the Dutch version of the LMUP, and to evaluate its psychometric properties in women with abortions.
Park, Jong Cook; Kim, Kwang Sig
2012-03-01
The reliability of test is determined by each items' characteristics. Item analysis is achieved by classical test theory and item response theory. The purpose of the study was to compare the discrimination indices with item response theory using the Rasch model. Thirty-one 4th-year medical school students participated in the clinical course written examination, which included 22 A-type items and 3 R-type items. Point biserial correlation coefficient (C(pbs)) was compared to method of extreme group (D), biserial correlation coefficient (C(bs)), item-total correlation coefficient (C(it)), and corrected item-total correlation coeffcient (C(cit)). Rasch model was applied to estimate item difficulty and examinee's ability and to calculate item fit statistics using joint maximum likelihood. Explanatory power (r2) of Cpbs is decreased in the following order: C(cit) (1.00), C(it) (0.99), C(bs) (0.94), and D (0.45). The ranges of difficulty logit and standard error and ability logit and standard error were -0.82 to 0.80 and 0.37 to 0.76, -3.69 to 3.19 and 0.45 to 1.03, respectively. Item 9 and 23 have outfit > or =1.3. Student 1, 5, 7, 18, 26, 30, and 32 have fit > or =1.3. C(pbs), C(cit), and C(it) are good discrimination parameters. Rasch model can estimate item difficulty parameter and examinee's ability parameter with standard error. The fit statistics can identify bad items and unpredictable examinee's responses.
Schinka, J A
1995-02-01
Individual scale characteristics and the inventory structure of the Personality Assessment Inventory (PAI; Morey, 1991) were examined by conducting internal consistency and factor analyses of item and scale score data from a large group (N = 301) of alcohol-dependent patients. Alpha coefficients, mean inter-item correlations, and corrected item-total scale correlations for the sample paralleled values reported by Morey for a large clinical sample. Minor differences in the scale factor structure of the inventory from Morey's clinical sample were found. Overall, the findings support the use of the PAI in the assessment of personality and psychopathology of alcohol-dependent patients.
Maizura, Husna; Masilamani, Retneswari; Aris, Tahir
2009-04-01
This small, cross-sectional study assessed the reliability of 3 scales from the Job Content Questionnaire (JCQ)-decision latitude, psychological job demand, and social support-in a group of office workers in a multinational company in Kuala Lumpur. A universal sample of 30 white-collar workers from a department of the company self-administered the English version of the JCQ comprising 21 core items selected from the full recommended version of 49 items on-site. Reliability (internal consistency) was evaluated using Cronbach's alpha coefficients for each scale. Corrected item-total correlation was presented for each and every item. Cronbach's alpha coefficients were acceptable for decision latitude (.76) and social support (.79) but slightly lower for psychological job demand (.64). Values for all item-total correlations for all 3 scales were greater than .3. In conclusion, this study suggests that the JCQ is a reliable scale for assessing job stress in this group of workers.
Dimensions of vegetable parenting practices among preschoolers.
Baranowski, Tom; Chen, Tzu-An; O'Connor, Teresia; Hughes, Sheryl; Beltran, Alicia; Frankel, Leslie; Diep, Cassandra; Baranowski, Janice C
2013-10-01
The objective of this study was to determine the factor structure of 31 effective and ineffective vegetable parenting practices used by parents of preschool children based on three theoretically proposed factors: responsiveness, control and structure. The methods employed included both corrected item-total correlations and confirmatory factor analysis. Acceptable fit was obtained only when effective and ineffective parenting practices were analyzed separately. Among effective items the model included one second order factor (effectiveness) and the three proposed first order factors. The same structure was revealed among ineffective items, but required correlated paths be specified among items. A theoretically specified three factor structure was obtained among 31 vegetable parenting practice items, but likely to be effective and ineffective items had to be analyzed separately. Research is needed on how these parenting practices factors predict child vegetable intake. Copyright © 2013 Elsevier Ltd. All rights reserved.
Learning Style Scales: a valid and reliable questionnaire.
Abdollahimohammad, Abdolghani; Ja'afar, Rogayah
2014-01-01
Learning-style instruments assist students in developing their own learning strategies and outcomes, in eliminating learning barriers, and in acknowledging peer diversity. Only a few psychometrically validated learning-style instruments are available. This study aimed to develop a valid and reliable learning-style instrument for nursing students. A cross-sectional survey study was conducted in two nursing schools in two countries. A purposive sample of 156 undergraduate nursing students participated in the study. Face and content validity was obtained from an expert panel. The LSS construct was established using principal axis factoring (PAF) with oblimin rotation, a scree plot test, and parallel analysis (PA). The reliability of LSS was tested using Cronbach's α, corrected item-total correlation, and test-retest. Factor analysis revealed five components, confirmed by PA and a relatively clear curve on the scree plot. Component strength and interpretability were also confirmed. The factors were labeled as perceptive, solitary, analytic, competitive, and imaginative learning styles. Cronbach's α was >0.70 for all subscales in both study populations. The corrected item-total correlations were >0.30 for the items in each component. The LSS is a valid and reliable inventory for evaluating learning style preferences in nursing students in various multicultural environments.
Development of Attitudes Toward Homosexuality Scale for Indians (AHSI).
Ahuja, Kanika K
2017-01-01
Attitudes toward homosexuality vary across cultures, with the legal and societal position being rather complicated in India. This study describes the process of developing and validating a Likert-type scale to assess attitudes toward homosexuality among heterosexuals. Phase 1 describes the development of the scale. Items were written based on thematic analysis of narratives generated from 50 college students and reviewing existing scales. After administering the 70-item scale to 68 participants, item analysis yielded 20 statements with item-total correlations over .70. Cronbach's alpha was .97. In Phase 2, the 20-item Attitudes Toward Homosexuality Scale for Indians (AHSI) was administered to 142 participants. Analysis yielded a corrected split-half correlation of .91. Further, AHSI discriminated between women and men; between liberal arts and STEM/business students; and those who reported interpersonal contact with gay men and lesbian women and those who did not. The scale has satisfactory reliability and shows promising construct validity.
Bermúdez-de-Alvear, Rosa M; Gálvez-Ruiz, Pablo; Martínez-Arquero, A Ginés; Rando-Márquez, Sara; Fernández-Contreras, Elena
2018-06-11
This study aimed to analyze the psychometric properties of the Spanish version of the Voice Activity and Participation Profile (SVAPP) questionnaire. A randomized, cross-sectional sampling strategy with controls was used. Two samples with a total of 169 participants were analyzed, specifically 61 men (mean age 37.02) and 108 women (mean age 37.78). Of these participants, 112 were patients and 57 were controls. The instrument was submitted to reliability (internal consistency and corrected item-total correlations) and reproducibility analyses. Validation assessment was based on the construct validity, convergent validity, discriminant validity, and concurrent validity. The global internal consistency was excellent (Cronbach's α = 0.976), corrected item-total correlations were satisfactory and ranged 0.63-0.89, and factor loadings were above 0.50. The different subscales showed good internal consistency (alpha coefficients ranged 0.830-0.956) and test-retest values were consistently associated. The exploratory factor analysis evidenced a strongly defined five factors internal structure, with factors loadings ranging 0.51-0.86. Convergent validity demonstrated that all subscales and scores were very strongly correlated (Pearson r above 0.735) and significantly associated. The discriminant validity analysis showed that SVAPP had good specificity to distinguish dysphonic from healthy voice subjects. Concurrent validity with Voice Handicap Index Spanish version (SVHI) showed very strong correlations between total scores, and between SVHI total score and SVAPP Daily and Social Communication subscales; correlations between both tests subscales were strong; only between SVAPP Work and SVHI Physical sections correlations were moderate. The findings of the present study demonstrated evidence for the SVAPP questionnaire reliability and validity, and provided insightful implications of voice disorders on Spanish patients' quality of life. However, further investigations are required. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
[Development of critical thinking skill evaluation scale for nursing students].
You, So Young; Kim, Nam Cho
2014-04-01
To develop a Critical Thinking Skill Test for Nursing Students. The construct concepts were drawn from a literature review and in-depth interviews with hospital nurses and surveys were conducted among students (n=607) from nursing colleges. The data were collected from September 13 to November 23, 2012 and analyzed using the SAS program, 9.2 version. The KR 20 coefficient for reliability, difficulty index, discrimination index, item-total correlation and known group technique for validity were performed. Four domains and 27 skills were identified and 35 multiple choice items were developed. Thirty multiple choice items which had scores higher than .80 on the content validity index were selected for the pre test. From the analysis of the pre test data, a modified 30 items were selected for the main test. In the main test, the KR 20 coefficient was .70 and Corrected Item-Total Correlations range was .11-.38. There was a statistically significant difference between two academic systems (p=.001). The developed instrument is the first critical thinking skill test reflecting nursing perspectives in hospital settings and is expected to be utilized as a tool which contributes to improvement of the critical thinking ability of nursing students.
[Analysis of parental knowledge and care in childhood fever].
Pérez-Conesa, Maria-Cristina; Sánchez Pina, Inés; Ridao Manonellas, Saida; Tormo Esparza, Antoni; García Hernando, Verónica; López Fernández, Marta
2017-10-01
To describe the parental knowledge and care of fever in children under 2years. Relate this data with socio-demographic with characteristics. Cross-sectional and correlation multicenter study. Five teams of Primary Care in Barcelona. Parents of children under 2years attended to administer a vaccine included in the pediatric systematic calendar. A total of 311 subjects participated. The main variables are 9 items of knowledge and 8 of care or management of fever obtained with the adaptation of the questionnaire by Chiappini et al. (2012). 69.8% had a correct care/management of fever. 3.9% matched all items of knowledge. The knowledge score is lower in people with no education (p=0.03); higher in Europe and South America and lowest in Asia and Africa (P<.001). 100% of patients that had chronic problems answered correctly all items of fever care (P=.03). It is important to note that the correlation between the scores of knowledge and management is positive (rho=0.15, P=.008). A correct care of fever is observed despite the low knowledge. A good strategy to promote a correct care of febrile child is to do sanitary education with update information and adapted it to parents, focusing on the differences between ethnic groups because they seem to have inaccurate beliefs about fever. Copyright © 2017 Elsevier España, S.L.U. All rights reserved.
Stoll, Kathrin; Hauck, Yvonne; Downe, Soo; Edmonds, Joyce; Gross, Mechthild M; Malott, Anne; McNiven, Patricia; Swift, Emma; Thomson, Gillian; Hall, Wendy A
2016-06-01
Assessment of childbirth fear, in advance of pregnancy, and early identification of modifiable factors contributing to fear can inform public health initiatives and/or school-based educational programming for the next generation of maternity care consumers. We developed and evaluated a short fear of birth scale that incorporates the most common dimensions of fear reported by men and women prior to pregnancy, fear of: labour pain, being out of control and unable to cope with labour and birth, complications, and irreversible physical damage. University students in six countries (Australia, Canada, England, Germany, Iceland, and the United States, n = 2240) participated in an online survey to assess their fears and attitudes about birth. We report internal consistency reliability, corrected-item-to-total correlations, factor loadings and convergent and discriminant validity of the new scale. The Childbirth Fear - Prior to Pregnancy (CFPP) scale showed high internal consistency across samples (α > 0.86). All corrected-item-to total correlations exceeded 0.45, supporting the uni-dimensionality of the scale. Construct validity of the CFPP was supported by a high correlation between the new scale and a two-item visual analogue scale that measures fear of birth (r > 0.6 across samples). Weak correlations of the CFPP with scores on measures that assess related psychological states (anxiety, depression and stress) support the discriminant validity of the scale. The CFPP is a short, reliable and valid measure of childbirth fear among young women and men in six countries who plan to have children. Copyright © 2016 Elsevier B.V. All rights reserved.
Effectiveness of health management departments of universities that train health managers in Turkey.
Karagoz, Sevgul; Balci, Ali
2007-01-01
This research has [corrected] aimed to examine the effectiveness of the health management departments of universities which [corrected] train health managers in Turkey. The study compares - for lecturers and students - nine variables of organisational effectiveness [corrected] These nine dimensions are derived from Cameron (1978; 1981; 1986) [corrected] Factor analysis was used to validate [corrected] the scale developed by the researcher. For internal consistency and reliability, the [corrected] Cronbach Alpha reliability coefficient and item total correlation were applied. A questionnaire was administered to a [corrected] total of [corrected] 207 people [corrected] in health management departments in [corrected]Turkey. In analysis of the data, [corrected] descriptive statistics and the [corrected] t-test were [corrected]used. According to our [corrected] research findings, at individual [corrected] university level, lecturers found their departments more effective than did [corrected] their students. The highest effectiveness was perceived at Baskent University, a private university [corrected] The best outcome was achieved for 'organisational health', and 'the [corrected] ability to acquire resources' achieved [corrected] the lowest outcome [corrected] Effectiveness overall [corrected] was found to be moderate [corrected] Copyright (c) 2006 John Wiley & Sons, Ltd.
The Effects of Methods of Imputation for Missing Values on the Validity and Reliability of Scales
ERIC Educational Resources Information Center
Cokluk, Omay; Kayri, Murat
2011-01-01
The main aim of this study is the comparative examination of the factor structures, corrected item-total correlations, and Cronbach-alpha internal consistency coefficients obtained by different methods used in imputation for missing values in conditions of not having missing values, and having missing values of different rates in terms of testing…
van Dijk, Inge; Scholten Meilink Lenferink, Nick; Lucassen, Peter L B J; Mercer, Stewart W; van Weel, Chris; Olde Hartman, Tim C; Speckens, Anne E M
2017-02-01
Empathy is an essential skill in doctor-patient communication with positive effects on compliance, patient satisfaction and symptom duration. There are no validated patient-rated empathy measures available in Dutch. To investigate the validity and reliability of a Dutch version of the Consultation and Relational Empathy (CARE) Measure, a widely used 10-item patient-rated questionnaire of physician empathy. After translation and back translation, the Dutch CARE Measure was distributed among patients from 19 general practitioners in 5 primary care centers. Tests of internal reliability and validity included Cronbach's alpha, item total correlations and factor analysis. Seven items of the QUality Of care Through the patient's Eyes (QUOTE) questionnaire assessing 'affective performance' of the physician were included in factor analysis and used to investigate convergent validity. Of the 800 distributed questionnaires, 655 (82%) were returned. Acceptability and face validity were supported by a low number of 'does not apply' responses (range 0.2%-11.9%). Internal reliability was high (Cronbach's alpha 0.974). Corrected item total correlations were at a minimum of 0.837. Factor analysis on the 10 items of the CARE Measure and 7 QUOTE items resulted in two factors (Eigenvalue > 1), the first containing the CARE Measure items and the second containing the QUOTE items. Convergent construct validity between the CARE Measure and QUOTE was confirmed with a modest positive correlation (r = 0.34, n = 654, P < 0.001). The findings support the preliminary validity and reliability of the Dutch CARE Measure. Future research is required to investigate divergent validity and discriminant ability between doctors. © The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
A Psychometric Evaluation of the Threadgold Communication Tool for Persons with Dementia
Strøm, Benedicte Sørensen; Engedal, Knut; Grov, Ellen-Karine
2016-01-01
Background The objective of this study was to investigate the psychometric properties of the Threadgold Communication Tool (TCT). Method Internal consistency reliability was measured using Cronbach's α coefficient and inter-item correlation. Test-retest was performed to examine the instrument's stability. Exploratory principal component analysis (PCA) with oblimin rotation was carried out to evaluate construct validity. Finally, the score on each item of the TCT was correlated with the person's Mini Mental State Examination (MMSE) and Barthel Index of activities of daily living scores. Results A total of 51 persons participated, with a mean age of 86.7 (SD 6.6) years, of whom 46 were women with moderate-to-severe dementia [mean MMSE score 7.5 (SD 6.7)]. There were two measurement points 2 weeks apart. The results showed a satisfactory level for internal consistency and a high test-retest reliability (r = 0.76). The corrected item-total correlation ranged between 0.50 and 0.87, and a two-factor structure was revealed at the PCA. ‘Vocalizing’ seemed to measure another aspect of communication and was the only item which was negatively loaded. Conclusion Despite the low sample size in this study, the results revealed the TCT as a reliable and valid instrument, suitable for measuring communication among people with dementia. We suggest clarifying the understanding of ‘vocalizing’ before considering removing it from the scale. PMID:27239188
Rask, Marie; Oscarsson, Marie; Ludwig, Neil; Swahnberg, Katarina
2017-04-04
Cervical dysplasia is a precancerous condition, which has been shown to create anxiety in women. To be able to investigate these women's health-related quality of life, a disease-specific instrument is required. There does not seem to be a Swedish version of an instrument to screen for this specific disease. Therefore, this study aims to translate and cross-culturally adapt the Functional Assessment of Chronic Illness Therapy - Cervical Dysplasia (FACIT-CD) into a Swedish context and evaluate its linguistic validity and reliability. The Functional Assessment of Chronic Illness Therapy (FACIT) translation methodology was used, which consists of several steps including pilot testing of the FACIT-CD instrument through cognitive debriefing interviews. Ten women diagnosed with cervical dysplasia participated in the cognitive debriefing interviews. The internal consistency reliability of the Swedish FACIT-CD was estimated by Cronbach's alpha coefficient. Homogeneity of the items was evaluated by corrected item-total correlations. The sample consists of 34 women who were diagnosed with cervical dysplasia. The translation and cross-cultural adaptation went smoothly without any problems for the majority of the items. The cognitive debriefing interviews indicated that the Swedish FACIT-CD consists of relevant items, is easy to understand and complete, and has unambiguous and comprehensive response categories. The translation and cross-cultural adaptation resulted in a Swedish FACIT-CD, which is conceptually and semantically equivalent to the English version and linguistically valid. The total scale of the Swedish FACIT-CD exhibited good internal consistency reliability with a Cronbach's alpha coefficient of 0.84, and all of the subscales exhibited acceptable value between 0.71 and 0.81 except the Relationships subscale, which had a value of 0.67. Finally, all but four items exceeded the acceptable level for the corrected item-total correlations of ≥ 0.20. The Swedish FACIT-CD is conceptually and semantically equivalent to the English version and linguistically valid; further, it exhibits good internal consistency reliability.
Xiao, Yu-Ying; Li, Ting; Xiao, Lin; Wang, Su-Wei; Wang, Si-Qi; Wang, Han-Xiao; Wang, Bei-Bei; Gao, Yu-Lin
2017-02-01
Professional attitude is of great importance for nursing talents in the modern society. To develop an effective educational program for student nurses in China, an appropriate instrument is required for the assessment of their professional attitude. To assess the validity and reliability of the Instrument of Professional Attitude for Student Nurses (IPASN) in Chinese version. The original version of IPASN was translated through Brislin model (translation, back translation, culture adaption and pilot study) with the authorization from the developer. A total of 681 nursing students were chosen by stratified convenience sampling to assess construct validity using exploratory factor analysis (EFA). Besides, item analysis, Cronbach's alpha coefficients, test-retest reliability were conducted to test the psychometric properties in this part. A total of 204 nursing undergraduate trainees were selected by cluster convenience sampling to confirm the structure using confirmatory factor analysis (CFA) in another time. Corrected item-total correlations, alpha if item deleted were between 0.33 and 0.69, 0.906 and 0.913, respectively, indicating no item should be deleted. Cronbach alpha value was 0.91 for the total scale and Cronbach alpha coefficient for subscales ranged from 0.67 to 0.89. Test-retest reliability estimated from intraclass correlation coefficient (ICC) was 0.74 (P<0.05). Differences in item scores between the high-score group (the first 27%) and low-score group (the last 27%) were significant (P<0.001), indicating that the item discrimination ability was good. Seven subscales (contribution to increase of scientific information load, autonomy, community service, continuous education, to promote professional development, cooperation and theory guiding practice) were identified in EFA and confirmed in CFA, and explained 65.5% of the total variance. It indicated that the Chinese version of IPASN was valid and reliable for the evaluation of nursing students' professional attitude. Copyright © 2016 Elsevier Ltd. All rights reserved.
The construct validity of the Bem Sex-Role Inventory for heterosexual and gay men.
Chung, Y B
1995-01-01
This study examined the construct validity of the Bem Sex-Role Inventory (BSRI; Bem, 1978) for heterosexual and gay men. Sixty heterosexual and 63 gay male participants were recruited through networking and advertisements. These two groups were of equivalent age, socioeconomic background, race, student status, and educational level. They completed the Lifestyle Questionnaire assessing sexual orientation and the BSRI assessing sex-role orientation. The internal consistency and discriminant validity of the BSRI scales were examined by corrected item-total correlations, coefficient alphas, inter-scale correlations, and factor analysis. Results suggested that the BSRI was equally valid for heterosexual and gay men, and the psychometric data reported in the BSRI Manual (Bem, 1981) were essentially replicated. However, the short-form BSRI is recommended for use with male respondents because of the problematic non-short-form Femininity items.
Fitting the Rasch Model to Account for Variation in Item Discrimination
ERIC Educational Resources Information Center
Weitzman, R. A.
2009-01-01
Building on the Kelley and Gulliksen versions of classical test theory, this article shows that a logistic model having only a single item parameter can account for varying item discrimination, as well as difficulty, by using item-test correlations to adjust incorrect-correct (0-1) item responses prior to an initial model fit. The fit occurs…
Erhart, M; Hagquist, C; Auquier, P; Rajmil, L; Power, M; Ravens-Sieberer, U
2010-07-01
This study compares item reduction analysis based on classical test theory (maximizing Cronbach's alpha - approach A), with analysis based on the Rasch Partial Credit Model item-fit (approach B), as applied to children and adolescents' health-related quality of life (HRQoL) items. The reliability and structural, cross-cultural and known-group validity of the measures were examined. Within the European KIDSCREEN project, 3019 children and adolescents (8-18 years) from seven European countries answered 19 HRQoL items of the Physical Well-being dimension of a preliminary KIDSCREEN instrument. The Cronbach's alpha and corrected item total correlation (approach A) were compared with infit mean squares and the Q-index item-fit derived according to a partial credit model (approach B). Cross-cultural differential item functioning (DIF ordinal logistic regression approach), structural validity (confirmatory factor analysis and residual correlation) and relative validity (RV) for socio-demographic and health-related factors were calculated for approaches (A) and (B). Approach (A) led to the retention of 13 items, compared with 11 items with approach (B). The item overlap was 69% for (A) and 78% for (B). The correlation coefficient of the summated ratings was 0.93. The Cronbach's alpha was similar for both versions [0.86 (A); 0.85 (B)]. Both approaches selected some items that are not strictly unidimensional and items displaying DIF. RV ratios favoured (A) with regard to socio-demographic aspects. Approach (B) was superior in RV with regard to health-related aspects. Both types of item reduction analysis should be accompanied by additional analyses. Neither of the two approaches was universally superior with regard to cultural, structural and known-group validity. However, the results support the usability of the Rasch method for developing new HRQoL measures for children and adolescents.
Buck, Harleah G; Harkness, Karen; Ali, Muhammad Usman; Carroll, Sandra L; Kryworuchko, Jennifer; McGillion, Michael
2017-04-01
Caregivers (CGs) contribute important assistance with heart failure (HF) self-care, including daily maintenance, symptom monitoring, and management. Until CGs' contributions to self-care can be quantified, it is impossible to characterize it, account for its impact on patient outcomes, or perform meaningful cost analyses. The purpose of this study was to conduct psychometric testing and item reduction on the recently developed 34-item Caregiver Contribution to Heart Failure Self-care (CACHS) instrument using classical and item response theory methods. Fifty CGs (mean age 63 years ±12.84; 70% female) recruited from a HF clinic completed the CACHS in 2014 and results evaluated using classical test theory and item response theory. Items would be deleted for low (<.05) or high (>.95) endorsement, low (<.3) or high (>.7) corrected item-total correlations, significant pairwise correlation coefficients, floor or ceiling effects, relatively low latent trait and item information function levels (<1.5 and p > .5), and differential item functioning. After analysis, 14 items were excluded, resulting in a 20-item instrument (self-care maintenance eight items; monitoring seven items; and management five items). Most items demonstrated moderate to high discrimination (median 2.13, minimum .77, maximum 5.05), and appropriate item difficulty (-2.7 to 1.4). Internal consistency reliability was excellent (Cronbach α = .94, average inter-item correlation = .41) with no ceiling effects. The newly developed 20-item version of the CACHS is supported by rigorous instrument development and represents a novel instrument to measure CGs' contribution to HF self-care. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
The development of an instrument to assess chemistry perceptions
NASA Astrophysics Data System (ADS)
Wells, Raymond R.
The instrument, developed in this study, attempted to correct the deficiencies of previous instruments. Statements of belief and opinion can be validly included under the construct of chemistry perceptions. Further, statements that might be better characterized as science attitudes, math attitudes, or attitudes toward a specific course or program were not included. Eliminating statements of math anxiety and test anxiety insured that responses to statements of anxiety were perceptions of anxiety solely related to chemistry. The results of the expert judges' responses to the Validation of Proposed Perception Statements forms were detailed to establish construct and content validity. The nature of Likert scale construction and calculation of internal consistency also supported the validity of the instrument. A pilot Chemistry Perception Questionnaire (CPQ) was then constructed based on agreement of the appropriate subscale and mean importance of the perception statements. The pilot CPQ results were subjected to an item analysis based on three sets of statistics: the frequency of each response and the percentage of respondents making each response for each perception statement, the mean and standard deviations for each item, and the item discrimination index which correlated the item scores with the subscale scores. With no zero or negative correlations to the subscale scores, it was not necessary to replace any of the perception statements contained in the pilot instrument. Therefore, the piloted Chemistry Perception Questionnaire became the final instrument. Factor analysis confirmed the multidimensionality of the instrument. The instrument was administered twice with a separation interval of approximately one month in order to perform a test-retest reliability analysis. One hundred and forty-one pairs were matched and results detailed. The correlation between forms, for the total instrument, was 0.9342. The mean coefficient alpha, for the total instrument, was 0.9495. With test-retest correlations and alphas exceeding 0.70 for all seven subscales and the total instrument, it was determined that the Chemistry Perception Questionnaire instrument achieved reasonably high reliability estimations.
Laksmiastuti, Sri Ratna; Budiardjo, Sarworini Bagio; Sutadi, Heriandi
2017-06-01
Predicting caries risk in children can be done by identifying caries risk factors. It is an important measure which contributes to best understanding of the cariogenic profile of the patient. Identification could be done by clinical examination and answering the questionnaire. We arrange the study to verify the questionnaire validation for predicting caries risk in children. The study was conducted on 62 pairs of mothers and their children, aged between 3 and 5 years. The questionnaire consists of 10 questions concerning mothers' attitude and knowledge about oral health. The reliability and validity test is based on Cronbach's alpha and correlation coefficient value. All question are reliable (Cronbach's alpha = 0.873) and valid (Corrected item-total item correlation >0.4). Five questionnaires of mother's attitude about oral health and five questionnaires of mother's knowledge about oral health are reliable and valid for predicting caries risk in children.
Disruptive behaviors in the classroom: initial standardization data on a new teacher rating scale.
Burns, G L; Owen, S M
1990-10-01
This study presents initial standardization data on the Sutter-Eyberg Student Behavior Inventory (SESBI), a teacher-completed measure of disruptive classroom behaviors. SESBIs were completed on 1116 children in kingergarten through fifth grade in a rural eastern Washington school district. Various analyses (Cronbach's alpha, corrected item-total correlations, average interitem correlations, principal components analyses) indicated that the SESBI provides a homogeneous measure of disruptive behaviors. Support was also found for three factors within the scale (e.g., overt aggression, oppositional behavior, and attentional difficulties). While the child's age did not have a significant effect on the SESBI, the child's gender did have a significant effect on scale scores as well as on most of the items, with males being rated more problematic than females. The SESBI was also able to discriminate between children in treatment for behavioral problems or learning disabilities and children not in treatment.
Feedback-related brain activity predicts learning from feedback in multiple-choice testing.
Ernst, Benjamin; Steinhauser, Marco
2012-06-01
Different event-related potentials (ERPs) have been shown to correlate with learning from feedback in decision-making tasks and with learning in explicit memory tasks. In the present study, we investigated which ERPs predict learning from corrective feedback in a multiple-choice test, which combines elements from both paradigms. Participants worked through sets of multiple-choice items of a Swahili-German vocabulary task. Whereas the initial presentation of an item required the participants to guess the answer, corrective feedback could be used to learn the correct response. Initial analyses revealed that corrective feedback elicited components related to reinforcement learning (FRN), as well as to explicit memory processing (P300) and attention (early frontal positivity). However, only the P300 and early frontal positivity were positively correlated with successful learning from corrective feedback, whereas the FRN was even larger when learning failed. These results suggest that learning from corrective feedback crucially relies on explicit memory processing and attentional orienting to corrective feedback, rather than on reinforcement learning.
Development of a refractive error quality of life scale for Thai adults (the REQ-Thai).
Sukhawarn, Roongthip; Wiratchai, Nonglak; Tatsanavivat, Pyatat; Pitiyanuwat, Somwung; Kanato, Manop; Srivannaboon, Sabong; Guyatt, Gordon H
2011-08-01
To develop a scale for measuring refractive error quality of life (QOL) for Thai adults. The full survey comprised 424 respondents from 5 medical centers in Bangkok and from 3 medical centers in Chiangmai, Songkla and KhonKaen provinces. Participants were emmetropes and persons with refractive correction with visual acuity of 20/30 or better An item reduction process was employed by combining 3 methods-expert opinion, impact method and item-total correlation methods. The classical reliability testing and the validity testing including convergent, discriminative and construct validity was performed. The developed questionnaire comprised 87 items in 6 dimensions: 1) quality of vision, 2) visual function, 3) social function, 4) psychological function, 5) symptoms and 6) refractive correction problems. It is the 5-level Likert scale type. The Cronbach's Alpha coefficients of its dimensions ranged from 0.756 to 0. 979. All validity testing were shown to be valid. The construct validity was validated by the confirmatory factor analysis. A short version questionnaire comprised 48 items with good reliability and validity was also developed. This is the first validated instrument for measuring refractive error quality of life for Thai adults that was developed with strong research methodology and large sample size.
Liegl, Gregor; Rose, Matthias; Correia, Helena; Fischer, H Felix; Kanlidere, Sibel; Mierke, Annett; Obbarius, Alexander; Nolte, Sandra
2018-01-01
To translate the PROMIS Physical Function (PF) item bank version 1.2 into German and to investigate psychometric properties of resulting full bank and seven derived short forms. Cross-sectional psychometric study. Inpatient and outpatient clinics of the Department of Psychosomatic Medicine at Charité-Universitätsmedizin Berlin, Germany. A total of 10 adult patients with various chronic diseases participated in cognitive debriefing interviews. The final item bank was administered to n = 266 adult patients with a broad range of medical conditions. Patient-reported outcome assessment as part of routine care. PROMIS v1.2 PF bank; MOS SF-36 PF scale (PF-10). Cross-cultural adaptation of the item bank followed established guidelines. For the final German translation, the corrected item-total correlations ranged from 0.44 to 0.84. Cronbach's alpha was high for each PROMIS PF short form ( α = 0.88-0.96). The full PROMIS PF bank and most short forms correlated highly with the SF-36 PF-10 ( r = 0.85-0.90), with the exception of PROMIS Upper Extremity ( r = 0.64). PROMIS Upper Extremity showed ceiling effects and lower agreement with the full bank than other short forms. Unidimensionality was supported for all PROMIS PF measures using traditional factor analysis and nonparametric item response theory. The German PROMIS PF bank was found to be conceptually equivalent to the English version and fulfilled the psychometric requirements for use of short forms in clinical practice. Future studies should pay particular attention to samples with upper extremity functional limitations to further investigate the dimensional structure of PF as conceptualized according to PROMIS.
Elders Health Empowerment Scale: Spanish adaptation and psychometric analysis.
Serrani Azcurra, Daniel Jorge Luis
2014-01-01
Empowerment refers to patient skills that allow them to become primary decision-makers in control of daily self-management of health problems. As important the concept as it is, particularly for elders with chronic diseases, few available instruments have been validated for use with Spanish speaking people. Translate and adapt the Health Empowerment Scale (HES) for a Spanish-speaking older adults sample and perform its psychometric validation. The HES was adapted based on the Diabetes Empowerment Scale-Short Form. Where "diabetes" was mentioned in the original tool, it was replaced with "health" terms to cover all kinds of conditions that could affect health empowerment. Statistical and Psychometric Analyses were conducted on 648 urban-dwelling seniors. The HES had an acceptable internal consistency with a Cronbach's α of 0.89. The convergent validity was supported by significant Pearson's Coefficient correlations between the HES total and item scores and the General Self Efficacy Scale (r= 0.77), Swedish Rheumatic Disease Empowerment Scale (r= 0.69) and Making Decisions Empowerment Scale (r= 0.70). Construct validity was evaluated using item analysis, half-split test and corrected item to total correlation coefficients; with good internal consistency (α> 0.8). The content validity was supported by Scale and Item Content Validity Index of 0.98 and 1.0, respectively. HES had acceptable face validity and reliability coefficients; which added to its ease administration and users' unbiased comprehension, could set it as a suitable tool in evaluating elder's outpatient empowerment-based medical education programs.
Elders Health Empowerment Scale
2014-01-01
Introduction: Empowerment refers to patient skills that allow them to become primary decision-makers in control of daily self-management of health problems. As important the concept as it is, particularly for elders with chronic diseases, few available instruments have been validated for use with Spanish speaking people. Objective: Translate and adapt the Health Empowerment Scale (HES) for a Spanish-speaking older adults sample and perform its psychometric validation. Methods: The HES was adapted based on the Diabetes Empowerment Scale-Short Form. Where "diabetes" was mentioned in the original tool, it was replaced with "health" terms to cover all kinds of conditions that could affect health empowerment. Statistical and Psychometric Analyses were conducted on 648 urban-dwelling seniors. Results: The HES had an acceptable internal consistency with a Cronbach's α of 0.89. The convergent validity was supported by significant Pearson's Coefficient correlations between the HES total and item scores and the General Self Efficacy Scale (r= 0.77), Swedish Rheumatic Disease Empowerment Scale (r= 0.69) and Making Decisions Empowerment Scale (r= 0.70). Construct validity was evaluated using item analysis, half-split test and corrected item to total correlation coefficients; with good internal consistency (α> 0.8). The content validity was supported by Scale and Item Content Validity Index of 0.98 and 1.0, respectively. Conclusions: HES had acceptable face validity and reliability coefficients; which added to its ease administration and users' unbiased comprehension, could set it as a suitable tool in evaluating elder's outpatient empowerment-based medical education programs. PMID:25767307
Nessen, Thomas; Demmelmaier, Ingrid; Nordgren, Birgitta; Opava, Christina H
2015-01-01
The aim of the present study was to investigate aspects of reliability and validity of the Exercise Self-Efficacy Scale (ESES-S) in a rheumatoid arthritis (RA) population. A total of 244 people with RA participating in a physical activity study were included. The six-item ESES-S, exploring confidence in performing exercise, was assessed for test-retest reliability over 4-6 months, and for internal consistency. Construct validity investigated correlation with similar and other constructs. An intraclass correlation coefficient (ICC) of 0.59 (95% CI 0.37-0.73) was found for 84 participants with stable health perceptions between measurement occasions. Cronbach's alpha coefficients of 0.87 and 0.89 were found at the first and second measurements. Corrected item-total correlation single ESES-S items ranged between 0.53 and 0.73. Construct convergent validity for the ESES-S was partly confirmed by correlations with health-enhancing physical activity and outcome expectations respectively (Pearson's r = 0.18, p < 0.01). Construct divergent validity was confirmed by the absence of correlations with age or gender. No floor or ceiling effects were found for ESES-S. The results indicate that the ESES-S has moderate test-retest reliability and respectable internal consistency in people with RA. Construct validity was partially supported in the present sample. Further research on construct validity of the ESES-S is recommended. Physical exercise is crucial for management of symptoms and co-morbidity in rheumatoid arthritis. Self-efficacy for exercise is important to address in rehabilitation as it regulates exercise motivation and behavior. Measurement properties of self-efficacy scales need to be assessed in specific populations and different languages.
Development of a Facebook Addiction Scale.
Andreassen, Cecilie Schou; Torsheim, Torbjørn; Brunborg, Geir Scott; Pallesen, Ståle
2012-04-01
The Bergen Facebook Addiction Scale (BFAS), initially a pool of 18 items, three reflecting each of the six core elements of addiction (salience, mood modification, tolerance, withdrawal, conflict, and relapse), was constructed and administered to 423 students together with several other standardized self-report scales (Addictive Tendencies Scale, Online Sociability Scale, Facebook Attitude Scale, NEO-FFI, BIS/BAS scales, and Sleep questions). That item within each of the six addiction elements with the highest corrected item-total correlation was retained in the final scale. The factor structure of the scale was good (RMSEA = .046, CFI = .99) and coefficient alpha was .83. The 3-week test-retest reliability coefficient was .82. The scores converged with scores for other scales of Facebook activity. Also, they were positively related to Neuroticism and Extraversion, and negatively related to Conscientiousness. High scores on the new scale were associated with delayed bedtimes and rising times.
Ozturk, Erhan Arif; Kocer, Bilge Gonenli; Umay, Ebru; Cakci, Aytul
2018-06-07
The objectives of the present study were to translate and cross-culturally adapt the English version of the Parkinson Fatigue Scale into Turkish, to evaluate its psychometric properties, and to compare them with that of other language versions. A total of 144 patients with idiopathic Parkinson disease were included in the study. The Turkish version of Parkinson Fatigue Scale was evaluated for data quality, scaling assumptions, acceptability, reliability, and validity. The questionnaire response rate was 100% for both test and retest. The percentage of missing data was zero for items, and the percentage of computable scores was full. Floor and ceiling effects were absent. The Parkinson Fatigue Scale provides an acceptable internal consistency (Cronbach's alpha was 0.974 for 1st test and 0.964 for a retest, and corrected item-to-total correlations were ranged from 0.715 to 0.906) and test-retest reliability (Cohen's kappa coefficients were ranged from 0.632 to 0.786 for individuals items, and intraclass correlation coefficient was 0.887 for the overall Parkinson Fatigue Scale Score). An exploratory factor analysis of the items revealed a single factor explaining 71.7% of variance. The goodness-of-fit statistics for the one-factorial confirmatory factor analysis were Tucker Lewis index = 0.961, comparative fit index = 0.971 and root mean square error of approximation = 0.077 for a single factor. The average Parkinson Fatigue Scale Score was correlated significantly with sociodemographic data, clinical characteristics and scores of rating scales. The Turkish version of the Parkinson Fatigue Scale seems to be culturally well adapted and have good psychometric properties. The scale can be used in further studies to assess the fatigue in patients with Parkinson's disease.
Hockenberry, S L; Billingham, R E
1987-12-01
Two hundred twenty-five [corrected] respondents (109 [corrected] heterosexuals and 116 [corrected] homosexuals) completed a survey containing a 20-item Boyhood Gender Conformity Scale (BGCS). This scale was largely composed of edited and abridged gender items from Part A of Freund et al.'s Feminine Gender Identity Scale (FGIS-A) and Whitam's "childhood indicators." The combined scale was developed in an attempt to obtain a reliable, valid, and potent discriminating instrument for accurately classifying adult male respondents for sexual orientation on the basis of their reported boyhood gender conformity or nonconforming behavior and identity. In addition, 33% of these respondents were administered the original FGIS-A and Whitam inventory during a 2-week test-retest analysis conducted to determine the validity and reliability of the new instrument. All the original items significantly discriminated between heterosexual and homosexual respondents. From these a 13-item function and a 5-item function proved to be the most powerful discriminators between the two groups. Significant correlations between each of the three scales and a very high test-retest correlation coefficient supported the reliability and validity assumption for the BGCS. The conclusion was made that the five-item function (playing with boys, preferring [corrected] boys' games, imagining self as sports figure, reading adventure and sports stories, considered a "sissy") was the most potent and parsimonious discriminator among adult males for sexual orientation. It was similarly noted that the absence of masculine behaviors and traits appeared to be a more powerful predictor of later homosexual orientation than the traditionally feminine or cross-sexed traits and behaviors.
Measuring Decision-Making During Thyroidectomy: Validity Evidence for a Web-Based Assessment Tool.
Madani, Amin; Gornitsky, Jordan; Watanabe, Yusuke; Benay, Cassandre; Altieri, Maria S; Pucher, Philip H; Tabah, Roger; Mitmaker, Elliot J
2018-02-01
Errors in judgment during thyroidectomy can lead to recurrent laryngeal nerve injury and other complications. Despite the strong link between patient outcomes and intraoperative decision-making, methods to evaluate these complex skills are lacking. The purpose of this study was to develop objective metrics to evaluate advanced cognitive skills during thyroidectomy and to obtain validity evidence for them. An interactive online learning platform was developed ( www.thinklikeasurgeon.com ). Trainees and surgeons from four institutions completed a 33-item assessment, developed based on a cognitive task analysis and expert Delphi consensus. Sixteen items required subjects to make annotations on still frames of thyroidectomy videos, and accuracy scores were calculated based on an algorithm derived from experts' responses ("visual concordance test," VCT). Seven items were short answer (SA), requiring users to type their answers, and scores were automatically calculated based on their similarity to a pre-populated repertoire of correct responses. Test-retest reliability, internal consistency, and correlation of scores with self-reported experience and training level (novice, intermediate, expert) were calculated. Twenty-eight subjects (10 endocrine surgeons and otolaryngologists, 18 trainees) participated. There was high test-retest reliability (intraclass correlation coefficient = 0.96; n = 10) and internal consistency (Cronbach's α = 0.93). The assessment demonstrated significant differences between novices, intermediates, and experts in total score (p < 0.01), VCT score (p < 0.01) and SA score (p < 0.01). There was high correlation between total case number and total score (ρ = 0.95, p < 0.01), between total case number and VCT score (ρ = 0.93, p < 0.01), and between total case number and SA score (ρ = 0.83, p < 0.01). This study describes the development of novel metrics and provides validity evidence for an interactive Web-based platform to objectively assess decision-making during thyroidectomy.
Classical Item Analysis Using Latent Variable Modeling: A Note on a Direct Evaluation Procedure
ERIC Educational Resources Information Center
Raykov, Tenko; Marcoulides, George A.
2011-01-01
A directly applicable latent variable modeling procedure for classical item analysis is outlined. The method allows one to point and interval estimate item difficulty, item correlations, and item-total correlations for composites consisting of categorical items. The approach is readily employed in empirical research and as a by-product permits…
Ekim, Ayfer; Hecan, Melis; Oren, Serkan
Childhood chronic diseases have a great impact, including physiological, social and financial burdens, on parents. The concept of "caregiver burden" is gaining importance to understand the effects of allergic diseases and plan family-centered strategies. The purpose of this study was to examine the psychometric properties of the Caregiver Burden Index (CBI) in Turkish mothers of children with allergies. The participants of this methodological study were 213 mothers of children with allergies between 6 and 12years. Construct validity was evaluated through factor analysis and reliability was evaluated through internal consistency and item-total correlation. In reliability analysis, the overall Cronbach's alpha value (0.85) demonstrated a high level of reliability. The corrected item-total correlation varied between 0.63 and 0.84. In exploratory factor analysis, it was detected that 3 factors structure explained 73.6% of the total variance. This study indicated that the CBI is a valid and reliable tool to assess the caregiver burden of mothers of Turkish children with allergies. The results of this study contribute to the development and implementation of evidence based models of care that address the caregiver burden needs of parents whose children have allergies. Copyright © 2017 Elsevier Inc. All rights reserved.
Legal Professionals' Knowledge of Eyewitness Testimony in China: A Cross-Sectional Survey
Jiang, Lina; Luo, Dahua
2016-01-01
Purpose To examine legal professionals’ knowledge of a wide range of factors that affect eyewitness accuracy in China. Methods A total of 812 participants, including 210 judges, 244 prosecutors, 202 police officers, and 156 defense attorneys, were asked to respond to 12 statements about eyewitness testimony and 3 basic demographic questions (i.e., gender, age, and prior experience). Results Although the judges and the defense attorneys had a somewhat higher number of correct responses than the other two groups, all groups showed limited knowledge of eyewitness testimony. In addition, the participants’ responses to only four items (i.e., weapon focus, attitude and expectations, child suggestibility, and the impact of stress) were roughly unanimous within the four legal professional groups. Legal professionals’ gender showed no significant correlations with their knowledge of eyewitness testimony. Prior experiences were significantly and negatively correlated with the item on the knowledge of forgetting curve among judges but positively correlated with two items (i.e., attitudes and exposure time) among defense attorneys and with 4 statements (i.e., the knowledge of attitudes and expectations, impact of stress, child witness accuracy, and exposure time) among prosecutors. Conclusions The findings suggest that knowledge of the factors that influence eyewitness accuracy must be more effectively communicated to legal professionals in the future. PMID:26828933
Nicolais, Christina J; Bernstein, Ruth; Riekert, Kristin A; Quittner, Alexandra L
2018-02-01
Cystic fibrosis (CF) is a life-shortening, burdensome disease requiring complex knowledge to manage the disease. Significant gaps in knowledge have been documented for parents, which may lead to unintentionally poor adherence and insufficient transfer of treatment responsibility from parents to adolescents. There are no current, validated measures of parent knowledge for this population and there are no measures that assess the knowledge required for day-to-day behavioral management of CF. We assessed the psychometric properties of the parent version of the Knowledge of Disease Management-Cystic Fibrosis measure (KDM-CF-P) using data from iCARE (I Change Adherence and Raise Expectations), a randomized control adherence intervention trial. A total of 196 parents in the iCARE standard care/control arm completed 35 items assessing their knowledge of disease management at their 12-month study visit, prior to beginning the intervention. Items were eliminated from the measure if they met the threshold for ceiling effects, were deemed clinically irrelevant, or did not correlate well with their intended scale. Item-to-total correlations, confirmatory factor analysis, discriminant function, reliability, and convergent validity were calculated. The KDM-CF-P (19 items) demonstrated internal consistency of KR20 = 0.60 on each scale and a two-scale structure. Convergent validity for knowledge scores was found with maternal education, family income, and type of medical insurance. Parents correctly answered approximately 85% of items on the KDM-CF-P. The KDM-CF-P psychometrics support a two-scale measure with clinical utility. It is useful for assessing gaps in knowledge that can be remediated through individualized, tailored interventions. © 2017 Wiley Periodicals, Inc.
de Chastelaine, Marianne; Mattson, Julia T.; Wang, Tracy H.; Donley, Brian E.; Rugg, Michael D.
2016-01-01
Using fMRI, subsequent memory effects (greater activity for later remembered than later forgotten study items) predictive of associative encoding were compared across samples of young, middle-aged and older adults (total n = 136). During scanning, participants studied visually presented word pairs. In a later test phase, they discriminated between studied pairs, ‘rearranged’ pairs (items studied on different trials) and new pairs. Subsequent memory effects were identified by contrasting activity elicited by study pairs that went on to be correctly judged intact or incorrectly judged rearranged. Effects in the hippocampus were age-invariant and positively correlated across participants with associative memory performance. Subsequent memory effects in the right IFG were greater in the older than the young group. In older participants only, both left and, in contrast to prior reports, right IFG subsequent memory effects correlated positively with memory performance. We suggest that the IFG is especially vulnerable to age-related decline in functional integrity, and that the relationship between encoding-related activity in right IFG and memory performance depends on the experimental context. PMID:27143433
[Severe intimate partner violence risk prediction scale-revised].
Echeburúa, Enrique; Amor, Pedro Javier; Loinaz, Ismael; de Corral, Paz
2010-11-01
The aim of this study was to describe the psychometric properties of the Severe Intimate Partner Violence Risk Prediction Scale and to revise it in order to ponderate the 20 items according to their discriminant capacity and to solve the missing item problem. The sample for this study consisted of 450 male batterers who were reported to the police station. The victims were classified as high-risk (18.2%), moderate-risk (45.8%) and low-risk (36%), depending on the cutoff scores in the original scale. Internal consistency (Cronbach's alpha=.72) and interrater reliability (r=.73) were acceptable. The point biserial correlation coefficient between each item and the corrected total score of the 20-item scale was calculated to determine the most discriminative items, which were associated with the context of intimate partner violence in the last month, with the male batterer's profile and with the victim's vulnerability. A revised scale (EPV-R) with new cutoff scores and indications on how to deal with the missing items were proposed in accordance with these results. This easy-to-use tool appears to be suitable to the requirements of criminal justice professionals and is intended for use in safety planning. Implications of these results for further research are discussed.
Item Selection and Pre-equating with Empirical Item Characteristic Curves.
ERIC Educational Resources Information Center
Livingston, Samuel A.
An empirical item characteristic curve shows the probability of a correct response as a function of the student's total test score. These curves can be estimated from large-scale pretest data. They enable test developers to select items that discriminate well in the score region where decisions are made. A similar set of curves can be used to…
ERIC Educational Resources Information Center
Jin, Ying; Myers, Nicholas D.; Ahn, Soyeon
2014-01-01
Previous research has demonstrated that differential item functioning (DIF) methods that do not account for multilevel data structure could result in too frequent rejection of the null hypothesis (i.e., no DIF) when the intraclass correlation coefficient (?) of the studied item was the same as the ? of the total score. The current study extended…
Reliability of adapted version of Italian Label tobacco Impact Index for the adolescent: ALII.
Guerra, F; Mannocci, A; Colamesta, V; De Luca, G; Fiore, M; Firenze, A; Ferrara, M; Langiano, E; De Vito, E; Bonaccorsi, G; La Torre, G
2017-01-01
The aim of this study is to assess the reliability of the Adolescent Label Impact Index (ALII) , it is an adolescent adapted version of Italian LII of the tobacco products warnings. A sample including students aged 13-15 years was considered. The ALII is constructed by 4 items: salience, harm, quitting and forgo. The questionnaire was self-administered to study participants twice with 3 days between each administration (T1 and T2) to measure reliability. The internal consistency using Cronbach's alpha and Corrected Item-Total Correlations (CITC) and the test-retest reliability applying Pearson's correlation were computed. Cronbach's alpha ranges from 0.625 at T1 to 0.715 at T2. The "salience" resulted the item with the lowest CITC value (=0.281). The Pearson's coefficient was r=0.909 (p<0.001). The instruments is low in cost and easy to administer and analyses in a setting people aged 13-15 years. The ALII shown an acceptable consistency and excellent stability over time. However, attention has to be paid when the ALII is administered to the no smoking teens and who has never seen the tobacco product labels to allow an appropriate interpretation of the data collected.
Ng, S S W; Lo, A W Y; Leung, T K S; Chan, F S M; Wong, A T Y; Lam, R W T; Tsang, D K Y
2014-03-01
Quality of life outcomes are useful in the assessment of mental and social wellbeing and for informed health care decision-making, especially in the choice of interventions in psychiatric rehabilitation. In its original form, the Warwick-Edinburgh Mental Well-being Scale (WEMWBS) is a proven reliable and valid tool for assessing quality of life in normal adults, but not in adults from Asian countries. A shortened 7-item version of WEMWBS (SWEMWBS) with good internal construct validity was used for this study. The present study describes the translation of WEMWBS from English to Chinese and its validation in a sample of Chinese-speaking patient population. Participants included patients admitted to the inpatient units, and those attending the day hospital and outpatient units of the Kowloon Hospital (n = 126). Translation was performed using the multiple forward and backward translation protocol. Patients also completed the 5-item World Health Organization Well-being Index (WHO5) questionnaire. A case therapist completed the Brief Psychiatric Rating Scale within 2 days. A total of 20 patients were selected for test-retest measurements performed after 2 weeks. The sample displayed a normal distribution of the Chinese version of SWEMWBS (C-SWEMWBS) scores (mean ± standard deviation, 23.16 ± 5.39; skewness, -0.068; kurtosis, -0.355). Internal reliability coefficient (Cronbach's alpha) for C-SWEMWBS was 0.89 which was consistent with that of English version. The corrected item-total correlation was high with Spearman's rank correlation coefficients ranging from 0.57 (item 6) to 0.75 (item 5). Good test-retest reliability was observed (r = 0.677; p = 0.001). Principal components factor analysis identified a single component (eigenvalues, 4.28; 61.1% variance), similar to the English version. Scores of C-SWEMWBS were positively correlated with the scores of WHO5 (r = 0.49; p < 0.001), suggesting good concurrent validity. Few item scores including 'feeling useful', 'dealing with problems well', 'able to make decisions', and the total score were significantly correlated with diagnostic groups (p < 0.05). Education and diagnosis of mental illness were valid predictors for C-SWEMWBS (F = 5.41; p = 0.01). There were no effects due to age and gender. The C-SWEMWBS showed high levels of internal consistency and reliability against accepted criteria. It is short, acceptable, and culturally meaningful to clients with mental illness. Further large-scale studies in normal subjects and varied patient groups are recommended to generalise the findings.
Relapse Risk Assessment for Schizophrenia Patients (RASP): A New Self-Report Screening Tool.
Velligan, Dawn; Carpenter, William; Waters, Heidi C; Gerlanc, Nicole M; Legacy, Susan N; Ruetsch, Charles
2018-01-01
The Relapse Assessment for Schizophrenia Patients (RASP) was developed as a six-question self-report screener that measures indicators of Increased Anxiety and Social Isolation to assess patient stability and predict imminent relapse. This paper describes the development and psychometric characteristics of the RASP. The RASP and Positive and Negative Syndrome Scale (PANSS) were administered to patients with schizophrenia (n=166) three separate times. Chart data were collected on a subsample of patients (n=81). Psychometric analyses of RASP included tests of reliability, construct validity, and concurrent validity of items. Factors from RASP were correlated with subscales from PANSS (sensitivity to change and criterion validity [agreement between RASP and evidence of relapse]). Test-retest reliability returned modest to strong agreement at the item level and strong agreement at the questionnaire level. RASP showed good item response curves and internal consistency for the total instrument and within each of the two subscales (Increased Anxiety and Social Isolation). RASP Total Score and subscales showed good concurrent validity when correlated with PANSS Total Score, Positive, Excitement, and Anxiety subscales. RASP correctly predicted relapse in 67% of cases, with good specificity and negative predictive power and acceptable positive predictive power and sensitivity. The reliability and validity data presented support the use of RASP in settings where addition of a brief self-report assessment of relapse risk among patients with schizophrenia may be of benefit. Ease of use and scoring, and the ability to administer without clinical supervision allows for routine administration and assessment of relapse risk.
Psychometric properties of the Italian version of the Cognitive Reserve Scale (I-CRS).
Altieri, Manuela; Siciliano, Mattia; Pappacena, Simona; Roldán-Tapia, María Dolores; Trojano, Luigi; Santangelo, Gabriella
2018-05-04
The original definition of cognitive reserve (CR) refers to the individual differences in cognitive performance after a brain damage or pathology. Several proxies were proposed to evaluate CR (education, occupational attainment, premorbid IQ, leisure activities). Recently, some scales were developed to measure CR taking into account several cognitively stimulating activities. The aim of this study is to adapt the Cognitive Reserve Scale (I-CRS) for the Italian population and to explore its psychometric properties. I-CRS was administered to 547 healthy participants, ranging from 18 to 89 years old, along with neuropsychological and behavioral scales to evaluate cognitive functioning, depressive symptoms, and apathy. Cronbach's α, corrected item-total correlations, and the inter-item correlation matrix were calculated to evaluate the psychometric properties of the scale. Linear regression analysis was performed to build a correction grid of the I-CRS according to demographic variables. Correlational analyses were performed to explore the relationships between I-CRS and neuropsychological and behavioral scales. We found that age, sex, and education influenced the I-CRS score. Young adults and adults obtained higher I-CRS scores than elderly adults; women and participants with high educational attainment scored higher on I-CRS than men and participants with low education. I-CRS score correlated poorly with cognitive and depression scale scores, but moderately with apathy scale scores. I-CRS showed good psychometric properties and seemed to be a useful tool to assess CR in every adult life stage. Moreover, our findings suggest that apathy rather than depressive symptoms may interfere with the building of CR across the lifespan.
Dourado, Marcia C N; Mograbi, Daniel C; Santos, Raquel L; Sousa, Maria Fernanda B; Nogueira, Marcela L; Belfort, Tatiana; Landeira-Fernandez, Jesus; Laks, Jerson
2014-01-01
Despite the growing understanding of the conceptual complexity of awareness, there currently exists no instrument for assessing different domains of awareness in dementia. In the current study, the psychometric properties of a multidimensional awareness scale, the Assessment Scale of Psychosocial Impact of the Diagnosis of Dementia (ASPIDD), are explored in a sample of 201 people with dementia and their family caregivers. Cronbach's alpha was high (α = 0.87), indicating excellent internal consistency. The mean of corrected item-total correlation coefficients was moderate. ASPIDD presented a four-factor solution with a well-defined structure: awareness of activities of daily living, cognitive functioning and health condition, emotional state, and social functioning and relationships. Functional disability was positively correlated with total ASPIDD, unawareness of activities of daily living, cognitive functioning, and with emotional state. Caregiver burden was correlated with total ASPIDD scores and unawareness of cognitive functioning. The results suggest that ASPIDD is indeed a multidimensional scale, providing a reliable measure of awareness of disease in dementia. Further studies should explore the risk factors associated with different dimensions of awareness in dementia.
Lin, Chung-Ying; Lee, Tsung-Ying; Sun, Zih-Jie; Yang, Yi-Ching; Wu, Jin-Shang; Ou, Huang-Tz
2017-08-23
Although numerous health-related quality of life (HRQoL) instruments are available for patients with diabetes, the length of these measures may limit their feasibility to routine practice. Also, these measures do not distinguish items for generic and diabetes-specific HRQoL. This study was aimed to develop a diabetes-specific quality of life questionnaire module (DMQoL) to be in conjunction with the World Health Organization Quality of Life scale brief version (WHOQOL-BREF). One hundred seventeen patients with diabetes were enrolled from a medical center in Taiwan. The item content of DMQoL was constructed based on an extensive review of existing HRQoL instruments for diabetes, expert discussions and patient interviews. A series of psychometric tests were conducted to ensure the reliability and validity of DMQoL. The WHOQOL-BREF served as an existing HRQoL measure for construct validity testing. The response scale of DMQoL was adopted from the 5-point Likert scale of WHOQOL-BREF. A total of 10 items without ceiling or floor effects were selected from 20 items. Exploratory factor analysis (EFA) with parallel analysis and Rasch analysis concluded that the 10 items were embedded in the same underlying concept. The corrected item-total correlations and factor loadings from EFA were all above 0.4. The internal consistency of the 10 items was satisfactory (Cronbach's α = 0.84). The DMQoL total score was moderately correlated with that of WHOQOL-BREF (r = 0.48, p < 0.001). The known-group validity showed that patients with HbA1c ≤ 7% had significantly higher mean scores of DMQoL than did those with HbA1c > 8% (3.66 ± 0.47 vs. 3.41 ± 0.53; p = 0.037). The DMQoL with only 10 items is developed and it is sensitive to the change of diabetes progression in early phases (e.g., glycemic changes). The combination of WHOQOL-BREF and DMQoL provides a comprehensive picture of overall HRQoL in patients with diabetes and enhance the instrument's ability to detect clinically meaningful changes in diabetes.
The Dutch Activity Card Sort institutional version was reproducible, but biased against women.
Jong, A M; van Nes, F A; Lindeboom, R
2012-01-01
To examine the reproducibility of the institutional version of the Dutch Activity Card Sort (ACS-NL) and the possible presence of gender bias. Older rehabilitation inpatients (N = 52) were included. Intra- and inter-rater agreement for the ACS-NL total and subscale scores was examined by intraclass correlations (ICC), and agreement of individual items by the κ coefficient (k). Gender bias was examined by the proportion of men and women selecting an ACS item. ICC for inter-rater agreement of the ACS total score ranged between 0.78 and 0.87, ICC for intra-rater agreement ranged between 0.79 and 0.89. Median inter-rater κ for ACS-NL items was 0.72 (interquartile scores; 0.62-0.80). The inter-rater agreement (k = 0.43) and intra-rater agreement (k = 0.39) for the five most important activities was lower. Twenty ACS activities favoured men and seven activities favoured women. As a result, men scored systematically higher on the ACS-NL than women. Logistic regression analysis correcting for activity engagement level confirmed our findings. The reproducibility of the ACS-NL was high. The ACS-NL institutional version score may be biased in favour of men.
Haring, Catharina M; Cools, Bernadette M; van der Meer, Jos Wm; Postma, Cornelis T
2014-04-08
Many practicing physicians lack skills in physical examination. It is not known whether physical examination skills already show deficiencies after an early phase of clinical training. At the end of the internal medicine clerkship students are expected to be able to perform a general physical examination in every new patient encounter. In a previous study, the basic physical examination items that should standardly be performed were set by consensus. The aim of the current observational study was to assess whether medical students were able to correctly perform a general physical examination regarding completeness as well as technique at the end of the clerkship internal medicine. One hundred students who had just finished their clerkship internal medicine were asked to perform a general physical examination on a standardized patient as they had learned during the clerkship. They were recorded on camera. Frequency of performance of each component of the physical examination was counted. Adequacy of performance was determined as either correct or incorrect or not assessable using a checklist of short descriptions of each physical examination component. A reliability analysis was performed by calculation of the intra class correlation coefficient for total scores of five physical examinations rated by three trained physicians and for their agreement on performance of all items. Approximately 40% of the agreed standard physical examination items were not performed by the students. Students put the most emphasis on examination of general parameters, heart, lungs and abdomen. Many components of the physical examination were not performed as was taught during precourses. Intra-class correlation was high for total scores of the physical examinations 0.91 (p <0.001) and for agreement on performance of the five physical examinations (0.79-0.92 p <0.001). In conclusion, performance of the general physical examination was already below expectation at the end of the internal medicine clerkship. Possible causes and suggestions for improvement are discussed.
Student performance of the general physical examination in internal medicine: an observational study
2014-01-01
Background Many practicing physicians lack skills in physical examination. It is not known whether physical examination skills already show deficiencies after an early phase of clinical training. At the end of the internal medicine clerkship students are expected to be able to perform a general physical examination in every new patient encounter. In a previous study, the basic physical examination items that should standardly be performed were set by consensus. The aim of the current observational study was to assess whether medical students were able to correctly perform a general physical examination regarding completeness as well as technique at the end of the clerkship internal medicine. Methods One hundred students who had just finished their clerkship internal medicine were asked to perform a general physical examination on a standardized patient as they had learned during the clerkship. They were recorded on camera. Frequency of performance of each component of the physical examination was counted. Adequacy of performance was determined as either correct or incorrect or not assessable using a checklist of short descriptions of each physical examination component. A reliability analysis was performed by calculation of the intra class correlation coefficient for total scores of five physical examinations rated by three trained physicians and for their agreement on performance of all items. Results Approximately 40% of the agreed standard physical examination items were not performed by the students. Students put the most emphasis on examination of general parameters, heart, lungs and abdomen. Many components of the physical examination were not performed as was taught during precourses. Intra-class correlation was high for total scores of the physical examinations 0.91 (p <0.001) and for agreement on performance of the five physical examinations (0.79-0.92 p <0.001). Conclusions In conclusion, performance of the general physical examination was already below expectation at the end of the internal medicine clerkship. Possible causes and suggestions for improvement are discussed. PMID:24712683
ERIC Educational Resources Information Center
Kim, Hyung Jin; Brennan, Robert L.; Lee, Won-Chan
2017-01-01
In equating, when common items are internal and scoring is conducted in terms of the number of correct items, some pairs of total scores ("X") and common-item scores ("V") can never be observed in a bivariate distribution of "X" and "V"; these pairs are called "structural zeros." This simulation…
Helvik, Anne-Sofie; Engedal, Knut; Skancke, Randi H; Selbæk, Geir
2011-10-01
Few psychometric studies of the Hospital Anxiety and Depression Scale (HADS) scale have been performed with clinical samples of elderly individuals. The participants were 484 elderly (65-101 years, 241 men) patients in an acute medical unit. The HADS, the Montgomery-Aasberg Depression Rating Scale (MADRS) and questionnaires assessing quality of life, functional impairment, and cognitive function were used. The psychometric evaluation of the HADS included the following analyses: 1) the internal construct validity by means of principal component analysis followed by an oblique rotation and corrected item-total correlation; 2) the internal consistency reliability by means of the alpha coefficient (Cronbach's) and 3) concurrent validity by means of Spearman's rho. We found a two-factor solution explaining 45% of the variance. Six of seven items loaded adequately (≥0.40) on the HADS-A subscale (item 7 did not) and five of seven items loaded adequately on the HADS-D subscale (items 8 and 10 did not). Cronbach's alpha for the HADS-A and HADS-D subscale was 0.78 and 0.71, respectively. The correlation between HADS-D and the MADRS, a measure of the concurrent validity, was 0.51. The HADS appears to differentiate well between depression and anxiety. The internal consistency of the HADS in a sample of elderly persons was as satisfactory as it is in samples with younger persons. In contrast to younger samples, item 8 ("I feel as if I have slowed down") did not load adequately on the HADS-D subscale. This may be attributed to the way elderly people experience and describe their symptoms.
Opposing effects of negative emotion on amygdalar and hippocampal memory for items and associations
Horner, Aidan J.; Hørlyck, Lone D.; Burgess, Neil
2016-01-01
Although negative emotion can strengthen memory of an event it can also result in memory disturbances, as in post-traumatic stress disorder (PTSD). We examined the effects of negative item content on amygdalar and hippocampal function in memory for the items themselves and for the associations between them. During fMRI, we examined encoding and retrieval of paired associates made up of all four combinations of neutral and negative images. At test, participants were cued with an image and, if recognised, had to retrieve the associated (target) image. The presence of negative images increased item memory but reduced associative memory. At encoding, subsequent item recognition correlated with amygdala activity, while subsequent associative memory correlated with hippocampal activity. Hippocampal activity was reduced by the presence of negative images, during encoding and correct associative retrieval. In contrast, amygdala activity increased for correctly retrieved negative images, even when cued by a neutral image. Our findings support a dual representation account, whereby negative emotion up-regulates the amygdala to strengthen item memory but down-regulates the hippocampus to weaken associative representations. These results have implications for the development and treatment of clinical disorders in which diminished associations between emotional stimuli and their context contribute to negative symptoms, as in PTSD. PMID:26969864
The Effect of SSM Grading on Reliability When Residual Items Have No Discriminating Power.
ERIC Educational Resources Information Center
Kane, Michael T.; Moloney, James M.
Gilman and Ferry have shown that when the student's score on a multiple choice test is the total number of responses necessary to get all items correct, substantial increases in reliability can occur. In contrast, similar procedures giving partial credit on multiple choice items have resulted in relatively small gains in reliability. The analysis…
Development and Validation of the Spanish Numeracy Understanding in Medicine Instrument.
Jacobs, Elizabeth A; Walker, Cindy M; Miller, Tamara; Fletcher, Kathlyn E; Ganschow, Pamela S; Imbert, Diana; O'Connell, Maria; Neuner, Joan M; Schapira, Marilyn M
2016-11-01
The Spanish-speaking population in the U.S. is large and growing and is known to have lower health literacy than the English-speaking population. Less is known about the health numeracy of this population due to a lack of health numeracy measures in Spanish. we aimed to develop and validate a short and easy to use measure of health numeracy for Spanish-speaking adults: the Spanish Numeracy Understanding in Medicine Instrument (Spanish-NUMi). Items were generated based on qualitative studies in English- and Spanish-speaking adults and translated into Spanish using a group translation and consensus process. Candidate items for the Spanish NUMi were selected from an eight-item validated English Short NUMi. Differential Item Functioning (DIF) was conducted to evaluate equivalence between English and Spanish items. Cronbach's alpha was computed as a measure of reliability and a Pearson's correlation was used to evaluate the association between test scores and the Spanish Test of Functional Health Literacy (S-TOFHLA) and education level. Two-hundred and thirty-two Spanish-speaking Chicago residents were included in the study. The study population was diverse in age, gender, and level of education and 70 % reported Mexico as their country of origin. Two items of the English eight-item Short NUMi demonstrated DIF and were dropped. The resulting six-item test had a Cronbach's alpha of 0.72, a range of difficulty using classical test statistics (percent correct: 0.48 to 0.86), and adequate discrimination (item-total score correlation: 0.34-0.49). Scores were positively correlated with print literacy as measured by the S- TOFHLA (r = 0.67; p < 0.001) and varied as predicted across grade level; mean scores for up to eighth grade, ninth through twelfth grade, and some college experience or more, respectively, were 2.48 (SD ± 1.64), 4.15 (SD ± 1.45), and 4.82 (SD ± 0.37). The Spanish NUMi is a reliable and valid measure of important numerical concepts used in communicating health information.
Durning, Steven J; Dong, Ting; Artino, Anthony R; van der Vleuten, Cees; Holmboe, Eric; Schuwirth, Lambert
2015-08-01
An ongoing debate exists in the medical education literature regarding the potential benefits of pattern recognition (non-analytic reasoning), actively comparing and contrasting diagnostic options (analytic reasoning) or using a combination approach. Studies have not, however, explicitly explored faculty's thought processes while tackling clinical problems through the lens of dual process theory to inform this debate. Further, these thought processes have not been studied in relation to the difficulty of the task or other potential mediating influences such as personal factors and fatigue, which could also be influenced by personal factors such as sleep deprivation. We therefore sought to determine which reasoning process(es) were used with answering clinically oriented multiple-choice questions (MCQs) and if these processes differed based on the dual process theory characteristics: accuracy, reading time and answering time as well as psychometrically determined item difficulty and sleep deprivation. We performed a think-aloud procedure to explore faculty's thought processes while taking these MCQs, coding think-aloud data based on reasoning process (analytic, nonanalytic, guessing or combination of processes) as well as word count, number of stated concepts, reading time, answering time, and accuracy. We also included questions regarding amount of work in the recent past. We then conducted statistical analyses to examine the associations between these measures such as correlations between frequencies of reasoning processes and item accuracy and difficulty. We also observed the total frequencies of different reasoning processes in the situations of getting answers correctly and incorrectly. Regardless of whether the questions were classified as 'hard' or 'easy', non-analytical reasoning led to the correct answer more often than to an incorrect answer. Significant correlations were found between self-reported recent number of hours worked with think-aloud word count and number of concepts used in the reasoning but not item accuracy. When all MCQs were included, 19 % of the variance of correctness could be explained by the frequency of expression of these three think-aloud processes (analytic, nonanalytic, or combined). We found evidence to support the notion that the difficulty of an item in a test is not a systematic feature of the item itself but is always a result of the interaction between the item and the candidate. Use of analytic reasoning did not appear to improve accuracy. Our data suggest that individuals do not apply either System 1 or System 2 but instead fall along a continuum with some individuals falling at one end of the spectrum.
ERIC Educational Resources Information Center
Nevid, Jeffrey S.; Cheney, Brianna; Thompson, Clarissa
2015-01-01
Students in an introductory psychology class rated their level of confidence in their answers to exam questions on four multiple-choice exams through the course of a semester. Correlations between confidence judgments and accuracy (correct vs. incorrect) at the individual item level showed modest but significant relationships for item sets scaled…
Argimon-Pallàs, Josep M; Flores-Mateo, Gemma; Jiménez-Villa, Josep; Pujol-Ribera, Enriqueta; Foz, Gonçal; Bundó-Vidiella, Magda; Juncosa, Sebastià; Fuentes-Bellido, Cruz M; Pérez-Rodríguez, Belén; Margalef-Pallarès, Francesc; Villafafila-Ferrero, Rosa; Forès-Garcia, Dolors; Roman-Martínez, Josep; Vilert-Garroga, Esther
2009-02-24
There are few high-quality instruments for evaluating the effectiveness of Evidence-Based Practice (EBP) curricula with objective outcomes measures. The Fresno test is an instrument that evaluates most of EBP steps with a high reliability and validity in the English original version. The present study has the aims to translate the Fresno questionnaire into Spanish and its subsequent validation to ensure the equivalence of the Spanish version against the English original. The questionnaire will be translated with the back translation technique and tested in Primary Care Teaching Units in Catalonia (PCTU). Participants will be: (a) tutors of Family Medicine residents (expert group); (b) Family Medicine residents in their second year of the Family Medicine training program (novice group), and (c) Family Medicine physicians (intermediate group). The questionnaire will be administered before and after an educational intervention. The educational intervention will be an interactive four half-day sessions designed to develop the knowledge and skills required to EBP. Responsiveness statistics used in the analysis will be the effect size, the standardised response mean and Guyatt's method. For internal consistency reliability, two measures will be used: corrected item-total correlations and Cronbach's alpha. Inter-rater reliability will be tested using Kappa coefficient for qualitative items and intra-class correlation coefficient for quantitative items and the overall score. Construct validity, item difficulty, item discrimination and feasibility will be determined. The validation of the Fresno questionnaire into different languages will enable the expansion of the questionnaire, as well as allowing comparison between countries and the evaluation of different teaching models.
Mental health in primary care: an evaluation using the Item Response Theory.
Rocha, Hugo André da; Santos, Alaneir de Fátima Dos; Reis, Ilka Afonso; Santos, Marcos Antônio da Cunha; Cherchiglia, Mariângela Leal
2018-01-01
OBJECTIVE To determine the items of the Brazilian National Program for Improving Access and Quality of Primary Care that better evaluate the capacity to provide mental health care. METHODS This is a cross-sectional study carried out using the Graded Response Model of the Item Response Theory using secondary data from the second cycle of the National Program for Improving Access and Quality of Primary Care, which evaluates 30,523 primary care teams in the period from 2013 to 2014 in Brazil. The internal consistency, correlation between items, and correlation between items and the total score were tested using the Cronbach's alpha, Spearman's correlation, and point biserial coefficients, respectively. The assumptions of unidimensionality and local independence of the items were tested. Word clouds were used as one way to present the results. RESULTS The items with the greatest ability to discriminate were scheduling of the agenda according to risk stratification, keeping of records of the most serious cases of users in psychological distress, and provision of group care. The items that required a higher level of mental health care in the parameter of location were the provision of any type of group care and the provision of educational and mental health promotion activities. Total Cronbach's alpha coefficient was 0.87. The items that obtained the highest correlation with total score were the recording of the most serious cases of users in psychological distress and scheduling of the agenda according to risk stratification. The final scores obtained oscillated between -2.07 (minimum) and 1.95 (maximum). CONCLUSIONS There are important aspects in the discrimination of the capacity to provide mental health care by primary health care teams: risk stratification for care management, follow-up of the most serious cases, group care, and preventive and health promotion actions.
2017-03-15
61 BC .17 .91 .54 BC .28 .86 .60 AI .23 .77 .45 AI .07 .78 .50 T2 N = 5,199; S2 N = 8,164 5 Subtest-Level Analyses Descriptive ...GS) to .90 (IC) for Form S2. The reliability for both Forms S1 and S2 was .83. The highest item-total correlations for both Forms T1 and S1 were...The lowest item-total correlations for T1 and S1 occurred for VA (T1, .37; S1, .27). VA also had the lowest correlation for T2 and S2; VA (T2, .37
Laser Raman detection for oral cancer based on a Gaussian process classification method
NASA Astrophysics Data System (ADS)
Du, Zhanwei; Yang, Yongjian; Bai, Yuan; Wang, Lijun; Zhang, Chijun; Chen, He; Luo, Yusheng; Su, Le; Chen, Yong; Li, Xianchang; Zhou, Xiaodong; Jia, Jun; Shen, Aiguo; Hu, Jiming
2013-06-01
Oral squamous cell carcinoma is the most common neoplasm of the oral cavity. The incidence rate accounts for 80% of total oral cancer and shows an upward trend in recent years. It has a high degree of malignancy and is difficult to detect in terms of differential diagnosis, as a consequence of which the timing of treatment is always delayed. In this work, Raman spectroscopy was adopted to differentially diagnose oral squamous cell carcinoma and oral gland carcinoma. In total, 852 entries of raw spectral data which consisted of 631 items from 36 oral squamous cell carcinoma patients, 87 items from four oral gland carcinoma patients and 134 items from five normal people were collected by utilizing an optical method on oral tissues. The probability distribution of the datasets corresponding to the spectral peaks of the oral squamous cell carcinoma tissue was analyzed and the experimental result showed that the data obeyed a normal distribution. Moreover, the distribution characteristic of the noise was also in compliance with a Gaussian distribution. A Gaussian process (GP) classification method was utilized to distinguish the normal people and the oral gland carcinoma patients from the oral squamous cell carcinoma patients. The experimental results showed that all the normal people could be recognized. 83.33% of the oral squamous cell carcinoma patients could be correctly diagnosed and the remaining ones would be diagnosed as having oral gland carcinoma. For the classification process of oral gland carcinoma and oral squamous cell carcinoma, the correct ratio was 66.67% and the erroneously diagnosed percentage was 33.33%. The total sensitivity was 80% and the specificity was 100% with the Matthews correlation coefficient (MCC) set to 0.447 213 595. Considering the numerical results above, the application prospects and clinical value of this technique are significantly impressive.
McAlinden, Colm; Pesudovs, Konrad; Moore, Jonathan E
2010-11-01
To develop an instrument to measure subjective quality of vision: the Quality of Vision (QoV) questionnaire. A 30-item instrument was designed with 10 symptoms rated in each of three scales (frequency, severity, and bothersome). The QoV was completed by 900 subjects in groups of spectacle wearers, contact lens wearers, and those having had laser refractive surgery, intraocular refractive surgery, or eye disease and investigated with Rasch analysis and traditional statistics. Validity and reliability were assessed by Rasch fit statistics, principal components analysis (PCA), person separation, differential item functioning (DIF), item targeting, construct validity (correlation with visual acuity, contrast sensitivity, total root mean square [RMS] higher order aberrations [HOA]), and test-retest reliability (two-way random intraclass correlation coefficients [ICC] and 95% repeatability coefficients [R(c)]). Rasch analysis demonstrated good precision, reliability, and internal consistency for all three scales (mean square infit and outfit within 0.81-1.27; PCA >60% variance explained by the principal component; person separation 2.08, 2.10, and 2.01 respectively; and minimal DIF). Construct validity was indicated by strong correlations with visual acuity, contrast sensitivity and RMS HOA. Test-retest reliability was evidenced by a minimum ICC of 0.867 and a minimum 95% R(c) of 1.55 units. The QoV Questionnaire consists of a Rasch-tested, linear-scaled, 30-item instrument on three scales providing a QoV score in terms of symptom frequency, severity, and bothersome. It is suitable for measuring QoV in patients with all types of refractive correction, eye surgery, and eye disease that cause QoV problems.
Doig, Emmah; Prescott, Sarah; Fleming, Jennifer; Cornwell, Petrea; Kuipers, Pim
2016-01-01
To examine the internal reliability and test-retest reliability of the Client-Centeredness of Goal Setting (C-COGS) scale. The C-COGS scale was administered to 42 participants with acquired brain injury after completion of multidisciplinary goal planning. Internal reliability of scale items was examined using item-partial total correlations and Cronbach's α coefficient. The scale was readministered within a 1-mo period to a subsample of 12 participants to examine test-retest reliability by calculating exact and close percentage agreement for each item. After examination of item-partial total correlations, test items were revised. The revised items demonstrated stronger internal consistency than the original items. Preliminary evaluation of test-retest reliability was fair, with an average exact percent agreement across all test items of 67%. Findings support the preliminary reliability of the C-COGS scale as a tool to evaluate and promote client-centered goal planning in brain injury rehabilitation. Copyright © 2016 by the American Occupational Therapy Association, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Santi, Peter Angelo; Cutler, Theresa Elizabeth; Favalli, Andrea
In order to improve the accuracy and capabilities of neutron multiplicity counting, additional quantifiable information is needed in order to address the assumptions that are present in the point model. Extracting and utilizing higher order moments (Quads and Pents) from the neutron pulse train represents the most direct way of extracting additional information from the measurement data to allow for an improved determination of the physical properties of the item of interest. The extraction of higher order moments from a neutron pulse train required the development of advanced dead time correction algorithms which could correct for dead time effects inmore » all of the measurement moments in a self-consistent manner. In addition, advanced analysis algorithms have been developed to address specific assumptions that are made within the current analysis model, namely that all neutrons are created at a single point within the item of interest, and that all neutrons that are produced within an item are created with the same energy distribution. This report will discuss the current status of implementation and initial testing of the advanced dead time correction and analysis algorithms that have been developed in an attempt to utilize higher order moments to improve the capabilities of correlated neutron measurement techniques.« less
CTTITEM: SAS macro and SPSS syntax for classical item analysis.
Lei, Pui-Wa; Wu, Qiong
2007-08-01
This article describes the functions of a SAS macro and an SPSS syntax that produce common statistics for conventional item analysis including Cronbach's alpha, item difficulty index (p-value or item mean), and item discrimination indices (D-index, point biserial and biserial correlations for dichotomous items and item-total correlation for polytomous items). These programs represent an improvement over the existing SAS and SPSS item analysis routines in terms of completeness and user-friendliness. To promote routine evaluations of item qualities in instrument development of any scale, the programs are available at no charge for interested users. The program codes along with a brief user's manual that contains instructions and examples are downloadable from suen.ed.psu.edu/-pwlei/plei.htm.
Costa, Daniel L da Conceição; Barbosa, Veronica S; Requena, Guaraci; Shavitt, Roseli G; Pereira, Carlos A de Bragança; Diniz, Juliana B
2017-10-01
We aimed to investigate which items of the Yale-Brown Obsessive-Compulsive Severity Scale best discriminate the reduction in total scores in obsessive-compulsive disorder patients after 4 and 12 weeks of pharmacological treatment. Data from 112 obsessive-compulsive disorder patients who received fluoxetine (⩽80 mg/day) for 12 weeks were included. Improvement indices were built for each Yale-Brown Obsessive-Compulsive Severity Scale item at two timeframes: from baseline to week 4 and from baseline to week 12. Indices for each item were correlated with the total scores for obsessions and compulsions and then ranked by correlation coefficient. A correlation coefficient ⩾0.7 was used to identify items that contributed significantly to reducing obsessive-compulsive disorder severity. At week 4, the distress items reached the threshold of 0.7 for improvement on the obsession and compulsion subscales although, contrary to our expectations, there was greater improvement in the control items than in the distress items. At week 12, there was greater improvement in the time, interference, and control items than in the distress items. The use of fluoxetine led first to reductions in distress and increases in control over symptoms before affecting the time spent on, and interference from, obsessions and compulsions. Resistance did not correlate with overall improvement. Understanding the pathway of improvement with pharmacological treatment in obsessive-compulsive disorder may provide clues about how to optimize the effects of medication.
Pick-N Multiple Choice-Exams: A Comparison of Scoring Algorithms
ERIC Educational Resources Information Center
Bauer, Daniel; Holzer, Matthias; Kopp, Veronika; Fischer, Martin R.
2011-01-01
To compare different scoring algorithms for Pick-N multiple correct answer multiple-choice (MC) exams regarding test reliability, student performance, total item discrimination and item difficulty. Data from six 3rd year medical students' end of term exams in internal medicine from 2005 to 2008 at Munich University were analysed (1,255 students,…
Exploration and confirmation of the latent variable structure of the Jefferson scale of empathy
LaNoue, Marianna
2014-01-01
Objectives: To reaffirm the underlying components of the JSE by using exploratory factor analysis (EFA), and to confirm its latent variable structure by using confirmatory factor analysis (CFA). Methods Research participants included 2,612 medical students who entered Jefferson Medical College between 2002 and 2012. This sample was divided into two groups: Matriculants between 2002 and 2007 (n=1,380) and between 2008 and 2012 (n=1,232). Data for 2002-2007 matriculants were subjected to EFA (principal component factor extraction), and data for matriculants of 2008-2012 were used for CFA (structural equation modeling, and root mean square error for approximation). Results The EFA resulted in three factors: “perspective-taking,” “compassionate care” and “walking in patient’s shoes” replicating the 3-factor model reported in most of the previous studies. The CFA showed that the 3-factor model was an acceptable fit, thus confirming the latent variable structure emerged in the EFA. Corrected item-total score correlations for the total sample were all positive and statistically significant, ranging from 0.13 to 0.61 with a median of 0.44 (p<0.01). The item discrimination effect size indices (contrasting item mean scores for the top-third versus bottom-third JSE scorers) ranged from 0.50 to 1.4 indicating that the differences in item mean scores between top and bottom scorers on the JSE were of practical importance. Cronbach’s alpha coefficient of the JSE for the total sample was 0.80, ranging from 0.75 to 0.84 for matriculatnts of different years. Conclusions Findings provided further support for underlying constructs of the JSE, adding to its credibility. PMID:25341215
Gubbels, Jessica S; Sleddens, Ester Fc; Raaijmakers, Lieke Ch; Gies, Judith M; Kremers, Stef Pj
2016-08-01
To develop and validate a questionnaire to measure food-related and activity-related practices of child-care staff, based on existing, validated parenting practices questionnaires. A selection of items from the Comprehensive Feeding Practices Questionnaire (CFPQ) and the Preschooler Physical Activity Parenting Practices (PPAPP) questionnaire was made to include items most suitable for the child-care setting. The converted questionnaire was pre-tested among child-care staff during cognitive interviews and pilot-tested among a larger sample of child-care staff. Factor analyses with Varimax rotation and internal consistencies were used to examine the scales. Spearman correlations, t tests and ANOVA were used to examine associations between the scales and staff's background characteristics (e.g. years of experience, gender). Child-care centres in the Netherlands. The qualitative pre-test included ten child-care staff members. The quantitative pilot test included 178 child-care staff members. The new questionnaire, the Child-care Food and Activity Practices Questionnaire (CFAPQ), consists of sixty-three items (forty food-related and twenty-three activity-related items), divided over twelve scales (seven food-related and five activity-related scales). The CFAPQ scales are to a large extent similar to the original CFPQ and PPAPP scales. The CFAPQ scales show sufficient internal consistency with Cronbach's α ranging between 0·53 and 0·96, and average corrected item-total correlations within acceptable ranges (0·30-0·89). Several of the scales were significantly associated with child-care staff's background characteristics. Scale psychometrics of the CFAPQ indicate it is a valid questionnaire that assesses child-care staff's practices related to both food and activities.
Salathé, Cornelia Rolli; Trippolini, Maurizio Alen; Terribilini, Livio Claudio; Oliveri, Michael; Elfering, Achim
2018-06-01
Purpose To develop a multidimensional scale to asses psychosocial beliefs-the Yellow Flag Questionnaire (YFQ)-aimed at guiding interventions for workers with chronic musculoskeletal (MSK) pain. Methods Phase 1 consisted of item selection based on literature search, item development and expert consensus rounds. In phase 2, items were reduced with calculating a quality-score per item, using structure equation modeling and confirmatory factor analysis on data from 666 workers. In phase 3, Cronbach's α, and Pearson correlations coefficients were computed to compare YFQ with disability, anxiety, depression and self-efficacy and the YFQ score based on data from 253 injured workers. Regressions of YFQ total score on disability, anxiety, depression and self-efficacy were calculated. Results After phase 1, the YFQ included 116 items and 15 domains. Further reductions of items in phase 2 by applying the item quality criteria reduced the total to 48 items. Phase factor analysis with structural equation modeling confirmed 32 items in seven domains: activity, work, emotions, harm & blame, diagnosis beliefs, co-morbidity and control. Cronbach α was 0.91 for the total score, between 0.49 and 0.81 for the 7 distinct scores of each domain, respectively. Correlations between YFQ total score ranged with disability, anxiety, depression and self-efficacy was .58, .66, .73, -.51, respectively. After controlling for age and gender the YFQ total score explained between R2 27% and R2 53% variance of disability, anxiety, depression and self-efficacy. Conclusions The YFQ, a multidimensional screening scale is recommended for use to assess psychosocial beliefs of workers with chronic MSK pain. Further evaluation of the measurement properties such as the test-retest reliability, responsiveness and prognostic validity is warranted.
Kim, Hee-Ju
2017-03-01
This study aimed to evaluate the reliability and validity of the Korean version of the Mini-Sleep Questionnaire-Insomnia in Korean college students. A total of 470 students from six nursing colleges in South Korea participated in the study. The translation and linguistic validation of the Mini-Sleep Questionnaire-Insomnia was performed based on guidelines. The Pittsburgh Sleep Quality Index and the Perceived Stress Scale were used to validate the measure. Cronbach α, item-total correlation for internal consistency reliability and intraclass correlation coefficient for test-retest reliability were evaluated. Exploratory factor analysis for construct validity, Pearson's correlation with the Pittsburgh Sleep Quality Index and the Perceived Stress Scale for concurrent validity, and the receiver operating character curve for predictive validity were assessed. The 4-item Mini-Sleep Questionnaire-Insomnia had a Cronbach α of .69 and the item-total correlations were higher than .30. Cronbach α increased to .73 if the item assessing the use of sleeping pills and tranquilizers was deleted. This item had marked skewness and kurtosis issues. Factor analysis indicated unidimensionality, explaining 53.0% of the total variance. The measure showed high test-retest reliability (i.e., intraclass correlation coefficient = .84), acceptable concurrent validity (r with the Pittsburg Sleep Quality Index = .69; r with the Perceived Stress Scale = .31) and predictive validity [area under curve = .85; 95% confidence interval (0.81, 0.90)]. The Mini-Sleep Questionnaire-Insomnia showed acceptable reliability and validity. Yet, the limited distribution in sleep medications warrants further evaluations in the clinical population. Copyright © 2017. Published by Elsevier B.V.
Negahban, Hossein; Hessam, Masumeh; Tabatabaei, Saeid; Salehi, Reza; Sohani, Soheil Mansour; Mehravar, Mohammad
2014-01-01
The aim was to culturally translate and validate the Persian lower extremity functional scale (LEFS) in a heterogeneous sample of outpatients with lower extremity musculoskeletal disorders (n = 304). This is a prospective methodological study. After a standard forward-backward translation, psychometric properties were assessed in terms of test-retest reliability, internal consistency, construct validity, dimensionality, and ceiling or floor effects. The acceptable level of intraclass correlation coefficient >0.70 and Cronbach's alpha coefficient >0.70 was obtained for the Persian LEFS. Correlations between Persian LEFS and Short-Form 36 Health Survey (SF-36) subscales of Physical Health component (rs range = 0.38-0.78) were higher than correlations between Persian LEFS and SF-36 subscales of Mental Health component (rs range = 0.15-0.39). A corrected item--total correlation of >0.40 (Spearman's rho) was obtained for all items of the Persian LEFS. Horn's parallel analysis detected a total of two factors. No ceiling or floor effects were detected for the Persian LEFS. The Persian version of the LEFS is a reliable and valid instrument that can be used to measure functional status in Persian-speaking patients with different musculoskeletal disorders of the lower extremity. Implications for Rehabilitation The Persian lower extremity functional scale (LEFS) is a reliable, internally consistent and valid instrument, with no ceiling or floor effects, to determine functional status of heterogeneous patients with musculoskeletal disorders of the lower extremity. The Persian version of the LEFS can be used in clinical and research settings to measure function in Iranian patients with different musculoskeletal disorders of the lower extremity.
Bervoets, Liene; Van Noten, Caroline; Van Roosbroeck, Sofie; Hansen, Dominique; Van Hoorenbeeck, Kim; Verheyen, Els; Van Hal, Guido; Vankerckhoven, Vanessa
2014-01-01
This study was designed to validate the Dutch Physical Activity Questionnaires for Children (PAQ-C) and Adolescents (PAQ-A). After adjustment of the original Canadian PAQ-C and PAQ-A (i.e. translation/back-translation and evaluation by expert committee), content validity of both PAQs was assessed and calculated using item-level (I-CVI) and scale-level (S-CVI) content validity indexes. Inter-item and inter-rater reliability of 196 PAQ-C and 95 PAQ-A filled in by both children or adolescents and their parent, were evaluated. Inter-item reliability was calculated by Cronbach's alpha (α) and inter-rater reliability was examined by percent observed agreement and weighted kappa (κ). Concurrent validity of PAQ-A was examined in a subsample of 28 obese and 16 normal-weight children by comparing it with concurrently measured physical activity using a maximal cardiopulmonary exercise test for the assessment of peak oxygen uptake (VO2 peak). For both PAQs, I-CVI ranged 0.67-1.00. S-CVI was 0.89 for PAQ-C and 0.90 for PAQ-A. A total of 192 PAQ-C and 94 PAQ-A were fully completed by both child and parent. Cronbach's α was 0.777 for PAQ-C and 0.758 for PAQ-A. Percent agreement ranged 59.9-74.0% for PAQ-C and 51.1-77.7% for PAQ-A, and weighted κ ranged 0.48-0.69 for PAQ-C and 0.51-0.68 for PAQ-A. The correlation between total PAQ-A score and VO2 peak - corrected for age, gender, height and weight - was 0.516 (p = 0.001). Both PAQs have an excellent content validity, an acceptable inter-item reliability and a moderate to good strength of inter-rater agreement. In addition, total PAQ-A score showed a moderate positive correlation with VO2 peak. Both PAQs have an acceptable to good reliability and validity, however, further validity testing is recommended to provide a more complete assessment of both PAQs.
Mental health in primary care: an evaluation using the Item Response Theory
da Rocha, Hugo André; dos Santos, Alaneir de Fátima; Reis, Ilka Afonso; Santos, Marcos Antônio da Cunha; Cherchiglia, Mariângela Leal
2018-01-01
ABSTRACT OBJECTIVE To determine the items of the Brazilian National Program for Improving Access and Quality of Primary Care that better evaluate the capacity to provide mental health care. METHODS This is a cross-sectional study carried out using the Graded Response Model of the Item Response Theory using secondary data from the second cycle of the National Program for Improving Access and Quality of Primary Care, which evaluates 30,523 primary care teams in the period from 2013 to 2014 in Brazil. The internal consistency, correlation between items, and correlation between items and the total score were tested using the Cronbach’s alpha, Spearman’s correlation, and point biserial coefficients, respectively. The assumptions of unidimensionality and local independence of the items were tested. Word clouds were used as one way to present the results. RESULTS The items with the greatest ability to discriminate were scheduling of the agenda according to risk stratification, keeping of records of the most serious cases of users in psychological distress, and provision of group care. The items that required a higher level of mental health care in the parameter of location were the provision of any type of group care and the provision of educational and mental health promotion activities. Total Cronbach’s alpha coefficient was 0.87. The items that obtained the highest correlation with total score were the recording of the most serious cases of users in psychological distress and scheduling of the agenda according to risk stratification. The final scores obtained oscillated between -2.07 (minimum) and 1.95 (maximum). CONCLUSIONS There are important aspects in the discrimination of the capacity to provide mental health care by primary health care teams: risk stratification for care management, follow-up of the most serious cases, group care, and preventive and health promotion actions. PMID:29489992
A Study of Developing an Environmental Attitude Scale for Primary School Students
ERIC Educational Resources Information Center
Artvinli, Eyup; Demir, Zulfiye Melis
2018-01-01
The aim of this research is to develop an instrument that measures environmental attitudes of third grade students. The study was completed in six stages: creating scale items, content validity study, item total and remaining item correlation study, determining item discrimination, determining construct validity study and examining the internal…
Measuring genetic knowledge: a brief survey instrument for adolescents and adults.
Fitzgerald-Butt, S M; Bodine, A; Fry, K M; Ash, J; Zaidi, A N; Garg, V; Gerhardt, C A; McBride, K L
2016-02-01
Basic knowledge of genetics is essential for understanding genetic testing and counseling. The lack of a written, English language, validated, published measure has limited our ability to evaluate genetic knowledge of patients and families. Here, we begin the psychometric analysis of a true/false genetic knowledge measure. The 18-item measure was completed by parents of children with congenital heart defects (CHD) (n = 465) and adolescents and young adults with CHD (age: 15-25, n = 196) with a mean total correct score of 12.6 [standard deviation (SD) = 3.5, range: 0-18]. Utilizing exploratory factor analysis, we determined that one to three correlated factors, or abilities, were captured by our measure. Through confirmatory factor analysis, we determined that the two factor model was the best fit. Although it was necessary to remove two items, the remaining items exhibited adequate psychometric properties in a multidimensional item response theory analysis. Scores for each factor were computed, and a sum-score conversion table was derived. We conclude that this genetic knowledge measure discriminates best at low knowledge levels and is therefore well suited to determine a minimum adequate amount of genetic knowledge. However, further reliability testing and validation in diverse research and clinical settings is needed. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Faculty Performance on the Genomic Nursing Concept Inventory.
Read, Catherine Y; Ward, Linda D
2016-01-01
To use the newly developed Genomic Nursing Concept Inventory (GNCI) to evaluate faculty understanding of foundational genomic concepts, explore relative areas of strength and weakness, and compare the results with those of a student sample. An anonymous online survey instrument consisting of demographic or background items and the 31 multiple-choice questions that make up the GNCI was completed by 495 nursing faculty from across the United States in the fall of 2014. Total GNCI score and scores on four subcategories (genome basics, mutations, inheritance, genomic health) were calculated. Relationships between demographic or background variables and total GNCI score were explored. The mean score on the GNCI was 14.93 (SD = 5.31), or 48% correct; topical category scores were highest on the inheritance and genomic health items (59% and 58% correct, respectively), moderate on the mutations items (54% correct), and lowest on the genome basics items (33% correct). These results are strikingly similar to those of a recent study of nursing students. Factors associated with a higher total score on the GNCI included higher self-rated proficiency with genetic/genomic content, having a doctoral degree, having taken a genetics course for academic credit or continuing education, and having taught either a stand-alone genetic/genomic course or lecture content as part of nursing or related course. Self-rated proficiency with genetic/genomic content was fair or poor (70%), with only 7% rating their proficiency as very good or excellent. Faculty knowledge of foundational genomic concepts is similar to that of the students they teach and weakest in the areas related to basic science information. Genomics is increasingly relevant in all areas of clinical nursing practice, and the faculty charged with educating the next generation of nurses must understand foundational concepts. Faculty need to be proactive in seeking out relevant educational programs that include basic genetic/genomic concepts. © 2015 Sigma Theta Tau International.
A validation study of public health knowledge, skills, social responsibility and applied learning.
Vackova, Dana; Chen, Coco K; Lui, Juliana N M; Johnston, Janice M
2018-06-22
To design and validate a questionnaire to measure medical students' Public Health (PH) knowledge, skills, social responsibility and applied learning as indicated in the four domains recommended by the Association of Schools & Programmes of Public Health (ASPPH). A cross-sectional study was conducted to develop an evaluation tool for PH undergraduate education through item generation, reduction, refinement and validation. The 74 preliminary items derived from the existing literature were reduced to 55 items based on expert panel review which included those with expertise in PH, psychometrics and medical education, as well as medical students. Psychometric properties of the preliminary questionnaire were assessed as follows: frequency of endorsement for item variance; principal component analysis (PCA) with varimax rotation for item reduction and factor estimation; Cronbach's Alpha, item-total correlation and test-retest validity for internal consistency and reliability. PCA yielded five factors: PH Learning Experience (6 items); PH Risk Assessment and Communication (5 items); Future Use of Evidence in Practice (6 items); Recognition of PH as a Scientific Discipline (4 items); and PH Skills Development (3 items), explaining 72.05% variance. Internal consistency and reliability tests were satisfactory (Cronbach's Alpha ranged from 0.87 to 0.90; item-total correlation > 0.59). Lower paired test-retest correlations reflected instability in a social science environment. An evaluation tool for community-centred PH education has been developed and validated. The tool measures PH knowledge, skills, social responsibilities and applied learning as recommended by the internationally recognised Association of Schools & Programmes of Public Health (ASPPH).
Meert, Kathleen L; Templin, Thomas N; Michelson, Kelly N; Morrison, Wynne E; Hackbarth, Richard; Custer, Joseph R; Schim, Stephanie M; Briller, Sherylyn H; Thurston, Celia S
2012-11-01
To evaluate the reliability and validity of the Bereaved Parent Needs Assessment, a new instrument to measure parents' needs and need fulfillment around the time of their child's death in the pediatric intensive care unit. We hypothesized that need fulfillment would be negatively related to complicated grief and positively related to quality of life during bereavement. Cross-sectional survey. Five U.S. children's hospital pediatric intensive care units. Parents (n = 121) bereaved in a pediatric intensive care unit 6 months earlier. Surveys included the 68-item Bereaved Parent Needs Assessment, the Inventory of Complicated Grief, and the abbreviated version of the World Health Organization Quality of Life questionnaire. Each Bereaved Parent Needs Assessment item described a potential need and was rated on two scales: 1) a 5-point rating of importance (1 = not at all important, 5 = very important) and 2) a 5-point rating of fulfillment (1 = not at all met, 5 = completely met). Three composite scales were computed: 1) total importance (percentage of all needs rated ≥4 for importance), 2) total fulfillment (percentage of all needs rated ≥4 for fulfillment), and 3) percent fulfillment (percentage of important needs that were fulfilled). Internal consistency reliability was assessed by Cronbach's α and Spearman-Brown-corrected split-half reliability. Generalized estimating equations were used to test predictions between composite scales and the Inventory of Complicated Grief and World Health Organization Quality of Life questionnaire. Two items had mean importance ratings <3, and 55 had mean ratings >4. Reliability of composite scores ranged from 0.92 to 0.94. Total fulfillment was negatively correlated with Inventory of Complicated Grief (r = -.29; p < .01) and positively correlated with World Health Organization Quality of Life questionnaire (r = .21; p < .05). Percent fulfillment was also significantly correlated with both outcomes. Adjusting for parent's age, education, and loss of an only child, percent fulfillment remained significantly correlated with Inventory of Complicated Grief but not with World Health Organization Quality of Life questionnaire. The Bereaved Parent Needs Assessment demonstrated reliability and validity to assess the needs of parents bereaved in the pediatric intensive care unit. Meeting parents' needs around the time of their child's death may promote adjustment to loss.
Ernstmann, Nicole; Halbach, Sarah; Kowalski, Christoph; Pfaff, Holger; Ansmann, Lena
2017-04-01
Studies addressing the organizational contexts of care that may help increase the patients' ability to cope with a disease and to navigate through the health care system are still rare. Especially instruments allowing the assessment of such organizational efforts from the patients' perspective are missing. The aim of our study was to develop a survey instrument assessing organizational health literacy (HL) from the patients' perspective, i. e., health care organizations' responsiveness to patients' individual needs. A pool of 30 items was developed by a group of experts based on a literature review. The items were developed, tested and prioritized according to their importance in 11 semi-structured interviews and cognitive think-aloud interviews with cancer patients. The resulting 16 items were rated in a standardized postal survey involving a total of N=453 colon and breast cancer patients treated in cancer centers in Germany. An exploratory factor analysis, a confirmatory factor analysis and structural equation modelling were conducted. Item properties were analyzed. 83.2 % of the patients were diagnosed with breast cancer, 16.8 % had a diagnosis of colon cancer. The patients' mean age was 61 (26-88), 89.4 % were female. The most common comorbidities were hypertension (34.0 %) and cardiovascular disease (11.0 %). The final prediction model included nine items measuring the degree of health literacy-sensitivity of communication. The model showed an acceptable model fit. The nine items showed corrected item-total correlations between .622 and .762 and item difficulties between 0.77 and 0.87. Cronbach's α was .912. In a comprehensive development process, the original item pool comprising several aspects of organizational HL was reduced to a one-dimensional scale. The instrument measures an important aspect of organizational HL; i.e., the degree of health literacy-sensitivity of communication (HL-COM). HL-COM was found to impact patient enablement, mediated through the support by physicians. Future research will have to test these associations in the context of other diseases or institutions. Copyright © 2017. Published by Elsevier GmbH.
Pan, Jia-Yan; Ye, Shengquan; Ng, Petrus
2016-01-01
The present study validated the combined version of the 8-item Automatic Thought Questionnaire (ATQ) and 10 positive items from the ATQ-revised among Chinese university students. A total of 412 Mainland Chinese university students were recruited in Hong Kong by an online survey. A 14-item Chinese ATQ was derived via item analysis. Satisfactory internal consistency reliability and good split-half reliability were obtained. Exploratory and confirmatory factor analysis revealed a 3-correlated-factor solution for the Chinese ATQ: negative thought, positive thought (emotional), and positive thought (cognitive). The negative ATQ subscale score was positively correlated with negative affect, and negatively correlated with positive affect and life satisfaction. The two positive ATQ subscale scores were negatively correlated with negative affect, and positively correlated with positive affect and life satisfaction. The 14-item ATQ is a valid and reliable instrument for measuring automatic thoughts in the Chinese context of Hong Kong. © 2015 Wiley Periodicals, Inc.
Weech-Maldonado, Robert; Carle, Adam; Weidmer, Beverly; Hurtado, Margarita; Ngo-Metzger, Quyen; Hays, Ron D
2012-09-01
There is a need for reliable and valid measures of cultural competence (CC) from the patient's perspective. This paper evaluates the reliability and validity of the Consumer Assessments of Healthcare Providers and Systems (CAHPS) CC item set. Using 2008 survey data, we assessed the internal consistency of the CAHPS CC scales using the Cronbach α's and examined the validity of the measures using exploratory and confirmatory factor analysis, multitrait scaling analysis, and regression analysis. A random stratified sample (based on race/ethnicity and language) of 991 enrollees, younger than 65 years, from 2 Medicaid managed care plans in California and New York. CAHPS CC item set after excluding screener items and ratings. Confirmatory factor analysis (Comparative Fit Index=0.98, Tucker Lewis Index=0.98, and Root Mean Square Error or Approximation=0.06) provided support for a 7-factor structure: Doctor Communication--Positive Behaviors, Doctor Communication--Negative Behaviors, Doctor Communication--Health Promotion, Doctor Communication--Alternative Medicine, Shared Decision-Making, Equitable Treatment, and Trust. Item-total correlations (corrected for item overlap) for the 7 scales exceeded 0.40. Exploratory factor analysis showed support for 1 additional factor: Access to Interpreter Services. Internal consistency reliability estimates ranged from 0.58 (Alternative Medicine) to 0.92 (Positive Behaviors) and was 0.70 or higher for 4 of the 8 composites. All composites were positively and significantly associated with the overall doctor rating. The CAHPS CC 26-item set demonstrates adequate measurement properties and can be used as a supplemental item set to the CAHPS Clinician and Group Surveys in assessing culturally competent care from the patient's perspective.
Psychometrics of the Zarit Burden Interview in Caregivers of Patients With Heart Failure.
Al-Rawashdeh, Sami Y; Lennie, Terry A; Chung, Misook L
Identification of family caregivers who are burdened by the caregiving experience is vital to prevention of poor outcomes associated with caregiving. The Zarit Burden Interview (ZBI), a well-known measure of caregiving burden in caregivers of patients with dementia, has been used without being validated in caregivers of patients with heart failure (HF). The purpose of this study is to examine the reliability and validity of the ZBI in caregivers of patients with HF. A total of 124 primary caregivers of patients with HF completed survey questionnaires. Caregiving burden was measured by the ZBI. Reliability was examined using Cronbach's α and item-total/item-item correlations. Convergent validity was examined using correlations with the Oberst Caregiving Burden Scale. Construct validity was demonstrated by exploratory factor analysis and known hypothesis testing (ie, the hypothesis of the association between caregiving burden and depressive symptoms). Cronbach's α for the ZBI was .921. The ZBI had good item-total (r = 0.395-0.764) and item-item (mean r = 0.365) correlations. Significant correlations between the ZBI and the Oberst Caregiving Burden Scale (r = 0.466 for the caregiving time subscale and 0.583 for the caregiving task difficulty subscale; P < .001 for both) supported convergent validity. Four factors were identified (ie, consequences of caregiving, patient's dependence, exhaustion with caregiving and uncertainty, and guilt and fear for the patient's future) using factor analysis, which are consistent with previous studies. Caregivers with high burden scores had significantly higher depressive symptoms than did caregivers with lower burden scores (7.0 ± 6.8 vs 3.1 ± 4.3; P < .01). The findings provide evidence that the ZBI is a reliable and valid measure for assessing burden in caregivers of patients with HF.
[Validation of the Copenhagen Burnout Inventory to assess professional burnout in Spain].
Molinero Ruiz, Emilia; Basart Gómez-Quintero, Helena; Moncada Lluis, Salvador
2013-01-01
The Copenhagen Burnout Inventory (CBI) is a public domain questionnaire measuring the degree of psychological fatigue experienced in three subdimensions of Burnout: personal (PB), work-related (WB), and client-related Burnout (CB). The study aimed to examine the acceptability, reliability and construct validity of the Spanish version of CBI. The study population consisted of 479 workers of educational centers, social work centres, healthcare centres and workers within the industry sector. Data was collected in 2009 through a self-administered questionnaire including the three CBI scales, sixteen scales of psychosocial work environment (COPSOQ ISTAS21) and perceived general and mental health and vitality (SF-36). Response rate was 78.7%. The three scales have an inter-item correlation average between 0.42 and 0.60 and a corrected item-total correlation between 0,49 and 0,83. The internal consistency of the three scales had Cronbach's α values of 0.90 for PB, 0.83 for WB and 0.82 for CB. Burnout was related to both psychosocial work environment and wellbeing measures in the expected direction and intensity. The items of the three scales show good discrimination capacity, good consistency and homogeneity. The three CBI scales have an acceptable internal consistency reliability index, slightly higher in PB. The discrimination capacity of the scales is verified through the discrimination index and the different levels between occupations and activities. These results demonstrate that the Spanish version of the CBI is a reliable and valid instrument for measuring Burnout.
Prenatal knowledge and informational priorities of pregnant adolescents.
Smith, P B; Levenson, P M; Morrow, J R
1985-01-01
One hundred and forty-six indigent pregnant adolescents (12 to 18 years of age) were asked to complete a questionnaire concerning their prenatal care priorities (Scale I) and their knowledge of correct perinatal behaviors (Scale II). On Scale I, over 75% of teens considered parenting skills, infant care, and diet extremely important. On Scale II correctly answered items focused on the need to avoid substance abuse and smoking during pregnancy, visit the doctor, and eat balanced meals. The mean number of correct answers, however, was only 11.8 out of a total possible scale of 18 items. Less than 50% correctly answered statements about the effects of weight gain and other health behaviors on risk for high blood pressure and toxemia, safety of laxatives during pregnancy, possibility of becoming pregnant again before resuming menstruation, and the safety of various physical activities. Performance on both knowledge and health priority scales showed correct health information was limited to basic concrete facts. Abstract and technical aspects of health care did not appear to be easily assimilated.
Assessment of the psychometrics of a PROMIS item bank: self-efficacy for managing daily activities
Hong, Ickpyo; Li, Chih-Ying; Romero, Sergio; Gruber-Baldini, Ann L.; Shulman, Lisa M.
2017-01-01
Purpose The aim of this study is to investigate the psychometrics of the Patient-Reported Outcomes Measurement Information System self-efficacy for managing daily activities item bank. Methods The item pool was field tested on a sample of 1087 participants via internet (n = 250) and in-clinic (n = 837) surveys. All participants reported having at least one chronic health condition. The 35 item pool was investigated for dimensionality (confirmatory factor analyses, CFA and exploratory factor analysis, EFA), item-total correlations, local independence, precision, and differential item functioning (DIF) across gender, race, ethnicity, age groups, data collection modes, and neurological chronic conditions (McFadden Pseudo R2 less than 10 %). Results The item pool met two of the four CFA fit criteria (CFI = 0.952 and SRMR = 0.07). EFA analysis found a dominant first factor (eigenvalue = 24.34) and the ratio of first to second eigenvalue was 12.4. The item pool demonstrated good item-total correlations (0.59–0.85) and acceptable internal consistency (Cronbach’s alpha = 0.97). The item pool maintained its precision (reliability over 0.90) across a wide range of theta (3.70), and there was no significant DIF. Conclusion The findings indicated the item pool has sound psychometric properties and the test items are eligible for development of computerized adaptive testing and short forms. PMID:27048495
Assessment of the psychometrics of a PROMIS item bank: self-efficacy for managing daily activities.
Hong, Ickpyo; Velozo, Craig A; Li, Chih-Ying; Romero, Sergio; Gruber-Baldini, Ann L; Shulman, Lisa M
2016-09-01
The aim of this study is to investigate the psychometrics of the Patient-Reported Outcomes Measurement Information System self-efficacy for managing daily activities item bank. The item pool was field tested on a sample of 1087 participants via internet (n = 250) and in-clinic (n = 837) surveys. All participants reported having at least one chronic health condition. The 35 item pool was investigated for dimensionality (confirmatory factor analyses, CFA and exploratory factor analysis, EFA), item-total correlations, local independence, precision, and differential item functioning (DIF) across gender, race, ethnicity, age groups, data collection modes, and neurological chronic conditions (McFadden Pseudo R (2) less than 10 %). The item pool met two of the four CFA fit criteria (CFI = 0.952 and SRMR = 0.07). EFA analysis found a dominant first factor (eigenvalue = 24.34) and the ratio of first to second eigenvalue was 12.4. The item pool demonstrated good item-total correlations (0.59-0.85) and acceptable internal consistency (Cronbach's alpha = 0.97). The item pool maintained its precision (reliability over 0.90) across a wide range of theta (3.70), and there was no significant DIF. The findings indicated the item pool has sound psychometric properties and the test items are eligible for development of computerized adaptive testing and short forms.
Osterode, Wolf; Schranz, Sandra; Jordakieva, Galateja
2018-03-21
Mental and physical stress is common in physicians during night shifts. Neurocognitive effects of sleep deprivation as well as alterations in hormonal and metabolic parameters have previously been described. The aim of this crossover study was to evaluate the effects of night-shift work with partial sleep deprivation on steroid hormone excretion and possible associations with mood, sleep characteristics and cognitive functions in physicians. In total, 34 physicians (mean age 42 ± 8.5 years, 76.5% male) from different departments of the General Hospital of Vienna, Austria, were randomly assigned to two conditions: a regular day shift (8 h on duty, condition 1) and a continuous day-night shift (24 h on duty, condition 2). In both conditions, physicians collected a 24 h urine sample for steroid hormone concentration analysis and further completed psychological tests, including the sleep questionnaire (SF-A), the questionnaire for mental state (MDBF) and the computer-assisted visual memory test (FVW) before and at the end of their shifts, respectively. Although mean sleep deprivation during night shift was relatively small (~1.5 h) the impairment in participants' mental state was high in all three dimensions (mood, vigilance and agitation, p ≤ 0.001). Sleep quality (SQ), feeling of being recovered after sleep and mental balance decreased (p ≤ 0.001), whereas mental exhaustion increased (p < 0.05). Moreover, we could show a nearly linear relationship between most of these self-rating items. Testing visual memory participants made significantly more mistakes after night shift (p = 0.011), however, mostly in incorrectly identified items and not in correctly identified ones (FVW). SQ and false identified items were negatively correlated, whereas SQ and time of reaction were positively associated. It is assumed that after night shift, a tendency exists to make faster wrong decisions. SQ did not influence correctly identified items in FVW. In contrast to previous investigations, we found that only excretion rates for pregnanetriol and androsterone/etiocholanolone ratios (p < 0.05, respectively) were slightly reduced in 24-h urine samples after night shift. A considerable stimulation of the adrenocortical axis could not be affirmed. In general, dehydroepiandrosteron (DHEA) was negatively associated with the sense of recreation after sleep and with the time of reaction and positively correlated with correctly identified items in the FVW test. These results, on the one hand, are in line with previous findings indicating that stress and sleep deprivation suppress gonadal steroids, but, on the other hand, do not imply significant adrenocortical-axis stimulation (e.g. an increase of cortisol) during the day-night shift.
Koydemir, Selda; Demir, Ayhan
2007-06-01
The purpose of the study was to report initial data on the psychometric properties of the Brief Fear of Negative Evaluation Scale. The scale was applied to a nonclinical sample of 250 (137 women, 113 men) Turkish undergraduate students selected randomly from Middle East Technical University. Their mean age was 20.4 yr. (SD= 1.9). The factor structure of the Turkish version, its criterion validity, and internal reliability coefficients were assessed. Although maximum likelihood factor analysis initially indicated that the scale had only one factor, a forced two-factor solution accounted for more variance (61%) in scale scores than a single factor. The straightforward items loaded on the first factor, and the reverse-coded items loaded on the second factor. The total score was significantly positively correlated with scores on the Revised Cheek and Buss Shyness Scale and significantly negatively correlated with scores on the Rosenberg Self-Esteem Scale. Factor 1 (straightforward items) correlated more highly with both Shyness and Self-esteem than Factor 2 (reverse-coded items). Internal consistency estimate was .94 for the Total scores, .91 for the Factor 1 (straightforward items), and .87 for the Factor 2 (reverse-coded items). No sex differences were evident for Fear of Negative Evaluation.
2013-01-01
Background Quality of life (QOL) is an important outcome measure in the treatment of heroin addiction. The Taiwan version of the World Health Organization Quality of Life assessment (WHOQOL-BREF [TW]) has been developed and studied in various groups, but not specifically in a population of injection drug users. The aim of this study was to analyze the psychometric properties of the WHOQOL-BREF (TW) in a sample of injection drug users undergoing methadone maintenance treatment. Methods A total of 553 participants were interviewed and completed the instrument. Item-response distributions, internal consistency, corrected item-domain correlation, criterion-related validity, and construct validity through confirmatory factor analysis were evaluated. Results The frequency distribution of the 4 domains of the WHOQOL-BREF (TW) showed no floor or ceiling effects. The instrument demonstrated adequate internal consistency (Cronbach’s alpha coefficients were higher than 0.7 across the 4 domains) and all items had acceptable correlation with the corresponding domain scores (r = 0.32-0.73). Correlations (p < 0.01) of the 4 domains with the 2 benchmark items assessing overall QOL and general health were supportive of criterion-related validity. Confirmatory factor analysis yielded marginal goodness-of-fit between the 4-domain model and the sample data. Conclusions The hypothesized WHOQOL-BREF measurement model was appropriate for the injection drug users after some adjustments. Despite different patterns found in the confirmatory factor analysis, the findings overall suggest that the WHOQOL-BREF (TW) is a reliable and valid measure of QOL among injection drug users and can be utilized in future treatment outcome studies. The factor structure provided by the study also helps to understand the QOL characteristics of the injection drug users in Taiwan. However, more research is needed to examine its test-retest reliability and sensitivity to changes due to treatment. PMID:24325611
Ages & Stages Questionnaire–Brazil–2011
Santana, Cristina M. T.; Filgueiras, Alberto; Landeira-Fernandez, J.
2015-01-01
Introduction. Professionals who assess early childhood development highly benefit from reliable development screening measures. The Ages & Stages Questionnaire was adapted Brazil in 2010 and named ASQ-BR. Modifications in some items were required to improve the instrument’s psychometric properties. The present study modified the ASQ-BR to verify if those changes increase its characteristics. Method. This study researched 67 522 children from 972 public day care centers and preschools. Changes in items were made considering Cronbach’s α and item-to-total correlations. Reliability, dimensionality, and item-to-total correlations were calculated. Results. Regarding dimensionality, 86.2% of the scales in ASQ-BR-2011 were unidimensional. Internal consistency showed improvement from 2010 to 2011: 53.8% of the scales increased the α statistics against 41.2% that decreased, and 5.0% remained the same. Finally, 65.2% of the modified items showed improvement. Conclusions. Overall, the instrument’s psychometrics improved from 2010 to 2011, especially in the personal/social domain. However, it still leaves room for improvement in future studies. PMID:27335984
Portuguese Medical Students' Knowledge and Attitudes Towards Homosexuality.
Lopes, Lucas; Gato, Jorge; Esteves, Manuel
2016-11-01
Lesbian, gay, bisexual and transgender people still face discrimination in healthcare environments and physicians often report lack of knowledge on this population's specific healthcare needs. In fact, recommendations have been put forward to include lesbian, gay, bisexual and transgender health in medical curricula. This study aimed to explore factors associated with medical students' knowledge and attitudes towards homosexuality in different years of the medical course. An anonymous online-based questionnaire was sent to all medical students enrolled at the Faculty of Medicine - University of Porto, Portugal, in December 2015. The questionnaire included socio-demographic questions, the Multidimensional Scale of Attitudes Toward Lesbians and Gay Men (27 items) and a Homosexuality Knowledge Questionnaire (17 items). Descriptive statistics, ANOVAs, Chi-square tests and Pearson's correlations were used in the analysis. A total of 489 completed responses was analyzed. Male gender, religiosity and absence of lesbian, gay or bisexual friends were associated with more negative attitudes towards homosexuality. Attitudinal scores did not correlate with advanced years in medical course or contact with lesbian, gay or bisexual patients. Students aiming to pursue technique-oriented specialties presented higher scores in the 'Modern Heterosexism' subscale than students seeking patient-oriented specialties. Although advanced years in medical course correlated significantly with higher knowledge scores, items related with lesbian, gay or bisexual health showed the lowest percentage of correct answers. There seems to be a lack of exploration of medical students' personal attitudes towards lesbians and gay men, and also a lack of knowledge on lesbian, gay or bisexual specific healthcare needs. This study highlights the importance of inclusive undergraduate curriculum development in order to foster quality healthcare.
Estimating Premorbid Cognitive Abilities in Low-Educated Populations
Apolinario, Daniel; Brucki, Sonia Maria Dozzi; Ferretti, Renata Eloah de Lucena; Farfel, José Marcelo; Magaldi, Regina Miksian; Busse, Alexandre Leopold; Jacob-Filho, Wilson
2013-01-01
Objective To develop an informant-based instrument that would provide a valid estimate of premorbid cognitive abilities in low-educated populations. Methods A questionnaire was drafted by focusing on the premorbid period with a 10-year time frame. The initial pool of items was submitted to classical test theory and a factorial analysis. The resulting instrument, named the Premorbid Cognitive Abilities Scale (PCAS), is composed of questions addressing educational attainment, major lifetime occupation, reading abilities, reading habits, writing abilities, calculation abilities, use of widely available technology, and the ability to search for specific information. The validation sample was composed of 132 older Brazilian adults from the following three demographically matched groups: normal cognitive aging (n = 72), mild cognitive impairment (n = 33), and mild dementia (n = 27). The scores of a reading test and a neuropsychological battery were adopted as construct criteria. Post-mortem inter-informant reliability was tested in a sub-study with two relatives from each deceased individual. Results All items presented good discriminative power, with corrected item-total correlation varying from 0.35 to 0.74. The summed score of the instrument presented high correlation coefficients with global cognitive function (r = 0.73) and reading skills (r = 0.82). Cronbach's alpha was 0.90, showing optimal internal consistency without redundancy. The scores did not decrease across the progressive levels of cognitive impairment, suggesting that the goal of evaluating the premorbid state was achieved. The intraclass correlation coefficient was 0.96, indicating excellent inter-informant reliability. Conclusion The instrument developed in this study has shown good properties and can be used as a valid estimate of premorbid cognitive abilities in low-educated populations. The applicability of the PCAS, both as an estimate of premorbid intelligence and cognitive reserve, is discussed. PMID:23555894
Adhi, Mohammad Idrees; Aly, Syed Moyn
2018-04-01
To find differences between One-Correct and One-Best multiple-choice questions with relation to student scores, post-exam item analyses results and student perception. This comparative cross-sectional study was conducted at the Dow University of Health Sciences, Karachi, from November 2010 to April 2011, and comprised medical students. Data was analysed using SPSS 18. Of the 207 participants, 16(7.7%) were boys and 191(92.3%) were girls. The mean score in Paper I was 18.62±4.7, while in Paper II it was 19.58±6.1. One-Best multiple-choice questions performed better than One-Correct. There was no statistically significant difference in the mean scores of the two papers or in the difficulty indices. Difficulty and discrimination indices correlated well in both papers. Cronbach's alpha of paper I was 0.584 and that of paper II was 0.696. Point-biserial values were better for paper II than for paper I. Most students expressed dissatisfaction with paper II. One-Best multiple-choice questions showed better scores, higher reliability, better item performance and correlation values.
Tang, Jennifer Yee-Man; Ho, Andy Hau-Yan; Luo, Hao; Wong, Gloria Hoi-Yan; Lau, Bobo Hi-Po; Lum, Terry Yat-Sang; Cheung, Karen Siu-Lan
2016-09-01
The present study aimed to develop and validate a Cantonese short version of the Zarit Burden Interview (CZBI-Short) for Hong Kong Chinese dementia caregivers. The 12-item Zarit Burden Interview (ZBI) was translated into spoken Cantonese and back-translated by two bilingual research assistants and face validated by a panel of experts. Five hundred Chinese dementia caregivers showing signs of stress reported their burden using the translated ZBI and rated their depressive symptoms, overall health, and care recipients' physical functioning and behavioral problems. The factor structure of the translated scale was identified using principal component analysis and confirmatory factor analysis; internal consistency and item-total correlations were assessed; and concurrent validity was tested by correlating the ZBI with depressive symptoms, self-rated health, and care recipients' physical functioning and behavioral problems. The principal component analysis resulted in 11 items loading on a three-factor model comprised role strain, self-criticism, and negative emotion, which accounted for 59% of the variance. The confirmatory factor analysis supported the three-factor model (CZBI-Short) that explained 61% of the total variance. Cronbach's alpha (0.84) and item-total correlations (rho = 0.39-0.71) indicated CZBI-Short had good reliability. CZBI-Short showed correlations with depressive symptoms (r = 0.50), self-rated health (r = -0.26) and care recipients' physical functioning (r = 0.18-0.26) and disruptive behaviors (r = 0.36). The 12-item CZBI-Short is a concise, reliable, and valid instrument to assess burden in Chinese dementia caregivers in clinical and social care settings.
Translation and validation of the Rhinosinusitis Disability Index for use in Nigeria.
Asoegwu, C N; Nwawolo, C C; Okubadejo, N U
2017-07-01
The Rhinosinusitis Disability Index (RSDI) is a validated and reliable measure of severity of chronic rhinosinusitis. The objective of this study was to translate and validate the instrument for use in Nigeria. This is a methodological study. 71 patients with chronic rhinosinusitis attending two Otolaryngology clinics in Lagos, Nigeria. Using standardized methods and trained translators, the RSDI was translated to vernacular (Yoruba language) and back-translated to culturally appropriate English. Data analysis comprised of assessment of the item quality, content validity and internal consistency of the back-translated Rhinosinusitis Disability Index (bRSDI), and correlation to the original RSDI. Content validity (floor and ceiling effects) showed 0% floor and ceiling effects for the total scores, 0% ceiling effects for all domains and floor effect for physical domain, and 9.9 and 8.5% floor effects for functional and emotional domains, respectively. The mean item-own correlation for physical domain was 0.54 ± 0.08, 0.72 ± 0.08 for functional domain and 0.74 ± 0.07 for emotional domain. All domain item-own correlations were higher than item-other domain correlations. The total Cronbach's alpha was 0.936 and was higher than 0.70 for all the domains representing good internal consistency. Pearson correlation analysis showed strong correlation of RSDI to bRSDI (total score 0.881; p = 0.000, and domain subscores-physical: 0.788; p = 0.000, functional: 0.830; p = 0.000, and emotional: 0.888; p = 0.000). The back-translated Rhinosinusitis Disability Index shows good face and content validity with good internal consistency while correlating linearly and significantly with the original Rhinosinusitis Disability Index and is recommended for use in Nigeria.
Data Visualization of Item-Total Correlation by Median Smoothing
ERIC Educational Resources Information Center
Yu, Chong Ho; Douglas, Samantha; Lee, Anna; An, Min
2016-01-01
This paper aims to illustrate how data visualization could be utilized to identify errors prior to modeling, using an example with multi-dimensional item response theory (MIRT). MIRT combines item response theory and factor analysis to identify a psychometric model that investigates two or more latent traits. While it may seem convenient to…
Khorramdel, Lale; von Davier, Matthias
2014-01-01
This study shows how to address the problem of trait-unrelated response styles (RS) in rating scales using multidimensional item response theory. The aim is to test and correct data for RS in order to provide fair assessments of personality. Expanding on an approach presented by Böckenholt (2012), observed rating data are decomposed into multiple response processes based on a multinomial processing tree. The data come from a questionnaire consisting of 50 items of the International Personality Item Pool measuring the Big Five dimensions administered to 2,026 U.S. students with a 5-point rating scale. It is shown that this approach can be used to test if RS exist in the data and that RS can be differentiated from trait-related responses. Although the extreme RS appear to be unidimensional after exclusion of only 1 item, a unidimensional measure for the midpoint RS is obtained only after exclusion of 10 items. Both RS measurements show high cross-scale correlations and item response theory-based (marginal) reliabilities. Cultural differences could be found in giving extreme responses. Moreover, it is shown how to score rating data to correct for RS after being proved to exist in the data.
Gopichandran, Vijayaprasad; Wouters, Edwin; Chetlapalli, Satish Kumar
2015-05-03
Trust in physicians is the unwritten covenant between the patient and the physician that the physician will do what is in the best interest of the patient. This forms the undercurrent of all healthcare relationships. Several scales exist for assessment of trust in physicians in developed healthcare settings, but to our knowledge none of these have been developed in a developing country context. To develop and validate a new trust in physician scale for a developing country setting. Dimensions of trust in physicians, which were identified in a previous qualitative study in the same setting, were used to develop a scale. This scale was administered among 616 adults selected from urban and rural areas of Tamil Nadu, south India, using a multistage sampling cross sectional survey method. The individual items were analysed using a classical test approach as well as item response theory. Cronbach's α was calculated and the item to total correlation of each item was assessed. After testing for unidimensionality and absence of local dependence, a 2 parameter logistic Semajima's graded response model was fit and item characteristics assessed. Competence, assurance of treatment, respect for the physician and loyalty to the physician were important dimensions of trust. A total of 31 items were developed using these dimensions. Of these, 22 were selected for final analysis. The Cronbach's α was 0.928. The item to total correlations were acceptable for all the 22 items. The item response analysis revealed good item characteristic curves and item information for all the items. Based on the item parameters and item information, a final 12 item scale was developed. The scale performs optimally in the low to moderate trust range. The final 12 item trust in physician scale has a good construct validity and internal consistency. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Gopichandran, Vijayaprasad; Wouters, Edwin; Chetlapalli, Satish Kumar
2015-01-01
Trust in physicians is the unwritten covenant between the patient and the physician that the physician will do what is in the best interest of the patient. This forms the undercurrent of all healthcare relationships. Several scales exist for assessment of trust in physicians in developed healthcare settings, but to our knowledge none of these have been developed in a developing country context. Objectives To develop and validate a new trust in physician scale for a developing country setting. Methods Dimensions of trust in physicians, which were identified in a previous qualitative study in the same setting, were used to develop a scale. This scale was administered among 616 adults selected from urban and rural areas of Tamil Nadu, south India, using a multistage sampling cross sectional survey method. The individual items were analysed using a classical test approach as well as item response theory. Cronbach's α was calculated and the item to total correlation of each item was assessed. After testing for unidimensionality and absence of local dependence, a 2 parameter logistic Semajima's graded response model was fit and item characteristics assessed. Results Competence, assurance of treatment, respect for the physician and loyalty to the physician were important dimensions of trust. A total of 31 items were developed using these dimensions. Of these, 22 were selected for final analysis. The Cronbach's α was 0.928. The item to total correlations were acceptable for all the 22 items. The item response analysis revealed good item characteristic curves and item information for all the items. Based on the item parameters and item information, a final 12 item scale was developed. The scale performs optimally in the low to moderate trust range. Conclusions The final 12 item trust in physician scale has a good construct validity and internal consistency. PMID:25941182
Procedural justice and layoff survivors' commitment: a quantitative review.
Grubb, W Lee
2006-10-01
Layoffs are common in today's organizations. Most studies that have examined the correlation between procedural justice and the organizational commitment of layoff survivors have yielded positive correlations, but the magnitude of the correlations varies widely. This study is the first to estimate the population correlation and to identify the primary sources that cause variation in the correlation across studies. The results indicated that justice and commitment correlations can always be expected to be positive. Based on a total sample size of 9080 individuals, the estimated mean population correlation was .34. Variation was primarily explained by attributes of the justice measure where multiple items scales and scales composed of both interactional and procedural justice items yielded higher correlations than single item measures. Therefore, it is important that employers recognize the substantial assuaging affect that procedural and interactional justice can have on survivors' organizational commitment.
The neurocognitive basis of borrowed context information.
O'Neill, Meagan; Diana, Rachel A
2017-06-01
Falsely remembered items can be accompanied by episodic context retrieval. This finding is difficult to explain because there is no episode that binds the remembered item to the experimenter-controlled context features. The current study examines the neural correlates of false context retrieval when the context features can be traced to encoding episodes of semantically-similar items. Our neuroimaging results support a "dissociated source" mechanism for context borrowing in false memory. We found that parahippocampal cortex (PHc) activation, thought to indicate context retrieval, was greater during trials that involved context borrowing (an incorrect, but plausible source decision) than during baseline correct context retrieval. In contrast, hippocampal activation, thought to indicate retrieval of an episodic binding, was stronger during correct source retrieval than during context borrowing. Vivid context retrieval during false recollection experiences was also indicated by increased activation in visual perceptual regions for context borrowing as compared to other incorrect source judgments. The pattern of findings suggests that context borrowing can arise when unusually strong activation of a semantically-related item's contextual features drives relatively weak retrieval of the associated episodic binding with failure to confirm the item information within that binding. This dissociated source retrieval mechanism suggests that context-driven episodic retrieval does not necessarily lead to retrieval of specific item details. That is, source information can be retrieved in the absence of item memory. Copyright © 2017 Elsevier Ltd. All rights reserved.
Skarzynski, Piotr H; Raj-Koziak, Danuta; J Rajchel, Joanna; Pilka, Adam; Wlodarczyk, Andrzej W; Skarzynski, Henryk
2017-10-01
To describe how the Tinnitus Handicap Inventory (THI) was translated into Polish (THI-POL) and to present psychometric data on how well it performed in a clinical population of tinnitus sufferers. The original version of THI was adapted into Polish. The reliability of THI-POL was investigated using test-retest, Cronbach's alpha, endorsement rate and item-total correlation. Construct validity and convergent validity were also assessed based on confirmatory factor analysis, inter-item correlation and Pearson product-moment correlations using subscale A (Tinnitus) of the Tinnitus and Hearing Survey (THS-POL); divergent validity was checked using subscale B (Hearing) of THS-POL. A group of 167 adults filled in THI-POL twice over their three-day hospitalisation period. Test-retest reliability for the total THI-POL scores was strong (r = 0.91). Cronbach's alpha coefficient for the total score was high (r = 0.95), confirming the questionnaire's stability. Confirmatory factor analysis (CFA) and inter-item correlation did not confirm the three-factor model. Convergent validity from the Tinnitus subscale of THS showed a positive strong (r = 0.75) correlation. Divergent validity showed only a moderate correlation. All analyses were statistically significant (p < 0.01). THI-POL is a valid and reliable self-administered tool, which allows the overall tinnitus handicap of Polish-speaking patients to be effectively assessed.
Sun, Ning; Li, Qiu-Jie; Lv, Dong-Mei; Lu, Gui-Zhi; Lin, Ping; An, Xue-Mei
2014-10-01
The present study was conducted to evaluate the psychometric properties of a newly adapted Chinese version of an instrument designed to measure structural empowerment among staff nurses. Structural empowerment has been shown to be important to nurses in Western cultures, but its importance in China is unknown. A convenience sample of 650 staff nurses was selected from six hospitals in Harbin, China. After linguistic adaptation using the forward-backward translation method, the 19-item Conditions of Work Effectiveness Questionnaire-II (CWEQ-II-CV) was answered by participants. Content validity, Cronbach's alpha, item-to-total correlation and exploratory factor analysis were used to assess the reliability and validity of the translated instrument. In the factor analysis, a six-factor solution was found to be reasonable with the sub-dimensions of structural empowerment that included support (three items), resources (three items), information (three items), opportunity (three items), formal power (three items) and informal power (four items). Cronbach's alpha coefficient for the total instrument was 0.92 and ranged from 0.68 to 0.86 in the six subscales. The item-to-total correlation coefficients ranged from 0.48 to 0.80. The findings also gave support for content validity. Evidence was found to support the reliability and validity of the CWEQ-II-CV scale that measures the quality of the work environment for nurses from a structural empowerment perspective. The translated version of CWEQ-II-CV can provide an effective evaluation tool for structural empowerment in the Chinese nursing workplace. © 2013 John Wiley & Sons Ltd.
Deepak, Kishore K; Al-Umran, Khalid Umran; AI-Sheikh, Mona H; Dkoli, B V; Al-Rubaish, Abdullah
2015-01-01
The functionality of distracters in a multiple choice question plays a very important role. We examined the frequency and impact of functioning and non-functioning distracters on psychometric properties of 5-option items in clinical disciplines. We analyzed item statistics of 1115 multiple choice questions from 15 summative assessments of undergraduate medical students and classified the items into five groups by their number of non-functioning distracters. We analyzed the effect of varying degree of non-functionality ranging from 0 to 4, on test reliability, difficulty index, discrimination index and point biserial correlation. The non-functionality of distracters inversely affected the test reliability and quality of items in a predictable manner. The non-functioning distracters made the items easier and lowered the discrimination index significantly. Three non-functional distracters in a 5-option MCQ significantly affected all psychometric properties (p < 0.5). The corrected point biserial correlation revealed that the items with 3 functional options were psychometrically as effective as 5-option items. Our study reveals that a multiple choice question with 3 functional options provides lower most limit of item format that has adequate psychometric property. The test containing items with less number of functioning options have significantly lower reliability. The distracter function analysis and revision of nonfunctioning distracters can serve as important methods to improve the psychometrics and reliability of assessment.
Derakhshandeh, Zahra; Amini, Mitra; Kojuri, Javad; Dehbozorgian, Marziyeh
2018-01-01
Clinical reasoning is one of the most important skills in the process of training a medical student to become an efficient physician. Assessment of the reasoning skills in a medical school program is important to direct students' learning. One of the tests for measuring the clinical reasoning ability is Clinical Reasoning Problems (CRPs). The major aim of this study is to measure psychometric qualities of CRPs and define correlation between this test and routine MCQ in cardiology department of Shiraz medical school. This study was a descriptive study conducted on total cardiology residents of Shiraz Medical School. The study population consists of 40 residents in 2014. The routine CRPs and the MCQ tests was designed based on similar objectives and were carried out simultaneously. Reliability, item difficulty, item discrimination, and correlation between each item and the total score of CRPs were all measured by Excel and SPSS software for checking psycometeric CRPs test. Furthermore, we calculated the correlation between CRPs test and MCQ test. The mean differences of CRPs test score between residents' academic year [second, third and fourth year] were also evaluated by Analysis of variances test (One Way ANOVA) using SPSS software (version 20)(α=0.05). The mean and standard deviation of score in CRPs was 10.19 ±3.39 out of 20; in MCQ, it was 13.15±3.81 out of 20. Item difficulty was in the range of 0.27-0.72; item discrimination was 0.30-0.75 with question No.3 being the exception (that was 0.24). The correlation between each item and the total score of CRP was 0.26-0.87; the correlation between CRPs test and MCQ test was 0.68 (p<0.001). The reliability of the CRPs was 0.72 as calculated by using Cronbach's alpha. The mean score of CRPs was different among residents based on their academic year and this difference was statistically significant (p<0.001). The results of this present investigation revealed that CRPs could be reliable test for measuring clinical reasoning in residents. It can be included in cardiology residency assessment programs.
Argentinean adaptation of the Social Skills Inventory IHS-Del-Prette.
Olaz, Fabián Orlando; Medrano, Leonardo; Greco, María Eugenia; Del Prette, Zilda Aparecida Pereira
2009-11-01
We present the results of the adaptation of the IHS-Del-Prette (Inventario de Habilidades Sociales, in English, Social Skills Inventory) to a sample of Argentinean college students. Firstly, we addressed the backward translation and carried out an equivalence study of the Portuguese and Spanish versions of the scale. The results showed the two versions were equivalent, as we obtained correlations lower than .50 in only 5 items. Secondly, we performed item analysis by calculating discrimination indexes and item-total correlations. Results indicated that the items are sensitive to differentiate between high and low social-skill groups. Exploratory factor analysis carried out with a sample of 602 college students yielded five factors that explained 26.5% of the total variance, although our data did not completely match the original factor structure. We also obtained moderate alpha values for the subscales, but high reliability for the total scale. Lastly, group differences between males and females are presented to provide evidence of validity. We discuss the implications of the results and present future lines of inquiry.
Mowrer, Robert R; Parker, Keesha N
2004-12-01
In a 2002 publication, Mowrer and McCarver reported weak but significant correlations (r =.24) between scores on the Multicultural Perspective Index and scores on Neugarten, Havighurst, and Tobin's 1961 Life Satisfaction Index-A and the Life Satisfaction Scale developed in 1985 by Diener, Emmons, Larsen, and Griffin. Using 382 undergraduate students the present study reduced the Index from 42 to 29 items based on each item's correlation with total items. An additional 104 undergraduate students then completed the modified 29-item version, Rosenberg's Self-esteem Scale, Cheek and Buss's Shyness Scale, the Self-rating Depression Scale by Zung, and the Neugarten, et al. Life Satisfaction Index-A. Scores on the modified Index were negatively correlated with those on the Depression and Shyness scales and positively correlated with scores on the Self-esteem and Life Satisfaction scales (p< .05).
Age-related differences in idiom production in adulthood
Conner, P. S.; Hyun, J.; O’Connor Wells, B.; Anema, I.; Goral, M.; Monéreau-Merry, M.; Rubino, D.; Kuckuk, R.; Obler, L. K.
2013-01-01
To investigate whether idiom production was vulnerable to age-related difficulties, we asked forty younger (ages 18-30) and forty older healthy adults (ages 60-85) to produce idiomatic expressions in a story-completion task. Younger adults produced significantly more correct idiom responses (73%) than older adults (60%) did. When older adults generated partially correct responses, they were less likely than younger participants to eventually produce the complete target idiom (Old: 32 % / Young: 70%); first-word cues after initial failure to retrieve an idiom resulted in more correct idioms for older (24%) than younger (15%) participants. Correlations between age and idiom correctness were positive for the Young group, and negative for the Older group, suggesting mastery of familiar idioms continues into adulthood. Within each group, scores on the Boston Naming Test correlated with performance on the idiom task. Findings for retrieving idiomatic expressions are thus similar to those for retrieving lexical items. PMID:21728830
de Pinho, Lucinéia; Moura, Paulo Henrique Tolentino; Silveira, Marise Fagundes; de Botelho, Ana Cristina Carvalho; Caldeira, Antônio Prates
2013-07-18
In light of its epidemic proportions in developed and developing countries, obesity is considered a serious public health issue. In order to increase knowledge concerning the ability of health care professionals in caring for obese adolescents and adopt more efficient preventive and control measures, a questionnaire was developed and validated to assess non-dietitian health professionals regarding their Knowledge of Nutrition in Obese Adolescents (KNOA). The development and evaluation of a questionnaire to assess the knowledge of primary care practitioners with respect to nutrition in obese adolescents was carried out in five phases, as follows: 1) definition of study dimensions 2) development of 42 questions and preliminary evaluation of the questionnaire by a panel of experts; 3) characterization and selection of primary care practitioners (35 dietitians and 265 non-dietitians) and measurement of questionnaire criteria by contrasting the responses of dietitians and non-dietitians; 4) reliability assessment by question exclusion based on item difficulty (too easy and too difficult for non-dietitian practitioners), item discrimination, internal consistency and reproducibility index determination; and 5) scoring the completed questionnaires. Dietitians obtained higher scores than non-dietitians (Mann-Whitney U test, P < 0.05), confirming the validity of the questionnaire criteria. Items were discriminated by correlating the score for each item with the total score, using a minimum of 0.2 as a correlation coefficient cutoff value. Item difficulty was controlled by excluding questions answered correctly by more than 90% of the non-dietitian subjects (too easy) or by less than 10% of them (too difficult). The final questionnaire contained 26 of the original 42 questions, increasing Cronbach's α value from 0.788 to 0.807. Test-retest agreement between respondents was classified as good to very good (Kappa test, >0.60). The KNOA questionnaire developed for primary care practitioners is a valid, consistent and suitable instrument that can be applied over time, making it a promising tool for developing and guiding public health policies.
Ambrosio, Leire; Portillo, Mari Carmen; Rodríguez-Blázquez, Carmen; Rodriguez-Violante, Mayela; Castrillo, Juan Carlos Martínez; Arillo, Víctor Campos; Garretto, Nélida Susana; Arakaki, Tomoko; Dueñas, Marcos Serrano; Álvarez, Mario; Ibáñez, Ivonne Pedroso; Carvajal, Ana; Martínez-Martín, Pablo
2016-01-01
Understanding how a person lives with a chronic illness, such as Parkinson’s disease (PD), is necessary to provide individualized care and professionals role in person-centered care at clinical and community levels is paramount. The present study was aimed to analyze the psychometric properties of the Living with Chronic Illness-PD Scale (EC-PC) in a wide Spanish-speaking population with PD. International cross-sectional study with retest was carried out with 324 patients from four Latin American countries and Spain. Feasibility, acceptability, scaling assumptions, reliability, precision, and construct validity were tested. The study included 324 patients, with age (mean±s.d.) 66.67±10.68 years. None of the EC-PC items had missing values and all acceptability parameters fulfilled the standard criteria. Around two-third of the items (61.54%) met scaling assumptions standards. Concerning internal consistency, Cronbach’s alpha values were 0.68–0.88; item-total correlation was >0.30, except for two items; item homogeneity index was >0.30, and inter-item correlation values 0.14–0.76. Intraclass correlation coefficient for EC-PC stability was 0.76 and standard error of measurement (s.e.m.) for precision was 8.60 (for a EC-PC s.d.=18.57). EC-PC presented strong correlation with social support (rS=0.61) and moderate correlation with life satisfaction (rS=0.46). Weak and negligible correlations were found with the other scales. Internal validity correlations ranged from 0.46 to 0.78. EC-PC total scores were significantly different for each severity level based on Hoehn and Yahr and Clinical Impression of Severity Index, but not for Patient Global Impression of Severity. The EC-PC has satisfactory acceptability, reliability, precision, and validity to evaluate living with PD. PMID:28725703
Tayama, Jun; Ogawa, Sayaka; Takeoka, Atsushi; Kobayashi, Masakazu; Shirabe, Susumu
2017-01-01
Abstract Obesity has become a serious social problem in industrialized countries in recent years. Clinically, although the evaluation of dietary behavior abnormalities is as important as any method of risk assessment for obesity, almost all the existing scales with many items may have numerous practical clinical difficulties. In this study, we aimed to prepare a short questionnaire to assess the dietary behavior abnormalities related to obesity. A total of 1032 individuals aged 20 to 59 years participated in the present study. Using item response theory (IRT), we selected the items for a short version from among 30 items of Sakata Eating Behavior Scale (EBS), which is widely used in Japan. As a result of the IRT-based analysis on the original 30-item version, 7 items were adopted as the short version. The correlation between the total score of the original EBS and the EBS short form was extremely high (r = 0.93, P = .001). In examining the criterion validity, for all participants (n = 1032), male (n = 516), and female (n = 516), the correlation coefficients between the total score of the EBS short form and body mass index (BMI) were r = 0.26, r = 0.28, and r = 0.28, respectively. The results of the receiver operating characteristic analysis was performed with obesity BMI > 25 kg/m2 as a dependent variable, the value of the area under the curve in the ROC was significantly higher in the 7-item version than in the total score of the original items (P = .0005). In conclusion, the 7-item EBS short form was created. Furthermore, it was found that the EBS short form is a reliable and valid measure that can be used as an indicator of obesity in both clinical and research settings. PMID:29049248
Paz, Sylvia H; Spritzer, Karen L; Morales, Leo S; Hays, Ron D
2013-03-29
To evaluate the equivalence of the PROMIS® wave 1 physical functioning item bank, by age (50 years or older versus 18-49). A total of 114 physical functioning items with 5 response choices were administered to English- (n=1504) and Spanish-language (n=640) adults. Item frequencies, means and standard deviations, item-scale correlations, and internal consistency reliability were estimated. Differential Item Functioning (DIF) by age was evaluated. Thirty of the 114 items were fagged for DIF based on an R-squared of 0.02 or above criterion. The expected total score was higher for those respondents who were 18-49 than those who were 50 or older. Those who were 50 years or older versus 18-49 years old with the same level of physical functioning responded differently to 30 of the 114 items in the PROMIS® physical functioning item bank. This study yields essential information about the equivalence of the physical functioning items in older versus younger individuals.
Self-correction in biomedical publications and the scientific impact.
Gasparyan, Armen Yuri; Ayvazyan, Lilit; Akazhanov, Nurbek A; Kitas, George D
2014-02-01
To analyze mistakes and misconduct in multidisciplinary and specialized biomedical journals. We conducted searches through PubMed to retrieve errata, duplicate, and retracted publications (as of January 30, 2014). To analyze publication activity and citation profiles of countries, multidisciplinary, and specialized biomedical journals, we referred to the latest data from the SCImago Journal and Country Rank database. Total number of indexed articles and values of the h-index of the fifty most productive countries and multidisciplinary journals were recorded and linked to the number of duplicate and retracted publications in PubMed. Our analysis found 2597 correction items. A striking increase in the number of corrections appeared in 2013, which is mainly due to 871 (85.3%) corrections from PLOS One. The number of duplicate publications was 1086. Articles frequently published in duplicate were reviews (15.6%), original studies (12.6%), and case reports (7.6%), whereas top three retracted articles were original studies (10.1%), randomized trials (8.8%), and reviews (7%). A strong association existed between the total number of publications across countries and duplicate (rs=0.86, P<0.0001) and retracted items (rs=0.812, P<0.0001). A similar trend was found between country-based h-index values and duplicate and retracted publications. The study suggests that the intensified self-correction in biomedicine is due to the attention of readers and authors, who spot errors in their hub of evidence-based information. Digitization and open access confound the staggering increase in correction notices and retractions.
Xiao, Yuan-mei; Wang, Zhi-ming; Wang, Mian-zhen; Lan, Ya-jia
2005-06-01
To test the reliability and validity of two mental workload assessment scales, i.e. subjective workload assessment technique (SWAT) and NASA task load index (NASA-TLX). One thousand two hundred and sixty-eight mental workers were sampled from various kinds of occupations, such as scientific research, education, administration and medicine, etc, with randomized cluster sampling. The re-test reliability, split-half reliability, Cronbach's alpha coefficient and correlation coefficients between item score and total score were adopted to test the reliability. The test of validity included structure validity. The re-test reliability coefficients of these two scales and their items were ranged from 0.516 to 0.753 (P < 0.01), indicating the two scales had good re-test reliability; the split-half reliability of SWAT was 0.645, and its Cronbach's alpha coefficient was more than 0.80, all the correlation coefficients between its items score and total score were more than 0.70; as for NASA-TLX, both the split-half reliability and Cronbach's alpha coefficient were more than 0.80, the correlation coefficients between its items score and total score were all more than 0.60 (P < 0.01) except the item of performance. Both scales had good inner consistency. The Pearson correlation coefficient between the two scales was 0.492 (P < 0.01), implying the results of the two scales had good consistency. Factor analysis showed that the two scales had good structure validity. Both SWAT and NASA-TLX have good reliability and validity and may be used as a valid tool to assess mental workload in China after being revised properly.
Item Reliabilities for a Family of Answer-Until-Correct (AUC) Scoring Rules.
ERIC Educational Resources Information Center
Kane, Michael T.; Moloney, James M.
The Answer-Until-Correct (AUC) procedure has been proposed in order to increase the reliability of multiple-choice items. A model for examinees' behavior when they must respond to each item until they answer it correctly is presented. An expression for the reliability of AUC items, as a function of the characteristics of the item and the scoring…
Reliability and Validity of a Turkish version of the Prenatal Breastfeeding Self-Efficacy Scale.
Aydin, Ayse; Pasinlioglu, Turkan
2018-05-18
This study aims to conduct reliability and validity study of the Turkish version of the "Prenatal Breastfeeding Self-Efficacy Scale", which determines pregnant women's perception of breastfeeding self-efficacy in the prenatal period. This methodological research was carried out between December 2014 and May 2016 in maternity clinics of the Erzurum Nene Hatun Maternity Hospital and Atatürk University Research Hospital. The study population consisted of pregnant women, admitted to the specified clinics for prenatal controls. The study was carried out with 326 pregnant women, who met the inclusion criteria and agreed to participate in the research without any sample selection. "Personal Information Form" and "Prenatal Breastfeeding Self-Efficacy Scale - Turkish Form" were used for data collection. The data were collected by the face-to-face interview method, and analyzed by SPSS 18 software. In the validity-reliability analysis of the scale, language and content validity, explanatory factor analysis, Cronbach's Alpha coefficient, item-total score correlation, and testretest methods were used. Linguistic validity was verified by the translation-backtranslation of the Prenatal Breastfeeding Self-Efficacy Scale, then the necessary corrections were made according to the recommendations of the expert opinions, to ensure the content validity. As a result of the explanatory factor analysis, performed to determine the construct validity of the scale, a single factor structure was found, having factor loadings in the appropriate range (0.30-0.76). In the internal consistency analysis of the scale, Cronbach's Alpha was 0.86, and the item-total score correlations were between 0.23 and 0.65, and no item was removed from the scale. In order to test the time-invariance of the scale, the test-retest correlation value was found to be 0.94. The relationship between the two applications were determined to be statistically significant (p < 0.001). Turkish version of the Prenatal Breastfeeding Self-Efficacy Scale was evaluated in Turkish women and found to be a valid and reliable measurement instrument. Copyright © 2018 Elsevier Ltd. All rights reserved.
Psychological distress in an incarcerated juvenile population.
Lyu, Shu-Yu; Chi, Ying-Chen; Farabee, David; Tsai, Liang-Ting; Lee, Ming-Been; Lo, Feng-En; Morisky, Donald E
2015-11-01
This study sought to examine the prevalence and correlates of psychological distress among incarcerated youth in Taiwan using the 5-item Brief Symptom Rating Scale (BSRS-5). This cross-sectional census survey study was conducted in 2007 among all the juveniles incarcerated in 23 correctional institutions (n = 1505) in Taiwan using a self-administered anonymous questionnaire. Of the total 1505 participants, 1363 completed the questionnaire (91% response rate). We excluded 494 participants as they were aged either over 17 years or under 12 years. Psychological distress was measured among the final 869 participants using the BSRS-5. Psychological distress was defined as a total score of at least 6 out of 20. Those identified as having psychological distress were then pooled into a case group and compared with control participants without psychological distress. The prevalence of psychological distress was 44.1%. Among the case group, sleep disturbance (36.8%) had the highest prevalence of severe or very severe symptoms, followed by depression (34.7%), and hostility (27.9%). Multivariate logistic regression analysis revealed that correlates of psychological distress included the following: being female; having a poor self-rated health status; having joined a gang; having experienced life disturbances prior to the current imprisonment; and having ever had a smoking habit. Significant sex differences were found for both the overall BSRS-5, as well as for each individual item of the BSRS-5. Treatment programs and interventions should be carefully tailored to address the mental health needs of juvenile inmates in a sex-specific manner using a multifaceted approach. Copyright © 2014. Published by Elsevier B.V.
On the application of copula in modeling maintenance contract
NASA Astrophysics Data System (ADS)
Iskandar, B. P.; Husniah, H.
2016-02-01
This paper deals with the application of copula in maintenance contracts for a nonrepayable item. Failures of the item are modeled using a two dimensional approach where age and usage of the item and this requires a bi-variate distribution to modelling failures. When the item fails then corrective maintenance (CM) is minimally repaired. CM can be outsourced to an external agent or done in house. The decision problem for the owner is to find the maximum total profit whilst for the agent is to determine the optimal price of the contract. We obtain the mathematical models of the decision problems for the owner as well as the agent using a Nash game theory formulation.
Chabrera, Carolina; Areal, Joan; Font, Albert; Caro, Mónica; Bonet, Marta; Zabalegui, Adelaida
2015-01-01
The aim of this study is to develop a Spanish version of the Satisfaction With Decision scale (SWDs) and analyse the psychometric properties of validity and reliability. An observational, descriptive study and validation of a tool to measure satisfaction with the decision. Urology, Radiation oncology, and Medical oncology Departments of the Hospital Universitari Germans Trias i Pujol, Institut Català d'Oncologia and the Institut Oncològic del Vallès - Hospital General de Catalunya. A total of 170 participants diagnosed with prostate cancer, and who could read and write in Spanish and gave their informed consent. A translation, back-translation and cross-cultural adaptation to Spanish was performed on the SWDs. The content validity, criterion validity, construct validity and reliability (internal consistency and stability) of the Spanish version were evaluated. The SWDs contains 6 items with 5-item Likert scales. A Spanish version (ESD) was obtained that was linguistically and conceptually equivalent to the original version. Criterion validity, the ESD correlated with "satisfaction with the decision" using a linear analogue scale, was significant (r=0.63, P<.01) for all items. The factorial analysis showed a unique dimension to explain 82.08% of the variance. The ESD showed excellent results in terms of internal consistency (Cronbach alpha=0.95) and good test-retest reliability with intraclass correlation coefficient of 0.711. The ESD is a validated Spanish scale to measure the satisfaction with the decisions taken in health, and demonstrates a correct validity and reliability. Copyright © 2015 Elsevier España, S.L.U. All rights reserved.
Development and psychometric testing of the Cancer Knowledge Scale for Elders.
Su, Ching-Ching; Chen, Yuh-Min; Kuo, Bo-Jein
2009-03-01
To develop the Cancer Knowledge Scale for Elders and test its validity and reliability. The number of elders suffering from cancer is increasing. To facilitate cancer prevention behaviours among elders, they shall be educated about cancer-related knowledge. Prior to designing a programme that would respond to the special needs of elders, understanding the cancer-related knowledge within this population was necessary. However, extensive review of the literature revealed a lack of appropriate instruments for measuring cancer-related knowledge. A valid and reliable cancer knowledge scale for elders is necessary. A non-experimental methodological design was used to test the psychometric properties of the Cancer Knowledge Scale for Elders. Item analysis was first performed to screen out items that had low corrected item-total correlation coefficients. Construct validity was examined with a principle component method of exploratory factor analysis. Cancer-related health behaviour was used as the criterion variable to evaluate criterion-related validity. Internal consistency reliability was assessed by the KR-20. Stability was determined by two-week test-retest reliability. The factor analysis yielded a four-factor solution accounting for 49.5% of the variance. For criterion-related validity, cancer knowledge was positively correlated with cancer-related health behaviour (r = 0.78, p < 0.001). The KR-20 coefficients of each factor were 0.85, 0.76, 0.79 and 0.67 and 0.87 for the total scale. Test-retest reliability over a two-week period was 0.83 (p < 0.001). This study provides evidence for content validity, construct validity, criterion-related validity, internal consistency and stability of the Cancer Knowledge Scale for Elders. The results show that this scale is an easy-to-use instrument for elders and has adequate validity and reliability. The scale can be used as an assessment instrument when implementing cancer education programmes for elders. It can also be used to evaluate the effects of education programmes.
Excellent reliability of the Hamilton Depression Rating Scale (HDRS-21) in Indonesia after training.
Istriana, Erita; Kurnia, Ade; Weijers, Annelies; Hidayat, Teddy; Pinxten, Lucas; de Jong, Cor; Schellekens, Arnt
2013-09-01
The Hamilton Depression Rating Scale (HDRS) is the most widely used depression rating scale worldwide. Reliability of HDRS has been reported mainly from Western countries. The current study tested the reliability of HDRS ratings among psychiatric residents in Indonesia, before and after HDRS training. The hypotheses were that: (i) prior to the training reliability of HDRS ratings is poor; and (ii) HDRS training can improve reliability of HDRS ratings to excellent levels. Furthermore, we explored cultural validity at item level. Videotaped HDRS interviews were rated by 30 psychiatric residents before and after 1 day of HDRS training. Based on a gold standard rating, percentage correct ratings and deviation from the standard were calculated. Correct ratings increased from 83% to 99% at item level and from 70% to 100% for the total rating. The average deviation from the gold standard rating improved from 0.07 to 0.02 at item level and from 2.97 to 0.46 for the total rating. HDRS assessment by psychiatric trainees in Indonesia without prior training is unreliable. A short, evidence-based HDRS training improves reliability to near perfect levels. The outlined training program could serve as a template for HDRS trainings. HDRS items that may be less valid for assessment of depression severity in Indonesia are discussed. Copyright © 2013 Wiley Publishing Asia Pty Ltd.
Preparation breeds success: Brain activity predicts remembering.
Herron, Jane E; Evans, Lisa H
2018-05-09
Successful retrieval of episodic information is thought to involve the adoption of memory states that ensure that stimulus events are treated as episodic memory cues (retrieval mode) and which can bias retrieval toward specific memory contents (retrieval orientation). The neural correlates of these memory states have been identified in many neuroimaging studies, yet critically there is no direct evidence that they facilitate retrieval success. We cued participants before each test item to prepare to complete an episodic (retrieve the encoding task performed on the item at study) or a non-episodic task. Our design allowed us to separate event-related potentials (ERPs) elicited by the preparatory episodic cue according to the accuracy of the subsequent memory judgment. We predicted that a correlate of retrieval orientation should be larger in magnitude preceding correct source judgments than that preceding source errors. This hypothesis was confirmed. Preparatory ERPs at bilateral frontal sites were significantly more positive-going when preceding correct source judgments than when preceding source errors or correct responses in a non-episodic baseline task. Furthermore this effect was not evident prior to recognized items associated with incorrect source judgments. This pattern of results indicates a direct contribution of retrieval orientation to the recovery of task-relevant information and highlights the value of separating preparatory neural activity at retrieval according to subsequent memory accuracy. Moreover, at a more general level this work demonstrates the important role of pre-stimulus processing in ecphory, which has remained largely neglected to date. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Beato, Maria Soledad
2016-01-01
Memory researchers have long been captivated by the nature of memory distortions and have made efforts to identify the neural correlates of true and false memories. However, the underlying mechanisms of avoiding false memories by correctly rejecting related lures remains underexplored. In this study, we employed a variant of the Deese/Roediger-McDermott paradigm to explore neural signatures of committing and avoiding false memories. ERP were obtained for True recognition, False recognition, Correct rejection of new items, and, more importantly, Correct rejection of related lures. With these ERP data, early-frontal, left-parietal, and late right-frontal old/new effects (associated with familiarity, recollection, and monitoring processes, respectively) were analysed. Results indicated that there were similar patterns for True and False recognition in all three old/new effects analysed in our study. Also, False recognition and Correct rejection of related lures activities seemed to share common underlying familiarity-based processes. The ERP similarities between False recognition and Correct rejection of related lures disappeared when recollection processes were examined because only False recognition presented a parietal old/new effect. This finding supported the view that actual false recollections underlie false memories, providing evidence consistent with previous behavioural research and with most ERP and neuroimaging studies. Later, with the onset of monitoring processes, False recognition and Correct rejection of related lures waveforms presented, again, clearly dissociated patterns. Specifically, False recognition and True recognition showed more positive going patterns than Correct rejection of related lures signal and Correct rejection of new items signature. Since False recognition and Correct rejection of related lures triggered familiarity-recognition processes, our results suggest that deciding which items are studied is based more on recollection processes, which are later supported by monitoring processes. Results are discussed in terms of Activation-Monitoring Framework and Fuzzy Trace-Theory, the most prominent explanatory theories of false memory raised with the Deese/Roediger-McDermott paradigm. PMID:27711125
Cadavid, Sara; Beato, Maria Soledad
2016-01-01
Memory researchers have long been captivated by the nature of memory distortions and have made efforts to identify the neural correlates of true and false memories. However, the underlying mechanisms of avoiding false memories by correctly rejecting related lures remains underexplored. In this study, we employed a variant of the Deese/Roediger-McDermott paradigm to explore neural signatures of committing and avoiding false memories. ERP were obtained for True recognition, False recognition, Correct rejection of new items, and, more importantly, Correct rejection of related lures. With these ERP data, early-frontal, left-parietal, and late right-frontal old/new effects (associated with familiarity, recollection, and monitoring processes, respectively) were analysed. Results indicated that there were similar patterns for True and False recognition in all three old/new effects analysed in our study. Also, False recognition and Correct rejection of related lures activities seemed to share common underlying familiarity-based processes. The ERP similarities between False recognition and Correct rejection of related lures disappeared when recollection processes were examined because only False recognition presented a parietal old/new effect. This finding supported the view that actual false recollections underlie false memories, providing evidence consistent with previous behavioural research and with most ERP and neuroimaging studies. Later, with the onset of monitoring processes, False recognition and Correct rejection of related lures waveforms presented, again, clearly dissociated patterns. Specifically, False recognition and True recognition showed more positive going patterns than Correct rejection of related lures signal and Correct rejection of new items signature. Since False recognition and Correct rejection of related lures triggered familiarity-recognition processes, our results suggest that deciding which items are studied is based more on recollection processes, which are later supported by monitoring processes. Results are discussed in terms of Activation-Monitoring Framework and Fuzzy Trace-Theory, the most prominent explanatory theories of false memory raised with the Deese/Roediger-McDermott paradigm.
Development of a work addiction scale.
Andreassen, Cecilie Schou; Griffiths, Mark D; Hetland, Jørn; Pallesen, Ståle
2012-06-01
Research into excessive work has gained increasing attention over the last 20 years. Terms such as "workaholism,"work addiction" and "excessive work" have been used interchangeably. Given the increase in empirical research, this study presents the development of the Bergen Work Addiction Scale (BWAS), a new psychometrically validated scale for the assessment of work addiction. A pool of 14 items, with two reflecting each of seven core elements of addiction (i.e., salience, mood modification, tolerance, withdrawal, conflict, relapse, and problems) was initially constructed. The items were then administered to two samples, one recruited by a web survey following a television broadcast about workaholism (n = 11,769) and one comprising participants in the second wave of a longitudinal internet-based survey about working life (n = 368). The items with the highest corrected item-total correlation from within each of the seven addiction elements were retained in the final scale. The assumed one-factor solution of the refined seven-item scale was acceptable (root mean square error of approximation = 0.077, Comparative Fit Index = 0.96, Tucker-Lewis Index = 0.95) and the internal reliability of the two samples were 0.84 and 0.80, respectively. The scores of the BWAS converged with scores on other workaholism scales, except for a Work Enjoyment subscale. A suggested cut-off for categorization of workaholics showed good discriminative ability in terms of working hours, leadership position, and subjective health complaints. It is concluded that the BWAS has good psychometric properties. © 2012 The Authors. Scandinavian Journal of Psychology © 2012 The Scandinavian Psychological Associations.
Eren, Nurhan
2014-12-01
In this study, we aimed to develop two reliable and valid assessment instruments for investigating the level of difficulties mental health workers experience while working with patients with personality disorders and the attitudes they develop tt the patients. The research was carried out based on the general screening model. The study sample consisted of 332 mental health workers in several mental health clinics of Turkey, with a certain amount of experience in working with personality disorders, who were selected with a random assignment method. In order to collect data, the Personal Information Questionnaire, Difficulty of Working with Personality Disorders Scale (PD-DWS), and Attitudes Towards Patients with Personality Disorders Scale (PD-APS), which are being examined for reliability and validity, were applied. To determine construct validity, the Adjective Check List, Maslach Burnout Inventory, and State and Trait Anxiety Inventory were used. Explanatory factor analysis was used for investigating the structural validity, and Cronbach alpha, Spearman-Brown, Guttman Split-Half reliability analyses were utilized to examine the reliability. Also, item reliability and validity computations were carried out by investigating the corrected item-total correlations and discriminative indexes of the items in the scales. For the PD-DWS KMO test, the value was .946; also, a significant difference was found for the Bartlett sphericity test (p<.001). The computed test-retest coefficient reliability was .702; the Cronbach alpha value of the total test score was .952. For PD-APS KMO, the value was .925; a significant difference was found in Bartlett sphericity test (p<.001); the computed reliability coefficient based on continuity was .806; and the Cronbach alpha value of the total test score was .913. Analyses on both scales were based on total scores. It was found that PD-DWS and PD-APS have good psychometric properties, measuring the structure that is being investigated, are compatible with other scales, have high levels of internal reliability between their items, and are consistent across time. Therefore, it was concluded that both scales are valid and reliable instruments.
Yanagisawa, Ayumi; Sudo, Noriko; Amitani, Yukiko; Caballero, Yuko; Sekiyama, Makiko; Mukamugema, Christine; Matsuoka, Takuya; Imanishi, Hiroaki; Sasaki, Takayo; Matsuda, Hirotaka
2016-01-01
This study aimed to develop and evaluate the validity of a food frequency questionnaire (FFQ) for rural Rwandans. Since our FFQ was developed to assess malnutrition, it measured energy, protein, vitamin A, and iron intakes only. We collected 260 weighed food records (WFRs) from a total of 162 Rwandans. Based on the WFR data, we developed a tentative FFQ and examined the food list by percent contribution to energy and nutrient intakes. To assess the validity, nutrient intakes estimated from the FFQ were compared with those calculated from three-day WFRs by correlation coefficient and cross-classification for 17 adults. Cumulative contributions of the 18-item FFQ to the total intakes of energy and nutrients reached nearly 100%. Crude and energy-adjusted correlation coefficients ranged from −0.09 (vitamin A) to 0.58 (protein) and from −0.19 (vitamin A) to 0.68 (iron), respectively. About 50%–60% of the participants were classified into the same tertile. Our FFQ provided acceptable validity for energy and iron intakes and could rank Rwandan adults in eastern rural area correctly according to their energy and iron intakes. PMID:27429558
2014-01-01
Background Measures of household socio-economic position (SEP) are widely used in health research. There exist a number of approaches to their measurement, with Principal Components Analysis (PCA) applied to a basket of household assets being one of the most common. PCA, however, carries a number of assumptions about the distribution of the data which may be untenable, and alternative, non-parametric, approaches may be preferred. Mokken scale analysis is a non-parametric, item response theory approach to scale development which appears never to have been applied to household asset data. A Mokken scale can be used to rank order items (measures of wealth) as well as households. Using data on household asset ownership from a national sample of 4,154 consenting households in the World Health Survey from Vietnam, 2003, we construct two measures of household SEP. Seventeen items asking about assets, and utility and infrastructure use were used. Mokken Scaling and PCA were applied to the data. A single item measure of total household expenditure is used as a point of contrast. Results An 11 item scale, out of the 17 items, was identified that conformed to the assumptions of a Mokken Scale. All the items in the scale were identified as strong items (Hi > .5). Two PCA measures of SEP were developed as a point of contrast. One PCA measure was developed using all 17 available asset items, the other used the reduced set of 11 items identified in the Mokken scale analaysis. The Mokken Scale measure of SEP and the 17 item PCA measure had a very high correlation (r = .98), and they both correlated moderately with total household expenditure: r = .59 and r = .57 respectively. In contrast the 11 item PCA measure correlated moderately with the Mokken scale (r = .68), and weakly with the total household expenditure (r = .18). Conclusion The Mokken scale measure of household SEP performed at least as well as PCA, and outperformed the PCA measure developed with the 11 items used in the Mokken scale. Unlike PCA, Mokken scaling carries no assumptions about the underlying shape of the distribution of the data, and can be used simultaneous to order household SEP and items. The approach, however, has not been tested with data from other countries and remains an interesting, but under researched approach. PMID:25126103
Reidpath, Daniel D; Ahmadi, Keivan
2014-01-01
Measures of household socio-economic position (SEP) are widely used in health research. There exist a number of approaches to their measurement, with Principal Components Analysis (PCA) applied to a basket of household assets being one of the most common. PCA, however, carries a number of assumptions about the distribution of the data which may be untenable, and alternative, non-parametric, approaches may be preferred. Mokken scale analysis is a non-parametric, item response theory approach to scale development which appears never to have been applied to household asset data. A Mokken scale can be used to rank order items (measures of wealth) as well as households. Using data on household asset ownership from a national sample of 4,154 consenting households in the World Health Survey from Vietnam, 2003, we construct two measures of household SEP. Seventeen items asking about assets, and utility and infrastructure use were used. Mokken Scaling and PCA were applied to the data. A single item measure of total household expenditure is used as a point of contrast. An 11 item scale, out of the 17 items, was identified that conformed to the assumptions of a Mokken Scale. All the items in the scale were identified as strong items (Hi > .5). Two PCA measures of SEP were developed as a point of contrast. One PCA measure was developed using all 17 available asset items, the other used the reduced set of 11 items identified in the Mokken scale analaysis. The Mokken Scale measure of SEP and the 17 item PCA measure had a very high correlation (r = .98), and they both correlated moderately with total household expenditure: r = .59 and r = .57 respectively. In contrast the 11 item PCA measure correlated moderately with the Mokken scale (r = .68), and weakly with the total household expenditure (r = .18). The Mokken scale measure of household SEP performed at least as well as PCA, and outperformed the PCA measure developed with the 11 items used in the Mokken scale. Unlike PCA, Mokken scaling carries no assumptions about the underlying shape of the distribution of the data, and can be used simultaneous to order household SEP and items. The approach, however, has not been tested with data from other countries and remains an interesting, but under researched approach.
Dennis, Michael L; Chan, Ya-Fen; Funk, Rodney R
2006-01-01
The Global Appraisal of Individual Needs (GAIN)1 is a 1-2 hour standardized biopsychosocial that integrates clinical and research assessment for people presenting to substance abuse treatment. The GAIN - Short Screener (GSS) is 3-5 minute screener to quickly identify those who would have a disorder based on the full 60-120 minute GAIN and triage the problem and kind of intervention they are likely to need along four dimensions (internalizing disorders, externalizing disorders, substance disorders, and crime/violence). Data were collected from 6,177 adolescents and 1,805 adults as part of 77 studies in three dozen locations around the United States that used the GAIN. For both adolescents and adults the 20-item total disorder screener (TDScr) and its four 5-item sub-screeners (internalizing disorders, externalizing disorders, substance disorders, and crime/violence) has good internal consistency (alpha of .96 on total screener), is highly correlated (r = .84 to .94) with the 123-item longer scales in the full GAIN. The GSS also does well in terms of its receiver operator characteristics (90% or more under the curve in all analyses) and has clinical decision-making cut points with excellent sensitivity (90% or more) for identifying people with a disorder and excellent specificity (92% or more) for correctly ruling out people who did not have a disorder. The GSS has good potential as an efficient screener for identifying people with co-occurring disorders across multiple systems and routing them to the right services and more detailed assessments.
Correlation between physical anomaly and behavioral abnormalities in Down syndrome
Bhattacharyya, Ranjan; Sanyal, Debasish; Roy, Krishna; Bhattacharyya, Sumita
2010-01-01
Objective: The minor physical anomaly (MPA) is believed to reflect abnormal development of the CNS. The aim is to find incidence of MPA and its behavioral correlates in Down syndrome and to compare these findings with the other causes of intellectual disability and normal population. Materials and Methods: One-hundred and forty intellectually disabled people attending a tertiary care set-up and from various NGOs are included in the study. The age-matched group from normal population was also studied for comparison. MPA are assessed by using Modified Waldrop scale and behavioral abnormality by Diagnostic assessment scale for severely handicapped (DASH II scale). Results: The Down syndrome group had significantly more MPA than other two groups and most of the MPA is situated in the global head region. There is strong correlation (P < 0.001) between the various grouped items of Modified Waldrop scale. Depression subscale is correlated with anomalies in the hands (P < 0.001), feet and Waldrop total items (P < 0.005). Mania item of DASH II scale is related with anomalies around the eyes (P < 0.001). Self-injurious behavior and total Waldrop score is negatively correlated with global head. Conclusion: Down syndrome group has significantly more MPA and a pattern of correlation between MPA and behavioral abnormalities exists which necessitates a large-scale study. PMID:21559153
Georgieva-Zhostova, Spaska; Kolev, Ognyan I; Stambolieva, Katerina
2014-09-01
The aim of the present study was the translation, cross-cultural adaptation and validation of the Dizziness Handicap Inventory in Bulgarian language (DHI-BG). Ninety-seven vestibular patients (19 men and 78 women, mean age 45.08 ± 13.85 years) took part in the investigation. All participants were asked to fill in the DHI-BG. Internal consistency was estimated using Cronbach's alpha and item-total correlation, reproducibility by calculating Bland-Altman's limits of agreement and intraclass correlation coefficients (ICCs). Associations were estimated by Spearman's correlation coefficients. The Cronbach's alpha for the total score, functional, physical and emotional subscales of DHI-BG were 0.88, 0.75, 0.72 and 0.81. The floor and ceiling effects of the DHI-BG total scale were evaluated with respect to the limits of agreement which were ±9.4-14.53 points. Intraclass correlation coefficients (ICCs) for all scale and subscales were higher than the recommended value of 0.75 and determined good test-retest reliability. The range of items correlation for DHI-BG was from 0.27 (item 12) to 0.72 (item 3). No significant differences were observed in the Cronbach's alpha coefficients between the DHI-BG and the original version, the German and Italian versions of the questionnaire. The most significant difference was observed in comparison with the German version of DHI. Construct validity presented a moderate correlation between Romberg coefficients and DHI-BG scores and strong correlation between all scores of DHI and the self-perceived disability. The results suggest that DHI-BG scores show a good discriminative validity between groups with different levels of self-assessed disability. The Bulgarian version of the DHI is a reliable and valid tool in assessing the impact of dizziness on the quality of life in Bulgarian vestibular patients.
Jung, Hee-Yeon; Kim, Jong-Hoon; Ahn, Yong-Min; Kim, Seong-Chan; Hwang, Samuel S; Kim, Yong-Sik
2005-01-01
The Liverpool University Neuroleptic Side-Effect Rating Scale (LUNSERS) was examined for its usefulness as a subjective measure of drug-induced parkinsonism and akathisia. Eighty-three subjects were assessed using the LUNSERS, the Simpson-Angus Scale (SAS) and the Barnes Akathisia Rating Scale (BARS), before and after a 6-week treatment with olanzapine. Significant correlations were found between the changes in scores of parkinsonism items of LUNSERS and SAS. The changes in scores of akathisia item (restlessness), extrapyramidal side effects (EPS) subscale and psychic side-effects subscale of LUNSERS were significantly correlated with those of the BARS. 'Shakiness', one item of the EPS subscale of LUNSERS, correctly classified between parkinsonism and non-parkinsonism groups with 81.0% accuracy. A combination of four items included in EPS and psychic side-effect subscales of LUNSERS identified akathisia and non-akathisia groups with 76.2% accuracy. These results suggest that the EPS and psychic side-effect subscales of LUNSERS may be useful in screening for drug-induced parkinsonism and akathisia. Copyright (c) 2004 John Wiley & Sons, Ltd.
Relationship between cognitive and non-cognitive symptoms of delirium.
Rajlakshmi, Aarya Krishnan; Mattoo, Surendra Kumar; Grover, Sandeep
2013-04-01
To study relationship between the cognitive and the non-cognitive symptoms of delirium. Eighty-four patients referred to psychiatry liaison services and met DSM-IVTR criteria of delirium were assessed using the Delirium Rating Scale Revised-1998 (DRSR-98) and Cognitive Test for Delirium (CTD). The mean DRS-R-98 severity score was 17.19 and DRS-R-98 total score was 23.36. The mean total score on CTD was 11.75. The mean scores on CTD were highest for comprehension (3.47) and lowest for vigilance (1.71). Poor attention was associated with significantly higher motor retardation and higher DRS-R-98 severity scores minus the attention scores. There were no significant differences between those with and without poor attention. Higher attention deficits were associated with higher dysfunction on all other domains of cognition on CTD. There was significant correlation between cognitive functions as assessed on CTD and total DRS-R-98 score, DRS-R-98 severity score and DRS-R-98 severity score without the attention item score. However, few correlations emerged between CTD domains and CTD total scores with cognitive symptom total score of DRS-R-98 (items 9-13) and non-cognitive symptom total score of DRS-R-98 (items 1-8). Our study suggests that in delirium, cognitive deficits are quite prevalent and correlate with overall severity of delirium. Attention deficit is a core symptom of delirium. Copyright © 2012 Elsevier B.V. All rights reserved.
Responsiveness of a Brief Measure of Lung Cancer Screening Knowledge.
Housten, Ashley J; Lowenstein, Lisa M; Leal, Viola B; Volk, Robert J
2016-12-14
Our aim was to examine the responsiveness of a lung cancer screening brief knowledge measure (LCS-12). Eligible participants were aged 55-80 years, current smokers or had quit within 15 years, and English speaking. They completed a baseline pretest survey, viewed a lung cancer screening video-based patient decision aid, and then filled out a follow-up posttest survey. We performed a paired samples t-test, calculated effect size, and calculated absolute and relative percent improvement for each item. Participants (n = 30) were primarily White (63%) with less than a college degree (63%), and half were female (50%). Mean age was 61.5 years (standard deviation [SD] = 4.67) and average smoking history was 30.4 pack-years (range = 4.6-90.0). Mean score on the 12-item measure increased from 47.3% correct on the pretest to 80.3% correct on the posttest (mean pretest score = 5.67 vs. mean posttest score = 9.63; mean score difference = 3.97, SD = 2.87, 95% CI = 2.90, 5.04). Total knowledge scores improved significantly and were responsive to the decision aid intervention (paired samples t-test = 7.57, p < .001; Cohen's effect size = 1.59; standard response mean [SRM] = 1.38). All individual items were responsive, yet two items had lower absolute responsiveness than the others (item 8: "Without screening, is lung cancer often found at a later stage when cure is less likely?" pretest correct = 83.3% vs. posttest = 96.7%, responsiveness = 13.4%; and item 10: "Can a CT scan find lung disease that is not cancer?" pretest correct = 80.0% vs. posttest = 93.3%, responsiveness = 13.3%). The LCS-12 knowledge measure may be a useful outcome measure of shared decision making for lung cancer screening.
Hamilton, Clayon B; Chesworth, Bert M
2013-11-01
The original 20-item Upper Extremity Functional Index (UEFI) has not undergone Rasch validation. The purpose of this study was to determine whether Rasch analysis supports the UEFI as a measure of a single construct (ie, upper extremity function) and whether a Rasch-validated UEFI has adequate reproducibility for individual-level patient evaluation. This was a secondary analysis of data from a repeated-measures study designed to evaluate the measurement properties of the UEFI over a 3-week period. Patients (n=239) with musculoskeletal upper extremity disorders were recruited from 17 physical therapy clinics across 4 Canadian provinces. Rasch analysis of the UEFI measurement properties was performed. If the UEFI did not fit the Rasch model, misfitting patients were deleted, items with poor response structure were corrected, and misfitting items and redundant items were deleted. The impact of differential item functioning on the ability estimate of patients was investigated. A 15-item modified UEFI was derived to achieve fit to the Rasch model where the total score was supported as a measure of upper extremity function only. The resultant UEFI-15 interval-level scale (0-100, worst to best state) demonstrated excellent internal consistency (person separation index=0.94) and test-retest reliability (intraclass correlation coefficient [2,1]=.95). The minimal detectable change at the 90% confidence interval was 8.1. Patients who were ambidextrous or bilaterally affected were excluded to allow for the analysis of differential item functioning due to limb involvement and arm dominance. Rasch analysis did not support the validity of the 20-item UEFI. However, the UEFI-15 was a valid and reliable interval-level measure of a single dimension: upper extremity function. Rasch analysis supports using the UEFI-15 in physical therapist practice to quantify upper extremity function in patients with musculoskeletal disorders of the upper extremity.
Chesworth, Bert M.
2013-01-01
Background The original 20-item Upper Extremity Functional Index (UEFI) has not undergone Rasch validation. Objective The purpose of this study was to determine whether Rasch analysis supports the UEFI as a measure of a single construct (ie, upper extremity function) and whether a Rasch-validated UEFI has adequate reproducibility for individual-level patient evaluation. Design This was a secondary analysis of data from a repeated-measures study designed to evaluate the measurement properties of the UEFI over a 3-week period. Methods Patients (n=239) with musculoskeletal upper extremity disorders were recruited from 17 physical therapy clinics across 4 Canadian provinces. Rasch analysis of the UEFI measurement properties was performed. If the UEFI did not fit the Rasch model, misfitting patients were deleted, items with poor response structure were corrected, and misfitting items and redundant items were deleted. The impact of differential item functioning on the ability estimate of patients was investigated. Results A 15-item modified UEFI was derived to achieve fit to the Rasch model where the total score was supported as a measure of upper extremity function only. The resultant UEFI-15 interval-level scale (0–100, worst to best state) demonstrated excellent internal consistency (person separation index=0.94) and test-retest reliability (intraclass correlation coefficient [2,1]=.95). The minimal detectable change at the 90% confidence interval was 8.1. Limitations Patients who were ambidextrous or bilaterally affected were excluded to allow for the analysis of differential item functioning due to limb involvement and arm dominance. Conclusion Rasch analysis did not support the validity of the 20-item UEFI. However, the UEFI-15 was a valid and reliable interval-level measure of a single dimension: upper extremity function. Rasch analysis supports using the UEFI-15 in physical therapist practice to quantify upper extremity function in patients with musculoskeletal disorders of the upper extremity. PMID:23813086
Kim, Hee-Ju; Abraham, Ivo
2017-01-01
Evidence is needed on the clinicometric properties of single-item or short measures as alternatives to comprehensive measures. We examined whether two single-item fatigue measures (i.e., Likert scale, numeric rating scale) or a short fatigue measure were comparable to a comprehensive measure in reliability (i.e., internal consistency and test-retest reliability) and validity (i.e., convergent, concurrent, and predictive validity) in Korean young adults. For this quantitative study, we selected the Functional Assessment of Chronic Illness Therapy-Fatigue for the comprehensive measure and the Profile of Mood States-Brief, Fatigue subscale for the short measure; and constructed two single-item measures. A total of 368 students from four nursing colleges in South Korea participated. We used Cronbach's alpha and item-total correlation for internal consistency reliability and intraclass correlation coefficient for test-retest reliability. We assessed Pearson's correlation with a comprehensive measure for convergent validity, with perceived stress level and sleep quality for concurrent validity and the receiver operating characteristic curve for predictive validity. The short measure was comparable to the comprehensive measure in internal consistency reliability (Cronbach's alpha=0.81 vs. 0.88); test-retest reliability (intraclass correlation coefficient=0.66 vs. 0.61); convergent validity (r with comprehensive measure=0.79); concurrent validity (r with perceived stress=0.55, r with sleep quality=0.39) and predictive validity (area under curve=0.88). Single-item measures were not comparable to the comprehensive measure. A short fatigue measure exhibited similar levels of reliability and validity to the comprehensive measure in Korean young adults. Copyright © 2016 Elsevier Ltd. All rights reserved.
DERAKHSHANDEH, ZAHRA; AMINI, MITRA; KOJURI, JAVAD; DEHBOZORGIAN, MARZIYEH
2018-01-01
Introduction: Clinical reasoning is one of the most important skills in the process of training a medical student to become an efficient physician. Assessment of the reasoning skills in a medical school program is important to direct students’ learning. One of the tests for measuring the clinical reasoning ability is Clinical Reasoning Problems (CRPs). The major aim of this study is to measure psychometric qualities of CRPs and define correlation between this test and routine MCQ in cardiology department of Shiraz medical school. Methods: This study was a descriptive study conducted on total cardiology residents of Shiraz Medical School. The study population consists of 40 residents in 2014. The routine CRPs and the MCQ tests was designed based on similar objectives and were carried out simultaneously. Reliability, item difficulty, item discrimination, and correlation between each item and the total score of CRPs were all measured by Excel and SPSS software for checking psycometeric CRPs test. Furthermore, we calculated the correlation between CRPs test and MCQ test. The mean differences of CRPs test score between residents’ academic year [second, third and fourth year] were also evaluated by Analysis of variances test (One Way ANOVA) using SPSS software (version 20)(α=0.05). Results: The mean and standard deviation of score in CRPs was 10.19 ±3.39 out of 20; in MCQ, it was 13.15±3.81 out of 20. Item difficulty was in the range of 0.27-0.72; item discrimination was 0.30-0.75 with question No.3 being the exception (that was 0.24). The correlation between each item and the total score of CRP was 0.26-0.87; the correlation between CRPs test and MCQ test was 0.68 (p<0.001). The reliability of the CRPs was 0.72 as calculated by using Cronbach's alpha. The mean score of CRPs was different among residents based on their academic year and this difference was statistically significant (p<0.001). Conclusion: The results of this present investigation revealed that CRPs could be reliable test for measuring clinical reasoning in residents. It can be included in cardiology residency assessment programs. PMID:29344528
Self-correction in biomedical publications and the scientific impact
Gasparyan, Armen Yuri; Ayvazyan, Lilit; Akazhanov, Nurbek A.; Kitas, George D.
2014-01-01
Aim To analyze mistakes and misconduct in multidisciplinary and specialized biomedical journals. Methods We conducted searches through PubMed to retrieve errata, duplicate, and retracted publications (as of January 30, 2014). To analyze publication activity and citation profiles of countries, multidisciplinary, and specialized biomedical journals, we referred to the latest data from the SCImago Journal & Country Rank database. Total number of indexed articles and values of the h-index of the fifty most productive countries and multidisciplinary journals were recorded and linked to the number of duplicate and retracted publications in PubMed. Results Our analysis found 2597 correction items. A striking increase in the number of corrections appeared in 2013, which is mainly due to 871 (85.3%) corrections from PLOS One. The number of duplicate publications was 1086. Articles frequently published in duplicate were reviews (15.6%), original studies (12.6%), and case reports (7.6%), whereas top three retracted articles were original studies (10.1%), randomized trials (8.8%), and reviews (7%). A strong association existed between the total number of publications across countries and duplicate (rs = 0.86, P < 0.001) and retracted items (rs = 0.812, P < 0.001). A similar trend was found between country-based h-index values and duplicate and retracted publications. Conclusion The study suggests that the intensified self-correction in biomedicine is due to the attention of readers and authors, who spot errors in their hub of evidence-based information. Digitization and open access confound the staggering increase in correction notices and retractions. PMID:24577829
Leombruni, Paolo; Loera, Barbara; Miniotti, Marco; Zizzi, Francesca; Castelli, Lorys; Torta, Riccardo
2015-10-01
A steady increase in the number of patients requiring end-of-life care has been observed during the last decades. The assessment of healthcare students' attitudes toward end-of-life care is an important step in their curriculum, as it provides information about their disposition to practice palliative medicine. The Frommelt Attitude Toward Care of the Dying Scale (FATCOD-B) was developed to detect such a disposition, but its psychometric properties are yet to be clearly defined. A convenience sample of 608 second-year medical students participated in our study in the 2012/2013 and 2013/2014 academic years. All participants completed the FATCOD-B. The sample was randomly divided in two subsamples. In the item analysis, reliability (Cronbach's α), internal consistency (item-total correlations), and an exploratory factor analysis (EFA) were conducted using the first subsample (n = 300). Using the second subsample (n = 308), confirmatory factor analysis (CFA) was performed using the robust ML method in the Lisrel program. Reliability for all items was 0.699. Item-total correlations, ranging from 0.03 to 0.39, were weak. EFA identified a two-dimensional orthogonal solution, explaining 20% of total variance. CFA upheld the two-dimensional model, but the loadings on the dimensions and their respective indicators were weak and equal to zero for certain items. The findings of the present study suggest that the FATCOD-B measures a two-dimensional construct and that several items seem in need of revision. Future research oriented toward building a revised version of the scale should pay attention to item ambiguity and take particular care to distinguish among items that concern emotions and beliefs related to end-of-life care, as well as their subjects (e.g., the healthcare provider, the patient, his family).
The Pieper-Zulkowski pressure ulcer knowledge test.
Pieper, Barbara; Zulkowski, Karen
2014-09-01
To describe the development and initial testing of the Pieper-Zulkowski Pressure Ulcer Knowledge Test (PZ-PUKT). Cross-sectional, instrument testing. Hospital association pressure ulcer educational program conference. Pressure ulcer research and guidelines from the last 5 years were examined for test item content. The initial PZ-PUKT had 115 items; response options were "true," "false," and "don't know." Registered nurses (N = 108) were randomly divided into 2 groups to take either the 60 prevention/risk and staging items or the 55 wound description items. Analyses of these responses resulted in 72 items, which were administered in total to a second cohort of 98 nurses for reliability. Cronbach's α was .80 for the 72-item PZ-PUKT. Cronbach's α values for the subscales were as follows: staging, .67; wound description, .64; and prevention/risk, .56. The mean correct scores were as follows: total, 80%; prevention, 77%; staging, 86%; and wound description, 77%. Nurses with wound care certification scored significantly higher on the PZ-PUKT than did nurses with other clinical certifications or with nurses who lacked certification. The PZ-PUKT has updated content about pressure ulcer prevention/risk, staging, and wound description. Reliability values are highest for the total test. Further use of the instrument in diverse settings will add to reliability testing and may provide direction for determination of a passing cutoff score.
NASA Astrophysics Data System (ADS)
Nieminen, Pasi; Savinainen, Antti; Viiri, Jouni
2010-07-01
This study investigates students’ ability to interpret multiple representations consistently (i.e., representational consistency) in the context of the force concept. For this purpose we developed the Representational Variant of the Force Concept Inventory (R-FCI), which makes use of nine items from the 1995 version of the Force Concept Inventory (FCI). These original FCI items were redesigned using various representations (such as motion map, vectorial and graphical), yielding 27 multiple-choice items concerning four central concepts underpinning the force concept: Newton’s first, second, and third laws, and gravitation. We provide some evidence for the validity and reliability of the R-FCI; this analysis is limited to the student population of one Finnish high school. The students took the R-FCI at the beginning and at the end of their first high school physics course. We found that students’ (n=168) representational consistency (whether scientifically correct or not) varied considerably depending on the concept. On average, representational consistency and scientifically correct understanding increased during the instruction, although in the post-test only a few students performed consistently both in terms of representations and scientifically correct understanding. We also compared students’ (n=87) results of the R-FCI and the FCI, and found that they correlated quite well.
D'Antoni, Anthony V; DiLandro, Anthony C; Chusid, Eileen D; Trepal, Michael J
2012-01-01
In 2010, the New York College of Podiatric Medicine general anatomy course was redesigned to emphasize clinical anatomy. Over a 2-year period, United States Medical Licensing Examination (USMLE)-style items were used in lecture assessments with two cohorts of students (N =200). Items were single-best-answer and extended-matching formats. Psychometric properties of items and assessments were evaluated, and anonymous student post-course surveys were administered. Mean grades for each assessment were recorded over time and compared between cohorts using analysis of variance. Correlational analyses were used to investigate the relationship between final course grades and lecture examinations. Post-course survey response rates for the cohorts were 71 of 97 (73%) and 81 of 103 (79%). The USMLE-style items had strong psychometric properties. Point biserial correlations were 0.20 and greater, and the range of students answering the items correctly was 25% to 75%. Examinations were highly reliable, with Kuder-Richardson 20 coefficients of 0.71 to 0.76. Students (>80%) reported that single-best-answer items were easier than extended-matching items. Students (>76%) believed that the items on the quizzes/examinations were similar to those found on USMLE Step 1. Most students (>84%) believed that they would do well on the anatomy section of their boards (American Podiatric Medical Licensing Examination [APMLE] Part I). Students valued USMLE-style items. These data, coupled with the psychometric data, suggest that USMLE-style items can be successfully incorporated into a basic science course in podiatric medical education. Outcomes from students who recently took the APMLE Part I suggest that incorporation of USMLE-style items into the general anatomy course was a successful measure and prepared them well.
Pitchford, Melanie; Ball, Linden J.; Hunt, Thomas E.; Steel, Richard
2017-01-01
We report a study examining the role of ‘cognitive miserliness’ as a determinant of poor performance on the standard three-item Cognitive Reflection Test (CRT). The cognitive miserliness hypothesis proposes that people often respond incorrectly on CRT items because of an unwillingness to go beyond default, heuristic processing and invest time and effort in analytic, reflective processing. Our analysis (N = 391) focused on people’s response times to CRT items to determine whether predicted associations are evident between miserly thinking and the generation of incorrect, intuitive answers. Evidence indicated only a weak correlation between CRT response times and accuracy. Item-level analyses also failed to demonstrate predicted response-time differences between correct analytic and incorrect intuitive answers for two of the three CRT items. We question whether participants who give incorrect intuitive answers on the CRT can legitimately be termed cognitive misers and whether the three CRT items measure the same general construct. PMID:29099840
Abstracts of ARI Research Publications, FY 1978
1980-09-01
initial item pool, 49 items were identified as having signifi- cant item-to-total-score correlations and were statistically determined to address a...failing. Differences among the three groups on main gun performance measures and the previous experience of gun- ners were not statistically significant...forms of the noncognitive cod- ing speed test; and (d) a second field administration to derive norms and other statistical characteristics of the new
Development of and Field-Test Results for the CAHPS PCMH Survey
Scholle, Sarah Hudson; Vuong, Oanh; Ding, Lin; Fry, Stephanie; Gallagher, Patricia; Brown, Julie A.; Hays, Ron D.; Cleary, Paul D.
2017-01-01
Objective To develop and evaluate survey questions that assess processes of care relevant to Patient-Centered Medical Homes (PCMHs). Research Design We convened expert panels, reviewed evidence on effective care practices and existing surveys, elicited broad public input, and conducted cognitive interviews and a field test to develop items relevant to PCMHs that could be added to the CAHPS® Clinician & Group (CG-CAHPS) 1.0 Survey. Surveys were tested using a two-contact mail protocol in 10 adult and 33 pediatric practices (both private and community health centers) in Massachusetts. A total of 4,875 completed surveys were received (overall response rate of 25%). Analyses We calculated the rate of valid responses for each item. We conducted exploratory factor analyses and estimated item-to-total correlations, individual and site level reliability, and correlations among proposed multi-item composites. Results Ten items in four new domains (Comprehensiveness, Information, Self-Management Support, and Shared Decision-Making) and four items in two existing domains (Access and Coordination of Care) were selected to be supplemental items to be used in conjunction with the adult CG-CAHPS 1.0 survey. For the child version, four items in each of two new domains (Information and Self-Management Support) and five items in existing domains (Access, Comprehensiveness-Prevention, Coordination of Care) were selected. Conclusions This study provides support for the reliability and validity of new items to supplement the CG-CAHPS 1.0 survey to assess aspects of primary care that are important attributes of Patient-Centered Medical Homes. PMID:23064272
Is Your Neighborhood Designed to Support Physical Activity? A Brief Streetscape Audit Tool.
Sallis, James F; Cain, Kelli L; Conway, Terry L; Gavand, Kavita A; Millstein, Rachel A; Geremia, Carrie M; Frank, Lawrence D; Saelens, Brian E; Glanz, Karen; King, Abby C
2015-09-03
Macro level built environment factors (eg, street connectivity, walkability) are correlated with physical activity. Less studied but more modifiable microscale elements of the environment (eg, crosswalks) may also affect physical activity, but short audit measures of microscale elements are needed to promote wider use. This study evaluated the relation of a 15-item neighborhood environment audit tool with a full version of the tool to assess neighborhood design on physical activity in 4 age groups. From the 120-item Microscale Audit of Pedestrian Streetscapes (MAPS) measure of street design, sidewalks, and street crossings, we developed the 15-item version (MAPS-Mini) on the basis of associations with physical activity and attribute modifiability. As a sample of a likely walking route, MAPS-Mini was conducted on a 0.25-mile route from participant residences toward the nearest nonresidential destination for children (n = 758), adolescents (n = 897), younger adults (n = 1,655), and older adults (n = 367). Active transportation and leisure physical activity were measured with age-appropriate surveys, and accelerometers provided objective physical activity measures. Mixed-model regressions were conducted for each MAPS item and a total environment score, adjusted for demographics, participant clustering, and macrolevel walkability. Total scores of MAPS-Mini and the 120-item MAPS correlated at r = .85. Total microscale environment scores were significantly related to active transportation in all age groups. Items related to active transport in 3 age groups were presence of sidewalks, curb cuts, street lights, benches, and buffer between street and sidewalk. The total score was related to leisure physical activity and accelerometer measures only in children. The MAPS-Mini environment measure is short enough to be practical for use by community groups and planning agencies and is a valid substitute for the full version that is 8 times longer.
Cho, Hyun; Kwon, Min; Choi, Ji-Hye; Lee, Sang-Kyu; Choi, Jung Seok; Choi, Sam-Wook; Kim, Dai-Jin
2014-09-01
This study was conducted to develop and validate a standardized self-diagnostic Internet addiction (IA) scale based on the diagnosis criteria for Internet Gaming Disorder (IGD) in the Diagnostic and Statistical Manual of Mental Disorder, 5th edition (DSM-5). Items based on the IGD diagnosis criteria were developed using items of the previous Internet addiction scales. Data were collected from a community sample. The data were divided into two sets, and confirmatory factor analysis (CFA) was performed repeatedly. The model was modified after discussion with professionals based on the first CFA results, after which the second CFA was performed. The internal consistency reliability was generally good. The items that showed significantly low correlation values based on the item-total correlation of each factor were excluded. After the first CFA was performed, some factors and items were excluded. Seven factors and 26 items were prepared for the final model. The second CFA results showed good general factor loading, Squared Multiple Correlation (SMC) and model fit. The model fit of the final model was good, but some factors were very highly correlated. It is recommended that some of the factors be refined through further studies. Copyright © 2014. Published by Elsevier Ltd.
Pedraza, Otto; Graff-Radford, Neill R.; Smith, Glenn E.; Ivnik, Robert J.; Willis, Floyd B.; Petersen, Ronald C.; Lucas, John A.
2010-01-01
Scores on the Boston Naming Test (BNT) are frequently lower for African American when compared to Caucasian adults. Although demographically-based norms can mitigate the impact of this discrepancy on the likelihood of erroneous diagnostic impressions, a growing consensus suggests that group norms do not sufficiently address or advance our understanding of the underlying psychometric and sociocultural factors that lead to between-group score discrepancies. Using item response theory and methods to detect differential item functioning (DIF), the current investigation moves beyond comparisons of the summed total score to examine whether the conditional probability of responding correctly to individual BNT items differs between African American and Caucasian adults. Participants included 670 adults age 52 and older who took part in Mayo's Older Americans and Older African Americans Normative Studies. Under a 2-parameter logistic IRT framework and after correction for the false discovery rate, 12 items where shown to demonstrate DIF. Six of these 12 items (“dominoes,” “escalator,” “muzzle,” “latch,” “tripod,” and “palette”) were also identified in additional analyses using hierarchical logistic regression models and represent the strongest evidence for race/ethnicity-based DIF. These findings afford a finer characterization of the psychometric properties of the BNT and expand our understanding of between-group performance. PMID:19570311
Han, Kihwan; Oh, Sangho; Choi, Jaehoon; Park, Sang Woo
2018-05-01
Alar transfixion sutures are commonly used for vestibular web correction. The purpose of this study was to evaluate the long-term results of the use of alar transfixion sutures in patients with a unilateral cleft lip nasal deformity using photogrammetric analysis. The study included 42 patients who were divided into child and adult groups. A total of 4 measurement items were evaluated from a basal view by photogrammetry using standardized clinical photographic techniques preoperatively, immediately postoperatively, 3 months postoperatively, and 6 months postoperatively. When the preoperative and last postoperative values were compared, no significant changes in any measurement items were noted in the adult group. In the child group, the proportional index (the ratio of the cleft side to the noncleft side) of the alar slope line inclination was significantly increased, but other measurement items showed no significant change. When the measurement items were compared between time points, no significant changes in any measurement items were noted in the adult group. In the child group, the proportional indexes of the alar length, the width between the subnasale and the alare, and the webbing degree were significantly decreased immediately postoperatively compared with the preoperative values. However, these significant changes were diminished at 3 months postoperatively. The proportional index of the alar slope line inclination was significantly increased at 3 months postoperatively compared with the preoperative value, but the significant change was diminished at 6 months postoperatively. The alar transfixion suture procedure is not effective for correcting a vestibular web and alar-facial groove.
de Brouwer, B J M; Kaljouw, M J; Kramer, M; Schmalenberg, C; van Achterberg, T
2014-03-01
Translate the Essentials of Magnetism II© (EOMII; Dutch Nurses' Association, Utrecht, The Netherlands) and assess its psychometric properties in a culture different from its origin. The EOMII, developed in the USA, measures the extent to which organizations/units provide healthy, productive and satisfying work environments. As many healthcare organizations are facing difficulties in attracting and retaining staff nurses, the EOMII provides the opportunity to assess the health and effectiveness of work environments. A three-phased (respectively N = 13, N = 74 and N = 2542) combined descriptive and correlational design was undertaken for translation and evaluation validity and psychometric qualities of the EOMII for Dutch hospitals (December 2009-January 2010). We performed forward-backward translation, face and content validation via cross-sectional survey research, and semi-structured interviews on relevance, clarity, and recognizability of instruments' items. Psychometric testing included principal component analysis using varimax rotation, item-total statistics, and reliability in terms of internal consistency (Cronbach's α) for the total scale and its subscales. Face validity was confirmed. Items were recognizable, relevant and clear. Confirmatory factor analysis indicated that five of eight subscales formed clear factors. Three original subscales contained two factors. Item-total correlations ranged from 0.43 to 0.83. One item correlated weakly (0.24) with its subscale. Cronbach's α for the entire scale was 0.92 and ranged from 0.58 to 0.92 for eight subscales. Dutch-translated EOMII (D-EOMII) demonstrated acceptable reliability and validity for assessing hospital staff nurses' work environment. The D-EOMII can be useful and effective in identifying areas in which change is needed for a hospital to pursue an excellent work environment that attracts and retains well-qualified nurses. © 2013 International Council of Nurses.
Pugh, Stephanie L.; Wyatt, Gwen; Wong, Raimond K. W.; Sagar, Stephen M.; Yueh, Bevan; Singh, Anurag K.; Yao, Min; Nguyen-Tan, Phuc Felix; Yom, Sue S.; Cardinale, Francis S.; Sultanem, Khalil; Hodson, D. Ian; Krempl, Greg A.; Chavez, Ariel; Yeh, Alexander M.; Bruner, Deborah W.
2016-01-01
Context The 15-item University of Washington Quality of Life questionnaire – Radiation Therapy Oncology Group (RTOG) modification (UW-QOL-RTOG modification) has been used in several trials of head and neck cancer conducted by NRG Oncology such as RTOG 9709, RTOG 9901, RTOG 0244, and RTOG 0537. Objectives This study is an exploratory factor analysis (EFA) to establish validity and reliability of the instrument subscales. Methods EFA on the UW-QOL - RTOG modification was conducted using baseline data from NRG Oncology's RTOG 0537, a trial of acupuncture-like transcutaneous electrical nerve stimulation in treating radiation-induced xerostomia. Cronbach's α coefficient was calculated to measure reliability; correlation with the University of Michigan Xerostomia Related Quality of Life Scale (XeQOLS) was used to evaluate concurrent validity; and correlations between consecutive time points were used to assess test-retest reliability. Results The 15-item EFA of the modified tool resulted in 11 items split into 4 factors: mucus, eating, pain, and activities. Cronbach's α ranged from 0.71 to 0.93 for the factors and total score, consisting of all 11 items. There were strong correlations (ρ≥0.60) between consecutive time points and between total score and the XeQOLS total score (ρ>0.65). Conclusion The UW-QOL-RTOG modification is a valid tool that can be used to assess symptom burden of head and neck cancer patients receiving radiation therapy or those who have recently completed radiation. The modified tool has acceptable reliability, concurrent validity, and test-retest reliability in this patient population, as well as the advantage of having being shortened from 15 to 11 items. PMID:27899312
Cross-cultural comparisons of the Mini-mental State Examination between Japanese and U.S. cohorts
Meguro, Kenichi; Ishii, Hiroshi; Yamaguchi, Satoshi; Saxton, Judith A.; Ganguli, Mary
2009-01-01
Background The Mini-mental State Examination (MMSE) is widely used in Japan and the U.S.A. for cognitive screening in the clinical setting and in epidemiological studies. A previous Japanese community study reported distributions of the MMSE total score very similar to that of the U.S.A. Methods Data were obtained from the Monongahela Valley Independent Elder's Study (MoVIES), a representative sample of community-dwelling elderly people aged 65 and older living near Pittsburgh, U.S.A., and from the Tajiri Project, with similar aims in Tajiri, Japan. We examined item-by-item distributions of the MMSE between two cohorts, comparing (1) percentage of correct answers for each item within each cohort, and (2) relative difficulty of each item measured by Item Characteristic Curve analysis (ICC), which estimates log odds of obtaining a correct answer adjusted for the remaining MMSE items, demographic variables (age, gender, education) and interactions of demographic variables and cohort. Results Median MMSE scores were very similar between the two samples within the same education groups. However, the relative difficulty of each item differed substantially between the two cohorts. Specifically, recall and auditory comprehension were easier for the Tajiri group, but reading comprehension and sentence construction were easier for the MoVIES group. Conclusions Our results reaffirm the importance of validation and examination of thresholds in each cohort to be studied when a common instrument is used as a dementia screening tool or for defining cognitive impairment. PMID:18925977
Validation of the Arabic Version of the Infant Feeding Intentions Scale Among Lebanese Women.
Yehya, Nadine; Tamim, Hani; Shamsedine, Lama; Ayash, Soumaya; Abdel Khalek, Lama; Abou Ezzi, Amanda; Nabulsi, Mona
2017-05-01
The Infant Feeding Intentions (IFI) scale was shown to reliably measure maternal intentions to initiate breastfeeding and continue exclusive breastfeeding until 1, 3, or 6 months in English and Spanish but not in Arab contexts. Research aim: This study aimed to validate an Arabic version of the IFI scale (IFI-A) and examine its ability to predict exclusive breastfeeding at 1, 3, or 6 months in pregnant Lebanese women. The internal consistency reliability and construct validity of the IFI-A scale were tested on 50 pregnant women (Group 1), whereas its predictive ability was tested on 196 pregnant women (Group 2), who were surveyed monthly about their infants' nutrition method until 6 months. The IFI-A scale's Cronbach's alpha internal consistency reliability is .82. Its corrected item-total correlations ranged from .26 for Item 2 ("at least give breastfeeding a try") to .86 for Item 4 ("will be exclusively breastfeeding at 3 months"). Exploratory factor analysis revealed that it is unidimensional. IFI-A scores correlated significantly with exclusive breastfeeding duration in Group 1 ( r = .624; p = .001) and with participants' breastfeeding attitude ( r = .390; p < .001) and previous breastfeeding duration ( r = .237; p = .011) in Group 2, thus confirming its external construct validity. In adjusted analysis, the IFI-A scale predicted exclusive breastfeeding at 3 months, albeit weakly (odds ratio = 1.16; 95% confidence interval [0.99, 1.36]), but not at 1 or 6 months. The IFI-A scale is a reliable and valid tool to assess maternal feeding intentions and predict exclusive breastfeeding at 3 months in the Arab context. Further studies are needed in other Arab contexts to confirm our findings.
Support for an auto-associative model of spoken cued recall: evidence from fMRI.
de Zubicaray, Greig; McMahon, Katie; Eastburn, Mathew; Pringle, Alan J; Lorenz, Lina; Humphreys, Michael S
2007-03-02
Cued recall and item recognition are considered the standard episodic memory retrieval tasks. However, only the neural correlates of the latter have been studied in detail with fMRI. Using an event-related fMRI experimental design that permits spoken responses, we tested hypotheses from an auto-associative model of cued recall and item recognition [Chappell, M., & Humphreys, M. S. (1994). An auto-associative neural network for sparse representations: Analysis and application to models of recognition and cued recall. Psychological Review, 101, 103-128]. In brief, the model assumes that cues elicit a network of phonological short term memory (STM) and semantic long term memory (LTM) representations distributed throughout the neocortex as patterns of sparse activations. This information is transferred to the hippocampus which converges upon the item closest to a stored pattern and outputs a response. Word pairs were learned from a study list, with one member of the pair serving as the cue at test. Unstudied words were also intermingled at test in order to provide an analogue of yes/no recognition tasks. Compared to incorrectly rejected studied items (misses) and correctly rejected (CR) unstudied items, correctly recalled items (hits) elicited increased responses in the left hippocampus and neocortical regions including the left inferior prefrontal cortex (LIPC), left mid lateral temporal cortex and inferior parietal cortex, consistent with predictions from the model. This network was very similar to that observed in yes/no recognition studies, supporting proposals that cued recall and item recognition involve common rather than separate mechanisms.
van der Vaart, Rosalie; Drossaert, Constance
2017-01-24
With the digitization of health care and the wide availability of Web-based applications, a broad set of skills is essential to properly use such facilities; these skills are called digital health literacy or eHealth literacy. Current instruments to measure digital health literacy focus only on information gathering (Health 1.0 skills) and do not pay attention to interactivity on the Web (Health 2.0). To measure the complete spectrum of Health 1.0 and Health 2.0 skills, including actual competencies, we developed a new instrument. The Digital Health Literacy Instrument (DHLI) measures operational skills, navigation skills, information searching, evaluating reliability, determining relevance, adding self-generated content, and protecting privacy. Our objective was to study the distributional properties, reliability, content validity, and construct validity of the DHLI's self-report scale (21 items) and to explore the feasibility of an additional set of performance-based items (7 items). We used a paper-and-pencil survey among a sample of the general Dutch population, stratified by age, sex, and educational level (T1; N=200). The survey consisted of the DHLI, sociodemographics, Internet use, health status, health literacy and the eHealth Literacy Scale (eHEALS). After 2 weeks, we asked participants to complete the DHLI again (T2; n=67). Cronbach alpha and intraclass correlation analysis between T1 and T2 were used to investigate reliability. Principal component analysis was performed to determine content validity. Correlation analyses were used to determine the construct validity. Respondents (107 female and 93 male) ranged in age from 18 to 84 years (mean 46.4, SD 19.0); 23.0% (46/200) had a lower educational level. Internal consistencies of the total scale (alpha=.87) and the subscales (alpha range .70-.89) were satisfactory, except for protecting privacy (alpha=.57). Distributional properties showed an approximately normal distribution. Test-retest analysis was satisfactory overall (total scale intraclass correlation coefficient=.77; subscale intraclass correlation coefficient range .49-.81). The performance-based items did not together form a single construct (alpha=.47) and should be interpreted individually. Results showed that more complex skills were reflected in a lower number of correct responses. Principal component analysis confirmed the theoretical structure of the self-report scale (76% explained variance). Correlations were as expected, showing significant relations with age (ρ=-.41, P<.001), education (ρ=.14, P=.047), Internet use (ρ=.39, P<.001), health-related Internet use (ρ=.27, P<.001), health status (ρ range .17-.27, P<.001), health literacy (ρ=.31, P<.001), and the eHEALS (ρ=.51, P<.001). This instrument can be accepted as a new self-report measure to assess digital health literacy, using multiple subscales. Its performance-based items provide an indication of actual skills but should be studied and adapted further. Future research should examine the acceptability of this instrument in other languages and among different populations. ©Rosalie van der Vaart, Constance Drossaert. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 24.01.2017.
van der Vaart, Rosalie
2017-01-01
Background With the digitization of health care and the wide availability of Web-based applications, a broad set of skills is essential to properly use such facilities; these skills are called digital health literacy or eHealth literacy. Current instruments to measure digital health literacy focus only on information gathering (Health 1.0 skills) and do not pay attention to interactivity on the Web (Health 2.0). To measure the complete spectrum of Health 1.0 and Health 2.0 skills, including actual competencies, we developed a new instrument. The Digital Health Literacy Instrument (DHLI) measures operational skills, navigation skills, information searching, evaluating reliability, determining relevance, adding self-generated content, and protecting privacy. Objective Our objective was to study the distributional properties, reliability, content validity, and construct validity of the DHLI’s self-report scale (21 items) and to explore the feasibility of an additional set of performance-based items (7 items). Methods We used a paper-and-pencil survey among a sample of the general Dutch population, stratified by age, sex, and educational level (T1; N=200). The survey consisted of the DHLI, sociodemographics, Internet use, health status, health literacy and the eHealth Literacy Scale (eHEALS). After 2 weeks, we asked participants to complete the DHLI again (T2; n=67). Cronbach alpha and intraclass correlation analysis between T1 and T2 were used to investigate reliability. Principal component analysis was performed to determine content validity. Correlation analyses were used to determine the construct validity. Results Respondents (107 female and 93 male) ranged in age from 18 to 84 years (mean 46.4, SD 19.0); 23.0% (46/200) had a lower educational level. Internal consistencies of the total scale (alpha=.87) and the subscales (alpha range .70-.89) were satisfactory, except for protecting privacy (alpha=.57). Distributional properties showed an approximately normal distribution. Test-retest analysis was satisfactory overall (total scale intraclass correlation coefficient=.77; subscale intraclass correlation coefficient range .49-.81). The performance-based items did not together form a single construct (alpha=.47) and should be interpreted individually. Results showed that more complex skills were reflected in a lower number of correct responses. Principal component analysis confirmed the theoretical structure of the self-report scale (76% explained variance). Correlations were as expected, showing significant relations with age (ρ=–.41, P<.001), education (ρ=.14, P=.047), Internet use (ρ=.39, P<.001), health-related Internet use (ρ=.27, P<.001), health status (ρ range .17-.27, P<.001), health literacy (ρ=.31, P<.001), and the eHEALS (ρ=.51, P<.001). Conclusions This instrument can be accepted as a new self-report measure to assess digital health literacy, using multiple subscales. Its performance-based items provide an indication of actual skills but should be studied and adapted further. Future research should examine the acceptability of this instrument in other languages and among different populations. PMID:28119275
Validation of the Spanish Version of the COPD-Q Questionnaire on COPD Knowledge.
Puente-Maestu, Luis; Chancafe-Morgan, Jorge; Calle, Myriam; Rodríguez-Hermosa, Juan L; Malo de Molina, Rosa; Ortega-González, Ángel; Fuster, Antonia; Márquez-Martín, Eduardo; Marcos, Pedro J; Ramírez, Laura; Ray, Shaunta'; Franks, Andrea
2016-01-01
Although recognition of the importance of educating chronic obstructive pulmonary disease (COPD) patients has grown in recent years, their understanding of this disease is not being measured due to a lack of specific instruments. The aim of this study was to validate the COPD-Q questionnaire, a 13-item instrument for determining COPD knowledge. The COPD-Q was translated and backtranslated, and subsequently submitted to logic and content validation by a group of COPD experts and 8 COPD patients. Reliability was studied in an independent group of 59 patients with severe COPD seen in the pulmonology ward or clinics of 6 hospitals in Spain (Andalusia, Baleares, Castilla-La Mancha, Galicia and Madrid). This sample was also used for other internal and external validations. The mean age of the group was approximately 70 years and their health awareness was low-to-medium. The number of correct answers was 8.3 (standard deviation: 1.9), median 8, range 3-13. Floor and ceiling effects were 0% and 1.5%, respectively. Internal consistency of the questionnaire was good (Cronbach's alpha=0.85) and reliability was also high, with a kappa coefficient >0.6 for all items and an intraclass correlation efficient of 0.84 for the total score. The 13-item COPD-Q is a valid, applicable and reliable instrument for determining patients' knowledge of COPD. Copyright © 2014 SEPAR. Published by Elsevier Espana. All rights reserved.
Vonderlin, Eva; Ropeter, Anna; Pauen, Sabina
2012-09-01
The Infant Behavior Questionnaire Revised (IBQ-R; Gartstein & Rothbart, 2003) is one of the most common parent-report instruments for assessing infant temperament. This study evaluated the psychometric properties of a German version. We studied item characteristics, internal consistency, and descriptive statistics for all 14 scales in a sample of 7- to 9-month-old infants and their mothers (N = 119). Factor analysis was conducted to identify higher-order relationships between the scales. Item analysis showed mixed corrected item-total correlations. Internal consistencies were all moderate to high. Results of the factor analysis confirmed the two dimensions of Surgency/Extraversion and Negative Affectivity, whereas the dimension Orienting/Regulation was not replicated. In contrast to the American sample, activity level in the German sample loaded on the factor Negative Affectivity. The scales low intensity pleasure and soothability, which loaded on factor Orienting/Regulation in the original version, showed substantial loadings on both dimensions Surgency/Extraversion and Negative Affectivity (inverted), whereas the scale duration of orienting was located on the factor Surgency/Extraversion. The German version of the IBQ-R provides a satisfying instrument for investigating infant temperament. However, further work is needed to improve the methodological quality of the questionnaire. Further research should especially focus on the factor structure of infant temperament. We suggest developing a shorter version and testing it with a larger and more diverse sample.
Assessing Patients’ Experiences with Communication Across the Cancer Care Continuum
Mazor, Kathleen M.; Street, Richard L.; Sue, Valerie M.; Williams, Andrew E.; Rabin, Borsika A.; Arora, Neeraj K.
2016-01-01
Objective To evaluate the relevance, performance and potential usefulness of the Patient Assessment of cancer Communication Experiences (PACE) items. Methods Items focusing on specific communication goals related to exchanging information, fostering healing relationships, responding to emotions, making decisions, enabling self-management, and managing uncertainty were tested via a retrospective, cross-sectional survey of adults who had been diagnosed with cancer. Analyses examined response frequencies, inter-item correlations, and coefficient alpha. Results A total of 366 adults were included in the analyses. Relatively few selected “Does Not Apply”, suggesting that items tap relevant communication experiences. Ratings of whether specific communication goals were achieved were strongly correlated with overall ratings of communication, suggesting item content reflects important aspects of communication. Coefficient alpha was ≥.90 for each item set, indicating excellent reliability. Variations in the percentage of respondents selecting the most positive response across items suggest results can identify strengths and weaknesses. Conclusion The PACE items tap relevant, important aspects of communication during cancer care, and may be useful to cancer care teams desiring detailed feedback. PMID:26979476
Gul, Asiye; Andsoy, Isil Isik; Ozkaya, Birgul; Zeydan, Ayten
2017-06-01
Nurses' knowledge of pressure ulcer (PU) prevention and management is an important first step in the provision of optimal care. To evaluate PU prevention/risk, staging, and wound description knowledge, a descriptive, cross-sectional survey was conducted among nurses working in an acute care Turkish hospital. The survey instrument was a modified and translated version of the Pieper Pressure Ulcer Knowledge Test (PUKT), and its validity and reliability were established. Nurses completed a Personal Characteristics Form, including sociodemographic information and exposure to educational presentations and information about and experience with PUs, followed by the 49-item modified PUKT which includes 33 prevention/risk items, 9 staging items, and 7 wound description items. All items are true/false questions with an I don't know option (scoring: minimum 0, maximum 49). Correct answers received 1 point and incorrect/unknown answers received 0 points. The paper-pencil questionnaires were distributed by 2 researchers to all nurses in the participating hospital and completed by those willing to be included. Responses were analyzed using descriptive statistics. Pearson's correlation test was used to examine the relationship between quantitative variables, and mean scores were compared using the Mann-Whitney U and Kruskal-Wallis tests. Among the 308 participating nurses (mean age 29.5 ± 8.1 [range 19-56] years) most were women (257, 83.4%) with 7.3 ± 7.8 (range 1-36) years of experience. The mean knowledge score for the entire sample was 29.7 ± 6.7 (range 8-42). The overall percentage of correct answers was 60.6% to 61.8% for PU prevention/risk assessment, 60% for wound description, and 56.6% for PU staging. Knowledge scores were significantly (P <.05) higher for participants who attended at least 1 lecture/conference/course on PUs in the last year, read articles/books about PUs, cared for patients with PUs, or believed their patients were at risk for PU development. Most participants (180, 58.4%) scored 60% or more correct; 8 (2.6%) correctly answered 80% or more of the items. The lowest number of correct answers was for the item, "Bunny boots and gel pads relieve pressure on the heels" (22, 7.1%). The results of this study suggest education and experience caring for patients who are at risk for or have a PU affect nurses' knowledge. This study, and additional research examining nurse knowledge, will help the development of much-needed education programs.
Validation of mothers' reports of dietary intake by four to seven year-old children.
Basch, C E; Shea, S; Arliss, R; Contento, I R; Rips, J; Gutin, B; Irigoyen, M; Zybert, P
1990-01-01
The validity of mothers' recall of four to seven year-old children's diet was assessed among 46 first generation Latino immigrant families from the Dominican Republic by comparing intake recalled by the mother to unobtrusive home observations of children. Correlations were moderate to high for calories and for most nutrients. There were no differences in mean intake of total calories or in intake of most macronutrients and micronutrients assessed. At least two-thirds of the children in the lowest (or highest) quintile based on home observations were correctly classified into the lowest or second lowest (or highest) quintiles based on mother's reports for calories and most nutrients. For all food items that were both observed and reported, 51 percent of reported portion sizes were equivalent to observed portion sizes, 15.5 percent were smaller, and 33.5 percent were larger. There was fair to good agreement on the number of food items eaten, with the exception of vegetables. Mothers' recall appears to be useful for classifying children by intake of calories, macronutrients and micronutrients, but provides a somewhat less accurate measure of actual foods eaten, portion sizes, and nutrient levels consumed. PMID:2240296
Krumm, Sabine; Kivisaari, Sasa L; Monsch, Andreas U; Reinhardt, Julia; Ulmer, Stephan; Stippich, Christoph; Kressig, Reto W; Taylor, Kirsten I
2017-05-01
The parietal lobe is important for successful recognition memory, but its role is not yet fully understood. We investigated the parietal lobes' contribution to immediate paired-associate memory and delayed item-recognition memory separately for hits (targets) and correct rejections (distractors). We compared the behavioral performance of 56 patients with known parietal and medial temporal lobe dysfunction (i.e. early Alzheimer's Disease) to 56 healthy control participants in an immediate paired and delayed single item object memory task. Additionally, we performed voxel-based morphometry analyses to investigate the functional-neuroanatomic relationships between performance and voxel-based estimates of atrophy in whole-brain analyses. Behaviorally, all participants performed better identifying targets than rejecting distractors. The voxel-based morphometry analyses associated atrophy in the right ventral parietal cortex with fewer correct responses to familiar items (i.e. hits) in the immediate and delayed conditions. Additionally, medial temporal lobe integrity correlated with better performance in rejecting distractors, but not in identifying targets, in the immediate paired-associate task. Our findings suggest that the parietal lobe critically supports successful immediate and delayed target recognition memory, and that the ventral aspect of the parietal cortex and the medial temporal lobe may have complementary preferences for identifying targets and rejecting distractors, respectively, during recognition memory. Copyright © 2017. Published by Elsevier Inc.
2013-01-01
Background In light of its epidemic proportions in developed and developing countries, obesity is considered a serious public health issue. In order to increase knowledge concerning the ability of health care professionals in caring for obese adolescents and adopt more efficient preventive and control measures, a questionnaire was developed and validated to assess non-dietitian health professionals regarding their Knowledge of Nutrition in Obese Adolescents (KNOA). Methods The development and evaluation of a questionnaire to assess the knowledge of primary care practitioners with respect to nutrition in obese adolescents was carried out in five phases, as follows: 1) definition of study dimensions 2) development of 42 questions and preliminary evaluation of the questionnaire by a panel of experts; 3) characterization and selection of primary care practitioners (35 dietitians and 265 non-dietitians) and measurement of questionnaire criteria by contrasting the responses of dietitians and non-dietitians; 4) reliability assessment by question exclusion based on item difficulty (too easy and too difficult for non-dietitian practitioners), item discrimination, internal consistency and reproducibility index determination; and 5) scoring the completed questionnaires. Results Dietitians obtained higher scores than non-dietitians (Mann–Whitney U test, P < 0.05), confirming the validity of the questionnaire criteria. Items were discriminated by correlating the score for each item with the total score, using a minimum of 0.2 as a correlation coefficient cutoff value. Item difficulty was controlled by excluding questions answered correctly by more than 90% of the non-dietitian subjects (too easy) or by less than 10% of them (too difficult). The final questionnaire contained 26 of the original 42 questions, increasing Cronbach’s α value from 0.788 to 0.807. Test-retest agreement between respondents was classified as good to very good (Kappa test, >0.60). Conclusion The KNOA questionnaire developed for primary care practitioners is a valid, consistent and suitable instrument that can be applied over time, making it a promising tool for developing and guiding public health policies. PMID:23865564
Assessing adolescents' personality with the NEO PI-R.
De Fruyt, F; Mervielde, I; Hoekstra, H A; Rolland, J P
2000-12-01
The suitability of the Revised NEO Personality Inventory (NEO PI-R) to assess adolescents' personality traits was investigated in an unselected heterogeneous sample of 469 adolescents aged 12 to 17 years. They were further administered the Hierarchical Personality Inventory for Children (HiPIC) to allow an examination of convergent and discriminant validity. The adult NEO PI-R factor structure proved to be highly replicable in the sample of adolescents, with all facet scales primarily loading on the expected factors, independent of the age group. Domain and facet internal consistency coefficients were comparable to those obtained in adult samples, with less than 12% of the items showing corrected item-facet correlations below absolute value .20. Although, in general, adolescents reported few difficulties with the comprehensibility of the items, they tend to report more problems with the Openness to Ideas (05) and Openness to Values (06) items. Correlations between NEO PI-R and HiPIC scales underscored the convergent and discriminant validity of the NEO facets and HiPIC scales. It was concluded that the NEO PI-R in its present form is useful for assessing adolescents' traits at the primary level, but additional research is necessary to infer the most appropriate facet level structure.
Conjunctive and Disjunctive Extensions of the Least Squares Distance Model of Cognitive Diagnosis
ERIC Educational Resources Information Center
Dimitrov, Dimiter M.; Atanasov, Dimitar V.
2012-01-01
Many models of cognitive diagnosis, including the "least squares distance model" (LSDM), work under the "conjunctive" assumption that a correct item response occurs when all latent attributes required by the item are correctly performed. This article proposes a "disjunctive" version of the LSDM under which the correct item response occurs when "at…
The Effect of Guessing on Item Reliability under Answer-Until-Correct Scoring
ERIC Educational Resources Information Center
Kane, Michael; Moloney, James
1978-01-01
The answer-until-correct (AUC) procedure requires that examinees respond to a multi-choice item until they answer it correctly. Using a modified version of Horst's model for examinee behavior, this paper compares the effect of guessing on item reliability for the AUC procedure and the zero-one scoring procedure. (Author/CTM)
Trunk control test as an early predictor of stroke rehabilitation outcome.
Franchignoni, F P; Tesio, L; Ricupero, C; Martino, M T
1997-07-01
The aim of this study was to investigate the construct and predictive validity of the Trunk Control Test (TCT) in postacute stroke patients by comparing TCT scores at admission and discharge with the Functional Independence Measure (FIM) scores. Forty-nine patients participated in the study. The TCT examines four movements: rolling from a supine position to the weak side (T1) and to the strong side (T2), sitting up from a lying-down position (T3), and sitting balance (T4). The FIM is an 18-item scale (13 motor [motFIM] and 5 cognitive [cognFIM]) used to determine the level of dependence of patients in daily life. Thirty-six patients (73%) increased their TCT overall score at discharge. The TCT item-total correlations were high, both at admission and discharge (P < .0001). The individual TCT items were intercorrelated. Furthermore, the homogeneity of the TCT was confirmed by a high Cronbach's index. High correlations were found between admission and discharge scores in the different tests (TCT, FIM, and motFIM; P < .0001) and between TCT at admission and FIM (P < .0001) and motFIM (P < .0001) at admission. TCT at admission alone explained 71% of the variance in motFIM at discharge. The TCT showed a good sensitivity to change in assessing recovery of stroke patients. The high item-total correlation and Cronbach's alpha value of the TCT suggest that there is one homogeneous construct underlying the item list. The TCT construct validity was confirmed by the correlation between this test and the FIM scores. TCT at admission predicted motFIM at discharge even better than motFIM at admission alone. Possibly, the TCT captures basic motor skills that foreshadow the recovery of more complex behavioral skills described by the FIM.
Psychometric assessment of a scale to measure bonding workplace social capital
Tsutsumi, Akizumi; Inoue, Akiomi; Odagiri, Yuko
2017-01-01
Objectives Workplace social capital (WSC) has attracted increasing attention as an organizational and psychosocial factor related to worker health. This study aimed to assess the psychometric properties of a newly developed WSC scale for use in work environments, where bonding social capital is important. Methods We assessed the psychometric properties of a newly developed 6-item scale to measure bonding WSC using two data sources. Participants were 1,650 randomly selected workers who completed an online survey. Exploratory factor analyses were conducted. We examined the item–item and item–total correlations, internal consistency, and associations between scale scores and a previous 8-item measure of WSC. We evaluated test–retest reliability by repeating the survey with 900 of the respondents 2 weeks later. The overall scale reliability was quantified by an intraclass coefficient and the standard error of measurement. We evaluated convergent validity by examining the association with several relevant workplace psychosocial factors using a dataset from workers employed by an electrical components company (n = 2,975). Results The scale was unidimensional. The item–item and item–total correlations ranged from 0.52 to 0.78 (p < 0.01) and from 0.79 to 0.89 (p < 0.01), respectively. Internal consistency was good (Cronbach’s α coefficient: 0.93). The correlation with the 8-item scale indicated high criterion validity (r = 0.81) and the scale showed high test–retest reliability (r = 0.74, p < 0.01). The intraclass coefficient and standard error of measurement were 0.74 (95% confidence intervals: 0.71–0.77) and 4.04 (95% confidence intervals: 1.86–6.20), respectively. Correlations with relevant workplace psychosocial factors showed convergent validity. Conclusions The results confirmed that the newly developed WSC scale has adequate psychometric properties. PMID:28662058
A Shorter Short Version of Barron's Ego Strength Scale
ERIC Educational Resources Information Center
Kelly, William E.; Daughtry, Don
2018-01-01
This study developed an abbreviated form of Barron's (1953) Ego Strength Scale for use in research among college student samples. A version of Barron's scale was administered to 100 undergraduate college students. Using item-total score correlations and internal consistency, the scale was reduced to 18 items (Es18). The Es18 possessed adequate…
Tepe, Rodger; Tepe, Chabha
2015-03-01
To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. In this test-retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. The IL self-efficacy survey demonstrated good reliability (test-retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test-retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments.
Tepe, Rodger; Tepe, Chabha
2015-01-01
Objective To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. Methods In this test–retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. Results The IL self-efficacy survey demonstrated good reliability (test–retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test–retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). Conclusions This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments. PMID:25517736
Outcome-based self-assessment on a team-teaching subject in the medical school
Cho, Sa Sun
2014-01-01
We attempted to investigate the reason why the students got a worse grade in gross anatomy and the way how we can improve upon the teaching method since there were gaps between teaching and learning under recently changed integration curriculum. General characteristics of students and exploratory factors to testify the validity were compared between year 2011 and 2012. Students were asked to complete a short survey with a Likert scale. The results were as follows: although the percentage of acceptable items was similar between professors, professor C preferred questions with adequate item discrimination and inappropriate item difficulty whereas professor Y preferred adequate item discrimination and appropriate item difficulty with statistical significance (P<0.01). The survey revealed that 26.5% of total students gave up the exam on gross anatomy of professor Y irrespective of years. These results suggested that students were affected by the corrected item difficulty rather than item discrimination in order to obtain academic achievement. Therefore, professors in a team-teaching subject should reach a consensus on an item difficulty with proper teaching methods. PMID:25548724
Sex knowledge and attitudes of moderately retarded males.
Edmonson, B; Wish, J
1975-09-01
In semistructured interview sessions, 18 moderately retarded men undergoing deinstitutional training, were questioned to determine their understanding of pictures of homosexual embrace, masturbation, dating, marriage, intercourse, pregnancy, childbirth, drunkenness, and their knowledge of anatomical terminology. The frequencies of various response categories revealed a range of comprehension, the lowest answering only 10 percent correctly, the median consisting of 28 percent correct, and only 1 subject correctly answering as many as one-half of the items. Correct conceptual responses significantly correlated with WAIS Full Scale and Verbal IQs and were also significantly related to the Adaptive Behavior Scale domains of Language, Socialization, and Responsibility. Serious errors of fact and conceptual confusion, though most prevalent in responses by the low comprehenders, were found in at least some responses by all of the men.
Dykes, Patricia C; Hurley, Ann; Cashen, Margaret; Bakken, Suzanne; Duffy, Mary E
2007-01-01
The use of health information technology (HIT) for the support of communication processes and data and information access in acute care settings is a relatively new phenomenon. A means of evaluating the impact of HIT in hospital settings is needed. The purpose of this research was to design and psychometrically evaluate the Impact of Health Information Technology scale (I-HIT). I-HIT was designed to measure the perception of nurses regarding the ways in which HIT influences interdisciplinary communication and workflow patterns and nurses' satisfaction with HIT applications and tools. Content for a 43-item tool was derived from the literature, and supported theoretically by the Coiera model and by nurse informaticists. Internal consistency reliability analysis using Cronbach's alpha was conducted on the 43-item scale to initiate the item reduction process. Items with an item total correlation of less than 0.35 were removed, leaving a total of 29 items. Item analysis, exploratory principal component analysis and internal consistency reliability using Cronbach's alpha were used to confirm the 29-item scale. Principal components analysis with Varimax rotation produced a four-factor solution that explained 58.5% of total variance (general advantages, information tools to support information needs, information tools to support communication needs, and workflow implications). Internal consistency of the total scale was 0.95 and ranged from 0.80-0.89 for four subscales. I-HIT demonstrated psychometric adequacy and is recommended to measure the impact of HIT on nursing practice in acute care settings.
Ali, Amira Mohammed; Ahmed, Anwar; Sharaf, Amira; Kawakami, Norito; Abdeldayem, Samia M; Green, Joseph
2017-12-01
This study aimed to examine the validity of the Arabic version of the Depression Anxiety Stress Scale-21 (DASS-21) in 149 illicit drug users. We calculated α coefficient, inter-item and item-total correlations, coefficients of reproducibility and scalability (CR and CS), item difficulty and discrimination indices. The DASS-21 had an acceptable reliability; but values of the CR and the CS were less than acceptable. Items varied in difficulty and discrimination; some items are candidates for elimination. The DASS-21 is a probabilistic and not a deterministic measure of distress; it has problematic items and needs further investigations. Copyright © 2017 Elsevier B.V. All rights reserved.
The Utrecht questionnaire (U-CEP) measuring knowledge on clinical epidemiology proved to be valid.
Kortekaas, Marlous F; Bartelink, Marie-Louise E L; de Groot, Esther; Korving, Helen; de Wit, Niek J; Grobbee, Diederick E; Hoes, Arno W
2017-02-01
Knowledge on clinical epidemiology is crucial to practice evidence-based medicine. We describe the development and validation of the Utrecht questionnaire on knowledge on Clinical epidemiology for Evidence-based Practice (U-CEP); an assessment tool to be used in the training of clinicians. The U-CEP was developed in two formats: two sets of 25 questions and a combined set of 50. The validation was performed among postgraduate general practice (GP) trainees, hospital trainees, GP supervisors, and experts. Internal consistency, internal reliability (item-total correlation), item discrimination index, item difficulty, content validity, construct validity, responsiveness, test-retest reliability, and feasibility were assessed. The questionnaire was externally validated. Internal consistency was good with a Cronbach alpha of 0.8. The median item-total correlation and mean item discrimination index were satisfactory. Both sets were perceived as relevant to clinical practice. Construct validity was good. Both sets were responsive but failed on test-retest reliability. One set took 24 minutes and the other 33 minutes to complete, on average. External GP trainees had comparable results. The U-CEP is a valid questionnaire to assess knowledge on clinical epidemiology, which is a prerequisite for practicing evidence-based medicine in daily clinical practice. Copyright © 2016 Elsevier Inc. All rights reserved.
Carciofo, Richard; Yang, Jiaoyan; Song, Nan; Du, Feng; Zhang, Kan
2016-01-01
The 44-item and 10-item Big Five Inventory (BFI) personality scales are widely used, but there is a lack of psychometric data for Chinese versions. Eight surveys (total N = 2,496, aged 18–82), assessed a Chinese-language BFI-44 and/or an independently translated Chinese-language BFI-10. Most BFI-44 items loaded strongly or predominantly on the expected dimension, and values of Cronbach's alpha ranged .698-.807. Test-retest coefficients ranged .694-.770 (BFI-44), and .515-.873 (BFI-10). The BFI-44 and BFI-10 showed good convergent and discriminant correlations, and expected associations with gender (females higher for agreeableness and neuroticism), and age (older age associated with more conscientiousness and agreeableness, and also less neuroticism and openness). Additionally, predicted correlations were found with chronotype (morningness positive with conscientiousness), mindfulness (negative with neuroticism, positive with conscientiousness), and mind wandering/daydreaming frequency (negative with conscientiousness, positive with neuroticism). Exploratory analysis found that the Self-discipline facet of conscientiousness positively correlated with morningness and mindfulness, and negatively correlated with mind wandering/daydreaming frequency. Furthermore, Self-discipline was found to be a mediator in the relationships between chronotype and mindfulness, and chronotype and mind wandering/daydreaming frequency. Overall, the results support the utility of the BFI-44 and BFI-10 for Chinese-language big five personality research. PMID:26918618
Carciofo, Richard; Yang, Jiaoyan; Song, Nan; Du, Feng; Zhang, Kan
2016-01-01
The 44-item and 10-item Big Five Inventory (BFI) personality scales are widely used, but there is a lack of psychometric data for Chinese versions. Eight surveys (total N = 2,496, aged 18-82), assessed a Chinese-language BFI-44 and/or an independently translated Chinese-language BFI-10. Most BFI-44 items loaded strongly or predominantly on the expected dimension, and values of Cronbach's alpha ranged .698-.807. Test-retest coefficients ranged .694-.770 (BFI-44), and .515-.873 (BFI-10). The BFI-44 and BFI-10 showed good convergent and discriminant correlations, and expected associations with gender (females higher for agreeableness and neuroticism), and age (older age associated with more conscientiousness and agreeableness, and also less neuroticism and openness). Additionally, predicted correlations were found with chronotype (morningness positive with conscientiousness), mindfulness (negative with neuroticism, positive with conscientiousness), and mind wandering/daydreaming frequency (negative with conscientiousness, positive with neuroticism). Exploratory analysis found that the Self-discipline facet of conscientiousness positively correlated with morningness and mindfulness, and negatively correlated with mind wandering/daydreaming frequency. Furthermore, Self-discipline was found to be a mediator in the relationships between chronotype and mindfulness, and chronotype and mind wandering/daydreaming frequency. Overall, the results support the utility of the BFI-44 and BFI-10 for Chinese-language big five personality research.
The Impact of Pediatric Brachial Plexus Injury on Families
Allgier, Allison; Overton, Myra; Welge, Jeffrey; Mehlman, Charles T.
2015-01-01
Purpose To determine the impact on families of children with brachial plexus injuries in order to best meet their clinical and social needs. Methods Our cross-sectional study included families with children between the ages of 1 and 18 with birth or non-neonatal brachial plexus injuries (BPI). The consenting parent or guardian completed a demographic questionnaire and the validated Impact on Family Scale during a single assessment. Total scores can range from 0-100, with the higher the score indicating a higher impact on the family. Factor analysis and item-total correlations were used to examine structure, individual items, and dimensions of family impact. Results One hundred two caregivers participated. Overall, families perceived various dimensions of impact on having a child with a BPI. Total family impact was 43. The 2 individual items correlating most strongly with the overall total score were from the financial dimension of the Impact on Family Scale. The strongest demographic relationship was traveling nationally for care and treatment of the BPI. Severity of injury was marginally correlated with impact on the family. Parent-child agreement about the severity of the illness was relatively high. Conclusion Caretakers of children with a BPI perceived impact on their families in the form of personal strain, family/social factors, financial stress, and mastery. A multidisciplinary clinical care team should address the various realms of impact on family throughout the course of treatment. Level of Evidence II Prognostic PMID:25936738
King's Parkinson's disease pain scale, the first scale for pain in PD: An international validation.
Chaudhuri, K Ray; Rizos, A; Trenkwalder, C; Rascol, O; Pal, S; Martino, D; Carroll, C; Paviour, D; Falup-Pecurariu, C; Kessel, B; Silverdale, M; Todorova, A; Sauerbier, A; Odin, P; Antonini, A; Martinez-Martin, P
2015-10-01
Pain is a key unmet need and a major aspect of non-motor symptoms of Parkinson's disease (PD). No specific validated scales exist to identify and grade the various types of pain in PD. We report an international, cross-sectional, open, multicenter, one-point-in-time evaluation with retest study of the first PD-specific pain scale, the King's PD Pain Scale. Its seven domains include 14 items, each item scored by severity (0-3) multiplied by frequency (0-4), resulting in a subscore of 0 to 12, with a total possible score range from 0 to 168. One hundred seventy-eight PD patients with otherwise unexplained pain (age [mean ± SD], 64.38 ± 11.38 y [range, 29-85]; 62.92% male; duration of disease, 5.40 ± 4.93 y) and 83 nonspousal non-PD controls, matched by age (64.25 ± 11.10 y) and sex (61.45% males) were studied. No missing data were noted, and floor effect was observed in all domains. The difference between mean and median King's PD Pain Scale total score was less than 10% of the maximum observed value. Skewness was marginally high (1.48 for patients). Factor analysis showed four factors in the King's PD Pain Scale, explaining 57% of the variance (Kaiser-Mayer-Olkin, 0.73; sphericity test). Cronbach's alpha was 0.78, item-total correlation mean value 0.40, and item homogeneity 0.22. Correlation coefficients of the King's PD Pain Scale domains and total score with other pain measures were high. Correlation with the Scale for Outcomes in PD-Motor, Non-Motor Symptoms Scale total score, and quality of life measures was high. The King's PD Pain Scale seems to be a reliable and valid scale for grade rating of various types of pain in PD. © 2015 International Parkinson and Movement Disorder Society.
Bläsing, Lena; Goebel, Gerhard; Flötzinger, Uta; Berthold, Anke; Kröner-Herwig, Birgit
2010-07-01
The purpose of this study was to analyse the Questionnaire on Hypersensitivity to Sound (GUF; Nelting & Finlayson, 2004 ) and to improve its validity based on the analysis of intercorrelations (single item level) with other methods of assessing hyperacusis (uncomfortable loudness level, individual loudness function, self-rated severity of hyperacusis). Subjects consisted of 91 inpatients with tinnitus and hyperacusis. The GUF showed a good reliability (alpha = .92). The factorial structure of the questionnaire reported by Nelting et al (2002) was not completely supported by the evidence in this study. The total score and the single items showed small to moderate correlations with the other modes of measuring hyperacusis. Evidence for convergent and discriminant validity were found, but overall the results corroborate the conceptual heterogeneity of the construct hyperacusis and its dependency on the assessment method. Four items of the GUF with particularly low correlations were excluded from the questionnaire. The revised GUF total score showed slightly but not statistically significant higher convergent and discriminant validity.
Neural Correlates of Encoding Within- and Across-Domain Inter-Item Associations
Park, Heekyeong; Rugg, Michael D.
2012-01-01
The neural correlates of the encoding of associations between pairs of words, pairs of pictures, and word-picture pairs were compared. The aims were to determine first, whether the neural correlates of associative encoding vary according to study material and second, whether encoding of across- versus within-material item pairs is associated with dissociable patterns of hippocampal and perirhinal activity, as predicted by the ‘domain dichotomy’ hypothesis of medial temporal lobe (MTL) function. While undergoing fMRI scanning, subjects (n = 24) were presented with the three classes of study pairs, judging which of the denoted objects fit into the other. Outside of the scanner, subjects then undertook an associative recognition task, discriminating between intact study pairs, rearranged pairs comprising items that had been presented on different study trials, and unstudied item pairs. The neural correlates of successful associative encoding – subsequent associative memory effects – were operationalized as the difference in activity between study pairs correctly judged intact versus pairs incorrectly judged rearranged on the subsequent memory test. Pair type-independent subsequent memory effects were evident in the left inferior frontal gyrus (IFG) and the hippocampus. Picture-picture pairs elicited material-selective effects in regions of fusiform cortex that were also activated to a greater extent on picture trials than word trials, while word-word pairs elicited material-selective subsequent memory effects in left lateral temporal cortex. Contrary to the domain-dichotomy hypothesis, neither hippocampal nor perirhinal subsequent memory effects differed depending on whether they were elicited by within- versus across-material study pairs. It is proposed that the left IFG plays a domain-general role in associative encoding, that associative encoding can also be facilitated by enhanced processing in material-selective cortical regions, and that the hippocampus and perirhinal cortex contribute equally to the formation of inter-item associations regardless of whether the items belong to the same or to different processing domains. PMID:21254802
Objective and Subjective Cancer Knowledge Among Faith-Based Chinese Adults.
Hou, Su-I; Liu, Ling Jie
2017-10-01
This study examined cancer knowledge between church-going younger versus older Chinese adults. Hou's 8-item validated cancer screening knowledge test (CSKT) and a new 14-item cancer warning signs test (CWST) were used to assess objective knowledge. Subjective knowledge was measured by one overall 5-point Likert scale item. A total of 372 Taiwanese and Chinese Americans from nine churches participated. Although there were no significant differences by age on either the CSKT scores (younger = 5.89 vs. older = 5.71; p = .297) or the CWST (younger = 6.27 vs. older = 5.86; p = .245), subjective knowledge was higher among older Chinese adults (younger = 2.44 vs. older = 3.05, p < .001). Older Chinese adults were also more likely to identify cancer warning signs correctly, while younger adults were more likely to identify false warning signs correctly. Results have implication on tailoring cancer knowledge type (subjective vs. objective) and content domain (screening vs. warning signs). Findings can help health educators better understand cancer education needs among Chinese adults.
Multivariate analysis of fears in dental phobic patients according to a reduced FSS-II scale.
Hakeberg, M; Gustafsson, J E; Berggren, U; Carlsson, S G
1995-10-01
This study analyzed and assessed dimensions of a questionnaire developed to measure general fears and phobias. A previous factor analysis among 109 dental phobics had revealed a five-factor structure with 22 items and an explained total variance of 54%. The present study analyzed the same material using a multivariate statistical procedure (LISREL) to reveal structural latent variables. The LISREL analysis, based on the correlation matrix, yielded a chi-square of 216.6 with 195 degrees of freedom (P = 0.138) and showed a model with seven latent variables. One was a general fear factor correlated to all 22 items. The other six factors concerned "Illness & Death" (5 items), "Failures & Embarrassment" (5 items), "Social situations" (5 items), "Physical injuries" (4 items), "Animals & Natural phenomena" (4 items). One item (opposite sex) was included in both "Failures & Embarrassment" and "Social situations". The last factor, "Social interaction", combined all the items in "Failures & Embarrassment" and "Social situations" (9 items). In conclusion, this multivariate statistical analysis (LISREL) revealed and confirmed a factor structure similar to our previous study, but added two important dimensions not shown with a traditional factor analysis. This reduced FSS-II version measures general fears and phobias and may be used on a routine clinical basis as well as in dental phobia research.
Assessing patients' experiences with communication across the cancer care continuum.
Mazor, Kathleen M; Street, Richard L; Sue, Valerie M; Williams, Andrew E; Rabin, Borsika A; Arora, Neeraj K
2016-08-01
To evaluate the relevance, performance and potential usefulness of the Patient Assessment of cancer Communication Experiences (PACE) items. Items focusing on specific communication goals related to exchanging information, fostering healing relationships, responding to emotions, making decisions, enabling self-management, and managing uncertainty were tested via a retrospective, cross-sectional survey of adults who had been diagnosed with cancer. Analyses examined response frequencies, inter-item correlations, and coefficient alpha. A total of 366 adults were included in the analyses. Relatively few selected Does Not Apply, suggesting that items tap relevant communication experiences. Ratings of whether specific communication goals were achieved were strongly correlated with overall ratings of communication, suggesting item content reflects important aspects of communication. Coefficient alpha was ≥.90 for each item set, indicating excellent reliability. Variations in the percentage of respondents selecting the most positive response across items suggest results can identify strengths and weaknesses. The PACE items tap relevant, important aspects of communication during cancer care, and may be useful to cancer care teams desiring detailed feedback. The PACE is a new tool for eliciting patients' perspectives on communication during cancer care. It is freely available online for practitioners, researchers and others. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Dueck, Amylou C; Mendoza, Tito R; Mitchell, Sandra A; Reeve, Bryce B; Castro, Kathleen M; Rogak, Lauren J; Atkinson, Thomas M; Bennett, Antonia V; Denicoff, Andrea M; O'Mara, Ann M; Li, Yuelin; Clauser, Steven B; Bryant, Donna M; Bearden, James D; Gillis, Theresa A; Harness, Jay K; Siegel, Robert D; Paul, Diane B; Cleeland, Charles S; Schrag, Deborah; Sloan, Jeff A; Abernethy, Amy P; Bruner, Deborah W; Minasian, Lori M; Basch, Ethan
2015-11-01
To integrate the patient perspective into adverse event reporting, the National Cancer Institute developed a patient-reported outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE). To assess the construct validity, test-retest reliability, and responsiveness of PRO-CTCAE items. A total of 975 adults with cancer undergoing outpatient chemotherapy and/or radiation therapy enrolled in this questionnaire-based study between January 2011 and February 2012. Eligible participants could read English and had no clinically significant cognitive impairment. They completed PRO-CTCAE items on tablet computers in clinic waiting rooms at 9 US cancer centers and community oncology practices at 2 visits 1 to 6 weeks apart. A subset completed PRO-CTCAE items during an additional visit 1 business day after the first visit. Primary comparators were clinician-reported Eastern Cooperative Oncology Group Performance Status (ECOG PS) and the European Organisation for Research and Treatment of Cancer Core Quality of Life Questionnaire (QLQ-C30). A total of 940 of 975 (96.4%) and 852 of 940 (90.6%) participants completed PRO-CTCAE items at visits 1 and 2, respectively. At least 1 symptom was reported by 938 of 940 (99.8%) participants. Participants' median age was 59 years; 57.3% were female, 32.4% had a high school education or less, and 17.1% had an ECOG PS of 2 to 4. All PRO-CTCAE items had at least 1 correlation in the expected direction with a QLQ-C30 scale (111 of 124, P<.05 for all). Stronger correlations were seen between PRO-CTCAE items and conceptually related QLQ-C30 domains. Scores for 94 of 124 PRO-CTCAE items were higher in the ECOG PS 2 to 4 vs 0 to 1 group (58 of 124, P<.05 for all). Overall, 119 of 124 items met at least 1 construct validity criterion. Test-retest reliability was 0.7 or greater for 36 of 49 prespecified items (median [range] intraclass correlation coefficient, 0.76 [0.53-.96]). Correlations between PRO-CTCAE item changes and corresponding QLQ-C30 scale changes were statistically significant for 27 prespecified items (median [range] r=0.43 [0.10-.56]; all P≤.006). Evidence demonstrates favorable validity, reliability, and responsiveness of PRO-CTCAE in a large, heterogeneous US sample of patients undergoing cancer treatment. Studies evaluating other measurement properties of PRO-CTCAE are under way to inform further development of PRO-CTCAE and its inclusion in cancer trials.
Mouthon, L; Rannou, F; Bérezné, A; Pagnoux, C; Arène, J‐P; Foïs, E; Cabane, J; Guillevin, L; Revel, M; Fermanian, J; Poiraudeau, S
2007-01-01
Objective To develop and assess the reliability and construct validity of a scale assessing disability involving the mouth in systemic sclerosis (SSc). Methods We generated a 34‐item provisional scale from mailed responses of patients (n = 74), expert consensus (n = 10) and literature analysis. A total of 71 other SSc patients were recruited. The test–retest reliability was assessed using the intraclass coefficient correlation and divergent validity using the Spearman correlation coefficient. Factor analysis followed by varimax rotation was performed to assess the factorial structure of the scale. Results The item reduction process retained 12 items with 5 levels of answers (total score range 0–48). The mean total score of the scale was 20.3 (SD 9.7). The test–retest reliability was 0.96. Divergent validity was confirmed for global disability (Health Assessment Questionnaire (HAQ), r = 0.33), hand function (Cochin Hand Function Scale, r = 0.37), inter‐incisor distance (r = −0.34), handicap (McMaster‐Toronto Arthritis questionnaire (MACTAR), r = 0.24), depression (Hospital Anxiety and Depression (HAD); HADd, r = 0.26) and anxiety (HADa, r = 0.17). Factor analysis extracted 3 factors with eigenvalues of 4.26, 1.76 and 1.47, explaining 63% of the variance. These 3 factors could be clinically characterised. The first factor (5 items) represents handicap induced by the reduction in mouth opening, the second (5 items) handicap induced by sicca syndrome and the third (2 items) aesthetic concerns. Conclusion We propose a new scale, the Mouth Handicap in Systemic Sclerosis (MHISS) scale, which has excellent reliability and good construct validity, and assesses specifically disability involving the mouth in patients with SSc. PMID:17502364
Development and testing of a cancer appetite and symptom questionnaire.
Halliday, V; Porock, D; Arthur, A; Manderson, C; Wilcock, A
2012-06-01
Poor appetite and weight loss are common in patients with cancer, contributing to an increase in morbidity and mortality. Early identification of those at greatest risk is problematic. The Council on Nutrition Appetite Questionnaire (CNAQ) is short and easy to use, although it is not specific to cancer populations. The present study aimed to build on the CNAQ to develop a cancer appetite and symptom questionnaire (CASQ) for predicting weight loss in patients with cancer. The content validity of the CNAQ was assessed by an expert panel (n = 41) using the content validity index (CVI). The resulting CASQ was tested for reliability among patients receiving radiotherapy (n = 34). Predictive validity of the CASQ was determined in patients with lung or upper gastrointestinal cancer (n = 185), comparing CASQ scores (possible range 0-48) recorded at baseline with percentage weight change after 12 weeks. In all but one CNAQ item, the CVI was above the minimum level of agreement (>0.70). Comments from expert panel members led to minor modifications and the introduction of new items resulting in the 12-item CASQ. The intraclass correlation coefficient of the CASQ was 0.80 [95% confidence interval (CI) = 0.68-0.92] and the difference between total scores at two time points was -0.20 (95% CI = -1.21 to 0.80). The optimum cut-off point of the instrument to predict >10% weight loss was 29/30 (area under curve = 0.75; sensitivity 71%, specificity 66%, positive predictive value 19%, negative predictive value 95%) [Correction added on 30 April 2012, after first online publication: in the preceding sentence, <10% was corrected to >10%]. The CASQ can predict weight loss among patients with lung and upper gastrointestinal cancer. Acknowledgment of the low positive predictive value is needed if the instrument is to be used within clinical practice. © 2012 The Authors. Journal of Human Nutrition and Dietetics © 2012 The British Dietetic Association Ltd.
Screening for oral health literacy in an urban dental clinic
Atchison, Kathryn A.; Gironda, Melanie W.; Messadi, Diana; Der-Martirosian, Claudia
2013-01-01
Objective Studies show that the average person fails to understand and use health care related materials to their full potential. The goal of this study was to evaluate a health literacy instrument based on the Rapid Estimate of Adult Literacy in Medicine (REALM) that incorporates dental and medical terms into one 84-item Rapid Estimate of Adult Literacy in Medicine and Dentistry (REALM-D) measure and determine its association with patient characteristics of a culturally diverse dental clinic population. Methods An 84-item dental/medical health literacy word list and a 48-item health beliefs and attitudes survey was provided to a sample of 200 adult patients seeking treatment for the first time at an oral diagnosis clinic located in a large urban medical center in Los Angeles, California. Results Of the total sample, 154 participants read all of list 1 correctly, 141 read list 2 correctly, and only 38 read list 3 correctly. Nonwhite participants had significantly lower REALM-D scores at each level of difficulty as well as the total scale score compared to white participants. Participants who reported English as not their main language had significantly lower REALM-D scores. REALM-D scores also varied significantly by level of education among participants where as level of education increased, oral health literacy increased. At a bivariate level, race, education, and English as a main language remain predictive of health literacy in a regression model. An interaction between education and English as a main language was significant. Conclusions The REALM-D is an effective instrument for use by medical and dental clinicians in detecting differences among people of different backgrounds and for whom English was not their primary language. PMID:20545829
McCulloch, Katie; Pastorek, Nicholas J; Miller, Brian I; Romesser, Jennifer; Linck, John; Sim, Anita H; Troyanskaya, Maya; Maestas, Kacey Little
2015-01-01
The Department of Veterans Affairs is encouraging administration of the Mayo-Portland Adaptability Inventory-4 Participation Index (M2PI) to identify long-term psychosocial outcomes of Operation Enduring Freedom (OEF), Operation Iraqi Freedom (OIF), and Operation New Dawn (OND) Veterans with a history of traumatic brain injury (TBI). To evaluate clinician and Veteran interrater reliability and how response validity influences M2PI item ratings. A total of 122 OEF/OIF/OND Veterans who reported a history consistent with mild TBI during deployment and were referred for neuropsychological evaluation following Comprehensive TBI Evaluation. Interrater reliability study. M2PI; Minnesota Multiphasic Personality Inventory-2 Symptom Validity Scale (FBS). Veterans reported greater perceived restrictions than clinicians across all M2PI items and total score. Interrater correlations ranged from rs = 0.27 (residence) to rs = 0.58 (money management) across items, with a total score correlation of rs = 0.60. When response bias was indicated, both Veterans and clinicians reported greater participation restrictions than those reported by Veterans without evidenced response bias. Low interrater correlation is consistent with previous findings. As ratings of clinicians and Veterans should not be interpreted as equivalent, documenting the rater's identity is important for interpretation. Using objective indicators of functional outcome may assist clinician raters, particularly when self-report may be biased.
Suetsugu, Yoshiko; Honjo, Shuji; Ikeda, Mari; Kamibeppu, Kiyoko
2015-07-01
The purpose of this study was to develop the Japanese version of the Postpartum Bonding Questionnaire (PBQ) to gather data on Japanese mothers for comparison with other cultures and to examine the scale structure of the PBQ among Japanese mothers. We administered the PBQ to a cross-section of 244 mothers 4 weeks after delivery and again 2 weeks later to 199 mothers as a retest to examine reliability. We used exploratory factor analysis to evaluate the factor structure of the PBQ. Correlations with the Mother-to-Infant Bonding Scale (MIBS), the Maternal Attachment Inventory (MAI), Edinburgh Postnatal Depression Scale (EPDS), and sociodemographic variables were calculated for validation. The 14-item version of the PBQ extracted by exploratory analysis consisted of four factors: 'impaired bonding', 'rejection and anger', 'anxiety about care', and 'lack of affection'. We found significant correlations of the total scores of the PBQ and the 14-item version of the PBQ positively with the MIBS and negatively with the MAI. Moderate significant correlations with total scores were also found with the EPDS. Total scores for primiparous and depressed mothers were higher than those for multiparous mothers and mothers without depression. The results of this study demonstrated the reliability and validity of the PBQ and the 14-item version of the PBQ in Japanese mothers 4 weeks after delivery. Copyright © 2015. Published by Elsevier Inc.
Evaluating the Quality of Life of Glaucoma Patients Using the State-Trait Anxiety Inventory.
Otori, Yasumasa; Takahashi, Genichiro; Urashima, Mitsuyoshi; Kuwayama, Yasuaki
2017-11-01
To evaluate anxiety felt by glaucoma patients. In total, 472 glaucoma patients responded to a questionnaire on anxiety, subjective symptoms, and vision-related quality of life (VR-QOL) associated with glaucoma. Anxiety was evaluated using the State-Trait Anxiety Inventory (STAI), state anxiety (STAI-State) subscale along with our novel questionnaire, assessing visual function and subjective symptoms, specialized for glaucoma. VR-QOL was evaluated using 5 subitems from the 25-item National Eye Institute Visual Function Questionnaire (VFQ-25). Adherence to ophthalmic antiglaucoma agents was confirmed. As indexes of visual function, corrected visual acuity (measured by eye chart), mean deviation (MD) score (measured with static perimetry), and 4 thresholds at the center of vision were determined. Stages were classified according to the Aulhorn Classification. From the STAI-State scores, the prevalence of anxiety in glaucoma patients was evaluated. We analyzed the correlation between the STAI-State and VFQ-25, anxiety, subjective symptoms, adherence, and visual function indexes. In total, 78% of glaucoma patients experienced at least an intermediate level of anxiety. The STAI-State correlated significantly with anxiety and subjective symptoms as measured by our novel questionnaire, particularly for questions "current anxiety about loss of vision" and "current anxiety in life" (r=0.468 and 0.500; both P<0.0001). However, STAI-State correlated weakly with VFQ-25, and not at all with visual function indexes and adherence. Many glaucoma patients feel anxiety. The STAI-State is correlated with the VR-QOL and anxiety in glaucoma patients, making it useful for understanding the anxiety present in glaucoma patients.
Judging in Rhythmic Gymnastics at Different Levels of Performance.
Leandro, Catarina; Ávila-Carvalho, Lurdes; Sierra-Palmeiro, Elena; Bobo-Arce, Marta
2017-12-01
This study aimed to analyse the quality of difficulty judging in rhythmic gymnastics, at different levels of performance. The sample consisted of 1152 difficulty scores concerning 288 individual routines, performed in the World Championships in 2013. The data were analysed using the mean absolute judge deviation from the final difficulty score, a Cronbach's alpha coefficient and intra-class correlations, for consistency and reliability assessment. For validity assessment, mean deviations of judges' difficulty scores, the Kendall's coefficient of concordance W and ANOVA eta-squared values were calculated. Overall, the results in terms of consistency (Cronbach's alpha mostly above 0.90) and reliability (intra-class correlations for single and average measures above 0.70 and 0.90, respectively) were satisfactory, in the first and third parts of the ranking on all apparatus. The medium level gymnasts, those in the second part of the ranking, had inferior reliability indices and highest score dispersion. In this part, the minimum of corrected item-total correlation of individual judges was 0.55, with most values well below, and the matrix for between-judge correlations identified remarkable inferior correlations. These findings suggest that the quality of difficulty judging in rhythmic gymnastics may be compromised at certain levels of performance. In future, special attention should be paid to the judging analysis of the medium level gymnasts, as well as the Code of Points applicability at this level.
Judging in Rhythmic Gymnastics at Different Levels of Performance
Ávila-Carvalho, Lurdes; Sierra-Palmeiro, Elena; Bobo-Arce, Marta
2017-01-01
Abstract This study aimed to analyse the quality of difficulty judging in rhythmic gymnastics, at different levels of performance. The sample consisted of 1152 difficulty scores concerning 288 individual routines, performed in the World Championships in 2013. The data were analysed using the mean absolute judge deviation from the final difficulty score, a Cronbach’s alpha coefficient and intra-class correlations, for consistency and reliability assessment. For validity assessment, mean deviations of judges’ difficulty scores, the Kendall’s coefficient of concordance W and ANOVA eta-squared values were calculated. Overall, the results in terms of consistency (Cronbach’s alpha mostly above 0.90) and reliability (intra-class correlations for single and average measures above 0.70 and 0.90, respectively) were satisfactory, in the first and third parts of the ranking on all apparatus. The medium level gymnasts, those in the second part of the ranking, had inferior reliability indices and highest score dispersion. In this part, the minimum of corrected item-total correlation of individual judges was 0.55, with most values well below, and the matrix for between-judge correlations identified remarkable inferior correlations. These findings suggest that the quality of difficulty judging in rhythmic gymnastics may be compromised at certain levels of performance. In future, special attention should be paid to the judging analysis of the medium level gymnasts, as well as the Code of Points applicability at this level. PMID:29339996
DOE Office of Scientific and Technical Information (OSTI.GOV)
Calhoun, L.D.
A 15-step flowchart model was applied to the construction of a 20-item long form and a 6-item short form of the scale. Both scales were field-tested on 829 respondents representing a diverse range of subjects: high school juniors and seniors, nuclear engineering students, pre-service teachers, and members of a citizens action group. Both scales are available for immediate use. The 20-item scale appears to be reliable, content valid, and construct valid. Content validity was examined through factor analysis and the use of two separate juries of nuclear experts. Construct validity was examined by application of the known-groups approach. Scale reliabilitymore » and homogeneity were evidenced by a 0.93 coefficient alpha, a range of positive interim correlations of 0.15 to 0.73, and a range of adjusted item-total correlations of 0.46 to 0.80. The 20-item scale also has evaluative quality; means ranged from 2.80 to 3.70. Content validity for the 6-item scale was examined by a jury of nuclear experts. An obtained coefficient alpha of 0.82, a range of interim correlations of 0.51 to 0.72 suggest the scale is reliable and homogeneous. The 6-item short form also appears to have evaluative quality; means ranged from 2.37 to 3.18.« less
Development of a brief measure of college stress: the college student stress scale.
Feldt, Ronald C
2008-06-01
The study included assessment of the psychometric properties of an 11-item measure of perceived stress and control in 273 first-year college students. Results indicated good internal consistency and stability over a 5-week interval, and the total score was highly correlated with another measure of perceived stress. Principal components analysis with varimax rotation indicated two possible factors which explained 55% of the variance. However, given the small number of items and low internal consistency of the second factor (alpha=.60), use of the Total score is recommended.
Clinimetric Testing of the Comprehensive Cervical Dystonia Rating Scale
Comella, C. L.; Perlmutter, J.S.; Jinnah, H. A.; Waliczek, T. A.; Rosen, A. R.; Galpern, W. R.; Adler, C. H.; Barbano, R. L.; Factor, S. A.; Goetz, C.G.; Jankovic, J.; Reich, S. G.; Rodriguez, R. L.; Severt, W. L.; Zurowski, M.; Fox, S. H.; Stebbins, G.T.
2016-01-01
Objective To test the clinimetric properties of the Comprehensive Cervical Dystonia Rating Scale. Background This is a modular scale with modifications of the Toronto Western Spasmodic Torticollis Rating Scale (composed of three subscales assessing motor severity, disability and pain) now referred to as the revised Toronto Western Spasmodic Torticollis Scale-2.; a newly developed psychiatric screening instrument; and the Cervical Dystonia Impact Profile-58 as a quality of life measure. Methods Ten dystonia experts rated subjects with cervical dystonia using the comprehensive scale. Clinimetric techniques assessed each module of the scale for reliability, item correlation and factor structure. Results There were 208 cervical dystonia patients (73% women, age 59±10 years, duration 15±12 years). The internal consistency of the motor severity subscale was acceptable (Cronbach’s alpha = 0.57). Item to total correlations showed that elimination of items with low correlations (<0.20) increased alpha to 0.71. Internal consistency estimates for the subscales for disability and pain were 0.88 and 0.95 respectively. The psychiatric screening scale had a Cronbach’s alpha of 0.84 and satisfactory item to total correlations. When the subscales of the Toronto Western Spasmodic Torticollis scale -2 were combined with the psychiatric screening scale, Cronbach's alpha was 0.88, and construct validity assessment demonstrated four rational factors: motor, disability, pain and psychiatric disorders. The Cervical Dystonia Impact Profile-58 had an alpha of 0.98 and its construction was validated through a confirmatory factor analysis. Conclusions The modules of the Comprehensive Cervical Dystonia Rating Scale are internally consistent with a logical factor structure. PMID:26971359
Mackus, Marlou; Kruijff, Deborah de; Otten, Leila S; Kraneveld, Aletta D; Garssen, Johan; Verster, Joris C
2017-04-12
Altered immune functioning has been demonstrated in individuals with autism spectrum disorder (ASD). The current study explores the relationship between perceived immune functioning and experiencing ASD traits in healthy young adults. N = 410 students from Utrecht University completed a survey on immune functioning and autistic traits. In addition to a 1-item perceived immune functioning rating, the Immune Function Questionnaire (IFQ) was completed to assess perceived immune functioning. The Dutch translation of the Autism-Spectrum Quotient (AQ) was completed to examine variation in autistic traits, including the domains "social insights and behavior", "difficulties with change", "communication", "phantasy and imagination", and "detail orientation". The 1-item perceived immune functioning score did not significantly correlate with the total AQ score. However, a significant negative correlation was found between perceived immune functioning and the AQ subscale "difficulties with change" (r = -0.119, p = 0.019). In women, 1-item perceived immune functioning correlated significantly with the AQ subscales "difficulties with change" (r = -0.149, p = 0.029) and "communication" (r = -0.145, p = 0.032). In men, none of the AQ subscales significantly correlated with 1-item perceived immune functioning. In conclusion, a modest relationship between perceived immune functioning and several autistic traits was found.
Do healthier foods cost more in Saudi Arabia than less healthier options?
Gosadi, Ibrahim M.; Alshehri, Muner A.; Alawad, Saud H.
2016-01-01
Objectives: To investigate whether healthy foods in Saudi Arabia cost more compared with less healthy options. Method: This is a cross-sectional study conducted in Riyadh, Saudi Arabia during June and July 2015. The study targeted well-known market chains in the city of Riyadh. The selection of food items was purposive to include healthy and less healthy food items in each category. Price, caloric value, salt, fat, sugar, and fiber contents for each food item were collected. To test for the correlation between nutritional contents and average price, Spearman’s correlation coefficients were calculated. The Mann-Whitney U test was used to test for the presence of average price difference between healthy and less healthy food items. Results: A total of 162 food items were collected. Sixty-six food items were classified as healthy compared with 96 less healthier options. The calculated correlation coefficients indicate an association between increased cost of food with increased caloric values (0.649 p=0.0000001), increased fat content (0.610 p=0.0000003), and increased salt contents (0.273 p=0.001). Prices of food items with higher fiber contents showed a weaker association (0.191 p=0.015). The overall average cost of healthy food was approximately 10 Saudi riyals cheaper than less healthy food (p=0.000001). Conclusion: The findings of the study suggest that the cost of healthy food is lower than that of less healthy items in the Saudi market. PMID:27570859
Guan, Wei; Cai, Wei-Xiong; Huang, Fu-Yin; Wu, Jia-Sheng
2009-10-01
To explore the application of Diminished Criminal Responsibility Rating Scale (DCRRS) to mental retardation offenders. The DCRRS was used to 121 cases of mental retardation offenders who were divided into three groups according to the degree of their diminished criminal responsibility. There were significant differences in rating score among the three groups (mild group 22.12+/-4.69, moderate group 25.50+/-5.48, major group 27.59+/-5.69), and 17 items had good correlation with the total score of the scale with the correlation coefficient from 0.289 to 0.665. Six factors were extracted by the factor analysis, and 69.392% variation could be explained. The DCRRS has rational items, its total score could show the difference among the three degree diminished criminal responsibility of mental retardation offenders.
Development of the Attributed Dignity Scale.
Jacelon, Cynthia S; Dixon, Jane; Knafl, Kathleen A
2009-07-01
A sequential, multi-method approach to instrument development beginning with concept analysis, followed by (a) item generation from qualitative data, (b) review of items by expert and lay person panels, (c) cognitive appraisal interviews, (d) pilot testing, and (e) evaluating construct validity was used to develop a measure of attributed dignity in older adults. The resulting positively scored, 23-item scale has three dimensions: Self-Value, Behavioral Respect-Self, and Behavioral Respect-Others. Item-total correlations in the pilot study ranged from 0.39 to 0.85. Correlations between the Attributed Dignity Scale (ADS) and both Rosenberg's Self-Esteem Scale (0.17) and Crowne and Marlowe's Social Desirability Scale (0.36) were modest and in the expected direction, indicating attributed dignity is a related but independent concept. Next steps include testing the ADS with a larger sample to complete factor analysis, test-retest stability, and further study of the relationships between attributed dignity and other concepts.
Aloba, Olutayo; Olabisi, Oluseyi; Aloba, Tolulope
2016-01-01
The 10-item Connor-Davidson Resilience Scale (CD-RISC) has demonstrated satisfactory psychometric properties as a measure of resilience in all the previous studies conducted in developed countries. The objective of this study was to explore the psychometric characteristics of the 10-item CD-RISC among students nurses in southwestern Nigeria. This descriptive cross-sectional study involved a total of 449 student nurses who completed the 10-item CD-RISC in addition to measures of self-esteem, depression, religiosity, and psychological distress. The scale demonstrated adequate reliability (Cronbach's α = .81) and satisfactory validity with significant correlations with the measures of self-esteem, depression, religiosity, and psychological distress. Factor analyses revealed that resilience was best explained by a two-factor construct. The scale is a valid measure of resilience among Nigerian student nurses. © The Author(s) 2016.
Bootstrap Standard Errors for Maximum Likelihood Ability Estimates When Item Parameters Are Unknown
ERIC Educational Resources Information Center
Patton, Jeffrey M.; Cheng, Ying; Yuan, Ke-Hai; Diao, Qi
2014-01-01
When item parameter estimates are used to estimate the ability parameter in item response models, the standard error (SE) of the ability estimate must be corrected to reflect the error carried over from item calibration. For maximum likelihood (ML) ability estimates, a corrected asymptotic SE is available, but it requires a long test and the…
Scales for assessing self-efficacy of nurses and assistants for preventing falls
Dykes, Patricia C.; Carroll, Diane; McColgan, Kerry; Hurley, Ann C.; Lipsitz, Stuart R.; Colombo, Lisa; Zuyev, Lyubov; Middleton, Blackford
2011-01-01
Aim This paper is a report of the development and testing of the Self-Efficacy for Preventing Falls Nurse and Assistant scales. Background Patient falls and fall-related injuries are traumatic ordeals for patients, family members and providers, and carry a toll for hospitals. Self-efficacy is an important factor in determining actions persons take and levels of performance they achieve. Performance of individual caregivers is linked to the overall performance of hospitals. Scales to assess nurses and certified nursing assistants’ self-efficacy to prevent patients from falling would allow for targeting resources to increase SE, resulting in improved individual performance and ultimately decreased numbers of patient falls. Method Four phases of instrument development were carried out to (1) generate individual items from eight focus groups (four each nurse and assistant conducted in October 2007), (2) develop prototype scales, (3) determine content validity during a second series of four nurse and assistant focus groups (January 2008) and (4) conduct item analysis, paired t-tests, Student’s t-tests and internal consistency reliability to refine and confirm the scales. Data were collected during February–December, 2008. Results The 11-item Self-Efficacy for Preventing Falls Nurse had an alpha of 0·89 with all items in the range criterion of 0·3–0·7 for item total correlation. The 8-item Self-Efficacy for Preventing Falls Assistant had an alpha of 0·74 and all items had item total correlations in the 0·3–0·7 range. Conclusions The Self-Efficacy for Preventing Falls Nurse and Self-Efficacy for Preventing Falls Assistant scales demonstrated psychometric adequacy and are recommended to measure bedside staff’s self-efficacy beliefs in preventing patient falls. PMID:21073506
Revision and validation of a scale to assess pregnancy stress.
Chen, Chung-Hey
2015-03-01
Pregnancy is a potentially stressful event. Prenatal stress alters maternal endocrine and immune systems, has been implicated in the etiology of prenatal complications or postnatal psychiatric disorders, and may adversely affect fetal health. The 30-item Pregnancy Stress Rating Scale (PSRS), initially developed in 1983 by Chen and colleagues, is the only measure to date designed specifically to evaluate prenatal stress. The purpose of this study was to reconsider and revise the 30-item PSRS and validate the new PSRS. A cross-sectional design was used. Adding new items of pregnancy stress generated from clinical experience and expert recommendations resulted in a 40-item revised PSRS that was more reflective of current social conditions. Three hundred pregnant women, recruited from the antenatal clinic of a medical center in southern Taiwan, completed the revised PSRS to assess its internal consistency, test-retest reliability, construct validity, and convergent and discriminate validity. The final 36-item PSRS (PSRS36) was derived by deleting four items with relatively low item-total correlation coefficients or factor loadings. The resultant 36-item scale showed good internal consistency (α = .92) and 2-week test-retest reliability (r = .82). Factor analysis confirmed construct validity and suggested five prenatal stress dimensions, which explained 52.17% of the total variance. Convergent and discriminate validities were indicated by significant correlations among the PSRS36, Perceived Stress Scale, and Interpersonal Support Evaluation List. The PSRS36 is a psychometrically sound and practical tool for nurses and other healthcare providers to assess prenatal stress and to examine intervention protocols in Taiwanese prenatal women. More research is recommended to determine whether the PSRS36 may be used in other racial-ethnic groups.
Psychometric properties of the Spanish version of the Resilience Scale.
Heilemann, MarySue V; Lee, Kathryn; Kury, Felix Salvador
2003-01-01
The purpose of this study is to test the reliability and validity of a Spanish translation of the Resilience Scale (RS), which was originally created in English by Wagnild and Young (1993). A team of bilingual, bicultural translators participated in the translation process to enhance the linguistic accuracy and cultural appropriateness of the Spanish translation. As part of the convenience sample of 315 women of Mexican descent who participated in the larger study, data from 147 women who preferred to read and write in Spanish were used in this analysis. The English version of the RS consists of a 17-item "Personal Competence" subscale and an 8-item "Acceptance of Self and Life" subscale for a total of 25 items. However, two items had low item-total loadings and were removed to form a modified 23-item RS. The exploratory principal components factor analysis, varimax rotation, and subsequent goodness of fit indices were ambivalent on whether a one or two-factor solution was appropriate, but the chi-square difference test clearly demonstrated that the two-factor solution of the Spanish version was more useful in explaining variance than a one-factor solution. Internal consistency reliability was estimated with Cronbach's alpha (alpha = 0.93) which was acceptable for the 23-item RS as well as its subscales. Construct validity was demonstrated by a significant positive correlation between resilience and life satisfaction (r = 0.36; p < 0.001), and a significant negative correlation between resilience and depressive symptoms (r = -0.29; p < 0.01). This analysis ultimately supports the appropriateness of the modified 23-item Spanish translation of the RS and its subscales in a sample of urban, low-income women of Mexican descent in the U.S.
Stubenrouch, Fabienne E; Pieterse, Arwen H; Falkenberg, Rijan; Santema, T Katrien B; Stiggelbout, Anne M; van der Weijden, Trudy; Aarts, J Annemijn W M; Ubbink, Dirk T
2016-06-01
The 12-item "observing patient involvement" (OPTION(12))-instrument is commonly used to assess the extent to which healthcare providers involve patients in health-related decision-making. The five-item version (OPTION(5)) claims to be a more efficient measure. In this study we compared the Dutch versions of the OPTION-instruments in terms of inter-rater agreement and correlation in outpatient doctor-patient consultations in various settings, to learn if we can safely switch to the shorter OPTION(5)-instrument. Two raters coded 60 audiotaped vascular surgery and oncology patient consultations using OPTION(12) and OPTION(5). Unweighted Cohen's kappa was used to compute inter-rater agreement on item-level. The association between the total scores of the two OPTION-instruments was investigated using Pearson's correlation coefficient (r) and a Bland & Altman plot. After fine-tuning the OPTION-manuals, inter-rater agreement for OPTION(12) and OPTION(5) was good to excellent (kappa range 0.69-0.85 and 0.63-0.72, respectively). Mean total scores were 23.7 (OPTION(12); SD=7.8) and 39.3 (OPTION(5); SD=12.7). Correlation between the total scores was high (r=0.71; p=0.01). OPTION(5) scored systematically higher with a wider range than OPTION(12). Both OPTION-instruments had a good inter-rater agreement and correlated well. OPTION(5) seems to differentiate better between various levels of patient involvement. The OPTION(5)-instrument is recommended for clinical application. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Vaughn, Kalif E; Rawson, Katherine A; Pyc, Mary A
2013-12-01
A wealth of previous research has established that retrieval practice promotes memory, particularly when retrieval is successful. Although successful retrieval promotes memory, it remains unclear whether successful retrieval promotes memory equally well for items of varying difficulty. Will easy items still outperform difficult items on a final test if all items have been correctly recalled equal numbers of times during practice? In two experiments, normatively difficult and easy Lithuanian-English word pairs were learned via test-restudy practice until each item had been correctly recalled a preassigned number of times (from 1 to 11 correct recalls). Despite equating the numbers of successful recalls during practice, performance on a delayed final cued-recall test was lower for difficult than for easy items. Experiment 2 was designed to diagnose whether the disadvantage for difficult items was due to deficits in cue memory, target memory, and/or associative memory. The results revealed a disadvantage for the difficult versus the easy items only on the associative recognition test, with no differences on cue recognition, and even an advantage on target recognition. Although successful retrieval enhanced memory for both difficult and easy items, equating retrieval success during practice did not eliminate normative item difficulty differences.
Suggested posthypnotic amnesia in psychiatric patients and normals.
Frischholz, Edward J; Lipman, Laurie S; Braun, Bennett G; Sachs, Roberta
2015-01-01
The present study examined both quantitative and qualitative hypnotizability differences among four psychiatric patient groups (dissociative disorder (n = 17), schizophrenic (n = 13), mood disorder (n = 14), and anxiety disorder (n = 14) patients), and normals (college students (n = 63)). Dissociative disorder patients earned significantly higher corrected total scores on the Stanford Hypnotic Susceptibility Scale, Form C (mean = 7.94), than all other groups. Likewise, dissociative disorder patients initially recalled significantly fewer items when the posthypnotic amnesia suggestion was in effect (mean = .41) and reversed significantly more items when the suggestion was canceled (mean = 3.82) than all other groups. In contrast, schizophrenic patients recalled significantly fewer items when the amnesia suggestion was in effect (mean = 1.85) and reversed significantly fewer items when it was canceled (mean = .77) than the remaining groups. This qualitative difference between schizophrenic patients and the other groups on the suggested posthypnotic amnesia item was observed even though there were no significant quantitative differences between groups in overall hypnotic responsivity.
NASA Astrophysics Data System (ADS)
Chen, Hua-cai; Chen, Xing-dan; Lu, Yong-jun; Cao, Zhi-qiang
2006-01-01
Near infrared (NIR) reflectance spectroscopy was used to develop a fast determination method for total ginsenosides in Ginseng (Panax Ginseng) powder. The spectra were analyzed with multiplicative signal correction (MSC) correlation method. The best correlative spectra region with the total ginsenosides content was 1660 nm~1880 nm and 2230nm~2380 nm. The NIR calibration models of ginsenosides were built with multiple linear regression (MLR), principle component regression (PCR) and partial least squares (PLS) regression respectively. The results showed that the calibration model built with PLS combined with MSC and the optimal spectrum region was the best one. The correlation coefficient and the root mean square error of correction validation (RMSEC) of the best calibration model were 0.98 and 0.15% respectively. The optimal spectrum region for calibration was 1204nm~2014nm. The result suggested that using NIR to rapidly determinate the total ginsenosides content in ginseng powder were feasible.
Windmann, Sabine; Hill, Holger
2014-10-01
Performance on tasks requiring discrimination of at least two stimuli can be viewed either from an objective perspective (referring to actual stimulus differences), or from a subjective perspective (corresponding to participant's responses). Using event-related potentials recorded during an old/new recognition memory test involving emotionally laden and neutral words studied either blockwise or randomly intermixed, we show here how the objective perspective (old versus new items) yields late effects of blockwise emotional item presentation at parietal sites that the subjective perspective fails to find, whereas the subjective perspective ("old" versus "new" responses) is more sensitive to early effects of emotion at anterior sites than the objective perspective. Our results demonstrate the potential advantage of dissociating the subjective and the objective perspective onto task performance (in addition to analyzing trials with correct responses), especially for investigations of illusions and information processing biases, in behavioral and cognitive neuroscience studies. Copyright © 2014 Elsevier Inc. All rights reserved.
Wu, Hua-hong; Li, Hui; Gao, Qian
2013-05-30
The quality of life in children with short stature was rarely studied in China, so we explore these children's quality of life and psychometric properties of the Chinese version of the Pediatric Quality of Life Inventory 4.0(PedsQL4.0) Generic Core Scales among children with short stature. A total of 201 children aged 8 ~ 18 years from the short stature clinic and other clinics of capital institute of pediatrics attended this study. The questionnaires include demographic information and PedsQL4.0 generic core scales. According to children's height, we divided them into three groups: short stature, normal short and normal group, then compared the score of scales by the height category. Moreover, we analyzed the reliability and validity of PedsQL4.0 generic core scales in these 201 children. The child self-report total PedsQL mean score, for the short stature, normal short and normal groups were 77.77 ± 9.69, 83.50 ± 8.56 and 87.36 ± 7.23; the parent-proxy total PedsQL mean score were 77.62 ± 10.50, 82.69 ± 8.35 and 84.91 ± 9.96 respectively. Both for children self- and parent proxy-reports, the Cronbach's α coefficients of total scale, psychosocial health and social functioning ranged between 0.74 and 0.80, it ranged between 0.51 and 0.66 in other dimensions. For child self-reports, the correlation coefficients of 17 items' scores (total 23 items) with the scores of dimensions they belong to were above 0.5, with the highest 0.759; the other 6 items' correlation coefficients were below 0.5, with the lowest 0.280. For parent proxy-reports, the correlation coefficients of 19 items' scores with the scores of dimension they belong to were above 0.5, with the highest 0.793, the other 4 items' below 0.5 with the lowest 0.243. The quality of life in children with short stature is worse than their normal peers by Peds QL4.0 generic core scales, the statues of their quality of life was positively related to their stature.
Vigliecca, Nora Silvana
2017-11-09
To study the relationship between the caregiver's perception about the patient's impairment in spontaneous speech, according to an item of four questions administered by semi-structured interview, and the patient's performance in the Brief Aphasia Evaluation (BAE). 102 right-handed patients with focal brain lesions of different types and location were examined. BAE is a valid and reliable instrument to assess aphasia. The caregiver's perception was correlated with the item of spontaneous speech, the total score and the three main factors of the BAE: Expression, Comprehension and Complementary factors. The precision (sensitivity/ specificity) about the caregiver's perception of the patient's spontaneous speech was analyzed with reference to the presence or absence of disorder, according to the professional, on the BAE item of spontaneous speech. The studied correlation was satisfactory, being greater (higher than 80%) for the following indicators: the item of spontaneous speech, the Expression factor and the total score of the scale; the correlation was a little smaller (higher than 70%) for the Comprehension and Complementary factors. Comparing two cut-off points that evaluated the precision of the caregiver's perception, satisfactory results were observed in terms of sensitivity and specificity (>70%) with likelihood ratios higher than three. By using the median as the cut-off point, more satisfactory diagnostic discriminations were obtained. Interviewing the caregiver specifically on the patient's spontaneous speech, in an abbreviated form, provides relevant information for the aphasia diagnosis.
Heinik, J; Werner, P; Lin, R
1999-01-01
The testament definition scale (TDS) is a specifically designed six-item scale aimed at measuring the respondent's capacity to define "testament." We assessed the reliability and validity of this new short scale in 31 community-dwelling cognitively impaired elderly patients. Interrater reliability for the six items ranged from .87 to .97. The interrater reliability for the total score was .77. Significant correlations were found between the TDS score and the Mini-Mental State Examination (MMSE) and the Cambridge Cognitive Examination scores (r = .71 and .72 respectively, p = .001). Criterion validity yielded significantly different means for subjects with MMSE scores of 24-30 and 0-23: mean 3.9 and 1.6 respectively (t(20) = 4.7, p = .001). Using a cutoff point of 0-2 vs. 3+, 79% of the subjects were correctly classified as severely cognitively impaired, with only 8.3% false positives, and a positive predictive value of 94%. Thus, TDS was found both reliable and valid. This scale, however, is not synonymous with testamentary capacity. The discussion deals with the methodological limitations of this study, and highlights the practical as well as the theoretical relevance of TDS. Future studies are warranted to elucidate the relationships between TDS and existing legal requirements of testamentary capacity.
Study on the Validity and Reliability of Melbourne Decision Making Scale in Turkey
ERIC Educational Resources Information Center
Çolakkadioglu, Oguzhan; Deniz, M. Engin
2015-01-01
This study is to analyze the validity and reliability of Melbourne Decision Making Questionnaire (MDMQ). The sample consisted of 650 university students. The structural validity of the MDMQ, as well as correlations among its sub-scales, measure-bound validity, internal consistency, item total correlations and test-retest reliability coefficients…
Kostuj, Tanja; Stief, Felix; Hartmann, Kirsten Anna; Schaper, Katharina; Arabmotlagh, Mohammad; Baums, Mike H; Meurer, Andrea; Krummenauer, Frank; Lieske, Sebastian
2018-01-01
Objective After cross-cultural adaption for the German translation of the Ankle-Hindfoot Scale of the American Orthopaedic Foot and Ankle Society (AOFAS-AHS) and agreement analysis with the Foot Function Index (FFI-D), the following gait analysis study using the Oxford Foot Model (OFM) was carried out to show which of the two scores better correlates with objective gait dysfunction. Design and participants Results of the AOFAS-AHS and FFI-D, as well as data from three-dimensional gait analysis were collected from 20 patients with mild to severe ankle and hindfoot pathologies. Kinematic and kinetic gait data were correlated with the results of the total AOFAS scale and FFI-D as well as the results of those items representing hindfoot function in the AOFAS-AHS assessment. With respect to the foot disorders in our patients (osteoarthritis and prearthritic conditions), we correlated the total range of motion (ROM) in the ankle and subtalar joints as identified by the OFM with values identified during clinical examination ‘translated’ into score values. Furthermore, reduced walking speed, reduced step length and reduced maximum ankle power generation during push-off were taken into account and correlated to gait abnormalities described in the scores. An analysis of correlations with CIs between the FFI-D and the AOFAS-AHS items and the gait parameters was performed by means of the Jonckheere-Terpstra test; furthermore, exploratory factor analysis was applied to identify common information structures and thereby redundancy in the FFI-D and the AOFAS-AHS items. Results Objective findings for hindfoot disorders, namely a reduced ROM, in the ankle and subtalar joints, respectively, as well as reduced ankle power generation during push-off, showed a better correlation with the AOFAS-AHS total score—as well as AOFAS-AHS items representing ROM in the ankle, subtalar joints and gait function—compared with the FFI-D score. Factor analysis, however, could not identify FFI-D items consistently related to these three indicator parameters (pain, disability and function) found in the AOFAS-AHS. Furthermore, factor analysis did not support stratification of the FFI-D into two subscales. Conclusions The AOFAS-AHS showed a good agreement with objective gait parameters and is therefore better suited to evaluate disability and functional limitations of patients suffering from foot and ankle pathologies compared with the FFI-D. PMID:29626046
Weeks, Clinton S; Humphreys, Michael S; Cornwell, T Bettina
2018-02-01
Brands engaged in sponsorship of events commonly have objectives that depend on consumer memory for the sponsor-event relationship (e.g., sponsorship awareness). Consumers however, often misattribute sponsorships to nonsponsor competitor brands, indicating erroneous memory for these relationships. The current research uses an item and relational memory framework to reveal sponsor brands may inadvertently foster this misattribution when they communicate relational linkages to events. Effects can be explained via differential roles of communicating item information (information that supports processing item distinctiveness) versus relational information (information that supports processing relationships among items) in contributing to memory outcomes. Experiment 1 uses event-cued brand recall to show that correct memory retrieval is best supported by communicating relational information when sponsorship relationships are not obvious (low congruence). In contrast, correct retrieval is best supported by communicating item information when relationships are obvious (high congruence). Experiment 2 uses brand-cued event recall to show that, against conventional marketing recommendations, relational information increases misattribution, whereas item information guards against misattribution. Results suggest sponsor brands must distinguish between item and relational communications to enhance correct retrieval and limit misattribution. Methodologically, the work shows that choice of cueing direction is critical in differentially revealing patterns of correct and incorrect retrieval with pair relationships. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Development of the Abbreviated Masculine Gender Role Stress Scale
Swartout, Kevin M.; Parrott, Dominic J.; Cohn, Amy M.; Hagman, Brett T.; Gallagher, Kathryn E.
2014-01-01
Data gathered from six independent samples (n = 1,729) that assessed men’s masculine gender role stress in college and community males were aggregated used to determine the reliability and validity of an abbreviated version of the Masculine Gender Role Stress Scale (MGRS scale). The 15 items with the highest item-to-total scale correlations were used to create an abbreviated MGRS scale. Psychometric properties of each of the 15-items were examined with Item Response Theory (IRT) analysis, using the discrimination and threshold parameters. IRT results showed that the abbreviated scale may hold promise at capturing the same amount of information as the full 40-item scale. Relative to the 40-item scale, the total score of the abbreviated MGRS scale demonstrated comparable convergent validity using the measurement domains of masculine identity, hyper-masculinity, trait anger, anger expression, and alcohol involvement. An abbreviated MGRS scale may be recommended for use in clinical practice and research settings to reduce cost, time, and patient/participant burden. Additionally, IRT analyses identified items with higher discrimination and threshold parameters that may be used to screen for problematic gender role stress in men who may be seen in routine clinical or medical practice. PMID:25528163
Development of the Abbreviated Masculine Gender Role Stress Scale.
Swartout, Kevin M; Parrott, Dominic J; Cohn, Amy M; Hagman, Brett T; Gallagher, Kathryn E
2015-06-01
Data gathered from 6 independent samples (n = 1,729) that assessed men's masculine gender role stress in college and community males were aggregated used to determine the reliability and validity of an abbreviated version of the Masculine Gender Role Stress (MGRS) Scale. The 15 items with the highest item-to-total scale correlations were used to create an abbreviated MGRS Scale. Psychometric properties of each of the 15 items were examined with item response theory (IRT) analysis, using the discrimination and threshold parameters. IRT results showed that the abbreviated scale may hold promise at capturing the same amount of information as the full 40-item scale. Relative to the 40-item scale, the total score of the abbreviated MGRS Scale demonstrated comparable convergent validity using the measurement domains of masculine identity, hypermasculinity, trait anger, anger expression, and alcohol involvement. An abbreviated MGRS Scale may be recommended for use in clinical practice and research settings to reduce cost, time, and patient/participant burden. Additionally, IRT analyses identified items with higher discrimination and threshold parameters that may be used to screen for problematic gender role stress in men who may be seen in routine clinical or medical practice. (c) 2015 APA, all rights reserved).
Validation of Gujarati Version of ABILOCO-Kids Questionnaire.
Diwan, Shraddha; Diwan, Jasmin; Patel, Pankaj; Bansal, Ankita B
2015-10-01
ABILOCO-Kids is a measure of locomotion ability for children with cerebral palsy (CP) aged 6 to 15 years & is available in English & French. To validate the Gujarati version of ABILOCO-Kids questionnaire to be used in clinical research on Gujarati population. ABILOCO-Kids questionnaire was translated into Gujarati from English using forward-backward-forward method. To ensure face & content validity of Gujarati version using group consensus method, each item was examined by group of experts having mean experience of 24.62 years in field of paediatric and paediatric physiotherapy. Each item was analysed for content, meaning, wording, format, ease of administration & scoring. Each item was scored by expert group as either accepted, rejected or accepted with modification. Procedure was continued until 80% of consensus for all items. Concurrent validity was examined on 55 children with Cerebral Palsy (6-15 years) of all Gross Motor Functional Classification System (GMFCS) level & all clinical types by correlating score of ABILOCO-Kids with Gross Motor Functional Measure & GMFCS. In phase 1 of validation, 16 items were accepted as it is; 22 items accepted with modification & 3 items went for phase 2 validation. For concurrent validity, highly significant positive correlation was found between score of ABILOCO-Kids & total GMFM (r=0.713, p<0.005) & highly significant negative correlation with GMFCS (r= -0.778, p<0.005). Gujarati translated version of ABILOCO-Kids questionnaire has good face & content validity as well as concurrent validity which can be used to measure caregiver reported locomotion ability in children with CP.
Rescorla, Leslie A; Achenbach, Thomas M; Ivanova, Masha Y; Harder, Valerie S; Otten, Laura; Bilenberg, Niels; Bjarnadottir, Gudrun; Capron, Christiane; De Pauw, Sarah S W; Dias, Pedro; Dobrean, Anca; Döpfner, Manfred; Duyme, Michel; Eapen, Valsamma; Erol, Nese; Esmaeili, Elaheh Mohammad; Ezpeleta, Lourdes; Frigerio, Alessandra; Fung, Daniel S S; Gonçalves, Miguel; Guðmundsson, Halldór; Jeng, Suh-Fang; Jusiené, Roma; Ah Kim, Young; Kristensen, Solvejg; Liu, Jianghong; Lecannelier, Felipe; Leung, Patrick W L; Machado, Bárbara César; Montirosso, Rosario; Ja Oh, Kyung; Ooi, Yoon Phaik; Plück, Julia; Pomalima, Rolando; Pranvera, Jetishi; Schmeck, Klaus; Shahini, Mimoza; Silva, Jaime R; Simsek, Zeynep; Sourander, Andre; Valverde, José; van der Ende, Jan; Van Leeuwen, Karla G; Wu, Yen-Tzu; Yurdusen, Sema; Zubrick, Stephen R; Verhulst, Frank C
2011-01-01
International comparisons were conducted of preschool children's behavioral and emotional problems as reported on the Child Behavior Checklist for Ages 1½-5 by parents in 24 societies (N = 19,850). Item ratings were aggregated into scores on syndromes; Diagnostic and Statistical Manual of Mental Disorders-oriented scales; a Stress Problems scale; and Internalizing, Externalizing, and Total Problems scales. Effect sizes for scale score differences among the 24 societies ranged from small to medium (3-12%). Although societies differed greatly in language, culture, and other characteristics, Total Problems scores for 18 of the 24 societies were within 7.1 points of the omnicultural mean of 33.3 (on a scale of 0-198). Gender and age differences, as well as gender and age interactions with society, were all very small (effect sizes < 1%). Across all pairs of societies, correlations between mean item ratings averaged .78, and correlations between internal consistency alphas for the scales averaged .92, indicating that the rank orders of mean item ratings and internal consistencies of scales were very similar across diverse societies.
Rescorla, Leslie A.; Achenbach, Thomas M.; Ivanova, Masha Y.; Harder, Valerie S.; Otten, Laura; Bilenberg, Niels; Bjarnadottir, Gudrun; Capron, Christiane; De Pauw, Sarah S. W.; Dias, Pedro; Dobrean, Anca; Döpfner, Manfred; Duyme, Michel; Eapen, Valsamma; Erol, Nese; Esmaeili, Elaheh Mohammad; Ezpeleta, Lourdes; Frigerio, Alessandra; Fung, Daniel S. S.; Gonçalves, Miguel; Guđmundsson, Halldór; Jeng, Suh-Fang; Jusiené, Roma; Kim, Young Ah; Kristensen, Solvejg; Liu, Jianghong; Lecannelier, Felipe; Leung, Patrick W. L.; Machado, Bárbara César; Montirosso, Rosario; Oh, Kyung Ja; Ooi, Yoon Phaik; Plück, Julia; Pomalima, Rolando; Pranvera, Jetishi; Schmeck, Klaus; Shahini, Mimoza; Silva, Jaime R.; Simsek, Zeynep; Sourander, Andre; Valverde, José; van der Ende, Jan; Van Leeuwen, Karla G.; Wu, Yen-Tzu; Yurdusen, Sema; Zubrick, Stephen R.; Verhulst, Frank C.
2014-01-01
International comparisons were conducted of preschool children’s behavioral and emotional problems as reported on the Child Behavior Checklist for Ages 1½–5 by parents in 24 societies (N =19,850). Item ratings were aggregated into scores on syndromes; Diagnostic and Statistical Manual of Mental Disorders–oriented scales; a Stress Problems scale; and Internalizing, Externalizing, and Total Problems scales. Effect sizes for scale score differences among the 24 societies ranged from small to medium (3–12%). Although societies differed greatly in language, culture, and other characteristics, Total Problems scores for 18 of the 24 societies were within 7.1 points of the omnicultural mean of 33.3 (on a scale of 0–198). Gender and age differences, as well as gender and age interactions with society, were all very small (effect sizes <1%). Across all pairs of societies, correlations between mean item ratings averaged .78, and correlations between internal consistency alphas for the scales averaged .92, indicating that the rank orders of mean item ratings and internal consistencies of scales were very similar across diverse societies. PMID:21534056
NASA Astrophysics Data System (ADS)
Isik, Hakan
This study is premised on the fact that student conceptions of optics appear to be unrelated to student characteristics of gender, age, years since high school graduation, or previous academic experiences. This study investigated the relationships between student characteristics and student performance on image formation test items and the changes in student conceptions of optics after an introductory inquiry-based physics course. Data was collected from 39 college students who were involved in an inquiry-based physics course teaching topics of geometrical optics. Student data concerning characteristics and previous experiences with optics and mathematics were collected. Assessment of student understanding of optics knowledge for pinholes, plane mirrors, refraction, and convex lenses was collected with, the Test of Image Formation with Light-Ray Tracing instrument. Total scale and subscale scores representing the optics instrument content were derived from student pretest and posttest responses. The types of knowledge, needed to answer each optics item correctly, were categorized as situational, conceptual, procedural, and strategic knowledge. These types of knowledge were associated with student correct and incorrect responses to each item to explain the existences and changes in student scientific and naive conceptions. Correlation and stepwise multiple regression analyses were conducted to identify the student characteristics and academic experiences that significantly predicted scores on the subscales of the test. The results showed that student experience with calculus was a significant predictor of student performance on the total scale as well as on the refraction subscale of the Test of Image Formation with Light-Ray Tracing. A combination of student age and previous academic experience with precalculus was a significant predictor of student performance on the pretest pinhole subscale. Student characteristic of years since high school graduation significantly predicted the gain in student scores on pinhole and plane-mirror items from the pretest to the posttest with those students who were most recent graduates from high school doing better. Multivariate and univariate analyses of variance of the Test of Image Formation with Light-Ray Tracing pinhole scale and individual item changes from the pretest to the posttest resulted in statistically significant mean differences between total scores as well as between various individual pinhole items. There were no significant changes for individual plane-mirror items from pretest to posttest. Results revealed that there is a perceivable relationship between student optics-content knowledge and the types of knowledge required by items. At the pretest, the greatest selection of wrong responses related to the items requiring situational type of knowledge and the fewest selection of wrong responses was relate to the items requiring procedural type of knowledge. Student selection of wrong options for each item revealed the following naive optics conceptions: pinholes do not create reversed images (pretest), size and sharpness of pinhole images are related to the focus of a pinhole camera (pretest and posttest); propagation of light rays are interpreted as being radial rather than directional (pretest and posttest); no conception of image formation and observation for parallel mirrors (pretest and posttest), the place of an image depends on the position of the observer (pretest and posttest), a plane mirror reflects the images of the objects placed at one side of the mirror and the observers who were positioned at the other side of the mirror can see them (pretest and posttest); applying the law of reflection to plane mirrors without considering the variations in angles of incidence and reflection (pretest and posttest), and image observation is confused with the image formation in mirrors placed perpendicular to one another (pretest and posttest). Future research should focus on the acquisition, development, and identification of reliable measures of optics concepts, processes, types of knowledge, and specific optics understanding (i.e., pinhole, plane-mirror). Future research should focus on the identification of the more critical concepts such as changes in size and sharpness of pinhole images, image observation, image formation in general, and image formation and observation in parallel mirrors. Future research can be conducted with a larger set of participants so as to compare different instructional methods and address instructional deficiencies using more efficient statistical methods. Comparative studies can be conducted to investigate the relations of various instructional strategies on student conceptions of optics.
[Relationship between occupational stress and mental health in offshore oil platform workers].
Wu, Hongtao; Xiao, Taiqin; Zou, Jianfang; Shan, Yongle; Li, Zijian
2014-02-01
To investigate the relationship between occupational stress and mental health in offshore oil platform workers and to provide a scientific basis for protection of their mental health. A total of 768 workers on offshore oil platform were surveyed with the Occupational Stress Inventory Revised Edition and Symptom Check List-90 (SCL-90). The total score of Occupational Role Questionnaire (ORQ) for the workers (160.27±24.63) was significantly lower than the national norm (166.52±27.01) (P < 0.01); the total score of Personal Strain Questionnaire (PSQ) (101.96±19.8) was significantly higher than the national norm (92.45±17.33) (P < 0.01). The total score of Personal Resource Questionnaire (PRQ) for the workers was not significantly different from the national norm (P > 0.05), but the items of recreation, social support, and rational/cognitive found significant difference (P < 0.05). The total score of SCL-90 was positively correlated with all items of ORQ and PSQ (P < 0.01) and negatively correlated with all items of PRQ (P < 0.01). The multiple stepwise regression analysis showed that current work seniority, education background, drinking, role overload, role insufficiency, role ambiguity, responsibility, physical environment, and rational/cognitive conduct impacted the score of SCL-90 (P < 0.05). The mental health of workers on offshore oil platform is related to occupational stress, and role overload, role ambiguity, physical environment, and rational/cognitive conduct, etc, are closely associated with the workers' mental health.
Pérez, Cynthia M; del Carmen Santos, María; Torres, Aurinés; Grana, Carlos; Albizu-García, Carmen
2015-09-01
Given the heavy burden of hepatitis C virus (HCV) and human immunodeficiency virus (HIV) infections in correctional facilities, we examined knowledge about these infections among case workers and correctional officers in penal institutions in Puerto Rico. We used data from a cross-sectional study of state prisons, commissioned by the Puerto Rico Department of Correction and Rehabilitation, to assess knowledge about HCV and HIV (10 items each) among 256 case workers and correctional officers from 18 penal institutions selected in the prison system. Total scores for each scale ranged from 0 to 10 points, with higher scores reflecting more knowledge. Of 256 participants, 64.8% were males, 39.6% were aged 30-39 years, and 70.3% were case workers. The percentage of correct responses for knowledge items ranged from 8.5% to 97.0% for HCV infection and from 38.7% to 99.6% for HIV infection. The vast majority (>96%) of participants knew that injection drug users should be tested for HCV infection and that sharing of needle injection equipment and multiple sex partners increase the risk of HIV infection. However, misconceptions about routes of transmission for these viral infections were found, with larger gaps in knowledge for HCV infection. Mean knowledge scores for HCV and HIV infections were 4.20±0.17 and 6.95±0.22, respectively, being significantly (p<0.05) higher for case workers. The findings about HCV and HIV knowledge in an important segment of the correctional system staff support the urgent need for increasing educational opportunities for correctional staff.
Celik, Selda; Pinar, Rukiye
2016-09-01
To examine the psychometric properties of a Turkish version of the Diabetes Fear of Injecting and Self-testing Questionnaire (D-FISQ). Forward-backward translation of the D-FISQ from English into Turkish was conducted. Original English and translated forms were examined by a panel group. Validity was investigated using content, confirmatory factor analysis, and divergent validity. Reliability was assessed using Cronbach α values, item-total correlations, and intraclass correlations. The sample comprised 350 patients with diabetes. Data were analyzed using SPSS 15.0 for Windows and LISREL 8. The content validity index for the panel members was .90, which indicated perfect content validity; items in D-FISQ were clear, concise, readable, and distinct. Confirmatory factor analysis confirmed the original construct of the D-FISQ. All items had factor loadings higher than the recommended level of .40. The D-FISQ scores were discriminated by the level of anxiety. Reliability results were also satisfactory. Cronbach α values were within ideal limits. Item-total correlation coefficient ranged from .72 to .86. In terms of test-retest reliability, intraclass correlation coefficient was found to be over .90. D-FISQ is a valid and reliable questionnaire in assessing needle-prick fear among Turkish patients with diabetes. We recommend performing the Turkish D-FISQ in determining and screening patients with diabetes who have fear related to self-insulin injection and finger-prick test. Thus, health care professionals should be aware of the potential consequences of injection fear such as insulin misuse and poor self-monitoring of blood glucose, which may have unfavorable effects on optimal diabetes management. Copyright © 2016. Published by Elsevier B.V.
Park, Young-Min; Lee, Bun-Hee; Lee, Seung-Hwan
2014-04-01
There is some evidence that low lipid levels cause suicide in depressed patients. The purpose of this study was to identify whether low serum lipid levels are associated with suicide ideation or are correlated with central serotonin function. Auditory processing for the loudness dependence of auditory evoked potentials (LDAEP) was measured in 73 outpatients with major depressive disorder. The Hamilton Depression Rating Scale (HAMD) and the Beck Depression Inventory (BDI) were administered on the same day as measurement of the LDAEP. In addition, serum levels of total cholesterol, low-density lipoprotein (LDL), high-density lipoprotein (HDL), and triglyceride (TG) levels were measured. All subjects had received antidepressant monotherapy. The depressed subjects were divided into those with and without suicide ideation according to the score for HAMD item 3 or BDI item 9. TG levels differed significantly between the two groups, whereas body mass index (BMI), total cholesterol, LDL, HDL, and LDAEP did not. The scores for HAMD item 3 and BDI item 9 were negatively correlated with TG levels (p=0.045 and 0.026, respectively). The LDAEP was negatively correlated with TG levels (p=0.012). Although there was tendency toward a negative correlation between the LDAEP and serum LDL, it did not reach statistical significance (p=0.068). The cross-sectional design of this study means that baseline serum lipid levels were not measured. The findings of this study revealed a relationship between TG and suicide ideation that is independent of both BMI and body weight. Furthermore, serum lipid levels were associated with central serotonergic activity, as assessed using the LDAEP. Copyright © 2014 Elsevier B.V. All rights reserved.
Validity and reliability of the Persian version of mobile phone addiction scale.
Mazaheri, Maryam Amidi; Karbasi, Mojtaba
2014-02-01
With regard to large number of mobile users especially among college students in Iran, addiction to mobile phone is attracting increasing concern. There is an urgent need for reliable and valid instrument to measure this phenomenon. This study examines validity and reliability of the Persian version of mobile phone addiction scale (MPAIS) in college students. this methodological study was down in Isfahan University of Medical Sciences. One thousand one hundred and eighty students were selected by convenience sampling. The English version of the MPAI questionnaire was translated into Persian with the approach of Jones et al. (Challenges in language, culture, and modality: Translating English measures into American Sign Language. Nurs Res 2006; 55: 75-81). Its reliability was tested by Cronbach's alpha and its dimensionality validity was evaluated using Pearson correlation coefficients with other measures of mobile phone use and IAT. Construct validity was evaluated using Exploratory subscale analysis. Cronbach's alpha of 0.86 was obtained for total PMPAS, for subscale1 (eight items) was 0.84, for subscale 2 (five items) was 0.81 and for subscale 3 (two items) was 0.77. There were significantly positive correlations between the score of PMPAS and IAT (r = 0.453, P < 0.001) and other measures of mobile phone use. Principal component subscale analysis yielded a three-subscale structure including: inability to control craving; feeling anxious and lost; mood improvement accounted for 60.57% of total variance. The results of discriminate validity showed that all the item's correlations with related subscale were greater than 0.5 and correlations with unrelated subscale were less than 0.5. Considering lack of a valid and reliable questionnaire for measuring addiction to the mobile phone, PMPAS could be a suitable instrument for measuring mobile phone addiction in future research.
Cil Akinci, Ayse; Pinar, Rukiye
2014-02-01
To investigate the validity and reliability of the Caregiver Burden Scale in family members who provide primary care for haemodialysis patients. In Turkey, there is a need for a multi-dimensional instrument to evaluate the caregiver burden in people who provide care for patients with chronic diseases. A methodological study. The study sample consisted of 161 family members who provide primary care for haemodialysis patients. The forward-backward translation method was used to develop the Turkish Caregiver Burden Scale. The reliability was based on internal consistency investigated by Cronbach's alpha and item-total correlation. The factorial construct validity of the scale was tested with confirmatory factor analysis. By means of convergent and divergent validity, correlation between Caregiver Burden Scale and 36-Item Short Form Health Survey (SF-36) and correlation between Caregiver Burden Scale and the Maslach Burnout Scale were investigated. Cronbach's alpha and item-total correlations results suggested that there was good internal reliability. We found five underlying factors similar to original Scale's five-factor solution. The confirmatory factor analysis five-factor model represented an acceptable fit. Factor loadings were significant, with standardised loadings ranging from 0·43-0·81. By means of divergent validity, all sub-dimension scores and the total score of the Caregiver Burden Scale were negatively correlated with the SF-36, whereas there was a positive correlation with the emotional exhaustion and depersonalisation subscales of the Maslach Burnout Scale as expected. These results suggest that the Caregiver Burden Scale is a reliable and valid instrument which can be used with confidence in Turkish caregivers for haemodialysis patients to screen caregiver burden. The burden experienced by people who provide care for patients with chronic diseases can be evaluated with the Caregiver Burden Scale. Additionally, the Caregiver Burden Scale can be used in the evaluation of the effectiveness of attempts to decrease caregiver burden. © 2012 Blackwell Publishing Ltd.
Validation of Turkish version of brief negative symptom scale.
Polat Nazlı, Irmak; Ergül, Ceylan; Aydemir, Ömer; Chandhoke, Swati; Üçok, Alp; Gönül, Ali Saffet
2016-11-01
Negative symptoms in schizophrenia have been assessed by many instruments. However, a current consensus on these symptoms has been built and new tools, such as the Brief Negative Symptom Scale (BNSS), are generated. This study aimed to evaluate reliability and validity of the Turkish version of BNSS. The scale was translated to Turkish and backtranslated to English. After the approval of the translation, 75 schizophrenia patients were interviewed with BNSS, Positive and Negative Syndrome Scale (PANSS), Calgary Depression Scale for Schizophrenia (CDSS) and Extrapyramidal Symptom Rating Scale (ESRS). Reliability and validity analyses were then calculated. In the reliability analysis, the Cronbach's alpha coefficient was 0.96 and item-total score correlation coefficients were between 0.655-0.884. The intraclass correlation coefficient was 0.665. The inter-rater reliability was 0.982 (p < 0.0001). In the validity analysis, the total score of BNSS-TR was correlated with PANSS Total Score, Positive Symptoms Subscale, Negative Symptoms Subscale, and General Psychopathology Subscale. CDSS and ESRS were not correlated with BNSS-TR. The factor structure of the scale was consisting the same items as in the original version. Our study confirms that the Turkish version of BNSS is an applicable tool for the evaluation of negative symptoms in schizophrenia.
Parr, Jeremy R; De Jonge, Maretha V; Wallace, Simon; Pickles, Andrew; Rutter, Michael L; Le Couteur, Ann S; van Engeland, Herman; Wittemeyer, Kerstin; McConachie, Helen; Roge, Bernadette; Mantoulan, Carine; Pedersen, Lennart; Isager, Torben; Poustka, Fritz; Bolte, Sven; Bolton, Patrick; Weisblatt, Emma; Green, Jonathan; Papanikolaou, Katerina; Baird, Gillian; Bailey, Anthony J
2015-10-01
Clinical genetic studies confirm the broader autism phenotype (BAP) in some relatives of individuals with autism, but there are few standardized assessment measures. We developed three BAP measures (informant interview, self-report interview, and impression of interviewee observational scale) and describe the development strategy and findings from the interviews. International Molecular Genetic Study of Autism Consortium data were collected from families containing at least two individuals with autism. Comparison of the informant and self-report interviews was restricted to samples in which the interviews were undertaken by different researchers from that site (251 UK informants, 119 from the Netherlands). Researchers produced vignettes that were rated blind by others. Retest reliability was assessed in 45 participants. Agreement between live scoring and vignette ratings was very high. Retest stability for the interviews was high. Factor analysis indicated a first factor comprising social-communication items and rigidity (but not other repetitive domain items), and a second factor comprised mainly of reading and spelling impairments. Whole scale Cronbach's alphas were high for both interviews. The correlation between interviews for factor 1 was moderate (adult items 0.50; childhood items 0.43); Kappa values for between-interview agreement on individual items were mainly low. The correlations between individual items and total score were moderate. The inclusion of several factor 2 items lowered the overall Cronbach's alpha for the total set. Both interview measures showed good reliability and substantial stability over time, but the findings were better for factor 1 than factor 2. We recommend factor 1 scores be used for characterising the BAP. © 2015 The Authors Autism Research published by Wiley Periodicals, Inc. on behalf of International Society for Autism Research.
Development of knowledge tests for multi-disciplinary emergency training: a review and an example.
Sørensen, J L; Thellesen, L; Strandbygaard, J; Svendsen, K D; Christensen, K B; Johansen, M; Langhoff-Roos, P; Ekelund, K; Ottesen, B; Van Der Vleuten, C
2015-01-01
The literature is sparse on written test development in a post-graduate multi-disciplinary setting. Developing and evaluating knowledge tests for use in multi-disciplinary post-graduate training is challenging. The objective of this study was to describe the process of developing and evaluating a multiple-choice question (MCQ) test for use in a multi-disciplinary training program in obstetric-anesthesia emergencies. A multi-disciplinary working committee with 12 members representing six professional healthcare groups and another 28 participants were involved. Recurrent revisions of the MCQ items were undertaken followed by a statistical analysis. The MCQ items were developed stepwise, including decisions on aims and content, followed by testing for face and content validity, construct validity, item-total correlation, and reliability. To obtain acceptable content validity, 40 out of originally 50 items were included in the final MCQ test. The MCQ test was able to distinguish between levels of competence, and good construct validity was indicated by a significant difference in the mean score between consultants and first-year trainees, as well as between first-year trainees and medical and midwifery students. Evaluation of the item-total correlation analysis in the 40 items set revealed that 11 items needed re-evaluation, four of which addressed content issues in local clinical guidelines. A Cronbach's alpha of 0.83 for reliability was found, which is acceptable. Content and construct validity and reliability were acceptable. The presented template for the development of this MCQ test could be useful to others when developing knowledge tests and may enhance the overall quality of test development. © 2014 The Acta Anaesthesiologica Scandinavica Foundation. Published by John Wiley & Sons Ltd.
[Psychometric Characteristics of the Clinical Nursing Mentors' Behavior Scale].
Zhao, Rong; Chen, Yan-Hua; Yu, Hui-Ting; Xiao, Lu; Wen, Jing; Yeh, Tzu-Pei
2017-08-01
The behavior of mentors impacts the quality and experience of nursing students who are studying in clinical placement. Accurately assessing the behavior of mentors is fundamental to training, regulating, guiding, and improving their behavior and quality of teaching. To test the validity and reliability of the Clinical Nursing Mentors' Behavior Scale (CNMBS) among mentors. This study included three stages. During the first stage, seven Chinese experts were invited to evaluate content validity. During the second stage, the test-retest reliability was examined with 63 mentors. During the third stage, a cross-sectional study was conducted. Seven hundred and sixty-six nursing mentors from five hospitals in Beijing, Shenzhen, and Sichuan completed the survey either online or in hard copy form. The data collected from the questionnaire were analyzed using item analysis, construct validity, internal consistency and discriminant validity, with the results used to determine the psychometric characteristics of the CNMBS. The content validity index for the CNMBS was .91. The intra-class correlation coefficient was .89; the range of the item discrimination critical ratio was 9.42-22.43 (p < .001), and the item-total correlation was .35- .70 (p < .001). The three factors of "guiding personal growth", "promoting professional development", and "providing psychosocial support" and a total of 23 items were identified, with item factor loadings ranging from .51 to .79. The three factors explained 50.99% of total variance. The internal consistency of the CNMBS earned a Cronbach's α coefficient of .92, while those of the three subscales were .89, .86 and .75, respectively. The Clinical Nursing Mentors' Behavior Scale demonstrated high validity and reliability, supporting the CNMBS as a valid tool for assessing the teaching behavior of mentors.
Vastamäki, Heidi; Vastamäki, Martti; Laimi, Katri; Saltychev, Michail
2017-07-01
Poorly functioning work environments may lead to dissatisfaction for the employees and financial loss for the employers. The Job Content Questionnaire (JCQ) was designed to measure social and psychological characteristics of work environments. To investigate the factor construct of the Finnish 14-item version of JCQ when applied to professional orchestra musicians. In a cross-sectional survey, the questionnaire was sent by mail to 1550 orchestra musicians and students. 630 responses were received. Full data were available for 590 respondents (response rate 38%).The questionnaire also contained questions on demographics, job satisfaction, health status, health behaviors, and intensity of playing music. Confirmatory factor analysis of the 2-factor model of JCQ was conducted. Of the 5 estimates, JCQ items in the "job demand" construct, the "conflicting demands" (question 5) explained most of the total variance in this construct (79%) demonstrating almost perfect correlation of 0.63. In the construct of "job control," "opinions influential" (question 10) demonstrated a perfect correlation index of 0.84 and the items "little decision freedom" (question 14) and "allows own decisions" (question 6) showed substantial correlations of 0.77 and 0.65. The 2-factor model of the Finnish 14-item version of JCQ proposed in this study fitted well into the observed data. The "conflicting demands," "opinions influential," "little decision freedom," and "allows own decisions" items demonstrated the strongest correlations with latent factors suggesting that in a population similar to the studied one, especially these items should be taken into account when observed in the response of a population.
Pompilus, Farrah; Burgess, Somali; Hudgens, Stacie; Banderas, Benjamin; Daniels, Selena
2015-12-01
Facial lines or wrinkles are among the most visible signs of aging, and minimally invasive cosmetic procedures are becoming increasingly popular. The aim of this study was to develop and validate the Facial Line Satisfaction Questionnaire (FLSQ) for use in adults with upper facial lines (UFL). A literature review, concept elicitation interviews (n = 33), and cognitive debriefing interviews (n = 23) of adults with UFL were conducted to develop the FLSQ. The FLSQ comprises Baseline and Follow-up versions and was field-tested with 150 subjects in a US observational study designed to assess its psychometric performance. Analyses included acceptability (item and scale distribution [i.e. missingness, floor, and ceiling effects]), reliability, and validity (including concurrent validity). In total, 69 concepts were elicited during patient interviews. Following cognitive debriefing interviews, the FLSQ-Baseline version included 11 items and the Follow-up version included 13 items. Response rates for the FLSQ were 100% and 73% at baseline and follow-up, respectively; no items had excessive missing data. Questionnaire scale scores were normally distributed. Most domain scores demonstrated good internal consistency reliability (Cronbach's α ≥ 0.70). Most items within their respective domains exhibited good convergent (item-scale correlations > 0.40) and discriminant (items had higher correlation with their hypothesized scales than other scales) validity. Concurrent validity correlation coefficients of the FLSQ domain scores with the associated concurrent measures were acceptable (range: r = 0.40-0.70). Six FLSQ items demonstrated reliability and validity as stand-alone items outside their domains. The FLSQ is a valid questionnaire for assessing treatment expectations, satisfaction, impact, and preference in adults with UFL. © 2015 The Authors. Journal of Cosmetic Dermatology Published by Wiley Periodicals, Inc.
Johnson, Jeffrey D; Rugg, Michael D
2006-02-03
Retrieval orientation refers to the differential processing of retrieval cues according to the type of information sought from memory (e.g., words vs. pictures). In the present study, event-related potentials (ERPs) were employed to investigate whether the neural correlates of differential retrieval orientations are sensitive to the specificity of the retrieval demands of the test task. In separate study-test phases, subjects encoded lists of intermixed words and pictures, and then undertook one of two retrieval tests, in both of which the retrieval cues were exclusively words. In the recognition test, subjects performed 'old/new' discriminations on the test items, and old items corresponded to only one class of studied material (words or pictures). In the exclusion test, old items corresponded to both classes of study material, and subjects were required to respond 'old' only to test items corresponding to a designated class of material. Thus, demands for retrieval specificity were greater in the exclusion test than during recognition. ERPs elicited by correctly classified new items in the two types of test were contrasted according to whether words or pictures were the sought-for material. Material-dependent ERP effects were evident in both tests, but the effects onset earlier and offset later in the exclusion test. The findings suggest that differential processing of retrieval cues, and hence the adoption of differential retrieval orientations, varies according to the specificity of the retrieval goal.
van Rooij, Antonius J; Van Looy, Jan; Billieux, Joël
2017-07-01
Some people have serious problems controlling their Internet and video game use. The DSM-5 now includes a proposal for 'Internet Gaming Disorder' (IGD) as a condition in need of further study. Various studies aim to validate the proposed diagnostic criteria for IGD and multiple new scales have been introduced that cover the suggested criteria. Using a structured approach, we demonstrate that IGD might be better interpreted as a formative construct, as opposed to the current practice of conceptualizing it as a reflective construct. Incorrectly approaching a formative construct as a reflective one causes serious problems in scale development, including: (i) incorrect reliance on item-to-total scale correlation to exclude items and incorrectly relying on indices of inter-item reliability that do not fit the measurement model (e.g., Cronbach's α); (ii) incorrect interpretation of composite or mean scores that assume all items are equal in contributing value to a sum score; and (iii) biased estimation of model parameters in statistical models. We show that these issues are impacting current validation efforts through two recent examples. A reinterpretation of IGD as a formative construct has broad consequences for current validation efforts and provides opportunities to reanalyze existing data. We discuss three broad implications for current research: (i) composite latent constructs should be defined and used in models; (ii) item exclusion and selection should not rely on item-to-total scale correlations; and (iii) existing definitions of IGD should be enriched further. © 2016 The Authors. Psychiatry and Clinical Neurosciences © 2016 Japanese Society of Psychiatry and Neurology.
Moser, Debra K; Riegel, Barbara; McKinley, Sharon; Doering, Lynn V; Meischke, Hendrika; Heo, Seongkum; Lennie, Terry A; Dracup, Kathleen
2009-01-01
Perceived control is a construct with important theoretical and clinical implications for healthcare providers, yet practical application of the construct in research and clinical practice awaits development of an easily administered instrument to measure perceived control with evidence of reliability and validity. To test the psychometric properties of the Control Attitudes Scale-Revised (CAS-R) using a sample of 3,396 individuals with coronary heart disease, 513 patients with acute myocardial infarction, and 146 patients with heart failure. Analyses were done separately in each patient group. Reliability was assessed using Cronbach's alpha to determine internal consistency, and item homogeneity was assessed using item-total and interitem correlations. Validity was examined using principal component analysis and testing hypotheses about known associations. Cronbach's alpha values for the CAS-R in patients with coronary heart disease, acute myocardial infarction, and heart failure were all greater than .70. Item-total and interitem correlation coefficients for all items were acceptable in the groups. In factor analyses, the same single factor was extracted in all groups, and all items were loaded moderately or strongly to the factor in each group. As hypothesized in the final construct validity test, in all groups, patients with higher levels of perceived control had less depression and less anxiety compared with those of patients who had lower levels of perceived control. This study provides evidence of the reliability and validity of the 8-item CAS-R as a measure of perceived control in patients with cardiac illness and provides important insight into a key patient construct.
Shou, Juan; Ren, Limin; Wang, Haitang; Yan, Fei; Cao, Xiaoyun; Wang, Hui; Wang, Zhiliang; Zhu, Shanzhu; Liu, Yao
2016-04-01
The 12-item Short-Form Health Survey (SF-12) is the abridged practical version of SF-36. This cross-sectional study was aimed to assess the reliability and validity of SF-12 for the health status of Chinese community elderly population. The Chinese community elderly people in Xujiahui district of Shanghai were investigated. The internal consistency reliability was assessed using Cronbach's alpha and split-half reliability coefficients. Construct validity was analyzed using exploratory factor analysis (EFA) and confirmatory factor analysis (CFA). Spearman's correlation coefficient (ρ) was used for the evaluation of criterion, convergent, and discriminant validity with Spearman's ρ ≥ 0.4 as satisfactory. Comparisons of the SF-12 summary scores among populations that differed in demographics were performed for discriminant validity. Total 1343 individuals aged ≥60 and <85 years old (response rate: 91.3 %) were analyzed. The Cronbach's α value (0.910) and the split-half reliability coefficient (0.812) reflected satisfactory internal consistency reliability of SF-12. EFA extracted a two-factor model (physical and mental health). About 60.7 % of the total variance was explained by the two factors. CFA showed that the two-factor solution provided a good fit to the data. Good convergent validity and discriminant validity of SF-12 were proved by the correction analyses (Spearman's ρ > 0.4) and the comparisons of the SF-12 summary scores among populations (P < 0.05). SF-12 summary scores were significantly correlated with the SF-36 summary scores (Spearman's ρ > 0.4, P < 0.05). In conclusion, SF-12 had satisfactory reliability and validity in measuring health status of Chinese community elderly population in Xujiahui district of Shanghai.
Psychometric properties of the Exercise Benefits/Barriers Scale in Mexican elderly women
Enríquez-Reyna, María Cristina; Cruz-Castruita, Rosa María; Ceballos-Gurrola, Oswaldo; García-Cadena, Cirilo Humberto; Hernández-Cortés, Perla Lizeth; Guevara-Valtier, Milton Carlos
2017-01-01
ABSTRACT Objective: analyze and assess the psychometric properties of the subscales in the Spanish version of the Exercise Benefits/Barriers Scale in an elderly population in the Northeast of Mexico. Method: methodological study. The sample consisted of 329 elderly associated with one of the five public centers for senior citizens in the metropolitan area of Northeast Mexico. The psychometric properties included the assessment of the Cronbach's alpha coefficient, the Kaiser Meyer Olkin coefficient, the inter-item correlation, exploratory and confirmatory factor analysis. Results: in the principal components analysis, two components were identified based on the 43 items in the scale. The item-total correlation coefficient of the exercise benefits subscale was good. Nevertheless, the coefficient for the exercise barriers subscale revealed inconsistencies. The reliability and validity were acceptable. The confirmatory factor analysis revealed that the elimination of items improved the goodness of fit of the baseline scale, without affecting its validity or reliability. Conclusion: the Exercise Benefits/Barriers subscale presented satisfactory psychometric properties for the Mexican context. A 15-item short version is presented with factorial structure, validity and reliability similar to the complete scale. PMID:28591306
Gasquet, Isabelle; Villeminot, Sylvie; Estaquio, Carla; Durieux, Pierre; Ravaud, Philippe; Falissard, Bruno
2004-08-04
Few questionnaires on outpatients' satisfaction with hospital exist. All have been constructed without giving enough room for the patient's point of view in the validation procedure. The main objective was to develop, according to psychometric standards, a self-administered generic outpatient questionnaire exploring opinion on quality of hospital care. First, a qualitative phase was conducted to generate items and identify domains using critical analysis incident technique and literature review. A list of easily comprehensible non-redundant items was defined using Delphi technique and a pilot study on outpatients. This phase involved outpatients, patient association representatives and experts. The second step was a quantitative validation phase comprised a multicenter study in 3 hospitals, 10 departments and 1007 outpatients. It was designed to select items, identify dimensions, measure reliability, internal and concurrent validity. Patients were randomized according to the place of questionnaire completion (hospital v. home) (participation rate = 65%). Third, a mail-back study on 2 departments and 248 outpatients was conducted to replicate the validation (participation rate = 57%). A 27-item questionnaire comprising 4 subscales (appointment making, reception facilities, waiting time and consultation with the doctor). The factorial structure was satisfactory (loading >0.50 on each subscale for all items, except one item). Interscale correlations ranged from 0.42 to 0.59, Cronbach alpha coefficients ranged from 0.79 to 0.94. All Item-scale correlations were higher than 0.40. Test-retest intraclass coefficients ranged from 0.69 to 0.85. A unidimensional 9-item version was produced by selection of one third of the items within each subscale with the strongest loading on the principal component and the best item-scale correlation corrected for overlap. Factors related to satisfaction level independent from departments were age, previous consultations in the department and satisfaction with life. Completion at hospital immediately after consultation led to an overestimation of satisfaction. No satisfaction score differences existed between spontaneous respondents and patients responding after reminder(s). Good estimation of patient opinion on hospital consultation performance was obtained with these questionnaires. When comparing performances between departments or the same department over time scores need to be adjusted on 3 variables that influence satisfaction independently from department. Completion of the questionnaire at home is preferable to completion in the consultation facility and reminders are not necessary to produce non-biased data.
[Validating the Spanish version of the Nursing Activities Score].
Sánchez-Sánchez, M M; Arias-Rivera, S; Fraile-Gamo, M P; Thuissard-Vasallo, I J; Frutos-Vivar, F
2015-01-01
Validating workload scores ensures that they are appropriate for the purpose for which they were developed. To validate the Nursing Activities Score (NAS) Spanish version. Observational and prospective study. 1,045 patients who were admitted to a medical-surgical unit and a serious burns unit in 2006 were included. The nurse in charge assessed patient workloads by Nine Equivalent of Nursing Manpower use Score and NAS. To assess the internal consistency of the measurements of NAS, item-test correlations, Cronbach's α and Cronbach's α corrected by omitting each of the items were calculated. The intraobserver and interobserver reliability were assessed with the intraclass correlation coefficient by viewing recordings and Kappa (interobserver reliability) was estimated. For the analysis of internal validity, a factorial principal components analysis was performed. Convergent validity was assessed using the Spearman correlation coefficient values obtained from the Nine Equivalent of Nursing Manpower use Score and Spanish-NAS scales. For internal consistency, 164 questionnaires were analysed and a Cronbach's α of 0.373 was calculated. The intraclass correlation coefficient for intraobserver reliability estimate was 0.837 (95% IC: 0.466-0.950) and 0.662 (95% IC: 0.033-0.882) for interobserver reliability. The estimated kappa was 0.371. For internal validity, exploratory factor analysis showed that the first item explained 58.9% of the variance of the questionnaire. For convergent validity 1006 questionnaires were included and a Spearman correlation coefficient of 0.746 was observed. The psychometric properties of Spanish-NAS are acceptable. Copyright © 2014 Elsevier España, S.L.U. y SEEIUC. All rights reserved.
An Analysis of the Individual Effects of Sex Bias.
ERIC Educational Resources Information Center
Smith, Richard M.
Most attempts to correct for the presence of biased test items in a measurement instrument have been either to remove the items or to adjust the scores to correct for the bias. Using the Rasch Dichotomous Response Model and the independent ability estimates derived from three sets of items, those which favor females, those which favor males, and…
Oral health literacy and knowledge among patients who are pregnant for the first time.
Hom, Jacqueline M; Lee, Jessica Y; Divaris, Kimon; Baker, A Diane; Vann, William F
2012-09-01
The authors conducted an observational cohort study to determine the levels of and examine the associations of oral health literacy (OHL) and oral health knowledge in low-income patients who were pregnant for the first time. An analytic sample of 119 low-income patients who were pregnant for the first time completed a structured 30-minute, in-person interview conducted by two trained interviewers in seven counties in North Carolina. The authors measured OHL by means of a dental word recognition test and assessed oral health knowledge by administering a six-item knowledge survey. The authors found that OHL scores were distributed normally (mean [standard deviation], 16.4 [5.0]). The percentage of correct responses for each oral health knowledge item ranged from 45 to 98 percent. The results of bivariate analyses showed that there was a positive correlation between OHL and oral health knowledge (P < .01). Higher OHL levels were associated with correct responses to two of the knowledge items (P < .01). OHL was low in the study sample. There was a significant association between OHL and oral health knowledge. Low OHL levels and, thereby, low levels of oral health knowledge, might affect health outcomes for both the mother and child. Tailoring messages to appropriate OHL levels might improve knowledge.
Oren, Besey; Zengin, Neriman; Yildiz, Nebahat
2016-01-01
OBJECTIVE: This study aimed to test the validity and reliability of a version of the tool developed in Sri Lanka in 2011 to assess patient perceptions of the quality of nursing care and related hospital services created for use with Turkish patients. METHODS: This methodological study was conducted between November 2013 and November 2014 after obtaining ethical approval and organizational permission. Data was collected during discharge from 180 adult patients who were hospitalized for at least 3 days at a medical school hospital located in Istanbul. After language validation, validity and reliability analyses of the scale were conducted. Content validity, content validity index (CVI), construct validity, and exploratory factor analysis were assessed and examined, and reliability was tested using the Cronbach’s alpha coefficient and item-total correlations. RESULTS: Mean CVI was found to be 0.95, which is above expected value. Exploratory factor analysis revealed 4 factors with eigenvalues above 1, which explained 82.4% of total variance in the Turkish version of the tool to measure patient perceptions of nursing care and other hospital services. Factor loading for each item was ≥.40. Cronbach’s alpha coefficient of sub-dimensions and total scale were found to be 0.84-0.98 and 0.98, respectively. Item-total correlations ranged from 0.56 to 0.83 for the entire group, which was above expected values. CONCLUSION: The Turkish version of the scale to assess patient perceptions of the quality of nursing care and related hospital services, which comprised 4 sub-dimensions and 36 items, was found to be valid and reliable for use with the Turkish population. PMID:28275750
Oren, Besey; Zengin, Neriman; Yildiz, Nebahat
2016-01-01
This study aimed to test the validity and reliability of a version of the tool developed in Sri Lanka in 2011 to assess patient perceptions of the quality of nursing care and related hospital services created for use with Turkish patients. This methodological study was conducted between November 2013 and November 2014 after obtaining ethical approval and organizational permission. Data was collected during discharge from 180 adult patients who were hospitalized for at least 3 days at a medical school hospital located in Istanbul. After language validation, validity and reliability analyses of the scale were conducted. Content validity, content validity index (CVI), construct validity, and exploratory factor analysis were assessed and examined, and reliability was tested using the Cronbach's alpha coefficient and item-total correlations. Mean CVI was found to be 0.95, which is above expected value. Exploratory factor analysis revealed 4 factors with eigenvalues above 1, which explained 82.4% of total variance in the Turkish version of the tool to measure patient perceptions of nursing care and other hospital services. Factor loading for each item was ≥.40. Cronbach's alpha coefficient of sub-dimensions and total scale were found to be 0.84-0.98 and 0.98, respectively. Item-total correlations ranged from 0.56 to 0.83 for the entire group, which was above expected values. The Turkish version of the scale to assess patient perceptions of the quality of nursing care and related hospital services, which comprised 4 sub-dimensions and 36 items, was found to be valid and reliable for use with the Turkish population.
Canady, Renée B; Stommel, Manfred; Holzman, Claudia
2009-01-01
This study investigated the appropriateness of using the CES-D scale for comparing depressive symptoms among pregnant women of different races. Black and White women were matched on education, age, Medicaid status, and marital status-living arrangements. The matching procedure yielded a study sample of 375 in each ethnic group. Using a confirmatory factor analysis, the fit of several factor models for the CES-D was evaluated. One CES-D item, "everything was an effort", showed a low item-total correlation (0.04 among blacks, 0.22 among whites) and was excluded from further analysis. After imposing the constraints of equal factor loadings and factor covariance across both groups, a two-factor model with 19 CES-D items provided a good fit. Only the loading for the "was happy" item displayed a small difference between the two groups. Furthermore, the correlations between the original 20-item and the unbiased 18-item scales were r = 0.994 for Whites and r = 0.992 for Blacks. The results suggest that the 20-item CES-D can be used to compare depressive symptoms in White and Black pregnant women without introducing significant ethnic-racial bias in the measurement of these symptoms.
Rapid assessment of tinnitus-related psychological distress using the Mini-TQ.
Hiller, Wolfgang; Goebel, Gerhard
2004-01-01
The aim of this study was to develop an abridged version of the Tinnitus Questionnaire (TQ) to be used as a quick tool for the assessment of tinnitus-related psychological distress. Data from 351 inpatients and 122 outpatients with chronic tinnitus were used to analyse item statistics and psychometric properties. Twelve items with an optimal combination of high item-total correlations, reliability and sensitivity in assessing changes were selected for the Mini-TQ. Correlation with the full TQ was >0.90, and test-retest reliability was 0.89. Validity was confirmed by associations with general psychological symptom patterns. Treatment effects indicated by the Mini-TQ were slightly greater than those indicated by the full TQ. The Mini-TQ is recommended as a psychometrically approved and solid tool for rapid and economical assessment of subjective tinnitus distress.
Correlation between remnant inferior turbinate volume and symptom severity of empty nose syndrome.
Hong, Hye Ran; Jang, Yong Ju
2016-06-01
Empty nose syndrome (ENS) is an iatrogenic disorder caused by turbinate reduction procedures, which results in considerable nasal dysfunction and severely impaired quality of life. However, there is a lack of data that explains the relationship between the degree of turbinate reduction and subjective symptoms. The aim of this study was to evaluate the effects of remnant inferior turbinate volume on symptom severity. We retrospectively analyzed data from 34 patients who were diagnosed with ENS. All patients underwent computed tomography scanning and completed the SNOT-25 questionnaire. The control group consisted of 10 patients with pituitary adenoma who did not have any sinonasal symptoms or abnormalities. The inferior turbinate volumes were compared between groups, and the correlation between inferior turbinate volumes (ITVs) and Sino-Nasal Outcome Test-25 (SNOT-25) was also evaluated. The ENS group presented with a significantly smaller inferior turbinate volume than the control group (P < 0.001). The overall SNOT-25 score demonstrated no statistically significant correlation with anterior, posterior, or total ITV (P > 0.05, respectively). Among the various items on SNOT-25, a high dryness score was significantly correlated with a smaller total inferior turbinate volume (P = 0.030). Facial pain was significantly correlated with smaller anterior ITV (P = 0.011). In addition, patients who had smaller posterior inferior turbinate volume demonstrated higher scores on specific SNOT-25 items. A smaller inferior turbinate volume is significantly associated with specific SNOT-25 items in ENS patients. 4. Laryngoscope, 126:1290-1295, 2016. © 2015 The American Laryngological, Rhinological and Otological Society, Inc.
T56. AN EXPLORATORY ANALYSIS CONVERTING SCORES BETWEEN THE PANSS AND BNSS
Kott, Alan; Daniel, David
2018-01-01
Abstract Background The Brief Negative Symptom Scale is a relatively new instrument designed specifically to measure the negative symptoms in schizophrenia. Recently more clinical trials include the BNSS scale as a secondary or exploratory outcome, typically along with the PANSS. In the current analysis we aimed at establishing the equations that would allow conversion between the BNSS scale total score and the PANSS negative subscale and PANSS negative factors score as well as conversion equations between the expressive deficits and avolition/apathy factors of the scales. (Kirkpatrick, 2011; Strauss, 2012) Methods Data from 518 schizophrenia clinical trials subjects with both PANSS and BNSS data available were used. Regression analyses predicting the BNSS total score with the PANSS negative subscale score, and the BNSS total score with the PANSS Negative factor (NFS) score were performed on data from all subjects. Regression analyses predicting the BNSS avolition/apathy factor (items 1, 2, 3, 5, 6, 7, and 8) with the PANSS avolition/apathy factor (items N2, N4 and G16) and the BNSS expressive deficits factor (items 4, 9, 10, 11, 12, and 13)with the expressive deficits factor (items N1, N3, N6, G5, G7, and G13)of the PANSS were performed on a sample of 318 subjects with individual BNSS item scores available. In addition to estimating the equations we as well calculated the Pearson’s correlations between the scales. Results The PANSS and BNSS avolition/apathy factors were highly correlated (r=0.70) as were the expressive deficit factors r=0.83). The following equations predicting the BNSS total score were obtained from regression analyses performed on 2,560 data points: BNSS_total = -11.64 + 2.10*PANSS_negative_subscale BNSS_total = -9.26 + 2.11*PANSS_NFS The following equations predicting the BNSS factor scores from the PANSS factor scores were obtained from regression analyses performed on 1,634 data points: BNSS_avolition/apathy = -2.40 + 2.38 * PANSS_avolition/apathy BNSS_expressive_deficit_factor = -4.21 + 1.27 * PANSS_expressive_deficit_factor Discussion The BNSS differs from the PANSS negative factor because it addresses all five currently recognized domains of negative symptoms including anhedonia and attempts to differentiate anticipatory from consummatory states. In our analysis we have replicated the strong correlation between the BNSS total score and PANSS negative subscale and newly identified strong correlations between the BNSS total score and NFS as well as strong correlations between the avolotion/apathy and expressive deficit factors of the BNSS and the PANSS scales. (Kirkpatrick, 2011)The provided equations offer a useful tool allowing researchers and clinicians to easily convert the data between the instruments for reasons such as pooling data from multiple trials using one of the instruments, to allow interpretation of results within the context of previously conducted research, etc. but as well offer a framework for risk based monitoring to identify data deviating from the expected relationship and allow for a targeted exploration of the causes for such a disagreement. The data used for analysis included not only subjects with predominantly negative symptoms but as well acutely psychotic subjects as well as subjects in stable conditions allowing therefore to generalize the results across the majority of schizophrenic subjects. This post-hoc analysis is exploratory. We plan to further explore the potential utility of equations addressing the relationships among schizophrenia measures of symptom severity in an iterative manner with larger datasets.
García-Pérez, Miguel A.; Alcalá-Quintana, Rocío
2016-01-01
Hoekstra et al. (Psychonomic Bulletin & Review, 2014, 21:1157–1164) surveyed the interpretation of confidence intervals (CIs) by first-year students, master students, and researchers with six items expressing misinterpretations of CIs. They asked respondents to answer all items, computed the number of items endorsed, and concluded that misinterpretation of CIs is robust across groups. Their design may have produced this outcome artifactually for reasons that we describe. This paper discusses first the two interpretations of CIs and, hence, why misinterpretation cannot be inferred from endorsement of some of the items. Next, a re-analysis of Hoekstra et al.'s data reveals some puzzling differences between first-year and master students that demand further investigation. For that purpose, we designed a replication study with an extended questionnaire including two additional items that express correct interpretations of CIs (to compare endorsement of correct vs. nominally incorrect interpretations) and we asked master students to indicate which items they would have omitted had they had the option (to distinguish deliberate from uninformed endorsement caused by the forced-response format). Results showed that incognizant first-year students endorsed correct and nominally incorrect items identically, revealing that the two item types are not differentially attractive superficially; in contrast, master students were distinctively more prone to endorsing correct items when their uninformed responses were removed, although they admitted to nescience more often that might have been expected. Implications for teaching practices are discussed. PMID:27458424
NASA Astrophysics Data System (ADS)
Campbell, Erin Roberts
The process of chemical education should facilitate students' construction of meaningful conceptual structures about the concepts and processes of chemistry. It is evident, however, that students at all levels possess concepts that are inconsistent with currently accepted scientific views. The purpose of this study was to examine undergraduate chemistry students' conceptions of atomic structure, chemical bonding and molecular structure. A diagnostic instrument to evaluate students' conceptions of atomic and molecular structure was developed by the researcher. The instrument incorporated multiple-choice items and reasoned explanations based upon relevant literature and a categorical summarization of student responses (Treagust, 1988, 1995). A covalent bonding and molecular structure diagnostic instrument developed by Peterson and Treagust (1989) was also employed. The ex post facto portion of the study examined the conceptual understanding of undergraduate chemistry students using descriptive statistics to summarize the results obtained from the diagnostic instruments. In addition to the descriptive portion of the study, a total score for each student was calculated based on the combination of correct and incorrect choices made for each item. A comparison of scores obtained on the diagnostic instruments by the upper and lower classes of undergraduate students was made using a t-Test. This study also examined an axiomatic assumption that an understanding of atomic structure is important in understanding bonding and molecular structure. A Pearson Correlation Coefficient, ṟ, was calculated to provide a measure of the strength of this association. Additionally, this study gathered information regarding expectations of undergraduate chemistry students' understanding held by the chemical community. Two questionnaires were developed with items based upon the propositional knowledge statements used in the development of the diagnostic instruments. Subgroups of items from the questionnaires were formed from the combination of items found to measure different aspects of a specific topic area using a reliability analysis. Average scores for the subgroups were compared to results obtained by students on the diagnostic instrument targeting the same topic area. There were no significant differences of the scores on both of the diagnostic instruments between the levels of undergraduate chemistry students. There were, however, significant differences on certain items of the diagnostic instruments between upper and lower class students. Additionally, misconceptions were identified within all levels of these undergraduate students that corresponded to previous results reported in the literature. A significant relationship was found to exist between the scores obtained on the two diagnostic instruments, as well as strong correlations between specific items and the total scores of the instruments. Response to the expectations questionnaires revealed no differences between the chemical industry and chemical academia, but did provide information concerning the chemical community's expectations of undergraduate chemistry students. Results indicate that undergraduate students majoring in chemistry have conceptions that are inconsistent with currently accepted scientific views. The findings also support the hypothesis that an understanding of the general structure of the atom and the roles played by electrons in molecular bonding and structure is important to an understanding of chemical properties and behavior.
People--things and data--ideas: bipolar dimensions?
Tay, Louis; Su, Rong; Rounds, James
2011-07-01
We examined a longstanding assumption in vocational psychology that people-things and data-ideas are bipolar dimensions. Two minimal criteria for bipolarity were proposed and examined across 3 studies: (a) The correlation between opposite interest types should be negative; (b) after correcting for systematic responding, the correlation should be greater than -.40. In Study 1, a meta-analysis using 26 interest inventories with a sample size of 1,008,253 participants showed that meta-analytic correlations between opposite RIASEC (realistic, investigative, artistic, social, enterprising, conventional) types ranged from -.03 to .18 (corrected meta-analytic correlations ranged from -.23 to -.06). In Study 2, structural equation models (SEMs) were fit to the Interest Finder (IF; Wall, Wise, & Baker, 1996) and the Interest Profiler (IP; Rounds, Smith, Hubert, Lewis, & Rivkin, 1999) with sample sizes of 13,939 and 1,061, respectively. The correlations of opposite RIASEC types were positive, ranging from .17 to .53. No corrected correlation met the criterion of -.40 except for investigative-enterprising (r = -.67). Nevertheless, a direct estimate of the correlation between data-ideas end poles using targeted factor rotation did not reveal bipolarity. Furthermore, bipolar SEMs fit substantially worse than a multiple-factor representation of vocational interests. In Study 3, a two-way clustering solution on IF and IP respondents and items revealed a substantial number of individuals with interests in both people and things. We discuss key theoretical, methodological, and practical implications such as the structure of vocational interests, interpretation and scoring of interest measures for career counseling, and expert RIASEC ratings of occupations.
Cappelleri, J C; Althof, S E; Siegel, R L; Shpilsky, A; Bell, S S; Duttagupta, S
2004-02-01
Development and validation of a patient-reported measure of psychosocial variables in men with erectile dysfunction (ED) is described. Literature review, focus groups, and medical specialists identified 86 potential items. Redundant, ambiguous, or low item-to-total correlation items were removed. Data from 98 men reporting diagnosed ED and 94 controls assisted in final item selection and psychometric evaluation. Treatment responsiveness was evaluated in 93 men with ED in a 10-week open-label trial of sildenafil citrate (Viagra). The 14 chosen items resolved into two domains: Sexual Relationship (eight items) and Confidence (six items), the latter comprising Self-Esteem (four items) and Overall Relationship (two items) subscales. The resulting Self-Esteem And Relationship (SEAR) questionnaire demonstrated validity and reliability. The intervention study demonstrated responsiveness to beneficial treatment with significant improvement in scores (P=0.0001). The SEAR questionnaire possesses strong psychometric properties that support its validity and reliability for measuring sexual relationship, confidence, and particularly self-esteem.
[Development of a cell phone addiction scale for korean adolescents].
Koo, Hyun Young
2009-12-01
This study was done to develop a cell phone addiction scale for Korean adolescents. The process included construction of a conceptual framework, generation of initial items, verification of content validity, selection of secondary items, preliminary study, and extraction of final items. The participants were 577 adolescents in two middle schools and three high schools. Item analysis, factor analysis, criterion related validity, and internal consistency were used to analyze the data. Twenty items were selected for the final scale, and categorized into 3 factors explaining 55.45% of total variance. The factors were labeled as withdrawal/tolerance (7 items), life dysfunction (6 items), and compulsion/persistence (7 items). The scores for the scale were significantly correlated with self-control, impulsiveness, and cell phone use. Cronbach's alpha coefficient for the 20 items was .92. Scale scores identified students as cell phone addicted, heavy users, or average users. The above findings indicate that the cell phone addiction scale has good validity and reliability when used with Korean adolescents.
Spinal appearance questionnaire: factor analysis, scoring, reliability, and validity testing.
Carreon, Leah Y; Sanders, James O; Polly, David W; Sucato, Daniel J; Parent, Stefan; Roy-Beaudry, Marjolaine; Hopkins, Jeffrey; McClung, Anna; Bratcher, Kelly R; Diamond, Beverly E
2011-08-15
Cross sectional. This study presents the factor analysis of the Spinal Appearance Questionnaire (SAQ) and its psychometric properties. Although the SAQ has been administered to a large sample of patients with adolescent idiopathic scoliosis (AIS) treated surgically, its psychometric properties have not been fully evaluated. This study presents the factor analysis and scoring of the SAQ and evaluates its psychometric properties. The SAQ and the Scoliosis Research Society-22 (SRS-22) were administered to AIS patients who were being observed, braced or scheduled for surgery. Standard demographic data and radiographic measures including Lenke type and curve magnitude were also collected. Of the 1802 patients, 83% were female; with a mean age of 14.8 years and mean initial Cobb angle of 55.8° (range, 0°-123°). From the 32 items of the SAQ, 15 loaded on two factors with consistent and significant correlations across all Lenke types. There is an Appearance (items 1-10) and an Expectations factor (items 12-15). Responses are summed giving a range of 5 to 50 for the Appearance domain and 5 to 20 for the Expectations domain. The Cronbach's α was 0.88 for both domains and Total score with a test-retest reliability of 0.81 for Appearance and 0.91 for Expectations. Correlations with major curve magnitude were higher for the SAQ Appearance and SAQ Total scores compared to correlations between the SRS Appearance and SRS Total scores. The SAQ and SRS-22 Scores were statistically significantly different in patients who were scheduled for surgery compared to those who were observed or braced. The SAQ is a valid measure of self-image in patients with AIS with greater correlation to curve magnitude than SRS Appearance and Total score. It also discriminates between patients who require surgery from those who do not.
Psychometric Properties of the Persian Version of the Simple Shoulder Test (SST) Questionnaire.
Ebrahimzadeh, Mohammad H; Vahedi, Ehsan; Baradaran, Aslan; Birjandinejad, Ali; Seyyed-Hoseinian, Seyyed-Hadi; Bagheri, Farshid; Kachooei, Amir Reza
2016-10-01
To validate the Persian version of the simple shoulder test in patients with shoulder joint problems. Following Beaton`s guideline, translation and back translation was conducted. We reached to a consensus on the Persian version of SST. To test the face validity in a pilot study, the Persian SST was administered to 20 individuals with shoulder joint conditions. We enrolled 148 consecutive patients with shoulder problem to fill the Persian SST, shoulder specific measure including Oxford shoulder score (OSS) and two general measures including DASH and SF-36. To measure the test-retest reliability, 42 patients were randomly asked to fill the Persian-SST for the second time after one week. Cronbach's alpha coefficient was used to demonstrate internal consistency over the 12 items of Persian-SST. ICC for the total questionnaire was 0.61 showing good and acceptable test-retest reliability. ICC for individual items ranged from 0.32 to 0.79. The total Cronbach's alpha was 0.84 showing good internal consistency over the 12 items of the Persian-SST. Validity testing showed strong correlation between SST and OSS and DASH. The correlation with OSS was positive while with DASH scores was negative. The correlation was also good to strong with all physical and most mental subscales of the SF-36. Correlation coefficient was higher with DASH and OSS in compare to SF-36. Persian version of SST found to be valid and reliable instrument for shoulder joint pain and function assessment in Iranian population.
Osman, Augustine; Wong, Jane L; Bagge, Courtney L; Freedenthal, Stacey; Gutierrez, Peter M; Lozano, Gregorio
2012-12-01
We conducted two studies to examine the dimensions, internal consistency reliability estimates, and potential correlates of the Depression Anxiety Stress Scales-21 (DASS-21; Lovibond & Lovibond, 1995). Participants in Study 1 included 887 undergraduate students (363 men and 524 women, aged 18 to 35 years; mean [M] age = 19.46, standard deviation [SD] = 2.17) recruited from two public universities to assess the specificity of the individual DASS-21 items and to evaluate estimates of internal consistency reliability. Participants in a follow-up study (Study 2) included 410 students (168 men and 242 women, aged 18 to 47 years; M age = 19.65, SD = 2.88) recruited from the same universities to further assess factorial validity and to evaluate potential correlates of the original DASS-21 total and scale scores. Item bifactor and confirmatory factor analyses revealed that a general factor accounted for the greatest proportion of common variance in the DASS-21 item scores (Study 1). In Study 2, the fit statistics showed good fit for the bifactor model. In addition, the DASS-21 total scale score correlated more highly with scores on a measure of mixed depression and anxiety than with scores on the proposed specific scales of depression or anxiety. Coefficient omega estimates for the DASS-21 scale scores were good. Further investigations of the bifactor structure and psychometric properties of the DASS-21, specifically its incremental and discriminant validity, using known clinical groups are needed. © 2012 Wiley Periodicals, Inc.
Ross, Robert S.; Smolen, Andrew; Curran, Tim; Nyhus, Erika
2018-01-01
A critical problem for developing personalized treatment plans for cognitive disruptions is the lack of understanding how individual differences influence cognition. Recognition memory is one cognitive ability that varies from person to person and that variation may be related to different genetic phenotypes. One gene that may impact recognition memory is the monoamine oxidase A gene (MAO-A), which influences the transcription rate of MAO-A. Examination of how MAO-A phenotypes impact behavioral and event-related potentials (ERPs) correlates of recognition memory may help explain individual differences in recognition memory performance. Therefore, the current study uses electroencephalography (EEG) in combination with genetic phenotyping of the MAO-A gene to determine how well-characterized ERP components of recognition memory, the early frontal old/new effect, left parietal old/new effect, late frontal old/new effect, and the late posterior negativity (LPN) are impacted by MAO-A phenotype during item and source memory. Our results show that individuals with the MAO-A phenotype leading to increased transcription have lower response sensitivity during both item and source memory. Additionally, during item memory the left parietal old/new effect is not present due to increased ERP amplitude for correct rejections. The results suggest that MAO-A phenotype changes EEG correlates of recognition memory and influences how well individuals differentiate between old and new items. PMID:29487517
[Development and validation of the Korean patient safety culture scale for nursing homes].
Yoon, Sook Hee; Kim, Byungsoo; Kim, Se Young
2013-06-01
The purpose of this study was to develop a tool to evaluate patient safety culture in nursing homes and to test its validity and reliability. A preliminary tool was developed through interviews with focus group, content validity tests, and a pilot study. A nationwide survey was conducted from February to April, 2011, using self-report questionnaires. Participants were 982 employees in nursing homes. Data were analyzed using Cronbach's alpha, item analysis, factor analysis, and multitrait/multi-Item analysis. From the results of the analysis, 27 final items were selected from 49 items on the preliminary tool. Items with low correlation with total scale were excluded. The 4 factors sorted by factor analysis contributed 63.4% of the variance in the total scale. The factors were labeled as leadership, organizational system, working attitude, management practice. Cronbach's alpha for internal consistency was .95 and the range for the 4 factors was from .86 to .93. The results of this study indicate that the Korean Patient Safety Culture Scale has reliability and validity and is suitable for evaluation of patient safety culture in Korean nursing homes.
van der Eijk, Cees; Rose, Jonathan
2015-01-01
This paper undertakes a systematic assessment of the extent to which factor analysis the correct number of latent dimensions (factors) when applied to ordered-categorical survey items (so-called Likert items). We simulate 2400 data sets of uni-dimensional Likert items that vary systematically over a range of conditions such as the underlying population distribution, the number of items, the level of random error, and characteristics of items and item-sets. Each of these datasets is factor analysed in a variety of ways that are frequently used in the extant literature, or that are recommended in current methodological texts. These include exploratory factor retention heuristics such as Kaiser’s criterion, Parallel Analysis and a non-graphical scree test, and (for exploratory and confirmatory analyses) evaluations of model fit. These analyses are conducted on the basis of Pearson and polychoric correlations. We find that, irrespective of the particular mode of analysis, factor analysis applied to ordered-categorical survey data very often leads to over-dimensionalisation. The magnitude of this risk depends on the specific way in which factor analysis is conducted, the number of items, the properties of the set of items, and the underlying population distribution. The paper concludes with a discussion of the consequences of over-dimensionalisation, and a brief mention of alternative modes of analysis that are much less prone to such problems. PMID:25789992
Vallone, Donna; Allen, Jane A; Clayton, Richard R; Xiao, Haijun
2007-10-01
To assess the reliability and validity of the Brief Sensation Seeking Scale BSSS-4 by race/ethnicity. Six waves of nationally representative, cross-sectional, Legacy Media Tracking Survey (LMTS) data. Analyses are based on a sample size of 24 328 individuals. Response rates for the individual survey administrations range from 60% to 30%. Data were collected by telephone, from April 2001 to January 2004. Youth, aged 12-17 years, who completed the LMTS. Sensation seeking was measured using the four-item scale, BSSS-4, published by Stephenson et al. in 2003. A series of items from the LMTS was used to measure youth intention to smoke and smoking behavior. Mean sensation seeking scores increased as the risk for established smoking increased. African American youth who are open to smoking or have experimented with cigarettes had lower mean sensation seeking scores than their white and Hispanic counterparts. Coefficient alpha and average corrected item-total correlations suggest that the BSSS-4 is a less reliable measure of sensation seeking for African American youth compared to white and Hispanic youth. The BSSS-4 is a useful tool for identifying youth at risk for smoking; however, it is less reliable and valid for African American youth compared with other youth. Future research should investigate whether other existing sensation seeking scales are equally reliable and valid across race/ethnicity, and whether an alternative scale could or should be developed that would measure sensation seeking more effectively among African American youth.
Panah, Sara Hojat; Baharlouie, Hamze; Rezaeian, Zahra Sadat; Hawker, Gilian
2016-01-01
The present study aimed to translate and evaluate the reliability and validity of the Persian version of the 11-item Intermittent and Constant Osteoarthritis Pain (ICOAP) measure in Iranian subjects with Knee Osteoarthritis (KOA). The ICOAP questionnaire was translated according to the Manufacturers Alliance for Productivity and Innovation (MAPI) protocol. The procedure consisted of forward and backward translation, as well as the assessment of the psychometric properties of the Persian version of the questionnaire. A sample of 230 subjects with KOA was asked to complete the Persian versions of ICOAP and Knee injury and Osteoarthritis Outcome Score (KOOS). The ICOAP was readministered to forty subjects five days after the first visit. Test-retest reliability was assessed using Intraclass Correlation Coefficient (ICC), and internal consistency was assessed by Cronbach's alpha and item-total correlation. The correlation between ICOAP and KOOS was determined using Spearman's correlation coefficient. Subjects found the Persian-version of the ICOAP to be clear, simple, and unambiguous, confirming its face validity. Spearman correlations between ICOAP total and subscale scores with KOOS scores were between 0.5 and 0.7, confirming construct validity. Cronbach's alpha, used to assess internal consistency, was 0.89, 0.93, and 0.92 for constant pain, intermittent pain, and total pain scores, respectively. The ICC was 0.90 for constant pain and 0.91 for the intermittent pain and total pain score. The Persian version of the ICOAP is a reliable and valid outcome measure that can be used in Iranian subjects with KOA.
General practitioners' knowledge and concern about electromagnetic fields.
Berg-Beckhoff, Gabriele; Breckenkamp, Jürgen; Larsen, Pia Veldt; Kowall, Bernd
2014-12-01
Our aim is to explore general practitioners' (GPs') knowledge about EMF, and to assess whether different knowledge structures are related to the GPs' concern about EMF. Random samples were drawn from lists of GPs in Germany in 2008. Knowledge about EMF was assessed by seven items. A latent class analysis was conducted to identify latent structures in GPs' knowledge. Further, the GPs' concern about EMF health risk was measured using a score comprising six items. The association between GPs' concern about EMF and their knowledge was analysed using multiple linear regression. In total 435 (response rate 23.3%) GPs participated in the study. Four groups were identified by the latent class analysis: 43.1% of the GPs gave mainly correct answers; 23.7% of the GPs answered low frequency EMF questions correctly; 19.2% answered only the questions relating EMF with health risks, and 14.0% answered mostly "don't know". There was no association between GPs' latent knowledge classes or between the number of correct answers given by the GPs and their EMF concern, whereas the number of incorrect answers was associated with EMF concern. Greater EMF concern in subjects with more incorrect answers suggests paying particular attention to misconceptions regarding EMF in risk communication.
Item Type and Gender Differences on the Mental Rotations Test
ERIC Educational Resources Information Center
Voyer, Daniel; Doyle, Randi A.
2010-01-01
This study investigated gender differences on the Mental Rotations Test (MRT) as a function of item and response types. Accordingly, 86 male and 109 female undergraduate students completed the MRT without time limits. Responses were coded as reflecting two correct (CC), one correct and one wrong (CW), two wrong (WW), one correct and one blank…
Reliability of the Melbourne assessment of unilateral upper limb function.
Randall, M; Carlin, J B; Chondros, P; Reddihough, D
2001-11-01
This study examines the reliability of the Melbourne Assessment of Unilateral Upper Limb Function: a quantitative test of quality of movement in children with neurological impairment. The assessment was administered to 20 children aged from 5 to 16 years (mean age 9 years 10 months, SD 2 years 10 months) who had various types and degrees of cerebral palsy (CP). The performances of the 20 children during assessment were videotaped for subsequent scoring by 15 occupational therapists. Scores were analyzed for internal consistency of test items, inter- and intrarater reliability of scorings of the same videotapes, and test-retest reliability using repeat videotaping. Results revealed very high internal consistency of test items (alpha=0.96), moderate to high agreement both within and between raters for all test items (intraclass correlations of at least 0.7) apart from item 16 (hand to mouth and down), and high interrater reliability (0.95) and intrarater reliability (0.97) for total test scores. Test-retest results revealed moderate to high intrarater reliability for item totals (mean of 0.83 and 0.79) for each rater and high reliability for test totals (0.98 and 0.97). These findings indicate that the Melbourne Assessment of Unilateral Upper Limb Function is a reliable tool for measuring the quality of unilateral upper-limb movement in children with CP.
Validation of Gujarati Version of ABILOCO-Kids Questionnaire
Diwan, Jasmin; Patel, Pankaj; Bansal, Ankita B.
2015-01-01
Background ABILOCO-Kids is a measure of locomotion ability for children with cerebral palsy (CP) aged 6 to 15 years & is available in English & French. Aim To validate the Gujarati version of ABILOCO-Kids questionnaire to be used in clinical research on Gujarati population. Materials and Methods ABILOCO-Kids questionnaire was translated into Gujarati from English using forward-backward-forward method. To ensure face & content validity of Gujarati version using group consensus method, each item was examined by group of experts having mean experience of 24.62 years in field of paediatric and paediatric physiotherapy. Each item was analysed for content, meaning, wording, format, ease of administration & scoring. Each item was scored by expert group as either accepted, rejected or accepted with modification. Procedure was continued until 80% of consensus for all items. Concurrent validity was examined on 55 children with Cerebral Palsy (6-15 years) of all Gross Motor Functional Classification System (GMFCS) level & all clinical types by correlating score of ABILOCO-Kids with Gross Motor Functional Measure & GMFCS. Result In phase 1 of validation, 16 items were accepted as it is; 22 items accepted with modification & 3 items went for phase 2 validation. For concurrent validity, highly significant positive correlation was found between score of ABILOCO-Kids & total GMFM (r=0.713, p<0.005) & highly significant negative correlation with GMFCS (r= -0.778, p<0.005). Conclusion Gujarati translated version of ABILOCO-Kids questionnaire has good face & content validity as well as concurrent validity which can be used to measure caregiver reported locomotion ability in children with CP. PMID:26557603
Adult perceptions of dental fluorosis and select dental conditions-an Asian perspective.
Nair, Rahul; Chuang, Janice Cheah Ping; Lee, Pauline Shih Jia; Leo, Song Jie; Yang, Naomi QiYue; Yee, Robert; Tong, Huei Jinn
2016-04-01
To compare lay people's perceptions with regard to various levels of dental fluorosis and select dental defects versus normal dentition. Adults rated digitally created photographs made showing lips (without retraction) and teeth depicting the following conditions: no apparent aesthetic defects (normal, Thylstrup- Fejerskov score 0 - TF0), 6 levels of fluorosis (TF1-6), carious lesions (two cavitated and one noncavitated), malocclusions (Class II, Class III, anterior open bite and greater spacing), extrinsic staining and an incisal chip. The photographs were displayed on colour-calibrated iPads(™) . Participants used a self-administered questionnaire to rate their perceptions on (Item 1) how normal teeth were, (Item 2) how attractive the teeth were, (Item 3) need to seek correction of teeth, (Item 4) how well the person took care of their teeth and (Item 5) whether the person was born like this. Data from Item 5 were excluded due to low reliability. Ratings for Item 1 showed that TF1-4 was similar or significantly better than TF0. For Item 2, TF1 and TF4 were significantly better than TF0, with TF2 and TF3 being similar. For Item 3, there was significantly lower need to seek correction with TF2 and TF4 versus TF0, whereas TF1 and TF3 were similar to TF0. TF5 and TF6 were rated significantly lower than TF0 for Item 1 and Item 2, and significantly higher rating for Item 3 (need to seek correction). Ratings for Item 4 were similar, with TF1, TF2 and TF4 being rated significantly higher than TF0, and TF5 and TF6 being rated lower. Cavitated caries and staining were generally perceived as being significantly less favourable than TF6, with higher need to seek correction as well. Noncavitated carious lesion and incisal chip were rated similar to TF0. Cavitated carious lesions were rated aesthetically similar or significantly worse than TF0 and TF6. Severe fluorosis (TF5 and 6) was perceived to be less aesthetically pleasing and received higher ratings for need to seek correction than normal teeth. Mild-to-moderate fluorosis (TF1-4) showed similar or better aesthetic perceptions and similar or lower need to seek correction, when compared to normal teeth (TF0). Easily visible cavitated dental caries was rated worse than teeth with severe fluorosis (TF6) and normal teeth (TF0). © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Jo, Min-Woo; Lee, Hyeon-Jeong; Kim, Soo Young; Kim, Seon-Ha; Chang, Hyejung; Ahn, Jeonghoon; Ock, Minsu
2017-01-01
Few attempts have been made to develop a generic health-related quality of life (HRQoL) instrument and to examine its validity and reliability in Korea. We aimed to do this in our present study. After a literature review of existing generic HRQoL instruments, a focus group discussion, in-depth interviews, and expert consultations, we selected 30 tentative items for a new HRQoL measure. These items were evaluated by assessing their ceiling effects, difficulty, and redundancy in the first survey. To validate the HRQoL instrument that was developed, known-groups validity and convergent/discriminant validity were evaluated and its test-retest reliability was examined in the second survey. Of the 30 items originally assessed for the HRQoL instrument, four were excluded due to high ceiling effects and six were removed due to redundancy. We ultimately developed a HRQoL instrument with a reduced number of 20 items, known as the Health-related Quality of Life Instrument with 20 items (HINT-20), incorporating physical, mental, social, and positive health dimensions. The results of the HINT-20 for known-groups validity were poorer in women, the elderly, and those with a low income. For convergent/discriminant validity, the correlation coefficients of items (except vitality) in the physical health dimension with the physical component summary of the Short Form 36 version 2 (SF-36v2) were generally higher than the correlations of those items with the mental component summary of the SF-36v2, and vice versa. Regarding test-retest reliability, the intraclass correlation coefficient of the total HINT-20 score was 0.813 (p<0.001). A novel generic HRQoL instrument, the HINT-20, was developed for the Korean general population and showed acceptable validity and reliability.
Correlated environmental corrections in TOPEX/POSEIDON, with a note on ionospheric accuracy
NASA Technical Reports Server (NTRS)
Zlotnicki, V.
1994-01-01
Estimates of the effectiveness of an altimetric correction, and interpretation of sea level variability as a response to atmospheric forcing, both depend upon assuming that residual errors in altimetric corrections are uncorrelated among themselves and with residual sea level, or knowing the correlations. Not surprisingly, many corrections are highly correlated since they involve atmospheric properties and the ocean surface's response to them. The full corrections (including their geographically varying time mean values), show correlations between electromagnetic bias (mostly the height of wind waves) and either atmospheric pressure or water vapor of -40%, and between atmospheric pressure and water vapor of 28%. In the more commonly used collinear differences (after removal of the geographically varying time mean), atmospheric pressure and wave height show a -30% correlation, atmospheric pressure and water vapor a -10% correlation, both pressure and water vapor a 7% correlation with residual sea level, and a bit surprisingly, ionospheric electron content and wave height a 15% correlation. Only the ocean tide is totally uncorrelated with other corrections or residual sea level. The effectiveness of three ionospheric corrections (TOPEX dual-frequency, a smoothed version of the TOPEX dual-frequency, and Doppler orbitography and radiopositioning integrated by satellite (DORIS) is also evaluated in terms of their reduction in variance of residual sea level. Smooth (90-200 km along-track) versions of the dual-frequency altimeter ionosphere perform best both globally and within 20 deg in latitude from the equator. The noise variance in the 1/s TOPEX inospheric samples is approximately (11 mm) squared, about the same as noise in the DORIS-based correction; however, the latter has its error over scales of order 10(exp 3) km. Within 20 deg of the equator, the DORIS-based correction adds (14 mm) squared to the residual sea level variance.
NASA Astrophysics Data System (ADS)
Hao, Wenrui; Lu, Zhenzhou; Li, Luyi
2013-05-01
In order to explore the contributions by correlated input variables to the variance of the output, a novel interpretation framework of importance measure indices is proposed for a model with correlated inputs, which includes the indices of the total correlated contribution and the total uncorrelated contribution. The proposed indices accurately describe the connotations of the contributions by the correlated input to the variance of output, and they can be viewed as the complement and correction of the interpretation about the contributions by the correlated inputs presented in "Estimation of global sensitivity indices for models with dependent variables, Computer Physics Communications, 183 (2012) 937-946". Both of them contain the independent contribution by an individual input. Taking the general form of quadratic polynomial as an illustration, the total correlated contribution and the independent contribution by an individual input are derived analytically, from which the components and their origins of both contributions of correlated input can be clarified without any ambiguity. In the special case that no square term is included in the quadratic polynomial model, the total correlated contribution by the input can be further decomposed into the variance contribution related to the correlation of the input with other inputs and the independent contribution by the input itself, and the total uncorrelated contribution can be further decomposed into the independent part by interaction between the input and others and the independent part by the input itself. Numerical examples are employed and their results demonstrate that the derived analytical expressions of the variance-based importance measure are correct, and the clarification of the correlated input contribution to model output by the analytical derivation is very important for expanding the theory and solutions of uncorrelated input to those of the correlated one.
Martinez-Martin, Pablo; Rodriguez-Blazquez, Carmen; Alvarez-Sanchez, Mario; Arakaki, Tomoko; Bergareche-Yarza, Alberto; Chade, Anabel; Garretto, Nelida; Gershanik, Oscar; Kurtis, Monica M; Martinez-Castrillo, Juan Carlos; Mendoza-Rodriguez, Amelia; Moore, Henry P; Rodriguez-Violante, Mayela; Singer, Carlos; Tilley, Barbara C; Huang, Jing; Stebbins, Glenn T; Goetz, Christopher G
2013-01-01
The Movement Disorder Society-UPDRS (MDS-UPDRS) was published in 2008, showing satisfactory clinimetric results and has been proposed as the official benchmark scale for Parkinson's disease. The present study, based on the official MDS-UPDRS Spanish version, performed the first independent testing of the scale and adds information on its clinimetric properties. The cross-culturally adapted MDS-UPDRS Spanish version showed a comparative fit index ≥ 0.90 for each part (I-IV) relative to the English-language version and was accepted as the Official MDS-UPDRS Spanish version. Data from this scale, applied with other assessments to Spanish-speaking Parkinson's disease patients in five countries, were analyzed for an independent and complementary clinimetric evaluation. In total, 435 patients were included. Missing data were negligible and moderate floor effect (30 %) was found for Part IV. Cronbach's α index ranged between 0.79 and 0.93 and only five items did not reach the 0.30 threshold value of item-total correlation. Test-retest reliability was adequate with only two sub-scores of the item 3.17, Rest tremor amplitude, reaching κ values lower than 0.60. The intraclass correlation coefficient was higher than 0.85 for the total score of each part. Correlation of the MDS-UPDRS parts with other measures for related constructs was high (≥ 0.60) and the standard error of measurement lower than one-third baseline standard deviation for all subscales. Results confirm those of the original study and add information on scale reliability, construct validity, and precision. The MDS-UPDRS Spanish version shows satisfactory clinimetric characteristics.
van der Meulen, Mirja W; Boerebach, Benjamin C M; Smirnova, Alina; Heeneman, Sylvia; Oude Egbrink, Mirjam G A; van der Vleuten, Cees P M; Arah, Onyebuchi A; Lombarts, Kiki M J M H
2017-01-01
Multisource feedback (MSF) instruments are used to and must feasibly provide reliable and valid data on physicians' performance from multiple perspectives. The "INviting Co-workers to Evaluate Physicians Tool" (INCEPT) is a multisource feedback instrument used to evaluate physicians' professional performance as perceived by peers, residents, and coworkers. In this study, we report on the validity, reliability, and feasibility of the INCEPT. The performance of 218 physicians was assessed by 597 peers, 344 residents, and 822 coworkers. Using explorative and confirmatory factor analyses, multilevel regression analyses between narrative and numerical feedback, item-total correlations, interscale correlations, Cronbach's α and generalizability analyses, the psychometric qualities, and feasibility of the INCEPT were investigated. For all respondent groups, three factors were identified, although constructed slightly different: "professional attitude," "patient-centeredness," and "organization and (self)-management." Internal consistency was high for all constructs (Cronbach's α ≥ 0.84 and item-total correlations ≥ 0.52). Confirmatory factor analyses indicated acceptable to good fit. Further validity evidence was given by the associations between narrative and numerical feedback. For reliable total INCEPT scores, three peer, two resident and three coworker evaluations were needed; for subscale scores, evaluations of three peers, three residents and three to four coworkers were sufficient. The INCEPT instrument provides physicians performance feedback in a valid and reliable way. The number of evaluations to establish reliable scores is achievable in a regular clinical department. When interpreting feedback, physicians should consider that respondent groups' perceptions differ as indicated by the different item clustering per performance factor.
Mahmoudian, Saeid; Shahmiri, Elaheh; Rouzbahani, Masoumeh; Jafari, Zahra; Keyhani, Mohammad; Rahimi, Farzad; Mahmoudian, Guiti; Akbarvand, Leila; Barzegar, Gholamreza; Farhadi, Mohammad
2011-01-01
Tinnitus is a debilitating condition that is widespread yet difficult to successfully diagnose and treat. This symptom can seriously affect the individual's life quality. The aim of current study was to compose and validate a Persian version of the Tinnitus Handicap Inventory (THI-P). The linguistic validation of the original version of THI into Persian version (THI-P) included translation, back translation and data gathering. The THI-P was administered to 112 tinnitus subjects. Age, gender, medical history and tinnitus characteristics were recorded as baseline information. All participants complained of chronic unilateral or bilateral subjective idiopathic tinnitus lasting for at least 6 months before consulting about their tinnitus. There was no significant difference between gender, age, hearing impairment and total score and subscales of THI-P. Pearson product-moment correlations revealed adequate test-retest reliability for the THI-P (r = 0.96). Cronbach's-alpha coefficient indicated adequate internal stability of the THI-P (r= 0.943), with a total item correction varying between r=0.939 and r=0.944, indicating its reproducibility. The present study proved the internal consistency/ coherency of the Persian version of THI (THI-P). This provides satisfactory application in clinical/research environments.
Colman, John A.; Waldron, Marcus C.; Breault, Robert F.; Lent, Robert M.
1999-01-01
Total mercury and methylmercury were measured in 4 reservoir cores and 12 wetland cores from Sudbury River. The distribution of total mercury and methylmercury in these cores was evaluated to determine the potential for total mercury and methylmercury transport from reservoir and wetlands sediments to the water column. Concentrations of methylmercury were corrected for an analytical artifact introduced during the separation distillation used in the analysis procedure. Corrected methylmercury concentrations correlated with total mercury concentrations in bulk sediment from below the top layers of reservoir and wetland cores; methylmercury concentrations at the top layers of cores were relatively high, however, and were not correlated with total mercury concentrations. Concentrations of methylmercury in pore water were positively correlated with methylmercury concentrations in the bulk sediment. High concentrations of total mercury and methylmercury in sediment (73 and 0.047 micrograms per gram dry-weight basis, respectively) contributed less to the water column in the reservoir than in the wetlands probably because of burial by low concentration sediment and differences in the processes available to transport mercury from the sediments to the water in the reservoirs, as compared to the wetlands .
von Wyl, Agnes; Toggweiler, Stephan; Zollinger, Ruedi
2017-01-01
The Health of the Nation Outcome Scales for Children and Adolescents (HoNOSCA), in use worldwide, is a 13-item measure assessing the biopsychosocial severity of mental health problems in children and adolescents. This article introduces the authorized German-language version of HoNOSCA, the HoNOSCA-D, and examines and discusses its psychometric properties based on a clinical sample of 1,533 children and adolescents aged 4;0 to 17;11 years. For the HoNOSCA-D total score (severity of mental health problems), internal consistency (Cronbach's alpha) was 0.63. The discriminative power of the items ranged from 0.07 to 0.44; the average interitem correlation was 0.11. Due to this stochastic independence, calculation of a total severity index is acceptable. Using factor analysis, the principal axis factoring and varimax rotation resulted in a four-factor structure, which with a Kaiser-Meyer-Olkin measure of sampling adequacy of 0.684 explained 30.62% of total variance. The convergent correlations with the German-language parent report version of the Strengths and Difficulties Questionnaire were as expected and showed a medium effect size. Gender and age differences in the HoNOSCA-D total score were small. Regarding the 13 items gender and age differences were negligible to medium. The highest severity was found for schizophrenia and psychotic disorders, followed by affective disorders and social behavior disorders. Overall, validity of HoNOSCA-D was clearly supported.
Development and validation of a new tool to measure Iranian pregnant women's empowerment.
Borghei, N S; Taghipour, A; Roudsari, R Latifnejad; Keramat, A
2016-03-15
Empowering pregnant women improves their health and reduces maternal mortality, but there is a lack of suitable tools to measure women's empowerment in some cultures. This study aimed to design and validate a questionnaire for measuring the dimensions of empowerment among Iranian pregnant women. After a literature review, and face and content validity testing, a 38-item questionnaire was developed and tested on a sample of 161 pregnant women. Factor analysis grouped the items into 3 subscales: educational empowerment (e.g. prenatal training), autonomy (e.g. financial independency and mental ability) and sociopolitical empowerment (e.g. involvement in social and political activities). Criterion validity testing showed a strong positive correlation of the total scale and subscales scores with the Kameda and the Spritzer empowerment scales. Cronbach alpha was 0.92 for total empowerment. A total of 32 items remained in the Self-Structured Pregnancy Empowerment Questionnaire, which is a valid new tool to measure the dimensions of pregnant women's empowerment.
Social contagion of correct and incorrect information in memory.
Rush, Ryan A; Clark, Steven E
2014-01-01
The present study examines how discussion between individuals regarding a shared memory affects their subsequent individual memory reports. In three experiments pairs of participants recalled items from photographs of common household scenes, discussed their recall with each other, and then recalled the items again individually. Results showed that after the discussion. individuals recalled more correct items and more incorrect items, with very small non-significant increases, or no change, in recall accuracy. The information people were exposed to during the discussion was generally accurate, although not as accurate as individuals' initial recall. Individuals incorporated correct exposure items into their subsequent recall at a higher rate than incorrect exposure items. Participants who were initially more accurate became less accurate, and initially less-accurate participants became more accurate as a result of their discussion. Comparisons to no-discussion control groups suggest that the effects were not simply the product of repeated recall opportunities or self-cueing, but rather reflect the transmission of information between individuals.
[A test to measure the degree of knowledge on food and nutrition at the onset of elementary school].
Ivanovic Marincovich, D; Castro Gómez, C G; Ivanovic Marincovich, R
1997-06-01
The objective of this work was to design a test to measure the degree of knowledge on food and nutrition in school-age children from elementary first and second grades. A graphic instrument was designed according to the psychological child development and was based on the specific objectives pursued by the curriculum programs of the Ministry of Education. The test was developed around the following topics through 15 items: Area 1: Basic Concepts on Food and Nutrition (9 items) and Area 2: Food, Personal and Environmental Hygiene (9 items). The test was pilot tested on 103 school-age children of both grades (1:1), of both sexes (1:1), belonging to Peñalolén and Las Condes counties from Chile's Metropolitan Region and from high and low socioeconomic status (SES) (1:1), measured through the Graffar's Modified Method. The final version of the test was applied in a representative sample of 1.482 school-age children from Chile's Metropolitan Region from elementary first and second grades during 1986-1987. Content validity was assured by a team of judges and by the curriculum programs. Reliability was assessed by the Spearman correlation with the Spearman-Brown correction. Item-test consistency was determined by the Pearson correlation coefficient. Data were processed by the statistical analysis system (SAS) package. Results showed that reliability coefficient was 0.84 and item-test consistency was equal or above 0.25 in all items. It can be concluded that this test can be useful to determine the degree of knowledge on food and nutrition at the onset of elementary school, both in Chile and in other countries.
Senaha, Mirna Lie Hosogi; Brucki, Sonia Maria Dozzi; Nitrini, Ricardo
2010-01-01
Although language rehabilitation in patients with primary progressive aphasia (PPA) is recommended, rehabilitation studies in this clinical syndrome are scarce. Specifically, in relation to semantic dementia (SD), few studies have shown the possibility of lexical relearning. Objective To analyze the effectiveness of rehabilitation for lexical reacquisition in SD. Methods Three SD patients were submitted to training for lexical reacquisition based on principles of errorless learning. Comparisons between naming performance of treated items (pre and post-training) and non-treated items of the Boston Naming Test (BNT) were made. Results All patients improved their performance in naming treated words after intervention. However, decline in performance in naming of non-treated items was observed. Case 1 named zero items at baseline while her performance post-training was 29.4% correct responses without cueing, and 90.7% correct with and without cueing. Case 2 named 6.9% of items correctly at baseline and his performance in post-training was 52.9% without cueing and 87.3%, with and without cueing. Case 3 named zero items at baseline and his performance in post-training was 100% correct responses without cueing. Considering the performance in naming the non-treated items of the BNT, the percentages of correct responses in the first evaluation and in the re-evaluation, respectively were: 16.7% and 8.3% (case 1; 14 month-interval); 26.7% and 11.6% (case 2; 18 month-interval) and 11.6% and 8.3% (case 3; 6 month-interval). Conclusions The reacquisition of lost vocabulary may be possible in SD despite progressive semantic deterioration. PMID:29213703
Translation and linguistic validation of the Composite Autonomic Symptom Score COMPASS 31.
Pierangeli, Giulia; Turrini, Alessandra; Giannini, Giulia; Del Sorbo, Francesca; Calandra-Buonaura, Giovanna; Guaraldi, Pietro; Bacchi Reggiani, Maria Letizia; Cortelli, Pietro
2015-10-01
The aim of our study was to translate and to do a linguistic validation of the Composite Autonomic Symptom Score COMPASS 31. COMPASS 31 is a self-assessment instrument including 31 items assessing six domains of autonomic functions: orthostatic intolerance, vasomotor, secretomotor, gastrointestinal, bladder, and pupillomotor functions. This questionnaire has been created by the Autonomic group of the Mayo Clinic from two previous versions: the Autonomic Symptom Profile (ASP) composed of 169 items and the following COMPASS with 72 items selected from the ASP. We translated the questionnaire by means of a standardized forward and back-translation procedure. Thirty-six subjects, 25 patients with autonomic failure of different aethiologies and 11 healthy controls filled in the COMPASS 31 twice, 4 ± 1 weeks apart, once in Italian and once in English in a randomized order. The test-retest showed a significant correlation between the Italian and the English versions as total score. The evaluation of single domains by means of Pearson correlation when applicable or by means of Spearman test showed a significant correlation between the English and the Italian COMPASS 31 version for all clinical domains except the vasomotor one for the lack of scoring. The comparison between the patients with autonomic failure and healthy control groups showed significantly higher total scores in patients with respect to controls confirming the high sensitivity of COMPASS 31 in revealing autonomic symptoms.
Psychometric properties of the Symptom Status Questionnaire-Heart Failure.
Heo, Seongkum; Moser, Debra K; Pressler, Susan J; Dunbar, Sandra B; Mudd-Martin, Gia; Lennie, Terry A
2015-01-01
Many patients with heart failure (HF) experience physical symptoms, poor health-related quality of life (HRQOL), and high rates of hospitalization. Physical symptoms are associated with HRQOL and are major antecedents of hospitalization. However, reliable and valid physical symptom instruments have not been established. Therefore, this study examined the psychometric properties of the Symptom Status Questionnaire-Heart Failure (SSQ-HF) in patients with HF. Data on symptoms using the SSQ-HF were collected from 249 patients (aged 61 years, 67% male, 45% in New York Heart Association functional class III/IV). Internal consistency reliability was assessed using Cronbach's α. Item homogeneity was assessed using item-total and interitem correlations. Construct validity was assessed using factor analysis and testing hypotheses on known relationships. Data on depressive symptoms (Beck Depression Inventory II), HRQOL (Minnesota Living With Heart Failure Questionnaire), and event-free survival were collected to test known relationships. Internal consistency reliability was supported: Cronbach's α was .80. Item-total correlation coefficients and interitem correlation coefficients were acceptable. Factor analysis supported the construct validity of the instrument. More severe symptoms were associated with more depressive symptoms, poorer HRQOL, and more risk for hospitalization, emergency department visit, or death, controlling for covariates. The findings of this study support the reliability and validity of the SSQ-HF. Clinicians and researchers can use this instrument to assess physical symptoms in patients with HF.
Item Difficulty Modeling of Paragraph Comprehension Items
ERIC Educational Resources Information Center
Gorin, Joanna S.; Embretson, Susan E.
2006-01-01
Recent assessment research joining cognitive psychology and psychometric theory has introduced a new technology, item generation. In algorithmic item generation, items are systematically created based on specific combinations of features that underlie the processing required to correctly solve a problem. Reading comprehension items have been more…
Kliem, Sören; Lohmann, Anna; Mößle, Thomas; Brähler, Elmar
2018-04-25
The Beck Hopelessness Scale (BHS) has been the most frequently used instrument for the measurement of hopelessness in the past 40 years. Only recently has it officially been translated into German. The psychometric properties and factor structure of the BHS have been cause for intensive debate in the past. Based on a representative sample of the German population (N = 2450) item analysis including item sensitivity, item-total correlation and item difficulty was performed. Confirmatory factor analyses (CFA) for several factor solutions from the literature were performed. Multiple group factor analysis was performed to assess measurement invariance. Construct validity was assessed via the replication of well-established correlations with concurrently assessed measures. Most items exhibited adequate properties. Items #4, #8 and #13 exhibited poor item characteristics- each of these items had previously received negative evaluations in international studies. A one-dimensional factor solution, favorable for the calculation and interpretation of a sum score, was regarded as adequate. A bi-factor model with one content factor and two method factors (defined by positive/negative item coding) resulted in an excellent model fit. Cronbach's alpha in the current sample was .87. Hopelessness, as measured by the BHS, significantly correlated in the expected direction with suicidal ideation (r = .36), depression (r = .53) and life satisfaction (r = -.53). Strict measurement invariance could be established regarding gender and depression status. Due to limited research regarding the interpretation of fit indices with dichotomous data, interpretation of CFA results needs to remain tentative. The BHS is a valid measure of hopelessness in various subgroups of the general population. Future research could aim at replicating these findings using item response theory and cross-cultural samples. A one-dimensional bi-factor model seems appropriate even in a non-clinical population.
Saltychev, Mikhail; Bärlund, Esa; Laimi, Katri
2018-03-01
The aim of this study was to assess the correlation between pain severity measured on a numeric rating scale and restrictions of functioning measured with the WHO Disability Assessment Schedule (WHODAS 2.0). This was a cross-sectional study of 1207 patients with musculoskeletal pain conditions. Correlation was assessed using Spearman's and Pearson tests. Although all the Spearman's rank correlations between WHODAS 2.0 items and pain severity were statistically significant, they were mostly weak, with only a few moderate associations for 'S2 household responsibilities', 'S8 washing', 'S9 dressing', and 'S12 day-to-day work'. The correlation between the WHODAS 2.0 total score and pain severity was also moderate: 0.41 [95% confidence interval (CI): 0.36-0.45] for average pain and 0.42 (95% CI: 0.37-0.46) for worst pain. The correlation between the WHODAS 2.0 total score and pain level was also assessed using Pearson's product-moment correlation, yielding figures that were similar to Spearman's correlation: 0.42 (P<0.0001, 95% CI: 0.37-0.46) for average pain and 0.39 (P<0.0001, 95% CI: 0.34-0.44) for worst pain. Among patients with chronic musculoskeletal pain, the correlation between pain severity measured by numeric rating scale and functioning level measured by WHODAS 2.0 was weak to moderate, with slightly stronger associations in physical domains of functioning.
Identification and topographic localization of metallic foreign bodies by metal detector.
Muensterer, Oliver J; Joppich, Ingolf
2004-08-01
Exact localization of ingested metal objects is necessary to guide therapy. This study prospectively evaluates the accuracy of foreign body (FB) identification and localization by metal detector (MTD) in a systematic topographic fashion. Patients who presented after an alleged or witnessed metal FB ingestion were scanned with an MTD. In case of a positive signal, the location was recorded in a topographic diagram, and radiographs were obtained. The diagnostic accuracy of the MTD scan for FB identification and topographic localization was determined by chi(2) analysis, and concordance was calculated by the McNemar test and expressed as kappa. A total of 70 MTD examinations were performed on 65 patients (age 6 months to 16 years); 5 patients were scanned twice on different days. The majority had swallowed coins and button batteries (n = 41). Of these, 29 items were correctly identified, and 11 of 12 were correctly ruled out (coins and button batteries: sensitivity, 100% [95% Confidence Interval 95% to 100%]; specificity, 91.7% [95% CI 76% to 100%], kappa = 0.94). When all metallic objects were included, 41 of 46 were correctly identified, and 22 of 24 were correctly ruled out (sensitivity, 89.1% [95% CI 80% to 98%]; specificity, 91.7% [95% CI 81% to 100%], kappa = 0.78). Five miscellaneous objects were not identified (sensitivity for items other than coins and button batteries 71% [95% CI 49% to 92%], kappa = 0.56). Localization by MTD was correct in 30 of 41 identified objects (73%). The error rates of junior and senior pediatric surgery residents did not differ significantly (P =.82). Ingested coins and button batteries can be safely and accurately found by metal detector. For these indications, the MTD is a radiation-free diagnostic alternative to conventional radiographs. Other items, however, cannot be ruled out reliably by MTD. In these cases, radiographic imaging is still indicated.
Kostuj, Tanja; Stief, Felix; Hartmann, Kirsten Anna; Schaper, Katharina; Arabmotlagh, Mohammad; Baums, Mike H; Meurer, Andrea; Krummenauer, Frank; Lieske, Sebastian
2018-04-05
After cross-cultural adaption for the German translation of the Ankle-Hindfoot Scale of the American Orthopaedic Foot and Ankle Society (AOFAS-AHS) and agreement analysis with the Foot Function Index (FFI-D), the following gait analysis study using the Oxford Foot Model (OFM) was carried out to show which of the two scores better correlates with objective gait dysfunction. Results of the AOFAS-AHS and FFI-D, as well as data from three-dimensional gait analysis were collected from 20 patients with mild to severe ankle and hindfoot pathologies.Kinematic and kinetic gait data were correlated with the results of the total AOFAS scale and FFI-D as well as the results of those items representing hindfoot function in the AOFAS-AHS assessment. With respect to the foot disorders in our patients (osteoarthritis and prearthritic conditions), we correlated the total range of motion (ROM) in the ankle and subtalar joints as identified by the OFM with values identified during clinical examination 'translated' into score values. Furthermore, reduced walking speed, reduced step length and reduced maximum ankle power generation during push-off were taken into account and correlated to gait abnormalities described in the scores. An analysis of correlations with CIs between the FFI-D and the AOFAS-AHS items and the gait parameters was performed by means of the Jonckheere-Terpstra test; furthermore, exploratory factor analysis was applied to identify common information structures and thereby redundancy in the FFI-D and the AOFAS-AHS items. Objective findings for hindfoot disorders, namely a reduced ROM, in the ankle and subtalar joints, respectively, as well as reduced ankle power generation during push-off, showed a better correlation with the AOFAS-AHS total score-as well as AOFAS-AHS items representing ROM in the ankle, subtalar joints and gait function-compared with the FFI-D score.Factor analysis, however, could not identify FFI-D items consistently related to these three indicator parameters (pain, disability and function) found in the AOFAS-AHS. Furthermore, factor analysis did not support stratification of the FFI-D into two subscales. The AOFAS-AHS showed a good agreement with objective gait parameters and is therefore better suited to evaluate disability and functional limitations of patients suffering from foot and ankle pathologies compared with the FFI-D. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Lam-Figueroa, Nelly; Contreras-Pulache, Hans; Mori-Quispe, Elizabeth; Nizama-Valladolid, Martín; Gutiérrez, César; Hinostroza-Camposano, Williams; Reyes, Erasmo Torrejón; Hinostroza-Camposano, Richard; Coaquira-Condori, Elizabeth; Hinostroza-Camposano, Willy David
2011-01-01
To develop and validate an instrument to assess Internet Addiction (IA) phenomenon in adolescents of Metropolitan Lima. We performed an observational analytical study, including a sample of 248 high school adolescent students. In order to evaluate the IA, we constructed the questionnaire: "Scale for Internet Addiction of Lima" (SIAL), which assesses symptoms and dysfunctional characteristics. The resulting items were submitted to experts' judgment, finally obtaining a 11-item scale. The mean age was 14 years old. The psychometric analysis of the instrument showed a Cronbach' Alpha Coefficient of 0.84, with values of item-total correlation ranging from 0.45 to 0.59. The dimensional analysis yielded a two-dimensional structure that explained up to 50.7% of the total variance. The bi-dimensional data analysis revealed a significant association (p<0,001) between Dimension I (symptoms of IA) and the weekly time spent on the Internet, male sex, past history of bad behavior in school and plans for the future. Dimension II (dysfunction due to IA) had a significant association to past history of bad behavior, plans for the future (p<0,001) and missing school without valid reasons. The SIAL showed a good internal consistency, with moderate and significant inter-item correlations. The findings show that addiction has a dynamic role, which evidences a problem generated in family patterns and inadequate social networks.
Cain, Kelli L; Gavand, Kavita A; Conway, Terry L; Geremia, Carrie M; Millstein, Rachel A; Frank, Lawrence D; Saelens, Brian E; Adams, Marc A; Glanz, Karen; King, Abby C; Sallis, James F
2017-06-01
Macroscale built environment factors (e.g., street connectivity) are correlated with physical activity. Less-studied but more modifiable microscale elements (e.g., sidewalks) may also influence physical activity, but shorter audit measures of microscale elements are needed to promote wider use. This study evaluated the relation of an abbreviated 54-item streetscape audit tool with multiple measures of physical activity in four age groups. We developed a 54-item version from the original 120-item Microscale Audit of Pedestrian Streetscapes (MAPS). Audits were conducted on 0.25-0.45 mile routes from participant residences toward the nearest nonresidential destination for children (N=758), adolescents (N=897), younger adults (N=1,655), and older adults (N=367). Active transport and leisure physical activity were measured with surveys, and objective physical activity was measured with accelerometers. Items to retain from original MAPS were selected primarily by correlations with physical activity. Mixed linear regression analyses were conducted for MAPS-Abbreviated summary scores, adjusting for demographics, participant clustering, and macroscale walkability. MAPS-Abbreviated and original MAPS total scores correlated r=.94 The MAPS-Abbreviated tool was related similarly to physical activity outcomes as the original MAPS. Destinations and land use, streetscape and walking path characteristics, and overall total scores were significantly related to active transport in all age groups. Street crossing characteristics were related to active transport in children and older adults. Aesthetics and social characteristics were related to leisure physical activity in children and younger adults, and cul-de-sacs were related with physical activity in youth. Total scores were related to accelerometer-measured physical activity in children and older adults. MAPS-Abbreviated is a validated observational measure for use in research. The length and related cost of implementation has been cited as a barrier to use of microscale instruments, so availability of this shorter validated measure could lead to more widespread use of streetscape audits in health research.
Blasimann, Angela; Dauphinee, Sharon Wood; Staal, J Bart
2014-12-01
Clinical measurement. To translate and cross-culturally adapt the Hip disability and Osteoarthritis Outcome Score (HOOS) from English into German, and to study its psychometric properties in patients after hip surgery. There is no specific hip questionnaire in German that not only measures symptoms and function but also contains items about hip-related quality of life. The translation and cross-cultural adaptation involved forward translation, harmonization, cognitive debriefing, back translation, and comparison to the original HOOS following international guidelines. The German version was tested in 51 Swiss inpatients 8 weeks after different types of hip surgery, mainly total hip replacement. The mean age of the participants was 62.5 years, and the age range was from 27 to 87 years. Thirty (58.8%) of the participants were women. Internal consistency and test-retest reliability were estimated using Cronbach alpha and intraclass correlation coefficients for agreement. For construct validity, total scores of the German HOOS were correlated with those of the Western Ontario and McMaster Universities Osteoarthritis Index. The HOOS was also compared to the Medical Outcomes Study 36-Item Short-Form Health Survey. Cronbach alpha values for all German HOOS subscales were between .87 and .93. For test-retest reliability, the intraclass correlation coefficient for agreement was 0.85 for the total scores of the German HOOS. The Spearman rho for the Medical Outcomes Study 36-Item Short-Form Health Survey physical functioning subscale compared to the sum of all HOOS subscales was 0.71, and that for the Medical Outcomes Study 36-Item Short-Form Health Survey physical component summary was 0.97. The German HOOS has demonstrated adequate reliability and validity. Use of the German HOOS is recommended for assessment of patients after hip surgery, with the proviso that additional psychometric testing should be done in future research.
Cain, Kelli L.; Gavand, Kavita A.; Conway, Terry L.; Geremia, Carrie M.; Millstein, Rachel A.; Frank, Lawrence D.; Saelens, Brian E.; Adams, Marc A.; Glanz, Karen; King, Abby C.; Sallis, James F.
2017-01-01
Purpose Macroscale built environment factors (e.g., street connectivity) are correlated with physical activity. Less-studied but more modifiable microscale elements (e.g., sidewalks) may also influence physical activity, but shorter audit measures of microscale elements are needed to promote wider use. This study evaluated the relation of an abbreviated 54-item streetscape audit tool with multiple measures of physical activity in four age groups. Methods We developed a 54-item version from the original 120-item Microscale Audit of Pedestrian Streetscapes (MAPS). Audits were conducted on 0.25-0.45 mile routes from participant residences toward the nearest nonresidential destination for children (N=758), adolescents (N=897), younger adults (N=1,655), and older adults (N=367). Active transport and leisure physical activity were measured with surveys, and objective physical activity was measured with accelerometers. Items to retain from original MAPS were selected primarily by correlations with physical activity. Mixed linear regression analyses were conducted for MAPS-Abbreviated summary scores, adjusting for demographics, participant clustering, and macroscale walkability. Results MAPS-Abbreviated and original MAPS total scores correlated r=.94 The MAPS-Abbreviated tool was related similarly to physical activity outcomes as the original MAPS. Destinations and land use, streetscape and walking path characteristics, and overall total scores were significantly related to active transport in all age groups. Street crossing characteristics were related to active transport in children and older adults. Aesthetics and social characteristics were related to leisure physical activity in children and younger adults, and cul-de-sacs were related with physical activity in youth. Total scores were related to accelerometer-measured physical activity in children and older adults. Conclusion MAPS-Abbreviated is a validated observational measure for use in research. The length and related cost of implementation has been cited as a barrier to use of microscale instruments, so availability of this shorter validated measure could lead to more widespread use of streetscape audits in health research. PMID:29270361
Development of the Serenity Scale.
Roberts, K T; Aspy, C B
1993-01-01
Serenity is a sustained inner peace. Nurses can use knowledge about serenity to help clients cope with harsh circumstances. The Serenity Scale is a 40-item self-report, summated scale that evaluates clients' serenity status. Critical attributes, identified by serenity experts, served as the theoretical framework. Sixty-five items were given to 542 male and female subjects age 20 to 95 (73% Caucasians and 27% minority) from varying income and educational levels yielding an alpha of .93. Forty items (SS.V2) were extracted for further analysis. The alpha coefficient was .92 with item-to-total correlations ranging from .25 to .67. Item means ranged from 2.6-3.7 (grand mean = 3.4). A principal components factor analysis with varimax rotation revealed nine factors explaining 58.2% of the variance. Limitations are that SS.V2 has not been tested with an independent sample and subjects with low educational levels had difficulty with some items.
Psychometric evaluation of the fatigue severity scale for use in chronic hepatitis C.
Kleinman, L; Zodet, M W; Hakim, Z; Aledort, J; Barker, C; Chan, K; Krupp, L; Revicki, D
2000-01-01
Evidence exists demonstrating that infection with hepatitis C virus impairs health-related quality of life, but less is known about the effect of fatigue, a common symptom, on everyday life. The psychometric properties of the fatigue severity scale (FSS) were explored to determine suitability as an outcome measure in clinical trials. The FSS includes nine items developed to measure disabling fatigue and a visual analog scale (VAS) to measure overall fatigue. Using baseline data from three clinical trials (n = 1225) involving chronic hepatitis C patients, scaling and psychometric characteristics of the FSS were assessed. The SF-36 was also used in the trials. Item response theory analysis demonstrated that the FSS items can be placed along a single homogenous domain, fatigue. Internal consistency reliability was 0.94. Test-retest reliability was 0.82 for the total score and 0.80 for the VAS. The total score and the VAS were significantly correlated with the SF-36 vitality subscale (r = -0.76 and r = -0.76 respectively). Correlations with other SF-36 subscales were moderate (r = -0.46 to r = -0.67, all p < 0.0001). In summary, the FSS possesses good psychometric properties.
Ypofanti, Maria; Zisi, Vasiliki; Zourbanos, Nikolaos; Mouchtouri, Barbara; Tzanne, Pothiti; Theodorakis, Yannis; Lyrakos, Georgios
2015-09-30
Goldberg's International Personality Item Pool (IPIP) big-five personality factor markers currently lack validating evidence. The structure of the 50-item IPIP was examined in two different adult samples (total N=811), in each case justifying a 5-factor solution, with only minor discrepancies. Age differences were comparable to previous findings using other inventories. One sample (N=193) also completed additionally another personality measure (the TIPI Short Form). Conscientiousness, extraversion and emotional stability/neuroticism scales of the IPIP were highly correlated with those of the TIPI (r=0.62 to 0.65, P=0.01). Agreeableness and Intellect/Openness scales correlated less strongly (r=0.54 and 0.58 respectively, P=0.01). The IPIP scales have good internal consistency (a=0.88) and relate strongly to major dimensions of personality assessed by the two questionnaires.
Development of the Brazilian brief version of the Diabetes Quality of Life Measure (DQOL-Brazil-8).
Brasil, Fábio; Brasil, Andreia Mara Brolezzi; e Souza, Rodrigo Augusto de Paula; Pontarolo, Roberto; Correr, Cassyano Januário
2015-01-01
To provide for Brazil, through the selection of items of the Brazilian version of the Diabetes Quality of Life Measure (DQOL-Brazil), a concise instrument. This is a cross-sectional study in which the DQOL-Brazil was administered to 150 type 1 diabetic patients and 146 type 2 diabetic patients. The items of the instrument were selected according to the analysis of the principal components and Spearman's correlations with treatment satisfaction, glycated hemoglobin level, and Nottingham Health Profile. From a total of 44 items, only 8 were selected to compose the summary instrument (DQOL-Brazil-8). The DQOL-Brazil-8 presented Spearman's correlation of 0.873 with the DQOL-Brazil and a Cronbach's alpha coefficient of 0.702. The Brazilian health professionals now have a brief tool for a fast application that preserves the best features of the full DQOL-Brazil.
Application of the diligence inventory in dental education.
Jasinevicius, T R; Bernard, H; Schuttenberg, E M
1998-04-01
The fifty-five-item Diligence Inventory for Higher Education (DI-HE) was applied to a new subject group--190 dental students. After item and factor analysis, a fifty-item (four subscale) inventory best reflected this group. The DI-HE's split half reliability was 0.81 (p < 0.001), the reliability coefficient for the pre- and post-test was 0.68 (p < 0.01), and the correlation coefficient alpha was 0.90. The DI-HE scores were high, with no statistical differences among the four classes. Overall, significant relationships were found between grade point averages (GPAs) and DI-HE total and subscale scores, with r values as high as 0.44. While female students' DI-HE scores were significantly higher (p = 0.023) than male students' scores, no correlations between DI-HE scores and GPAs for females were found. The results suggest that DI-HE may be useful for assessment purposes in professional education.
Development of a 12-item short version of the HIV stigma scale.
Reinius, Maria; Wettergren, Lena; Wiklander, Maria; Svedhem, Veronica; Ekström, Anna Mia; Eriksson, Lars E
2017-05-30
Valid and reliable instruments for the measurement of enacted, anticipated and internalised stigma in people living with HIV are crucial for mapping trends in the prevalence of HIV-related stigma and tracking the effectiveness of stigma-reducing interventions. Although longer instruments exist, e.g., the commonly used 40-item HIV Stigma Scale by Berger et al., a shorter instrument would be preferable to facilitate the inclusion of HIV stigma in more and broader surveys. Therefore, the aim of this work was to develop a substantially shorter, but still valid, version of the HIV Stigma Scale. Data from a psychometric evaluation of the Swedish 40-item HIV Stigma Scale were reanalysed to create a short version with 12 items (three from each of the four stigma subscales: personalised stigma, disclosure concerns, concerns with public attitudes and negative self-image). The short version of the HIV stigma scale was then psychometrically tested using data from a national survey investigating stigma and quality of life among people living with HIV in Sweden (n = 880, mean age 47.9 years, 26% female). The hypothesized factor structure of the proposed short version was replicated in exploratory factor analysis without cross loadings and confirmatory factor analysis supported construct validity with high standardised effects (>0.7) of items on the intended scales. The χ 2 test was statistically significant (χ 2 = 154.2, df = 48, p < 0.001), but alternate fit measures indicated acceptable fit (comparative fit index: 0.963, Tucker-Lewis index: 0.950 and root mean square error of approximation: 0.071). Corrected item-total correlation coefficients were >0.4 for all items, with a variation indicating that the broadness of the concept of stigma had been captured. All but two aspects of HIV-related stigma that the instrument is intended to cover were captured by the selected items in the short version. The aspects that did not lose any items were judged to have acceptable psychometric properties. The short version of the instrument showed higher floor and ceiling effects than the full-length scale, indicating a loss of sensitivity in the short version. Cronbach's α for the subscales were all >0.7. Although being less sensitive in measurement, the proposed 12-item short version of the HIV Stigma Scale has comparable psychometric properties to the full-length scale and may be used when a shorter instrument is needed.
New evidence of factor structure and measurement invariance of the SDQ across five European nations.
Ortuño-Sierra, Javier; Fonseca-Pedrero, Eduardo; Aritio-Solana, Rebeca; Velasco, Alvaro Moreno; de Luis, Edurne Chocarro; Schumann, Gunter; Cattrell, Anna; Flor, Herta; Nees, Frauke; Banaschewski, Tobias; Bokde, Arun; Whelan, Rob; Buechel, Christian; Bromberg, Uli; Conrod, Patricia; Frouin, Vincent; Papadopoulos, Dimitri; Gallinat, Juergen; Garavan, Hugh; Heinz, Andreas; Walter, Henrik; Struve, Maren; Gowland, Penny; Paus, Tomáš; Poustka, Luise; Martinot, Jean-Luc; Paillère-Martinot, Marie-Laure; Vetter, Nora C; Smolka, Michael N; Lawrence, Claire
2015-12-01
The main purpose of the present study was to analyse the internal structure and to test the measurement invariance of the Strengths and Difficulties Questionnaire (SDQ), self-reported version, in five European countries. The sample consisted of 3012 adolescents aged between 12 and 17 years (M = 14.20; SD = 0.83). The five-factor model (with correlated errors added), and the five-factor model (with correlated errors added) with the reverse-worded items allowed to cross-load on the Prosocial subscale, displayed adequate goodness of-fit indices. Multi-group confirmatory factor analysis showed that the five-factor model (with correlated errors added) had partial strong measurement invariance by countries. A total of 11 of the 25 items were non-invariant across samples. The level of internal consistency of the Total difficulties score was 0.84, ranging between 0.69 and 0.78 for the SDQ subscales. The findings indicate that the SDQ's subscales need to be modified in various ways for screening emotional and behavioural problems in the five European countries that were analysed.
Reliability of the Adult Myopathy Assessment Tool in Individuals with Myositis
Harris-Love, Michael O.; Joe, Galen; Davenport, Todd E.; Koziol, Deloris; Rose, Kristen Abbett; Shrader, Joseph A.; Vasconcelos, Olavo M.; McElroy, Beverly; Dalakas, Marinos C.
2015-01-01
Objective The Adult Myopathy Assessment Tool (AMAT) is a 13-item performance-based battery developed to assess functional status and muscle endurance. The purpose of this study was to determine the intrarater and interrater reliability of the AMAT in adults with myosits. Methods Nineteen raters (13 physical therapists and 6 physicians) scored videotaped recordings of patients with myositis performing the AMAT for a total of 114 tests and 1,482 item observations per session. Raters rescored the AMAT test and item observations during a follow up session (19 ±6 days between scoring sessions). All raters completed a single, self-directed, electronic training module prior to the initial scoring session. Results Intrarater and interrater reliability correlation coefficients were .94 or greater for the AMAT Functional Subscale, Endurance Subscale, and Total score (all p < 0.02 for Ho:ρ ≤ 0.75). All AMAT items had satisfactory intrarater agreement (Kappa statistics with Fleiss-Cohen weights, Kw = .57-1.00). Interrater agreement was acceptable for each AMAT item (K = .56-.89) except the sit up (K = .16). The standard error of measurement and 95% confidence interval range for the AMAT Total scores did not exceed 2 points across all observations (AMAT Total score range = 0-45). Conclusions The AMAT is a reliable, domain-specific assessment of functional status and muscle endurance for adult subjects with myositis. Results of this study suggest that physicians and physical therapists may reliably score the AMAT following a single training session. The AMAT Functional Subscale, Endurance Subscale, and Total score exhibit interrater and intrarater reliability suitable for clinical and research use. PMID:25201624
The hippocampus supports both recollection and familiarity when memories are strong
Smith, Christine N.; Wixted, John T.; Squire, Larry R.
2011-01-01
Recognition memory is thought to consist of two component processes – recollection and familiarity. It has been suggested that the hippocampus supports recollection, while adjacent cortex supports familiarity. However, the qualitative experiences of recollection and familiarity are typically confounded with a quantitative difference in memory strength (recollection > familiarity). Thus, the question remains whether the hippocampus might in fact support familiarity-based memories whenever they are as strong as recollection-based memories. We addressed this problem in a novel way using the Remember/Know procedure where we could explicitly match the confidence and accuracy of Remember and Know decisions. As in earlier studies, recollected items had higher accuracy and confidence than familiar items, and hippocampal activity was higher for recollected items than for familiar items. Furthermore hippocampal activity was similar for familiar items, misses, and correct rejections. When the accuracy and confidence of recollected and familiar items were matched, the findings were dramatically different. Hippocampal activity was now similar for recollected and familiar items. Importantly, hippocampal activity was also greater for familiar items than for misses or correct rejections (as well as for recollected items vs. misses or correct rejections). Our findings suggest that the hippocampus supports both recollection and familiarity when memories are strong. PMID:22049412
Powers, John H; Bacci, Elizabeth D; Guerrero, M Lourdes; Leidy, Nancy Kline; Stringer, Sonja; Kim, Katherine; Memoli, Matthew J; Han, Alison; Fairchok, Mary P; Chen, Wei-Ju; Arnold, John C; Danaher, Patrick J; Lalani, Tahaniyat; Ridoré, Michelande; Burgess, Timothy H; Millar, Eugene V; Hernández, Andrés; Rodríguez-Zulueta, Patricia; Smolskis, Mary C; Ortega-Gallegos, Hilda; Pett, Sarah; Fischer, William; Gillor, Daniel; Macias, Laura Moreno; DuVal, Anna; Rothman, Richard; Dugas, Andrea; Ruiz-Palacios, Guillermo M
2018-02-01
To assess the reliability, validity, and responsiveness of InFLUenza Patient-Reported Outcome (FLU-PRO©) scores for quantifying the presence and severity of influenza symptoms. An observational prospective cohort study of adults (≥18 years) with influenza-like illness in the United States, the United Kingdom, Mexico, and South America was conducted. Participants completed the 37-item draft FLU-PRO daily for up to 14 days. Item-level and factor analyses were used to remove items and determine factor structure. Reliability of the final tool was estimated using Cronbach α and intraclass correlation coefficients (2-day reliability). Convergent and known-groups validity and responsiveness were assessed using global assessments of influenza severity and return to usual health. Of the 536 patients enrolled, 221 influenza-positive subjects comprised the analytical sample. The mean age of the patients was 40.7 years, 60.2% were women, and 59.7% were white. The final 32-item measure has six factors/domains (nose, throat, eyes, chest/respiratory, gastrointestinal, and body/systemic), with a higher order factor representing symptom severity overall (comparative fit index = 0.92; root mean square error of approximation = 0.06). Cronbach α was high (total = 0.92; domain range = 0.71-0.87); test-retest reliability (intraclass correlation coefficient, day 1-day 2) was 0.83 for total scores and 0.57 to 0.79 for domains. Day 1 FLU-PRO domain and total scores were moderately to highly correlated (≥0.30) with Patient Global Rating of Flu Severity (except nose and throat). Consistent with known-groups validity, scores differentiated severity groups on the basis of global rating (total: F = 57.2, P < 0.001; domains: F = 8.9-67.5, P < 0.001). Subjects reporting return to usual health showed significantly greater (P < 0.05) FLU-PRO score improvement by day 7 than did those who did not, suggesting score responsiveness. Results suggest that FLU-PRO scores are reliable, valid, and responsive to change in influenza-positive adults. Copyright © 2018 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.
External validity of the pediatric cardiac quality of life inventory
Marino, Bradley S.; Drotar, Dennis; Cassedy, Amy; Davis, Richard; Tomlinson, Ryan S.; Mellion, Katelyn; Mussatto, Kathleen; Mahony, Lynn; Newburger, Jane W.; Tong, Elizabeth; Cohen, Mitchell I.; Helfaer, Mark A.; Kazak, Anne E.; Wray, Jo; Wernovsky, Gil; Shea, Judy A.; Ittenbach, Richard
2012-01-01
Purpose The Pediatric Cardiac Quality of Life Inventory (PCQLI) is a disease-specific, health-related quality of life (HRQOL) measure for pediatric heart disease (HD). The purpose of this study was to demonstrate the external validity of PCQLI scores. Methods The PCQLI development site (Development sample) and six geographically diverse centers in the United States (Composite sample) recruited pediatric patients with acquired or congenital HD. Item response option variability, scores [Total (TS); Disease Impact (DI) and Psychosocial Impact (PI) subscales], patterns of correlation, and internal consistency were compared between samples. Results A total of 3,128 patients and parent participants (1,113 Development; 2,015 Composite) were analyzed. Response option variability patterns of all items in both samples were acceptable. Inter-sample score comparisons revealed no differences. Median item–total (Development, 0.57; Composite, 0.59) and item–subscale (Development, DI 0.58, PI 0.59; Composite, DI 0.58, PI 0.56) correlations were moderate. Subscale–subscale (0.79 for both samples) and subscale–total (Development, DI 0.95, PI 0.95; Composite, DI 0.95, PI 0.94) correlations and internal consistency (Development, TS 0.93, DI 0.90, PI 0.84; Composite, TS 0.93, DI 0.89, PI 0.85) were high in both samples. Conclusion PCQLI scores are externally valid across the US pediatric HD population and may be used for multi-center HRQOL studies. PMID:21188538
Development and Validation of a Quality-of-Life Instrument for Infantile Hemangiomas.
Chamlin, Sarah L; Mancini, Anthony J; Lai, Jin-Shei; Beaumont, Jennifer L; Cella, David; Adams, Denise; Drolet, Beth; Baselga, Eulalia; Frieden, Ilona J; Garzon, Maria; Holland, Kristin; Horii, Kimberly A; Lucky, Anne W; McCuaig, Catherine; Metry, Denise; Morel, Kimberly D; Newell, Brandon D; Nopper, Amy J; Powell, Julie; Siegel, Dawn; Haggstrom, Anita N
2015-06-01
Infantile hemangiomas (IH) are common tumors for which there is no validated disease-specific instrument to measure the quality of life in infants and their parents/caregivers during the critical first months of life. This study prospectively developed and validated a quality-of-life instrument for patients with IH and their parents/caregivers and correlated demographic and clinical features to the effects on the quality of life. A total of 220 parents/caregivers completed the 35-item Infantile Hemangioma Quality-of-Life (IH-QoL) instrument and provided demographic information. The dimensionality of the items was evaluated using factor analysis, with results suggesting four factors: child physical symptoms, child social interactions, parent emotional functioning, and parent psychosocial functioning. Each factor fit the Rasch measurement model with acceptable fit index (mean square <1.4) and demonstrated excellent internal consistency, with alpha ranging from 0.76 to 0.88. The final instrument consists of four scales with a total of 29 items. Content validity was verified by analyzing parents' responses to an open-ended question. Test-retest reliability at a 48-hour interval was supported by a total IH-QoL intraclass correlation coefficient of 0.84. Certain clinical characteristics of hemangioma, including those located on the head and neck, in the proliferative stage, and requiring treatment, are associated with a greater impact on QoL.
Protecting children: a survey of caregivers’ knowledge of Georgia’s child restraint laws
Strasser, Sheryl; Whorton, Laurie; Walpole, Amanda J; Beddington, Sarah
2010-01-01
Introduction The leading cause of injury and death among children in the United States is motor vehicle crashes. Even though restraint laws are in place and public awareness campaigns and educational interventions have increased, many children are still improperly restrained or not restrained at all. When correctly used, child restraints significantly reduce risk of injury or death. Methods The purpose of the study was to elicit caregiver baseline knowledge of car seat installation and regulation before receiving car seat education from certified technicians at Inspection Station events. Inspection Station is a program whereby staff assists parents in correctly positioning car seats in participants’ vehicles. Over an 8-week period, Safe Kids Cobb County Car Seat Technicians distributed a 16-item survey, with 10 knowledge-based questions and six demographic questions to Inspection Station participants. Descriptive statistics and t-tests were conducted to assess relationships between participant age, ethnicity, and gender with overall knowledge scores. Regression analysis was run to determine the association between participant education level and total child restraint knowledge. Results One hundred sixty-nine surveys were completed. Participant knowledge of vehicular child restraint ranged from 0% to 90% on all items. Only 29.6% of caregivers understood the proper tightness of the harness system. Less than half of the caregivers (43.8%) were aware of the Georgia law requiring children aged 6 years and younger to be in some type of child restraint. Only 43.2% of caregivers surveyed knew that children need to ride in a rear-facing child restraint until 1 year of age and 20 pounds. No significant correlations between participant knowledge and age were found. Statistically significant associations were found between total knowledge scores and education level, ethnicity, and gender. Discussion The results from this study describe baseline knowledge among a sample of participants at Inspection Station activities held in Cobb County, Georgia. These results can help inform tailoring of future programming so that the impact of enhanced health education/prevention messages for intended populations can be maximized and health child injury risk related to improper restraints can be minimized. PMID:22312220
Lin, Chung-Ying; Wang, Jung-Der; Pai, Ming-Chyi; Ku, Li-Jung Elizabeth
To examine the psychometric properties of different short versions of the Zarit Burden Interview (ZBI), and to find an efficient and valid short version for clinical use among dementia caregivers. A total of 270 Taiwanese dementia caregivers filled out the full form of the ZBI, which contains 22 items. Using the 22-item ZBI, we used confirmatory factor analysis (CFA) to calculate the fit indices of all proposed short versions with various items to determine useful short versions. Additional associations between each useful short version and informal care hours, as well as subjective financial situations, were examined to understand their concurrent validity. Based on the CFA results, three short versions of the ZBI, performed excellently (4-item version: comparative fit index [CFI]=1.000, Tucker-Lewis index [TLI]=1.035, standardized root mean square residual [SRMR]=0.019, and root mean square error of approximation [RMSEA]=0.000; 8-item version: CFI=0.970, TLI=0.958, SRMR=0.045, and RMSEA=0.065; 12-item version: CFI=0.959, TLI=0.950, SRMR=0.053, and RMSEA=0.075). In addition, the 12-item ZBI, as compared with other versions, had a higher correlation with the number of informal care hours. The 12-item ZBI was also highly correlated with the original 22-item ZBI (r=0.952). We found the 12-item ZBI to be a promising measure for healthcare providers to assess the burden of dementia caregivers quickly and efficiently. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Compliance with universal precautions in correctional health care facilities.
Gershon, R R; Karkashian, C D; Vlahov, D; Kummer, L; Kasting, C; Green-McKenzie, J; Escamilla-Cejudo, J A; Kendig, N; Swetz, A; Martin, L
1999-03-01
There were three main objectives of this cross-sectional study of Maryland State correctional health care workers. The first was to evaluate compliance with work practices designed to minimize exposure to blood and body fluids; the second, to identify correlates of compliance with universal precautions (UPs); and the third was to determine the relationship, if any, between compliance and exposures. Of 216 responding health care workers, 34% reported overall compliance across all 15 items on a compliance scale. Rates for specific items were particularly low for use of certain types of personal protective equipment, such as protective eyewear (53.5%), face mask (47.2%) and protective clothing (33.9%). Compliance rates were highest for glove use (93.2%) waste disposal (89.8%), and sharps disposal (80.8%). Compliance rates were generally not associated with demographic factors, except for age; younger workers were more likely to be compliant with safe work practices than were older workers (P < 0.05). Compliance was positively associated with several work-related variables, including perceived safety climate (i.e., management's commitment to infection control and the overall safety program) and job satisfaction, and was found to be inversely associated with security-related work constraints, job/task factors, adverse working conditions, workplace discrimination, and perceived work stress. Bloodborne exposures were not uncommon; 13.8% of all respondents had at least one bloodborne exposure within the previous 6 months, and compliance was inversely related to blood and body fluid exposures. This study identified several potentially modifiable correlates of compliance, including factors unique to the correctional setting. Infection-control interventional strategies specifically tailored to these health care workers may therefore be most effective in reducing the risk of bloodborne exposures.
Pantalon, Michael V; Dziura, James; Li, Fang-Yong; Owens, Patricia H; O'Connor, Patrick G; D'Onofrio, Gail
2017-01-01
No psychometrically validated instrument for evaluating the extent to which interventionists correctly implement brief interventions designed to motivate treatment engagement for opioid use disorders has been reported in the literature. The objective of this study was to develop and examine the psychometric properties of the Brief Negotiation Interview (BNI) Adherence Scale for Opioid Use Disorders (BAS-O). In the context of a randomized controlled trial evaluating the efficacy of 3 models of emergency department care for opioid use disorders, the authors developed and subsequently examined the psychometric properties of the BAS-O, a 38-item scale that required raters to answer whether or not ("Yes" or "No") each of the critical actions of the BNI was correctly implemented by the research interventionist. BAS-O items pertained to the BNI's 4 steps: (1) Raise the Subject, (2) Provide Feedback, (3) Enhance Motivation, and (4) Negotiate and Advise. A total of 215 audio-recorded BNI and 88 control encounters were rated by 3 trained raters who were independent of the study team and blind to study hypotheses, treatment, and assignment. The results indicated the BAS-O has fair to excellent psychometric properties, in terms of good internal consistency, excellent interrater reliability, discriminant validity, and construct validity, and fair predictive validity. A 13-item, 2-factor solution accounted for nearly 80% of the variance, where factor 1 addressed "Autonomy and Planning" (7 items) and factor 2 addressed "Motivation and Problems" (6 items). However, predictive validity was found for only one of the BAS-O factor items (i.e., Telling patients that treatment will address a range of issues related to their opioid use disorder). This study suggests that the BAS-O is a psychometrically valid measure of adherence to the specialized BNI for motivating treatment engagement in patients with opioid use disorders, thus providing a brief (13-item), objective method of evaluating BNI skill performance.
The Tenacious Nature of Memory Binding for Arousing Negative Items
Novak, Deanna L.; Mather, Mara
2009-01-01
In two experiments, we investigated whether people are better or worse at updating memory for the location of emotional pictures than neutral pictures. We measured participants' memories for the locations of both arousing negative pictures and neutral pictures while manipulating practice (encountering the same event repeatedly) and interference (encountering the same picture in a different location). Memory for the context of emotional items was less likely to be corrected when erroneous and less likely to be correctly updated when the context changed. These results suggest that initial item-context binding is more tenacious for emotional items than for neutral items, even when such binding is incorrect. PMID:19744934
Navabi, Nader; Hashemipour, Maryam A; Roughani, Aida
2017-02-01
Oral cancer is a global health problem; however, many dentists lack the necessary skills, knowledge and capacity to diagnose oral cancers early. This study aimed to examine the validity and reliability of a Persian short-form version of a standardised questionnaire to assess dentists' knowledge, practice and attitudes towards oral cancer. This cross-sectional analytical study was carried out in May 2015 in Tehran, Iran. An original 39-item English-language questionnaire developed by Yellowitz et al . was translated into Persian using forward and backward translation methods. A total of 15 dental professionals were asked to assess the questionnaire for content validity. Based on their feedback, a 20-item short-form version was prepared, including six demographic, six knowledge, four attitude and four practice items. The translated short-form questionnaire was subsequently distributed to 973 general dental practitioners attending a dental conference in Tehran. Internal consistency and reliability were assessed with Cronbach's alpha coefficient and item-total correlation calculations. A total of 13 professionals and 313 general dentists participated in the study (response rates: 86.7% and 32.2%, respectively). After the elimination of six items (two knowledge, two attitude and two practice items), the validity and reliability of the questionnaire was confirmed. The final Persian 14-item version of the questionnaire had acceptable validity and internal consistency. These results indicate that researchers can use this translated short-form version to evaluate oral cancer knowledge, attitudes and practices among Persian-speaking dentists; this will allow for a comparison of data between different populations.
Wieland, Mark L; Nelson, Jonathan; Palmer, Tiffany; O'Hara, Connie; Weis, Jennifer A; Nigon, Julie A; Sia, Irene G
2013-01-01
Tuberculosis disproportionately affects immigrants and refugees to the United States. Upon arrival to the United States, many of these individuals attend adult education centers, but little is known about how to deliver tuberculosis health information at these venues. Therefore, the authors used a participatory approach to design and evaluate a tuberculosis education video in this setting. The authors used focus group data to inform the content of the video that was produced and delivered by adult learners and their teachers. The video was evaluated by learners for acceptability through 3 items with a 3-point Likert scale. Knowledge (4 items) and self-efficacy (2 items) about tuberculosis were evaluated before and after viewing the video. A total of 159 learners (94%) rated the video as highly acceptable. Knowledge about tuberculosis improved after viewing the video (56% correct vs. 82% correct; p <.001), as did tuberculosis-related self-efficacy (77% vs. 90%; p <.001). Adult education centers that serve large immigrant and refugee populations may be excellent venues for health education, and a video may be an effective tool to educate these populations. Furthermore, a participatory approach in designing health education materials may enhance the efficacy of these tools.
Wieland, Mark L.; Nelson, Jonathan; Palmer, Tiffany; O’Hara, Connie; Weis, Jennifer A.; Nigron, Julie A.; Sia, Irene G.
2012-01-01
Tuberculosis (TB) disproportionately affects immigrants and refugees to the United States. Upon arrival to the US, many of these individuals attend adult education centers, but little is known about how to deliver TB health information at these venues. Therefore, a participatory approach was used to design and evaluate a tuberculosis education video in this setting. Focus groups data were used to inform the content of the video that was produced and delivered by adult learners and their teachers. The video was evaluated by learners for acceptability through 3 items with a 3-point Likert scale. Knowledge (4 items) and self-efficacy (2 items) about TB were evaluated before and after viewing the video. A total of 159 learners (94%) rated the video as highly acceptable. Knowledge about TB improved after viewing the video (56% correct vs. 82% correct; p=<0.001), as did TB-related self-efficacy (77% vs. 90%; p=<0.001). Adult education centers that serve large immigrant and refugee populations may be excellent venues for health education, and a video may be an effective tool to educate these populations. Furthermore, a participatory approach in designing health education materials may enhance the efficacy of these tools. PMID:23237382
Mauss, Daniel; Herr, Raphael M; Theorell, Töres; Angerer, Peter; Li, Jian
2018-01-01
The Demand Control Support Questionnaire (DCSQ) is an established self-reported tool to measure a stressful work environment. Validated German and English versions are however currently missing. The aim of this study was therefore to evaluate the psychometric properties of German and English versions of the DCSQ among white-collar employees in Switzerland and the US. This cross-sectional study was carried out on 499 employees in Switzerland and 411 in the US, respectively. The 17-item DCSQ with three scales assessed psychosocial stress at work (psychological demands, decision latitude, and social support at work). Depressive symptoms were measured by the 2-item Patient Health Questionnaire. Cronbach's α and item-total correlations tested the scale reliability (internal consistency). Construct validity of the questionnaire was examined using exploratory factor analysis (EFA). Logistic regressions estimated associations of each scale and job strain with depressive symptoms (criterion validity). In both samples, all DCSQ scales presented satisfactory internal consistency (Cronbach's α ≥ 0.72; item-total correlations ≥ 0.33), and EFA showed the 17 items loading on three factors, which is in line with the theoretically assumed structure of the DCSQ construct. Moreover, all three scales as well as high job strain were significantly associated with depressive symptoms. The associations were stronger in the US sample. The German and the English versions of the DCSQ seem to be reliable and valid instruments to measure psychosocial stress based on the job demand-control-support model in the workplace of white-collar employees in Switzerland and the US.
Towards operationalising internal distractibility (Mind Wandering) in adults with ADHD.
Biederman, Joseph; Fitzgerald, Maura; Uchida, Mai; Spencer, Thomas J; Fried, Ronna; Wicks, Jennifer; Saunders, Alexandra; Faraone, Stephen V
2017-12-01
To investigate whether specific symptoms of attention deficit hyperactivity disorder (ADHD) can help identify ADHD patients with mind wandering. Subjects were adults ages 18-55 of both sexes (n=41) who completed the Mind-Wandering Questionnaire (MWQ) and the ADHD module of the Schedule for Affective Disorders and Schizophrenia for School-Age Children Epidemiologic Version. We used Spearman's rank correlation and Pearson's χ2 analyses to examine associations between the ADHD module and the MWQ and receiver operator characteristic (ROC) analyses to evaluate the diagnostic efficiency of the ADHD module. Out of the three ADHD domains, the inattentive ADHD scores had the strongest association with the MWQ (total: r s=0.34, df=39, p=0.03; inattentive: r s=0.38, df=39, p=0.02; Hyperactive: r s=0.17, df=39, p=0.28). Correlation analyses between individual items on the ADHD module and the MWQ showed that two inattention items ('failure to pay attention to detail' and 'trouble following instructions') were positively associated with total scores on the MWQ (p=0.02). These two inattention items had the strongest association with the MWQ (r s=0.45, df=38, p=0.004). ROC analyses showed that the combined score of the two significant inattention items had the highest efficiency (AUC=0.71) in classifying high-level mind wanderers as defined by scores greater than the median split on the MWQ. The combined score of the two inattention items best identified high-level mind wanderers. Results suggest a way to operationalise mind wandering using the symptoms of ADHD.
Psychometric Testing of the Self-Efficacy for Interdisciplinary Plans of Care Scale.
Molle, Elizabeth; Froman, Robin
2017-01-01
Computerized interdisciplinary plans of care have revitalized nurse-centric care plans into dynamic and meaningful electronic documents. To maximize the benefits of these documents, it is important to understand healthcare professionals' attitudes, specifically their confidence, for making computerized interdisciplinary care plans useful and meaningful documents. The purpose of the study was to test the psychometric properties of the Self-Efficacy for Interdisciplinary Plans of Care instrument intended to measure healthcare professionals' self-efficacy for using such documents. Content validity was assessed by an expert review panel. Content validity indices ranged from 0.75 to 1.00, with a scale CVI of 0.94. A sample of 389 healthcare providers completed the 14-item instrument. Principal axis factoring was used to assess factor structure. The exploratory factor analysis yielded a single-factor structure accounting for 71.76% of covariance. Cronbach internal consistency coefficient for the single factor solution was .97. The corrected item-total correlations ranged from 0.71 to 0.90. The coefficient of stability, during a 2-week period, with a subset of the sample (n = 38), was estimated at 0.82. The results of this study suggest that the Self-Efficacy for Interdisciplinary Plans of Care has sturdy reliability and validity for measuring the self-efficacy of healthcare providers to make computerized interdisciplinary plans of care meaningful and useful documents.
Statistical power as a function of Cronbach alpha of instrument questionnaire items.
Heo, Moonseong; Kim, Namhee; Faith, Myles S
2015-10-14
In countless number of clinical trials, measurements of outcomes rely on instrument questionnaire items which however often suffer measurement error problems which in turn affect statistical power of study designs. The Cronbach alpha or coefficient alpha, here denoted by C(α), can be used as a measure of internal consistency of parallel instrument items that are developed to measure a target unidimensional outcome construct. Scale score for the target construct is often represented by the sum of the item scores. However, power functions based on C(α) have been lacking for various study designs. We formulate a statistical model for parallel items to derive power functions as a function of C(α) under several study designs. To this end, we assume fixed true score variance assumption as opposed to usual fixed total variance assumption. That assumption is critical and practically relevant to show that smaller measurement errors are inversely associated with higher inter-item correlations, and thus that greater C(α) is associated with greater statistical power. We compare the derived theoretical statistical power with empirical power obtained through Monte Carlo simulations for the following comparisons: one-sample comparison of pre- and post-treatment mean differences, two-sample comparison of pre-post mean differences between groups, and two-sample comparison of mean differences between groups. It is shown that C(α) is the same as a test-retest correlation of the scale scores of parallel items, which enables testing significance of C(α). Closed-form power functions and samples size determination formulas are derived in terms of C(α), for all of the aforementioned comparisons. Power functions are shown to be an increasing function of C(α), regardless of comparison of interest. The derived power functions are well validated by simulation studies that show that the magnitudes of theoretical power are virtually identical to those of the empirical power. Regardless of research designs or settings, in order to increase statistical power, development and use of instruments with greater C(α), or equivalently with greater inter-item correlations, is crucial for trials that intend to use questionnaire items for measuring research outcomes. Further development of the power functions for binary or ordinal item scores and under more general item correlation strutures reflecting more real world situations would be a valuable future study.
Calculations with the quasirelativistic local-spin-density-functional theory for high-Z atoms
DOE Office of Scientific and Technical Information (OSTI.GOV)
Guo, Y.; Whitehead, M.A.
1988-10-01
The generalized-exchange local-spin-density-functional theory (LSD-GX) with relativistic corrections of the mass velocity and Darwin terms has been used to calculate statistical total energies for the neutral atoms, the positive ions, and the negative ions for high-Z elements. The effect of the correlation and relaxation correction on the statistical total energy is discussed. Comparing the calculated results for the ionization potentials and electron affinities for the atoms (atomic number Z from 37 to 56 and 72 to 80) with experiment, shows that for the atoms rubidium to barium both the LSD-GX and the quasirelativistic LSD-GX, with self-interaction correction, Gopinathan, Whitehead, andmore » Bogdanovic's Fermi-hole parameters (Phys. Rev. A 14, 1 (1976)), and Vosko, Wilk, and Nusair's correlation correction (Can. J. Phys. 58, 1200 (1980)), are very good methods for calculating ionization potentials and electron affinities. For the atoms hafnium to mercury the relativistic effect has to be considered.« less
Koriat, Asher
2018-05-01
Can we tell whether our beliefs and judgments are correct or wrong? Results across many domains indicate that people are skilled at discriminating between correct and wrong answers, endorsing the former with greater confidence than the latter. However, it has not been realized that because of people's adaptation to reality, representative samples of items tend to favor the correct answer, yielding object-level accuracy (OLA) that is considerably better than chance. Across 16 experiments that used 2-alternative forced-choice items from several domains, the confidence/accuracy (C/A) relationship was positive for items with OLA >50%, but consistently negative across items with OLA <50%. A systematic sampling of items that covered the full range of OLA (0-100%) yielded a U-function relating confidence to OLA. The results imply that the positive C/A relationship that has been reported in many studies is an artifact of OLA being better than chance rather than representing a general ability to discriminate between correct and wrong responses. However, the results also support the ecological approach, suggesting that confidence is based on a frugal, "bounded" heuristic that has been specifically tailored to the ecological structure of the natural environment. This heuristic is used despite the fact that for items with OLA <50%, it yields confidence judgments that are counterdiagnostic of accuracy. Our ability to tell between correct and wrong judgments is confined to the probability structure of the world we live in. The results were discussed in terms of the contrast between systematic design and representative design. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
[Preliminary study on civil capacity rating scale for mental disabled patients].
Zhang, Qin-Ting; Pang, Yan-Xia; Cai, Wei-Xiong; Tang, Tao; Huang, Fu-Yin
2010-10-01
To create civil capacity rating scale for mentally disabled patients, and explore its feasibility during the forensic psychiatric expertise. The civil capacity-related items were determined after discussion and consultation. The civil capacity rating scale for mentally disabled patients was established and the manual was created according to the logistic sequence of the assessment. The rating scale was used during the civil assessment in four institutes. There were 14 items in civil capacity rating scale for mentally disabled patients. Two hundred and two subjects were recruited and divided into three groups according to the experts' opinion on their civil capacities: full civil capacity, partial civil capacity and no civil capacity. The mean score of the three groups were 2.32 +/- 2.45, 11.62 +/- 4.01 and 25.02 +/- 3.90, respectively, and there was statistical differences among the groups. The Cronbach alpha of the rating scale was 0.9724, and during the split-reliability test, the two-splited part of the rating scale were highly correlated (r = 0.9729, P = 0.000). The Spearman correlative coefficient between each item and the score of the rating scale was from 0.643 to 0.882 (P = 0.000). There was good correlation between the conclusion according to the rating scale and the experts' opinion (kappa = 0.841, P = 0.000). When the discriminate analysis was used, 7 items were included into the discrimination equation, and 92.6% subjects were identified as the correct groups using the equation. There is satisfied reliability and validity on civil capacity rating scale for mentally disabled patients. The rating scale can be used as effective tools to grade their civil capacity during the forensic expertise.
Lafave, Mark R; Hiemstra, Laurie; Kerslake, Sarah
2016-08-01
Clinical management of patellofemoral (PF) instability is a challenge, particularly considering the number of variables that should be taken into consideration for treatment. Quality of life is an important measure to consider with this patient population. To factor analyze and reduce the total number of items in the Banff Patella Instability Instrument (BPII). Subsequent to the factor analysis, the new, item-reduced BPII 2.0 was tested for validity, reliability, and responsiveness. Cohort study (diagnosis); Level of evidence, 2. Quality of life was measured for PF instability patients (N = 223) through use of the original BPII at their initial consultation. Data from the BPII scores were used in a principal components analysis (PCA) to factor analyze and reduce the total number of items in the original BPII, to create a revised BPII 2.0. The BPII 2.0 underwent content validation (Cronbach alpha, patient interviews, and grade-level checking), construct validation (analysis of variance comparing the initial visit and the 6-, 12-, and 24-month postoperative visits, eta-square), convergent validation (Pearson r correlation to the original BPII), responsiveness testing (eta-square, anchor-based distribution testing), and reliability testing (intraclass correlation coefficient [ICC]). The BPII was successfully reduced from 32 to 23 items with excellent Cronbach alpha values in the new BPII 2.0: initial visit = 0.91; 6-month postoperative visit = 0.96; 12-month postoperative visit = 0.97; and 24-month postoperative visit = 0.76. Grade-level reading for all items was assessed as below grade 12. The BPII 2.0 was able to discriminate between all time periods with significant differences between groups (P < .05). Eta-square was 0.40, demonstrating a medium to large effect size. The BPII significantly correlated with the BPII 2.0 (0.82, 0.90, 0.90, and 0.94 at the initial visit and 6-, 12-, and 24-month postoperative visits, respectively), providing evidence of convergent validity. A significant correlation was found between the 7-point scale and 24-month postoperative BPII 2.0 scores, a sign of anchor-based responsiveness. ICC (2,k) was 0.97, indicating strong reliability. The BPII 2.0 is valid, reliable, and responsive for assessment of patients with PF instability, both surgically and nonsurgically treated. © 2016 The Author(s).
Validation of Single-Item Screening Measures for Provider Burnout in a Rural Health Care Network.
Waddimba, Anthony C; Scribani, Melissa; Nieves, Melinda A; Krupa, Nicole; May, John J; Jenkins, Paul
2016-06-01
We validated three single-item measures for emotional exhaustion (EE) and depersonalization (DP) among rural physician/nonphysician practitioners. We linked cross-sectional survey data (on provider demographics, satisfaction, resilience, and burnout) with administrative information from an integrated health care network (1 academic medical center, 6 community hospitals, 31 clinics, and 19 school-based health centers) in an eight-county underserved area of upstate New York. In total, 308 physicians and advanced-practice clinicians completed a self-administered, multi-instrument questionnaire (65.1% response rate). Significant proportions of respondents reported high EE (36.1%) and DP (9.9%). In multivariable linear mixed models, scores on EE/DP subscales of the Maslach Burnout Inventory were regressed on each single-item measure. The Physician Work-Life Study's single-item measure (classifying 32.8% of respondents as burning out/completely burned out) was correlated with EE and DP (Spearman's ρ = .72 and .41, p < .0001; Kruskal-Wallis χ(2) = 149.9 and 56.5, p < .0001, respectively). In multivariable models, it predicted high EE (but neither low EE nor low/high DP). EE/DP single items were correlated with parent subscales (Spearman's ρ = .89 and .81, p < .0001; Kruskal-Wallis χ(2) = 230.98 and 197.84, p < .0001, respectively). In multivariable models, the EE item predicted high/low EE, whereas the DP item predicted only low DP. Therefore, the three single-item measures tested varied in effectiveness as screeners for EE/DP dimensions of burnout. © The Author(s) 2015.
Nagpal, Jitender; Kumar, Arvind; Kakar, Sonia; Bhartia, Abhishek
2010-05-01
To develop a reliable and valid quality of life questionnaire for Indian patients with diabetes. A draft of 75 questions was prepared on the basis of expert opinion, focus group discussions, review of existing literature and detailed semi-structured interviews of patients with diabetes with the intention of including all aspects of diabetes-specific and quality of life considered relevant by patients and care providers to enable constrict validity. A Stage 2 questionnaire was then prepared with 13 domains and 54 items (questions) after expert panel review for obvious irrelevance and duplication of issues. It was administered to 150 participants visiting a diabetes center at New Delhi. Factor analysis was done using principal component method with varimax rotation. Reliability analysis was done by calculating Cronbach's Alpha. For evaluating concordant validity the questionnaire was co-administered with DQL-CTQ to 30 participants. The discriminant validity of the questionnaire was tested using 't' test for metabolic control, co-morbidities, insulin use and gender. Using principal component method 8 domains were identified on the basis of an apriori hypothesis and the scree plot. These 8 domains explained 49.9% of the total variation. 34 items (questions) were selected to represent these domains on the basis of extraction communality, factor loading, inter-item and item-total correlations. The final questionnaire has an Overall Cronbach's Alpha value of 0.894 (subscale- 0.55 to 0.85) showing high internal consistency. The questionnaire showed good concordance (product moment correlation 0.724; p = 0.001; subscale correlation - 0.457 to 0.779) with the DQL-CTQ. The overall standardized questionnaire score showed good responsiveness to metabolic control and co-morbidities establishing discriminant validity. The final version of questionnaire with 8 domains and 34 items is a reliable and valid tool for assessment of quality of life of Indian patients with diabetes.
The development of indonesian online game addiction questionnaire.
Jap, Tjibeng; Tiatri, Sri; Jaya, Edo Sebastian; Suteja, Mekar Sari
2013-01-01
Online game is an increasingly popular source of entertainment for all ages, with relatively prevalent negative consequences. Addiction is a problem that has received much attention. This research aims to develop a measure of online game addiction for Indonesian children and adolescents. The Indonesian Online Game Addiction Questionnaire draws from earlier theories and research on the internet and game addiction. Its construction is further enriched by including findings from qualitative interviews and field observation to ensure appropriate expression of the items. The measure consists of 7 items with a 5-point Likert Scale. It is validated by testing 1,477 Indonesian junior and senior high school students from several schools in Manado, Medan, Pontianak, and Yogyakarta. The validation evidence is shown by item-total correlation and criterion validity. The Indonesian Online Game Addiction Questionnaire has good item-total correlation (ranging from 0.29 to 0.55) and acceptable reliability (α = 0.73). It is also moderately correlated with the participant's longest time record to play online games (r = 0.39; p<0.01), average days per week in playing online games (ρ = 0.43; p<0.01), average hours per days in playing online games (ρ = 0.41; p<0.01), and monthly expenditure for online games (ρ = 0.30; p<0.01). Furthermore, we created a clinical cut-off estimate by combining criteria and population norm. The clinical cut-off estimate showed that the score of 14 to 21 may indicate mild online game addiction, and the score of 22 and above may indicate online game addiction. Overall, the result shows that Indonesian Online Game Addiction Questionnaire has sufficient psychometric property for research use, as well as limited clinical application.
Martin, T P C; Moualed, D; Paul, A; Ronan, N; Tysome, J R; Donnelly, N P; Cook, R; Axon, P R
2015-04-01
The Cambridge Otology Quality of Life Questionnaire (COQOL) is a patient-recorded outcome measurement (PROM) designed to quantify the quality of life of patients attending otology clinics. Item-reduction model. A systematically designed long-form version (74 items) was tested with patient focus groups before being presented to adult otology patients (n. 137). Preliminary item analysis tested reliability, reducing the COQOL to 24 questions. This was then presented in conjunction with the SF-36 (V1) questionnaire to a total of 203 patients. Subsequently, these were re-presented at T + 3 months, and patients recorded whether they felt their condition had improved, deteriorated or remained the same. Non-responders were contacted by post. A correlation between COQOL scores and patient perception of change was examined to analyse content validity. Teaching hospital and university psychology department. Adult patients attending otology clinics with a wide range of otological conditions. Item reliability measured by item–total correlation, internal consistency and test– retest reliability. Validity measured by correlation between COQOL scores and patient-reported symptom change. Reliability: the COQOL showed excellent internal consistency at both initial presentation (a = 0.90) and 3 months later (a = 0.93). Validity: One-way analysis of variance showed a significant difference between groups reporting change and those reporting no change in quality of life (F(2, 80) = 5.866, P < 0.01). The COQOL is the first otology-specific PROM. Initial studies demonstrate excellent reliability and encouraging preliminary criterion validity: further studies will allow a deeper validation of the instrument.
The Development of Indonesian Online Game Addiction Questionnaire
Jap, Tjibeng; Tiatri, Sri; Jaya, Edo Sebastian; Suteja, Mekar Sari
2013-01-01
Online game is an increasingly popular source of entertainment for all ages, with relatively prevalent negative consequences. Addiction is a problem that has received much attention. This research aims to develop a measure of online game addiction for Indonesian children and adolescents. The Indonesian Online Game Addiction Questionnaire draws from earlier theories and research on the internet and game addiction. Its construction is further enriched by including findings from qualitative interviews and field observation to ensure appropriate expression of the items. The measure consists of 7 items with a 5-point Likert Scale. It is validated by testing 1,477 Indonesian junior and senior high school students from several schools in Manado, Medan, Pontianak, and Yogyakarta. The validation evidence is shown by item-total correlation and criterion validity. The Indonesian Online Game Addiction Questionnaire has good item-total correlation (ranging from 0.29 to 0.55) and acceptable reliability (α = 0.73). It is also moderately correlated with the participant's longest time record to play online games (r = 0.39; p<0.01), average days per week in playing online games (ρ = 0.43; p<0.01), average hours per days in playing online games (ρ = 0.41; p<0.01), and monthly expenditure for online games (ρ = 0.30; p<0.01). Furthermore, we created a clinical cut-off estimate by combining criteria and population norm. The clinical cut-off estimate showed that the score of 14 to 21 may indicate mild online game addiction, and the score of 22 and above may indicate online game addiction. Overall, the result shows that Indonesian Online Game Addiction Questionnaire has sufficient psychometric property for research use, as well as limited clinical application. PMID:23560113
Development of a short version of the new brief job stress questionnaire.
Inoue, Akiomi; Kawakami, Norito; Shimomitsu, Teruichi; Tsutsumi, Akizumi; Haratani, Takashi; Yoshikawa, Toru; Shimazu, Akihito; Odagiri, Yuko
2014-01-01
This study was aimed to investigate the test-retest reliability and validity of a short version of the New Brief Job Stress Questionnaire (New BJSQ) whose scales have one item selected from a standard version. Based on the results from an anonymous web-based questionnaire of occupational health staffs and personnel/labor staffs, we selected higher-priority scales from the standard version. After selecting one item with highest item-total correlation coefficient from each scale, a 23-item questionnaire was developed. A nationally representative survey was administered to Japanese employees (n=1,633) to examine test-retest reliability and validity. Most scales (or items) showed modest but adequate levels of test-retest reliability (r>0.50). Furthermore, job demands and job resources scales (or items) were associated with mental and physical stress reactions while job resources scales (or items) were also associated with positive outcomes. These findings provided a piece of evidence that the short version of the New BJSQ is reliable and valid.
Development of a Short Version of the New Brief Job Stress Questionnaire
INOUE, Akiomi; KAWAKAMI, Norito; SHIMOMITSU, Teruichi; TSUTSUMI, Akizumi; HARATANI, Takashi; YOSHIKAWA, Toru; SHIMAZU, Akihito; ODAGIRI, Yuko
2014-01-01
This study was aimed to investigate the test-retest reliability and validity of a short version of the New Brief Job Stress Questionnaire (New BJSQ) whose scales have one item selected from a standard version. Based on the results from an anonymous web-based questionnaire of occupational health staffs and personnel/labor staffs, we selected higher-priority scales from the standard version. After selecting one item with highest item-total correlation coefficient from each scale, a 23-item questionnaire was developed. A nationally representative survey was administered to Japanese employees (n=1,633) to examine test-retest reliability and validity. Most scales (or items) showed modest but adequate levels of test-retest reliability (r>0.50). Furthermore, job demands and job resources scales (or items) were associated with mental and physical stress reactions while job resources scales (or items) were also associated with positive outcomes. These findings provided a piece of evidence that the short version of the New BJSQ is reliable and valid. PMID:24975108
Good, Meadow M; Korbly, Nicole; Kassis, Nadine C; Richardson, Monica L; Book, Nicole M; Yip, Sallis; Saguan, Docile; Gross, Carey; Evans, Janelle; Harvie, Heidi S; Sung, Vivian
2013-11-01
The objective of the study was to describe the basic knowledge about prolapse and attitudes regarding the uterus in women seeking care for prolapse symptoms. This was a cross-sectional study of English-speaking women presenting with prolapse symptoms. Patients completed a self-administered questionnaire that included 5 prolapse-related knowledge items and 6 benefit-of-uterus attitude items; higher scores indicated greater knowledge or more positive perception of the uterus. The data were analyzed using descriptive statistics and multiple linear regression. A total of 213 women were included. The overall mean knowledge score was 2.2 ± 1.1 (range, 0-5); 44% of the items were answered correctly. Participants correctly responded that surgery (79.8%), pessary (55.4%), and pelvic muscle exercises (34.3%) were prolapse treatment options. Prior evaluation by a female pelvic medicine and reconstructive surgery specialist (beta = 0.57, P = .001) and higher education (beta = 0.3, P = .07) was associated with a higher mean knowledge score. For attitude items, the overall mean score was 15.1 (4.7; range, 6-30). A total of 47.4% disagreed with the statement that the uterus is important for sex. The majority disagreed with the statement that the uterus is important for a sense of self (60.1%); that hysterectomy would make me feel less feminine (63.9%); and that hysterectomy would make me feel less whole (66.7%). Previous consultation with a female pelvic medicine and reconstructive surgery specialist was associated with a higher mean benefit of uterus score (beta = 1.82, P = .01). Prolapse-related knowledge is low in women seeking care for prolapse symptoms. The majority do not believe the uterus is important for body image or sexuality and do not believe that hysterectomy will negatively affect their sex lives. Copyright © 2013 Mosby, Inc. All rights reserved.
Cao, Shiqi; Liu, Ning; Han, Wuxiang; Zi, Yunpeng; Peng, Fan; Li, Lexiang; Fu, Qiwei; Chen, Yi; Zheng, Weijie; Qian, Qirong
2017-01-14
The Forgotten Joint Score (FJS) is a newly developed health-related quality of life (HRQoL) questionnaire designed to evaluate the awareness after total knee arthroplasty (TKA). This study cross-culturally adapted and psychometrically validated a simplified Chinese version of the FJS (SC-FJS). Cross-cultural adaptation was performed according to the internationally recognized guidelines. One-hundred and fifty participants who underwent primary TKA were recruited in this study. Cronbach's α and intra-class correlations were used to determine reliability. Construct validity was analyzed by evaluating the correlations between SC-FJS and the Knee Injury and Osteoarthritis Outcome Score (KOOS) and the short form (36) health survey (SF-36). Each of the 12 items was properly responded and correlated with the total items. SC-FJS had excellent reliability [Cronbach's α = 0.907, intra-class correlation coefficient (ICC) = 0.970, 95% CI 0.959-0.978). Elimination of any one item in all did not result in a value of Cronbach's α of <0.80. SC-FJS had a high correlation with symptoms (0.67, p < 0.001) and pain (0.60, p < 0.001) domains of KOOS and social functioning (0.66, p < 0.001) domain of SF-36, and it also moderately correlated with function in daily living (0.53, p < 0.001) and function in sport and recreation (0.40, p < 0.001) domains of KOOS, and physical subscale of SF-36 (0.49-0.53, p < 0.001) but had a low (r = 0.20) or not significant (p > 0.05) correlation with mental subscale of SF-36. SC-FJS demonstrated excellent acceptability, internal consistency, reliability, and construct validity, which can be recommended for patients who underwent joint arthroplasty in Mainland China.
Böttcher, B; Fessler, S; Friedl, F; Toth, B; Walter, M H; Wildt, L; Riedl, D
2018-04-01
Patients with polycystic ovary syndrome (PCOS) report a decreased health-related quality of life (HRQOL) and higher levels of psychological distress. Validated questionnaires are necessary to assess the impact of PCOS on patients' lives. The aim of the present study was to evaluate the German "Polycystic Ovary Syndrome Questionnaire" (PCOSQ-G). The psychometric properties of the PCOSQ-G were investigated in PCOS patients with item-total correlation, internal consistency and test-retest reliability. Correlations with the Short-Form-36 Health Survey (SF-36) and the Hospital Anxiety and Depression Scale (HADS-D) were calculated to evaluate the validity of the PCOSQ-G. Discriminatory validity was investigated through a receiver operating characteristic curve and independent sample t tests compared with healthy controls. Good psychometric properties were found for most items. Acceptable to high internal consistency was found for the total score (α = 0.94-0.95) and all subscales (α = 0.70-0.97). High test-retest reliability was found for the total score (0.86) and all subscales (0.81-0.90). The validity analyses showed that the PCOSQ-G total score was positively correlated with both SF-36 summary scales and was negatively correlated with both HADS subscales. Patients reported significantly lower values for the PCOSQ-G total score (p < 0.001) and all subscales, and the PCOSQ-G discriminated well between patients and healthy controls (AUC = 0.81, p < 0.001). PCOSQ-G is a reliable and valid tool to assess the HRQOL in patients with PCOS and can be used in future clinical research. Patients with PCOS exhibited an impaired HRQOL, which indicates the need for psychosomatic counseling.
Factors Associated with Patient Press Ganey Satisfaction Scores for Ophthalmology Patients.
Long, Chao; Tsay, Ellen L; Jacobo, Samuel A; Popat, Rita; Singh, Kuldev; Chang, Robert T
2016-02-01
To determine which metrics from the Press Ganey patient satisfaction survey best correlate with "likelihood to recommend" among patients in an academic tertiary medical center practice setting. Cross-sectional study. Over a 3-month period, patients presenting to an academic practice who agreed to participate were enrolled in the study if they met the following entry criteria: (1) age ≥18 years, (2) ability to read and speak English, and (3) followed in this practice between 4 months and 4 years. A total of 196 patients were recruited. A 26-item abridged version of the Press Ganey survey typically distributed to patients via mail or e-mail after visiting the Stanford University Hospital was administered privately to each eligible patient of 2 different attending clinics at the conclusion of his or her visit. The 26 survey items were not modified for the purposes of the study and were administered such that participants could not be individually identified. The arithmetic mean score for the item "Likelihood of your recommending our practice to others" was calculated by assigning a value (0-100) to the Likert value associated with survey responses and correlated with the 25 other items using the differences in the mean scores. Response to survey items graded on a 1 to 5 standard Likert scale. The weighted mean patient survey score for the "likelihood to recommend" item for the junior faculty member was 95.9% and for the senior faculty member was 94.5%, respectively. For the remaining 25 items, "Amount of time the care provider spent with you" (Diff[1-2]=1.03; P < 0.0001) and "Ease of scheduling your appointment" (Diff[1-2]=0.99; P < 0.0001) best correlated with likelihood to recommend. In contrast, "Friendliness/courtesy of the care provider" (Diff[1-2]=0.29; P = 0.0045) correlated least with likelihood to recommend. Stratification based on provider did not affect the study results. The perception of time spent with the practitioner and ease of appointment scheduling are the 2 variables that best correlate with patients recommending their ophthalmologists to other prospective patients. Copyright © 2016 American Academy of Ophthalmology. Published by Elsevier Inc. All rights reserved.
The Montgomery Äsberg and the Hamilton Ratings of Depression
Carmody, Thomas; Rush, A. John; Bernstein, Ira; Warden, Diane; Brannan, Stephen; Burnham, Daniel; Woo, Ada; Trivedi, Madhukar
2007-01-01
The 17-item Hamilton Rating Scale for Depression (HRSD17) and the Montgomery Äsberg Depression Rating Scale (MADRS) are two widely used clinicianrated symptom scales. A 6-item version of the HRSD (HRSD6) was created by Bech to address the psychometric limitations of the HRSD17. The psychometric properties of these measures were compared using classical test theory (CTT) and item response theory (IRT) methods. IRT methods were used to equate total scores on any two scales. Data from two distinctly different outpatient studies of nonpsychotic major depression: a 12-month study of highly treatment-resistant patients (n=233) and an 8-week acute phase drug treatment trial (n=985) were used for robustness of results. MADRS and HRSD6 items generally contributed more to the measurement of depression than HRSD17 items as shown by higher item-total correlations and higher IRT slope parameters. The MADRS and HRSD6 were unifactorial while the HRSD17 contained 2 factors. The MADRS showed about twice the precision in estimating depression as either the HRSD17 or HRSD6 for average severity of depression. An HRSD17 of 7 corresponded to an 8 or 9 on the MADRS and 4 on the HRSD6. The MADRS would be superior to the HRSD17 in the conduct of clinical trials. PMID:16769204
Čatipović, Marija; Marković, Martina; Grgurić, Josip
2018-04-27
Validating a questionnaire/instrument before proceeding to the field for data collection is important. An 18-item breastfeeding intention, 39-item attitude and 44-item knowledge questionnaire was validated in a Croatian sample of secondary-school students ( N = 277). For the intentions, principal component analysis (PCA) yielded a four-factor solution with 8 items explaining 68.3% of the total variance. Cronbach’s alpha (0.71) indicated satisfactory internal consistency. For the attitudes, PCA showed a seven-factor structure with 33 items explaining 58.41% of total variance. Cronbach’s alpha (0.87) indicated good internal consistency. There were 13 knowledge questions that were retained after item analysis, showing good internal consistency (KR20 = 0.83). In terms of criterion validity, the questionnaire differentiated between students who received breastfeeding education compared to students who were not educated in breastfeeding. Correlations between intentions and attitudes (r = 0.49), intentions and knowledge (r = 0.29), and attitudes and knowledge (r = 0.38) confirmed concurrent validity. The final instrument is reliable and valid for data collection on breastfeeding. Therefore, the instrument is recommended for evaluation of breastfeeding education programs aimed at upper-grade elementary and secondary school students.
Marković, Martina; Grgurić, Josip
2018-01-01
Background: Validating a questionnaire/instrument before proceeding to the field for data collection is important. Methods: An 18-item breastfeeding intention, 39-item attitude and 44-item knowledge questionnaire was validated in a Croatian sample of secondary-school students (N = 277). Results: For the intentions, principal component analysis (PCA) yielded a four-factor solution with 8 items explaining 68.3% of the total variance. Cronbach’s alpha (0.71) indicated satisfactory internal consistency. For the attitudes, PCA showed a seven-factor structure with 33 items explaining 58.41% of total variance. Cronbach’s alpha (0.87) indicated good internal consistency. There were 13 knowledge questions that were retained after item analysis, showing good internal consistency (KR20 = 0.83). In terms of criterion validity, the questionnaire differentiated between students who received breastfeeding education compared to students who were not educated in breastfeeding. Correlations between intentions and attitudes (r = 0.49), intentions and knowledge (r = 0.29), and attitudes and knowledge (r = 0.38) confirmed concurrent validity. Conclusions: The final instrument is reliable and valid for data collection on breastfeeding. Therefore, the instrument is recommended for evaluation of breastfeeding education programs aimed at upper-grade elementary and secondary school students. PMID:29702616
Sharp, J L; Gough, K; Pascoe, M C; Drosdowsky, A; Chang, V T; Schofield, P
2018-07-01
The Memorial Symptom Assessment Scale Short Form (MSAS-SF) is a widely used symptom assessment instrument. Patients who self-complete the MSAS-SF have difficulty following the two-part response format, resulting in incorrectly completed responses. We describe modifications to the response format to improve useability, and rational scoring rules for incorrectly completed items. The modified MSAS-SF was completed by 311 women in our Peer and Nurse support Trial to Assist women in Gynaecological Oncology; the PeNTAGOn study. Descriptive statistics were used to summarise completion of the modified MSAS-SF, and provide symptom statistics before and after applying the rational scoring rules. Spearman's correlations with the Functional Assessment for Cancer Therapy-General (FACT-G) and Hospital Anxiety and Depression Scale (HADS) were assessed. Correct completion of the modified MSAS-SF items ranged from 91.5 to 98.7%. The rational scoring rules increased the percentage of useable responses on average 4% across all symptoms. MSAS-SF item statistics were similar with and without the scoring rules. The pattern of correlations with FACT-G and HADS was compatible with prior research. The modified MSAS-SF was useable for self-completion and responses demonstrated validity. The rational scoring rules can minimise loss of data from incorrectly completed responses. Further investigation is recommended.
ERIC Educational Resources Information Center
Attali, Yigal; Powers, Don; Hawthorn, John
2008-01-01
Registered examinees for the GRE® General Test answered open-ended sentence-completion items. For half of the items, participants received immediate feedback on the correctness of their answers and up to two opportunities to revise their answers. A significant feedback-and-revision effect was found. Participants were able to correct many of their…
Parietal cortex and episodic memory retrieval in schizophrenia.
Lepage, Martin; Pelletier, Marc; Achim, Amélie; Montoya, Alonso; Menear, Matthew; Lal, Sam
2010-06-30
People with schizophrenia consistently show memory impairment on varying tasks including item recognition memory. Relative to the correct rejection of distracter items, the correct recognition of studied items consistently produces an effect termed the old/new effect that is characterized by increased activity in parietal and frontal cortical regions. This effect has received only scant attention in schizophrenia. We examined the old/new effect in 15 people with schizophrenia and 18 controls during an item recognition test, and neural activity was examined with event-related functional magnetic resonance imaging. Both groups performed equally well during the recognition test and showed increased activity in a left dorsolateral prefrontal region and in the precuneus bilaterally during the successful recognition of old items relative to the correct rejection of new items. The control group also exhibited increased activity in the dorsal left parietal cortex. This region has been implicated in the top-down modulation of memory which involves control processes that support memory-retrieval search, monitoring and verification. Although these processes may not be of paramount importance in item recognition memory performance, the present findings suggest that people with schizophrenia may have difficulty with such top-down modulation, a finding consistent with many other studies in information processing.
Bazzo, Stefania; Battistella, Giuseppe; Riscica, Patrizia; Moino, Giuliana; Dal Pozzo, Giuseppe; Bottarel, Mery; Geromel, Mariasole; Czerwinsky, Loredana
2015-01-01
Alcohol consumption during pregnancy can result in a range of harmful effects on the developing foetus and newborn, called Fetal Alcohol Spectrum Disorders (FASD). The identification of pregnant women who use alcohol enables to provide information, support and treatment for women and the surveillance of their children. The AUDIT-C (the shortened consumption version of the Alcohol Use Disorders Identification Test) is used for investigating risky drinking with different populations, and has been applied to estimate alcohol use and risky drinking also in antenatal clinics. The aim of the study was to investigate the reliability of a self-report Italian version of the AUDIT-C questionnaire to detect alcohol consumption during pregnancy, regardless of its use as a screening tool. The questionnaire was filled in by two independent consecutive series of pregnant women at the 38th gestation week visit in the two birth locations of the Local Health Authority of Treviso (Italy), during the years 2010 and 2011 (n=220 and n=239). Reliability analysis was performed using internal consistency, item-total score correlations, and inter-item correlations. The "discriminatory power" of the test was also evaluated. Results. Overall, about one third of women recalled alcohol consumption at least once during the current pregnancy. The questionnaire had an internal consistency of 0.565 for the group of the year 2010, of 0.516 for the year 2011, and of 0.542 for the overall group. The highest item total correlations' coefficient was 0.687 and the highest inter-item correlations' coefficient was 0.675. As for the discriminatory power of the questionnaire, the highest Ferguson's delta coefficient was 0.623. These findings suggest that the Italian self-report version of the AUDIT-C possesses unsatisfactory reliability to estimate alcohol consumption during pregnancy when used as self-report questionnaire in an obstetric setting.
Validity and reliability of the Persian version of mobile phone addiction scale
Mazaheri, Maryam Amidi; Karbasi, Mojtaba
2014-01-01
Background: With regard to large number of mobile users especially among college students in Iran, addiction to mobile phone is attracting increasing concern. There is an urgent need for reliable and valid instrument to measure this phenomenon. This study examines validity and reliability of the Persian version of mobile phone addiction scale (MPAIS) in college students. Materials and Methods: this methodological study was down in Isfahan University of Medical Sciences. One thousand one hundred and eighty students were selected by convenience sampling. The English version of the MPAI questionnaire was translated into Persian with the approach of Jones et al. (Challenges in language, culture, and modality: Translating English measures into American Sign Language. Nurs Res 2006; 55: 75-81). Its reliability was tested by Cronbach's alpha and its dimensionality validity was evaluated using Pearson correlation coefficients with other measures of mobile phone use and IAT. Construct validity was evaluated using Exploratory subscale analysis. Results: Cronbach's alpha of 0.86 was obtained for total PMPAS, for subscale1 (eight items) was 0.84, for subscale 2 (five items) was 0.81 and for subscale 3 (two items) was 0.77. There were significantly positive correlations between the score of PMPAS and IAT (r = 0.453, P < 0.001) and other measures of mobile phone use. Principal component subscale analysis yielded a three-subscale structure including: inability to control craving; feeling anxious and lost; mood improvement accounted for 60.57% of total variance. The results of discriminate validity showed that all the item's correlations with related subscale were greater than 0.5 and correlations with unrelated subscale were less than 0.5. Conclusion: Considering lack of a valid and reliable questionnaire for measuring addiction to the mobile phone, PMPAS could be a suitable instrument for measuring mobile phone addiction in future research. PMID:24778668
Kirwan, John; Bode, Christina; Cramp, Fiona; Carmona, Loreto; Dures, Emma; Englbrecht, Matthias; Fransen, Jaap; Greenwood, Rosemary; Hagel, Sofia; van de Laar, Maart; Molto, Anna; Nicklin, Joanna; Petersson, Ingemar F; Redondo, Marta; Schett, Georg; Gossec, Laure
2018-01-01
Abstract Objective To evaluate the Bristol Rheumatoid Arthritis Fatigue Multidimensional Questionnaire (BRAF-MDQ), the revised Bristol Rheumatoid Arthritis Numerical Rating Scales (BRAF-NRS V2) and the Rheumatoid Arthritis Impact of Disease (RAID) scale in six countries. Methods We surveyed RA patients in France, Germany, The Netherlands, Spain, Sweden and the UK, including the HAQ, 36-item Short Form Health Survey (SF-36) and potential revisions of the BRAF-NRS coping and Spanish RAID coping items. Factor structure and internal consistency were examined by factor analysis and Cronbach’s α and construct validity by Spearman’s correlation. Results A total of 1276 patients participated (76% female, 25% with a disease duration <5 years, median HAQ 1.0). The original BRAF-MDQ four-factor structure and RAID single-factor structure were confirmed in every country with ⩾66% of variation in items explained by each factor and all item factor loadings of 0.71–0.98. Internal consistency for the BRAF-MDQ total and subscales was a Cronbach’s α of 0.75–0.96 and for RAID, 0.93–0.96. Fatigue construct validity was shown for the BRAF-MDQ and BRAF-NRS severity and effect scales, correlated internally with SF-36 vitality and with RAID fatigue (r = 0.63–0.93). Broader construct validity for the BRAFs and RAID was shown by correlation with each other, HAQ and SF-36 domains (r = 0.46–0.82), with similar patterns in individual countries. The revised BRAF-NRS V2 Coping item had stronger validity than the original in all analyses. The revised Spanish RAID coping item performed as well as the original. Conclusion Across six European countries, the BRAF-MDQ identifies the same four aspects of fatigue, and along with the RAID, shows strong factor structure and internal consistency and moderate–good construct validity. The revised BRAF-NRS V2 shows improved construct validity and replaces the original. PMID:29087507
Hewlett, Sarah; Kirwan, John; Bode, Christina; Cramp, Fiona; Carmona, Loreto; Dures, Emma; Englbrecht, Matthias; Fransen, Jaap; Greenwood, Rosemary; Hagel, Sofia; van de Laar, Maart; Molto, Anna; Nicklin, Joanna; Petersson, Ingemar F; Redondo, Marta; Schett, Georg; Gossec, Laure
2018-02-01
To evaluate the Bristol Rheumatoid Arthritis Fatigue Multidimensional Questionnaire (BRAF-MDQ), the revised Bristol Rheumatoid Arthritis Numerical Rating Scales (BRAF-NRS V2) and the Rheumatoid Arthritis Impact of Disease (RAID) scale in six countries. We surveyed RA patients in France, Germany, The Netherlands, Spain, Sweden and the UK, including the HAQ, 36-item Short Form Health Survey (SF-36) and potential revisions of the BRAF-NRS coping and Spanish RAID coping items. Factor structure and internal consistency were examined by factor analysis and Cronbach's α and construct validity by Spearman's correlation. A total of 1276 patients participated (76% female, 25% with a disease duration <5 years, median HAQ 1.0). The original BRAF-MDQ four-factor structure and RAID single-factor structure were confirmed in every country with ⩾66% of variation in items explained by each factor and all item factor loadings of 0.71-0.98. Internal consistency for the BRAF-MDQ total and subscales was a Cronbach's α of 0.75-0.96 and for RAID, 0.93-0.96. Fatigue construct validity was shown for the BRAF-MDQ and BRAF-NRS severity and effect scales, correlated internally with SF-36 vitality and with RAID fatigue (r = 0.63-0.93). Broader construct validity for the BRAFs and RAID was shown by correlation with each other, HAQ and SF-36 domains (r = 0.46-0.82), with similar patterns in individual countries. The revised BRAF-NRS V2 Coping item had stronger validity than the original in all analyses. The revised Spanish RAID coping item performed as well as the original. Across six European countries, the BRAF-MDQ identifies the same four aspects of fatigue, and along with the RAID, shows strong factor structure and internal consistency and moderate-good construct validity. The revised BRAF-NRS V2 shows improved construct validity and replaces the original. © The Author 2017. Published by Oxford University Press on behalf of the British Society for Rheumatology.
Hobart, J; Thompson, A
2001-01-01
OBJECTIVES—Routine data collection is now considered mandatory. Therefore, staff rated clinical scales that consist of multiple items should have the minimum number of items necessary for rigorous measurement. This study explores the possibility of developing a short form Barthel index, suitable for use in clinical trials, epidemiological studies, and audit, that satisfies criteria for rigorous measurement and is psychometrically equivalent to the 10 item instrument. METHODS—Data were analysed from 844 consecutive admissions to a neurological rehabilitation unit in London. Random half samples were generated. Short forms were developed in one sample (n=419), by selecting items with the best measurement properties, and tested in the other (n=418). For each of the 10 items of the BI, item total correlations and effect sizes were computed and rank ordered. The best items were defined as those with the lowest cross product of these rank orderings. The acceptability, reliability, validity, and responsiveness of three short form BIs (five, four, and three item) were determined and compared with the 10 item BI. Agreement between scores generated by short forms and 10 item BI was determined using intraclass correlation coefficients and the method of Bland and Altman. RESULTS—The five best items in this sample were transfers, bathing, toilet use, stairs, and mobility. Of the three short forms examined, the five item BI had the best measurement properties and was psychometrically equivalent to the 10 item BI. Agreement between scores generated by the two measures for individual patients was excellent (ICC=0.90) but not identical (limits of agreement=1.84±3.84). CONCLUSIONS—The five item short form BI may be a suitable outcome measure for group comparison studies in comparable samples. Further evaluations are needed. Results demonstrate a fundamental difference between assessment and measurement and the importance of incorporating psychometric methods in the development and evaluation of health measures. PMID:11459898
Alkemade, Nathan; Bowden, Stephen C; Salzman, Louis
2015-02-01
It has been suggested that MMPI-2 scoring requires removal of some items when assessing patients after a traumatic brain injury (TBI). Gass (1991. MMPI-2 interpretation and closed head injury: A correction factor. Psychological assessment, 3, 27-31) proposed a correction procedure in line with the hypothesis that MMPI-2 endorsement may be affected by symptoms of TBI. This study assessed the validity of the Gass correction procedure. A sample of patients with a TBI (n = 242), and a random subset of the MMPI-2 normative sample (n = 1,786). The correction procedure implies a failure of measurement invariance across populations. This study examined measurement invariance of one of the MMPI-2 scales (Hs) that includes TBI correction items. A four-factor model of the MMPI-2 Hs items was defined. The factor model was found to meet the criteria for partial measurement invariance. Analysis of the change in sensitivity and specificity values implied by partial measurement invariance failed to indicate significant practical impact of partial invariance. Overall, the results support continued use of all Hs items to assess psychological well-being in patients with TBI. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Adequacy of Using a Three-Item Questionnaire to Determine Zygosity in Chinese Young Twins.
Ho, Connie Suk-Han; Zheng, Mo; Chow, Bonnie Wing-Yin; Wong, Simpson W L; Lim, Cadmon K P; Waye, Mary M Y
2017-03-01
The present study examined the adequacy of a three-item parent questionnaire in determining the zygosity of young Chinese twins and whether there was any association between parent response accuracy and some demographic variables. The sample consisted of 334 pairs of same-sex Chinese twins aged from 3 to 11 years. Three scoring methods, namely the summed score, logistic regression, and decision tree, were employed to evaluate parent response accuracy of twin zygosity based on single nucleotide polymorphism (SNP) information. The results showed that all three methods achieved high level of accuracy ranging from 91 to 93 % which was comparable to the accuracy rates in previous Chinese twin studies. Correlation results also showed that the higher the parents' education level or the family income was, the more likely parents were able to tell correctly that their twins are identical or fraternal. The present findings confirmed the validity of using a three-item parent questionnaire to determine twin zygosity in a Chinese school-aged twin sample.
Validity and Reliability of the Turkish Chronic Pain Acceptance Questionnaire
Akmaz, Hazel Ekin; Uyar, Meltem; Kuzeyli Yıldırım, Yasemin; Akın Korhan, Esra
2018-05-29
Pain acceptance is the process of giving up the struggle with pain and learning to live a worthwhile life despite it. In assessing patients with chronic pain in Turkey, making a diagnosis and tracking the effectiveness of treatment is done with scales that have been translated into Turkish. However, there is as yet no valid and reliable scale in Turkish to assess the acceptance of pain. To validate a Turkish version of the Chronic Pain Acceptance Questionnaire developed by McCracken and colleagues. Methodological and cross sectional study. A simple randomized sampling method was used in selecting the study sample. The sample was composed of 201 patients, more than 10 times the number of items examined for validity and reliability in the study, which totaled 20. A patient identification form, the Chronic Pain Acceptance Questionnaire, and the Brief Pain Inventory were used to collect data. Data were collected by face-to-face interviews. In the validity testing, the content validity index was used to evaluate linguistic equivalence, content validity, construct validity, and expert views. In reliability testing of the scale, Cronbach’s α coefficient was calculated, and item analysis and split-test reliability methods were used. Principal component analysis and varimax rotation were used in factor analysis and to examine factor structure for construct concept validity. The item analysis established that the scale, all items, and item-total correlations were satisfactory. The mean total score of the scale was 21.78. The internal consistency coefficient was 0.94, and the correlation between the two halves of the scale was 0.89. The Chronic Pain Acceptance Questionnaire, which is intended to be used in Turkey upon confirmation of its validity and reliability, is an evaluation instrument with sufficient validity and reliability, and it can be reliably used to examine patients’ acceptance of chronic pain.
Validity and Reliability of the Turkish Chronic Pain Acceptance Questionnaire
Akmaz, Hazel Ekin; Uyar, Meltem; Kuzeyli Yıldırım, Yasemin; Akın Korhan, Esra
2018-01-01
Background: Pain acceptance is the process of giving up the struggle with pain and learning to live a worthwhile life despite it. In assessing patients with chronic pain in Turkey, making a diagnosis and tracking the effectiveness of treatment is done with scales that have been translated into Turkish. However, there is as yet no valid and reliable scale in Turkish to assess the acceptance of pain. Aims: To validate a Turkish version of the Chronic Pain Acceptance Questionnaire developed by McCracken and colleagues. Study Design: Methodological and cross sectional study. Methods: A simple randomized sampling method was used in selecting the study sample. The sample was composed of 201 patients, more than 10 times the number of items examined for validity and reliability in the study, which totaled 20. A patient identification form, the Chronic Pain Acceptance Questionnaire, and the Brief Pain Inventory were used to collect data. Data were collected by face-to-face interviews. In the validity testing, the content validity index was used to evaluate linguistic equivalence, content validity, construct validity, and expert views. In reliability testing of the scale, Cronbach’s α coefficient was calculated, and item analysis and split-test reliability methods were used. Principal component analysis and varimax rotation were used in factor analysis and to examine factor structure for construct concept validity. Results: The item analysis established that the scale, all items, and item-total correlations were satisfactory. The mean total score of the scale was 21.78. The internal consistency coefficient was 0.94, and the correlation between the two halves of the scale was 0.89. Conclusion: The Chronic Pain Acceptance Questionnaire, which is intended to be used in Turkey upon confirmation of its validity and reliability, is an evaluation instrument with sufficient validity and reliability, and it can be reliably used to examine patients’ acceptance of chronic pain. PMID:29843496
Pruitt, Sandi L; Jeffe, Donna B; Yan, Yan; Schootman, Mario
2012-04-01
Limited psychometric research has examined the reliability of self-reported measures of neighbourhood conditions, the effect of measurement error on associations between neighbourhood conditions and health, and potential differences in the reliabilities between neighbourhood strata (urban vs rural and low vs high poverty). We assessed overall and stratified reliability of self-reported perceived neighbourhood conditions using five scales (social and physical disorder, social control, social cohesion, fear) and four single items (multidimensional neighbouring). We also assessed measurement error-corrected associations of these conditions with self-rated health. Using random-digit dialling, 367 women without breast cancer (matched controls from a larger study) were interviewed twice, 2-3 weeks apart. Test-retest (intraclass correlation coefficients (ICC)/weighted κ) and internal consistency reliability (Cronbach's α) were assessed. Differences in reliability across neighbourhood strata were tested using bootstrap methods. Regression calibration corrected estimates for measurement error. All measures demonstrated satisfactory internal consistency (α ≥ 0.70) and either moderate (ICC/κ=0.41-0.60) or substantial (ICC/κ=0.61-0.80) test-retest reliability in the full sample. Internal consistency did not differ by neighbourhood strata. Test-retest reliability was significantly lower among rural (vs urban) residents for two scales (social control, physical disorder) and two multidimensional neighbouring items; test-retest reliability was higher for physical disorder and lower for one multidimensional neighbouring item among the high (vs low) poverty strata. After measurement error correction, the magnitude of associations between neighbourhood conditions and self-rated health were larger, particularly in the rural population. Research is needed to develop and test reliable measures of perceived neighbourhood conditions relevant to the health of rural populations.
Competitive control of cognition in rhesus monkeys.
Kowaguchi, Mayuka; Patel, Nirali P; Bunnell, Megan E; Kralik, Jerald D
2016-12-01
The brain has evolved different approaches to solve problems, but the mechanisms that determine which approach to take remain unclear. One possibility is that control progresses from simpler processes, such as associative learning, to more complex ones, such as relational reasoning, when the simpler ones prove inadequate. Alternatively, control could be based on competition between the processes. To test between these possibilities, we posed the support problem to rhesus monkeys using a tool-use paradigm, in which subjects could pull an object (the tool) toward themselves to obtain an otherwise out-of-reach goal item. We initially provided one problem exemplar as a choice: for the correct option, a food item placed on the support tool; for the incorrect option, the food item placed off the tool. Perceptual cues were also correlated with outcome: e.g., red, triangular tool correct, blue, rectangular tool incorrect. Although the monkeys simply needed to touch the tool to register a response, they immediately pulled it, reflecting a relational reasoning process between themselves and another object (R self-other ), rather than an associative one between the arbitrary touch response and reward (A resp-reward ). Probe testing then showed that all four monkeys used a conjunction of perceptual features to select the correct option, reflecting an associative process between stimuli and reward (A stim-reward ). We then added a second problem exemplar and subsequent testing revealed that the monkeys switched to using the on/off relationship, reflecting a relational reasoning process between two objects (R other-other ). Because behavior appeared to reflect R self-other rather than A resp-reward , and A stim-reward prior to R other-other , our results suggest that cognitive processes are selected via competitive control dynamics. Copyright © 2016 Elsevier B.V. All rights reserved.
Pulmonary function tests correlated with thoracic volumes in adolescent idiopathic scoliosis.
Ledonio, Charles Gerald T; Rosenstein, Benjamin E; Johnston, Charles E; Regelmann, Warren E; Nuckley, David J; Polly, David W
2017-01-01
Scoliosis deformity has been linked with deleterious changes in the thoracic cavity that affect pulmonary function. The causal relationship between spinal deformity and pulmonary function has yet to be fully defined. It has been hypothesized that deformity correction improves pulmonary function by restoring both respiratory muscle efficiency and increasing the space available to the lungs. This research aims to correlate pulmonary function and thoracic volume before and after scoliosis correction. Retrospective correlational analysis between thoracic volume modeling from plain x-rays and pulmonary function tests was conducted. Adolescent idiopathic scoliosis patients enrolled in a multicenter database were sorted by pre-operative Total Lung Capacities (TLC) % predicted values from their Pulmonary Function Tests (PFT). Ten patients with the best and ten patients with the worst TLC values were included. Modeled thoracic volume and TLC values were compared before and 2 years after surgery. Scoliosis correction resulted in an increase in the thoracic volume for patients with the worst initial TLCs (11.7%) and those with the best initial TLCs (12.5%). The adolescents with the most severe pulmonary restriction prior to surgery strongly correlated with post-operative change in total lung capacity and thoracic volume (r 2 = 0.839; p < 0.001). The mean increase in thoracic volume in this group was 373.1 cm 3 (11.7%) which correlated with a 21.2% improvement in TLC. Scoliosis correction in adolescents was found to increase thoracic volume and is strongly correlated with improved TLC in cases with severe restrictive pulmonary function, but no correlation was found in cases with normal pulmonary function. © 2016 Orthopaedic Research Society. Published by Wiley Periodicals, Inc. J Orthop Res 35:175-182, 2017. © 2016 Orthopaedic Research Society. Published by Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
De Michelis, Paola; Tozzi, Roberta; Consolini, Giuseppe
2017-02-01
From the very first measurements made by the magnetometers onboard Swarm satellites launched by European Space Agency (ESA) in late 2013, it emerged a discrepancy between scalar and vector measurements. An accurate analysis of this phenomenon brought to build an empirical model of the disturbance, highly correlated with the Sun incidence angle, and to correct vector data accordingly. The empirical model adopted by ESA results in a significant decrease in the amplitude of the disturbance affecting VFM measurements so greatly improving the vector magnetic data quality. This study is focused on the characterization of the difference between magnetic field intensity measured by the absolute scalar magnetometer (ASM) and that reconstructed using the vector field magnetometer (VFM) installed on Swarm constellation. Applying empirical mode decomposition method, we find the intrinsic mode functions (IMFs) associated with ASM-VFM total intensity differences obtained with data both uncorrected and corrected for the disturbance correlated with the Sun incidence angle. Surprisingly, no differences are found in the nature of the IMFs embedded in the analyzed signals, being these IMFs characterized by the same dominant periodicities before and after correction. The effect of correction manifests in the decrease in the energy associated with some IMFs contributing to corrected data. Some IMFs identified by analyzing the ASM-VFM intensity discrepancy are characterized by the same dominant periodicities of those obtained by analyzing the temperature fluctuations of the VFM electronic unit. Thus, the disturbance correlated with the Sun incidence angle could be still present in the corrected magnetic data. Furthermore, the ASM-VFM total intensity difference and the VFM electronic unit temperature display a maximal shared information with a time delay that depends on local time. Taken together, these findings may help to relate the features of the observed VFM-ASM total intensity difference to the physical characteristics of the real disturbance thus contributing to improve the empirical model proposed for the correction of data.[Figure not available: see fulltext.
Psychometric properties of the Brisbane Burn Scar Impact Profile in adults with burn scars
Kimble, Roy; McPhail, Steven; Plaza, Anita; Simons, Megan
2017-01-01
Objective The aim of the study was to determine the longitudinal validity, reproducibility, responsiveness and interpretability of the adult version of the Brisbane Burn Scar Impact Profile, a patient-report measure of health-related quality of life. Methods A prospective longitudinal cohort study of patients with or at risk of burn scarring was conducted at three assessment points (at baseline around the time of wound healing, one to two weeks post-baseline and 1-month post-baseline). Participants attending a major metropolitan adult burn centre at baseline were recruited. Participants completed the Brisbane Burn Scar Impact Profile and the 36-item Short Form Health Survey and Patient Observer Scar Assessment Scale. Intraclass Correlation Coefficients (ICCs), smallest detectable change, percentage of those who improved, stayed the same or worsened and Area under the Receiver Operating Characteristic Curve (AUC) were used to test the aim. Results Data were included for 118 participants at baseline, 68 participants at one to two weeks and 57 participants at 1-month post-baseline. All groups of items had acceptable reproducibility, except for the overall impact of burn scars (ICC = 0.69), the impact of sensations which was not expected to be stable (ICC = 0.63), mobility and daily activities (ICC = 0.63, 0.67 respectively). The responsiveness of six out of seven groups of items able to be tested against external criterion was supported (AUC = 0.72–0.75). Hypothesised correlations of changes in the Brisbane Burn Scar Impact Profile items with changes in criterion measures generally supported longitudinal validity (e.g., nine out of thirteen hypotheses using the SF-36 as an external criterion were supported). Internal consistency estimates, item-total and inter-item correlations indicated there was likely redundancy of some groups of items, particularly in the relationships and social interaction, appearance and emotional reactions items (Chronbach’s alpha range = 0.94–0.95). Conclusion Support was found for the reproducibility, longitudinal validity, responsiveness and interpretability of most groups of Brisbane Burn Scar Impact Profile items and some individual items in the test population. Potential redundancy of items should be investigated further. PMID:28902874
The Psychometric Properties of PHQ-4 Depression and Anxiety Screening Scale Among College Students.
Khubchandani, Jagdish; Brey, Rebecca; Kotecki, Jerome; Kleinfelder, JoAnn; Anderson, Jason
2016-08-01
Depression and anxiety are some of the most common causes of morbidity, social dysfunction, and reduced academic performance in college students. The combination of improved surveillance and access to care would result in better outreach. Brief screening tools can help reach larger populations of college students efficiently. However, reliability and validity of brief screeners for anxiety and depression have not been assessed in college students. Thus, the purpose of this study was to assess in a sample of college students the psychometric properties of PHQ-4, a brief screening tool for depression and anxiety. Undergraduate students were recruited from general education classes at a Midwestern university. Students were given a questionnaire that asked them whether they had been diagnosed by a doctor or health professional with anxiety or depression. Next, they were asked to respond to the items on the PHQ-4 scale. A total of 934 students responded to the survey (response rate=72%). Majority of the participants were females (63%) and Whites (80%). The internal reliability of PHQ-4 was found to be high (α=0.81). Those who were diagnosed with depression or anxiety had statistically significantly higher scores on PHQ-4 (p<0.01). Corrected item total correlations for PHQ-4 were between r=0.66 and r=0.80. PHQ-4 operating characteristics were estimated and area under the curve (AUC) values were 0.835 and 0.787, respectively for anxiety and depression. The PHQ-4 is a reliable and valid tool that can serve as a mass screener for depression and anxiety in young adults. Widespread implementation of this screening tool should be explored across college campuses. Copyright © 2016 Elsevier Inc. All rights reserved.
von Wyl, Agnes; Toggweiler, Stephan; Zollinger, Ruedi
2017-01-01
The Health of the Nation Outcome Scales for Children and Adolescents (HoNOSCA), in use worldwide, is a 13-item measure assessing the biopsychosocial severity of mental health problems in children and adolescents. This article introduces the authorized German-language version of HoNOSCA, the HoNOSCA-D, and examines and discusses its psychometric properties based on a clinical sample of 1,533 children and adolescents aged 4;0 to 17;11 years. For the HoNOSCA-D total score (severity of mental health problems), internal consistency (Cronbach’s alpha) was 0.63. The discriminative power of the items ranged from 0.07 to 0.44; the average interitem correlation was 0.11. Due to this stochastic independence, calculation of a total severity index is acceptable. Using factor analysis, the principal axis factoring and varimax rotation resulted in a four-factor structure, which with a Kaiser–Meyer–Olkin measure of sampling adequacy of 0.684 explained 30.62% of total variance. The convergent correlations with the German-language parent report version of the Strengths and Difficulties Questionnaire were as expected and showed a medium effect size. Gender and age differences in the HoNOSCA-D total score were small. Regarding the 13 items gender and age differences were negligible to medium. The highest severity was found for schizophrenia and psychotic disorders, followed by affective disorders and social behavior disorders. Overall, validity of HoNOSCA-D was clearly supported. PMID:29033858
Garcia-Campayo, Javier; Navarro-Gil, Mayte; Andrés, Eva; Montero-Marin, Jesús; López-Artal, Lorena; Demarzo, Marcelo Marcos Piva
2014-01-10
Self-compassion is a key psychological construct for assessing clinical outcomes in mindfulness-based interventions. The aim of this study was to validate the Spanish versions of the long (26 item) and short (12 item) forms of the Self-Compassion Scale (SCS). The translated Spanish versions of both subscales were administered to two independent samples: Sample 1 was comprised of university students (n = 268) who were recruited to validate the long form, and Sample 2 was comprised of Aragon Health Service workers (n = 271) who were recruited to validate the short form. In addition to SCS, the Mindful Attention Awareness Scale (MAAS), the State-Trait Anxiety Inventory-Trait (STAI-T), the Beck Depression Inventory (BDI) and the Perceived Stress Questionnaire (PSQ) were administered. Construct validity, internal consistency, test-retest reliability and convergent validity were tested. The Confirmatory Factor Analysis (CFA) of the long and short forms of the SCS confirmed the original six-factor model in both scales, showing goodness of fit. Cronbach's α for the 26 item SCS was 0.87 (95% CI = 0.85-0.90) and ranged between 0.72 and 0.79 for the 6 subscales. Cronbach's α for the 12-item SCS was 0.85 (95% CI = 0.81-0.88) and ranged between 0.71 and 0.77 for the 6 subscales. The long (26-item) form of the SCS showed a test-retest coefficient of 0.92 (95% CI = 0.89-0.94). The Intraclass Correlation (ICC) for the 6 subscales ranged from 0.84 to 0.93. The short (12-item) form of the SCS showed a test-retest coefficient of 0.89 (95% CI: 0.87-0.93). The ICC for the 6 subscales ranged from 0.79 to 0.91. The long and short forms of the SCS exhibited a significant negative correlation with the BDI, the STAI and the PSQ, and a significant positive correlation with the MAAS. The correlation between the total score of the long and short SCS form was r = 0.92. The Spanish versions of the long (26-item) and short (12-item) forms of the SCS are valid and reliable instruments for the evaluation of self-compassion among the general population. These results substantiate the use of this scale in research and clinical practice.
Kim, Hae Won
2009-02-01
This study was done to develop a pregnancy nutrition knowledge scale and to examine the relationships between pregnancy nutrition knowledge and eating habits in pregnant women. With convenient sampling, 189 pregnant women who used community health centers for their ante-natal care were recruited. Data were collected using a self administered questionnaire including items on pregnancy nutrition knowledge (18 items) developed by researcher and items on eating habits (14 items). Cronbach's alpha and exploratory factor analysis were examined to test reliability and construct validity of the scale. Pearson's correlation coefficients were used to identify the relationship between pregnancy nutrition knowledge and eating habits. Cronbach's alpha of 18 items was .80. In factor analysis using principal components, 6 factors explained 65% of the total variance. The level of pregnancy nutrition knowledge was not sufficient but correlations between pregnancy nutrition knowledge and some of eating habits were significant. Specifically, pregnancy nutrition knowledge was positively correlated with good eating habits and negatively with bad eating habits. The pregnancy nutrition knowledge scale developed in this study is acceptable for nutrition education led by nurses. Pregnancy nutrition knowledge and eating habits are considered as major variables for ante-natal nutrition education. In future studies, explorations are needed on dietary intake and physiological indices in pregnant women, comparison of women at risk with those not at risk, and development of nutritional education programs for pregnant women.
Reliability and validity of the Microsoft Kinect for evaluating static foot posture
2013-01-01
Background The evaluation of foot posture in a clinical setting is useful to screen for potential injury, however disagreement remains as to which method has the greatest clinical utility. An inexpensive and widely available imaging system, the Microsoft Kinect™, may possess the characteristics to objectively evaluate static foot posture in a clinical setting with high accuracy. The aim of this study was to assess the intra-rater reliability and validity of this system for assessing static foot posture. Methods Three measures were used to assess static foot posture; traditional visual observation using the Foot Posture Index (FPI), a 3D motion analysis (3DMA) system and software designed to collect and analyse image and depth data from the Kinect. Spearman’s rho was used to assess intra-rater reliability and concurrent validity of the Kinect to evaluate foot posture, and a linear regression was used to examine the ability of the Kinect to predict total visual FPI score. Results The Kinect demonstrated moderate to good intra-rater reliability for four FPI items of foot posture (ρ = 0.62 to 0.78) and moderate to good correlations with the 3DMA system for four items of foot posture (ρ = 0.51 to 0.85). In contrast, intra-rater reliability of visual FPI items was poor to moderate (ρ = 0.17 to 0.63), and correlations with the Kinect and 3DMA systems were poor (absolute ρ = 0.01 to 0.44). Kinect FPI items with moderate to good reliability predicted 61% of the variance in total visual FPI score. Conclusions The majority of the foot posture items derived using the Kinect were more reliable than the traditional visual assessment of FPI, and were valid when compared to a 3DMA system. Individual foot posture items recorded using the Kinect were also shown to predict a moderate degree of variance in the total visual FPI score. Combined, these results support the future potential of the Kinect to accurately evaluate static foot posture in a clinical setting. PMID:23566934
Evaluation of a Picture-Based Test for the Assessment of Gelotophobia.
Ruch, Willibald; Platt, Tracey; Bruntsch, Richard; Ďurka, Róbert
2017-01-01
This study examines whether coding open answers in a picture-based test, as to the extent they reflect the fear of being laughed at (i.e., gelotophobia), demonstrates sufficient validity to construct a semi-projective test for the assessment of gelotophobia. Previous findings indicate that cartoon stimuli depicting laughter situations (i.e., in the pilot version of the Picture-Geloph; Ruch et al., 2009) on average elicit fear-typical responses in gelotophobes stronger than in non-gelotophobes. The present study aims to (a) develop a standardized scoring procedure based on a coding scheme, and (b) examine the properties of the pilot version of the Picture-Geloph in order to select the most acceptable items for a standard form of the test. For Study 1, a sample of N = 126 adults, with scores evenly distributed across the gelotophobia spectrum, completed the pilot version of the Picture-Geloph by noting down what they assumed the protagonist in each of 20 cartoons would say or think. Furthermore, participants answered the GELOPH<15> (Ruch and Proyer, 2008), the established questionnaire for the subjective assessment of the fear of being laughed at. Agreement between two independent raters indicated that the developed coding scheme allows for objective and reliable scoring of the Picture-Geloph (mean of intraclass correlations = 0.66). Nine items met the criteria employed to identify the psychometrically most reliable and valid items. These items were unidimensional and internally consistent (Cronbach's alpha = 0.78). The total score of this selection (i.e., the Picture-Geloph<9>) discriminated significantly between non-fearful, slightly, markedly, and extremely fearful individuals; furthermore, it correlated sufficiently high ( r = 0.66; r c = 0.79 when corrected for reliability of both measures) with the GELOPH<15>. Cronbach's alpha (0.73) was largely comparable whereas the estimate of convergent validity was found to be lower in one ( r = 0.50; r c = 0.61; N = 103) of the two samples in Study 2. Combining all three samples ( N = 313) yielded a linear relationship between the self-report and the Picture-Geloph. With the Picture-Geloph<9> and the developed coding scheme, an unobtrusive and valid alternative instrument for the assessment of gelotophobia is provided. Possible applications are discussed.
Akiskal, Hagop S; Mendlowicz, Mauro V; Jean-Louis, Girardin; Rapaport, Mark H; Kelsoe, John R; Gillin, J Christian; Smith, Tom L
2005-03-01
To validate a short English-language version of the Temperament Evaluation of Memphis, Pisa, Paris and San Diego-autoquestionnaire version (TEMPS-A), a self-report questionnaire designed to measure temperamental variations in psychiatric patients and healthy volunteers. Its constituent subscales and items were formulated on the basis of the diagnostic criteria for affective temperaments (cyclothymic, dysthymic, irritable, hyperthymic, and anxious), originally developed by the first author and his former collaborators. Further item wording and selection were achieved at a later stage through an iterative process that incorporated feedback from clinicians, researchers, and research volunteers. A total of 510 volunteers (284 patients with mood disorders, 131 relatives of bipolar probands, and 95 normal controls) were recruited by advertisement in the newspapers, announcements on radio and television, flyers and newsletters, and word of mouth. All participants were interviewed using the Structured Clinical Interview for DSM-III-R, and completed the 110-item TEMPS-A and the Temperament and Character Inventory (TCI-125). The factorial structure, the alpha coefficients, and the item-total correlations coefficients of the TEMPS-A and the correlation coefficients between the dimensions of the TCI and the TEMPS-A subscales were then determined. A principal components analysis with a Varimax rotation found that 39 out of the 110 original items of the TEMPS-A loaded on five factors that were interpreted as representing the cyclothymic, depressive, irritable, hyperthymic, and anxious factors. Coefficients alpha for internal consistency were 0.91 (cyclothymic), 0.81 (depressive), 0.77 (irritable), 0.76 (hyperthymic), and 0.67 (anxious) subscales. We found statistically significant positive correlations between all-but the hyperthymic-subscales and harm avoidance. Positive correlations with the hyperthymic and cyclothymic, and novelty seeking and negative correlations with the remaining subscales were also recorded. Other major findings included positive correlations between the hyperthymic and reward dependence, persistence and self-directedness; positive correlation between the self-transcendence and the cyclothymic, hyperthymic and the anxious; and negative correlations between the depressive, cyclothymic, irritable, anxious and cooperativeness. As the full-scale anxious temperament was added after the four scales of the TEMPS-A were developed, it has only been evaluated in 345 subjects. These data indicate that the TEMPS-A in its shortened version is a psychometrically valid scale with good internal consistency. The proposed five subscale structure is upheld. Concurrent validity against the TCI is shown. Most importantly, for each of the temperaments, we were able to show positive attributes which are meaningful in an evolutionary context, along with traits which make a person vulnerable to mood shifts. This hypothesized dual nature of temperament, which is upheld by our data, is a desirable characteristic for a putative behavioral endophenotype in an oligogenic model of inheritance for bipolar disorder.
Federal Register 2010, 2011, 2012, 2013, 2014
2011-03-15
... DEPARTMENT OF THE INTERIOR National Park Service [2253-665] Notice of Intent To Repatriate Cultural Items: Peabody Museum of Archaeology and Ethnology, Harvard University, Cambridge, MA; Correction AGENCY: National Park Service, Interior. ACTION: Notice; correction. Notice is here given in accordance...
77 FR 59339 - Acquisition of Commercial Items
Federal Register 2010, 2011, 2012, 2013, 2014
2012-09-27
... DEPARTMENT OF DEFENSE Defense Acquisition Regulations System 48 CFR Part 212 Acquisition of Commercial Items CFR Correction 212.504 [Corrected] In Title 48 of the Code of Federal Regulations, Chapter 2 (Parts 201--299), revised as of October 1, 2011, on page 73, in section 212.504, paragraph (a) is...
Mao, Hui-Fen; Chen, Wan-Yin; Yao, Grace; Huang, Sheau-Ling; Lin, Chia-Chi; Huang, Wen-Ni Wennie
2010-05-01
To develop and validate a cross-cultural version of the Quebec User Evaluation of Satisfaction with Assistive Technology (QUEST 2.0) for users of assistive technology devices in Taiwan. A cross-sectional survey. The standard cultural adaptation procedure was used for questionnaire translation and cultural item design. A field test was then conducted for item selection and psychometric properties testing. One hundred and five volunteer assistive device users in community. A questionnaire comprising 12 items of the QUEST 2.0 and 16 culture-specific items. One culture-specific item, 'Cost', was selected based on eight criteria and added to the QUEST 2.0 (12 items) to formulate the Taiwanese version of QUEST 2.0 (T-QUEST). The T-QUEST consisted of 13 items which were classified into two domains: device (8 items) and service (5 items). The internal consistencies of the device, service and total T-QUEST scores were 0.87, 0.84 and 0.90, respectively. The device, services and total T-QUEST scores achieved good test-retest stability (intraclass correlation coefficient (ICC) 0.90, 0.97, 0.95). Exploratory factor analysis revealed that T-QUEST had a two-factor structure for device and service in the construct of user satisfaction (53.42% of the variance explained). Users of assistive device in different culture may have different concerns regarding satisfaction. T-QUEST is the first published version of QUEST with culture-specific items added to the original translated items of QUEST 2.0. T-QUEST was a valid and reliable tool for measuring user satisfaction among Mandarin-speaking individuals using various kinds of assistive devices.
Reduced-Item Food Audits Based on the Nutrition Environment Measures Surveys.
Partington, Susan N; Menzies, Tim J; Colburn, Trina A; Saelens, Brian E; Glanz, Karen
2015-10-01
The community food environment may contribute to obesity by influencing food choice. Store and restaurant audits are increasingly common methods for assessing food environments, but are time consuming and costly. A valid, reliable brief measurement tool is needed. The purpose of this study was to develop and validate reduced-item food environment audit tools for stores and restaurants. Nutrition Environment Measures Surveys for stores (NEMS-S) and restaurants (NEMS-R) were completed in 820 stores and 1,795 restaurants in West Virginia, San Diego, and Seattle. Data mining techniques (correlation-based feature selection and linear regression) were used to identify survey items highly correlated to total survey scores and produce reduced-item audit tools that were subsequently validated against full NEMS surveys. Regression coefficients were used as weights that were applied to reduced-item tool items to generate comparable scores to full NEMS surveys. Data were collected and analyzed in 2008-2013. The reduced-item tools included eight items for grocery, ten for convenience, seven for variety, and five for other stores; and 16 items for sit-down, 14 for fast casual, 19 for fast food, and 13 for specialty restaurants-10% of the full NEMS-S and 25% of the full NEMS-R. There were no significant differences in median scores for varying types of retail food outlets when compared to the full survey scores. Median in-store audit time was reduced 25%-50%. Reduced-item audit tools can reduce the burden and complexity of large-scale or repeated assessments of the retail food environment without compromising measurement quality. Copyright © 2015 American Journal of Preventive Medicine. Published by Elsevier Inc. All rights reserved.
Abdel-Khalek, Ahmed M
2004-06-01
The Arabic Scale of Death Anxiety (ASDA) was constructed and validated in a sample of undergraduates (17-33 yrs) in 3 Arab countries, Egypt (n = 418), Kuwait (n = 509), and Syria (n = 709). In its final form, the ASDA consists of 20 statements. Each item is answered on a 5-point intensity scale anchored by 1: No, and 5: Very much. Alpha reliabilities ranged from .88 to .93, and item-remainder correlations ranged between .27 and .74; the 1-week test-retest reliability was .90 (Egyptians only), denoting high internal consistency and stability. The correlations between the ASDA and Templer's DAS ranged from .60 to .74 denoting high convergent validity of the ASDA against the DAS in the 3 Arab countries. Four factors were extracted in the Egyptian sample and labeled "Fear of dead people and tombs", "Fear of postmortem events", "Fear of lethal disease", and "death preoccupation". The first two factors were almost completely identical in the three countries. The item, "I fear the torture of the grave", had a very high mean score. There were significant correlations between the ASDA and death depression, death obsession, reasons for death fear, and general anxiety, depression, obsession-compulsion, neuroticism, and being a female. All female groups attained significantly higher mean ASDA scores than their male counterparts. Kuwaitis had higher mean ASDA total scores, in comparison with their Egyptian and Syrian counterparts, whereas female Syrians attained the lowest mean ASDA total score in proportion to their female peers.
Richter, Jörg
2015-04-01
Methods to assess intervention progress and outcome for frequent use are needed. To provide preliminary information about psychometric properties for the Norwegian version of the Brief Problems Monitor. Cronbach's alpha scores and intra-class correlation coefficients as indicators for internal consistency (reliability) and Pearson correlation coefficients between corresponding subscales of the long and short ASEBA form versions as well as multiple regression coefficients to explore the predictive power of the reduced item-set related to the corresponding scale-scores of the long version were calculated in large, representative data sets of Norwegian children and adolescents. Cronbach's alpha scores of the Norwegian version of the BPM subscales varied between 0.67 (attention BPM-youth) and 0.88 (attention BPM-teacher) and between 0.90 (BPM-youth) and 0.96 (BPM-teacher) for its total problem score. Corresponding subscales from the long versions and the BPM as well as the total problems scores were closely correlated with coefficients of high effect size (all r > 0.80). The variance of the items of the BPM explained about three-quarters or more of the variance in the corresponding subscales of the long version. The Norwegian BPM has good psychometric properties in terms of 1) being acceptable to good internal consistency and in terms of 2) regression coefficients of high effect size from the BPM items to the problem-scale scores of the long versions as validity indicators. Its use in clinical practice and research can be recommended.
de Jong, Martijn G; Pieters, Rik; Stremersch, Stefan
2012-09-01
Answers to sensitive questions are prone to social desirability bias. If not properly addressed, the validity of the research can be suspect. This article presents multigroup item randomized response theory (MIRRT) to measure self-reported sensitive topics across cultures. The method was specifically developed to reduce social desirability bias by making an a priori change in the design of the survey. The change involves the use of a randomization device (e.g., a die) that preserves participants' privacy at the item level. In cases where multiple items measure a higher level theoretical construct, the researcher could still make inferences at the individual level. The method can correct for under- and overreporting, even if both occur in a sample of individuals or across nations. We present and illustrate MIRRT in a nontechnical manner, provide WinBugs software code so that researchers can directly implement it, and present 2 cross-national studies in which it was applied. The first study compared nonstudent samples from 2 countries (total n = 927) on permissive sexual attitudes and risky sexual behavior and related these to individual-level characteristics such as the Big Five personality traits. The second study compared nonstudent samples from 17 countries (total n = 6,195) on risky sexual behavior and related these to individual-level characteristics, such as gender and age, and to country-level characteristics, such as sex ratio.
The CAREQOL-MS was a useful instrument to measure caregiver quality of life in multiple sclerosis.
Benito-León, Julián; Rivera-Navarro, Jesús; Guerrero, Angel Luis; de Las Heras, Virginia; Balseiro, José; Rodríguez, Elena; Belló, Mireia; Martínez-Martín, Pablo
2011-06-01
To develop and test the first specific instrument for assessing caregiver health-related quality of life (HRQOL) in multiple sclerosis (MS) (CAREQOL-MS). Questionnaire items were derived from a literature review and the views of patients, caregivers, and experts. Instrument was reduced after the analyses of caregivers' interviews and experts' opinions. CAREQOL-MS psychometric properties were assessed in 276 MS caregivers. The final version consisted of 24 items (five subscales) and was free of floor or ceiling effects. For subscales, the Cronbach's alpha coefficient ranged from 0.75 to 0.90. The item-total correlation was 0.62-0.74 for subscale I (physical burden/global health); 0.56-0.74 for subscale II (social impact); 0.52-0.62 for subscale III (emotional impact), and 0.58-0.65 for subscale IV (need of help); subscale V (emotional reactions) had only two items. The intraclass correlation coefficient (0.96 for the total score; 0.75-0.95 for subscales) suggested satisfactory reproducibility. Association was close between CAREQOL-MS subscales and the Zarit burden interview and moderate with short form 36 mental/physical components. CAREQOL-MS subscales scores significantly increased (worse HRQOL) with increasing caregivers' age and Expanded Disability Status Scale. The standard error of the measurement ranged from 0.91 to 2.43 for subscales. Our results provided initial evidence of the usefulness and satisfactory psychometric properties of the CAREQOL-MS. Copyright © 2011 Elsevier Inc. All rights reserved.
Gelaye, Bizu; Lohsoonthorn, Vitool; Lertmeharit, Somrat; Pensuksan, Wipawan C; Sanchez, Sixto E; Lemma, Seblewengel; Berhane, Yemane; Zhu, Xiaotong; Vélez, Juan Carlos; Barbosa, Clarita; Anderade, Asterio; Tadesse, Mahlet G; Williams, Michelle A
2014-01-01
The Pittsburgh Sleep Quality Index (PSQI) and the Epworth Sleepiness Scale (ESS) are questionnaires used to assess sleep quality and excessive daytime sleepiness in clinical and population-based studies. The present study aimed to evaluate the construct validity and factor structure of the PSQI and ESS questionnaires among young adults in four countries (Chile, Ethiopia, Peru and Thailand). A cross-sectional study was conducted among 8,481 undergraduate students. Students were invited to complete a self-administered questionnaire that collected information about lifestyle, demographic, and sleep characteristics. In each country, the construct validity and factorial structures of PSQI and ESS questionnaires were tested through exploratory and confirmatory factor analyses (EFA and CFA). The largest component-total correlation coefficient for sleep quality as assessed using PSQI was noted in Chile (r = 0.71) while the smallest component-total correlation coefficient was noted for sleep medication use in Peru (r = 0.28). The largest component-total correlation coefficient for excessive daytime sleepiness as assessed using ESS was found for item 1 (sitting/reading) in Chile (r = 0.65) while the lowest item-total correlation was observed for item 6 (sitting and talking to someone) in Thailand (r = 0.35). Using both EFA and CFA a two-factor model was found for PSQI questionnaire in Chile, Ethiopia and Thailand while a three-factor model was found for Peru. For the ESS questionnaire, we noted two factors for all four countries. Overall, we documented cross-cultural comparability of sleep quality and excessive daytime sleepiness measures using the PSQI and ESS questionnaires among Asian, South American and African young adults. Although both the PSQI and ESS were originally developed as single-factor questionnaires, the results of our EFA and CFA revealed the multi- dimensionality of the scales suggesting limited usefulness of the global PSQI and ESS scores to assess sleep quality and excessive daytime sleepiness.
A brief marital satisfaction screening tool for use in primary care medicine.
Bailey, Justin; Kerley, Sara; Kibelstis, Thomas
2012-02-01
In the last 3 decades, research has shown consistent association with marriage and mortality and morbidity benefits. Despite the known emotional and physiological benefits of marriage, and the high rate of marriage failure, there are no well-defined screening tools to identify at-risk marriages in primary care settings. Patients presenting to a family medicine clinic were asked to complete a one-item screening question about the level of satisfaction with their marriage. Participants were also asked to fill out the Dyadic Adjustment Scale (DAS), a validated 32-item marital adjustment scale. A total of 159 of 208 (76%) respondents completed the survey. The average DAS score was 111 (SD=21.5), similar to the national average of 114 (SD=17.8). Using the DAS as the gold standard for marital satisfaction, we assessed the level of agreement between the one item screener and the longer DAS. A Pearson's Correlation Coefficient showed a correlation of 0.67. ROC curve showed sensitivity 86% and specificity 86% for the one item screener. Area under the curve was 0.89 (95% CI=0.83-0.93). In addition, analysis of variance showed that predictors of marital satisfaction included more dinners shared a week (compared 0--2, 3-6, 7 nights a week) and dates a month (0, 1--3, >3). Paired t test showed perceived health and living with spouse to be significant. The one-item screening question was shown to have good correlation to the gold standard, as well as acceptable sensitivity and specificity for identifying current dissatisfaction with marriage in a primary care setting. Further research is needed to determine if screening in a primary care setting, correlated with early intervention, can help improve satisfaction and avoid divorce.
The Effects of Aging and IQ on Item and Associative Memory
Ratcliff, Roger; Thapar, Anjali; McKoon, Gail
2011-01-01
The effects of aging and IQ on performance were examined in four memory tasks: item recognition, associative recognition, cued recall, and free recall. For item and associative recognition, accuracy and the response time distributions for correct and error responses were explained by Ratcliff’s (1978) diffusion model, at the level of individual participants. The values of the components of processing identified by the model for the recognition tasks, as well as accuracy for cued and free recall, were compared across levels of IQ ranging from 85 to 140 and age (college-age, 60-74 year olds, and 75-90 year olds). IQ had large effects on the quality of the evidence from memory on which decisions were based in the recognition tasks and accuracy in the recall tasks, except for the oldest participants for whom some of the measures were near floor values. Drift rates in the recognition tasks, accuracy in the recall tasks, and IQ all correlated strongly with each other. However, there was a small decline in drift rates for item recognition and a large decline for associative recognition and accuracy in cued recall (about 70 percent). In contrast, there were large age effects on boundary separation and nondecision time (which correlated across tasks), but little effect of IQ. The implications of these results for single- and dual- process models of item recognition are discussed and it is concluded that models that deal with both RTs and accuracy are subject to many more constraints than models that deal with only one of these measures. Overall, the results of the study show a complicated but interpretable pattern of interactions that present important targets for response time and memory models. PMID:21707207
A virtual shopping test for realistic assessment of cognitive function
2013-01-01
Background Cognitive dysfunction caused by brain injury often prevents a patient from achieving a healthy and high quality of life. By now, each cognitive function is assessed precisely by neuropsychological tests. However, it is also important to provide an overall assessment of the patients’ ability in their everyday life. We have developed a Virtual Shopping Test (VST) using virtual reality technology. The objective of this study was to clarify 1) the significance of VST by comparing VST with other conventional tests, 2) the applicability of VST to brain-damaged patients, and 3) the performance of VST in relation to age differences. Methods The participants included 10 patients with brain damage, 10 age-matched healthy subjects for controls, 10 old healthy subjects, and 10 young healthy subjects. VST and neuropsychological tests/questionnaires about attention, memory and executive function were conducted on the patients, while VST and the Mini-Mental State Examination (MMSE) were conducted on the controls and healthy subjects. Within the VST, the participants were asked to buy four items in the virtual shopping mall quickly in a rational way. The score for evaluation included the number of items bought correctly, the number of times to refer to hints, the number of movements between shops, and the total time spent to complete the shopping. Results Some variables on VST correlated with the scores of conventional assessment about attention and everyday memory. The mean number of times referring to hints and the mean number of movements were significantly larger for the patients with brain damage, and the mean total time was significantly longer for the patients than for the controls. In addition, the mean total time was significantly longer for the old than for the young. Conclusions The results suggest that VST is able to evaluate the ability of attention and everyday memory in patients with brain damage. The time of VST is increased by age. PMID:23777412
A virtual shopping test for realistic assessment of cognitive function.
Okahashi, Sayaka; Seki, Keiko; Nagano, Akinori; Luo, Zhiwei; Kojima, Maki; Futaki, Toshiko
2013-06-18
Cognitive dysfunction caused by brain injury often prevents a patient from achieving a healthy and high quality of life. By now, each cognitive function is assessed precisely by neuropsychological tests. However, it is also important to provide an overall assessment of the patients' ability in their everyday life. We have developed a Virtual Shopping Test (VST) using virtual reality technology. The objective of this study was to clarify 1) the significance of VST by comparing VST with other conventional tests, 2) the applicability of VST to brain-damaged patients, and 3) the performance of VST in relation to age differences. The participants included 10 patients with brain damage, 10 age-matched healthy subjects for controls, 10 old healthy subjects, and 10 young healthy subjects. VST and neuropsychological tests/questionnaires about attention, memory and executive function were conducted on the patients, while VST and the Mini-Mental State Examination (MMSE) were conducted on the controls and healthy subjects. Within the VST, the participants were asked to buy four items in the virtual shopping mall quickly in a rational way. The score for evaluation included the number of items bought correctly, the number of times to refer to hints, the number of movements between shops, and the total time spent to complete the shopping. Some variables on VST correlated with the scores of conventional assessment about attention and everyday memory. The mean number of times referring to hints and the mean number of movements were significantly larger for the patients with brain damage, and the mean total time was significantly longer for the patients than for the controls. In addition, the mean total time was significantly longer for the old than for the young. The results suggest that VST is able to evaluate the ability of attention and everyday memory in patients with brain damage. The time of VST is increased by age.
2017-01-01
Objectives Few attempts have been made to develop a generic health-related quality of life (HRQoL) instrument and to examine its validity and reliability in Korea. We aimed to do this in our present study. Methods After a literature review of existing generic HRQoL instruments, a focus group discussion, in-depth interviews, and expert consultations, we selected 30 tentative items for a new HRQoL measure. These items were evaluated by assessing their ceiling effects, difficulty, and redundancy in the first survey. To validate the HRQoL instrument that was developed, known-groups validity and convergent/discriminant validity were evaluated and its test-retest reliability was examined in the second survey. Results Of the 30 items originally assessed for the HRQoL instrument, four were excluded due to high ceiling effects and six were removed due to redundancy. We ultimately developed a HRQoL instrument with a reduced number of 20 items, known as the Health-related Quality of Life Instrument with 20 items (HINT-20), incorporating physical, mental, social, and positive health dimensions. The results of the HINT-20 for known-groups validity were poorer in women, the elderly, and those with a low income. For convergent/discriminant validity, the correlation coefficients of items (except vitality) in the physical health dimension with the physical component summary of the Short Form 36 version 2 (SF-36v2) were generally higher than the correlations of those items with the mental component summary of the SF-36v2, and vice versa. Regarding test-retest reliability, the intraclass correlation coefficient of the total HINT-20 score was 0.813 (p<0.001). Conclusions A novel generic HRQoL instrument, the HINT-20, was developed for the Korean general population and showed acceptable validity and reliability. PMID:28173686
Chin, Weng Yee; Choi, Edmond P H; Chan, Kit T Y; Wong, Carlos K H
2015-01-01
The Center for Epidemiologic Studies Depression Scale (CES-D) is a commonly used instrument to measure depressive symptomatology. Despite this, the evidence for its psychometric properties remains poorly established in Chinese populations. The aim of this study was to validate the use of the CES-D in Chinese primary care patients by examining factor structure, construct validity, reliability, sensitivity and responsiveness. The psychometric properties were assessed amongst a sample of 3686 Chinese adult primary care patients in Hong Kong. Three competing factor structure models were examined using confirmatory factor analysis. The original CES-D four-structure model had adequate fit, however the data was better fit into a bi-factor model. For the internal construct validity, corrected item-total correlations were 0.4 for most items. The convergent validity was assessed by examining the correlations between the CES-D, the Patient Health Questionnaire 9 (PHQ-9) and the Short Form-12 Health Survey (version 2) Mental Component Summary (SF-12 v2 MCS). The CES-D had a strong correlation with the PHQ-9 (coefficient: 0.78) and SF-12 v2 MCS (coefficient: -0.75). Internal consistency was assessed by McDonald's omega hierarchical (ωH). The ωH value for the general depression factor was 0.855. The ωH values for "somatic", "depressed affect", "positive affect" and "interpersonal problems" were 0.434, 0.038, 0.738 and 0.730, respectively. For the two-week test-retest reliability, the intraclass correlation coefficient was 0.91. The CES-D was sensitive in detecting differences between known groups, with the AUC >0.7. Internal responsiveness of the CES-D to detect positive and negative changes was satisfactory (with p value <0.01 and all effect size statistics >0.2). The CES-D was externally responsive, with the AUC>0.7. The CES-D appears to be a valid, reliable, sensitive and responsive instrument for screening and monitoring depressive symptoms in adult Chinese primary care patients. In its original four-factor and bi-factor structure, the CES-D is supported for cross-cultural comparisons of depression in multi-center studies.
Knowledge about tooth avulsion and its management among dental assistants in Riyadh, Saudi Arabia
2014-01-01
Background Studies evaluating dental assistants’ knowledge about tooth avulsion and its management are rare. The purpose of this study was to evaluate the level of knowledge about tooth avulsion and its management among dental assistants in Riyadh, Saudi Arabia and to assess its relationship with their educational background. Methods A convenience sampling methodology was employed for sample selection. Over a period of four months starting in February, 2013, 691 pretested 17-item questionnaires were distributed. A total of 498 questionnaires were returned for an overall response rate of 72.1%. Six questions were related to knowledge about permanent tooth avulsion and one question was related to knowledge about primary tooth avulsion. Correct answers to these questions were assigned one point each, and based on this scoring system, an overall knowledge score was calculated. An analysis of covariance was used to test the association between the level of knowledge (total score) and the educational qualifications of the respondents (dental degree and others). A P-value of 0.05 was considered the threshold for statistical significance. Results The majority of the respondents (n = 387; 77.7%) were non-Saudis (377 were from the Philippines), and 79.1% (n = 306) of the Filipinos had a dental degree. The question about recommendations for an avulsed tooth that is dirty elicited the highest number of correct responses (n = 444; 89.2%), whereas the question about the best storage media elicited the lowest number of correct responses (n = 192; 38.6%). The overall mean score for knowledge about tooth avulsion was 6.27 ± 1.74. The mean knowledge score among the respondents with a dental degree was 6.63 ± 1.37, whereas that among the respondents with other qualifications was 5.71 ± 2.08. Conclusions The educational qualifications of the surveyed dental assistants were strongly correlated with the level of knowledge about tooth avulsion and its management. PMID:24885584
Knowledge about tooth avulsion and its management among dental assistants in Riyadh, Saudi Arabia.
Halawany, Hassan Suliman; AlJazairy, Yousra Hussain; Alhussainan, Nawaf Sulaiman; AlMaflehi, Nassr; Jacob, Vimal; Abraham, Nimmi Biju
2014-05-06
Studies evaluating dental assistants' knowledge about tooth avulsion and its management are rare. The purpose of this study was to evaluate the level of knowledge about tooth avulsion and its management among dental assistants in Riyadh, Saudi Arabia and to assess its relationship with their educational background. A convenience sampling methodology was employed for sample selection. Over a period of four months starting in February, 2013, 691 pretested 17-item questionnaires were distributed. A total of 498 questionnaires were returned for an overall response rate of 72.1%. Six questions were related to knowledge about permanent tooth avulsion and one question was related to knowledge about primary tooth avulsion. Correct answers to these questions were assigned one point each, and based on this scoring system, an overall knowledge score was calculated. An analysis of covariance was used to test the association between the level of knowledge (total score) and the educational qualifications of the respondents (dental degree and others). A P-value of 0.05 was considered the threshold for statistical significance. The majority of the respondents (n = 387; 77.7%) were non-Saudis (377 were from the Philippines), and 79.1% (n = 306) of the Filipinos had a dental degree. The question about recommendations for an avulsed tooth that is dirty elicited the highest number of correct responses (n = 444; 89.2%), whereas the question about the best storage media elicited the lowest number of correct responses (n = 192; 38.6%). The overall mean score for knowledge about tooth avulsion was 6.27 ± 1.74. The mean knowledge score among the respondents with a dental degree was 6.63 ± 1.37, whereas that among the respondents with other qualifications was 5.71 ± 2.08. The educational qualifications of the surveyed dental assistants were strongly correlated with the level of knowledge about tooth avulsion and its management.
Assessing the role of memory in preschoolers' performance on episodic foresight tasks.
Atance, Cristina M; Sommerville, Jessica A
2014-01-01
A total of 48 preschoolers (ages 3, 4, and 5) received four tasks modelled after prior work designed to assess the development of "episodic foresight". For each task, children encountered a problem in one room and, after a brief delay, were given the opportunity in a second room to select an item to solve the problem. Importantly, after selecting an item, children were queried about their memory for the problem. Age-related changes were found both in children's ability to select the correct item and their ability to remember the problem. However, when we controlled for children's memory for the problem, there were no longer significant age-related changes on the item choice measure. These findings suggest that age-related changes in children's performance on these tasks are driven by improvements in children's memory versus improvements in children's future-oriented thinking or "foresight" per se. Our results have important implications for how best to structure tasks to measure children's episodic foresight, and also for the relative role of memory in this task and in episodic foresight more broadly.
Consistency of near-death experience accounts over two decades: are reports embellished over time?
Greyson, Bruce
2007-06-01
"Near-death experiences," commonly reported after clinical death and resuscitation, may require intervention and, if reliable, may elucidate altered brain functioning under extreme stress. It has been speculated that accounts of near-death experiences are exaggerated over the years. The objective of this study was to test the reliability over two decades of accounts of near-death experiences. Seventy-two patients with near-death experience who had completed the NDE scale in the 1980s (63% of the original cohort still alive) completed the scale a second time, without reference to the original scale administration. The primary outcome was differences in NDE scale scores on the two administrations. The secondary outcome was the statistical association between differences in scores and years elapsed between the two administrations. Mean scores did not change significantly on the total NDE scale, its 4 factors, or its 16 items. Correlation coefficients between scores on the two administrations were significant at P<0.001 for the total NDE scale, for its 4 factors, and for its 16 items. Correlation coefficients between score changes and time elapsed between the two administrations were not significant for the total NDE scale, for its 4 factors, or for its 16 items. Contrary to expectation, accounts of near-death experiences, and particularly reports of their positive affect, were not embellished over a period of almost two decades. These data support the reliability of near-death experience accounts.
Yang, Chengqing; Zhang, Tianhong; Li, Zezhi; Heeramun-Aubeeluck, Anisha; Liu, Na; Huang, Nan; Zhang, Jie; He, Leiying; Li, Hui; Tang, Yingying; Chen, Fazhan; Liu, Fei; Wang, Jijun; Lu, Zheng
2015-10-08
Although many studies have examined executive functions and facial emotion recognition in people with schizophrenia, few of them focused on the correlation between them. Furthermore, their relationship in the siblings of patients also remains unclear. The aim of the present study is to examine the correlation between executive functions and facial emotion recognition in patients with first-episode schizophrenia and their siblings. Thirty patients with first-episode schizophrenia, their twenty-six siblings, and thirty healthy controls were enrolled. They completed facial emotion recognition tasks using the Ekman Standard Faces Database, and executive functioning was measured by Wisconsin Card Sorting Test (WCST). Hierarchical regression analysis was applied to assess the correlation between executive functions and facial emotion recognition. Our study found that in siblings, the accuracy in recognizing low degree 'disgust' emotion was negatively correlated with the total correct rate in WCST (r = -0.614, p = 0.023), but was positively correlated with the total error in WCST (r = 0.623, p = 0.020); the accuracy in recognizing 'neutral' emotion was positively correlated with the total error rate in WCST (r = 0.683, p = 0.014) while negatively correlated with the total correct rate in WCST (r = -0.677, p = 0.017). People with schizophrenia showed an impairment in facial emotion recognition when identifying moderate 'happy' facial emotion, the accuracy of which was significantly correlated with the number of completed categories of WCST (R(2) = 0.432, P < .05). There were no correlations between executive functions and facial emotion recognition in the healthy control group. Our study demonstrated that facial emotion recognition impairment correlated with executive function impairment in people with schizophrenia and their unaffected siblings but not in healthy controls.
Development and validation of the Single Item Narcissism Scale (SINS).
Konrath, Sara; Meier, Brian P; Bushman, Brad J
2014-01-01
The narcissistic personality is characterized by grandiosity, entitlement, and low empathy. This paper describes the development and validation of the Single Item Narcissism Scale (SINS). Although the use of longer instruments is superior in most circumstances, we recommend the SINS in some circumstances (e.g. under serious time constraints, online studies). In 11 independent studies (total N = 2,250), we demonstrate the SINS' psychometric properties. The SINS is significantly correlated with longer narcissism scales, but uncorrelated with self-esteem. It also has high test-retest reliability. We validate the SINS in a variety of samples (e.g., undergraduates, nationally representative adults), intrapersonal correlates (e.g., positive affect, depression), and interpersonal correlates (e.g., aggression, relationship quality, prosocial behavior). The SINS taps into the more fragile and less desirable components of narcissism. The SINS can be a useful tool for researchers, especially when it is important to measure narcissism with constraints preventing the use of longer measures.
Neural correlates of differential retrieval orientation: Sustained and item-related components.
Woodruff, C Chad; Uncapher, Melina R; Rugg, Michael D
2006-01-01
Retrieval orientation refers to a cognitive state that biases processing of retrieval cues in service of a specific goal. The present study used a mixed fMRI design to investigate whether adoption of different retrieval orientations - as indexed by differences in the activity elicited by retrieval cues corresponding to unstudied items - is associated with differences in the state-related activity sustained across a block of test trials sharing a common retrieval goal. Subjects studied mixed lists comprising visually presented words and pictures. They then undertook a series of short test blocks in which all test items were visually presented words. The blocks varied according to whether the test items were used to cue retrieval of studied words or studied pictures. In several regions, neural activity elicited by correctly classified new items differed according to whether words or pictures were the targeted material. The loci of these effects suggest that one factor driving differential cue processing is modulation of the degree of overlap between cue and targeted memory representations. In addition to these item-related effects, neural activity sustained throughout the test blocks also differed according to the nature of the targeted material. These findings indicate that the adoption of different retrieval orientations is associated with distinct neural states. The loci of these sustained effects were distinct from those where new item activity varied, suggesting that the effects may play a role in biasing retrieval cue processing in favor of the current retrieval goal.
Adaptable Learning Assistant for Item Bank Management
ERIC Educational Resources Information Center
Nuntiyagul, Atorn; Naruedomkul, Kanlaya; Cercone, Nick; Wongsawang, Damras
2008-01-01
We present PKIP, an adaptable learning assistant tool for managing question items in item banks. PKIP is not only able to automatically assist educational users to categorize the question items into predefined categories by their contents but also to correctly retrieve the items by specifying the category and/or the difficulty level. PKIP adapts…
An analysis of high school students' perceptions and academic performance in laboratory experiences
NASA Astrophysics Data System (ADS)
Mirchin, Robert Douglas
This research study is an investigation of student-laboratory (i.e., lab) learning based on students' perceptions of experiences using questionnaire data and evidence of their science-laboratory performance based on paper-and-pencil assessments using Maryland-mandated criteria, Montgomery County Public Schools (MCPS) criteria, and published laboratory questions. A 20-item questionnaire consisting of 18 Likert-scale items and 2 open-ended items that addressed what students liked most and least about lab was administered to students before labs were observed. A pre-test and post-test assessing laboratory achievement were administered before and after the laboratory experiences. The three labs observed were: soda distillation, stoichiometry, and separation of a mixture. Five significant results or correlations were found. For soda distillation, there were two positive correlations. Student preference for analyzing data was positively correlated with achievement on the data analysis dimension of the lab rubric. A student preference for using numbers and graphs to analyze data was positively correlated with achievement on the analysis dimension of the lab rubric. For the separating a mixture lab data the following pairs of correlations were significant. Student preference for doing chemistry labs where numbers and graphs were used to analyze data had a positive correlation with writing a correctly worded hypothesis. Student responses that lab experiences help them learn science positively correlated with achievement on the data dimension of the lab rubric. The only negative correlation found related to the first result where students' preference for computers was inversely correlated to their performance on analyzing data on their lab report. Other findings included the following: students like actual experimental work most and the write-up and analysis of a lab the least. It is recommended that lab science instruction be inquiry-based, hands-on, and that students be tested for lab content acquisition. The final conclusion of the study is that students expressed a preference for working in groups and working with materials and equipment as opposed to individual, non-group work and analyzing data.
Recognition memory reveals just how CONTRASTIVE contrastive accenting really is
Fraundorf, Scott H.; Watson, Duane G.; Benjamin, Aaron S.
2010-01-01
The effects of pitch accenting on memory were investigated in three experiments. Participants listened to short recorded discourses that contained contrast sets with two items (e.g. British scientists and French scientists); a continuation specified one item from the set. Pitch accenting on the critical word in the continuation was manipulated between non-contrastive (H* in the ToBI system) and contrastive (L+H*). On subsequent recognition memory tests, the L+H* accent increased hits to correct statements and correct rejections of the contrast item (Experiments 1–3), but did not impair memory for other parts of the discourse (Experiment 2). L+H* also did not facilitate correct rejections of lures not in the contrast set (Experiment 3), indicating that contrastive accents do not simply strengthen the representation of the target item. These results suggest comprehenders use pitch accenting to encode and update information about multiple elements in a contrast set. PMID:20835405
Validation of the Intelligibility in Context Scale for Jamaican Creole-Speaking Preschoolers.
Washington, Karla N; McDonald, Megan M; McLeod, Sharynne; Crowe, Kathryn; Devonish, Hubert
2017-08-15
To describe validation of the Intelligibility in Context Scale (ICS; McLeod, Harrison, & McCormack, 2012a) and ICS-Jamaican Creole (ICS-JC; McLeod, Harrison, & McCormack, 2012b) in a sample of typically developing 3- to 6-year-old Jamaicans. One-hundred and forty-five preschooler-parent dyads participated in the study. Parents completed the 7-item ICS (n = 145) and ICS-JC (n = 98) to rate children's speech intelligibility (5-point scale) across communication partners (parents, immediate family, extended family, friends, acquaintances, strangers). Preschoolers completed the Diagnostic Evaluation of Articulation and Phonology (DEAP; Dodd, Hua, Crosbie, Holm, & Ozanne, 2006) in English and Jamaican Creole to establish speech-sound competency. For this sample, we examined validity and reliability (interrater, test-rest, internal consistency) evidence using measures of speech-sound production: (a) percentage of consonants correct, (b) percentage of vowels correct, and (c) percentage of phonemes correct. ICS and ICS-JC ratings showed preschoolers were always (5) to usually (4) understood across communication partners (ICS, M = 4.43; ICS-JC, M = 4.50). Both tools demonstrated excellent internal consistency (α = .91), high interrater, and test-retest reliability. Significant correlations between the two tools and between each measure and language-specific percentage of consonants correct, percentage of vowels correct, and percentage of phonemes correct provided criterion-validity evidence. A positive correlation between the ICS and age further strengthened validity evidence for that measure. Both tools show promising evidence of reliability and validity in describing functional speech intelligibility for this group of typically developing Jamaican preschoolers.
Measurement properties of the Inventory of Cognitive Bias in Medicine (ICBM)
Sladek, Ruth M; Phillips, Paddy A; Bond, Malcolm J
2008-01-01
Background Understanding how doctors think may inform both undergraduate and postgraduate medical education. Developing such an understanding requires valid and reliable measurement tools. We examined the measurement properties of the Inventory of Cognitive Bias in Medicine (ICBM), designed to tap this domain with specific reference to medicine, but with previously questionable measurement properties. Methods First year postgraduate entry medical students at Flinders University, and trainees (postgraduate doctors in any specialty) and consultants (N = 348) based at two teaching hospitals in Adelaide, Australia, completed the ICBM and a questionnaire measuring thinking styles (Rational Experiential Inventory). Results Questions with the lowest item-total correlation were deleted from the original 22 item ICBM, although the resultant 17 item scale only marginally improved internal consistency (Cronbach's α = 0.61 compared with 0.57). A factor analysis identified two scales, both achieving only α = 0.58. Construct validity was assessed by correlating Rational Experiential Inventory scores with the ICBM, with some positive correlations noted for students only, suggesting that those who are naïve to the knowledge base required to "successfully" respond to the ICBM may profit by a thinking style in tune with logical reasoning. Conclusion The ICBM failed to demonstrate adequate content validity, internal consistency and construct validity. It is unlikely that improvements can be achieved without considered attention to both the audience for which it is designed and its item content. The latter may need to involve both removal of some items deemed to measure multiple biases and the addition of new items in the attempt to survey the range of biases that may compromise medical decision making. PMID:18507864
Reliability and Validity of the Farsi Version of the Somatosensory Amplification Scale
Aghayousefi, Alireza; Oraki, Mohammad; Mohammadi, Narges; Farzad, Valiyollah; Daghaghzadeh, Hammed
2015-01-01
Background: The somatosensory amplification scale (SSAS) is a 10-item self-report instrument designed to assess a tendency to experience normal somatic and visceral sensations as intense, noxious, and disturbing. Objectives: The present study investigated the reliability and validity of the SSAS, developed by Barsky et al. (1988), in the Iranian population. Materials and Methods: The study was carried out on 240 patients with functional gastrointestinal disorders and 30 healthy persons selected by convenience sampling from 2013 to 2014. The patients completed the SSAS, the somatization subscale of the symptom checklist-90-revised (SCL-90-R som), and the modified somatic perception questionnaire (MSPQ), whereas the healthy persons completed just the SSAS. Results: Exploratory factor analysis indicated that the one-factor solution, accounting for 29.42% of the variance, explained that the SSAS items were represented by one global dimension. The SSAS had acceptable internal consistency (α = 0.78) and good test-retest reliability (r = 0.80). The item-to-scale correlations varied from 0.17 to 0.55. Item 2 had the lowest item-total score correlation (r = 0.17), and the α coefficient for the SSAS exceeded when this item was deleted. The convergent validity of the SSAS with somatization was shown with a significant correlation between the SSAS, SCL-90-R som (r = 0.36), and MSPQ scores (r = 0.52). Discriminant validity analysis showed no significant difference in the SSAS between the patient and control groups (P > 0.05) and non-specificity of the SSAS for patients. Conclusions: In sum, the SSAS has acceptable reliability and validity for the Iranian population and the scale measures the same the original scale, namely somatosensory amplification. PMID:26576173
Positive mental health among health professionals working at a psychiatric hospital
Picco, Louisa; Yuan, Qi; Vaingankar, Janhavi Ajit; Chang, Sherilyn; Abdin, Edimansyah; Chua, Hong Choon; Chong, Siow Ann; Subramaniam, Mythily
2017-01-01
Background Positive mental health (PMH) is a combination of emotional, psychological and social well-being that is necessary for an individual to be mentally healthy. The current study aims to examine the socio-demographic differences of PMH among mental health professionals and to explore the association between job satisfaction and total PMH. Methods Doctors, nurses and allied health staff (n = 462) completed the online survey which included the multidimensional 47-item PMH instrument as well as a single item job satisfaction question. Associations of PMH with job satisfaction were investigated via linear regression models. Results Significant differences in PMH total and domain specific scores were observed across socio-demographic characteristics. Age and ethnicity were significantly correlated with PMH total scores as well as various domain scores, while gender, marital and residency status and the staff’s position were only significantly correlated with domain specific scores. Job satisfaction was also found to be a significantly associated with total PMH. Conclusion The workplace is a key environment that affects the mental health and well-being of working adults. In order to promote and foster PMH, workplaces need to consider the importance of psychosocial well-being and the wellness of staff whilst providing an environment that supports and maintains overall health and work efficiency. PMID:28591203
Factors Influencing Early Detection of Oral Cancer by Primary Health-Care Professionals.
Hassona, Y; Scully, C; Shahin, A; Maayta, W; Sawair, F
2016-06-01
The purposes of this study are to determine early detection practices performed by primary healthcare professionals, to compare medical and dental sub-groups, and to identify factors that influence the ability of medical and dental practitioners to recognize precancerous changes and clinical signs of oral cancer. A 28-item survey instrument was used to interview a total of 330 Jordanian primary health-care professionals (165 dental and 165 medical). An oral cancer knowledge scale (0 to 31) was generated from correct responses on oral cancer general knowledge. An early detection practice scale (0 to 24) was generated from the reported usage and frequency of procedures in oral cancer examination. Also, a diagnostic ability scale (0 to 100) was generated from correct selections of suspicious oral lesions. Only 17.8 % of the participants reported that they routinely performed oral cancer screening in practices. Their oral cancer knowledge scores ranged from 3 to 31 with a mean of 15.6. The early detection practice scores ranged from 2 to 21 with a mean of 11.6. A significant positive correlation was found between knowledge scores and early detection practice scores (r = 0.22; p < 0.001). The diagnostic ability scores ranged from 11.5 to 96 with a mean of 43.6. The diagnostic ability score was significantly correlated with knowledge scores (r = 0.39; p < 0.001), but not with early detection practice scores (r = 0.01; p = 0.92). Few significant differences were found between medical and dental primary care professionals. Continuous education courses on early diagnosis of oral cancer and oral mucosal lesions are needed for primary health-care professionals.
van de Graaf, Elizabeth S; Borsboom, Gerard J J M; van der Sterre, Geertje W; Felius, Joost; Simonsz, Huibert J; Kelderman, Henk
2017-09-01
The Adult Strabismus Quality of Life Questionnaire (AS-20) and the Amblyopia & Strabismus Questionnaire (A&SQ) both measure health-related quality of life in strabismus patients. We evaluated to what extent these instruments cover similar domains by identifying the underlying quality-of-life factors of the combined questionnaires. Participants were adults from a historic cohort with available orthoptic childhood data documenting strabismus and/or amblyopia. They had previously completed the A&SQ and were now asked to complete the AS-20. Factor analysis was performed on the correlation-matrix of the combined AS-20 and A&SQ data to identify common underlying factors. The identified factors were correlated with the clinical variables of angle of strabismus, degree of binocular vision, and visual acuity of the worse eye. One hundred ten patients completed both questionnaires (mean age, 44 years; range, 38-51 years). Six factors were found that together explained 78% of the total variance. The factor structure was dominated by the first four factors. One factor contained psychosocial and social-contact items, and another factor depth-perception items from both questionnaires. A third factor contained seven items-only from the AS-20-on eye strain, stress, and difficulties with reading and with concentrating. A fourth factor contained seven items-only from the A&SQ-on fear of losing the better eye and visual disorientation, specific for amblyopia. Current visual acuity of the worse eye correlated with depth-perception items and vision-related items, whereas current binocular vision correlated with psychosocial and social-contact items, in 93 patients. Factor analysis suggests that the AS-20 and A&SQ measure a similar psychosocial quality-of-life domain. However, functional problems like avoidance of reading, difficulty in concentrating, eye stress, reading problems, inability to enjoy hobbies, and need for frequent breaks when reading are represented only in the AS-20. During the development of the A&SQ, asthenopia items were considered insufficiently specific for strabismus and were excluded a priori. The patients who generated the items for the AS-20 had, in majority, adulthood-onset strabismus and diplopia and were, hence, more likely to develop such complaints than our adult patients with childhood-onset strabismus and/or amblyopia.
Comparison of scales for evaluating premenstrual symptoms in women using oral contraceptives.
Coffee, Andrea L; Kuehl, Thomas J; Sulak, Patricia J
2008-05-01
To compare two scales used in research to evaluate daily premenstrual mood symptoms during use of a monophasic oral contraceptive. Subanalysis of data from a prospective study. University-affiliated medical center. SUBJECTS; One hundred two reproductive-aged (18-48 yrs) women taking a monophasic oral contraceptive containing ethinyl estradiol and drospirenone in the standard 21-7 fashion (21 days of hormones followed by 7 days of placebo), and who had self-identified premenstrual symptoms of headache, mood changes, or pelvic pain. Subjects completed a single-item questionnaire, the Scott & White Daily Diary of Symptoms, and a multiple-item questionnaire, the Penn State Daily Symptom Report (DSR), to assess their premenstrual symptoms. The Scott & White diary used a visual analog scale of 0-10 to assess pelvic pain, headache, and mood (a composite of anxiety, depression, and irritability). The Penn State DSR contained 17 items: 10 behavioral and seven physical components, each rated on a scale of 0-4, with one item that specifically rated mood swings. Scores from the two scales were compared by using Spearman correlation coefficients, the Kendall W for concordance, and linear regression of ranked sums for study cycles. The Scott & White mood score significantly correlated with the total of the 17 items on the Penn State DSR, as well as the 10 behavioral items, the seven physical items, and the single mood-swing item (p<0.0001); specific coefficients of concordance were 0.44, 0.23, 0.10, and 0.28, respectively, and R2 values were 0.39, 0.39, 0.30, and 0.34, respectively. The daily Scott & White mood score was positively correlated with all 17 elements of the Penn State DSR (0.25-0.57). The greatest correlation was seen with the mood-swing element. Both instruments demonstrated the same patterns during the 21-7 oral contraceptive cycle, with symptoms increasing immediately before and peaking during the 7-day hormone-free interval. A single-item daily mood score using a rating scale of 0-10 was concordant with a relatively complex 17-element symptom index and demonstrated the same pattern of change during cycles of oral contraception. The simple scoring system offers an advantage, especially in clinical studies of long duration.
Artifact Correction in Temperature-Dependent Attenuated Total Reflection Infrared (ATR-IR) Spectra.
Sobieski, Brian; Chase, Bruce; Noda, Isao; Rabolt, John
2017-08-01
A spectral processing method was developed and tested for analyzing temperature-dependent attenuated total reflection infrared (ATR-IR) spectra of aliphatic polyesters. Spectra of a bio-based, biodegradable polymer, 3.9 mol% 3HHx poly[(R)-3-hydroxybutyrate- co-(R)-3-hydroxyhexanoate] (PHBHx), were analyzed and corrected prior to analysis using two-dimensional correlation spectroscopy (2D-COS). Removal of the temperature variation of diamond absorbance, correction of the baseline, ATR correction, and appropriate normalization were key to generating more reliable data. Both the processing steps and order were important. A comparison to differential scanning calorimetry (DSC) analysis indicated that the normalization method should be chosen with caution to avoid unintentional trends and distortions of the crystalline sensitive bands.
The development and validation of a test of science critical thinking for fifth graders.
Mapeala, Ruslan; Siew, Nyet Moi
2015-01-01
The paper described the development and validation of the Test of Science Critical Thinking (TSCT) to measure the three critical thinking skill constructs: comparing and contrasting, sequencing, and identifying cause and effect. The initial TSCT consisted of 55 multiple choice test items, each of which required participants to select a correct response and a correct choice of critical thinking used for their response. Data were obtained from a purposive sampling of 30 fifth graders in a pilot study carried out in a primary school in Sabah, Malaysia. Students underwent the sessions of teaching and learning activities for 9 weeks using the Thinking Maps-aided Problem-Based Learning Module before they answered the TSCT test. Analyses were conducted to check on difficulty index (p) and discrimination index (d), internal consistency reliability, content validity, and face validity. Analysis of the test-retest reliability data was conducted separately for a group of fifth graders with similar ability. Findings of the pilot study showed that out of initial 55 administered items, only 30 items with relatively good difficulty index (p) ranged from 0.40 to 0.60 and with good discrimination index (d) ranged within 0.20-1.00 were selected. The Kuder-Richardson reliability value was found to be appropriate and relatively high with 0.70, 0.73 and 0.92 for identifying cause and effect, sequencing, and comparing and contrasting respectively. The content validity index obtained from three expert judgments equalled or exceeded 0.95. In addition, test-retest reliability showed good, statistically significant correlations ([Formula: see text]). From the above results, the selected 30-item TSCT was found to have sufficient reliability and validity and would therefore represent a useful tool for measuring critical thinking ability among fifth graders in primary science.
Paz, Sylvia H; Spritzer, Karen L; Morales, Leo S; Hays, Ron D
2013-09-01
To evaluate the equivalence of the PROMIS(®) physical functioning item bank by language of administration (English versus Spanish). The PROMIS(®) wave 1 English-language physical functioning bank consists of 124 items, and 114 of these were translated into Spanish. Item frequencies, means and standard deviations, item-scale correlations, and internal consistency reliability were calculated. The IRT assumption of unidimensionality was evaluated by fitting a single-factor confirmatory factor analytic model. IRT threshold and discrimination parameters were estimated using Samejima's Graded Response Model. DIF by language of administration was evaluated. Item means ranged from 2.53 (SD = 1.36) to 4.62 (SD = 0.82). Coefficient alpha was 0.99, and item-rest correlations ranged from 0.41 to 0.89. A one-factor model fits the data well (CFI = 0.971, TLI = 0.970, and RMSEA = 0.052). The slope parameters ranged from 0.45 ("Are you able to run 10 miles?") to 4.50 ("Are you able to put on a shirt or blouse?"). The threshold parameters ranged from -1.92 ("How much do physical health problems now limit your usual physical activities (such as walking or climbing stairs)?") to 6.06 ("Are you able to run 10 miles?"). Fifty of the 114 items were flagged for DIF based on an R(2) of 0.02 or above criterion. The expected total score was higher for Spanish- than English-language respondents. English- and Spanish-speaking subjects with the same level of underlying physical function responded differently to 50 of 114 items. This study has important implications in the study of physical functioning among diverse populations.
Kim, Miyong; Han, Hae-Ra; Phillips, Linda
2003-01-01
Metric equivalence is a quantitative way to assess cross-cultural equivalences of translated instruments by examining the patterns of psychometric properties based on cross-cultural data derived from both versions of the instrument. Metric equivalence checks at item and instrument levels can be used as a valuable tool to refine cross-cultural instruments. Korean and English versions of the Center for Epidemiological Studies-Depression Scale (CES-D) were administered to 154 Korean Americans and 151 Anglo Americans to illustrate approaches to assessing their metric equivalence. Inter-item and item-total correlations, Cronbach's alpha coefficients, and factor analysis were used for metric equivalence checks. The alpha coefficient for the Korean-American sample was 0.85 and 0.92 for the Anglo American sample. Although all items of the CES-D surpassed the desirable minimum of 0.30 in the Anglo American sample, four items did not meet the standard in the Korean American sample. Differences in average inter-item correlations were also noted between the two groups (0.25 for Korean Americans and 0.37 for Anglo Americans). Factor analysis identified two factors for both groups, and factor loadings showed similar patterns and congruence coefficients. Results of the item analysis procedures suggest the possibility of bias in certain items that may influence the sensitivity of the Korean version of the CES-D. These item biases also provide a possible explanation for the alpha differences. Although factor loadings showed similar patterns for the Korean and English versions of the CES-D, factorial similarity alone is not sufficient for testing the universality of the structure underlying an instrument.
Sachs, J; Gao, L
2000-09-01
The learning process questionnaire (LPQ) has been the source of intensive cross-cultural study. However, an item-level factor analysis of all the LPQ items simultaneously has never been reported. Rather, items within each subscale have been factor analysed to establish subscale unidimensionality and justify the use of composite subscale scores. It was of major interest to see if the six logically constructed items groups of the LPQ would be supported by empirical evidence. Additionally, it was of interest to compare the consistency of the reliability and correlational structure of the LPQ subscales in our study with those of previous cross-cultural studies. Confirmatory factor analysis was used to fit the six-factor item level model and to fit five representative subscale level factor models. A total of 1070 students between the ages of 15 to 18 years was drawn from a representative selection of 29 classes from within 15 secondary schools in Guangzhou, China. Males and females were almost equally represented. The six-factor item level model of the LPQ seemed to fit reasonably well, thus supporting the six dimensional structure of the LPQ and justifying the use of composite subscale scores for each LPQ dimension. However, the reliability of many of these subscales was low. Furthermore, only two subscale-level factor models showed marginally acceptable fit. Substantive considerations supported an oblique three-factor model. Because the LPQ subscales often show low internal consistency reliability, experimental and correlational studies that have used these subscales as dependent measures have been disappointing. It is suggested that some LPQ items should be revised and other items added to improve the inventory's overall psychometric properties.
Can health care providers recognise a fibromyalgia personality?
Da Silva, José A P; Jacobs, Johannes W G; Branco, Jaime C; Canaipa, Rita; Gaspar, M Filomena; Griep, Ed N; van Helmond, Toon; Oliveira, Paula J; Zijlstra, Theo J; Geenen, Rinie
2017-01-01
To determine if experienced health care providers (HCPs) can recognise patients with fibromyalgia (FM) based on a limited set of personality items, exploring the existence of a FM personality. From the 240-item NEO-PI-R personality questionnaire, 8 HCPs from two different countries each selected 20 items they considered most discriminative of FM personality. Then, evaluating the scores on these items of 129 female patients with FM and 127 female controls, each HCP rated the probability of FM for each individual on a 0-10 scale. Personality characteristics (domains and facets) of selected items were determined. Scores of patients with FM and controls on the eight 20-item sets, and HCPs' estimates of each individual's probability of FM were analysed for their discriminative value. The eight 20-item sets discriminated for FM, with areas under the receiver operating characteristic curve ranging from 0.71-0.81. The estimated probabilities for FM showed, in general, percentages of correct classifications above 50%, with rising correct percentages for higher estimated probabilities. The most often chosen and discriminatory items were predominantly of the domain neuroticism (all with higher scores in FM), followed by some items of the facet trust (lower scores in FM). HCPs can, based on a limited set of items from a personality questionnaire, distinguish patients with FM from controls with a statistically significant probability. The HCPs' expectation that personality in FM patients is associated with higher levels for aspects of neuroticism (proneness to psychological distress) and lower scores for aspects of trust, proved to be correct.
Huang, Yueng-Hsiang; Lee, Jin; Chen, Zhuo; Perry, MacKenna; Cheung, Janelle H; Wang, Mo
2017-06-01
Zohar and Luria's (2005) safety climate (SC) scale, measuring organization- and group- level SC each with 16 items, is widely used in research and practice. To improve the utility of the SC scale, we shortened the original full-length SC scales. Item response theory (IRT) analysis was conducted using a sample of 29,179 frontline workers from various industries. Based on graded response models, we shortened the original scales in two ways: (1) selecting items with above-average discriminating ability (i.e. offering more than 6.25% of the original total scale information), resulting in 8-item organization-level and 11-item group-level SC scales; and (2) selecting the most informative items that together retain at least 30% of original scale information, resulting in 4-item organization-level and 4-item group-level SC scales. All four shortened scales had acceptable reliability (≥0.89) and high correlations (≥0.95) with the original scale scores. The shortened scales will be valuable for academic research and practical survey implementation in improving occupational safety. Copyright © 2017 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Tanaka, Hisako; Imai, Shino; Nakade, Makiko; Imai, Eri; Takimoto, Hidemi
2016-12-01
Survey items of the Japan National Nutrition Survey (J-NNS) have changed over time. Several papers on dietary surveys have been published; however, to date, there are no in-depth papers regarding physical examinations. Therefore, we investigated changes in the survey items in the physical examinations performed in the J-NNS and the National Health and Nutrition Survey (NHNS), with the aim of incorporating useful data for future policy decisions. We summarized the description of physical examinations and marshalled the changes of survey items from the J-NNS and NHNS from 1946 to 2012. The physical examination is roughly classified into the following six components: some are relevant to anthropometric measurements, clinical measurements, physical symptoms, blood tests, lifestyle and medication by interview, and others. Items related to nutritional deficiency, such as anaemia and tendon reflex disappearance, and body weight measurements were collected during the early period, according to the instructions of the General Headquarters. From 1989, blood tests and measurement of physical activity were added, and serum total protein, total cholesterol, triglycerides, HDL-cholesterol, blood glucose, red blood corpuscles and haemoglobin measurements have been performed continuously for more than 20 years. This is the first report on the items of physical examination in the J-NNS and NHNS. Our research results provide basic information for the utilization of the J-NNS and NHNS, to researchers, clinicians or policy makers. Monitoring the current state correctly is essential for national health promotion, and also for improvement of the investigation methods to apply country-by-country comparisons.
Semon, Natalie L.; Lating, Jeffrey M.; Everly, George S.; Perry, Charlene J.; Moore, Suzanne Straub; Mosley, Adrian M.; Thompson, Carol B.; Links, Jonathan M.
2014-01-01
Objectives Faculty and affiliates of the Johns Hopkins Preparedness and Emergency Response Research Center partnered with local health departments and faith-based organizations to develop a dual-intervention model of capacity-building for public mental health preparedness and community resilience. Project objectives included (1) determining the feasibility of the tri-partite collaborative concept; (2) designing, delivering, and evaluating psychological first aid (PFA) training and guided preparedness planning (GPP); and (3) documenting preliminary evidence of the sustainability and impact of the model. Methods We evaluated intervention effectiveness by analyzing pre- and post-training changes in participant responses on knowledge-acquisition tests administered to three urban and four rural community cohorts. Changes in percent of correct items and mean total correct items were evaluated. Criteria for model sustainability and impact were, respectively, observations of nonacademic partners engaging in efforts to advance post-project preparedness alliances, and project-attributable changes in preparedness-related practices of local or state governments. Results The majority (11 of 14) test items addressing technical or practical PFA content showed significant improvement; we observed comparable testing results for GPP training. Government and faith partners developed ideas and tools for sustaining preparedness activities, and numerous project-driven changes in local and state government policies were documented. Conclusions Results suggest that the model could be an effective approach to promoting public health preparedness and community resilience. PMID:25355980
McCabe, O Lee; Semon, Natalie L; Lating, Jeffrey M; Everly, George S; Perry, Charlene J; Moore, Suzanne Straub; Mosley, Adrian M; Thompson, Carol B; Links, Jonathan M
2014-01-01
Faculty and affiliates of the Johns Hopkins Preparedness and Emergency Response Research Center partnered with local health departments and faith-based organizations to develop a dual-intervention model of capacity-building for public mental health preparedness and community resilience. Project objectives included (1) determining the feasibility of the tri-partite collaborative concept; (2) designing, delivering, and evaluating psychological first aid (PFA) training and guided preparedness planning (GPP); and (3) documenting preliminary evidence of the sustainability and impact of the model. We evaluated intervention effectiveness by analyzing pre- and post-training changes in participant responses on knowledge-acquisition tests administered to three urban and four rural community cohorts. Changes in percent of correct items and mean total correct items were evaluated. Criteria for model sustainability and impact were, respectively, observations of nonacademic partners engaging in efforts to advance post-project preparedness alliances, and project-attributable changes in preparedness-related practices of local or state governments. The majority (11 of 14) test items addressing technical or practical PFA content showed significant improvement; we observed comparable testing results for GPP training. Government and faith partners developed ideas and tools for sustaining preparedness activities, and numerous project-driven changes in local and state government policies were documented. Results suggest that the model could be an effective approach to promoting public health preparedness and community resilience.
Cho, Sun-Joo; Preacher, Kristopher J.; Bottge, Brian A.
2015-01-01
Multilevel modeling (MLM) is frequently used to detect group differences, such as an intervention effect in a pre-test–post-test cluster-randomized design. Group differences on the post-test scores are detected by controlling for pre-test scores as a proxy variable for unobserved factors that predict future attributes. The pre-test and post-test scores that are most often used in MLM are summed item responses (or total scores). In prior research, there have been concerns regarding measurement error in the use of total scores in using MLM. To correct for measurement error in the covariate and outcome, a theoretical justification for the use of multilevel structural equation modeling (MSEM) has been established. However, MSEM for binary responses has not been widely applied to detect intervention effects (group differences) in intervention studies. In this article, the use of MSEM for intervention studies is demonstrated and the performance of MSEM is evaluated via a simulation study. Furthermore, the consequences of using MLM instead of MSEM are shown in detecting group differences. Results of the simulation study showed that MSEM performed adequately as the number of clusters, cluster size, and intraclass correlation increased and outperformed MLM for the detection of group differences. PMID:29881032
Cho, Sun-Joo; Preacher, Kristopher J; Bottge, Brian A
2015-11-01
Multilevel modeling (MLM) is frequently used to detect group differences, such as an intervention effect in a pre-test-post-test cluster-randomized design. Group differences on the post-test scores are detected by controlling for pre-test scores as a proxy variable for unobserved factors that predict future attributes. The pre-test and post-test scores that are most often used in MLM are summed item responses (or total scores). In prior research, there have been concerns regarding measurement error in the use of total scores in using MLM. To correct for measurement error in the covariate and outcome, a theoretical justification for the use of multilevel structural equation modeling (MSEM) has been established. However, MSEM for binary responses has not been widely applied to detect intervention effects (group differences) in intervention studies. In this article, the use of MSEM for intervention studies is demonstrated and the performance of MSEM is evaluated via a simulation study. Furthermore, the consequences of using MLM instead of MSEM are shown in detecting group differences. Results of the simulation study showed that MSEM performed adequately as the number of clusters, cluster size, and intraclass correlation increased and outperformed MLM for the detection of group differences.
Force required for correcting the deformity of pectus carinatum and related multivariate analysis.
Chen, Chenghao; Zeng, Qi; Li, Zhongzhi; Zhang, Na; Yu, Jie
2017-12-24
To measure the force required for correcting pectus carinatum to the desired position and investigate the correlations of the required force with patients' gender, age, deformity type, severity and body mass index (BMI). A total of 125 patients with pectus carinatum were enrolled in the study from August 2013 to August 2016. Their gender, age, deformity type, severity and BMI were recorded. A chest wall compressor was used to measure the force required for correcting the chest wall deformity. Multivariate linear regression was used for data analysis. Among the 125 patients, 112 were males and 13 were females. Their mean age was 13.7±1.5 years old, mean Haller index was 2.1±0.2, and mean BMI was 17.4±1.8 kg/m 2 . Multivariate linear regression analysis showed that the desirable force for correcting chest wall deformity was not correlated with gender and deformity type, but positively correlated with age and BMI and negatively correlated with Haller index. The desirable force measured for correcting chest wall deformities of patients with pectus carinatum positively correlates with age and BMI and negatively correlates with Haller index. The study provides valuable information for future improvement of implanted bar, bar fixation technique, and personalized surgery. Retrospective study. Level 3-4. Copyright © 2018. Published by Elsevier Inc.
Wilson, Patrick B; Madrigal, Leilani A
2016-12-01
Omega-3 polyunsaturated fatty acids (PUFAs) have important physiological functions and may offer select benefits for athletic performance and recovery. The purpose of this investigation was to assess dietary and whole blood omega-3 PUFAs among collegiate athletes. In addition, a brief questionnaire was evaluated as a valid tool for quantifying omega-3 PUFA intake. Fifty-eight athletes (9 males, 49 females) completed a 21-item questionnaire developed to assess omega-3 PUFA intake and provided dried whole blood samples to quantify α-linolenic acid (ALA), eicosapentaenoic acid (EPA), docosahexaenoic acid (DHA), and the HS-Omega-3 Index. Geometric means (95% confidence intervals) for the HS-Omega-3 Index were 4.79% (4.37-5.25%) and 4.75% (4.50-5.01%) for males and females, respectively. Median dietary intakes of ALA, EPA, and DHA were all below 100 mg. Among females, several dietary omega-3 PUFA variables were positively associated with whole blood EPA, with total EPA (rho = 0.67, p < .001) and total DHA (rho = 0.69, p < .001) intakes showing the strongest correlations. Whole blood DHA among females showed positive associations with dietary intakes, with total EPA (rho = 0.62, p < .001) and total DHA (rho = 0.64, p < .001) intakes demonstrating the strongest correlations. The HS-Omega-3 Index in females was positively correlated with all dietary variables except ALA. Among males, the only significant correlation was between food and whole blood EPA (rho = 0.83, p < .01). Collegiate athletes had relatively low intakes of omega-3 PUFAs. A 21-item questionnaire may be useful for screening female athletes for poor omega-3 PUFA status.
Nakata, Akinori; Irie, Masahiro; Takahashi, Masaya
2013-01-01
Although a single-item job satisfaction measure has been shown to be reliable and inclusive as multiple-item scales in relation to health, studies including immunological data are few. The purpose of this study was to evaluate the validity of single-item job and family life satisfaction based on its association with immune indices. A total of 189 white-collar employees (70% men) underwent a blood draw for the measurement of natural killer (NK), total T, and B cell counts as well as plasma immunoglobulin (Ig) G concentrations and completed single-item job and family life satisfaction measures, respectively. The response options for satisfaction measures were 'dissatisfied' (coded 1) to 'satisfied' (coded 4). Spearman's partial correlations controlling for cofactors revealed that increased job satisfaction was positively associated with NK cells (rsp=0.201, p=0.007) and IgG (rsp=0.178, p=0.018), while family life satisfaction was unrelated to immune indices. Those who reported a combination of low job/low family life satisfaction had significantly lower NK and higher B cell counts than those with a high job/high family life satisfaction. Our study suggests that the single-item summary measure of job satisfaction, but not family life satisfaction, may be a valid tool to evaluate immune status in healthy white-collar employees.
Nakata, Akinori; Irie, Masahiro; Takahashi, Masaya
2015-01-01
Although a single-item job satisfaction measure has been shown to be reliable and inclusive as multiple-item scales in relation to health, studies including immunological data are few. The purpose of this study was to evaluate the validity of single-item job and family life satisfaction based on its association with immune indices. A total of 189 white-collar employees (70% men) underwent a blood draw for the measurement of natural killer (NK), total T, and B cell counts as well as plasma immunoglobulin (Ig) G concentrations and completed single-item job and family life satisfaction measures, respectively. The response options for satisfaction measures were ‘dissatisfied’ (coded 1) to ‘satisfied’ (coded 4). Spearman’s partial correlations controlling for cofactors revealed that increased job satisfaction was positively associated with NK cells (rsp=0.201, p=0.007) and IgG (rsp=0.178, p=0.018), while family life satisfaction was unrelated to immune indices. Those who reported a combination of low job/low family life satisfaction had significantly lower NK and higher B cell counts than those with a high job/high family life satisfaction. Our study suggests that the single-item summary measure of job satisfaction, but not family life satisfaction, may be a valid tool to evaluate immune status in healthy white-collar employees. PMID:23196390
Alternative formulation of explicitly correlated third-order Møller-Plesset perturbation theory
NASA Astrophysics Data System (ADS)
Ohnishi, Yu-ya; Ten-no, Seiichiro
2013-09-01
The second-order wave operator in the explicitly correlated wave function theory has been newly defined as an extension of the conventional s- and p-wave (SP) ansatz (also referred to as the FIXED amplitude ansatz) based on the linked-diagram theorem. The newly defined second-order wave operator has been applied to the calculation of the F12 correction to the third-order many-body perturbation (MP3) energy. In addition to this new wave operator, the F12 correction with the conventional first-order wave operator has been derived and calculated. Among three components of the MP3 correlation energy, the particle ladder contribution, which has shown the slowest convergence with respect to the basis set size, is fairly ameliorated by employing these F12 corrections. Both the newly defined and conventional formalisms of the F12 corrections exhibit a similar recovery of over 90% of the complete basis set limit of the particle ladder contribution of the MP3 correlation energy with a triple-zeta quality basis set for the neon atom, while the amount is about 75% without the F12 correction. The corrections to the ring term are small but the corrected energy has shown similar recovery as the particle ladder term. The hole ladder term has shown a rapid convergence even without the F12 corrections. Owing to these balanced recoveries, the deviation of the total MP3 correlation energy from the complete basis set limit has been calculated to be about 1 kcal/mol with the triple-zeta quality basis set, which is more than five times smaller than the error without the F12 correction.
Trivedi, M H; Rush, A J; Ibrahim, H M; Carmody, T J; Biggs, M M; Suppes, T; Crismon, M L; Shores-Wilson, K; Toprac, M G; Dennehy, E B; Witte, B; Kashner, T M
2004-01-01
The present study provides additional data on the psychometric properties of the 30-item Inventory of Depressive Symptomatology (IDS) and of the recently developed Quick Inventory of Depressive Symptomatology (QIDS), a brief 16-item symptom severity rating scale that was derived from the longer form. Both the IDS and QIDS are available in matched clinician-rated (IDS-C30; QIDS-C16) and self-report (IDS-SR30; QIDS-SR16) formats. The patient samples included 544 out-patients with major depressive disorder (MDD) and 402 out-patients with bipolar disorder (BD) drawn from 19 regionally and ethnicically diverse clinics as part of the Texas Medication Algorithm Project (TMAP). Psychometric analyses including sensitivity to change with treatment were conducted. Internal consistencies (Cronbach's alpha) ranged from 0.81 to 0.94 for all four scales (QIDS-C16, QIDS-SR16, IDS-C30 and IDS-SR30) in both MDD and BD patients. Sad mood, involvement, energy, concentration and self-outlook had the highest item-total correlations among patients with MDD and BD across all four scales. QIDS-SR16 and IDS-SR30 total scores were highly correlated among patients with MDD at exit (c = 0.83). QIDS-C16 and IDS-C30 total scores were also highly correlated among patients with MDD (c = 0.82) and patients with BD (c = 0.81). The IDS-SR30, IDS-C30, QIDS-SR16, and QIDS-C16 were equivalently sensitive to symptom change, indicating high concurrent validity for all four scales. High concurrent validity was also documented based on the SF-12 Mental Health Summary score for the population divided in quintiles based on their IDS or QIDS score. The QIDS-SR16 and QIDS-C16, as well as the longer 30-item versions, have highly acceptable psychometric properties and are treatment sensitive measures of symptom severity in depression.
Aksoy Derya, Yeşim; Timur Taşhan, Sermin; Duman, Mesude; Durgun Ozan, Yeter
2018-07-01
The purpose of this study was to create a Turkish version of the Pregnancy-Related Anxiety Questionnaire-Revised 2 (PRAQR2), which was revised for application to multiparous and primiparous pregnancy, and to explore its psychometric characteristics in multiparous and primiparous pregnancy. This study was methodologically designed to assess the reliability and validity of the PRAQ-R2. The study was carried out in the obstetrics clinic of a training and research hospital in Malatya. A total of 616 healthy pregnant women (399 multiparous and 217 primiparous) constituted the sample of the study. The cultural adaptation process of the questionnaire was conducted in three phases: language validity, content validity, and pilot application. Exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) were used to test the construct validity of the questionnaire. The reliability of the PRAQ-R2 was evaluated with Cronbach's alpha internal consistency coefficient, item-total correlation, test-retest analysis, and parallel forms reliability. The EFA revealed that the PRAQ-R2 consists of 10 items for the multiparous group and 11 for the primiparous group after adding the item ``I am anxious about the delivery because I have never experienced one before.'' The CFA for both groups supported the three-factor questionnaire yielded by the EFA. Good fit index values were obtained in both groups. Cronbach's alpha internal consistency coefficient ranged from 0.81 to 0.93 for the multiparous group and 0.87 to 0.94 for the primiparous group for the complete PRAQ-R2 and each of its subdimensions. In addition, the item-total correlation, test-retest analysis, and parallel forms reliability of the questionnaire were highly correlated. The PRAQ-R2 is a valid and reliable instrument that can be used to evaluate the level of anxiety in Turkish pregnant women irrespective of parity. The use of the PRAQ-R2 in prenatal healthcare services will contribute to the early diagnosis, treatment, and management of pregnancy-related anxiety. Copyright © 2018 Elsevier Ltd. All rights reserved.
Murray, Aileen; Hall, Amanda; Williams, Geoffrey C; McDonough, Suzanne M; Ntoumanis, Nikos; Taylor, Ian; Jackson, Ben; Copsey, Bethan; Hurley, Deirdre A; Matthews, James
2018-02-27
To assess the inter-rater reliability and concurrent validity of the Communication Evaluation in Rehabilitation Tool, which aims to externally assess physiotherapists competency in using Self-Determination Theory-based communication strategies in practice. Audio recordings of initial consultations between 24 physiotherapists and 24 patients with chronic low back pain in four hospitals in Ireland were obtained as part of a larger randomised controlled trial. Three raters, all of whom had Ph.Ds in psychology and expertise in motivation and physical activity, independently listened to the 24 audio recordings and completed the 18-item Communication Evaluation in Rehabilitation Tool. Inter-rater reliability between all three raters was assessed using intraclass correlation coefficients. Concurrent validity was assessed using Pearson's r correlations with a reference standard, the Health Care Climate Questionnaire. The total score for the Communication Evaluation in Rehabilitation Tool is an average of all 18 items. Total scores demonstrated good inter-rater reliability (Intraclass Correlation Coefficient (ICC) = 0.8) and concurrent validity with the Health Care Climate Questionnaire total score (range: r = 0.7-0.88). Item-level scores of the Communication Evaluation in Rehabilitation Tool identified five items that need improvement. Results provide preliminary evidence to support future use and testing of the Communication Evaluation in Rehabilitation Tool. Implications for Rehabilitation Promoting patient autonomy is a learned skill and while interventions exist to train clinicians in these skills there are no tools to assess how well clinicians use these skills when interacting with a patient. The lack of robust assessment has severe implications regarding both the fidelity of clinician training packages and resulting outcomes for promoting patient autonomy. This study has developed a novel measurement tool Communication Evaluation in Rehabilitation Tool and a comprehensive user manual to assess how well health care providers use autonomy-supportive communication strategies in real world-clinical settings. This tool has demonstrated good inter-rater reliability and concurrent validity in its initial testing phase. The Communication Evaluation in Rehabilitation Tool can be used in future studies to assess autonomy-supportive communication and undergo further measurement property testing as per our recommendations.
The Coopersmith Self-Esteem Inventory in an Adult Sample.
ERIC Educational Resources Information Center
Noller, Patricia; Shugm, David
1988-01-01
The reliability and validity of the Self-Esteem Inventory developed by S. C. Coopersmith (1975) were evaluated via item-total correlation, discriminant analysis, factor analysis, and analysis of variance of data for 352 Australian adults. The instrument had high internal consistency and discriminated well between subjects with high and low…
Smolen, Tomasz; Chuderski, Adam
2015-01-01
Fluid intelligence (Gf) is a crucial cognitive ability that involves abstract reasoning in order to solve novel problems. Recent research demonstrated that Gf strongly depends on the individual effectiveness of working memory (WM). We investigated a popular claim that if the storage capacity underlay the WM-Gf correlation, then such a correlation should increase with an increasing number of items or rules (load) in a Gf-test. As often no such link is observed, on that basis the storage-capacity account is rejected, and alternative accounts of Gf (e.g., related to executive control or processing speed) are proposed. Using both analytical inference and numerical simulations, we demonstrated that the load-dependent change in correlation is primarily a function of the amount of floor/ceiling effect for particular items. Thus, the item-wise WM correlation of a Gf-test depends on its overall difficulty, and the difficulty distribution across its items. When the early test items yield huge ceiling, but the late items do not approach floor, that correlation will increase throughout the test. If the early items locate themselves between ceiling and floor, but the late items approach floor, the respective correlation will decrease. For a hallmark Gf-test, the Raven-test, whose items span from ceiling to floor, the quadratic relationship is expected, and it was shown empirically using a large sample and two types of WMC tasks. In consequence, no changes in correlation due to varying WM/Gf load, or lack of them, can yield an argument for or against any theory of WM/Gf. Moreover, as the mathematical properties of the correlation formula make it relatively immune to ceiling/floor effects for overall moderate correlations, only minor changes (if any) in the WM-Gf correlation should be expected for many psychological tests.
Representation of Item Position in Immediate Serial Recall: Evidence from Intrusion Errors
ERIC Educational Resources Information Center
Fischer-Baum, Simon; McCloskey, Michael
2015-01-01
In immediate serial recall, participants are asked to recall novel sequences of items in the correct order. Theories of the representations and processes required for this task differ in how order information is maintained; some have argued that order is represented through item-to-item associations, while others have argued that each item is…
Rasch Measurement and Item Banking: Theory and Practice.
ERIC Educational Resources Information Center
Nakamura, Yuji
The Rasch Model is an item response theory, one parameter model developed that states that the probability of a correct response on a test is a function of the difficulty of the item and the ability of the candidate. Item banking is useful for language testing. The Rasch Model provides estimates of item difficulties that are meaningful,…
A simple but fully nonlocal correction to the random phase approximation
NASA Astrophysics Data System (ADS)
Ruzsinszky, Adrienn; Perdew, John P.; Csonka, Gábor I.
2011-03-01
The random phase approximation (RPA) stands on the top rung of the ladder of ground-state density functional approximations. The simple or direct RPA has been found to predict accurately many isoelectronic energy differences. A nonempirical local or semilocal correction to this direct RPA leaves isoelectronic energy differences almost unchanged, while improving total energies, ionization energies, etc., but fails to correct the RPA underestimation of molecular atomization energies. Direct RPA and its semilocal correction may miss part of the middle-range multicenter nonlocality of the correlation energy in a molecule. Here we propose a fully nonlocal, hybrid-functional-like addition to the semilocal correction. The added full nonlocality is important in molecules, but not in atoms. Under uniform-density scaling, this fully nonlocal correction scales like the second-order-exchange contribution to the correlation energy, an important part of the correction to direct RPA, and like the semilocal correction itself. For the atomization energies of ten molecules, and with the help of one fit parameter, it performs much better than the elaborate second-order screened exchange correction.
1993-09-28
APPENDIX B COKPILATIC SYS=Dl OPTI•CS APPENDIX C APPEN4DIX F OF THE Aa STANDRD CHNMP1 1 The a implementation described above was tested according to the Mad...REPORT and CHEK FILE is checked by a set of executable tests. If these units are not opirating correctly, validation testing is discontinued. Class B...sections 2.1 and 2.2 (counted in items b and f , below). 3-1 PC0SSmnG INO nON a) Total Number of Applicable Tests 3809 b) Total Number of Withdrawn Tests
Disruption of amygdala-entorhinal-hippocampal network in late-life depression.
Leal, Stephanie L; Noche, Jessica A; Murray, Elizabeth A; Yassa, Michael A
2017-04-01
Episodic memory deficits are evident in late-life depression (LLD) and are associated with subtle synaptic and neurochemical changes in the medial temporal lobes (MTL). However, the particular mechanisms by which memory impairment occurs in LLD are currently unknown. We tested older adults with (DS+) and without (DS-) depressive symptoms using high-resolution fMRI that is capable of discerning signals in hippocampal subfields and amygdala nuclei. Scanning was conducted during performance of an emotional discrimination task used previously to examine the relationship between depressive symptoms and amygdala-mediated emotional modulation of hippocampal pattern separation in young adults. We found that hippocampal dentate gyrus (DG)/CA3 activity was reduced during correct discrimination of negative stimuli and increased during correct discrimination of neutral items in DS+ compared to DS- adults. The extent of the latter increase was correlated with symptom severity. Furthermore, DG/CA3 and basolateral amygdala (BLA) activity predicted discrimination performance on negative trials, a relationship that depended on symptom severity. The impact of the BLA on depressive symptom severity was mediated by the DG/CA3 during discrimination of neutral items, and by the lateral entorhinal cortex (LEC) during false recognition of positive items. These results shed light on a novel mechanistic account for amygdala-hippocampal network changes and concurrent alterations in emotional episodic memory in LLD. The BLA-LEC-DG/CA3 network, which comprises a key pathway by which emotion modulates memory, is specifically implicated in LLD. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Development and validation of the Myasthenia Gravis Impairment Index.
Barnett, Carolina; Bril, Vera; Kapral, Moira; Kulkarni, Abhaya; Davis, Aileen M
2016-08-30
We aimed to develop a measure of myasthenia gravis impairment using a previously developed framework and to evaluate reliability and validity, specifically face, content, and construct validity. The first draft of the Myasthenia Gravis Impairment Index (MGII) included examination items from available measures enriched with newly developed, patient-reported items, modified after patient input. International neuromuscular specialists evaluated face and content validity via an e-mail survey. Test-retest reliability was assessed in stable patients at a 3-week interval and interrater reliability was evaluated in the same day. Construct validity was assessed through correlations between the MGII and other measures and by comparing scores in different patient groups. The first draft was assessed by 18 patients, and 72 specialists answered the survey. The second draft had 7 examination and 22 patient-reported items. Field testing included 200 patients, with 54 patients completing the reliability studies. Test-retest reliability of the total score was good (intraclass correlation coefficient 0.92; 95% confidence interval 0.79-0.94), as was interrater reliability of the examination component (intraclass correlation coefficient 0.81; 95% confidence interval 0.79-0.94). The MGII correlated well with comparison measures, with higher correlations with the MG-activities of daily living (r = 0.91) and MG-specific quality of life 15-item scale (r = 0.78). When assessing different patient groups, the scores followed expected patterns. The MGII was developed using a patient-centered framework of myasthenia-related impairments and incorporating patient input throughout the development process. It is reliable in an outpatient setting and has demonstrated construct validity. Responsiveness studies are under way. © 2016 American Academy of Neurology.
Development and validation of the Myasthenia Gravis Impairment Index
Bril, Vera; Kapral, Moira; Kulkarni, Abhaya; Davis, Aileen M.
2016-01-01
Objective: We aimed to develop a measure of myasthenia gravis impairment using a previously developed framework and to evaluate reliability and validity, specifically face, content, and construct validity. Methods: The first draft of the Myasthenia Gravis Impairment Index (MGII) included examination items from available measures enriched with newly developed, patient-reported items, modified after patient input. International neuromuscular specialists evaluated face and content validity via an e-mail survey. Test–retest reliability was assessed in stable patients at a 3-week interval and interrater reliability was evaluated in the same day. Construct validity was assessed through correlations between the MGII and other measures and by comparing scores in different patient groups. Results: The first draft was assessed by 18 patients, and 72 specialists answered the survey. The second draft had 7 examination and 22 patient-reported items. Field testing included 200 patients, with 54 patients completing the reliability studies. Test–retest reliability of the total score was good (intraclass correlation coefficient 0.92; 95% confidence interval 0.79–0.94), as was interrater reliability of the examination component (intraclass correlation coefficient 0.81; 95% confidence interval 0.79–0.94). The MGII correlated well with comparison measures, with higher correlations with the MG–activities of daily living (r = 0.91) and MG-specific quality of life 15-item scale (r = 0.78). When assessing different patient groups, the scores followed expected patterns. Conclusions: The MGII was developed using a patient-centered framework of myasthenia-related impairments and incorporating patient input throughout the development process. It is reliable in an outpatient setting and has demonstrated construct validity. Responsiveness studies are under way. PMID:27402891
Reliability and Validity of the Turkish Version of the Gastrointestinal Symptom Rating Scale.
Turan, Nuray; Aşt, Türkinaz Atabek; Kaya, Nurten
The purpose of this methodological study is to investigate the validity and reliability of the Turkish version of the Gastrointestinal Symptom Rating Scale (GSRS). The scale was adapted to the Turkish language via backward translation. Content validity was examined by referring to experts. Reliability was examined via test-retest reliability and internal consistency, and validity was examined with divergent and convergent validity. The Epworth Sleepiness Scale (ESS) and the Marlowe-Crowne Social Desirability Scale (MCSDS) were used for divergent validity. As for convergent validity, the Constipation Severity Instrument (CSI) and the Patient Assessment of Constipation Quality of Life Scale (PAC-QOLQ) were utilized. The relationship between the GSRS and the health-related quality of life (36-item short-form health survey [SF-36]) was also analyzed. The study population consisted of patients in orthopedic clinic who volunteered to participate. Test-retest reliability was examined with the participation of 30 patients; internal consistency and validity were examined with 150 patients. Test-retest reliability correlation coefficients of the GSRS varied from 0.39 to 0.87 for all items. For internal consistency, the GSRS's item total correlation was found to be 0.17-0.67, and Cronbach α was 0.82 for all items. There was a positive linear significant correlation between the GSRS, CSI, and PAC-QOLQ. There was no significant correlation between the GSRS, MCSDS, and ESS. Higher GSRS scores inversely correlated with general quality of life (SF-36). The Turkish version of the GSRS has been found to be a reliable and valid instrument for assessing patients' gastrointestinal symptoms. Therefore, this instrument can be confidently used with Turkish individuals.
Årestedt, Kristofer; Ågren, Susanna; Flemme, Inger; Moser, Debra K; Strömberg, Anna
2015-08-01
The four-item Control Attitudes Scale (CAS) was developed to measure control perceived by patients with cardiac disease and their family members, but extensive psychometric evaluation has not been performed. The aim was to translate, culturally adapt and psychometrically evaluate the CAS in a Swedish sample of implantable cardioverter defibrillator (ICD) recipients, heart failure (HF) patients and their partners. A sample (n=391) of ICD recipients, HF patients and partners were used. Descriptive statistics, item-total and inter-item correlations, exploratory factor analysis, ordinal regression modelling and Cronbach's alpha were used to validate the CAS. The findings from the factor analyses revealed that the CAS is a multidimensional scale including two factors, Control and Helplessness. The internal consistency was satisfactory for all scales (α=0.74-0.85), except the family version total scale (α=0.62). No differential item functioning was detected which implies that the CAS can be used to make invariant comparisons between groups of different age and sex. The psychometric properties, together with the simple and short format of the CAS, make it to a useful tool for measuring perceived control among patients with cardiac diseases and their family members. When using the CAS, subscale scores should be preferred. © The European Society of Cardiology 2014.
A Farahani, Mansoureh; Emamzadeh Ghasemi, Hormat Sadat; Nikpaima, Nasrin; Fereidooni, Zhila; Rasoli, Maryam
2014-10-29
Evaluation of nursing instructors' clinical teaching performance is a prerequisite to the quality assurance of nursing education. One of the most common procedures for this purpose is using student evaluations. This study was to develop and evaluate the psychometric properties of Nursing Instructors' Clinical Teaching Performance Inventory (NICTPI). The primary items of the inventory were generated by reviewing the published literature and the existing questionnaires as well as consulting with the members of the Faculties Evaluation Committee of the study setting. Psychometric properties were assessed by calculating its content validity ratio and index, and test-retest correlation coefficient as well as conducting an exploratory factor analysis and an internal consistency assessment. The content validity ratios and indices of the items were respectively higher than 0.85 and 0.79. The final version of the inventory consisted of 25 items, and in the exploratory factor analysis, items were loaded on three factors which jointly accounting for 72.85% of the total variance. The test-retest correlation coefficient and the Cronbach's alpha of the inventory were 0.93 and 0.973, respectively. The results revealed that the developed inventory is an appropriate, valid, and reliable instrument for evaluating nursing instructors' clinical teaching performance.
Nafees, Beenish; Rasmussen, Mikkel; LLoyd, Andrew
2017-01-01
Using an ostomy appliance can affect many aspects of a person's health-related quality of life (HRQL). A 2-part, descrip- tive study was designed to develop and validate an instrument to assess quality-of-life outcomes related to ostomy ap- pliance use. Study inclusion/exclusion criteria stipulated participants should be 18 to 85 years of age, have an ileostomy or colostomy, used an appliance for a minimum of 3 months without assistance, and able to complete an online survey. All participants provided sociodemographic and clinical information. In phase 1, a literature search was conducted and existing instruments used to measure HRQL in persons with an ostomy were assessed. Subsequently, the Ostomy-Q, a 23-item, Likert-response type questionnaire, divided into 4 domains (Discreetness, Comfort, Confidence, and Social Life), was developed based on published evidence and existing ostomy-related HRQL tools. Seven (7) participants re- cruited from a manufacturer user panel took part in exploratory/cognitive qualitative interviews to refine the new quality- of-life questionnaire. In phase 2, the instrument was tested to assess item variability and conceptual structure, item-total correlation, internal consistency, test-retest reliability, sensitivity, and minimal important difference (MID) in an online validation study among 200 participants from the manufacturer's user panel (equally divided by gender, 125 [62.5%] >50 years old, 128 [64%] with an ileostomy). This exercise also included completion of the Stoma Quality of Life Question- naire and 2 domains from the Ostomy Adjustment Inventory-23 to assess convergent validity. Eighty-two (82) participants recompleted these study instruments 2 weeks later to assess test-retest reliability. Sociodemographic and clinical data were assessed using descriptive statistics; Cronbach's alpha was used for internal consistency (minimum 0.70), principle component analysis for item variability/conceptual structure, and item-total correlation; intraclass correlation coefficient was used for test-retest reliability; and standard error of measurement was applied to MID. All domains demonstrated good internal consistency (between 0.69 and 0.78). All scales showed stability, with a minimum intraclass correlation coefficient of 0.743 (P <.001). The Ostomy-Q showed good convergent validity with other instruments to which it was compared (P <.01). In this study, the Ostomy-Q was found to be a reliable and valid outcome measure that can enhance understanding of the impact of ostomy appliances on users. Some items for social relationships and discreetness may need more exploring in the future with other patient groups.
Sleep-related disorders in Latin-American children with atopic dermatitis: A case control study.
Urrutia-Pereira, M; Solé, D; Rosario, N A; Neto, H J C; Acosta, V; Almendarez, C F; Avalos, M M; Badellino, H; Berroa, F; Álvarez-Castelló, M; Castillo, A J; Castro-Almarales, R L; De la Cruz, M M; Cepeda, A M; Fernandez, C; González-León, M; Lozano-Saenz, J; Sanchez-Silot, C; Sisul-Alvariza, J C; Valentin-Rostan, M; Sarni, R O S
Atopic dermatitis (AD) has been associated with impairment of sleep. The aim of this study was to evaluate sleep disorders in AD Latin-American children (4-10 years) from nine countries, and in normal controls (C). Parents from 454 C and 340 AD children from referral clinics answered the Children Sleep Habits Questionnaire (CSHQ), a one-week retrospective 33 questions survey under seven items (bedtime resistance, sleep duration, sleep anxiety, night awakening, parasomnias, sleep-disordered breathing and daytime sleepiness). Total CSHQ score and items were analysed in both C and AD groups. Spearman's correlation coefficient between SCORAD (Scoring atopic dermatitis), all subscales and total CSHQ were also obtained. C and AD groups were similar regarding age, however, significantly higher values for total CSHQ (62.2±16.1 vs 53.3±12.7, respectively) and items were observed among AD children in comparison to C, and they were higher among those with moderate (54.8%) or severe (4.3%) AD. Except for sleep duration (r=-0.02, p=0.698), there was a significant Spearman's correlation index for bedtime resistance (0.24, p<0.0001), sleep anxiety (0.29, p<0.0001), night awakening (0.36, p<0.0001), parasomnias (0.54, p<0.0001), sleep-disordered breathing (0.42, p<0.0001), daytime sleepiness (0.26, p<0.0001) and total CSHQ (0.46, p<0.0001). AD patients had significantly higher elevated body mass index. Latin-American children with AD have sleep disorders despite treatment, and those with moderate to severe forms had marked changes in CSHQ. Copyright © 2016 SEICAP. Published by Elsevier España, S.L.U. All rights reserved.
[Validation of the German version of Eating Assessment Tool for head and neck cancer patients].
Zaretsky, Eugen; Steinbach-Hundt, Silke; Pluschinski, Petra; Grethel, Isabel
2018-04-10
The assessment of subjective swallowing complaints constitutes an important element in a multidimensional, modern management of head and neck cancer patients suffering from dysphagia. For this purpose, an internationally recognized and validated 10-item questionnaire EAT-10 is used that was developed and validated by Belafski et al. in 2008. The purpose of the present study is the translation of EAT-10 into the German language and its validation for head and neck cancer patients. After the translation of EAT-10 into German according to the guidelines for the translation of foreign measuring instruments, a validation of gEAT-10 was carried out on the basis of the sample of 210 head and neck cancer patients. The reliability was determined by means of the internal consistency (Cronbach's Alpha) and item-total correlations (Spearman). The construct validity was verified by the uni- and multivariate analyses of the distribution of gEAT-10 total scores depending on gender, age, BMI, tumor stage and localization as well as type of the oncological therapy. The internal consistency amounted to α = .94, the item-total correlations varied between ρ = .59 and ρ = .85. No significant associations between gEAT-10 total scores and gender as well as age were identified in univariate calculations. Such associations were found for BMI, tumor stage and localization as well as type of the oncological therapy. However, only the tumor stage yielded a significant result in a regression. The gEAT-10 was shown to be a reliable and construct valid questionnaire for the assessment of subjective swallowing complaints in patients with head and neck cancer. © Georg Thieme Verlag KG Stuttgart · New York.
Okada, Kayoko; Vilberg, Kaia L; Rugg, Michael D
2012-03-01
The neural correlates of successful retrieval on tests of word stem recall and recognition memory were compared. In the recall test, subjects viewed word stems, half of which were associated with studied items and half with unstudied items, and for each stem attempted to recall a corresponding study word. In the recognition test, old/new judgments were made on old and new words. The neural correlates of successful retrieval were identified by contrasting activity elicited by correctly endorsed test items. Old > new effects common to the two tasks were found in medial and lateral parietal and right entorhinal cortex. Common new > old effects were identified in medial and left frontal cortex, and left anterior intra-parietal sulcus. Greater old > new effects were evident for cued recall in inferior parietal regions abutting those demonstrating common effects, whereas larger new > old effects were found for recall in left frontal cortex and the anterior cingulate. New > old effects were also found for the recall task in right lateral anterior prefrontal cortex, where they were accompanied by old > new effects during recognition. It is concluded that successful recall and recognition are associated with enhanced activity in a common set of recollection-sensitive parietal regions, and that the greater activation in these regions during recall reflects the greater dependence of that task on recollection. Larger new > old effects during recall are interpreted as reflections of the greater opportunity for iterative retrieval attempts when retrieval cues are partial rather than copy cues. Copyright © 2011 Wiley Periodicals, Inc.
Okada, Kayoko; Vilberg, Kaia L.; Rugg, Michael D.
2011-01-01
The neural correlates of successful retrieval on tests of word stem recall and recognition memory were compared. In the recall test, subjects viewed word stems, half of which were associated with studied items and half with unstudied items, and for each stem attempted to recall a corresponding study word. In the recognition test, old/new judgments were made on old and new words. The neural correlates of successful retrieval were identified by contrasting activity elicited by correctly endorsed test items. Old > new effects common to the two tasks were found in medial and lateral parietal, and right entorhinal cortex. Common new > old effects were identified in medial and left frontal cortex, and left anterior intra-parietal sulcus. Greater old > new effects were evident for cued recall in inferior parietal regions abutting those demonstrating common effects, whereas larger new > old effects were found for recall in left frontal cortex and the anterior cingulate. New > old effects were also found for the recall task in right lateral anterior prefrontal cortex, where they were accompanied by old > new effects during recognition. It is concluded that successful recall and recognition are associated with enhanced activity in a common set of recollection-sensitive parietal regions, and that the greater activation in these regions during recall reflects the greater dependence of that task on recollection. Larger new > old effects during recall are interpreted as reflections of the greater opportunity for iterative retrieval attempts when retrieval cues are partial rather than copy cues. PMID:21455941
Zachariah, Marianne; Seidling, Hanna M; Neri, Pamela M; Cresswell, Kathrin M; Duke, Jon; Bloomrosen, Meryl; Volk, Lynn A; Bates, David W
2011-01-01
Background Medication-related decision support can reduce the frequency of preventable adverse drug events. However, the design of current medication alerts often results in alert fatigue and high over-ride rates, thus reducing any potential benefits. Methods The authors previously reviewed human-factors principles for relevance to medication-related decision support alerts. In this study, instrument items were developed for assessing the appropriate implementation of these human-factors principles in drug–drug interaction (DDI) alerts. User feedback regarding nine electronic medical records was considered during the development process. Content validity, construct validity through correlation analysis, and inter-rater reliability were assessed. Results The final version of the instrument included 26 items associated with nine human-factors principles. Content validation on three systems resulted in the addition of one principle (Corrective Actions) to the instrument and the elimination of eight items. Additionally, the wording of eight items was altered. Correlation analysis suggests a direct relationship between system age and performance of DDI alerts (p=0.0016). Inter-rater reliability indicated substantial agreement between raters (κ=0.764). Conclusion The authors developed and gathered preliminary evidence for the validity of an instrument that measures the appropriate use of human-factors principles in the design and display of DDI alerts. Designers of DDI alerts may use the instrument to improve usability and increase user acceptance of medication alerts, and organizations selecting an electronic medical record may find the instrument helpful in meeting their clinicians' usability needs. PMID:21946241
Higashide, Tomomi; Ohkubo, Shinji; Hangai, Masanori; Ito, Yasuki; Shimada, Noriaki; Ohno-Matsui, Kyoko; Terasaki, Hiroko; Sugiyama, Kazuhisa; Chew, Paul; Li, Kenneth K W; Yoshimura, Nagahisa
2016-01-01
To identify the factors which significantly contribute to the thickness variabilities in macular retinal layers measured by optical coherence tomography with or without magnification correction of analytical areas in normal subjects. The thickness of retinal layers {retinal nerve fiber layer (RNFL), ganglion cell layer plus inner plexiform layer (GCLIPL), RNFL plus GCLIPL (ganglion cell complex, GCC), total retina, total retina minus GCC (outer retina)} were measured by macular scans (RS-3000, NIDEK) in 202 eyes of 202 normal Asian subjects aged 20 to 60 years. The analytical areas were defined by three concentric circles (1-, 3- and 6-mm nominal diameters) with or without magnification correction. For each layer thickness, a semipartial correlation (sr) was calculated for explanatory variables including age, gender, axial length, corneal curvature, and signal strength index. Outer retinal thickness was significantly thinner in females than in males (sr2, 0.07 to 0.13) regardless of analytical areas or magnification correction. Without magnification correction, axial length had a significant positive sr with RNFL (sr2, 0.12 to 0.33) and a negative sr with GCLIPL (sr2, 0.22 to 0.31), GCC (sr2, 0.03 to 0.17), total retina (sr2, 0.07 to 0.17) and outer retina (sr2, 0.16 to 0.29) in multiple analytical areas. The significant sr in RNFL, GCLIPL and GCC became mostly insignificant following magnification correction. The strong correlation between the thickness of inner retinal layers and axial length appeared to result from magnification effects. Outer retinal thickness may differ by gender and axial length independently of magnification correction.
Higashide, Tomomi; Ohkubo, Shinji; Hangai, Masanori; Ito, Yasuki; Shimada, Noriaki; Ohno-Matsui, Kyoko; Terasaki, Hiroko; Sugiyama, Kazuhisa; Chew, Paul; Li, Kenneth K. W.; Yoshimura, Nagahisa
2016-01-01
Purpose To identify the factors which significantly contribute to the thickness variabilities in macular retinal layers measured by optical coherence tomography with or without magnification correction of analytical areas in normal subjects. Methods The thickness of retinal layers {retinal nerve fiber layer (RNFL), ganglion cell layer plus inner plexiform layer (GCLIPL), RNFL plus GCLIPL (ganglion cell complex, GCC), total retina, total retina minus GCC (outer retina)} were measured by macular scans (RS-3000, NIDEK) in 202 eyes of 202 normal Asian subjects aged 20 to 60 years. The analytical areas were defined by three concentric circles (1-, 3- and 6-mm nominal diameters) with or without magnification correction. For each layer thickness, a semipartial correlation (sr) was calculated for explanatory variables including age, gender, axial length, corneal curvature, and signal strength index. Results Outer retinal thickness was significantly thinner in females than in males (sr2, 0.07 to 0.13) regardless of analytical areas or magnification correction. Without magnification correction, axial length had a significant positive sr with RNFL (sr2, 0.12 to 0.33) and a negative sr with GCLIPL (sr2, 0.22 to 0.31), GCC (sr2, 0.03 to 0.17), total retina (sr2, 0.07 to 0.17) and outer retina (sr2, 0.16 to 0.29) in multiple analytical areas. The significant sr in RNFL, GCLIPL and GCC became mostly insignificant following magnification correction. Conclusions The strong correlation between the thickness of inner retinal layers and axial length appeared to result from magnification effects. Outer retinal thickness may differ by gender and axial length independently of magnification correction. PMID:26814541
Estimating Between-Person and Within-Person Subscore Reliability with Profile Analysis.
Bulut, Okan; Davison, Mark L; Rodriguez, Michael C
2017-01-01
Subscores are of increasing interest in educational and psychological testing due to their diagnostic function for evaluating examinees' strengths and weaknesses within particular domains of knowledge. Previous studies about the utility of subscores have mostly focused on the overall reliability of individual subscores and ignored the fact that subscores should be distinct and have added value over the total score. This study introduces a profile reliability approach that partitions the overall subscore reliability into within-person and between-person subscore reliability. The estimation of between-person reliability and within-person reliability coefficients is demonstrated using subscores from number-correct scoring, unidimensional and multidimensional item response theory scoring, and augmented scoring approaches via a simulation study and a real data study. The effects of various testing conditions, such as subtest length, correlations among subscores, and the number of subtests, are examined. Results indicate that there is a substantial trade-off between within-person and between-person reliability of subscores. Profile reliability coefficients can be useful in determining the extent to which subscores provide distinct and reliable information under various testing conditions.
Blackmon, Jaime E; Liptak, Cori; Recklitis, Christopher J
2017-03-01
Three previously developed short forms of the Beck Depression Inventory-Youth (BDI-Y) were validated against the standard 20-item BDI-Y; 168 adolescent survivors completed the standard and short-form versions of the BDI-Y. The short forms were evaluated for internal consistency and compared with the standard BDI-Y using correlation coefficients and receiver operating characteristic curve analyses. The three short forms had good internal consistency (α > 0.85), high correlations with the total BDI-Y scale (r > 0.85), and good discrimination compared with the standard BDI-Y cutoff score (area under the ROC curve >0.95). Consistent with prior findings, strong psychometric properties of an eight-item short form support its use as a screening measure for adolescent cancer survivors.
Psychometric evaluation of the muscle appearance satisfaction scale in a Mexican male sample.
Escoto Ponce de León, María Del Consuelo; Bosques-Brugada, Lilián Elizabeth; Camacho Ruiz, Esteban Jaime; Alvarez-Rayón, Georgina; Franco Paredes, Karina; Rodríguez Hernández, Gabriela
2017-03-02
The purpose of this study was to determine whether the muscle appearance satisfaction scale (MASS) shows acceptable psychometric properties in Mexican bodybuilders. A total of 258 Mexican male bodybuilders were recruited. Two self-report questionnaires, including the MASS and drive for muscularity scale (DMS), were administered. Six models of the latent structure of the MASS were evaluated, using confirmatory factor analysis with maximum likelihood, considering robust Satorra-Bentler correction to estimate the fit of the models to the data. Similar to the original MASS, the series of CFA confirmed that the Mexican version was well represented with the 17-item five-factor structure, which showed a good model fit [Satorra-Bentler Chi-square (109, n = 258) = 189.18, p < 0.0001; NNFI = 0.91; CFI = 0.93; IFI = 0.93; RMSEA = 0.05 (0.04, 0.07)]. Internal consistency was estimated with McDonald's omega, which was acceptable for the MASS (0.88), and their subscales (0.80 to 0.89), except for muscle checking scale (0.77). Test-retest reliability analysis showed stability of the MASS total as well as of the subscale scores over a 2-week period (intraclass correlation coefficients = 0.75-0.91). Construct validity was demonstrated by a significant positive correlation between MASS and DMS results (r = 0.75; p = 0.0001). These results were similar to those of previous studies, which demonstrate the scale's usefulness. Our results support the suitability of the MASS and its subscales to measure muscle dysmorphia symptoms in Mexican male bodybuilders.
Tarescavage, Anthony M; Corey, David M; Ben-Porath, Yossef S
2016-04-01
The purpose of the current study was to identify Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF) correlates of police officer integrity violations and other problem behaviors in an archival database with original MMPI item responses and collateral information regarding integrity violations obtained for 417 male officers. In Study 1, we estimated MMPI-2-RF scores from the MMPI item pool (which includes approximately 80% of the MMPI-2-RF items) in a normative sample, a psychiatric inpatient sample, and a police officer sample, and conducted analyses that demonstrated the comparability of estimated and full scale scores for 41 of the 51 MMPI-2-RF scales. In Study 2, we correlated estimated MMPI-2-RF scores with information about subsequent integrity violations and problem behaviors from the integrity violation data set. Several meaningful associations were obtained, predominately with scales from the emotional, thought, and behavioral dysfunction domains of the MMPI-2-RF. Application of a correction for range restriction yielded substantially improved validity estimates. Finally, we calculated relative risk ratios for the statistically significant findings using cutoffs lower than 65T, which is traditionally used to identify clinically significant elevations, and found several meaningful relative risk ratios. © The Author(s) 2015.
Isaac, Barney Thomas Jesudason; Thangakunam, Balamugesh; Cherian, Rekha A; Christopher, Devasahayam Jesudas
2015-01-01
For the follow-up of patients with idiopathic interstitial pneumonias (IIP), it is unclear which parameters of pulmonary function tests (PFT) and exercise testing would correlate best with high-resolution computed tomography (HRCT).. To find out the correlation of symptom scores, PFTs and exercise testing with HRCT scoring in patients diagnosed as idiopathic interstitial pneumonia. Cross-sectional study done in pulmonary medicine outpatients department of a tertiary care hospital in South India. Consecutive patients who were diagnosed as IIP by a standard algorithm were included into the study. Cough and dyspnea were graded for severity and duration. Pulmonary function tests and exercise testing parameters were noted. HRCT was scored based on an alveolar score, an interstitial score and a total score. The HRCT was correlated with each of the clinical and physiologic parameters. Pearson's/Spearman's correlation coefficient was used for the correlation of symptoms and parameters of ABG, PFT and 6MWT with the HRCT scores. A total of 94 patients were included in the study. Cough and dyspnea severity (r = 0.336 and 0.299), FVC (r = -0.48), TLC (r = -0.439) and DLCO and distance saturation product (DSP) (r = -0.368) and lowest saturation (r = -0.324) had significant correlation with total HRCT score. Among these, DLCO, particularly DLCO corrected % of predicted, correlated best with HRCT score (r = -0.721).. Symptoms, PFT and exercise testing had good correlation with HRCT. DLCO corrected % of predicted correlated best with HRCT.
A novel scale for measuring mixed states in bipolar disorder.
Cavanagh, Jonathan; Schwannauer, Matthias; Power, Mick; Goodwin, Guy M
2009-01-01
Conventional descriptions of bipolar disorder tend to treat the mixed state as something of an afterthought. There is no scale that specifically measures the phenomena of the mixed state. This study aimed to test a novel scale for mixed state in a clinical and community population of bipolar patients. The scale included clinically relevant symptoms of both mania and depression in a bivariate scale. Recovered respondents were asked to recall their last manic episode. The scale allowed endorsement of one or more of the manic and depressive symptoms. Internal consistency analyses were carried out using Cronbach alpha. Factor analysis was carried out using a standard Principal Components Analysis followed by Varimax Rotation. A confirmatory factor analytic method was used to validate the scale structure in a representative clinical sample. The reliability analysis gave a Cronbach alpha value of 0.950, with a range of corrected-item-total-scale correlations from 0.546 (weight change) to 0.830 (mood). The factor analysis revealed a two-factor solution for the manic and depressed items which accounted for 61.2% of the variance in the data. Factor 1 represented physical activity, verbal activity, thought processes and mood. Factor 2 represented eating habits, weight change, passage of time and pain sensitivity. This novel scale appears to capture the key features of mixed states. The two-factor solution fits well with previous models of bipolar disorder and concurs with the view that mixed states may be more than the sum of their parts.
Maciel, João; Infante, Paulo; Ribeiro, Susana; Ferreira, André; Silva, Artur C; Caravana, Jorge; Carvalho, Manuel G
2014-11-01
The prevalence of obesity has increased worldwide. An assessment of the impact of obesity on health-related quality of life (HRQoL) requires specific instruments. The Moorehead-Ardelt Quality of Life Questionnaire II (MA-II) is a widely used instrument to assess HRQoL in morbidly obese patients. The objective of this study was to translate and validate a Portuguese version of the MA-II.The study included forward and backward translations of the original MA-II. The reliability of the Portuguese MA-II was estimated using the internal consistency and test-retest methods. For validation purposes, the Spearman's rank correlation coefficient was used to evaluate the correlation between the Portuguese MA-II and the Portuguese versions of two other questionnaires, the 36-item Short Form Health Survey (SF-36) and the Impact of Weight on Quality of Life-Lite (IWQOL-Lite).One hundred and fifty morbidly obese patients were randomly assigned to test the reliability and validity of the Portuguese MA-II. Good internal consistency was demonstrated by a Cronbach's alpha coefficient of 0.80, and a very good agreement in terms of test-retest reliability was recorded, with an overall intraclass correlation coefficient (ICC) of 0.88. The total sums of MA-II scores and each item of MA-II were significantly correlated with all domains of SF-36 and IWQOL-Lite. A statistically significant negative correlation was found between the MA-II total score and BMI. Moreover, age, gender and surgical status were independent predictors of MA-II total score.A reliable and valid Portuguese version of the MA-II was produced, thus enabling the routine use of MA-II in the morbidly obese Portuguese population.
Bektas, Murat; Akdeniz Kudubes, Aslı; Ugur, Ozlem; Vergin, Canan; Demirag, Bengü
2016-06-01
This study aimed to develop the Scale for Quality of Life in Pediatric Oncology Patients Aged 13-18: Adolescent Form and Parent Form. We used the child and parent information form, Visual Quality of Life Scale, and our own scale, the Scale for Quality of Life in Pediatric Oncology Patients Aged 13-18: Adolescent Form and Parent Form. We finalized the 35-item scale to determine the items, received opinions from 14 specialists on the scale, and pilot-tested the scale in 25 children and their parents. We used Pearson correlation analysis, Cronbach α coefficient, factor analysis and receiver operating characteristics analysis to analyze the data. The total Cronbach α of the parent form was .97, the total factor load was .60-.97 and the total variance was 80.4%. The cutoff point of the parent form was 85.50. The total Cronbach α of the adolescent form was .98, the total factor load was .62-.96, and the total variance explained was 83.4%. The cutoff point of the adolescent form was 75.50. As a result of the parent form factor analysis, we determined the Kaiser-Meyer-Olkin coefficient as .83, the Barlett test χ(2) as 12,615.92; the factor coefficients of all items of the parent form ranged from .63 to .98. The factor coefficients of all items of the adolescent form ranged from .34 to .99. As a result of the adolescent form factor analysis, we determined the KMO as .79, and the Barlett test χ(2) as 13,970.62. Conclusively, we found that the adolescent form and the parent form were valid and reliable in assessing the children's quality of life. Copyright © 2016. Published by Elsevier B.V.
Yun, Young Ho; Kang, Eun Kyo; Lee, Jihye; Choo, Jiyeon; Ryu, Hyewon; Yun, Hye-Min; Kang, Jung Hun; Kim, Tae You; Sim, Jin-Ah; Kim, Yaeji
2018-03-05
In this study, we aimed to develop and validate an instrument that could be used by patients with cancer to evaluate their quality of palliative care. Development of the questionnaire followed the four-phase process: item generation and reduction, construction, pilot testing, and field testing. Based on the literature, we constructed a list of items for the quality of palliative care from 104 quality care issues divided into 14 subscales. We constructed scales of 43 items that only the cancer patients were asked to answer. Using relevance and feasibility criteria and pilot testing, we developed a 44-item questionnaire. To assess the sensitivity and validity of the questionnaire, we recruited 220 patients over 18 years of age from three Korean hospitals. Factor analysis of the data and fit statistics process resulted in the 4-factor, 32-item Quality Care Questionnaire-Palliative Care (QCQ-PC), which covers appropriate communication with health care professionals (ten items), discussing value of life and goals of care (nine items), support and counseling for needs of holistic care (seven items), and accessibility and sustainability of care (six items). All subscales and total scores showed a high internal consistency (Cronbach alpha range, 0.89 to 0.97). Multi-trait scaling analysis showed good convergent (0.568-0.995) and discriminant (0.472-0.869) validity. The correlation between the total and subscale scores of QCQ-PC and those of EORTC QLQ-C15-PAL, MQOL, SAT-SF, and DCS was obtained. This study demonstrates that the QCQ-PC can be adopted to assess the quality of care in patients with cancer.
Evaluation of a short food frequency questionnaire used among Norwegian children.
Lillegaard, Inger Therese L; Overby, Nina Cecilie; Andersen, Lene Frost
2012-01-01
The aim of this study was to evaluate a short food frequency questionnaire (FFQ) against a four-day precoded food diary (PFD) with regard to frequency of food intake among Norwegian 9- and 13-year-olds. A total of 733 9-year-olds and 904 13-year-olds completed first a short FFQ and one to two weeks later a four-day PFD. The short FFQ included questions about 23 food items, including different drinks, fruits, vegetables, bread, fish, pizza, sweets, chocolate and savoury snacks. The PFD covered the whole diet. When comparing mean intake from the PFD with comparable food items in the FFQ, all food items showed that increasing intake measured with the PFD corresponded with increasing intake with the short FFQ. However, participants reported a significantly higher frequency of intake for most foods with the short FFQ compared with PFD, except for soft drinks with sugar and sweets. The median Spearman correlation coefficient between the two methods was 0.36 among the 9-year-olds and 0.32 among the 13-year-olds. Often eaten foods such as fruits and vegetables had higher correlations than seldom eaten foods such as pizza and potato chips. The median correlation coefficients for drinks alone were higher (r=0.47) for both age groups. Results indicate that the short FFQ was able to identify high and low consumers of food intake and had a moderate capability to rank individuals according to food intake. Drinks, fruits and vegetables had better correlations with the PFD than infrequently eaten food items.
Evaluation of a short food frequency questionnaire used among Norwegian children
Lillegaard, Inger Therese L.; Øverby, Nina Cecilie; Andersen, Lene Frost
2012-01-01
Objective The aim of this study was to evaluate a short food frequency questionnaire (FFQ) against a four-day precoded food diary (PFD) with regard to frequency of food intake among Norwegian 9- and 13-year-olds. Subjects and design A total of 733 9-year-olds and 904 13-year-olds completed first a short FFQ and one to two weeks later a four-day PFD. The short FFQ included questions about 23 food items, including different drinks, fruits, vegetables, bread, fish, pizza, sweets, chocolate and savoury snacks. The PFD covered the whole diet. Results When comparing mean intake from the PFD with comparable food items in the FFQ, all food items showed that increasing intake measured with the PFD corresponded with increasing intake with the short FFQ. However, participants reported a significantly higher frequency of intake for most foods with the short FFQ compared with PFD, except for soft drinks with sugar and sweets. The median Spearman correlation coefficient between the two methods was 0.36 among the 9-year-olds and 0.32 among the 13-year-olds. Often eaten foods such as fruits and vegetables had higher correlations than seldom eaten foods such as pizza and potato chips. The median correlation coefficients for drinks alone were higher (r=0.47) for both age groups. Conclusions Results indicate that the short FFQ was able to identify high and low consumers of food intake and had a moderate capability to rank individuals according to food intake. Drinks, fruits and vegetables had better correlations with the PFD than infrequently eaten food items. PMID:22259597
Journal Impact Factor: Do the Numerator and Denominator Need Correction?
Liu, Xue-Li; Gai, Shuang-Shuang; Zhou, Jing
2016-01-01
To correct the incongruence of document types between the numerator and denominator in the traditional impact factor (IF), we make a corresponding adjustment to its formula and present five corrective IFs: IFTotal/Total, IFTotal/AREL, IFAR/AR, IFAREL/AR, and IFAREL/AREL. Based on a survey of researchers in the fields of ophthalmology and mathematics, we obtained the real impact ranking of sample journals in the minds of peer experts. The correlations between various IFs and questionnaire score were analyzed to verify their journal evaluation effects. The results show that it is scientific and reasonable to use five corrective IFs for journal evaluation for both ophthalmology and mathematics. For ophthalmology, the journal evaluation effects of the five corrective IFs are superior than those of traditional IF: the corrective effect of IFAR/AR is the best, IFAREL/AR is better than IFTotal/Total, followed by IFTotal/AREL, and IFAREL/AREL. For mathematics, the journal evaluation effect of traditional IF is superior than those of the five corrective IFs: the corrective effect of IFTotal/Total is best, IFAREL/AR is better than IFTotal/AREL and IFAREL/AREL, and the corrective effect of IFAR/AR is the worst. In conclusion, not all disciplinary journal IF need correction. The results in the current paper show that to correct the IF of ophthalmologic journals may be valuable, but it seems to be meaningless for mathematic journals. PMID:26977697
Cross-cultural adaptation and validation of Persian Achilles tendon Total Rupture Score.
Ansari, Noureddin Nakhostin; Naghdi, Soofia; Hasanvand, Sahar; Fakhari, Zahra; Kordi, Ramin; Nilsson-Helander, Katarina
2016-04-01
To cross-culturally adapt the Achilles tendon Total Rupture Score (ATRS) to Persian language and to preliminary evaluate the reliability and validity of a Persian ATRS. A cross-sectional and prospective cohort study was conducted to translate and cross-culturally adapt the ATRS to Persian language (ATRS-Persian) following steps described in guidelines. Thirty patients with total Achilles tendon rupture and 30 healthy subjects participated in this study. Psychometric properties of floor/ceiling effects (responsiveness), internal consistency reliability, test-retest reliability, standard error of measurement (SEM), smallest detectable change (SDC), construct validity, and discriminant validity were tested. Factor analysis was performed to determine the ATRS-Persian structure. There were no floor or ceiling effects that indicate the content and responsiveness of ATRS-Persian. Internal consistency was high (Cronbach's α 0.95). Item-total correlations exceeded acceptable standard of 0.3 for the all items (0.58-0.95). The test-retest reliability was excellent [(ICC)agreement 0.98]. SEM and SDC were 3.57 and 9.9, respectively. Construct validity was supported by a significant correlation between the ATRS-Persian total score and the Persian Foot and Ankle Outcome Score (PFAOS) total score and PFAOS subscales (r = 0.55-0.83). The ATRS-Persian significantly discriminated between patients and healthy subjects. Explanatory factor analysis revealed 1 component. The ATRS was cross-culturally adapted to Persian and demonstrated to be a reliable and valid instrument to measure functional outcomes in Persian patients with Achilles tendon rupture. II.
Family-centred service: differences in what parents of children with cerebral palsy rate important.
Terwiel, M; Alsem, M W; Siebes, R C; Bieleman, K; Verhoef, M; Ketelaar, M
2017-09-01
A family-centred approach to services of children with disabilities is widely accepted as the foundational approach to service delivery in paediatric health care. The 56 items of the Measure of Processes of Care questionnaire (MPOC-56) all reflect elements of family-centred service. In this study, we investigated which elements of family-centred service are rated important by parents of children with cerebral palsy by adding a question on importance to each item of the MPOC-56 (MPOC-56-I). In total, 175 parents of children with cerebral palsy completed the MPOC-56-I. For each MPOC item, parents were asked to rate the importance on a 5-point scale ranging from 0 (not important at all) up to and including 4 (very important). We used Spearman's rank correlation coefficient to further explore the variation in parents' importance ratings. Parents' importance ratings of the MPOC-56 items varied. The percentage of parents rating an item important (importance rating 3 or 4) varied between 43.8% and 96.8%. The percentage of parents rating an item unimportant (rating 0 or 1) varied between 0.0% and 20.3%, and the percentage of parents rating an item neutral (rating 2) varied between 3.0% and 36.0%. Most diverse importance ratings were found for five items concerning the provision of general information. Three correlations between these items and child and parent characteristics were found. Six items were rated important by almost all (≥95%) parents. These items concern elements of specific information about the child, co-ordinated and comprehensive care for child and family and enabling and partnership. Parents rate the importance of family-centred services for their situation in various ways. These findings endorse that family-centred services should recognize the uniqueness of families and should be tailored to what parents find important. © 2017 John Wiley & Sons Ltd.
Dawson, Deborah A; Saha, Tulshi D; Grant, Bridget F
2010-02-01
The relative severity of the 11 DSM-IV alcohol use disorder (AUD) criteria are represented by their severity threshold scores, an item response theory (IRT) model parameter inversely proportional to their prevalence. These scores can be used to create a continuous severity measure comprising the total number of criteria endorsed, each weighted by its relative severity. This paper assesses the validity of the severity ranking of the 11 criteria and the overall severity score with respect to known AUD correlates, including alcohol consumption, psychological functioning, family history, antisociality, and early initiation of drinking, in a representative population sample of U.S. past-year drinkers (n=26,946). The unadjusted mean values for all validating measures increased steadily with the severity threshold score, except that legal problems, the criterion with the highest score, was associated with lower values than expected. After adjusting for the total number of criteria endorsed, this direct relationship was no longer evident. The overall severity score was no more highly correlated with the validating measures than a simple count of criteria endorsed, nor did the two measures yield different risk curves. This reflects both within-criterion variation in severity and the fact that the number of criteria endorsed and their severity are so highly correlated that severity is essentially redundant. Attempts to formulate a scalar measure of AUD will do as well by relying on simple counts of criteria or symptom items as by using scales weighted by IRT measures of severity. Published by Elsevier Ireland Ltd.
Evaluation of five guidelines for option development in multiple-choice item-writing.
Martínez, Rafael J; Moreno, Rafael; Martín, Irene; Trigo, M Eva
2009-05-01
This paper evaluates certain guidelines for writing multiple-choice test items. The analysis of the responses of 5013 subjects to 630 items from 21 university classroom achievement tests suggests that an option should not differ in terms of heterogeneous content because such error has a slight but harmful effect on item discrimination. This also occurs with the "None of the above" option when it is the correct one. In contrast, results do not show the supposedly negative effects of a different-length option, the use of specific determiners, or the use of the "All of the above" option, which not only decreases difficulty but also improves discrimination when it is the correct option.
Unsworth, Nash; Brewer, Gene A; Spillers, Gregory J
2011-09-01
In three experiments search termination decisions were examined as a function of response type (correct vs. incorrect) and confidence. It was found that the time between the last retrieved item and the decision to terminate search (exit latency) was related to the type of response and confidence in the last item retrieved. Participants were willing to search longer when the last retrieved item was a correct item vs. an incorrect item and when the confidence was high in the last retrieved item. It was also found that the number of errors retrieved during the recall period was related to search termination decisions such that the more errors retrieved, the more likely participants were to terminate the search. Finally, it was found that knowledge of overall search set size influenced the time needed to search for items, but did not influence search termination decisions. Copyright © 2011 Elsevier B.V. All rights reserved.
Tarrant, Marie; Knierim, Aimee; Hayes, Sasha K; Ware, James
2006-12-01
Multiple-choice questions are a common assessment method in nursing examinations. Few nurse educators, however, have formal preparation in constructing multiple-choice questions. Consequently, questions used in baccalaureate nursing assessments often contain item-writing flaws, or violations to accepted item-writing guidelines. In one nursing department, 2770 MCQs were collected from tests and examinations administered over a five-year period from 2001 to 2005. Questions were evaluated for 19 frequently occurring item-writing flaws, for cognitive level, for question source, and for the distribution of correct answers. Results show that almost half (46.2%) of the questions contained violations of item-writing guidelines and over 90% were written at low cognitive levels. Only a small proportion of questions were teacher generated (14.1%), while 36.2% were taken from testbanks and almost half (49.4%) had no source identified. MCQs written at a lower cognitive level were significantly more likely to contain item-writing flaws. While there was no relationship between the source of the question and item-writing flaws, teacher-generated questions were more likely to be written at higher cognitive levels (p<0.001). Correct answers were evenly distributed across all four options and no bias was noted in the placement of correct options. Further training in item-writing is recommended for all faculty members who are responsible for developing tests. Pre-test review and quality assessment is also recommended to reduce the occurrence of item-writing flaws and to improve the quality of test questions.
2014-01-01
Background Self-compassion is a key psychological construct for assessing clinical outcomes in mindfulness-based interventions. The aim of this study was to validate the Spanish versions of the long (26 item) and short (12 item) forms of the Self-Compassion Scale (SCS). Methods The translated Spanish versions of both subscales were administered to two independent samples: Sample 1 was comprised of university students (n = 268) who were recruited to validate the long form, and Sample 2 was comprised of Aragon Health Service workers (n = 271) who were recruited to validate the short form. In addition to SCS, the Mindful Attention Awareness Scale (MAAS), the State-Trait Anxiety Inventory–Trait (STAI-T), the Beck Depression Inventory (BDI) and the Perceived Stress Questionnaire (PSQ) were administered. Construct validity, internal consistency, test-retest reliability and convergent validity were tested. Results The Confirmatory Factor Analysis (CFA) of the long and short forms of the SCS confirmed the original six-factor model in both scales, showing goodness of fit. Cronbach’s α for the 26 item SCS was 0.87 (95% CI = 0.85-0.90) and ranged between 0.72 and 0.79 for the 6 subscales. Cronbach’s α for the 12-item SCS was 0.85 (95% CI = 0.81-0.88) and ranged between 0.71 and 0.77 for the 6 subscales. The long (26-item) form of the SCS showed a test-retest coefficient of 0.92 (95% CI = 0.89–0.94). The Intraclass Correlation (ICC) for the 6 subscales ranged from 0.84 to 0.93. The short (12-item) form of the SCS showed a test-retest coefficient of 0.89 (95% CI: 0.87-0.93). The ICC for the 6 subscales ranged from 0.79 to 0.91. The long and short forms of the SCS exhibited a significant negative correlation with the BDI, the STAI and the PSQ, and a significant positive correlation with the MAAS. The correlation between the total score of the long and short SCS form was r = 0.92. Conclusion The Spanish versions of the long (26-item) and short (12-item) forms of the SCS are valid and reliable instruments for the evaluation of self-compassion among the general population. These results substantiate the use of this scale in research and clinical practice. PMID:24410742
Development of a stress scale for pregnant women in the South Asian context: the A-Z Stress Scale.
Kazi, A; Fatmi, Z; Hatcher, J; Niaz, U; Aziz, A
2009-01-01
Stress in pregnancy can lead to low-birth-weight and preterm babies and to psychological consequences such as anxiety and depression during pregnancy and the puerperium. Previous scales to measure stress contain items that overlap with the symptoms of pregnancy. A stress scale was developed based on in-depth interviews with pregnant women in Pakistan. Construct validity, test-retest reliability and inter-rater reliability were carried out. Cronbach alpha was 0.82 for the 30 short-listed items, with item-total correlations of 0.2-0.8. Multidimensional scaling determined 2 dimensions: socioenvironmental hassles and chronic illnesses. This was the first scale developed for pregnant women based on stressors in a developing country in South Asia.
Possin, Katherine L; Chester, Serana K; Laluz, Victor; Bostrom, Alan; Rosen, Howard J; Miller, Bruce L; Kramer, Joel H
2012-09-01
On tests of design fluency, an examinee draws as many different designs as possible in a specified time limit while avoiding repetition. The neuroanatomical substrates and diagnostic group differences of design fluency repetition errors and total correct scores were examined in 110 individuals diagnosed with dementia, 53 with mild cognitive impairment (MCI), and 37 neurologically healthy controls. The errors correlated significantly with volumes in the right and left orbitofrontal cortex (OFC), the right and left superior frontal gyrus, the right inferior frontal gyrus, and the right striatum, but did not correlate with volumes in any parietal or temporal lobe regions. Regression analyses indicated that the lateral OFC may be particularly crucial for preventing these errors, even after excluding patients with behavioral variant frontotemporal dementia (bvFTD) from the analysis. Total correct correlated more diffusely with volumes in the right and left frontal and parietal cortex, the right temporal cortex, and the right striatum and thalamus. Patients diagnosed with bvFTD made significantly more repetition errors than patients diagnosed with MCI, Alzheimer's disease, semantic dementia, progressive supranuclear palsy, or corticobasal syndrome. In contrast, total correct design scores did not differentiate the dementia patients. These results highlight the frontal-anatomic specificity of design fluency repetitions. In addition, the results indicate that the propensity to make these errors supports the diagnosis of bvFTD. (JINS, 2012, 18, 1-11).
Correlates of a Single-Item Indicator Versus a Multi-Item Scale of Outness About Same-Sex Attraction
Noor, Syed W.; Galos, Dylan L.; Simon Rosser, B. R.
2017-01-01
In this study, we investigated if a single-item indicator measured the degree to which people were open about their same-sex attraction (“out”) as accurately as a multi-item scale. For the multi-item scale, we used the Outness Inventory, which includes three subscales: family, world, and religion. We examined correlations between the single- and multi-item measures; between the single-item indicator and the subscales of the multi-item scale; and between the measures and internalized homonegativity, social attitudes towards homosexuality, and depressive symptoms. In addition, we calculated Tjur’s R2 as a measure of predictive power of the single-item indicator, multi-item scale, and subscales of the multi-item scale in predicting two health-related outcomes: depressive symptoms and condomless anal sex with multiple partners. There was a strong correlation between the single- and multi-item measures (r = 0.73). Furthermore, there were strong correlations between the single-item indicator and each subscale of the multi-item scale: family (r = 0.70), world (r = 0.77), and religion (r = 0.50). In addition, the correlations between the single-item indicator and internalized homonegativity (r = −0.63), social attitudes towards homosexuality (r = −0.38), and depression (r = −0.14) were higher than those between the multi-item scale and internalized homonegativity (r = −0.55), social attitudes towards homosexuality (r = −0.21), and depression (r = −0.13). Contrary to the premise that multi-item measures are superior to single-item measures, our collective findings indicate that the single-item indicator of outness performs better than the multi-item scale of outness. PMID:26292840
[Checklist Development for Women-Doctor-Friendly Working Conditions in a Hospital Setting].
Horie, Saki; Takeuchi, Masumi; Yamaoka, Kazue; Nohara, Michiko; Hasunuma, Naoko; Okinaga, Hiroko; Nomura, Kyoko
2015-01-01
This study aims to develop a scale of "women-doctor-friendly working conditions in a hospital setting". A task team consisting of relevant people including a medical doctor and a hospital personnel identified 36 items related to women-doctor-friendly working conditions. From December in 2012 to January in 2013, we sent a self-administered questionnaire to 807 full-time employees including faculty members and medical doctors who worked for a university-affiliated hospital. We asked them to score the extent to which they think it is necessary for women doctors to balance between work and gender role responsibilities on the basis of the Likert scale. We carried out a factor analysis and computed Cronbach's alpha to develop a scale and investigated its construct validity and reliability. Of the 807 employees, 291 returned the questionnaires (response rate, 36.1%). The item-total correlation (between an individual item score and the total score) coefficient was in the range from 0.44 to 0.68. In factor analysis, we deleted six items, and five factors were extracted on the basis of the least likelihood method with the oblique Promax rotation. The factors were termed "gender equality action in an organization", "the compliance of care leave in both sexes and parental leave in men", "balance between life events and work", "childcare support at the workplace", and "flexible employment status". The Cronbach's alpha values of all the factors and the total items were 0.82-0.89 and 0.93, respectively, suggesting that the scale we developed has high reliability. The result indicated that the scale of women-doctor-friendly working conditions consisting of five factors with 30 items is highly validated and reliable.
Fieo, Robert; Ocepek-Welikson, Katja; Kleinman, Marjorie; Eimicke, Joseph P.; Crane, Paul K.; Cella, David; Teresi, Jeanne A.
2017-01-01
Aims The goals of these analyses were to examine the psychometric properties and measurement equivalence of a self-reported cognition measure, the Patient Reported Outcome Measurement Information System® (PROMIS®) Applied Cognition – General Concerns short form. These items are also found in the PROMIS Cognitive Function (version 2) item bank. This scale consists of eight items related to subjective cognitive concerns. Differential item functioning (DIF) analyses of gender, education, race, age, and (Spanish) language were performed using an ethnically diverse sample (n = 5,477) of individuals with cancer. This is the first analysis examining DIF in this item set across ethnic and racial groups. Methods DIF hypotheses were derived by asking content experts to indicate whether they posited DIF for each item and to specify the direction. The principal DIF analytic model was item response theory (IRT) using the graded response model for polytomous data, with accompanying Wald tests and measures of magnitude. Sensitivity analyses were conducted using ordinal logistic regression (OLR) with a latent conditioning variable. IRT-based reliability, precision and information indices were estimated. Results DIF was identified consistently only for the item, brain not working as well as usual. After correction for multiple comparisons, this item showed significant DIF for both the primary and sensitivity analyses. Black respondents and Hispanics in comparison to White non-Hispanic respondents evidenced a lower conditional probability of endorsing the item, brain not working as well as usual. The same pattern was observed for the education grouping variable: as compared to those with a graduate degree, conditioning on overall level of subjective cognitive concerns, those with less than high school education also had a lower probability of endorsing this item. DIF was also observed for age for two items after correction for multiple comparisons for both the IRT and OLR-based models: “I have had to work really hard to pay attention or I would make a mistake” and “I have had trouble shifting back and forth between different activities that require thinking”. For both items, conditional on cognitive complaints, older respondents had a higher likelihood than younger respondents of endorsing the item in the cognitive complaints direction. The magnitude and impact of DIF was minimal. The scale showed high precision along much of the subjective cognitive concerns continuum; the overall IRT-based reliability estimate for the total sample was 0.88 and the estimates for subgroups ranged from 0.87 to 0.92. Conclusion Little DIF of high magnitude or impact was observed in the PROMIS Applied Cognition – General Concerns short form item set. One item, “It has seemed like my brain was not working as well as usual” might be singled out for further study. However, in general the short form item set was highly reliable, informative, and invariant across differing race/ethnic, educational, age, gender, and language groups. PMID:28523238
Fieo, Robert; Ocepek-Welikson, Katja; Kleinman, Marjorie; Eimicke, Joseph P; Crane, Paul K; Cella, David; Teresi, Jeanne A
2016-01-01
The goals of these analyses were to examine the psychometric properties and measurement equivalence of a self-reported cognition measure, the Patient Reported Outcome Measurement Information System ® (PROMIS ® ) Applied Cognition - General Concerns short form. These items are also found in the PROMIS Cognitive Function (version 2) item bank. This scale consists of eight items related to subjective cognitive concerns. Differential item functioning (DIF) analyses of gender, education, race, age, and (Spanish) language were performed using an ethnically diverse sample ( n = 5,477) of individuals with cancer. This is the first analysis examining DIF in this item set across ethnic and racial groups. DIF hypotheses were derived by asking content experts to indicate whether they posited DIF for each item and to specify the direction. The principal DIF analytic model was item response theory (IRT) using the graded response model for polytomous data, with accompanying Wald tests and measures of magnitude. Sensitivity analyses were conducted using ordinal logistic regression (OLR) with a latent conditioning variable. IRT-based reliability, precision and information indices were estimated. DIF was identified consistently only for the item, brain not working as well as usual. After correction for multiple comparisons, this item showed significant DIF for both the primary and sensitivity analyses. Black respondents and Hispanics in comparison to White non-Hispanic respondents evidenced a lower conditional probability of endorsing the item, brain not working as well as usual. The same pattern was observed for the education grouping variable: as compared to those with a graduate degree, conditioning on overall level of subjective cognitive concerns, those with less than high school education also had a lower probability of endorsing this item. DIF was also observed for age for two items after correction for multiple comparisons for both the IRT and OLR-based models: "I have had to work really hard to pay attention or I would make a mistake" and "I have had trouble shifting back and forth between different activities that require thinking". For both items, conditional on cognitive complaints, older respondents had a higher likelihood than younger respondents of endorsing the item in the cognitive complaints direction. The magnitude and impact of DIF was minimal. The scale showed high precision along much of the subjective cognitive concerns continuum; the overall IRT-based reliability estimate for the total sample was 0.88 and the estimates for subgroups ranged from 0.87 to 0.92. Little DIF of high magnitude or impact was observed in the PROMIS Applied Cognition - General Concerns short form item set. One item, "It has seemed like my brain was not working as well as usual" might be singled out for further study. However, in general the short form item set was highly reliable, informative, and invariant across differing race/ethnic, educational, age, gender, and language groups.
Kim, Kye-Ha; Foster, Roxie L; Park, Jeong-Hwan
2017-04-01
To demonstrate the psychometric properties of the Emotional Reactions Instrument-English (ERI-E) between hospitalized African American and Caucasian children aged 7-12 years. A methodological study was conducted to examine validity and reliability of the ERI-E with 230 hospitalized African American and Caucasian children. Data were collected with sociodemographic and clinical forms, and using the ERI-E, and the Facial Affective Scale (FAS). Different factor structures were found between hospitalized African American and Caucasian children. In psychometric testing of the ERI-E with African American children, four items, alone, lonely, shy, and bored, were removed from the original 16-item ERI-E after exploratory factor analysis. Three factors, including Fear, Anxiety, and Distress, were identified explaining 60.71% of the total variance. Cronbach's alpha coefficient for the revised 12-item scale was 0.85. Six items, happy, sad, afraid, frightened, hurt, and uncomfortable, in the ERI-E were significantly correlated with the FAS (r = 0.20-0.59) as evidence of concurrent validity. In the sample with hospitalized Caucasian children, two items, bored and uncomfortable, were eliminated from the original ERI-E after exploratory factor analysis. Four factors including Fear, Anxiety, Distress, and Loneliness were extracted with 62.61% of total variance. Cronbach's alpha coefficient for the revised 14-item in the ERI-E was 0.84 for hospitalized Caucasian children. As evidence of concurrent validity, 10 items, happy, sad, afraid, frightened, bad, lonely, scary, bored, hurt, and uncomfortable, in the ERI-E were significantly correlated with the FAS (r = 0.20-0.69). Because children with different cultural backgrounds understand or use words differently, healthcare providers should assess the cultural norms of pediatric patients and ensure steps have been taken to ensure clear, effective communication with pediatric patients. In addition, healthcare providers should evaluate the meanings of faces in the FAS before using it in a clinical setting because faces have different cultural connotations. The explosive growth of ethnic minority children in the United States makes it paramount for healthcare providers and researchers to consider the measurement equivalence of any measure to better serve different racial and cultural groups. © 2017 Wiley Periodicals, Inc.
Measuring Advance Care Planning: Optimizing the Advance Care Planning Engagement Survey.
Sudore, Rebecca L; Heyland, Daren K; Barnes, Deborah E; Howard, Michelle; Fassbender, Konrad; Robinson, Carole A; Boscardin, John; You, John J
2017-04-01
A validated 82-item Advance Care Planning (ACP) Engagement Survey measures a broad range of behaviors. However, concise surveys are needed. The objective of this study was to validate shorter versions of the survey. The survey included 57 process (e.g., readiness) and 25 action items (e.g., discussions). For item reduction, we systematically eliminated questions based on face validity, item nonresponse, redundancy, ceiling effects, and factor analysis. We assessed internal consistency (Cronbach's alpha) and construct validity with cross-sectional correlations and the ability of the progressively shorter survey versions to detect change one week after exposure to an ACP intervention (Pearson correlation coefficients). Five hundred one participants (four Canadian and three US sites) were included in item reduction (mean age 69 years [±10], 41% nonwhite). Because of high correlations between readiness and action items, all action items were removed. Because of high correlations and ceiling effects, two process items were removed. Successive factor analysis then created 55-, 34-, 15-, nine-, and four-item versions; 664 participants (from three US ACP clinical trials) were included in validity analysis (age 65 years [±8], 72% nonwhite, 34% Spanish speaking). Cronbach's alphas were high for all versions (four items 0.84-55 items 0.97). Compared with the original survey, cross-sectional correlations were high (four items 0.85; 55 items 0.97) as were delta correlations (four items 0.68; 55 items 0.93). Shorter versions of the ACP Engagement Survey are valid, internally consistent, and able to detect change across a broad range of ACP behaviors for English and Spanish speakers. Shorter ACP surveys can efficiently measure broad ACP behaviors in research and clinical settings. Published by Elsevier Inc.
Heyland, Daren K; Jiang, Xuran; Day, Andrew G; Cohen, S Robin
2013-08-01
The recently developed Canadian Health Care Evaluation Project (CANHELP) questionnaire, which can be used to assess both patient and family satisfaction with end-of-life care, takes 40-60 minutes to complete. The length of the interview may limit its uptake and clinical utility; a shorter version would make its use more feasible. The purpose of this study was to develop and validate a shorter version of the CANHELP questionnaire. Data were collected using a cross-sectional survey of patients with advanced medical diseases and their family members. Participants completed the long version of CANHELP, a global rating of satisfaction with care (GRS), the FAMCARE scale (family members only), and a quality-of-life (QOL) questionnaire. We reduced the items on the long version based on their relationship to the GRS, the frequency of missing data, the distribution of responses, the redundancy of the items, and focus groups with frontline users. With the remaining items, we assessed internal consistency using Cronbach's alpha, and evaluated construct validity by describing the correlation of the new CANHELP Lite with the full version of CANHELP, GRS, FAMCARE, and the QOL questionnaire scores. A total of 363 patients and 193 family members participated in this study. The patient version was reduced from 37 items to 20 items and the caregiver version was reduced from 38 items to 21 items. Cronbach's alphas ranged from 0.68 to 0.93 for all domains of both the patient and caregiver questionnaires. We observed a high degree of correlation between CANHELP Lite domains and overall scores and the same domains and overall scores for the full version of CANHELP. In addition, we observed moderate to strong correlation between the CANHELP Lite overall satisfaction scores and the GRS questions. There was moderate correlation between the overall family member CANHELP Lite score and overall FAMCARE score (r = 0.45) and this was similar to the correlation between the full version of CANHELP and FAMCARE scores (r = 0.41). CANHELP Lite correlated more strongly with the QOL subscale on health care than the other QOL subscales. The CANHELP Lite questionnaire is a valid and internally consistent instrument to measure satisfaction with end-of-life care. Copyright © 2013 U.S. Cancer Pain Relief Committee. Published by Elsevier Inc. All rights reserved.
Liu, Huayun; Yu, Juping; Chen, Yongyi; He, Pingping; Zhou, Lianqing; Tang, Xinhui; Liu, Xiangyu; Li, Xuying; Wu, Yanping; Wang, Yuhua
2016-02-01
This study aimed to examine the psychometric properties and performance of a Chinese version of the Female Sexual Function Index (FSFI) among a sample of Chinese women with cervical cancer. A cross-sectional survey design was used. The respondents included 215 women with cervical cancer in an oncology hospital in China. A translated Chinese version of the FSFI was used to investigate their sexual functioning. Psychometric testing included internal consistency reliability (Cronbach's alpha coefficient and item-total correlations), test-retest reliability, construct validity (principal component analysis via oblique rotation and confirmatory factor analysis), and variability (floor and ceiling effects). The mean score of the total scale was 20.65 ± 4.77. The Cronbach values were .94 for the total scale, .72-.90 for the domains. Test-retest correlation coefficients over 2-4 weeks were .84 (p < .05) for the total scale, .68-.83 for the subscales. Item-total correlation coefficients ranged between .47 and .83 (p < .05). A five-factor model was identified via principal component analysis and established by confirmatory factor analysis, including desire/arousal, lubrication, orgasm, satisfaction, and pain. There was no evidence of floor or ceiling effects. With good psychometric properties similar to its original English version, this Chinese version of the FSFI is demonstrated to be a reliable and valid instrument that can be used to assess sexual functioning of women with cervical cancer in China. Future research is still needed to confirm its psychometric properties and performance among a large sample. Copyright © 2015 Elsevier Ltd. All rights reserved.
Tailoring Multimedia Instruction to Soldier Needs
2014-12-01
Pretest Score (Mean % Items Correct) 39% 34% 48% 51% 51% 45% Posttest (Mean % Items Correct) 47% 44% 66% 60% 63% 56...Stepwise regression was used to examine the relationship between Soldiers’ posttest scores (criterion) and their pretest scores, training time, type of...differences among IMI types had no effect.) Pretest scores predicted posttest scores for both Adjust Indirect Fire (βstandardized = .66, t = 6.36
ERIC Educational Resources Information Center
Nowbakht, Mohammad; Shahnazari, Mohammadtaghi
2015-01-01
In the present study, the comparative effects of comprehensible input, output and corrective feedback on the receptive acquisition of L2 vocabulary items were investigated. Two groups of beginning EFL learners participated in the study. The control group received comprehensible input only, while the experimental group received input and was…
Zhang, J H; Peng, R; Du, Y; Mou, Y; Li, N N; Cheng, L
2016-11-08
Objective: To evaluate the reliability and validity of Parkinson's disease sleep scale-Chinese version (CPDSS) through a study of a large PD population in southwest China, and to explore the prevalence and characteristics of sleep disorders in Parkinson's disease (PD) patients from southwest China. Methods: A total of 544 PD patients and 220 control subjects were enrolled in our study. Demographic data, CPDSS, ESS, PDQ39, HAMD and H-Y stage were assessed in all subjects. Statistical description, Cronbach's alpha coefficient, intra-class correlation coefficient ( ICC ), Spearman rank correlation coefficient and Mann-Whitney U test were used for statistical analyses. Result: The Cronbach's alpha coefficient for CPDSS was 0.79, ICC of the total scale was 0.94 and ICC of each item ranged from 0.73 to 0.97. The factor analysis yielded a five-factor solution, which explained 63.4% of the total variance. Total and each item scores of CPDSS in PD patients were lower than those in healthy controls. 69.3% of PD patients had sleep disorder, while prevalence in the control group was only 29.6%. Negative correlation was found between CPDSS and ESS. Daytime sleepiness was the most common factor (35.9%) leading to sleep disorders. The sleep disorders of PD patients in Southwest China were significantly related with the course of disease, the severity of disease, the quality of life, depression, cognitive level and motor symptoms. Conclusion: CPDSS has good feasibility, reliability and validity in PD population from southwest China. CPDSS is considered as an effective tool for the assessment of sleep disorder in PD patients.
Chen, Hui-fang; Wu, Ching-yi; Lin, Keh-chung; Li, Ming-wei; Yu, Hung-wen
2012-07-01
To examine the measurement properties of a short version of the Stroke-Specific Quality of Life Scale (SS-QoL-12). Self-report survey of patients with mild to moderate upper extremity dysfunction. A total of 126 patients provided 252 observations before and after treatment. The construct validity and reliability was examined using the Rasch model; the concurrent and predictive validity was estimated using Spearman's rank correlation coefficients. Paired t-test and the standardized response mean (SRM) were performed to estimate the responsiveness of the SS-QoL-12. The 2-factor model (psychosocial and physical domains) fit the data better with smaller deviances. All but 1 item showed acceptable fit, and no item biases were detected. The reliability of the subscales and the whole scale ranged from 0.67 to 0.99. The total score showed fair correlations with the criterion measures at pretreatment (ρ = 0.28-0.40) and fair to good correlations at post-treatment (ρ = 0.39-0.54). The subscales had low to fair correlations at pretreatment (ρ = 0.19-0.49) and fair to good correlations at post-treatment (ρ = 0.31-0.56). The total and the subscales had low to good predictions at baseline (ρ = 0.22-0.52). The whole scale and the psychosocial subscale were mildly responsive to change (SRM = 0.22), but the physical subscale was not responsive to change (SRM = 0.08). The SS-QoL-12 has acceptable to good measurement properties, with an advantage of requiring less time to administer than other scales. The use of the subscale and total scores depends on the purpose of research. Future studies should recruit stroke patients with a broad range of dysfunction and use a large sample size to validate the findings.
Superficial Priming in Episodic Recognition
ERIC Educational Resources Information Center
Dopkins, Stephen; Sargent, Jesse; Ngo, Catherine T.
2010-01-01
We explored the effect of superficial priming in episodic recognition and found it to be different from the effect of semantic priming in episodic recognition. Participants made recognition judgments to pairs of items, with each pair consisting of a prime item and a test item. Correct positive responses to the test item were impeded if the prime…
Espada, José Pedro; Guillén-Riquelme, Alejandro; Morales, Alexandra; Orgilés, Mireia; Sierra, Juan Carlos
2014-12-01
The objective of this research is to determine the validity and reliability of a questionnaire designed to specifically assess the knowledge of HIV and other sexually transmitted infections in a Spanish adolescent population. Cross-sectional study for the validation of a questionnaire. A total of 17 schools in five Spanish provinces. A total of 1,570 adolescent schoolchildren between 13 and 17 years old. A pool of 40 items relating to knowledge about HIV and other sexually transmitted infections was established. This pool was analyzed by an expert panel. It was then administered to a pilot group with the same demographic characteristics of the sample, to ensure comprehension. Item analysis, internal consistency, test/retest and exploratory factorial analysis. A factor analysis was performed, in which five factors that explained 46% of the total variance were retained: general knowledge about HIV, condom as a protective method, routes of HIV transmission, the prevention of HIV, and other sexually transmitted infections. Reliability measures ranged from 0.66 to 0.88. The test-retest correlation was 0.59. There were gender differences in the knowledge of infections. These factors have adequate internal consistency and acceptable test-retest correlation. Theoretically, these factors fit properly with the content of the items. The factors have a moderate relationship, indicating that a high degree of knowledge about an aspect, but not a guarantee of general knowledge. The availability of a questionnaire to assess knowledge of sexually transmitted infections is helpful to evaluate prevention programs. Copyright © 2014 Elsevier España, S.L.U. All rights reserved.
Bennett, Robert; Russell, I Jon; Choy, Ernest; Spaeth, Michael; Mease, Philip; Kajdasz, Daniel; Walker, Daniel; Wang, Fujun; Chappell, Amy
2012-04-01
Patients with fibromyalgia (FM) rate stiffness as one of the most troublesome symptoms of the disorder. However, there are few published studies that have focused on better understanding the nature of stiffness in FM. The primary objectives of these analyses were to characterize the distribution of stiffness severity in patients at baseline, evaluate changes in stiffness after 12 weeks of treatment with duloxetine, and determine which outcomes were correlated with stiffness. These were post-hoc analyses of 3-month data from 4 randomized, double-blind, placebo-controlled studies that assessed efficacy of duloxetine in adults with FM. Severity of stiffness was assessed by using the Fibromyalgia Impact Questionnaire (FIQ) on a scale from 0 (no stiffness) to 10 (most severe stiffness). The association between changes in stiffness and other measures was evaluated by using Pearson's correlation coefficient. The FIQ total score and items, the Brief Pain Inventory (BPI-modified short form), the Clinical Global Impression-Severity scale, the Multidimensional Fatigue Inventory, the 17-item Hamilton Depression Rating Scale, the Sheehan Disability Scale, the 36-item Short-Form Health Survey, and the EuroQoL Questionnaire-5 Dimensions were evaluated in the correlation analyses. Stepwise linear regression was used to identify the variables that were most highly predictive of the changes in FIQ stiffness. The analysis included 1332 patients (mean age, 50.2 years; 94.7% female; and 87.8% white). The mean (SD) baseline FIQ stiffness score was 7.7 (2.0), and this score correlated with baseline BPI pain score and FIQ function. Duloxetine significantly improved the FIQ stiffness score compared with placebo (P < 0.001) and provided a moderate effect size (0.23 for the 60-mg dose and 0.38 for the 120-mg dose). Changes in stiffness were best correlated (range, 0.52-0.75; all, P < 0.001) with changes in BPI/FIQ pain and interference scores, FIQ nonrefreshing sleep, FIQ anxiety, 36-item Short-Form Health Survey bodily pain, and Sheehan Disability Scale total score. Variables related to severity of pain, pain interfering with daily activities, and physical functioning were predictors of change in stiffness. Stiffness scores were high in this population with FM and best correlated at baseline with BPI pain score and FIQ function. Not unexpectedly, improvement in stiffness with duloxetine correlated with many of the other markers of FM severity, presumably a result of amelioration in FM comorbidities. Copyright © 2012. Published by EM Inc USA.
Tarrant, Marie; Ware, James; Mohammed, Ahmed M
2009-07-07
Four- or five-option multiple choice questions (MCQs) are the standard in health-science disciplines, both on certification-level examinations and on in-house developed tests. Previous research has shown, however, that few MCQs have three or four functioning distractors. The purpose of this study was to investigate non-functioning distractors in teacher-developed tests in one nursing program in an English-language university in Hong Kong. Using item-analysis data, we assessed the proportion of non-functioning distractors on a sample of seven test papers administered to undergraduate nursing students. A total of 514 items were reviewed, including 2056 options (1542 distractors and 514 correct responses). Non-functioning options were defined as ones that were chosen by fewer than 5% of examinees and those with a positive option discrimination statistic. The proportion of items containing 0, 1, 2, and 3 functioning distractors was 12.3%, 34.8%, 39.1%, and 13.8% respectively. Overall, items contained an average of 1.54 (SD = 0.88) functioning distractors. Only 52.2% (n = 805) of all distractors were functioning effectively and 10.2% (n = 158) had a choice frequency of 0. Items with more functioning distractors were more difficult and more discriminating. The low frequency of items with three functioning distractors in the four-option items in this study suggests that teachers have difficulty developing plausible distractors for most MCQs. Test items should consist of as many options as is feasible given the item content and the number of plausible distractors; in most cases this would be three. Item analysis results can be used to identify and remove non-functioning distractors from MCQs that have been used in previous tests.
NASA Astrophysics Data System (ADS)
Croft, Stephen; Favalli, Andrea
2017-10-01
Neutron multiplicity counting using shift-register calculus is an established technique in the science of international nuclear safeguards for the identification, verification, and assay of special nuclear materials. Typically passive counting is used for Pu and mixed Pu-U items and active methods are used for U materials. Three measured counting rates, singles, doubles and triples are measured and, in combination with a simple analytical point-model, are used to calculate characteristics of the measurement item in terms of known detector and nuclear parameters. However, the measurement problem usually involves more than three quantities of interest, but even in cases where the next higher order count rate, quads, is statistically viable, it is not quantitatively applied because corrections for dead time losses are currently not available in the predominant analysis paradigm. In this work we overcome this limitation by extending the commonly used dead time correction method, developed by Dytlewski, to quads. We also give results for pents, which may be of interest for certain special investigations. Extension to still higher orders, may be accomplished by inspection based on the sequence presented. We discuss the foundations of the Dytlewski method, give limiting cases, and highlight the opportunities and implications that these new results expose. In particular there exist a number of ways in which the new results may be combined with other approaches to extract the correlated rates, and this leads to various practical implementations.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Croft, Stephen; Favalli, Andrea
Here, neutron multiplicity counting using shift-register calculus is an established technique in the science of international nuclear safeguards for the identification, verification, and assay of special nuclear materials. Typically passive counting is used for Pu and mixed Pu-U items and active methods are used for U materials. Three measured counting rates, singles, doubles and triples are measured and, in combination with a simple analytical point-model, are used to calculate characteristics of the measurement item in terms of known detector and nuclear parameters. However, the measurement problem usually involves more than three quantities of interest, but even in cases where themore » next higher order count rate, quads, is statistically viable, it is not quantitatively applied because corrections for dead time losses are currently not available in the predominant analysis paradigm. In this work we overcome this limitation by extending the commonly used dead time correction method, developed by Dytlewski, to quads. We also give results for pents, which may be of interest for certain special investigations. Extension to still higher orders, may be accomplished by inspection based on the sequence presented. We discuss the foundations of the Dytlewski method, give limiting cases, and highlight the opportunities and implications that these new results expose. In particular there exist a number of ways in which the new results may be combined with other approaches to extract the correlated rates, and this leads to various practical implementations.« less
Croft, Stephen; Favalli, Andrea
2017-07-16
Here, neutron multiplicity counting using shift-register calculus is an established technique in the science of international nuclear safeguards for the identification, verification, and assay of special nuclear materials. Typically passive counting is used for Pu and mixed Pu-U items and active methods are used for U materials. Three measured counting rates, singles, doubles and triples are measured and, in combination with a simple analytical point-model, are used to calculate characteristics of the measurement item in terms of known detector and nuclear parameters. However, the measurement problem usually involves more than three quantities of interest, but even in cases where themore » next higher order count rate, quads, is statistically viable, it is not quantitatively applied because corrections for dead time losses are currently not available in the predominant analysis paradigm. In this work we overcome this limitation by extending the commonly used dead time correction method, developed by Dytlewski, to quads. We also give results for pents, which may be of interest for certain special investigations. Extension to still higher orders, may be accomplished by inspection based on the sequence presented. We discuss the foundations of the Dytlewski method, give limiting cases, and highlight the opportunities and implications that these new results expose. In particular there exist a number of ways in which the new results may be combined with other approaches to extract the correlated rates, and this leads to various practical implementations.« less
The intelligibility in Context Scale: validity and reliability of a subjective rating measure.
McLeod, Sharynne; Harrison, Linda J; McCormack, Jane
2012-04-01
To describe a new measure of functional intelligibility, the Intelligibility in Context Scale (ICS), and evaluate its validity, reliability, and sensitivity using 3 clinical measures of severity of speech sound disorder: (a) percentage of phonemes correct (PPC), (b) percentage of consonants correct (PCC), and (c) percentage of vowels correct (PVC). Speech skills of 120 preschool children (109 with parent-/teacher-identified concern about how they talked and made speech sounds and 11 with no identified concern) were assessed with the Diagnostic Evaluation of Articulation and Phonology (Dodd, Hua, Crosbie, Holm, & Ozanne, 2002). Parents completed the 7-item ICS, which rates the degree to which children's speech is understood by different communication partners (parents, immediate family, extended family, friends, acquaintances, teachers, and strangers) on a 5-point scale. Parents' ratings showed that most children were always (5) or usually (4) understood by parents, immediate family, and teachers, but only sometimes (3) by strangers. Factor analysis confirmed the internal consistency of the ICS items; therefore, ratings were averaged to form an overall intelligibility score. The ICS had high internal reliability (α = .93), sensitivity, and construct validity. Criterion validity was established through significant correlations between the ICS and PPC (r = .54), PCC (r = .54), and PVC (r = .36). The ICS is a promising new measure of functional intelligibility. These data provide initial support for the ICS as an easily administered, valid, and reliable estimate of preschool children's intelligibility when speaking with people of varying levels of familiarity and authority.
Sideridis, Georgios D.; Tsaousis, Ioannis; Al Harbi, Khaleel
2016-01-01
The purpose of the present study was to relate response strategy with person ability estimates. Two behavioral strategies were examined: (a) the strategy to skip items in order to save time on timed tests, and, (b) the strategy to select two responses on an item, with the hope that one of them may be considered correct. Participants were 4,422 individuals who were administered a standardized achievement measure related to math, biology, chemistry, and physics. In the present evaluation, only the physics subscale was employed. Two analyses were conducted: (a) a person-based one to identify differences between groups and potential correlates of those differences, and, (b) a measure-based analysis in order to identify the parts of the measure that were responsible for potential group differentiation. For (a) person abilities the 2-PL model was employed and later the 3-PL and 4-PL models in order to estimate upper and lower asymptotes of person abilities. For (b) differential item functioning, differential test functioning, and differential distractor functioning were investigated. Results indicated that there were significant differences between groups with completers having the highest ability compared to both non-attempters and dual responders. There were no significant differences between no-attempters and dual responders. The present findings have implications for response strategy efficacy and measure evaluation, revision, and construction. PMID:27790174
Sideridis, Georgios D; Tsaousis, Ioannis; Al Harbi, Khaleel
2016-01-01
The purpose of the present study was to relate response strategy with person ability estimates. Two behavioral strategies were examined: (a) the strategy to skip items in order to save time on timed tests, and, (b) the strategy to select two responses on an item, with the hope that one of them may be considered correct. Participants were 4,422 individuals who were administered a standardized achievement measure related to math, biology, chemistry, and physics. In the present evaluation, only the physics subscale was employed. Two analyses were conducted: (a) a person-based one to identify differences between groups and potential correlates of those differences, and, (b) a measure-based analysis in order to identify the parts of the measure that were responsible for potential group differentiation. For (a) person abilities the 2-PL model was employed and later the 3-PL and 4-PL models in order to estimate upper and lower asymptotes of person abilities. For (b) differential item functioning, differential test functioning, and differential distractor functioning were investigated. Results indicated that there were significant differences between groups with completers having the highest ability compared to both non-attempters and dual responders. There were no significant differences between no-attempters and dual responders. The present findings have implications for response strategy efficacy and measure evaluation, revision, and construction.
Crespo-Eguilaz, N; Magallon, S; Sanchez-Carpintero, R; Narbona, J
2016-01-01
The Children's Communication Checklist (CCC) by Bishop is a useful scale for evaluation of pragmatic verbal abilities in school children. The aim of the study is to ascertain the validity and reliability of the CCC in Spanish. Answers to the CCC items by parents of 360 children with normal intelligence were analyzed. There were five groups: 160 control children; 68 children with attention deficit hyperactivity disorder, 77 with procedural non-verbal disorder, 25 children with social communication disorder and 30 with autism spectrum disorder. Investigations included: factorial analysis in order to cluster checklist items, reliability analyses of the proposed scales and discriminant analysis to check whether the scale correctly classifies children with pragmatic verbal abilities. Seven factors were obtained (Kaiser-Meyer-Olkin: 0.852) with moderate similarity with those of the original scale: social relationships, interests, and five more that can be grouped into pragmatic verbal ability (conversational abilities, coherence-comprehension, empathy nonverbal communication and appropriateness). All factors are significantly correlated with each other in the control group, and the five that compose pragmatic verbal ability correlate with each other in the clinical groups (Pearson r). The scales have good reliability (Cronbach's alpha: 0.914). The questionnaire correctly classifies 98.9% of grouped cases with and without pragmatic disorder and 78% of subjects in their appropriate clinical group. Besides, the questionnaire allows to differentiate the pathologies according to the presence and intensity of the symptoms. This Spanish version of the CCC is highly valid and reliable. The proposed statistics can be used as normative-reference values.
Tomietto, Marco; Saiani, Luisa; Palese, Alvisa; Cunico, Laura; Cicolini, Giancarlo; Watson, Paul; Saarikoski, Mikko
2012-01-01
A clinical learning environment is an "interactive network of forces within the clinical setting that influence the students' learning outcomes". International research indicates the Clinical Learning Environment and Supervision plus Nurse Teacher scale (CLES+T) as the gold standard to assess a good clinical learning environment. This study aims to evaluate the psychometric proprieties of CLES+T Italian version. 875 students attending the Bachelor in Nursing in 3 Universities in Italy participated in the study. Cronbach's alpha, item to total correlations, skewness and kurtosis were calculated; factor analysis was performed using Principal Axis Factoring and an oblique rotation method. Results showed a Cronbach's alpha of 0.95 of the scale and ranging from 0.80 to 0.96 among factors; all items verified item to total correlation and answers' variability criteria. Factor analysis showed a 7-factors model as explaining more than 67% of the variance, the higher variance was explained by the "pedagogical atmosphere" factor (37.61%). The nurse teacher factor in the Italian model is split into 3 sub-factors: theory-practice integration, cooperation with ward staff and relationship with mentor and student. These results enable an international debate concerning the theoretical structure of CLES+T and provide a reliable and valid tool for the comparison of supervisory models in guiding nursing students' clinical learning.
Improving Photometry and Stellar Signal Preservation with Pixel-Level Systematic Error Correction
NASA Technical Reports Server (NTRS)
Kolodzijczak, Jeffrey J.; Smith, Jeffrey C.; Jenkins, Jon M.
2013-01-01
The Kepler Mission has demonstrated that excellent stellar photometric performance can be achieved using apertures constructed from optimally selected CCD pixels. The clever methods used to correct for systematic errors, while very successful, still have some limitations in their ability to extract long-term trends in stellar flux. They also leave poorly correlated bias sources, such as drifting moiré pattern, uncorrected. We will illustrate several approaches where applying systematic error correction algorithms to the pixel time series, rather than the co-added raw flux time series, provide significant advantages. Examples include, spatially localized determination of time varying moiré pattern biases, greater sensitivity to radiation-induced pixel sensitivity drops (SPSDs), improved precision of co-trending basis vectors (CBV), and a means of distinguishing the stellar variability from co-trending terms even when they are correlated. For the last item, the approach enables physical interpretation of appropriately scaled coefficients derived in the fit of pixel time series to the CBV as linear combinations of various spatial derivatives of the pixel response function (PRF). We demonstrate that the residuals of a fit of soderived pixel coefficients to various PRF-related components can be deterministically interpreted in terms of physically meaningful quantities, such as the component of the stellar flux time series which is correlated with the CBV, as well as, relative pixel gain, proper motion and parallax. The approach also enables us to parameterize and assess the limiting factors in the uncertainties in these quantities.
A Tribute to Bunky at 125: A Comprehensive Bibliography of E. M. Jellinek's Publications.
Ward, Judit H; Bejarano, William
2016-05-01
E. M. Jellinek is considered one of the founders of alcohol science. On the 125th anniversary of his birth, the authors wish to contribute to existing, incomplete bibliographies of his work by offering a more comprehensive bibliography that includes his non-alcohol-studies publications as well as newly discovered alcohol-related items. After we reviewed the two existing Jellinek bibliographies, records were checked against the full-text items to correct errors and discrepancies. This led to the consolidation of the two bibliographies as well as the discovery of various reprints and republished titles. Based on the authors' parallel biographical investigations into Jellinek's lesser researched past, it was established that he had started his scientific career much earlier than previously documented. Additional publications attributed to E. M. Jellinek under various names were sought, located, and collected from geographically diverse sources in several languages, with the help of an international network of academic librarians. References were organized and separated by publication type, with reprinted and republished texts arranged underneath the original entries. Jellinek's comprehensive bibliography covers 70 years, from 1912 to 1982, with 165 original publications, as compared with the 90 and 96 publications, respectively, of the previous bibliographies. When reprints and republished items were included, the number of publications totals 308, as compared with the previous respective totals of 117 and 116. The new Jellinek bibliography highlights his multidisciplinary approach to several scientific disciplines and provides the potential to reevaluate his contributions and total scholarly impact.
Solari, A; Mattarozzi, K; Vignatelli, L; Giordano, A; Russo, P M; Uccelli, M Messmer; D'Alessandro, R
2010-10-01
We describe the development and clinical validation of a patient self-administered tool assessing the quality of multiple sclerosis diagnosis disclosure. A multiple sclerosis expert panel generated questionnaire items from the Doctor's Interpersonal Skills Questionnaire, literature review, and interviews with neurology inpatients. The resulting 19-item Comunicazione medico-paziente nella Sclerosi Multipla (COSM) was pilot tested/debriefed on seven patients with multiple sclerosis and administered to 80 patients newly diagnosed with multiple sclerosis. The resulting revised 20-item version (COSM-R) was debriefed on five patients with multiple sclerosis, field tested/debriefed on multiple sclerosis patients, and field tested on 105 patients newly diagnosed with multiple sclerosis participating in a clinical trial on an information aid. The hypothesized monofactorial structure of COSM-R section 2 was tested on the latter two groups. The questionnaire was well accepted. Scaling assumptions were satisfactory in terms of score distributions, item-total correlations and internal consistency. Factor analysis confirmed section 2's monofactorial structure, which was also test-retest reliable (intraclass correlation coefficient [ICC] 0.73; 95% CI 0.54-0.85). Section 1 had only fair test-retest reliability (ICC 0.45; 95% CI 0.12-0.69), and three items had 8-21% missed responses. COSM-R is a brief, easy-to-interpret MS-specific questionnaire for use as a health care indicator.
Measurement properties of the Spinal Cord Injury-Functional Index (SCI-FI) short forms.
Heinemann, Allen W; Dijkers, Marcel P; Ni, Pengsheng; Tulsky, David S; Jette, Alan
2014-07-01
To evaluate the psychometric properties of the Spinal Cord Injury-Functional Index (SCI-FI) short forms (basic mobility, self-care, fine motor, ambulation, manual wheelchair, and power wheelchair) based on internal consistency; correlations between short forms banks, full item bank forms, and a 10-item computer adaptive test version; magnitude of ceiling and floor effects; and test information functions. Cross-sectional cohort study. Six rehabilitation hospitals in the United States. Individuals with traumatic spinal cord injury (N=855) recruited from 6 national Spinal Cord Injury Model Systems facilities. Not applicable. SCI-FI full item bank, 10-item computer adaptive test, and parallel short form scores. The SCI-FI short forms (with separate versions for individuals with paraplegia and tetraplegia) demonstrate very good internal consistency, group-level reliability, excellent correlations between short forms and scores based on the total item bank, and minimal ceiling and floor effects (except ceiling effects for persons with paraplegia on self-care, fine motor, and power wheelchair ability and floor effects for persons with tetraplegia on self-care, fine motor, and manual wheelchair ability). The test information functions are acceptable across the range of scores where most persons in the sample performed. Clinicians and researchers should consider the SCI-FI short forms when computer adaptive testing is not feasible. Copyright © 2014 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Tsirogiannis, Panagiotis; Neophytou, Sophia; Reul, Anika; Heydecke, Guido; Reissmann, Daniel R
2017-01-01
To develop a reliable and valid instrument for the comprehensive assessment of patients' burdens during dental impression making, the Burdens in Dental Impression Making Questionnaire, BiDIM-Q. The item pool was generated in a convenience sample of 20 prosthodontic patients using semi-structured face-to-face interviews. The final instrument was tested in 145 consecutively recruited patients, and psychometric properties of the BiDIM-Q were determined. Four different impression materials were used according to the manufacturers' instructions and indications: alginate, c-silicone, polyvinylsiloxane, and polyether. The final BiDIM-Q consisting of 12 items showed sufficient reliability, indicated by Cronbach's alpha of .82 and an average inter-item correlation of .29. Validity was supported by Pearson correlation coefficients for the correlation between the instrument's total score with the patients' overall satisfaction rating (r=.63), and by the correlation matrix for the correlations of the patients' perceptions with the practitioners' satisfaction ratings. Overall, patient perceived burdens were low with highest burdens observed when using polyether in partially dentate patients for pick-up impressions, while lowest burdens were reported when using c-silicone for impressions of edentulous jaws. The BiDIM-Q is a reliable and valid tool for assessing patient-based process-related quality of care in dentistry allowing a deeper insight into patients' perspective during dental impression making. Copyright © 2016 Japan Prosthodontic Society. Published by Elsevier Ltd. All rights reserved.
Yang, Scott; Jones-Quaidoo, Sean M; Eager, Matthew; Griffin, Justin W; Reddi, Vasantha; Novicoff, Wendy; Shilt, Jeffrey; Bersusky, Ernesto; Defino, Helton; Ouellet, Jean; Arlet, Vincent
2011-07-01
In adolescent idiopathic scoliosis (AIS) there has been a shift towards increasing the number of implants and pedicle screws, which has not been proven to improve cosmetic correction. To evaluate if increasing cost of instrumentation correlates with cosmetic correction using clinical photographs. 58 Lenke 1A and B cases from a multicenter AIS database with at least 3 months follow-up of clinical photographs were used for analysis. Cosmetic parameters on PA and forward bending photographs included angular measurements of trunk shift, shoulder balance, rib hump, and ratio measurements of waist line asymmetry. Pre-op and follow-up X-rays were measured for coronal and sagittal deformity parameters. Cost density was calculated by dividing the total cost of instrumentation by the number of vertebrae being fused. Linear regression and spearman's correlation were used to correlate cost density to X-ray and photo outcomes. Three independent observers verified radiographic and cosmetic parameters for inter/interobserver variability analysis. Average pre-op Cobb angle and instrumented correction were 54° (SD 12.5) and 59% (SD 25) respectively. The average number of vertebrae fused was 10 (SD 1.9). The total cost of spinal instrumentation ranged from $6,769 to $21,274 (Mean $12,662, SD $3,858). There was a weak positive and statistically significant correlation between Cobb angle correction and cost density (r = 0.33, p = 0.01), and no correlation between Cobb angle correction of the uninstrumented lumbar spine and cost density (r = 0.15, p = 0.26). There was no significant correlation between all sagittal X-ray measurements or any of the photo parameters and cost density. There was good to excellent inter/intraobserver variability of all photographic parameters based on the intraclass correlation coefficient (ICC 0.74-0.98). Our method used to measure cosmesis had good to excellent inter/intraobserver variability, and may be an effective tool to objectively assess cosmesis from photographs. Since increasing cost density only improves mildly the Cobb angle correction of the main thoracic curve and not the correction of the uninstrumented spine or any of the cosmetic parameters, one should consider the cost of increasing implant density in Lenke 1A and B curves. In the area of rationalization of health care expenses, this study demonstrates that increasing the number of implants does not improve any relevant cosmetic or radiographic outcomes.
ERIC Educational Resources Information Center
Çikirikçi Demirtasli, Nükhet; Ulutas, Seher
2015-01-01
Problem Statement: Item bias occurs when individuals from different groups (different gender, cultural background, etc.) have different probabilities of responding correctly to a test item despite having the same skill levels. It is important that tests or items do not have bias in order to ensure the accuracy of decisions taken according to test…
Estimating the Number of Examinees Who Did Not Reach the Last Item of a Section.
ERIC Educational Resources Information Center
Wainer, Howard
It is important to estimate the number of examinees who reached a test item, because item difficulty is defined by the number who answered correctly divided by the number who reached the item. A new method is presented and compared to the previously used definition of three categories of response to an item: (1) answered; (2) omitted--a…
Development of a gambling addictive behavior scale for adolescents in Korea.
Park, Hyun Sook; Jung, Sun Young
2012-12-01
This study was conducted to develop a gambling addictive behavior scale for adolescents. The process involved construction of a conceptual framework, initial item search, verification of content validity, selection of secondary items, and extraction of final items. The participants were 299 adolescents from two middle schools and four high schools. Item analysis, factor analysis, criterion validity, internal consistency, and ROC curve were used to analyze the data. For the final scale, 25 items were selected, and categorized into 4 factors which accounted for 54.9% of the total variance. The factors were labeled as loss of control, life dysfunction from gambling addiction, gambling experience, and social dysfunction from problem gambling. The scores for the scale were significantly correlated with addictive personality, irrational gambling belief, and adolescent's gambling addictive behavior. Cronbach's alpha coefficient for the 25 items was .94. Scale scores identified adolescents as being in a problem gambling group, a non-problem gambling group, and a non-gambling group by the ROC curve. The above findings indicate that the gambling addictive behavior scale has good validity and reliability and can be used with adolescents in Korea.
Development of the Holistic Nursing Competence Scale.
Takase, Miyuki; Teraoka, Sachiko
2011-12-01
This study developed a scale to measure the nursing competence of Japanese registered nurses and to test its psychometric properties. Following the derivation of scale items and pilot testing, the final version of the scale was administered to 331 nurses to establish its internal consistency, as well as its construct and criterion-related validity. Using an exploratory and a confirmatory factor analysis, 36 items with a five-factor structure were retained to form the Holistic Nursing Competence Scale. These factors illustrate nurses' general aptitude and their competencies in staff education and management, ethical practice, the provision of nursing care, and professional development. The Scale has a positive correlation with the length of clinical experience. A Cronbach's alpha coefficient was 0.967. The Scale is a reliable and valid measure, helping both nurses and organizations to correctly evaluate nurses' competence and identify their needs for professional development. © 2011 Blackwell Publishing Asia Pty Ltd.
The development of the "Cantonese receptive vocabulary test' for children aged 2-6 in Hong Kong.
Cheung, P S; Lee, K Y; Lee, L W
1997-01-01
The study aims to develop a Cantonese receptive vocabulary test to assess 2-6-year-old children in Hong Kong. The test consists of 100 test items. Each target item is accompanied by a phonological distractor, a semantic distractor and an unrelated distractor. A sample of 609 normal children from four Maternal and Child Health Centres and nine kindergartens was selected. The results show that there is a significant effect of age on the correct score. ANOVA was performed to look at the age effect on each distractor individually. It was found that the scores of the three distractors decrease in their own patterns as age increases. With strong content validity, strong construct validity and high correlation coefficients in the split-half reliability, this test could be used as a reliable measurement for the Cantonese-speaking population in Hong Kong.
2014-01-01
Background Unintentional injuries are the major cause of morbidity and mortality in infants. Prevention of unintentional injuries has been shown to be effective with education. Understanding the level of knowledge and practices of caregivers in infant safety would be useful to identify gaps for improvement. Methods A cross-sectional study was conducted in an urban government health clinic in Malaysia among main caregivers of infants aged 11 to 15 months. Face-to-face interviews were conducted using a semi-structured self-designed questionnaire. Responses to the items were categorised by the percentage of correct answers: poor (<50%), moderate (50% – 70%) and good (>70%). Results A total of 403 caregivers participated in the study. Of the 21 items in the questionnaire on knowledge, 19 had good-to-moderate responses and two had poor responses. The two items on knowledge with poor responses were on the use of infant walkers (26.8%) and allowing infants on motorcycles as pillion riders (27.3%). Self-reported practice of infant safety was poor. None of the participants followed all 19 safety practices measured. Eight (42.1%) items on self-reported practices had poor responses. The worst three of these were on the use of baby cots (16.4%), avoiding the use of infant walkers (23.8%) and putting infants to sleep in the supine position (25.6%). Better knowledge was associated with self-reported safety practices in infants (p < 0.05). However, knowledge did not correspond to correct practice, particularly on the use of baby cots, infant walkers and sarong cradles. Conclusion Main caregivers’ knowledge on infant safety was good but self-reported practice was poor. Further research in the future is required to identify interventions that target these potentially harmful practices. PMID:24885332
Ramdzan, Siti Nurkamilla; Liew, Su May; Khoo, Ee Ming
2014-05-29
Unintentional injuries are the major cause of morbidity and mortality in infants. Prevention of unintentional injuries has been shown to be effective with education. Understanding the level of knowledge and practices of caregivers in infant safety would be useful to identify gaps for improvement. A cross-sectional study was conducted in an urban government health clinic in Malaysia among main caregivers of infants aged 11 to 15 months. Face-to-face interviews were conducted using a semi-structured self-designed questionnaire. Responses to the items were categorised by the percentage of correct answers: poor (<50%), moderate (50% - 70%) and good (>70%). A total of 403 caregivers participated in the study. Of the 21 items in the questionnaire on knowledge, 19 had good-to-moderate responses and two had poor responses. The two items on knowledge with poor responses were on the use of infant walkers (26.8%) and allowing infants on motorcycles as pillion riders (27.3%). Self-reported practice of infant safety was poor. None of the participants followed all 19 safety practices measured. Eight (42.1%) items on self-reported practices had poor responses. The worst three of these were on the use of baby cots (16.4%), avoiding the use of infant walkers (23.8%) and putting infants to sleep in the supine position (25.6%). Better knowledge was associated with self-reported safety practices in infants (p < 0.05). However, knowledge did not correspond to correct practice, particularly on the use of baby cots, infant walkers and sarong cradles. Main caregivers' knowledge on infant safety was good but self-reported practice was poor. Further research in the future is required to identify interventions that target these potentially harmful practices.
Roaldsen, Kirsti Skavberg; Måøy, Åsa Blad; Jørgensen, Vivien; Stanghelle, Johan Kvalvik
2016-05-01
Translation of the Spinal Cord Injury Falls Concern Scale (SCI-FCS), and investigation of test-retest reliability on item-level and total-score-level. Translation, adaptation and test-retest study. A specialized rehabilitation setting in Norway. Fifty-four wheelchair users with a spinal cord injury. The median age of the cohort was 49 years, and the median number of years after injury was 13. Interventions/measurements: The SCI-FCS was translated and back-translated according to guidelines. Individuals answered the SCI-FCS twice over the course of one week. We investigated item-level test-retest reliability using Svensson's rank-based statistical method for disagreement analysis of paired ordinal data. For relative reliability, we analyzed the total-score-level test-retest reliability with intraclass correlation coefficients (ICC2.1), the standard error of measurement (SEM), and the smallest detectable change (SDC) for absolute reliability/measurement-error assessment and Cronbach's alpha for internal consistency. All items showed satisfactory percentage agreement (≥69%) between test and retest. There were small but non-negligible systematic disagreements among three items; we recovered an 11-13% higher chance for a lower second score. There was no disagreement due to random variance. The test-retest agreement (ICC2.1) was excellent (0.83). The SEM was 2.6 (12%), and the SDC was 7.1 (32%). The Cronbach's alpha was high (0.88). The Norwegian SCI-FCS is highly reliable for wheelchair users with chronic spinal cord injuries.
The revised Generalized Expectancy for Success Scale: a validity and reliability study.
Hale, W D; Fiedler, L R; Cochran, C D
1992-07-01
The Generalized Expectancy for Success Scale (GESS; Fibel & Hale, 1978) was revised and assessed for reliability and validity. The revised version was administered to 199 college students along with other conceptually related measures, including the Rosenberg Self-Esteem Scale, the Life Orientation Test, and Rotter's Internal-External Locus of Control Scale. One subsample of students also completed the Eysenck Personality Inventory, while another subsample performed a criterion-related task that involved risk taking. Item analysis yielded 25 items with correlations of .45 or higher with the total score. Results indicated high internal consistency and test-retest reliability.
Studying Irony Detection Beyond Ironic Criticism: Let's Include Ironic Praise
Bruntsch, Richard; Ruch, Willibald
2017-01-01
Studies of irony detection have commonly used ironic criticisms (i.e., mock positive evaluation of negative circumstances) as stimulus materials. Another basic type of verbal irony, ironic praise (i.e., mock negative evaluation of positive circumstances) is largely absent from studies on individuals' aptitude to detect verbal irony. However, it can be argued that ironic praise needs to be considered in order to investigate the detection of irony in the variety of its facets. To explore whether the detection ironic praise has a benefit beyond ironic criticism, three studies were conducted. In Study 1, an instrument (Test of Verbal Irony Detection Aptitude; TOVIDA) was constructed and its factorial structure was tested using N = 311 subjects. The TOVIDA contains 26 scenario-based items and contains two scales for the detection of ironic criticism vs. ironic praise. To validate the measurement method, the two scales of the TOVIDA were experimentally evaluated with N = 154 subjects in Study 2. In Study 3, N = 183 subjects were tested to explore personality and ability correlates of the two TOVIDA scales. Results indicate that the co-variance between the ironic TOVIDA items was organized by two inter-correlated but distinct factors: one representing ironic praise detection aptitude and one representing ironic criticism detection aptitude. Experimental validation showed that the TOVIDA items truly contain irony and that item scores reflect irony detection. Trait bad mood and benevolent humor (as a facet of the sense of humor) were found as joint correlates for both ironic criticism and ironic praise detection scores. In contrast, intelligence, trait cheerfulness, and corrective humor were found as unique correlates of ironic praise detection scores, even when statistically controlling for the aptitude to detect ironic criticism. Our results indicate that the aptitude to detect ironic praise can be seen as distinct from the aptitude to detect ironic criticism. Generating unique variance in irony detection, ironic praise can be postulated as worthwhile to include in future studies—especially when studying the role of mental ability, personality, and humor in irony detection. PMID:28484409
Connor, Linda; Paul, Fiona; McCabe, Margaret; Ziniel, Sonja
2017-02-01
The Quick-EBP-VIK is a new instrument for measuring nurses' value, implementation, and knowledge of EBP. Psychometric testing was conducted in two parts. Part 1 describes the tool development and validity testing which resulted in the development of a 25-item survey after receiving ≥0.80 Item-Level Content Validity Index for both clarity and relevance. Part 2 describes psychometric testing was necessary to assess additional types of validity and reliability. The purpose of this paper is to further describe the psychometric testing of the Quick-EBP-VIK survey instrument. This descriptive study was designed to assess test-retest reliability, internal consistency and construct validity via a web-based survey. The survey instrument was e-mailed to all nurses at the study hospital. Nurses who responded to the first survey (Wave 1) received another e-mail invitation to complete the survey instrument again (Wave 2) for the purpose of assessing the test-retest reliability of the instrument. A total of 1,177 deliverable e-mails were sent to all nursing staff at one free standing pediatric hospital with Magnet ® designation in the northeast. A total of 382 nurses returned completed surveys, indicating a 32.5% response rate for Wave 1. A total of 131 nurses responded to Wave 2 indicating a response rate of 34.3%. The intraclass correlation coefficients for the items included in the final instrument ranged from 0.43 to 0.80 and were deemed sufficient. These represent a sufficient intraclass correlation coefficient. The Cronbach's Alpha values for each of the three domains are all higher than 0.7 indicating that the items of each of the measurement dimension are internally consistent. However, the composite reliability of the third domain was slightly lower than 0.7 when using Raykov's Rho. The Quick-EBP-VIK instrument has gone through rigorous comprehensive testing and has demonstrated good psychometric properties. © 2016 Sigma Theta Tau International.
Ghezeljeh, Tahereh Najafi; Ardebili, Fatimah Mohades; Rafii, Forough; Hagani, Hamid
2013-09-01
Burn as a traumatic life incident manifests severe pain and psychological problems. Specific instruments are needed to evaluate burn patients' psychological issues related to the injury. The aim of this study was to translate and evaluate the reliability and validity of the Persian versions of Impact of Burn Specific Pain Anxiety scale (BSPAS) and Impact of Event Scale (IES). In this cross-sectional study, convenience sampling method was utilized to select 55 Iranian hospitalized burn patients. Combined translation was utilized for translating scales. Alpha cronbach, item-total correlation, convergent and discriminative validity were evaluated. The Cronbach's α for both BSPAS- and IES-Persian version was 0.96. Item-total correlation coefficients ranged from 0.70 to 0.90. Convergent construct validity was confirmed by indicating high correlation between the scales designed to measure the same concepts. The mean score of BSPAS- and IES-Persian version was lower for individuals with a lower TBSA burn percentage which assessed discriminative construct validity of scales. BSPAS- and IES-Persian version showed high internal consistency and good validity for the assessment of burn psychological outcome in hospitalized burn patients. Future studies are needed to determine repeatability, factor structure, sensitivity and specificity of the scales. Copyright © 2013 Elsevier Ltd and ISBI. All rights reserved.