MAYESKE, GEORGE W.; AND OTHERS
THIS REPORT PRESENTS THE TABULATIONS AND ANALYSES OF RESPONSES TO EACH ITEM OF THE PRINCIPAL QUESTIONNAIRE THAT WAS ADMINISTERED AS PART OF THE EDUCATIONAL OPPORTUNITIES SURVEY. THE ITEM ANALYSES OF THESE DATA WERE CONDUCTED (1) TO PRESENT THE NUMBER AND PERCENTAGE OF ELEMENTARY AND SECONDARY SCHOOL PRINCIPALS RESPONDING TO EACH ITEM ALTERNATIVE,…
Klainin-Yobas, Piyanee; He, Hong-Gu
This study aimed to evaluate the psychometric properties of the General Health Questionnaire (GHQ-30) given conflicting findings in the literature. A cross-sectional, nonexperimental research was used with a convenience sample of 271 American female health care professionals. Data were collected by using self-reported questionnaires. A series of exploratory factor analyses (EFAs), confirmatory factor analyses (CFAs), and structural equation modeling (SEM) were performed to examine underlying dimensions of the GHQ-30. Results from EFAs and CFAs revealed the three-factor composition (positive affect, anxiety, and depressed mood). All factor loadings were statistically significant, and one pair of error variance was allowed to be correlated. All factors contained questionnaire items with acceptable face validity and demonstrated good internal consistency reliability. Results from SEM further confirmed underlying constructs of the scale. To our knowledge, this is the first study that extensively tested the psychometric properties of the GHQ-30, taking both statistical and substantive issues into consideration.
... 15 Commerce and Foreign Trade 2 2012-01-01 2012-01-01 false Technical Questionnaire for Encryption Items No. Supplement No. 6 to Part 742 Commerce and Foreign Trade Regulations Relating to Commerce and... Questionnaire for Encryption Items (a) For all encryption items: (1) State the name(s) of each product...
Gottschall, Amanda C.; West, Stephen G.; Enders, Craig K.
Behavioral science researchers routinely use scale scores that sum or average a set of questionnaire items to address their substantive questions. A researcher applying multiple imputation to incomplete questionnaire data can either impute the incomplete items prior to computing scale scores or impute the scale scores directly from other scale…
Gao, Yong; Zhu, Weimo
Using differential item functioning (DIF) analyses, this study examined whether there were any DIF items in the National Health and Nutrition Examination Survey (NHANES) physical activity (PA) questionnaire. A subset of adult data from the 2003-04 NHANES study (n = 3,083) was used. PA items related to respondents' occupational, transportation,…
Muraki, Eiji; Engelhard, George, Jr.
Recent developments in dichotomous factor analysis based on multidimensional item response models (Bock and Aitkin, 1981; Muthen, 1978) provide an effective method for exploring the dimensionality of questionnaire items. Implemented in the TESTFACT program, this "full information" item factor analysis accounts not only for the pairwise joint…
[Purpose] The purpose of this study was to design a physical activity questionnaire reflecting on the basic principles and recommendations of exercise and to examine its reliability. [Subjects and Methods] 342 males and 374 females from the community centers (senior center, residential culture center, sport center, and YWCA center) participated in this study. [Results] The test-retest reliability of the physical activity questionnaire, measured with an interval of three months, being between 0.61 and 0.91 signifies that the questionnaire was useful instrument for assessing physical activity levels. [Conclusion] This study found that the simple physical activity questionnaire containing the frequency, duration, intensity, overall length, and type of activities that the person performed during their leisure time was reliable. PMID:28174459
[Purpose] The purpose of this study was to design a physical activity questionnaire reflecting on the basic principles and recommendations of exercise and to examine its reliability. [Subjects and Methods] 342 males and 374 females from the community centers (senior center, residential culture center, sport center, and YWCA center) participated in this study. [Results] The test-retest reliability of the physical activity questionnaire, measured with an interval of three months, being between 0.61 and 0.91 signifies that the questionnaire was useful instrument for assessing physical activity levels. [Conclusion] This study found that the simple physical activity questionnaire containing the frequency, duration, intensity, overall length, and type of activities that the person performed during their leisure time was reliable.
Brown, Anna; Maydeu-Olivares, Alberto
Multidimensional forced-choice formats can significantly reduce the impact of numerous response biases typically associated with rating scales. However, if scored with classical methodology, these questionnaires produce ipsative data, which lead to distorted scale relationships and make comparisons between individuals problematic. This research…
Watson, Kathy; Baranowski, Tom; Thompson, Debbe
Perceived self-efficacy (SE) for eating fruit and vegetables (FV) is a key variable mediating FV change in interventions. This study applies item response modeling (IRM) to a fruit, juice and vegetable self-efficacy questionnaire (FVSEQ) previously validated with classical test theory (CTT) procedures. The 24-item (five-point Likert scale) FVSEQ…
Michigan State Dept. of Education, Lansing. Research, Evaluation, and Assessment Services.
Background and attitude questionnaire items used in the Michigan Educational Assessment battery to measure socioeconomic status and attitudes toward self, school, and the importance of school achievement are presented. A priori weights for item responses are provided. (For related document, see TM 002 329.) (KM)
WEINFELD, FREDERIC D.; AND OTHERS
THIS REPORT PRESENTS THE ANALYSIS OF QUESTIONNAIRE ITEM RESPONSES FROM THE NINTH-GRADE STUDENT QUESTIONNAIRE ADMINISTERED AS PART OF THE EDUCATIONAL OPPORTUNITIES SURVEY. THE ANALYSES WERE PERFORMED TO DOCUMENT SOME OF THE BASIC DATA FROM THE SURVEY, TO MAKE THEM AVAILABLE TO INTERESTED EDUCATIONAL RESEARCHERS, AND TO REWORK THE BASIC DATA FOR…
Kelly, Laura; Jenkinson, Crispin; Dummett, Sarah; Dawson, Jill; Fitzpatrick, Ray; Morley, David
Purpose The Oxford Participation and Activities Questionnaire is a patient-reported outcome measure in development that is grounded on the World Health Organization International Classification of Functioning, Disability, and Health (ICF). The study reported here aimed to inform and generate an item pool for the new measure, which is specifically designed for the assessment of participation and activity in patients experiencing a range of health conditions. Methods Items were informed through in-depth interviews conducted with 37 participants spanning a range of conditions. Interviews aimed to identify how their condition impacted their ability to participate in meaningful activities. Conditions included arthritis, cancer, chronic back pain, diabetes, motor neuron disease, multiple sclerosis, Parkinson’s disease, and spinal cord injury. Transcripts were analyzed using the framework method. Statements relating to ICF themes were recast as questionnaire items and shown for review to an expert panel. Cognitive debrief interviews (n=13) were used to assess items for face and content validity. Results ICF themes relevant to activities and participation in everyday life were explored, and a total of 222 items formed the initial item pool. This item pool was refined by the research team and 28 generic items were mapped onto all nine chapters of the ICF construct, detailing activity and participation. Cognitive interviewing confirmed the questionnaire instructions, items, and response options were acceptable to participants. Conclusion Using a clear conceptual basis to inform item generation, 28 items have been identified as suitable to undergo further psychometric testing. A large-scale postal survey will follow in order to refine the instrument further and to assess its psychometric properties. The final instrument is intended for use in clinical trials and interventions targeted at maintaining or improving activity and participation. PMID:26056503
Bozeman, D P; Perrewé, P L
This study examined the effect of overlapping scale content when certain items in the Organizational Commitment Questionnaire (OCQ) are used to predict turnover cognition measures. Analyses of judgmental data collected from 25 subject matter experts suggested that 6 OCQ items reflected a desire or an intent to retain membership in one's organization. Confirmatory factor analyses of survey data from 172 master of business administration alumni showed that the 6 OCQ retention items shared overlapping content with turnover cognitions items. Hierarchical multiple regression analyses of survey data from 330 hotel managers showed that (a) removing the 6 OCQ retention items caused a significant decrease in the variance explained in a measure of turnover cognitions and (b) the size of this effect is larger than that suggested by previous work.
Doi, Yuriko; Minowa, Masumi
The 12-item General Health Questionnaire (GHQ-12) has been extensively used in a variety of settings across countries. The main aim of the present study was to assess the factor structure of the GHQ-12 for the Japanese general adult population. Data came from a sample of 1808 Japanese aged 20 years or older who were randomly selected based on the 1995 census (897 men and 911 women). Cronbach's alpha coefficients were 0.83 for men and 0.85 for women. Overall, the corrected item-total correlation coefficients were >0.20 for both genders. The GHQ-12 yielded a two-factor solution of psychological distress (items 2, 5, 6, 9, 10 and 11) and social dysfunction (items 1, 3, 4, 7 and 8), which jointly accounted for 49.1% of the total variance, for women. Item 12 on happiness was not discernable. For men, item 12 was separated from a social dysfunction factor and yielded the third factor with item 3 on social role, and the three factors jointly accounted for 57.6%. The results of the present study suggest that the GHQ-12 can be used as an internally reliable and homogeneous scale that produces mainly the factors of psychological distress and social dysfunction. Item 12 may be structurally different in the case of Japanese adults.
Gelin, Michaela N.; Carleton, Bruce C.; Smith, M. Anne; Zumbo, Bruno D.
The present study investigated the factor structure and item analysis of the Mini Asthma Quality of Life Questionnaire (MiniAQLQ) in a sample of 258 community-dwelling asthmatic adults between the ages of 16 and 87 years. The mean age was 56 years for males (N = 99) and 50 years for females (N = 159). This study compared the fit of three factor…
Van Dam, Nicholas T.; Hobkirk, Andrea L.; Danoff-Burg, Sharon; Earleywine, Mitch
Mindfulness, a construct that entails moment-to-moment effort to be aware of present experiences and positive attitudinal features, has become integrated into the sciences. The Five Facet Mindfulness Questionnaire (FFMQ), one popular measure of mindfulness, exhibits different responses to positively and negatively worded items in nonmeditating…
Eys, Mark A; Carron, Albert V; Bray, Steven R; Brawley, Lawrence R
A common practice for counteracting response acquiescence in psychological measures has been to employ both negatively and positively worded items. However, previous research has highlighted that the reliability of measures can be affected by this practice (Spector, 1992). The purpose of the present study was to examine the effect that the presence of negatively worded items has on the internal reliability of the Group Environment Questionnaire (GEQ). Two samples (N = 276) were utilized, and participants were asked to complete the GEQ (original and revised) on separate occasions. Results demonstrated that the revised questionnaire (containing all positively worded items) had significantly higher Cronbach alpha values for three of the four dimensions of the GEQ. Implications, alternatives, and future directions are discussed.
Peters, Michele; Potter, Caroline M; Kelly, Laura; Hunter, Cheryl; Gibbons, Elizabeth; Jenkinson, Crispin; Coulter, Angela; Forder, Julien; Towers, Ann-Marie; A’Court, Christine; Fitzpatrick, Ray
Purpose To identify the main issues of importance when living with long-term conditions to refine a conceptual framework for informing the item development of a patient-reported outcome measure for long-term conditions. Materials and methods Semi-structured qualitative interviews (n=48) were conducted with people living with at least one long-term condition. Participants were recruited through primary care. The interviews were transcribed verbatim and analyzed by thematic analysis. The analysis served to refine the conceptual framework, based on reviews of the literature and stakeholder consultations, for developing candidate items for a new measure for long-term conditions. Results Three main organizing concepts were identified: impact of long-term conditions, experience of services and support, and self-care. The findings helped to refine a conceptual framework, leading to the development of 23 items that represent issues of importance in long-term conditions. The 23 candidate items formed the first draft of the measure, currently named the Long-Term Conditions Questionnaire. Conclusion The aim of this study was to refine the conceptual framework and develop items for a patient-reported outcome measure for long-term conditions, including single and multiple morbidities and physical and mental health conditions. Qualitative interviews identified the key themes for assessing outcomes in long-term conditions, and these underpinned the development of the initial draft of the measure. These initial items will undergo cognitive testing to refine the items prior to further validation in a survey. PMID:27621678
Bodenburg, Sebastian; Dopslaff, Nina
The Dysexecutive Questionnaire (DEX, , Behavioral assessment of the dysexecutive syndrome, 1996) is a standardized instrument to measure possible behavioral changes as a result of the dysexecutive syndrome. Although initially intended only as a qualitative instrument, the DEX has also been used increasingly to address quantitative problems. Until now there have not been more fundamental statistical analyses of the questionnaire's testing quality. The present study is based on an unselected sample of 191 patients with acquired brain injury and reports on the data relating to the quality of the items, the reliability and the factorial structure of the DEX. Item 3 displayed too great an item difficulty, whereas item 11 was not sufficiently discriminating. The DEX's reliability in self-rating is r = 0.85. In addition to presenting the statistical values of the tests, a clinical severity classification of the overall scores of the 4 found factors and of the questionnaire as a whole is carried out on the basis of quartile standards.
Murray, Aja Louise; Booth, Tom; McKenzie, Karen
The Learning Disability Screening Questionnaire (LDSQ; McKenzie & Paxton, 2006) was developed as a brief screen for intellectual disability. Although several previous studies have evaluated the LDSQ with respect to its utility as a clinical and research tool, no studies have considered the fairness of the test across males and females. In the current study we, therefore, used a multi-group item response theory approach to assess differential item functioning across gender in a sample of 211 males and 132 females assessed in clinical and forensic settings. Although the test did not show evidence of differential item functioning by gender, it was necessary to exclude one item due to estimation problems and to combine two very highly related items (concerning reading and writing ability) into a single literacy item Thus, in addition to being generally supportive of the utility of the LDSQ, our results also highlight possible areas of weakness in the tool and suggest possible amendments that could be made to test content to improve the test in future revisions.
Delgado-Gomez, David; Lopez-Castroman, Jorge; de Leon-Martinez, Victoria; Baca-Garcia, Enrique; Cabanas-Arrate, Maria Luisa; Sanchez-Gonzalez, Antonio; Aguado, David
There is a need to assess the psychiatric morbidity that appears as a consequence of terrorist attacks. The General Health Questionnaire (GHQ) has been used to this end, but its psychometric properties have never been evaluated in a population affected by terrorism. A sample of 891 participants included 162 direct victims of terrorist attacks and 729 relatives of the victims. All participants were evaluated using the 28-item version of the GHQ (GHQ-28). We examined the reliability and external validity of scores on the scale using Cronbach's alpha and Pearson correlation with the State-Trait Anxiety Inventory (STAI), respectively. The factor structure of the scale was analyzed with varimax rotation. Samejima's (1969) graded response model was used to explore the item properties. The GHQ-28 scores showed good reliability and item-scale correlations. The factor analysis identified 3 factors: anxious-somatic symptoms, social dysfunction, and depression symptoms. All factors showed good correlation with the STAI. Before rotation, the first, second, and third factor explained 44.0%, 6.4%, and 5.0% of the variance, respectively. Varimax rotation redistributed the percentages of variance accounted for to 28.4%, 13.8%, and 13.2%, respectively. Items with the highest loadings in the first factor measured anxiety symptoms, whereas items with the highest loadings in the third factor measured suicide ideation. Samejima's model found that high scores in suicide-related items were associated with severe depression. The factor structure of the GHQ-28 found in this study underscores the preeminence of anxiety symptoms among victims of terrorism and their relatives. Item response analysis identified the most difficult and significant items for each factor.
Petersen, Morten Aa; Groenvold, Mogens; Bjorner, Jakob B; Aaronson, Neil; Conroy, Thierry; Cull, Ann; Fayers, Peter; Hjermstad, Marianne; Sprangers, Mirjam; Sullivan, Marianne
In cross-national comparisons based on questionnaires, accurate translations are necessary to obtain valid results. Differential item functioning (DIF) analysis can be used to test whether translations of items in multi-item scales are equivalent to the original. In data from 10,815 respondents representing 10 European languages we tested for DIF in the nine translations of the EORTC QLQ-C30 emotional function scale when compared to the original English version. We tested for DIF using two different methods in parallel, a contingency table method and logistic regression. The DIF results obtained with the two methods were similar. We found indications of DIF in seven of the nine translations. At least two of the DIF findings seem to reflect linguistic problems in the translation. 'Imperfect' translations can affect conclusions drawn from cross-national comparisons. Given that translations can never be identical to the original we discuss how findings of DIF can be interpreted and discuss the difference between linguistic DIF and DIF caused by confounding, cross-cultural differences, or DIF in other items in the scale. We conclude that testing for DIF is a useful way to validate questionnaire translations.
Chen, Wei; Shu, Liang; Wang, Qian; Pan, Hui; Wu, Jing; Fang, Jie; Sun, Xu-Hong; Zhai, Yu; Dong, You-Rong; Liu, Jian-Ren
As possible candidate screening instruments for benign paroxysmal positional vertigo (BPPV), studies to validate the Dizziness Handicap Inventory (DHI) sub-scale (5-item and 2-item) and total scores are rare in China. From May 2014 to December 2014, 108(55 with and 53 without BPPV) patients complaining of episodic vertigo in the past week from a vertigo outpatient clinic were enrolled for DHI evaluation, as well as demographic and other clinical data. Objective BPPV was subsequently determined by positional evoking maneuvers under the record of optical Frenzel glasses. Cronbach's coefficient α was used to evaluate the reliability of psychometric scales. The validity of DHI total, 5-item and 2-item questionnaires to screen for BPPV was assessed by receiver operating characteristic (ROC) curves. It revealed that the DHI 5-item questionnaire had good internal consistency (Cronbach's coefficient α = 0.72). Area under the curve of total DHI, 5-item and 2-item scores for discriminating BPPV from those without was 0.678 (95 % CI 0.578-0.778), 0.873(95 % CI 0.807-0.940) and 0.895(95 % CI 0.836-0.953), respectively. It revealed 74.5 % sensitivity and 88.7 % specificity in separating BPPV and those without, with a cutoff value of 12 in the 5-item questionnaire. The corresponding rate of sensitivity and specificity was 78.2 and 88.7 %, respectively, with a cutoff value of 6 in 2-item questionnaire. The present study indicated that both 5-item and 2-item questionnaires in the Chinese version of DHI may be more valid than DHI total score for screening objective BPPV and merit further application in clinical practice in China.
Fukuhara, Shunichi; Wakita, Takafumi; Yamada, Masakazu; Hiratsuka, Yoshimune; Green, Joseph; Oki, Kotaro
Purpose In clinical ophthalmology as in other fields, measuring patient-reported outcomes imposes a burden on patients. To decrease that burden, we used item-response theory (IRT) to develop and test a short version of the National Eye Institute's Visual Function Questionnaire (VFQ). Methods We analyzed VFQ data from 276 adults in Japan. Most of them had glaucoma, cataract, or macular degeneration. Their visual acuity (Snellen fraction) averaged 20/120 (range: 20/13 to 20/2000) for the better eye, and 20/200 (range: 20/13 to 20/2000) for the worse eye. We used a polytomous IRT model, the Generalized Partial Credit Model as implemented in software for parameter scaling of rating data (PARSCALE). To select items for inclusion in the short version we examined each item's location on the latent-trait continuum, its slope, and its frequency of missing data. We also ensured representation of all 7 domains that are important in Japan. To examine the characteristics of the resulting scale, we computed its test information (an index of precision that can vary with the value of the latent trait), and carried out validation testing. Results From 32 of the original VFQ items, we selected 11. The scale comprising those 11 items (the VFQ-J11) had test information greater than 9 for values of the latent trait between −2.0 and +0.8. The item thresholds were well-targeted for patients with vision problems. Scores on the VFQ-J11 correlated strongly and in the expected direction with measures of visual field and corrected visual acuity. As expected for a valid measure, those scores also improved by a large amount (almost one standard deviation) after cataract surgery. Conclusion This 11-item instrument can provide reliable and the valid data on visual functioning in patients with ophthalmic problems. It is expected to be less of a burden on respondents, while it maintains good psychometric properties. PMID:24069172
Petkovska, Miodraga Stefanovska; Bojadziev, Marjan I.; Stefanovska, Vesna Velikj
AIM: The aim of the study is to analyze the internal consistency; validity and factor structure of the twelve item General Health Questionnaire for the Macedonian general population. MATERIAL AND METHODS: Data came from nationally representative sample of 1603 randomly selected Macedonians all aged 18 years or older. RESULTS: The mean GHQ score in the general sample was found to be 7.9 (SD = 4.3). The results revealed a higher GHQ score among women (M = 8.91, SD = 4.5) compared to men (M = 6.89; SD = 4.2). The participants from the rural areas obtained a lower GHQ score (M = 7.55, SD = 3.8) compared to participants coming from the urban areas (M = 9.37, SD = 4.1). The principal component analysis with oblique rotation (direct oblimin) with maximum likelihood procedure solution was performed and the results yielded a three factor solution which jointly accounted for 57.17% of the total variance: Factor I named social management (items 1, 3, 4, 6, 7 and 8); Factor II stress (items 2, 5 and 9) and Factor III named self-confidence (items 10, 11 and 12). Its factor structure is in line with representative research from other population groups. CONCLUSION: The GHQ-12 can be used effectively for assessment of the overall psychological well-being and detection of non-psychotic psychiatric problems among the Macedonian population. PMID:27275274
Falkenström, Fredrik; Hatcher, Robert L; Skjulsvik, Tommy; Larsson, Mattias Holmqvist; Holmqvist, Rolf
Recently, researchers have started to measure the working alliance repeatedly across sessions of psychotherapy, relating the working alliance to symptom change session by session. Responding to questionnaires after each session can become tedious, leading to careless responses and/or increasing levels of missing data. Therefore, assessment with the briefest possible instrument is desirable. Because previous research on the Working Alliance Inventory has found the separation of the Goal and Task factors problematic, the present study examined the psychometric properties of a 2-factor, 6-item working alliance measure, adapted from the Working Alliance Inventory, in 3 patient samples (ns = 1,095, 235, and 234). Results showed that a bifactor model fit the data well across the 3 samples, and the factor structure was stable across 10 sessions of primary care counseling/psychotherapy. Although the bifactor model with 1 general and 2 specific factors outperformed the 1-factor model in terms of model fit, dimensionality analyses based on the bifactor model results indicated that in practice the instrument is best treated as unidimensional. Results support the use of composite scores of all 6 items. The instrument was validated by replicating previous findings of session-by-session prediction of symptom reduction using the Autoregressive Latent Trajectory model. The 6-item working alliance scale, called the Session Alliance Inventory, is a promising alternative for researchers in search for a brief alliance measure to administer after every session.
Little research has been conducted on the psychometrics of the very short scale (36 items) of the Children’s Behavior Questionnaire, and no one-item temperament scale has been tested for use in applied work. In this study, 237 United States caregivers completed a survey to define their child’s behav...
Background Patient experience is a key feature of quality improvement in modern health-care delivery. Measuring patient experience is one of several tools used to assess and monitor the quality of health services. This study aims to develop a tool for assessing patient experience with inpatient care in public hospitals in Hong Kong. Methods Based on the General Inpatient Questionnaire (GIQ) framework of the Care Quality Commission as a discussion guide, a qualitative study involving focus group discussions and in-depth individual interviews with patients was employed to develop a tool for measuring inpatient experience in Hong Kong. Results All participants agreed that a patient satisfaction survey is an important platform for collecting patients’ views on improving the quality of health-care services. Findings of the focus group discussions and in-depth individual interviews identified nine key themes as important hospital quality indicators: prompt access, information provision, care and involvement in decision making, physical and emotional needs, coordination of care, respect and privacy, environment and facilities, handling of patient feedback, and overall care from health-care professionals and quality of care. Privacy, complaint mechanisms, patient involvement, and information provision were further highlighted as particularly important areas for item revision by the in-depth individual interviews. Thus, the initial version of the Hong Kong Inpatient Experience Questionnaire (HKIEQ), comprising 58 core items under nine themes, was developed. Conclusions A set of dimensions and core items of the HKIEQ was developed and the instrument will undergo validity and reliability tests through a validation survey. A valid and reliable tool is important in accurately assessing patient experience with care delivery in hospitals to improve the quality of health-care services. PMID:23835186
Ebesutani, Chad; Drescher, Christopher F; Reise, Steven P; Heiden, Laurie; Hight, Terry L; Damon, John D; Young, John
Although reverse-worded items have often been incorporated in scale construction to minimize the effects of acquiescent reporting biases, some researchers have more recently begun questioning this approach and wondering whether the advantages associated with incorporating reverse-worded items is worth the complexities that they bring to measures (e.g., Brown, 2003 ; Marsh, 1996 ). In this study, we used item response theory (IRT) to determine whether there is statistical justification to eliminate the reverse-worded items (e.g., "I have lots of friends") from the Loneliness Questionnaire (LQ; Asher, Hymel, & Renshaw, 1984) and retain only the non-reverse-worded items (e.g., "I'm lonely") to inform the provision of a shortened LQ version. Using a large sample of children (Grades 2-7; n = 6,784) and adolescents (Grades 8-12; n = 4,941), we examined the psychometric properties of the 24-item LQ and found support for retaining the 9 non-reverse-worded LQ items to make up a shortened measure of loneliness in youth. We found that the non-reverse-worded items were associated with superior psychometric properties relative to the reverse-worded items with respect to reliability and IRT parameters (e.g., discrimination and item information). A 3-point Likert-type scale was also found to be more suitable for measuring loneliness across both children and adolescents compared to the original 5-point scale. The relative contributions of reverse-worded and non-reverse-worded items in scale development for youth instruments are also discussed.
Wu, Chia-Huei; Chen, Lung Hung
In 2001, Elliot and McGregor proposed a 2 x 2 (mastery-performance x approach- avoidance) achievement goal frameworks and developed a questionnaire to measure four goals (mastery-approach, mastery-avoidance, performance-approach, and performance-avoidance goals). This study examines the dual meanings of items in 2 x 2 achievement goal…
Escorial, Sergio; Navas, Maria J.
Studies in the field of personality have systematically found gender differences in two of the three dimensions of the Eysenck model: neuroticism and psychoticism. This study aims to analyze these differences in the Eysenck Personality Questionnaire--Revised (EPQ-R) scales using differential item functioning (DIF) techniques to determine whether…
Aguado, Jaume; Campbell, Alistair; Ascaso, Carlos; Navarro, Purificacion; Garcia-Esteve, Lluisa; Luciano, Juan V.
In this study, the authors tested alternative factor models of the 12-item General Health Questionnaire (GHQ-12) in a sample of Spanish postpartum women, using confirmatory factor analysis. The authors report the results of modeling three different methods for scoring the GHQ-12 using estimation methods recommended for categorical and binary data.…
Cooper, Andrew; Petrides, K V
Trait emotional intelligence refers to a constellation of emotional self-perceptions located at the lower levels of personality hierarchies. In 2 studies, we sought to examine the psychometric properties of the Trait Emotional Intelligence Questionnaire-Short Form (TEIQue-SF; Petrides, 2009) using item response theory (IRT). Study 1 (N= 1,119, 455 men) showed that most items had good discrimination and threshold parameters and high item information values. At the global level, the TEIQue-SF showed very good precision across most of the latent trait range. Study 2 (N= 866, 432 men) used similar IRT techniques in a new sample based on the latest version of the TEIQue-SF (version 1.50). Results replicated Study 1, with the instrument showing good psychometric properties at the item and global level. Overall, the 2 studies suggest the TEIQue-SF can be recommended when a rapid assessment of trait emotional intelligence is required.
... submitted for classification or other consideration (as a result of a request by BIS) and provide a brief... describes the item(s). (2) Indicate whether there have been any prior classifications or registrations of... (Commodity Classification Automated Tracking System (CCATS) number, Encryption Registration Number...
Misery, Laurent; Jean-Decoster, Catherine; Mery, Sophie; Georgescu, Victor; Sibaud, Vincent
Sensitive skin is common but until now there has been no scale for measuring its severity. The Sensitive Scale is a new scale with a 14-item and a 10-item version that was tested in 11 countries in different languages on 2,966 participants. The aim of this study was to validate the pertinence of using the Sensitive Scale to measure the severity of sensitive skin. The internal consistency was high. Correlations with the dry skin type, higher age, female gender, fair phototypes and Dermatology Life Quality Index were found. Using the 10-item version appeared to be preferable because it was quicker and easier to complete, with the same internal consistency and the 4 items that were excluded were very rarely observed in patients. The mean initial scores were around 44/140 and 37/100. The use of a cream for sensitive skin showed the pertinence of the scale before and after treatment.
Teachers often raise a question that whether the lecture questionnaires are necessary or not. In this paper, we first show the recent statistical analysis for the official unsigned questionnaire evaluation results took in our faculty. We have found that: (1) the evaluation scores of lectures by students have been rising up year by year, which…
Otter, Martha E.; And Others
The ability of 2 components, interpretation of a question and memory, to forecast the test-retest association coefficients of reading test items was studied with initial samples of 916 elementary and 949 secondary school students. For both populations, both components forecast the relative sizes of test-retest correlation coefficients. (SLD)
Sijtsma, Klaas; van der Ark, L. Andries
This article first discusses a statistical test for investigating whether or not the pattern of missing scores in a respondent-by-item data matrix is random. Since this is an asymptotic test, we investigate whether it is useful in small but realistic sample sizes. Then, we discuss two known simple imputation methods, person mean (PM) and two-way…
Wardenaar, Klaas J; van Veen, Tineke; Giltay, Erik J; de Beurs, Edwin; Penninx, Brenda W J H; Zitman, Frans G
The original Mood and Anxiety Symptoms Questionnaire (MASQ) is a 90-item self-report, designed to measure the dimensions of Clark and Watson's tripartite model. We developed and validated a 30-item short adaptation of the MASQ: the MASQ-D30, which is more suitable for large-scale psychopathology research and has a clearer factor structure. The MASQ-D30 was developed through a process of item reduction and grouping of the appropriate subscales in a sample of 489 psychiatric outpatients, using a validated Dutch translation, based on the original English MASQ, as a starting point. Validation was done in two other large samples of 1461 and 2471 subjects, respectively, with an anxiety, somatoform and/or depression diagnosis or no psychiatric diagnosis. Psychometric properties were investigated and compared between the MASQ-D30 and the full (adapted) MASQ. A three-dimensional model (negative affect, positive affect and somatic arousal) was found to represent the data well, indicating good construct validity. The scales of the MASQ-D30 showed good internal consistency (all alphas>0.87) in patient samples. Correlations of the subscales with other instruments indicated acceptable convergent validity. Psychometric properties were similar for the MASQ-D30 and the full questionnaire. In conclusion, the MASQ-D30 is a valid instrument to assess dimensional aspects of depression and anxiety and can easily be implemented in psychopathology studies.
Delgado-Gomez, David; Lopez-Castroman, Jorge; de Leon-Martinez, Victoria; Baca-Garcia, Enrique; Cabanas-Arrate, Maria Luisa; Sanchez-Gonzalez, Antonio; Aguado, David
There is a need to assess the psychiatric morbidity that appears as a consequence of terrorist attacks. The General Health Questionnaire (GHQ) has been used to this end, but its psychometric properties have never been evaluated in a population affected by terrorism. A sample of 891 participants included 162 direct victims of terrorist attacks and…
Perlman, Baron; And Others
Type A behavior is an aggregate of behaviors associated with increased risk of coronary heart disease. Two self-administered questionnaires used to determine the presence of Type A behavior, the Jenkins Activity Survey and Framingham Type A Behavior Pattern Scale, were administered to 150 undergraduate students at a midwestern university, along…
Roszkowski, Michael J.
The Student Adaptation to College Questionnaire (SACQ) was administered to first year students during the fifth week of their first semester and then again at the end of the semester. The primary purpose was to examine item-remainder correlations to determine if the items are placed correctly into their respective domains. A secondary aim was to…
Justicia, Fernando; Pichardo, M. Carmen; Cano, Francisco; Berben, A. B. G.; De la Fuente, Jesus
The underlying structure of the Revised Two Factor version of the Study Process Questionnaire (R-SPQ-2F), a 20-item instrument for the evaluation of students' approaches to learning (SAL), was examined at item level using two independent groups of undergraduate students enrolled in the first (n = 314) and last (n = 522) years of their studies. The…
Kliem, Sören; Schmidt, Ricarda; Vogel, Mandy; Hiemisch, Andreas; Kiess, Wieland; Hilbert, Anja
Eating disturbances are common in children placing a vulnerable group of them at risk for full-syndrome eating disorders and adverse health outcomes. To provide a valid self-report assessment of eating disorder psychopathology in children, a short form of the child version of the Eating Disorder Examination (ChEDE-Q) was psychometrically evaluated. Similar to the EDE-Q, the ChEDE-Q provides assessment of eating disorder psychopathology related to anorexia nervosa, bulimia nervosa, and binge-eating disorder; however, the ChEDE-Q does not assess symptoms of avoidant/restrictive food intake disorder, pica, or rumination disorder. In 1,836 participants ages 7 to 18 years, recruited from two independent population-based samples, the factor structure of the recently established 8-item short form EDE-Q8 for adults was examined, including measurement invariance analyses on age, gender, and weight status derived from objectively measured weight and height. For convergent validity, the ChEDE-Q global score, body esteem scale, strengths and difficulties questionnaire, and sociodemographic characteristics were used. Item characteristics and age- and gender-specific norms were calculated. Confirmatory factor analysis revealed good model fit for the 8-item ChEDE-Q. Measurement invariance analyses indicated strict invariance for all analyzed subgroups. Convergent validity was provided through associations with well-established questionnaires and age, gender, and weight status, in expected directions. The newly developed ChEDE-Q8 proved to be a psychometrically sound and economical self-report assessment tool of eating disorder psychopathology in children. Further validation studies are needed, particularly concerning discriminant and predictive validity.
Cook, Karon F.; Choi, Seung W.; Crane, Paul K.; Deyo, Richard A.; Johnson, Kurt L.; Amtmann, Dagmar
Study Design A post-hoc simulation of a computer adaptive administration of the items of a modified version of the Roland Morris Disability Questionnaire. Objective To evaluate the effectiveness of adaptive administration of back pain-related disability items compared to a fixed 11-item short form. Summary of Background Data Short form versions of the Roland Morris Disability Questionnaire have been developed. An alternative to paper-and -pencil short forms is to administer items adaptively so that items are presented based on a person’s responses to previous items. Theoretically, this allows precise estimation of back pain disability with administration of only a few items. Materials and Methods Data were gathered from two previously conducted studies of persons with back pain. An item response theory model was used to calibrate scores based on all items, items of a paper-and-pencil short form, and several computer adaptive tests (CATs). Results Correlations between each CAT condition and scores based on a 23-item version of the Roland Morris Disability Questionnaire ranged from 0.93 to 0.98. Compared to an 11-item short form, an 11-item CAT produced scores that were significantly more highly correlated with scores based on the 23-item scale. CATs with even fewer items also produced scores that were highly correlated with scores based on all items. For example, scores from a five-item CAT had a correlation of 0.93 with full scale scores. Seven- and nine-item CATs correlated at 0.95 and 0.97, respectively. A CAT with a standard-error-based stopping rule produced scores that correlated at 0.95 with full scale scores. Conclusions A CAT-based back pain-related disability measure may be a valuable tool for use in clinical and research contexts. Use of CAT for other common measures in back pain research, such as other functional scales or measures of psychological distress, may offer similar advantages. PMID:18496352
Hamilton, Elena; Carr, Alan; Cahill, Paul; Cassells, Ciara; Hartnett, Dan
The SCORE (Systemic Clinical Outcome and Routine Evaluation) is a 40-item questionnaire for completion by family members 12 years and older to assess outcome in systemic therapy. This study aimed to investigate psychometric properties of two short versions of the SCORE and their responsiveness to therapeutic change. Data were collected at 19 centers from 701 families at baseline and from 433 of these 3-5 months later. Results confirmed the three-factor structure (strengths, difficulties, and communication) of the 15- and 28-item versions of the SCORE. Both instruments had good internal consistency and test-retest reliability. They also showed construct and criterion validity, correlating with measures of parent, child, and family adjustment, and discriminating between clinical and nonclinical cases. Total and factor scales of the SCORE-15 and -28 were responsive to change over 3-5 months of therapy. The SCORE-15 and SCORE-28 are brief psychometrically robust family assessment instruments which may be used to evaluate systemic therapy.
Background Missing items are common in quality of life (QoL) questionnaires and present a challenge for research in this field. The development of sound strategies of replacement and prevention requires accurate knowledge of their type and determinants. Methods We used the 2003 French Decennial Health Survey of a representative sample of the general population -- including 22,620 adult subjects who completed the SF-36 questionnaire-- to test various socio-demographic, health status and QoL variables as potential predictors of missingness. We constructed logistic regression models for each SF-36 item to identify independent predictors and classify them according to Little and Rubin ("missing completely at random", "missing at random" and "missing not at random"). Results The type of missingness was missing at random for half of the items of the SF-36 and missing not at random for the others. None of the items were missing completely at random. Independent predictors of missingness were age, female sex, low scores on the SF-36 subscales and in some cases low educational level, occupation, nationality and poor health status. Conclusion This study of the SF-36 shows that imputation of missing items is necessary and emphasizes several factors for missingness that should be considered in prevention strategies of missing data. Similar methodologies could be applied to item missingness in other QoL questionnaires. PMID:20128899
Baksheev, Gennady Nickolaevich; Robinson, Jo; Cosgrave, Elizabeth Mary; Baker, Kathryn; Yung, Alison Ruth
Despite the common use of the 12-item General Health Questionnaire (GHQ-12) with adolescents, there is limited data supporting its validity with this population. The aims of the study were to investigate the psychometric properties of the GHQ-12 among high school students, to validate the GHQ-12 against the gold standard of a diagnostic interview, and to suggest a threshold score for detecting depressive and anxiety disorders. Six hundred and fifty-four high school students from years 10 to 12 (ages 15-18) completed the GHQ-12 (Likert scored) and the Structured Clinical Interview for Diagnostic and Statistical Manual of Mental Disorders-IV-Test Revision (DSM-IV-TR). Receiver operating characteristic (ROC) curves were plotted. The mean GHQ-12 score for the total sample was 9.9 (S.D.=5.4). Results from the ROC curve indicated that the GHQ-12 performed better than chance at identifying depressive and anxiety disorders (area under the curve (AUC)=0.781). A GHQ-12 threshold score of 9/10 for males and 10/11 for females was found to be optimal. Given the significant proportion of mental illness among high school students, there may be a need to introduce screening for mental illnesses as part of the school curriculum. This can assist with the early identification and enable low stigma preventive intervention within the school environment.
Rogers, Katherine D.; Young, Alys; Lovell, Karina; Campbell, Malcolm; Scott, Paul R.; Kendal, Sarah
The present study is aimed to translate 3 widely used clinical assessment measures into British Sign Language (BSL), to pilot the BSL versions, and to establish their validity and reliability. These were the Patient Health Questionnaire (PHQ-9), the Generalized Anxiety Disorder 7-item (GAD-7) scale, and the Work and Social Adjustment Scale (WSAS).…
Theoretically, increased levels of physical activity self-efficacy (PASE) should lead to increased physical activity, but few studies have reported this effect among youth. This failure may be at least partially attributable to measurement limitations. In this study, Item Response Modeling (IRM) was...
Maij-de Meij, Annette M.; Kelderman, Henk; van der Flier, Henk
Mixture item response theory (IRT) models aid the interpretation of response behavior on personality tests and may provide possibilities for improving prediction. Heterogeneity in the population is modeled by identifying homogeneous subgroups that conform to different measurement models. In this study, mixture IRT models were applied to the…
Fischer, H Felix; Tritt, Karin; Klapp, Burghard F; Fliege, Herbert
A wide range of questionnaires for measuring depression are available. Item Response Theory models can help to evaluate the questionnaires exceeding the boundaries of Classical Test Theory and provide an opportunity to equate the questionnaires. In this study after checking for unidimensionality, a General Partial Credit Model was applied to data from two different depression scales [Patient Health Questionnaire (PHQ-9) and ICD-10-Symptom Rating (ISR)] obtained in clinical settings from a consecutive sample, including 4517 observations from a total of 2999 inpatients and outpatients of a psychosomatic clinic. The precision of each questionnaire was compared and the model was used to transform scores based on the assumed underlying latent trait. Both instruments were constructed to measure the same construct and their estimates of depression severity are highly correlated. Our analysis showed that the predicted scores provided by the conversion tables are similar to the observed scores in a validation sample. The PHQ-9 and ISR depression scales measure depression severity across a broad range with similar precision. While the PHQ-9 shows advantages in measuring low or high depression severity, the ISR is more parsimonious and also suitable for clinical purposes. Furthermore, the equation tables derived in this study enhance the comparability of studies using either one of the instruments, but due to substantial statistical spread the comparison of individual scores is imprecise.
Grondin, Julie; Blais, Jean-Guy
When respondents fail to use response scales of survey questionnaires as intended, latent variable modeling of data can produce disordered category thresholds. The objective of this paper is to show the usefulness of the Rasch modeling features to explore different ways of collapsing categories so that they are properly ordered and fit for further…
Yao, Kai-Ping Grace; Lee, Hsin-Yi; Tsauo, Jau-Yih
Researchers measure the significance of hip fracture by the patient's impairment. The patient's quality of life (QOL) is usually also substantially affected. However, there is no specific quality of life (QOL) questionnaire for patients with hip fractures. This study was designed to determine whether adding a new set of specific questions about…
Duan, Wenjie; Li, Jinxia
The widely used Five-Facet Mindfulness Questionnaire (FFMQ) mixes the dispositional and cultivated forms of mindfulness, thereby resulting in factor issues in previous studies. The present study distinguished the two forms of mindfulness and developed a Short Inventory of Mindfulness Capability at the item level of FFMQ. Three facets of mindfulness, namely, Describing, Acting with Awareness, and Non-judging of Experience, were assessed using community (n = 433) and student (n = 347) samples. Both meditators and non-meditators participated. Exploratory and confirmatory factor analysis (CFA) revealed a three-factor model of mindfulness with 12 items (four items per subscale). Psychometric evaluation demonstrated the solid factor structure of the measurement with high factor loadings, good internal consistency, and convergent validities. Longitudinal analysis indicated that the Acting with Awareness facet was a significant predictor of depression and anxiety symptoms 6 months later. Discussions focused on the roles of mindfulness capability on mental health as well as the relationship between them. A higher-order factor of mindfulness should be used to examine the efficacy of intervention or monitor the changes. Researchers who need to study the specific role or efficacy of each facet should calculate the scores of different facets. PMID:27667978
Wiwe Lipsker, Camilla; Kanstrup, Marie; Holmström, Linda; Kemani, Mike; Wicksell, Rikard K.
In pediatric chronic pain, research indicates a positive relation between parental psychological flexibility (i.e., the parent’s willingness to experience distress related to the child’s pain in the service of valued behavior) and level of functioning in the child. This points to the utility of targeting parental psychological flexibility in pediatric chronic pain. The Parent Psychological Flexibility Questionnaire (PPFQ) is currently the only instrument developed for this purpose, and two previous studies have indicated its reliability and validity. The current study sought to validate the Swedish version of the 17-item PPFQ (PPFQ-17) in a sample of parents (n = 263) of children with chronic pain. Factor structure and internal reliability were evaluated by means of principal component analysis (PCA) and Cronbach’s alpha. Concurrent criterion validity was examined by hierarchical multiple regression analyses with parental anxiety and depression as outcomes. The PCA supported a three-factor solution with 10 items explaining 69.5% of the total variance. Cronbach’s alpha (0.86) indicated good internal consistency. The 10-item PPFQ (PPFQ-10) further explained a significant amount of variance in anxiety (29%), and depression (35.6%), confirming concurrent validity. In conclusion, results support the reliability and validity of the PPFQ-10, and suggest its usefulness in assessing psychological flexibility in parents of children with chronic pain. PMID:27869780
Gideon, Nicole; Hawkes, Nick; Mond, Jonathan; Saunders, Rob; Tchanturia, Kate; Serpell, Lucy
Objective The aim of this study was to develop and validate a short form of the Eating Disorder Examination Questionnaire (EDE-Q) for routine, including session by session, outcome assessment. Method The current, 28-item version (6.0) of the EDE-Q was completed by 489 individuals aged 18–72 with various eating disorders recruited from three UK specialist eating disorder services. Rasch analysis was carried out on factors identified by means of principal component analysis, which in combination with expert ratings informed the development of an EDE-Q short form. The shortened questionnaire’s reliability, validity and sensitivity was assessed based on online data collected from students of a UK university and volunteers with a history of eating disorders recruited from a national eating disorders charity aged 18–74 (N = 559). Results A 12-item short form, the Eating Disorder Examination Questionnaire Short (EDE-QS) was derived. The new measure showed high internal consistency (Cronbach’s α = .913) and temporal stability (ICC = .93; p < .001). It was highly correlated with the original EDE-Q (r = .91 for people without ED; r = .82 for people with ED) and other measures of eating disorder and comorbid psychopathology. It was sufficiently sensitive to distinguish between people with and without eating disorders. Discussion The EDE-QS is a brief, reliable and valid measure of eating disorder symptom severity that performs similarly to the EDE-Q and that lends itself for the use of sessional outcome monitoring in treatment and research. PMID:27138364
Salaffi, Fausto; Di Carlo, Marco; Carotti, Marina; Farah, Sonia; Gutierrez, Marwin
Background Over the last few years, there has been a shift toward a more patient-centered perspective of the disease by adopting patient-reported outcomes. Touch-screen formats are increasingly being used for data collection in routine care and research. Objectives The aim of this study is to examine the equivalence, reliability, validity and respondent preference for a computerized touch-screen version of the Psoriatic Arthritis Impact of Disease 12-item (PsAID-12) questionnaire in comparison with the original paper-and-pencil version, in a cohort of patients with psoriatic arthritis (PsA). Methods One hundred and fifty-nine patients with PsA completed both the touch screen- and the conventional paper-and-pencil administered PsAID-12 questionnaire. Agreement between formats was assessed by intraclass correlation coefficients. Spearman’s rho correlation coefficient was used to test convergent validity of the touch screen format of PsAID-12, while receiver operating characteristic curve analysis was performed to test discriminant validity. In order to assess the patient’s preference, the participants filled in an additional questionnaire. The time taken to complete both formats was measured. Results A high concordance between the responses to the two modes of the PsAID-12 tested was found, with no significant mean differences. Intraclass correlation coefficients between data obtained for touch-screen and paper versions ranged from 0.801 to 0.962. There was a very high degree of correlation between the touch-screen format of PsAID-12 and composite disease activity indices (all at a P level <0.0001), Health Assessment Questionnaire, and Physician Assessment of disease activity. The discriminatory power of the touch-screen format of PsAID-12, assessed using the minimal disease activity – Outcome Measurements in Rheumatology Clinical Trials criteria, was very good, with an area under the receiver operating characteristic curve of 0.937 and a resulting cutoff value
Examining the Factor Structure of the 39-Item and 15-Item Versions of the Five Facet Mindfulness Questionnaire Before and After Mindfulness-Based Cognitive Therapy for People With Recurrent Depression
Research into the effectiveness and mechanisms of mindfulness-based interventions (MBIs) requires reliable and valid measures of mindfulness. The 39-item Five Facet Mindfulness Questionnaire (FFMQ-39) is a measure of mindfulness commonly used to assess change before and after MBIs. However, the stability and invariance of the FFMQ factor structure have not yet been tested before and after an MBI; pre to post comparisons may not be valid if the structure changes over this period. Our primary aim was to examine the factor structure of the FFMQ-39 before and after mindfulness-based cognitive therapy (MBCT) in adults with recurrent depression in remission using confirmatory factor analysis (CFA). Additionally, we examined whether the factor structure of the 15-item version (FFMQ-15) was consistent with that of the FFMQ-39, and whether it was stable over MBCT. Our secondary aim was to assess the general psychometric properties of both versions. CFAs showed that pre-MBCT, a 4-factor hierarchical model (excluding the “observing” facet) best fit the FFMQ-39 and FFMQ-15 data, whereas post-MBCT, a 5-factor hierarchical model best fit the data for both versions. Configural invariance across the time points was not supported for both versions. Internal consistency and sensitivity to change were adequate for both versions. Both FFMQ versions did not differ significantly from each other in terms of convergent validity. Researchers should consider excluding the Observing subscale from comparisons of total scale/subscale scores before and after mindfulness interventions. Current findings support the use of the FFMQ-15 as an alternative measure in research where briefer forms are needed. PMID:27078186
Davidson, Charlie A; Hoffman, Lesa; Spaulding, William D
This study updates and provides evidence for the dimensionality, reliability, and validity of a standard instrument for detection and measurement of schizotypy in non-clinical young adults. Schizotypy represents a set of traits on which both nonclinical and schizophrenia-spectrum populations vary meaningfully. These traits are linked to biological, cognitive, and social dimensions of serious mental illness (SMI), to clinical and subclinical variation in personal and social functioning, and to risk for SMI. Reliable and valid identification of schizotypal traits has important implications for clinical practice and research. Four consecutive independent samples of undergraduates were administered the SPQ-BR (N=2552). Confirmatory factor analyses suggested a minor item wording change improved reliability, and this Updated questionnaire was implemented for three-quarters of the sample (SPQ-BRU). A, single-order, nine-factor structure had acceptable psychometric properties. The best fitting second-order structure included four higher-order factors that distinguished Social Anxiety and Interpersonal factors. This differentiation was supported by differential relationships with treatment history. The Disorganized factor had the greatest unique relationship with personal and family treatment history. With few exceptions, factor loadings showed stability across samples. Overall, the higher-order and lower-order factors of schizotypy demonstrated reliability and convergent and discriminant validity; detailed psychometric data are presented in a supplement.
Rogers, Katherine D; Young, Alys; Lovell, Karina; Campbell, Malcolm; Scott, Paul R; Kendal, Sarah
The present study is aimed to translate 3 widely used clinical assessment measures into British Sign Language (BSL), to pilot the BSL versions, and to establish their validity and reliability. These were the Patient Health Questionnaire (PHQ-9), the Generalized Anxiety Disorder 7-item (GAD-7) scale, and the Work and Social Adjustment Scale (WSAS). The 3 assessment measures were translated into BSL and piloted with the Deaf signing population in the United Kingdom (n = 113). Participants completed the PHQ-9, GAD-7, WSAS, and Clinical Outcomes in Routine Evaluation-Outcome Measure (CORE-OM) online. The reliability and validity of the BSL versions of PHQ-9, GAD-7, and WSAS have been examined and were found to be good. The construct validity for the PHQ-9 BSL version did not find the single-factor solution as found in the hearing population. The BSL versions of PHQ-9, GAD-7, and WSAS have been produced in BSL and can be used with the signing Deaf population in the United Kingdom. This means that now there are accessible mental health assessments available for Deaf people who are BSL users, which could assist in the early identification of mental health difficulties.
Reis, Ana Luiza; Reis, Leonardo Oliveira; Saade, Ricardo Destro; Santos, Carlos Alberto; de Lima, Marcelo Lopes; Fregonesi, Adriano
Purpose To validate the Quality of Erection Questionnaire (QEQ) considering Brazilian social-cultural aspects. Materials and Methods To determine equivalence between the Portuguese and the English QEQ versions, the Portuguese version was back-translated by two professors who are native English speakers. After language equivalence had been determined, urologists considered the QEQ Portuguese version suitable. Men with self-reported erectile dysfunction (ED) and infertile men who had a stable sexual relationship for at least 6 months were invited to answer the QEQ, the International Index of Erectile Function (IIEF) and the RAND 36-Item Health Survey (RAND-36). The questionnaires were presented together and answered without help in a private room. Internal consistency (Cronbach’s α), test-retest reliability (Spearman), convergent validity (Spearman correlation) coefficients and known-groups validity (the ability of the QEQ Portuguese version to differentiate erectile dysfunction severity groups) were assessed. Results We recruited 197 men (167 ED patients and 30 non-ED patients), mean age of 53.3 and median of 55.5 years (23-82 years). The Portuguese version of the QEQ had high internal consistency (Cronbach α=0.93), high stability between test and retest (ICC 0.83, with IC 95%: 0.76-0.88, p<0.001) and Spearman correlation coefficient r=0.82 (p<0.001), which demonstrated the high correlation between the QEQ and IIEF results. The correlations between the QEQ and RAND-36 were significantly low in ED (r=0.20, p=0.01) and non-ED patients (r=0.37, p=0.04). Conclusion The QEQ Portuguese version presented good psychometric properties and high convergent validity in relation to IIEF. The low correlations between the QEQ and the RAND-36, as well as between the IIEF and the RAND-36 indicated IIEF and QEQ specificity, which may have resulted from the patients’ psychological adaptations that minimized the impact of ED on Quality of Life (QoL) and reestablished the well
Trani, Jean-François; Babulal, Ganesh Muneshwar; Bakhshi, Parul
Background Although 80% of persons with disabilities live in low and middle-income countries, there is still a lack of comprehensive, cross-culturally validated tools to identify persons facing activity limitations and functioning difficulties in these settings. In absence of such a tool, disability estimates vary considerably according to the methodology used, and policies are based on unreliable estimates. Methods and Findings The Disability Screening Questionnaire composed of 27 items (DSQ-27) was initially designed by a group of international experts in survey development and disability in Afghanistan for a national survey. Items were selected based on major domains of activity limitations and functioning difficulties linked to an impairment as defined by the International Classification of Functioning, Disability and Health. Face, content and construct validity, as well as sensitivity and specificity were examined. Based on the results obtained, the tool was subsequently refined and expanded to 34 items, tested and validated in Darfur, Sudan. Internal consistency for the total DSQ-34 using a raw and standardized Cronbach’s Alpha and within each domain using a standardized Cronbach’s Alpha was examined in the Asian context (India and Nepal). Exploratory factor analysis (EFA) using principal axis factoring (PAF) evaluated the lowest number of factors to account for the common variance among the questions in the screen. Test-retest reliability was determined by calculating intraclass correlation (ICC) and inter-rater reliability by calculating the kappa statistic; results were checked using Bland-Altman plots. The DSQ-34 was further tested for standard error of measurement (SEM) and for the minimum detectable change (MDC). Good internal consistency was indicated by Cronbach’s Alpha of 0.83/0.82 for India and 0.76/0.78 for Nepal. We confirmed our assumption for EFA using the Kaiser-Meyer-Olkin measure of sampling well above the accepted cutoff of 0.40 for
Griffee, Dale T.
The purpose of this paper is to give evidence for the thesis that if teachers using a questionnaire as a data collection instrument have the questionnaire items translated from one language into another, they cannot assume that the translated items are valid simply because they were translated. Even if the original questionnaire items were…
Nielsen, Anne Molgaard; Vach, Werner; Kent, Peter; Hestbaek, Lise; Kongsted, Alice
Background Latent class analysis (LCA) is increasingly being used in health research, but optimal approaches to handling complex clinical data are unclear. One issue is that commonly used questionnaires are multidimensional, but expressed as summary scores. Using the example of low back pain (LBP), the aim of this study was to explore and descriptively compare the application of LCA when using questionnaire summary scores and when using single items to subgrouping of patients based on multidimensional data. Materials and methods Baseline data from 928 LBP patients in an observational study were classified into four health domains (psychology, pain, activity, and participation) using the World Health Organization’s International Classification of Functioning, Disability, and Health framework. LCA was performed within each health domain using the strategies of summary-score and single-item analyses. The resulting subgroups were descriptively compared using statistical measures and clinical interpretability. Results For each health domain, the preferred model solution ranged from five to seven subgroups for the summary-score strategy and seven to eight subgroups for the single-item strategy. There was considerable overlap between the results of the two strategies, indicating that they were reflecting the same underlying data structure. However, in three of the four health domains, the single-item strategy resulted in a more nuanced description, in terms of more subgroups and more distinct clinical characteristics. Conclusion In these data, application of both the summary-score strategy and the single-item strategy in the LCA subgrouping resulted in clinically interpretable subgroups, but the single-item strategy generally revealed more distinguishing characteristics. These results 1) warrant further analyses in other data sets to determine the consistency of this finding, and 2) warrant investigation in longitudinal data to test whether the finer detail provided by
This database contains questionnaire items and a list of validation studies for standardized items related to walking and biking. The items come from multiple national and international physical activity questionnaires.
Doherr, Hanna; Christalle, Eva; Kriston, Levente; Härter, Martin; Scholl, Isabelle
Background The Shared Decision Making Questionnaire (SDM-Q-9 and SDM-Q-Doc) is a 9-item measure of the decisional process in medical encounters from both patients’ and physicians’ perspectives. It has good acceptance, feasibility, and reliability. This systematic review aimed to 1) evaluate the use of the SDM-Q-9 and SDM-Q-Doc in intervention studies on shared decision making (SDM) in clinical settings, 2) describe how the SDM-Q-9 and SDM-Q-Doc performed regarding sensitivity to change, and 3) assess the methodological quality of studies and study protocols that use the measure. Methods We conducted a systematic review of studies published between 2010 and October 2015 that evaluated interventions to facilitate SDM. The search strategy comprised three databases (EMBASE, PsycINFO, and Medline), reference tracking, citation tracking, and personal knowledge. Two independent reviewers screened titles and abstracts as well as full texts of potentially relevant records. We extracted the data using a pilot tested sheet, and we assessed the methodological quality of included studies using the Quality Assessment Tools from the U.S. National Institute of Health (NIH). Results Five completed studies and six study protocols fulfilled the inclusion criteria. The measure was used in a variety of health care settings, mainly in Europe, to evaluate several types of interventions. The reported mean sum scores ranged from 42 to 75 on a scale from 0 to 100. In four studies no significant change was detected in the mean-differences between main groups. In the fifth study the difference was small. Quality assessment revealed a high risk of bias in four of the five completed studies, while the study protocols received moderate quality ratings. Conclusions We found a wide range of areas in which the SDM-Q-9 and SDM-Q-Doc were applied. In the future this review may help researchers decide whether the measure fits their purposes. Furthermore, the review revealed risk of bias in
Guillemin, I; Marrel, A; Arnould, B; Capuron, L; Dupuy, A; Ginon, E; Layé, S; Lecerf, J-M; Prost, M; Rogeaux, M; Urdapilleta, I; Allaert, F-A
Providing well-being and maintaining good health are main objectives subjects seek from diet. This manuscript describes the development and preliminary validation of an instrument assessing well-being associated with food and eating habits in a general healthy population. Qualitative data from 12 groups of discussion (102 subjects) conducted with healthy subjects were used to develop the core of the Well-being related to Food Questionnaire (Well-BFQ). Twelve other groups of discussion with subjects with joint (n = 34), digestive (n = 32) or repetitive infection complaints (n = 30) were performed to develop items specific to these complaints. Five main themes emerged from the discussions and formed the modular backbone of the questionnaire: "Grocery shopping", "Cooking", "Dining places", "Commensality", "Eating and drinking". Each module has a common structure: items about subject's food behavior and items about immediate and short-term benefits. An additional theme - "Eating habits and health" - assesses subjects' beliefs about expected benefits of food and eating habits on health, disease prevention and protection, and quality of ageing. A preliminary validation was conducted with 444 subjects with balanced diet; non-balanced diet; and standard diet. The structure of the questionnaire was further determined using principal component analyses exploratory factor analyses, with confirmation of the sub-sections food behaviors, immediate benefits (pleasure, security, relaxation), direct short-term benefits (digestion and satiety, energy and psychology), and deferred long-term benefits (eating habits and health). Thirty-three subscales and 14 single items were further defined. Confirmatory analyses confirmed the structure, with overall moderate to excellent convergent and divergent validity and internal consistency reliability. The Well-BFQ is a unique, modular tool that comprehensively assesses the full picture of well-being related to food and eating habits in
Questionnaire Results A. Overview 8. Social De ■* ability and Acquiescence Response Sets C. Other Response Sets or Errors D. Effects of General Pretest...Attitudes of Respondents E. Effects of Demographic Characteristics of Responses XIII. Evaluating Questionnaire Results A. Overview B. Scoring...34questionnaire" refers to an ordered arrangement of items (questions, in effect ) intended to elicit the evaluations, judgments, comparisons, attitudes
Monticone, Marco; Ferrante, Simona; Giorgi, Ines; Galandra, Caterina; Rocca, Barbara; Foti, Calogero
BACKGROUND: Increasing attention is being devoted to cognitive-behavioural measures to improve interventions for chronic pain. OBJECTIVE: To develop an Italian version of the Coping Strategies Questionnaire – Revised (CSQ-R), and to validate it in a study involving 345 Italian subjects with chronic pain. METHODS: The questionnaire was developed following international recommendations. The psychometric analyses included confirmatory factor analysis; reliability, assessed by internal consistency (Cronbach’s alpha) and test-retest reliability (intraclass correlation coefficients); and construct validity, assessed by calculating the correlations between the subscales of the CSQ-R and measures of pain (numerical rating scale), disability (Sickness Impact Profile – Roland Scale), depression (Center for Epidemiological Studies – Depression Scale) and coping (Chronic Pain Coping Inventory) (Pearson’s correlation). RESULTS: Confirmatory factor analysis revealed that the CSQ-R model had an acceptable data-model fit (comparative fit index and normed fit index ≤0.90, root mean square error of approximation ≥0.08). Cronbach’s alpha was satisfactory (CSQ-R 0.914 to 0.961), and the intraclass correlation coefficients were good/excellent (CSQ-R 0.850 to 0.918). As expected, the correlations with the numerical rating scale, Sickness Impact Profile – Roland Scale, Center for Epidemiological Studies – Depression Scale and Chronic Pain Coping Inventory highlighted the adaptive and maladaptive properties of most of the CSQ-R subscales. CONCLUSION: The CSQ-R was successfully translated into Italian. The translation proved to have good factorial structure, and its psychometric properties are similar to those of the original and other adapted versions. Its use is recommended for clinical and research purposes in Italy and abroad. PMID:24761430
DeWalt, Darren A.; Rothrock, Nan; Yount, Susan; Stone, Arthur A.
One of the PROMIS (Patient-Reported Outcome Measurement Information System) network's primary goals is the development of a comprehensive item bank for patient-reported outcomes of chronic diseases. For its first set of item banks, PROMIS chose to focus on pain, fatigue, emotional distress, physical function, and social function. An essential step for the development of an item pool is the identification, evaluation, and revision of extant questionnaire items for the core item pool. In this work, we also describe the systematic process wherein items are classified for subsequent statistical processing by the PROMIS investigators. Six phases of item development are documented: identification of extant items, item classification and selection, item review and revision, focus group input on domain coverage, cognitive interviews with individual items, and final revision before field testing. Identification of items refers to the systematic search for existing items in currently available scales. Expert item review and revision was conducted by trained professionals who reviewed the wording of each item and revised as appropriate for conventions adopted by the PROMIS network. Focus groups were used to confirm domain definitions and to identify new areas of item development for future PROMIS item banks. Cognitive interviews were used to examine individual items. Items successfully screened through this process were sent to field testing and will be subjected to innovative scale construction procedures. PMID:17443114
Governor's Citizen Advisory Committee on Drugs, Salt Lake City, UT.
This questionnaire assesses drug use practices in junior and senior high school students. The 21 multiple choice items pertain to drug use practices, use history, available of drugs, main reason for drug use, and demographic data. The questionnaire is untimed, group administered, and may be given by the classroom teacher in about 10 minutes. Item…
Haagen, C. Hess
This questionnaire assesses marijuana use practices in college students. The 30 items (multiple choice or free response) are concerned with personal and demographic data, marijuana smoking practices, use history, effects from smoking marijuana, present attitude toward the substance, and use of other drugs. The Questionnaire is untimed and…
The heterogeneous structure of schizotypal personality disorder: item-level factors of the Schizotypal Personality Questionnaire and their associations with obsessive-compulsive disorder symptoms, dissociative tendencies, and normal personality.
Chmielewski, Michael; Watson, David
A. Raine et al.'s (1994) 3-factor scheme is currently the most widely accepted model of schizotypal personality disorder (SPD). Factor analytic studies of the Schizotypal Personality Questionnaire (SPQ; A. Raine, 1991) subscales, which represent the 9 Diagnostic and Statistical Manual of Mental Disorders (DSM) criteria for SPD, have provided the model's primary support. The use of only 9 modeled variables, however, limits the number of factors that can be extracted. To explicate this structure more fully, the authors conducted item-level factor analyses of the SPQ in a large student sample that completed the instrument twice within a 2-week interval. The authors' analyses failed to support either the 3-factor model of SPD or the 9 existing DSM-based subscales of the SPQ. Instead, 5 replicable dimensions emerged that capture recurrent symptom pairings found in the broader SPD literature: Social Anhedonia, Unusual Beliefs and Experiences, Social Anxiety, Mistrust, and Eccentricity/Oddity. These factors are only weakly correlated with each other and show differential correlational patterns with the Big Five personality traits, dissociative tendencies, and symptoms of obsessive-compulsive disorder. Moreover, they are congruent with dimensional models of personality psychopathology. Implications for SPD in DSM-V are discussed.
Markov blanket-based approach for learning multi-dimensional Bayesian network classifiers: an application to predict the European Quality of Life-5 Dimensions (EQ-5D) from the 39-item Parkinson's Disease Questionnaire (PDQ-39).
Borchani, Hanen; Bielza, Concha; Martı Nez-Martı N, Pablo; Larrañaga, Pedro
Multi-dimensional Bayesian network classifiers (MBCs) are probabilistic graphical models recently proposed to deal with multi-dimensional classification problems, where each instance in the data set has to be assigned to more than one class variable. In this paper, we propose a Markov blanket-based approach for learning MBCs from data. Basically, it consists of determining the Markov blanket around each class variable using the HITON algorithm, then specifying the directionality over the MBC subgraphs. Our approach is applied to the prediction problem of the European Quality of Life-5 Dimensions (EQ-5D) from the 39-item Parkinson's Disease Questionnaire (PDQ-39) in order to estimate the health-related quality of life of Parkinson's patients. Fivefold cross-validation experiments were carried out on randomly generated synthetic data sets, Yeast data set, as well as on a real-world Parkinson's disease data set containing 488 patients. The experimental study, including comparison with additional Bayesian network-based approaches, back propagation for multi-label learning, multi-label k-nearest neighbor, multinomial logistic regression, ordinary least squares, and censored least absolute deviations, shows encouraging results in terms of predictive accuracy as well as the identification of dependence relationships among class and feature variables.
McMorris, Robert F.; And Others
Two 50-item multiple-choice forms of a grammar test were developed differing only in humor being included in 20 items of one form. One hundred twenty-six (126) eighth graders received the test plus alternate forms of a questionnaire. Humor inclusion did not affect grammar scores on matched humorous/nonhumorous items nor on common post-treatment…
Woods, Carol M.
Differential item functioning (DIF) occurs when an item on a test, questionnaire, or interview has different measurement properties for one group of people versus another, irrespective of true group-mean differences on the constructs being measured. This article is focused on item response theory based likelihood ratio testing for DIF (IRT-LR or…
STARRY, ALLAN R.
THE OBJECTIVES OF THIS STUDY WERE (1) TO DEVELOP A GENERAL CLASSIFICATION SYSTEM FOR LIFE HISTORY ITEMS, (2) TO DETERMINE TEST-RETEST RELIABILITY ESTIMATES, AND (3) TO ESTIMATE RESISTANCE TO EXAMINEE FAKING, FOR REPRESENTATIVE BIOGRAPHICAL QUESTIONNAIRES. TWO 100-ITEM QUESTIONNAIRES WERE CONSTRUCTED THROUGH RANDOM ASSIGNMENT BY CONTENT AREA OF 200…
Baker, Mark; Keane, Brian
Maximizing school resources and managing a shrinking budget--these are two important items affected when a building's roofing system does not perform properly. Rather than acting in haste, school and university administrators should do what every teacher tells a student prior to answering any question: think through the research and studies to…
Hagel, Lilian Day; Mainieri, Alberto Scolfano; Zeni, Cristian Patrick; Wagner, Mario Bernardes
Objective: Compare a questionnaire based on the HEADSS approach (QBH-16) and the Child Behavior Checklist (CBCL) in the screening of mental disorder in adolescents with behavioral problems. Methods: Adolescents from both genders 12-17 years-old presenting behavioral problems without a previous diagnosis of mental disorder were referred from…
Boser, Judith A.; Clark, Sheldon B.
This study of survey research experts was conducted to determine desirable characteristics of mail questionnaires. The 82-item Likert-scale instrument used in the study covered general appearance, instructions, choice of items, choice of response options, wording, order of items, and item format. The instrument was administered to: 8 subjects who…
Gorin, Joanna S.; Embretson, Susan E.
Recent assessment research joining cognitive psychology and psychometric theory has introduced a new technology, item generation. In algorithmic item generation, items are systematically created based on specific combinations of features that underlie the processing required to correctly solve a problem. Reading comprehension items have been more…
Weijters, Bert; Baumgartner, Hans; Schillewaert, Niels
In the recent methodological literature, various models have been proposed to account for the phenomenon that reversed items (defined as items for which respondents' scores have to be recoded in order to make the direction of keying consistent across all items) tend to lead to problematic responses. In this article we propose an integrative conceptualization of three important sources of reversed item method bias (acquiescence, careless responding, and confirmation bias) and specify a multisample confirmatory factor analysis model with 2 method factors to empirically test the hypothesized mechanisms, using explicit measures of acquiescence and carelessness and experimentally manipulated versions of a questionnaire that varies 3 item arrangements and the keying direction of the first item measuring the focal construct. We explain the mechanisms, review prior attempts to model reversed item bias, present our new model, and apply it to responses to a 4-item self-esteem scale (N = 306) and the 6-item Revised Life Orientation Test (N = 595). Based on the literature review and the empirical results, we formulate recommendations on how to use reversed items in questionnaires.
Messinger, H B; Messinger, M I
Recently in this journal Peters and Murphy challenged the validity of factor analyses done on bimodal handedness data, suggesting instead that right- and left-handers be studied separately. But bimodality may be avoidable if attention is paid to Oldfield's questionnaire format and instructions for the subjects. Two characteristics appear crucial: a two-column LEFT-RIGHT format for the body of the instrument and what we call Oldfield's Admonition: not to indicate strong preference for handedness item, such as write, unless "... the preference is so strong that you would never try to use the other hand unless absolutely forced to...". Attaining unimodality of an item distribution would seem to overcome the objections of Peters and Murphy. In a 1984 survey in Boston we used Oldfield's ten-item questionnaire exactly as published. This produced unimodal item distributions. With reflection of the five-point item scale and a logarithmic transformation, we achieved a degree of normalization for the items. Two surveys elsewhere based on Oldfield's 20-item list but with changes in the questionnaire format and the instructions, yielded markedly different item distributions with peaks at each extreme and sometimes in the middle as well.
Sparfeldt, Jorn R.; Schilling, Susanne R.; Rost, Detlef H.; Thiel, Alexandra
The notion of item context effects implies that psychometric properties of an item or scale are altered by the presentation format, for example, blocked versus randomized. In an experimental study with high school students, the experimental group (n = 407) answered a four-dimensional academic self-concept questionnaire, in which the items were…
ABSTRACT Purpose: The purpose of this study was to examine the concurrent validity of the Late Life Function and Disability Instrument (LLFDI) in patients with coronary heart disease (CHD) and to evaluate the accuracy of information obtained through self-report questionnaire versus interview formats. Methods: The study included 29 patients older than 60 years attending an outpatient cardiac rehabilitation program. Participants completed the LLFDI, three additional self-report criterion measures, and six performance-based tests; they completed the LLFDI a second time via interview. We used descriptive statistics, correlations, and t-tests to analyze the data. Results: All LLFDI components were correlated (rs=0.36–0.83) with the self-report criterion measures. The Function Component of the LLFDI was moderately correlated with the 6-Minute Walk Test (r=0.62), timed up-and-go (r=−0.58), walking speed (r=−0.57), and timed sit-to-stand (r=−0.56) scores. The LLFDI demonstrated a ceiling effect (10%) only in the Disability Limitation component. All LLFDI component scores obtained via self-report questionnaire were correlated with scores obtained via interview; except in a single subcategory, there was no difference between LLFDI scores obtained through self-report questionnaire and those obtained through interview. Conclusions: Results indicate that the LLFDI has appropriate validity for older patients (>60 years) with CHD and can be completed independently by patients rather than administered by clinicians. PMID:23277685
Longford, Nicholas T.
A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…
Kraus, Shane; Rosenberg, Harold
Despite the prevalence of pornography use, and recent conceptualization of problematic use as an addiction, we could find no published scale to measure craving for pornography. Therefore, we conducted three studies employing young male pornography users to develop and evaluate such a questionnaire. In Study 1, we had participants rate their agreement with 20 potential craving items after reading a control script or a script designed to induce craving to watch pornography. We dropped eight items because of low endorsement. In Study 2, we revised both the questionnaire and cue exposure stimuli and then evaluated several psychometric properties of the modified questionnaire. Item loadings from a principal components analysis, a high internal consistency reliability coefficient, and a moderate mean inter-item correlation supported interpreting the 12 revised items as a single scale. Correlations of craving scores with preoccupation with pornography, sexual history, compulsive internet use, and sensation seeking provided support for convergent validity, criterion validity, and discriminant validity, respectively. The enhanced imagery script did not impact reported craving; however, more frequent users of pornography reported higher craving than less frequent users regardless of script condition. In Study 3, craving scores demonstrated good one-week test-retest reliability and predicted the number of times participants used pornography during the following week. This questionnaire could be applied in clinical settings to plan and evaluate therapy for problematic users of pornography and as a research tool to assess the prevalence and contextual triggers of craving among different types of pornography users.
Loas, Gwenolé; Yon, Valérie; Brien, Denis
The Frankfurt Complaint Questionnaire (FCQ) was designed to evaluate the subjective symptoms of schizophrenics. Several validation studies of the FCQ using principal components analyses (PCA) have shown one-, two-, or four-factor solutions. The present study was conducted using FCQ data on 310 schizophrenics who met the ICD-10 criteria for F20 (schizophrenia) disorder. Using several guidelines to select the number of factors, the PCA yielded one factor. This result suggests a unidimensionality underlying FCQ items. A new scale comprising 24 items was derived from those items with higher weights in the first factor.
Blau, Gary; DiMino, John; Sheridan, Natalie; Pred, Robert S.; Beverly, Clyde; Chessler, Marcy
The Young Schema Questionnaire (YSQ) in either long-form (205- item) or short-form (75-item or 90-item) versions has demonstrated its clinical usefulness for assessing early maladaptive schemas. However, even a 75 or 90-item "short form", particularly when combined with other measures, can represent a lengthy…
Bernstein, David P.; Stein, Judith A.; Newcomb, Michael D.; Walker, Edward; Pogge, David; Ahluvalia, Taruna; Stokes, John; Handelsman, Leonard; Medrano, Martha; Desmond, David; Zule, William
Exploratory and confirmatory factor analyses of 70 Childhood Trauma Questionnaire (CTQ) items were used to create a shorter 28-item version and test the measurement invariance of the 25 clinical items across adult substance abusing patients, adolescent psychiatric inpatients, and control populations (n=1,978). Items performed equivalently across…
Zijlstra, Wobbe P.; Van Der Ark, L. Andries; Sijtsma, Klaas
Classical methods for detecting outliers deal with continuous variables. These methods are not readily applicable to categorical data, such as incorrect/correct scores (0/1) and ordered rating scale scores (e.g., 0,..., 4) typical of multi-item tests and questionnaires. This study proposes two definitions of outlier scores suited for categorical…
The Work Environment Questionnaire (WEQ) was designed as a measure of organizational climate that relies on description of observable aspects of the... work environment rather than attitudes about work or job satisfaction. WEQ items were selected based on a critical incident analysis of work issues
Governor's Citizen Advisory Committee on Drugs, Salt Lake City, UT.
This questionnaire assesses drug use practices in high school drop-outs. The 79 items (multiple choice or apply/not apply) are concerned with demographic data and use, use history, reasons for use/nonuse, attitudes toward drugs, availability of drugs, and drug information with respect to narcotics, amphetamines, LSD, Marijuana, and barbiturates.…
Overseas Education Association, New York, NY.
AS A BASIS FOR IMPROVING THE EDUCATION OF THE 160,000 CHILDREN OF OVERSEAS AMERICAN MILITARY AND CIVILIAN PERSONNEL, 1,639 TEACHERS IN 285 OF THE 327 DEPARTMENT OF DEFENSE OVERSEAS DEPENDENTS SCHOOLS IN 28 COUNTRIES RESPONDED TO A 19-ITEM QUESTIONNAIRE COVERING TEACHING EXPERIENCE, EDUCATIONAL BACKGROUND, PERSONNEL PRACTICES, CLASSROOM MATERIALS,…
Kawasaki, Yohei; Ide, Kazuki; Akutagawa, Maiko; Yamada, Hiroshi; Furukawa, Toshiaki A.; Ono, Yutaka
Background Several studies have shown that total depressive symptom scores in the general population approximate an exponential pattern, except for the lower end of the distribution. The Center for Epidemiologic Studies Depression Scale (CES-D) consists of 20 items, each of which may take on four scores: “rarely,” “some,” “occasionally,” and “most of the time.” Recently, we reported that the item responses for 16 negative affect items commonly exhibit exponential patterns, except for the level of “rarely,” leading us to hypothesize that the item responses at the level of “rarely” may be related to the non-exponential pattern typical of the lower end of the distribution. To verify this hypothesis, we investigated how the item responses contribute to the distribution of the sum of the item scores. Methods Data collected from 21,040 subjects who had completed the CES-D questionnaire as part of a Japanese national survey were analyzed. To assess the item responses of negative affect items, we used a parameter r, which denotes the ratio of “rarely” to “some” in each item response. The distributions of the sum of negative affect items in various combinations were analyzed using log-normal scales and curve fitting. Results The sum of the item scores approximated an exponential pattern regardless of the combination of items, whereas, at the lower end of the distributions, there was a clear divergence between the actual data and the predicted exponential pattern. At the lower end of the distributions, the sum of the item scores with high values of r exhibited higher scores compared to those predicted from the exponential pattern, whereas the sum of the item scores with low values of r exhibited lower scores compared to those predicted. Conclusions The distributional pattern of the sum of the item scores could be predicted from the item responses of such items. PMID:27806132
Fiske, Donald W.; Barack, Leonard I.
The diversity among interpretations of single items in personality questionnaires has been noted previously. Using adjectives from the Adjective Check List (ACL), the study sought evidence bearing on these questions: Does such diversity make the responses to an item not comparable across subjects? If so, what are the implications for scores based…
Woods, Carol M.
Differential item functioning (DIF) occurs when an item on a test or questionnaire has different measurement properties for one group of people versus another, irrespective of mean differences on the construct. There are many methods available for DIF assessment. The present article is focused on indices of partial association. A family of average…
Ritter, Lois A. Ed.; Sue, Valerie M., Ed.
Internet-based surveys are still relatively new, and researchers are just beginning to articulate best practices for questionnaire design. Online questionnaire design has generally been guided by the principles applying to other self-administered instruments, such as paper-based questionnaires. Web-based questionnaires, however, have the potential…
Gámez, Wakiza; Chmielewski, Michael; Kotov, Roman; Ruggero, Camilo; Suzuki, Nadia; Watson, David
The 62-item Multidimensional Experiential Avoidance Questionnaire (MEAQ) was recently developed to assess a broad range of experiential avoidance (EA) content. However, practical clinical and research considerations made a briefer measure of EA desirable. Using items from the original 62-item MEAQ, a 15-item scale was created that tapped content from each of the MEAQ's six dimensions. Items were selected on the basis of their performance in 3 samples: undergraduates (n = 363), psychiatric outpatients (n = 265), and community adults (n = 215). These items were then evaluated using 2 additional samples (314 undergraduates and 201 psychiatric outpatients) and cross-validated in 2 new, independent samples (283 undergraduates and 295 community adults). The resulting measure (Brief Experiential Avoidance Questionnaire; BEAQ) demonstrated good internal consistency. It also exhibited strong convergence with respect to each of the MEAQ's 6 dimensions. The BEAQ demonstrated expected associations with measures of avoidance, psychopathology, and quality of life and was distinguishable from negative affectivity and neuroticism.
Eys, Mark; Loughead, Todd; Bray, Steven R; Carron, Albert V
The purpose of the current study was to initiate the development of a psychometrically sound measure of cohesion for youth sport groups. A series of projects were undertaken in a four-phase research program. The initial phase was designed to garner an understanding of how youth sport group members perceived the concept of cohesion through focus groups (n = 56), open-ended questionnaires (n = 280), and a literature review. In Phase 2, information from the initial projects was used in the development of 142 potential items and content validity was assessed. In Phase 3, 227 participants completed a revised 87-item questionnaire. Principal components analyses further reduced the number of items to 17 and suggested a two-factor structure (i.e., task and social cohesion dimensions). Finally, support for the factorial validity of the resultant questionnaire was provided through confirmatory factor analyses with an independent sample (n = 352) in Phase 4. The final version of the questionnaire contains 16 items that assess task and social cohesion in addition to 2 negatively worded spurious items. Specific issues related to assessing youth perceptions of cohesion are discussed and future research directions are suggested.
The construction and psychometric analysis of patient satisfaction questionnaires are discussed. The discussion is based upon the classification of multi-item questionnaires into scales or indices. Scales consist of items that describe the effects of the latent psychological variable to be measured, and indices consist of items that describe the causes of this variable. Whether patient satisfaction questionnaires should be constructed and analyzed as scales or as indices depends upon the purpose for which these questionnaires are required. If the final aim is improving care with regard to patients’ preferences, then these questionnaires should be constructed and analyzed as indices. This implies two requirements: 1) items for patient satisfaction questionnaires should be selected in such a way that the universe of possible causes of patient satisfaction is covered optimally and 2) Cronbach’s alpha, principal component analysis, exploratory factor analysis, confirmatory factor analysis, and analyses with models from item response theory, such as the Rasch Model, should not be applied for psychometric analyses. Instead, multivariate regression analyses with a direct rating of patient satisfaction as the dependent variable and the individual questionnaire items as independent variables should be performed. The coefficients produced by such an analysis can be applied for selecting the best items and for weighting the selected items when a sum score is determined. The lower boundaries of the validity of the unweighted and the weighted sum scores can be estimated by their correlations with the direct satisfaction rating. While the first requirement is fulfilled in the majority of the previous patient satisfaction questionnaires, the second one deviates from previous practice. Hence, if patient satisfaction is actually measured with the final aim of improving care with regard to patients’ preferences, then future practice should be changed so that the second
Aucoin, Julia W
Professional development specialists have had little opportunity to learn how to write test items to meet the expectations of today's graduate nurse. Schools of nursing have moved away from knowledge-level test items and have had to develop more application and analysis items to prepare graduates for the National Council Licensure Examination (NCLEX). This same type of question can be used effectively to support a competence assessment system and document critical thinking skills.
Introduction Disability and Physical Function (PF) outcome assessment has had limited ability to measure functional status at the floor (very poor functional abilities) or the ceiling (very high functional abilities). We sought to identify, develop and evaluate new floor and ceiling items to enable broader and more precise assessment of PF outcomes for the NIH Patient-Reported-Outcomes Measurement Information System (PROMIS). Methods We conducted two cross-sectional studies using NIH PROMIS item improvement protocols with expert review, participant survey and focus group methods. In Study 1, respondents with low PF abilities evaluated new floor items, and those with high PF abilities evaluated new ceiling items for clarity, importance and relevance. In Study 2, we compared difficulty ratings of new floor items by low functioning respondents and ceiling items by high functioning respondents to reference PROMIS PF-10 items. We used frequencies, percentages, means and standard deviations to analyze the data. Results In Study 1, low (n = 84) and high (n = 90) functioning respondents were mostly White, women, 70 years old, with some college, and disability scores of 0.62 and 0.30. More than 90% of the 31 new floor and 31 new ceiling items were rated as clear, important and relevant, leaving 26 ceiling and 30 floor items for Study 2. Low (n = 246) and high (n = 637) functioning Study 2 respondents were mostly White, women, 70 years old, with some college, and Health Assessment Questionnaire (HAQ) scores of 1.62 and 0.003. Compared to difficulty ratings of reference items, ceiling items were rated to be 10% more to greater than 40% more difficult to do, and floor items were rated to be about 12% to nearly 90% less difficult to do. Conclusions These new floor and ceiling items considerably extend the measurable range of physical function at either extreme. They will help improve instrument performance in populations with broad functional ranges and those concentrated at
Wilson, C. Chrisman
This is a general discussion of the validity, reliability, function, and format of questionnaires designed to measure problem behavior, noncompliance, anxiety, social interaction, hyperactivity, drug use, and sexual behavior. Commonly used questionnaires are cited. (CP)
Rosenberg, Limor; Ratzon, Nava Z.; Jarus, Tal; Bart, Orit
The purpose of this manuscript was to develop and test the psychometric properties of the Environmental Restriction Questionnaire (ERQ) a parent-reported questionnaire for measuring perceived environmental restrictions for young children participation. Reliability and homogeneity were tested by Cronbach's alpha and inter-item correlations.…
Biggs, John B.
This manual describes the theory behind the Study Process Questionnaire (SPQ) and explains what the subscale and scale scores mean. The SPQ is a 42-item self-report questionnaire used in Australia to assess the extent to which a tertiary student at a college or university endorses different approaches to learning and the motives and strategies…
Biggs, John B.
This manual describes the theory behind the Learning Process Questionnaire (LPQ) used in Australia and defines what the subscale and scale scores mean. The LPQ is a 36-item self-report questionnaire that yields scores on three basic motives for learning and three learning strategies, and on the approaches to learning that are formed by these…
Schmalz, Jonathan E.; Murrell, Amy R.
To date, general levels of experiential avoidance are primarily measured by the Acceptance and Action Questionnaire-II (AAQ-II), but it includes items of questionable comprehensibility. The Avoidance and Fusion Questionnaire for Youth (AFQ-Y), previously validated as a measure of experiential avoidance with children and adolescents, was…
Van Ginkel, Joost R.
The performance of multiple imputation in questionnaire data has been studied in various simulation studies. However, in practice, questionnaire data are usually more complex than simulated data. For example, items may be counterindicative or may have unacceptably low factor loadings on every subscale, or completely missing subscales may…
This article follows the development of test items (see "Language Assessment Quarterly", Volume 3 Issue 1, pp. 71-79 for the article "Test and Item Specifications Development"), beginning with a review of test and item specifications, then proceeding to writing and editing of items, pretesting and analysis, and finally selection of an item for a…
This pamphlet describes the exciting potential of item banking--a new approach to testing which combines both comparability of scores with flexibility of test format. Item banks are collections of items where the characteristics of each item is known and these characteristics can be summated to described a test made from such items. The principle…
Small novelty or promotional products, primarily used for outreach and educational purposes, must effectively convey a message, and their purchase will only be allowed if the item will contribute to the accomplishment of the Agency's mission.
Simone, Anna; Rota, Viviana; Tesio, Luigi; Perucca, Laura
ABILHAND is, in its original version, a 46-item, 4-level questionnaire. It measures the difficulty perceived by patients with rheumatoid arthritis as they do various daily manual tasks. ABILHAND was originally built through Rasch analysis. In a later study, it was simplified to a generic 23-item, three-level questionnaire, showing both…
García, Alexandra A
Survey data are compromised when respondents do not interpret questions in the way researchers expect. Cognitive interviews are used to detect problems respondents have in understanding survey instructions and items, and in formulating answers. This paper describes methods for conducting cognitive interviews and describes the processes and lessons learned with an illustrative case study. The case study used cognitive interviews to elicit respondents' understanding and perceptions of the format, instructions, items, and responses that make up the Diabetes Symptom Self-Care Inventory (DSSCI), a questionnaire designed to measure Mexican Americans' symptoms of type 2 diabetes and their symptom management strategies. Responses to cognitive interviews formed the basis for revisions in the format, instructions, items, and translation of the DSSCI. All those who develop and revise surveys are urged to incorporate cognitive interviews into their instrumentation methods so that they may produce more reliable and valid measurements.
Baranowski, Tom; Allen, Diane D.; Masse, Louise C.; Wilson, Mark
There has been some concern that participation in an intervention and exposure to a measurement instrument can change participants' interpretation of the items on a self-report questionnaire thereby distorting subsequent responses and biasing results. Differential item functioning (DIF) analysis using item response modeling can ascertain possible…
This study examines the factorial validity, factorial invariance across gender, and construct validity of a Swedish version of the Self-Presentation in Exercise Questionnaire (SPEQ; Conroy, Motl, & Hall, 2000). The a priori two-factor 14-item, 11-item, and 9-item models fail to reach acceptable levels of fit in a calibration sample. A modified…
Ligtvoet, Rudy; van der Ark, L. Andries; te Marvelde, Janneke M.; Sijtsma, Klaas
This article discusses the concept of an invariant item ordering (IIO) for polytomously scored items and proposes methods for investigating an IIO in real test data. Method manifest IIO is proposed for assessing whether item response functions intersect. Coefficient H[superscript T] is defined for polytomously scored items. Given that an IIO…
Gierl, Mark J.; Lai, Hollis
Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…
Cook, David I.
Contends that student evaluative questionnaires should be designed by instructors themselves to help improve their classroom performance and therefore should contain only questions that students are capable of answering objectively and not, for instance, questions about the relevancy of the course. Contains a sample questionnaire. (GH)
Leske, David A.; Holmes, Jonathan M.; Melia, B. Michele
IMPORTANCE The Intermittent Exotropia Questionnaire (IXTQ) is a patient, proxy, and parental report of quality of life specific to children with intermittent exotropia. We refine the IXTQ using Rasch analysis to improve reliability and validity. OBSERVATION Rasch analysis was performed on responses of 575 patients with intermittent exotropia enrolled from May 15, 2008, through July 24, 2013, and their parents from each of the 4 IXTQ health-related quality-of-life questionnaires (child 5 through 7 years of age and child 8 through 17 years of age, proxy, and parent questionnaires). Questionnaire performance and structure were confirmed in a separate cohort of 379 patients with intermittent exotropia. One item was removed from the 12-item child and proxy questionnaires, and response options in the 8- to 17-year-old child IXTQ and proxy IXTQ were combined into 3 response options for both questionnaires. Targeting was relatively poor for the child and proxy questionnaires. For the parent questionnaire, 3 subscales (psychosocial, function, and surgery) were evident. One item was removed from the psychosocial subscale. Resulting subscales had appropriate targeting. CONCLUSIONS AND RELEVANCE The Rasch-revised IXTQ may be a useful instrument for determining how intermittent exotropia affects health-related quality of life of children with intermittent exotropia and their parents, particularly for cohort studies. PMID:25634146
Zhang, Yuhai; Wang, Baoxi; Sun, Lijun; Shang, Lei
Background The objective of this study was to develop a questionnaire for caregivers to assess the eating behavior of Chinese preschoolers. Methods To assess children’s eating behaviors, 152 items were derived from a broad review of the literature related to epidemiology surveys and the assessment of children’s eating behaviors. All of these items were reviewed by 50 caregivers of preschoolers and 10 experienced pediatricians. Seventy-seven items were selected for use in a primary questionnaire. After conducting an exploratory factor analysis and a variability analysis on the data from 313 preschoolers used to evaluate this primary questionnaire, we deleted 39 of these 77 items. A Chinese Preschoolers’ Eating Behavior Questionnaire (CPEBQ) was finally established from the remaining 38 items. The structure of this questionnaire was explored by factor analysis, and its reliability, validity and discriminative ability were evaluated with data collected from caregivers of 603 preschoolers. Results The CPEBQ consisted of 7 dimensions and 38 items. The 7 dimensions were food fussiness, food responsiveness, eating habit, satiety responsiveness, exogenous eating, emotional eating and initiative eating. The Cronbach’s α coefficient for the questionnaire was 0.92, and the test-retest reliability was 0.72. There were significant differences between the scores of normal-weight, overweight and obese preschoolers when it was referred to food fussiness, food responsiveness, eating habits, satiety responsiveness and emotional eating (p<0.05). Differences in caregiver’s education levels also had significant effects on scores for food fussiness, eating habits and exogenous eating (p<0.05). Conclusions The CPEBQ satisfies the conditions of reliability and validity, in accordance with psychometric demands. The questionnaire can be employed to evaluate the characteristics of Chinese preschoolers’ eating behaviors; therefore, it can be used in child health care practice and
Background Back pain in children is common and early onset of back pain has been shown to increase the risk of back pain significantly in adulthood. Consequently, preventive efforts must be targeted the young population but research relating to spinal problems in this age group is scarce. Focus has primarily been on the working age population, and therefore specific questionnaires to measure spinal pain and its consequences, specifically aimed at children and adolescents are absent. The purpose of this study was to develop a questionnaire for schoolchildren filling this gap. Methods The Young Spine Questionnaire (YSQ) was developed in three phases – a conceptualisation, development and testing phase. The conceptualisation phase followed the Wilson and Cleary model and included questions regarding spinal prevalence estimates, pain frequency and intensity, activity restrictions, care seeking behaviour and influence of parental back trouble. Items from existing questionnaires and the “Revised Faces Pain Scale” (rFPS) were included during the development phase. The testing phase consisted of a mixed quantitative and qualitative iterative method carried out in two pilot tests using 4th grade children and focusing on assessment of spinal area location and item validity. Results The testing phase resulted in omission of the pain drawings and the questions and answer categories were simplified in several questions. Agreement between the questionnaire prevalence estimates and the interviews ranged between 83.7% (cervical pain today) and 97.9% (thoracic pain today). To improve the understanding of the spinal boundaries we added bony landmarks to the spinal drawings after pilot test I. This resulted in an improved sense of spinal boundary location in pilot test II. Correlations between the rFPS and the interview pain score ranged between 0.67 (cervical spine) and 0.79 (lumbar spine). Conclusions The Young Spine Questionnaire contains questions that assess spinal pain
Myrseth, Helga; Notelaers, Guy
The aim of the present study was to improve the weaknesses of the three-dimensional Gambling Motives Questionnaire and to examine the psychometric properties and factor structure of the Gambling Motives Questionnaire-Revised. The Gambling Motives Questionnaire was administered to a sample of 418 gamblers (92% men, mean age 19.5years). Participants completed the Gambling Motives Questionnaire and an additional item tapping boredom, as well as a variety of measures of gambling behavior and gambling problems as criterion measures. Results showed that the Gambling Motives Questionnaire-Revised is better represented as a four-factor structure tapping the following four gambling motives factors; enhancement, coping, social, and self-gratification, Δχ(2) Δ(df)=24.76 (3), p<0.001. Removing two problematic items from the Gambling Motives Questionnaire and adding an extra item tapping boredom also improved the fit of the Gambling Motives Questionnaire-Revised. The subscales enhancement, social, and coping were all significant predictors of variety of gambling behaviors (p<0.05), whereas enhancement, coping, and self-gratification predicted frequency of gambling behaviors (p<0.01). Coping and self-gratification predicted loss of control (p<0.01), whereas self-gratification predicted gambling problems (p<0.001). The Gambling Motives Questionnaire - Revised, consisting of the four dimensions enhancement motives, social motives, coping motives and self-gratification motives, is a reliable and valid instrument to measuring gambling motives.
This study addresses several important issues in assessment of differential item functioning (DIF). It starts with the definition of DIF, effectiveness of using item fit statistics to detect DIF, and linear modeling of DIF in dichotomous items, polytomous items, facets, and testlet-based items. Because a common metric over groups of test-takers is a prerequisite in DIF assessment, this study reviews three such methods of establishing a common metric: the equal-mean-difficulty method, the all-other-item method, and the constant-item (CI) method. A small simulation demonstrates the superiority of the CI method over the others. As the CI method relies on a correct specification of DIF-free items to serve as anchors, a method of identifying such items is recommended and its effectiveness is illustrated through a simulation. Finally, this study discusses how to assess practical significance of DIF at both item and test levels.
Van Loey, N E; Hofland, H W; Hendrickx, H; Van de Steenoven, J; Boekelaar, A; Nieuwenhuis, M K
Itch (pruritus) is a common multidimensional complaint after burn that can persist for months to years. A questionnaire able to investigate itch and its consequences is imperative for clinical and research purposes. The current study investigated the factor structure, internal consistency and construct validity of the Burns Itch Questionnaire (BIQ), a questionnaire particularly focusing on itch in the burns population. The BIQ was completed by 195 respondents at 3 months after burn. An exploratory factor analysis (EFA) was performed to investigate the factor structure. EFA showed the BIQ comprised three latent factors: itch severity, sleep interference and daily life interference. This was re-evaluated in a confirmatory factor analysis that yielded good fit indices after removing two items. The three subscales showed to have high internal consistency (.89) and were able to distinguish between patients with severe and less severe complaints. In conclusion, the BIQ showed to be useful in persons suffering from itch following burns.
Eaglen, R. L.
Plans are available for age-sensitive hardware management. Control plan identifies shelf life or age control requirements for materials considered age sensitive, use sensitive, or time service or shelf life controlled items, and describes methods of arriving at age controls through adherence to detailed specifications.
People who live in a democracy should be well informed of local, state, national, and international happenings. Students should become curious about news items and relate current happenings to the personal self. They must possess skills in word recognition and in diverse kinds of comprehension since reading is an important way to glean current…
Potocka, Adrianna; Najder, Anna
This article describes the development of the Eating Maturity Questionnaire, a self-reported measurement of eating maturity that initiates and gives direction to human eating behaviors. The Eating Maturity Questionnaire was designed to study individuals' biological and psychosocial motives for eating. The Eating Maturity Questionnaire is a 21-item tool with satisfactory psychometric values (Cronbach's α coefficients between 0.83 and 0.88) consisting of two subscales: Rational Eating and Psychosocial Maturity Eating Maturity Questionnaire results may be used to design programs that target eating behaviors and body mass modification.
Çokluk, Ömay; Gül, Emrah; Dogan-Gül, Çilem
The study aims to examine whether differential item function is displayed in three different test forms that have item orders of random and sequential versions (easy-to-hard and hard-to-easy), based on Classical Test Theory (CTT) and Item Response Theory (IRT) methods and bearing item difficulty levels in mind. In the correlational research, the…
Boccara, Olivia; Méni, Cecile; Léauté-Labreze, Christine; Bodemer, Christine; Voisard, Jean-Jacques; Dufresne, Hélène; Brauchoux, Sébastien; Taieb, Charles
To develop and validate a specific questionnaire to assess burden on families of children with infantile haemangioma (IH): the Haemangioma Family Burden questionnaire (HFB). Items were generated from a literature review and a verbatim report from parents. Subsequently, a study was implemented at the Necker Hospital and the Pellegrin Children's Hospital for psychometric analysis. The HFB was refined via item reduction according to inter-question correlations, consensus among experts and exploratory factor analysis. A 20-item questionnaire, grouped into 5 dimensions, was obtained. Construct validity was demonstrated and HFB showed good internal coherence (Cronbach's α: 0.93). The HFB was significantly correlated with the mental dimension of the Short-Form-12 (r = -0.75), and the Psychological General Well-Being Index (r = -0.61). HFB scores differed significantly according to the size and localization of the IH. A validated tool for assessing the burden on families of children with IH is now available.
Brennan, Laura; Siderowf, Andrew; Rubright, Jonathan D.; Rick, Jacqueline; Dahodwala, Nabila; Duda, John E.; Hurtig, Howard; Stern, Matthew; Xie, Sharon X.; Rennert, Lior; Karlawish, Jason; Shea, Judy A.; Trojanowski, John Q.; Weintraub, Daniel
Objective The aim of this work was to describe the development and psychometric analysis of the Penn Parkinson's Daily Activities Questionnaire. The questionnaire is an item response theory-based tool for rating cognitive instrumental activities of daily living in PD. Methods Candidate items for the Penn Parkinson's Daily Activities Questionnaire were developed through literature review and focus groups of patients and knowledgeable informants. Item selection and calibration of item-response theory parameters were performed using responses from a cohort of PD patients and knowledgeable informants (n = 388). In independent cohorts of PD patients and knowledgeable informants, assessments of test-retest reliability (n = 50), and construct validity (n = 68) of the questionnaire were subsequently performed. Construct validity was assessed by correlating questionnaire scores with measures of motor function, cognition, an existing activities of daily living measure, and directly observed daily function. Results Fifty items were retained in the final questionnaire item bank. Items were excluded owing to redundancy, difficult reading level, and when item-response theory parameters could not be calculated. Test-retest reliability was high (intraclass correlation coefficient = 0.97; P < 0.001). The questionnaire correlated strongly with cognition (r = 0.68; P < 0.001) and directly observed daily function (r = 0.87; P < 0.001), but not with motor impairment (r = 0.08; P = 0.53). The questionnaire score accurately discriminated between PD patients with and without dementia (receiver operating characteristic curve = 0.91; 95% confidence interval: 0.85–0.97). Conclusions The Penn Parkinson's Daily Activities Questionnaire shows strong evidence of reliability and validity. Item response theory-based psychometric analysis suggests that this questionnaire can discriminate across a range of daily functions. PMID:26249849
the small development requirement ( SOR ) or qualitative materiel requirement (QMR) which led to the development of the item being tested. Analysis of...might be caused oy the use of too much alcohol, mari- Juana , or hard drugs by upper-ranking officers, senior NCOs, or supervisors. h. Questions that
Halgunseth, Linda C.; Ispa, Jean M.
The present study was conducted in four phases and constructed a self-report parenting instrument for use with Mexican immigrant mothers of children aged 6 to 10. The 14-item measure was based on semistructured qualitative interviews with Mexican immigrant mothers (N = 10), was refined by a focus group of Mexican immigrant mothers (N = 5), and was…
Anoka-Hennepin Technical Coll., Minneapolis, MN.
This document contains test items to measure the job skills of electromechanical technicians. Questions are organized in four sections that cover the following topics: (1) shop math; (2) electricity and electronics; (3) mechanics and machining; and (4) plumbing, heating, ventilation and air conditioning, and welding skills. Questions call for…
Badia, Xavier; Webb, Susan M; Prieto, Luis; Lara, Nuria
Acromegaly is a chronic disease with an important impact on patients, Health Related Quality of Life (HRQoL). The ability to effectively measure Health Related Quality of Life is central to describing the impacts of disease or treatment upon the patient, therefore the importance of having a disease specific questionnaire for acromegaly. For the development of the AcroQoL questionnaire different sources of information were used: first a literature search was performed to identify relevant papers describing the impact of acromegaly in HRQoL, second the main domains of impact on HRQoL were identified by 10 experts endocrinologists, and third ten in-depth semi-structured interviews were conducted in acromegalic patients to identify domains and items related to the self-perceived impact of acromegaly in patients' life. After a proper qualitative analysis a preliminary 38 item questionnaire was obtained. Rasch analysis concluded with a final 22 item questionnaire. The measurement properties (validity and reliability) of the resulting final questionnaire were tested and compared using standard procedures (Cronbach's Alpha and item-total correlation). The evaluation of the item parameters confirmed the construct validity of the new instrument. Responsiveness to change was assessed in a small sample of 32 acromegalic patients with active disease in Spain who were administered the AcroQoL and the generic questionnaire EuroQoL 5-D. The results showed a statistically significant relationship between all the dimensions of AcroQoL and the VAS (visual analogic scale) of EQ-5D. An improvement in the global score of AcroQoL was related to a global improvement in the VAS of the EQ-5D. Following the current recommended standard methodology the Spanish questionnaire was translated into eleven other languages. PMID:14987332
Feldman, David L
Reduction in retained surgical items is an important part of any operating room patient-safety effort. Any item used in an operation can result in a retained surgical item, but sponges are the most frequent and the abdomen is the most common location. Retained sponges can cause significant morbidity, and the costs associated with both prevention and treatment of retained surgical items, including legal costs, can be considerable. This review will examine counting, teamwork, radiography, and new technology as methods used to prevent retained surgical items. Even though none of these techniques individually is likely to completely prevent retained surgical items, when used together the numbers can be reduced.
Bukenya, Richard; Ahmed, Abhiya; Andrade, Jeanette M.; Grigsby-Toussaint, Diana S.; Muyonga, John; Andrade, Juan E.
This study sought to develop and validate a general nutrition knowledge questionnaire (GNKQ) for Ugandan adults. The initial draft consisted of 133 items on five constructs associated with nutrition knowledge; expert recommendations (16 items), food groups (70 items), selecting food (10 items), nutrition and disease relationship (23 items), and food fortification in Uganda (14 items). The questionnaire validity was evaluated in three studies. For the content validity (study 1), a panel of five content matter nutrition experts reviewed the GNKQ draft before and after face validity. For the face validity (study 2), head teachers and health workers (n = 27) completed the questionnaire before attending one of three focus groups to review the clarity of the items. For the construct and test-rest reliability (study 3), head teachers (n = 40) from private and public primary schools and nutrition (n = 52) and engineering (n = 49) students from Makerere University took the questionnaire twice (two weeks apart). Experts agreed (content validity index, CVI > 0.9; reliability, Gwet’s AC1 > 0.85) that all constructs were relevant to evaluate nutrition knowledge. After the focus groups, 29 items were identified as unclear, requiring major (n = 5) and minor (n = 24) reviews. The final questionnaire had acceptable internal consistency (Cronbach α > 0.95), test-retest reliability (r = 0.89), and differentiated (p < 0.001) nutrition knowledge scores between nutrition (67 ± 5) and engineering (39 ± 11) students. Only the construct on nutrition recommendations was unreliable (Cronbach α = 0.51, test-retest r = 0.55), which requires further optimization. The final questionnaire included topics on food groups (41 items), selecting food (2 items), nutrition and disease relationship (14 items), and food fortification in Uganda (22 items) and had good content, construct, and test-retest reliability to evaluate nutrition knowledge among Ugandan adults. PMID:28230779
Peer, Eyal; Gamliel, Eyal
When respondents answer paper-and-pencil (PP) questionnaires, they sometimes modify their responses to correspond to previously answered items. As a result, this response bias might artificially inflate the reliability of PP questionnaires. We compared the internal consistency of PP questionnaires to computerized questionnaires that presented a…
Johnson, John A.
This study describes the relation between personality items' validities, defined as the items' correlations with acquaintance ratings on the Big 5 personality factors, and other itemmetric properties including ambiguity, syntactic complexity, social desirability, content, and trait indicativity. Five external validity coefficients for each item on…
Kang, Hyeon-Ah; Lu, Ying; Chang, Hua-Hua
Increasing use of item pools in large-scale educational assessments calls for an appropriate scaling procedure to achieve a common metric among field-tested items. The present study examines scaling procedures for developing a new item pool under a spiraled block linking design. The three scaling procedures are considered: (a) concurrent…
Green, Kathy E.; Kluever, Raymond C.
Item components that might contribute to the difficulty of items on the Raven Colored Progressive Matrices (CPM) and the Standard Progressive Matrices (SPM) were studied. Subjects providing responses to CPM items were 269 children aged 2 years 9 months to 11 years 8 months, most of whom were referred for testing as potentially gifted. A second…
Salgado, Felipe Almuna; Stacey, Kaye
This paper reports how the context in which a mathematics item is embedded impacts on students' performance. The performance of Year 10 students on four PISA items was compared with performance on variants with more familiar contexts. Performance was not better when they solved items with more familiar contexts but there was some evidence that…
Andrich, David; Hagquist, Curt
Differential item functioning (DIF) for an item between two groups is present if, for the same person location on a variable, persons from different groups have different expected values for their responses. Applying only to dichotomously scored items in the popular Mantel-Haenszel (MH) method for detecting DIF in which persons are classified by…
Sanders, Emma; Hill, Catherine Mary; Evans, Hazel Jean; Tuffrey, Catherine
Obstructive sleep apnea is a condition which affects an estimated 50% of children with Down syndrome, particularly in their early years. It can cause serious sequelae in affected children but may not be recognized by parents or health professionals. Routine screening has been recommended in some countries, but is not standard practice. There are no validated questionnaire-based tools available to screen this population of children for this particular sleep-related disorder. Using existing validated sleep questionnaire items, we have developed a questionnaire to screen children with Down syndrome up to 6 years of age for obstructive sleep apnea, which corresponds with the recommendations made in UK national guidelines. This paper describes these first steps in demonstrating content validity for a new questionnaire, which will be subject to further in-depth psychometric analysis. Relevance, clarity, and age appropriateness were rated for 33 items using a content review questionnaire by a group of 18 health professionals with expertise in respiratory pediatrics, neurodevelopmental pediatrics, and sleep physiology. The content validity index was calculated for individual items and contributed to decisions about item inclusion. Scale level content validity index for the modified questionnaire of 14 items was at an accepted level of 0.78. Two parents of children with Down syndrome took part in cognitive interviews after completing the modified questionnaire. We describe the development of this 14 item questionnaire to screen for OSA in children with DS from infancy to 6 years. PMID:26539127
Woods, Carol M.
Differential item functioning (DIF) occurs when an item on a test, questionnaire, or interview has different measurement properties for one group of people versus another. One way to test items with ordinal response scales for DIF is likelihood ratio (LR) testing using item response theory (IRT), or IRT-LR-DIF. Despite the various advantages of…
This article describes the typographic principles and practice which provide the basis of good design and print, the relevant printing processes which can be used, and the graphic designer's function in questionnaire production. As they impose constraints on design decisions to be discussed later in the text, the various methods of printing and production are discussed first.
Grinberg, Ilyse; Dawkins, Marva; Dawkins, Marvin P.; Fullilove, Constance
Initial validation was sought for the Life-Challenges Questionnaire-Teen Form, a 120-item youth-risk assessment tool. The questionnaire was administered to 99 students enrolled in an adolescent detention facility and a comparison group of 305 students attending high school. The survey items included correlates of youth violence and categorized…
Choy, S. Chee; Goh, Pauline Swee Choo; Sedhu, Daljeet Singh
The development of the 21-item Learner Awareness Levels Questionnaire (LALQ) was carried out using data from three separate studies. The LALQ is a self-reporting questionnaire assessing how and why students learn. Study 1 refined the initial pool of items to 21 using exploratory factor analysis. In Study 2, the analysis showed evidence for a…
Malandrakis, George N.
This study focuses on children's understanding of hazardous household items (HHI) and waste (HHW). Children from grades 4, 5 and 6 (n=173) participated in a questionnaire and interview research design. The results indicate that: (a) on a daily basis the children used HHI and disposed of HHW, (b) the children did not realize the danger of these…
Ranger, Jochen; Ortner, Tuulia M.
Recent studies have revealed a relation between the given response and the response latency for personality questionnaire items in the form of an inverted-U effect, which has been interpreted in light of schema-driven behavior. In general, more probable responses are given faster. In the present study, the relationship between the probability of…
Chen, Cheng-Te; Wang, Wen-Chung
This study explores the effects of ignoring item interaction on item parameter estimation and the efficiency of using the local dependence index Q[subscript 3] and the SAS NLMIXED procedure to detect item interaction under the three-parameter logistic model and the generalized partial credit model. Through simulations, it was found that ignoring…
Grudell, A B M; Alexander, J A; Enders, F B; Pacifico, R; Fredericksen, M; Wise, J L; Locke, G R; Arora, A; Zais, T; Talley, N J; Romero, Y
While multiple instruments characterize upper gastrointestinal symptoms, a validated instrument devoted to the measurement of a spectrum of esophageal dysphagia attributes is not available. Therefore, we constructed and validated the Mayo Dysphagia Questionnaire (MDQ). The 27 items of the MDQ underwent content validity, feasibility, concurrent validity, reproducibility, internal consistency, and construct validity testing. To assess content validity, five esophageal subspecialty gastroenterologists reviewed the items to ensure inclusion of pertinent domains. Feasibility testing was done with eight outpatients who refined problematic items. To assess concurrent validity, 70 patient responses on the MDQ were compared to responses gathered in a structured patient-physician interview. A separate group of 70 outpatients completed the MDQ twice to assess the reproducibility of each item. A total of 148 patients participated in the validation process (78 [53%] men; mean age 62). On average, the MDQ took 6 minutes to complete. A single item (odynophagia) tested poorly with a kappa value of <0.4. Otherwise, the majority of concurrent validity kappa values were in the good to excellent range with a mean of 0.63 (95% CI 0.22-0.89). The majority of reproducibility kappa values were also in the good to excellent range with a median kappa value of 0.76 (interquartile range: 0.67-0.81). Cronbach's alpha values were excellent in the range of 0.86-0.88. Spearman rank correlation coefficients to assess construct validity were also excellent in the range of 0.87-0.98. Thus, the MDQ is a concise instrument that demonstrates overall excellent concurrent validity, reproducibility, internal consistency, and construct validity for the features of esophageal dysphagia.
Kang, Eunjeong; Brannan, Ana Maria; Heflinger, Craig Anne
The aim of this study was to examine differences in responses to the Caregiver Strain Questionnaire (CGSQ) between African American and White caregivers of children with emotional and behavioral challenges. Significant item- and scale-level differences were detected across groups with African Americans consistently reporting less strain. We…
Heller, Eric S.; Rife, Frank N.
The goal of this study was to assess the relative merit of various ranges and types of response scales in terms of respondent satisfaction and comfort and the nature of the elicited information in a population of seventh grade students. Three versions of an attitudinal questionnaire, each containing the same items but employing a different…
Toll, Benjamin A.; McKee, Sherry A.; Krishnan-Sarin, Suchitra; O'Malley, Stephanie S.
This study assessed the factor structure of the Questionnaire on Smoking Urges (QSU), a commonly used assessment of cravings for cigarettes, with a sample of smokers presenting for treatment in a smoking cessation trial. On the basis of previous research, three confirmatory factor analytic models were tested. Model 1 hypothesized a 26-item,…
Joyce, Tan Bei Yu; Yates, Shirley M.
This study used the Rasch model to assess the unidimensionality and item-person fit of an Academic Self-Concept Questionnaire (ASCQ) that is based on the Confucian Heritage Culture (CHC) perspective. Knowledge of the relationship between academic achievement and academic self-concept is particularly useful because academic achievement is…
Tokar, David M.; Buchanan, Taneisha S.; Subich, Linda M.; Hall, Rosalie J.; Williams, Christine M.
The underlying factor structure of the Learning Experiences Questionnaire (LEQ; Schaub, 2004) was examined using data from 742 male and female college-age respondents. The LEQ items reflect a variety of learning experiences (generated based on Bandura's (1986, 1997) four sources of self-efficacy perceptions) that might occur in each of Holland's…
Gordon-Hollingsworth, Arlene T.; Thompson, Julia E.; Geary, Meghan A.; Schexnaildre, Mark A.; Lai, Betty S.; Kelley, Mary Lou
The Social Support Questionnaire for Children (SSQC) is a 50-item scale that assesses children's social support from parents, relatives, nonrelative adults, siblings, and peers. The SSQC demonstrates good psychometric properties (e.g., internal consistency, factorial validity). Furthermore, the SSQC appears to be an ethnically sensitive measure of…
Stacey, Susan E.
The basic steps to be followed in the development of a questionnaire are as follows: (1) specify the goals of the study by listing all the questions the investigation is to answer; (2) review the literature related to topics being studied; (3) define all abstract concepts included in the list of objectives; (4) decide what type of item format will…
Wilson, Brenda N.; Crawford, Susan G.; Green, Dido; Roberts, Gwen; Aylott, Alice; Kaplan, Bonnie J.
The Developmental Coordination Disorder Questionnaire (DCDQ) is a parent-completed measure designed to identify subtle motor problems in children of 8 to 14.6 years of age. The purpose of this study was to extend the lower age range to children aged 5 to 7 years, revise items to ensure clarity, develop new scoring, and evaluate validity of the…
Bogg, Richard A.; And Others
This questionnaire assesses drug use practices and attitudes toward drugs in high school students. The instrument has 59 items (multiple choice or completion), some with several parts. The question pertain to aspirations for the future, general attitudes and opinions, biographic and demographic data, family background and relationships, alcohol…
de Araujo Toloi, Diego; Uema, Deise; Matsushita, Felipe; da Silva Andrade, Paulo Antonio; Branco, Tiago Pugliese; de Carvalho Chino, Fabiana Tomie Becker; Guerra, Raquel Bezerra; Pfiffer, Túlio Eduardo Flesch; Chiba, Toshio; Guindalini, Rodrigo Santa Cruz; Sulmasy, Daniel P; Riechelmann, Rachel P
Summary Objectives Spirituality is related to the care and the quality of life of cancer patients. Thus, it is very important to assess their needs. The objective of this study was the translation and cultural adjustment of the Spiritual Needs Assessment for Patients (SNAP) questionnaire to the Brazilian Portuguese language. Methodology The translation and cultural adjustment of the SNAP questionnaire involved six stages: backtranslation, revision of backtranslation, translation to the original language and adjustments, pre-test on ten patients, and test and retest with 30 patients after three weeks. Adult patients, with a solid tumour and literate with a minimum of four years schooling were included. For analysis and consistency we used the calculation of the Cronbach alpha coefficient and the Pearson linear correlation. Results The final questionnaire had some language and content adjustments compared to the original version in English. The correlation analysis of each item with the total score of the questionnaire showed coefficients above 0.99. The calculation of the Cronbach alpha coefficient was 0.9. The calculation of the Pearson linear correlation with the test and retest of the questionnaire was equal to 0.95. Conclusion The SNAP questionnaire translated into Brazilian Portuguese is adequately reliable and consistent. This instrument allows adequate access to spiritual needs and can help patient care. PMID:28101137
Loas, Gwenolé; Yon, Valerie; Monestès, Jean Louis; Cuesta, Manuel J
Long-term reliability of the Frankfurt Complaint Questionnaire (FCQ) was investigated in two follow-up studies of participants with psychosis using a test-retest method. In the first study (N = 56), the duration of the follow-up ranged from 6 months to 2 years; Spearman rho was .62 for the abridged (18 items) Spanish version of the questionnaire. In Study 2 (N = 21), in participants with stable schizophrenia, the follow-up ranged from 8 to 11 years; test-retest Spearman rho was .83 for the French version of the questionnaire. Subjective experiences could constitute, in psychosis-prone people, traits or markers of psychotic vulnerability.
van der Zee, Karen; van Oudenhoven, Jan Pieter; Ponterotto, Joseph G; Fietzer, Alexander W
This study reports on the development of the Multicultural Personality Questionnaire-Short Form among 511 participants. Using a split-sample scale validation design, Study 1 (N = 260) employed a principal component analysis and rigorous item selection criteria to extract a 40-item short form (MPQ-SF) from the original 91-item Multicultural Personality Questionnaire (MPQ; van der Zee & van Oudenhoven, 2000, 2001). In Study 2 (N = 251), the MPQ-SF was subjected to confirmatory factor analysis and resulted in a reasonably good fit to the data (comparative fit index = .94; root mean squared error of approximation = .066). Satisfactory coefficient alphas and high correlations with the original scales were found. Moreover, relationships with related scales were largely in the predicted direction. Specific directions for follow-up research are posited.
Pérez Rodrigo, Carmen; Aranceta, Javier; Salvador, Gemma; Varela-Moreiras, Gregorio
Food Frequency Questionnaires are dietary assessment tools widely used in epidemiological studies investigating the relationship between dietary intake and disease or risk factors since the early '90s. The three main components of these questionnaires are the list of foods, frequency of consumption and the portion size consumed. The food list should reflect the food habits of the study population at the time the data is collected. The frequency of consumption may be asked by open ended questions or by presenting frequency categories. Qualitative Food Frequency Questionnaires do not ask about the consumed portions; semi-quantitative include standard portions and quantitative questionnaires ask respondents to estimate the portion size consumed either in household measures or grams. The latter implies a greater participant burden. Some versions include only close-ended questions in a standardized format, while others add an open section with questions about some specific food habits and practices and admit additions to the food list for foods and beverages consumed which are not included. The method can be self-administered, on paper or web-based, or interview administered either face-to-face or by telephone. Due to the standard format, especially closed-ended versions, and method of administration, FFQs are highly cost-effective thus encouraging its widespread use in large scale epidemiological cohort studies and also in other study designs. Coding and processing data collected is also less costly and requires less nutrition expertise compared to other dietary intake assessment methods. However, the main limitations are systematic errors and biases in estimates. Important efforts are being developed to improve the quality of the information. It has been recommended the use of FFQs with other methods thus enabling the adjustments required.
Diwan, Jasmin; Patel, Pankaj; Bansal, Ankita B.
Background ABILOCO-Kids is a measure of locomotion ability for children with cerebral palsy (CP) aged 6 to 15 years & is available in English & French. Aim To validate the Gujarati version of ABILOCO-Kids questionnaire to be used in clinical research on Gujarati population. Materials and Methods ABILOCO-Kids questionnaire was translated into Gujarati from English using forward-backward-forward method. To ensure face & content validity of Gujarati version using group consensus method, each item was examined by group of experts having mean experience of 24.62 years in field of paediatric and paediatric physiotherapy. Each item was analysed for content, meaning, wording, format, ease of administration & scoring. Each item was scored by expert group as either accepted, rejected or accepted with modification. Procedure was continued until 80% of consensus for all items. Concurrent validity was examined on 55 children with Cerebral Palsy (6-15 years) of all Gross Motor Functional Classification System (GMFCS) level & all clinical types by correlating score of ABILOCO-Kids with Gross Motor Functional Measure & GMFCS. Result In phase 1 of validation, 16 items were accepted as it is; 22 items accepted with modification & 3 items went for phase 2 validation. For concurrent validity, highly significant positive correlation was found between score of ABILOCO-Kids & total GMFM (r=0.713, p<0.005) & highly significant negative correlation with GMFCS (r= -0.778, p<0.005). Conclusion Gujarati translated version of ABILOCO-Kids questionnaire has good face & content validity as well as concurrent validity which can be used to measure caregiver reported locomotion ability in children with CP. PMID:26557603
Potter, Lori P; Mathias, Susan D; Raut, Monika; Kianifard, Farid; Tavakkol, Amir
Background This research was conducted to confirm the validity and reliability and to assess the responsiveness and clinical meaningfulness of the OnyCOE-t™, a questionnaire specifically designed to measure patient-reported outcomes (PRO) associated with toenail onychomycosis. Methods 504 patients with toenail onychomycosis randomized to receive 12 weeks of terbinafine 250 mg/day with or without target toenail debridement in the IRON-CLAD® trial completed the OnyCOE-t™ at baseline, weeks 6, 12, 24, and 48. The OnyCOE-t™ is composed of 6 multi-item scales and 1 single-item scale. These include a 7-item Toenail Symptom assessment, which comprises both Symptom Frequency and Symptom Bothersomeness scales; an 8-item Appearance Problems scale; a 7-item Physical Activities Problems scale; a 1-item Overall Problem scale; a 7-item Stigma scale; and a 3-item Treatment Satisfaction scale. In total, 33 toenail onychomycosis-specific items are included in the OnyCOE-t™. Clinical data, in particular the percent clearing of mycotic involvement in the target toenail, and OnyCOE-t™ responses were used to evaluate the questionnaire's reliability, validity, responsiveness, and the minimally clinical important difference (MCID). Results The OnyCOE-t™ was shown to be reliable and valid. Construct validity and known groups validity were acceptable. Internal consistency reliability of multi-item scales was demonstrated by Cronbach's alpha > .84. Responsiveness was good, with the Treatment Satisfaction, Symptom Frequency, Overall Problem, and Appearance Problem scales demonstrating the most responsiveness (Guyatt's statistic of 1.72, 1.31, 1.13, and 1.11, respectively). MCID was evaluated for three different clinical measures, and indicated that approximately an 8.5-point change (on a 0 to 100 scale) was clinically meaningful based on a 25% improvement in target nail clearing. Conclusion The OnyCOE-t™ questionnaire is a unique, toenail-specific PRO questionnaire that can be
Giaglis, G; Angelidis, G
In the context of the psychiatric reform, as well as of the lifelong education, a Questionnaire for the evaluation of "Satisfaction from Psychiatric Training" has been constructed. It consists of 4 subscales (Satisfaction from Materials, Trainers, Program Organization, and General Satisfaction) and a total of 19 closed-ended items, evaluated in 5-point Likert scales, and an open-ended question for general remarks. One hundred and seventy six subjects, who participated in 8 consecutive training programs in psychiatry, organized by the Vocational Training Center of the Psychiatric Hospital of Petra Olympus, Greece, completed the questionnaire anonymously. The sample was divided into two groups: group A (N=112, from the first 5 programs), for the evaluation of the questionnaire's properties, and group B (N=65, from the next 3 programs) for the validation of the results. Principal component analysis in group A showed the existence of 4 factors corresponding to the 4 subscales and accounted for 67.4% of the questionnaire's variability, which were also confirmed in group B. Internal consistency was high in both groups for the overall questionnaire (Cronbach α>0.92) and for each subscale. Test-retest reliability of every subscale was also high (Pearson's r>0.90). The answers in the open-ended remark question were graded by two independent judges in a 5-point Likert scale, in relation to the satisfaction they revealed, which was highly correlated with all questionnaire subscales but one. For the total sample, the questionnaire subscales showed moderately high correlation with one another (r from 0.629 to 0.706) and even higher with the overall score (from 0.820 to 0.892). The questionnaire's sensitivity was demonstrated by the statistically significant differences observed in the satisfaction experienced from the various programs. None of the subscales was significantly correlated with age (r<0.134), with years in work (r<0.059) or was differentiated by gender. In general
Hol, A. Michiel; Vorst, Harrie C. M.; Mellenbergh, Gideon J.
A total of 520 high school students were randomly assigned to a paper-and-pencil test (PPT), a computerized standard test (CST), or a computerized adaptive test (CAT) version of the Dutch School Attitude Questionnaire (SAQ), consisting of ordinal polytomous items. The CST administered items in the same order as the PPT. The CAT administered all…
... days for Category IA items and 60 calendar days for Category IB items contained in a vault or in a... days for Category IA items and seven calendar days for Category BI items located elsewhere in the...
... days for Category IA items and 60 calendar days for Category IB items contained in a vault or in a... days for Category IA items and seven calendar days for Category BI items located elsewhere in the...
... days for Category IA items and 60 calendar days for Category IB items contained in a vault or in a... days for Category IA items and seven calendar days for Category BI items located elsewhere in the...
Gilioli, R; Cassitto, M G; Campanini, P; Punzi, S; Consonni, D; Rengo, C; Fattorini, E; Foá, V
The aim of the study is to develop and validate a questionnaire able to evaluate the risk of mobbing at the workplace. A multiple-choice questionnaire has been developed which contains, among the different items, only one revealing a mobbing situation. The questionnaire has been administered to two groups (group A--243 subjects in a mobbing situation and group B--63 subjects without exposure to mobbing) and the differences in the scores obtained have been analysed. The questionnaire has proved to be valid and reliable. The results show that the presence of five mobbing actions is sufficient to define the workplace situation as potentially at risk for mobbing. The study reveals some limits in the selection of the two samples thus needing some adjustment. However, the questionnaire, also in the present form, can be considered a tool able to detect the mobbing situations.
Gray, Joshua C; Amlung, Michael T; Palmer, Abraham A; MacKillop, James
The 27-item Monetary Choice Questionnaire (MCQ; Kirby, Petry, & Bickel, 1999) and 30-item Probability Discounting Questionnaire (PDQ; Madden, Petry, & Johnson, 2009) are widely used, validated measures of preferences for immediate versus delayed rewards and guaranteed versus risky rewards, respectively. The MCQ measures delayed discounting by asking individuals to choose between rewards available immediately and larger rewards available after a delay. The PDQ measures probability discounting by asking individuals to choose between guaranteed rewards and a chance at winning larger rewards. Numerous studies have implicated these measures in addiction and other health behaviors. Unlike typical self-report measures, the MCQ and PDQ generate inferred hyperbolic temporal and probability discounting functions by comparing choice preferences to arrays of functions to which the individual items are preconfigured. This article provides R and SPSS syntax for processing the MCQ and PDQ. Specifically, for the MCQ, the syntax generates k values, consistency of the inferred k, and immediate choice ratios; for the PDQ, the syntax generates h indices, consistency of the inferred h, and risky choice ratios. The syntax is intended to increase the accessibility of these measures, expedite the data processing, and reduce risk for error.
Knoll, Ross W; Valentiner, David P; Holzman, Jacob B
The purpose of the current studies is to identify safety behavior dimensions relevant to test anxiety, to develop a questionnaire to assess those dimensions, and to examine the validity of that questionnaire. Items were generated from interviews with college students ( N = 24). Another sample ( N = 301) completed an initial 33-item measure. Another sample ( N = 151) completed the final 19-item version the Safety Behaviors in Test Anxiety Questionnaire and provided access to their academic records. Interviews and expert evaluations were used to select items for the initial pool. An examination of item distributions and exploratory factor analysis were used to identify dimensions and reduce the item pool. Confirmatory factor analyses were used to validate the factorial structure. Correlational analyses were used to examine criterion validity of the final measure. The Safety Behaviors in Test Anxiety Questionnaire consists of a 9-item "Superstitious Behaviors" scale and a 10-item "Reassurance Seeking." The measure shows good content validity, factorial validity, internal consistency, and convergent and discriminant validity. Only the Reassurance Seeking scale showed good incremental criterion validity. Overall, these findings suggest that reassurance seeking may be a neglected target for interventions that might increase performance on high stakes tests.
Cumming, Jennifer; Woodcock, Charlotte; Cooley, Sam J.; Holland, Mark J. G.; Burns, Victoria E.
The aim of the present study was to develop and provide psychometric evidence in support of the groupwork skills questionnaire (GSQ) for measuring task and interpersonal groupwork skills. A 46-item version of the GSQ was initially completed by 672 university students. The number of items was reduced to 15 following exploratory factor analyses, and…
Ferrando, Pere J.; Lorenzo-Seva, Urbano; Chico, Eliseo
This article proposes procedures for simultaneously assessing and controlling acquiescence and social desirability in questionnaire items. The procedures are based on a semi-restricted factor-analytic tridimensional model, and can be used with binary, graded-response, or more continuous items. We discuss procedures for fitting the model (item…
Motevalian, Seyed Abbas; Asadi-Lari, Mohsen; Rahimi, Habibollah; Eftekhar, Mehrdad
In Iran, road traffic injuries are the first cause of burden of disease and motorcyclists are the most vulnerable road users. Elliot and colleagues developed the "Motorcycle Rider Behavior Questionnaire" (MRBQ), on the basis of Reason's "Driver Behavior Questionnaire" (DBQ) in 2007. The purpose of this study was to assess the validity and reliability of a Persian version of MRBQ. The 43-item MRBQ was adapted to Persian according to translation-back translation method. The questionnaire was significantly revised after assessment of content validity. In the revised version, 10 items of original MRBQ were deleted and 15 new items were added. The revised MRBQ was used in a survey of 518 motorcyclists. To assess the construct validity of MRBQ, we used Buss-Perry Aggression questionnaire concurrently on all of the subjects. After three weeks, we carried out the retest study on 119 out of 518 subjects. The mean age of the subjects was 32.5 years (SD=8.8). All of the participants were male with mean of 9.3 years of motorcycle riding experience (SD=7.3). Principal Components Analysis (PCA) showed six subscales: "Speed Violations", "Traffic Errors", "Safety Violations", "Traffic Violations", "Stunts" and "Control Errors", which accounted for 36.44% of total variance together. For each of these subscales, Cronbach's Alpha was between 0.79 to 0.91. Intraclass Correlation Coefficient for six subscales and total questionnaire were from 0.73 to 0.91. There were significant correlations between MRBQ subscales and subscales of Buss-Perry aggression questionnaire. The results indicated that the 48-item Persian version of MRBQ is a suitable measure for studying motorcyclists' behavior.
Powell, Danny H; Elwood Jr, Robert H
During the survey, respondents are asked to provide qualitative answers (well, adequate, needs improvement) on how well material control and accountability (MC&A) functions are being performed. These responses can be used to develop failure probabilities for basic events performed during routine operation of the MC&A systems. The failure frequencies for individual events may be used to estimate total system effectiveness using a fault tree in a probabilistic risk analysis (PRA). Numeric risk values are required for the PRA fault tree calculations that are performed to evaluate system effectiveness. So, the performance ratings in the questionnaire must be converted to relative risk values for all of the basic MC&A tasks performed in the facility. If a specific material protection, control, and accountability (MPC&A) task is being performed at the 'perfect' level, the task is considered to have a near zero risk of failure. If the task is performed at a less than perfect level, the deficiency in performance represents some risk of failure for the event. As the degree of deficiency in performance increases, the risk of failure increases. If a task that should be performed is not being performed, that task is in a state of failure. The failure probabilities of all basic events contribute to the total system risk. Conversion of questionnaire MPC&A system performance data to numeric values is a separate function from the process of completing the questionnaire. When specific questions in the questionnaire are answered, the focus is on correctly assessing and reporting, in an adjectival manner, the actual performance of the related MC&A function. Prior to conversion, consideration should not be given to the numeric value that will be assigned during the conversion process. In the conversion process, adjectival responses to questions on system performance are quantified based on a log normal scale typically used in human error analysis (see A.D. Swain and H.E. Guttmann
Spreng, R. Nathan; McKinnon, Margaret C.; Mar, Raymond A.; Levine, Brian
In order to formulate a parsimonious tool to assess empathy, we used factor analysis on a combination of self-report measures to examine consensus and developed a brief self-report measure of this common factor. The Toronto Empathy Questionnaire (TEQ) represents empathy as a primarily emotional process. In three studies, the TEQ demonstrated strong convergent validity, correlating positively with behavioral measures of social decoding, self-report measures of empathy, and negatively with a measure of Autism symptomatology. Moreover, it exhibited good internal consistency and high test-retest reliability. The TEQ is a brief, reliable, and valid instrument for the assessment of empathy. PMID:19085285
Wang, Wen-Chung; Shih, Ching-Lin
Three multiple indicators-multiple causes (MIMIC) methods, namely, the standard MIMIC method (M-ST), the MIMIC method with scale purification (M-SP), and the MIMIC method with a pure anchor (M-PA), were developed to assess differential item functioning (DIF) in polytomous items. In a series of simulations, it appeared that all three methods…
... 48 Federal Acquisition Regulations System 5 2014-10-01 2014-10-01 false Alternate item(s). 852.214-72 Section 852.214-72 Federal Acquisition Regulations System DEPARTMENT OF VETERANS AFFAIRS CLAUSES AND FORMS SOLICITATION PROVISIONS AND CONTRACT CLAUSES Texts of Provisions and Clauses...
Wyse, Adam E.; Mapuranga, Raymond
Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…
Zhang, Bo; Stone, Clement A.
This research examines the utility of the s-x[superscript 2] statistic proposed by Orlando and Thissen (2000) in evaluating item fit for multidimensional item response models. Monte Carlo simulation was conducted to investigate both the Type I error and statistical power of this fit statistic in analyzing two kinds of multidimensional test…
Bonar, Erin E.; Hoffmann, Erica; Rosenberg, Harold; Kryszak, Elizabeth; Young, Kathleen M.; Ashrafioun, Lisham; Kraus, Shane W.; Bannon, Erin E.
Objective To evaluate the psychometric properties of a new self-report questionnaire designed to assess college students’ intentions to employ 31 specific alcohol-reduction strategies. Method Students attending a large public university were recruited to complete alcohol-reduction, drinking history, and personality questionnaires online. Results Based on item-total correlations and principal components analysis, we eliminated three items and calculated average intention ratings across the remaining 28 items. The resulting scale had appropriate unidimensionality and excellent internal consistency. Correlations of intention questionnaire scores with measures of drinking history, alcohol outcome expectancies, sensation seeking, and impression management provided some support for criterion and discriminant validity of the questionnaire. Conclusion This questionnaire could be employed as an outcome measure to evaluate prevention programs and as a clinical tool to identify clients who have little intention to employ drinking reduction strategies in heavy drinking situations. PMID:22686362
Fox, J.-P.; Wyrick, Cheryl
The randomized response technique ensures that individual item responses, denoted as true item responses, are randomized before observing them and so-called randomized item responses are observed. A relationship is specified between randomized item response data and true item response data. True item response data are modeled with a (non)linear…
Education Commission of the States, Denver, CO. National Assessment of Educational Progress.
The career and occupational development items contained in this document are part of a kit consisting of four documents which bring together different types of items that measure a number of career and occupational development (COD) objectives developed by the National Assessment of Educational Progress (NAEP). (NAEP--which completed a national…
Briggs, Derek C.; Wilson, Mark
An approach called generalizability in item response modeling (GIRM) is introduced in this article. The GIRM approach essentially incorporates the sampling model of generalizability theory (GT) into the scaling model of item response theory (IRT) by making distributional assumptions about the relevant measurement facets. By specifying a random…
Four studies are reported on the derivation and assessment of a hypermasculinity scale. In Study 1, a questionnaire measure of hypermasculine values was derived from an initial 122 items, rated on a seven-point scale by 600 men from eight categories, based on occupation or sport interest. Factor analysis and item reduction produced 26- and 16- item scales (Hypermasculine Values Questionnaire, HVQ and Short Hypermasculine Values Questionnaire) with high internal consistencies. There were substantial differences between categories, consistent with predictions based on their gender-stereotypic connotations. Study 2 involved the scales being administered to another similarly composed sample: again high internal consistency and unidimensionality (in a confirmatory factor analysis) were found, and a similar association with category membership. Test-retest reliability was high. In Study 3, the concurrent and discriminative validity of the HVQ was studied, by comparing it with an existing measure of hypermasculinity, male role norms, attitudes to women's rights, gender-related traits, and trait aggression. Associations were found with other gender scales, and there was a moderate association with trait physical aggression. The range of associations reflected the items on the scale, which involve toughness, the need to avoid femininity, and control of women's sexuality, themes familiar from ethnographic accounts of masculinity. Study 4 showed that the HVQ was associated with hostile but not benevolent sexism, and replicated its association with trait aggression.
Takahashi, Megumi; Tanaka, Katsutoshi; Miyaoka, Hitoshi
A new and easy evaluation method of communication skills has been developed using the Communication Skills Questionnaire (CSQ), which can be self-administered or administered by family members and medical staff. The reliability and validity of this CSQ were evaluated. Eighty-seven patients with mental disorders and 100 normal controls participated in a self-rating evaluation of the CSQ, and 55 family members and four medical personnel also participated in objective rating. The CSQ contained 29 items and these items were divided into three categories: cooperative skills (17 items), assertive skills (six items) and general communication skills (six items, mainly non-verbal skills). Internal consistencies of all groups were between 0.91 and 0.97. Test-retest reliability values for patients, family members and medical staff were between 0.90 and 0.95. Interrater reliability of medical staff was 0.73. The total scores had a moderate positive correlation with Global Assessment of Functioning (GAF) score and doctor's impression of communication skill evaluated on a 10-point scale. The patient group had a lower CSQ score than that of controls and the score differences between controls and patients with schizophrenia, mood disorders or eating disorders were statistically significant. This questionnaire is a good psychometric method of evaluating the communication skills of patients.
Bartolucci, F.; Montanari, G. E.; Pandolfi, S.
With reference to a questionnaire aimed at assessing the performance of Italian nursing homes on the basis of the health conditions of their patients, we investigate two relevant issues: dimensionality of the latent structure and discriminating power of the items composing the questionnaire. The approach is based on a multidimensional item…
Wagner, Julie; Lacey, Kimberly; Chyun, Deborah; Abbott, Gina
This paper describes a paper and pencil questionnaire that measures heart disease risk knowledge in people with diabetes. The Heart Disease Fact Questionnaire (HDFQ) is a 25-item questionnaire that was developed to tap into respondents' knowledge of major risk factors for the development of CHD. Approximately half of these items specifically address diabetes-related CHD risk factors. Based on extensive pilot data, the current study analyzed responses from 524 people with diabetes to assess the psychometric properties. The HDFQ is readable to an average 13-year old and imposes little burden. It shows good content and face validity. It demonstrates adequate internal consistency, with Kuder-Richardson-20 formula = 0.77 and good item-total correlations. Item analysis showed a desirable range in P-values. In discriminant function analyses, HDFQ scores differentiated respondents by knowledge of their own cardiovascular health, use of lipid lowering medications, health insurance status, and educational attainment, thus indicating good criterion related validity. This measure of heart disease risk knowledge is brief, understandable to respondents, and easy to administer and score. Its potential for use in research and practice is discussed. Future research should establish norms as well as investigate its test-retest reliability and predictive validity.
This paper reviews and discusses some critical issues related to the use of questionnaire surveys in educational planning. Ten brief sections discuss survey objectives, coverage, questionnaire design, administration, validity, nonresponse, cost considerations, coding, statistical analysis, and interpretation. Five illustrative questionnaire…
Use of the Diet History Questionnaire and Diet*Calc Analysis Software for publication purposes should contain a citation which includes version information for the software, questionnaire, and nutrient database.
ARP staff adapted the Diet History Questionnaire (DHQ) for use by Canadian populations in collaboration with the Alberta Cancer Board. This questionnaire takes into account the different food fortification polices of the U.S. and Canada.
Dang, Jeff; Cole, Jason C.; Burgess, Somali M.; Yang, Min; Daniels, Selena R.; Walt, John G.
Background Patient-reported outcome (PRO) measures have been used to assess treatment benefit in a variety of therapeutic areas and are now becoming increasingly important in aesthetic research. Objectives The objective of the current study was to develop and validate a new PRO measure (Eyelash Satisfaction Questionnaire [ESQ]) to assess satisfaction with eyelash prominence. Methods The content of the questionnaire (including conceptual framework and questionnaire items) was generated by review of literature, participant interviews, and expert opinion. Cognitive interviews were conducted to pilot test the questionnaire. Psychometric properties of the questionnaire were examined in a combined sample of participants (n = 970) completing Internet- (n = 909) and paper-based (n = 61) versions. Item- and domain-level properties were examined using modern and classical psychometrics. Results Content-based analysis of qualitative data demonstrated the presence of 3 distinct domains (Length, Fullness, Overall Satisfaction; Confidence, Attractiveness, and Professionalism; and Daily Routine). Initial confirmatory factor analysis (CFA) results of 23 items revealed insufficient model-data fit (comparative fit index [CFI] of 0.86 and a non-normed fit index [NNFI] of 0.82). A revised model using 9 items (3 per domain) achieved appropriate fit (CFI of 0.99 and NNFI of 0.97). Analyses revealed measurement equivalence across the Internet- and paper-based versions. The 3 ESQ domains had strong internal consistency reliability (Cronbach's α [range] = 0.919-0.976) and adequate convergent and discriminant validity. Conclusions The ESQ was found to be a reliable and valid PRO measure for assessing satisfaction with eyelash prominence. Level of Evidence: 3 Therapeutic PMID:26691738
Haberman, Shelby J; Sinharay, Sandip; Chon, Kyong Hee
Residual analysis (e.g. Hambleton & Swaminathan, Item response theory: principles and applications, Kluwer Academic, Boston, 1985; Hambleton, Swaminathan, & Rogers, Fundamentals of item response theory, Sage, Newbury Park, 1991) is a popular method to assess fit of item response theory (IRT) models. We suggest a form of residual analysis that may be applied to assess item fit for unidimensional IRT models. The residual analysis consists of a comparison of the maximum-likelihood estimate of the item characteristic curve with an alternative ratio estimate of the item characteristic curve. The large sample distribution of the residual is proved to be standardized normal when the IRT model fits the data. We compare the performance of our suggested residual to the standardized residual of Hambleton et al. (Fundamentals of item response theory, Sage, Newbury Park, 1991) in a detailed simulation study. We then calculate our suggested residuals using data from an operational test. The residuals appear to be useful in assessing the item fit for unidimensional IRT models.
The purpose of this study was to develop a questionnaire that could measure preservice mathematics teachers' mathematics educational values. Development and validation of the questionnaire involved a sequential inquiry in which design principles were established from the existing literature and a pool of items was constructed then submitted to…
Martinez-Fernandez, J. Reinaldo; Corcelles, Mariona; Cerrato-Lara, Maria
In this study, we present the conceptions about teamwork questionnaire designed to evaluate the conceptions that secondary students have about teamwork. Participants were 309 students aged 15-16 from eight secondary schools, seven from Barcelona and one from Girona (Spain). The original 27-item questionnaire was reduced according to expert…
Coelho, Vítor Alexandre; Sousa, Vanda; Marchante, Marta; Brás, Patrícia; Romão, Ana Maria
This study aims to validate the Bullying and Cyberbullying Behaviors Questionnaire, to examine the prevalence of bullying and victimization behaviors in Portuguese middle school students, and to analyse the differences in victimization and bullying between genders and across school grades. The questionnaire is composed of 36 items, allowing for…
Kuntsche, Emmanuel; Kuntsche, Sandra
A short form of the Drinking Motive Questionnaire Revised (DMQ-R; Cooper, 1994) was developed, using different item selection strategies based on a national representative sample of 5,617 12- to 18-year-old students in Switzerland. To confirm the concurrent validity of the short-form questionnaire, or DMQ-R SF, data from a second national sample…
Reynolds, Jesse S.; Treu, Judith A.; Njike, Valentine; Walker, Jennifer; Smith, Erica; Katz, Catherine S.; Katz, David L.
Objective: To determine the reliability and validity of a 10-item questionnaire, the Food Label Literacy for Applied Nutrition Knowledge questionnaire. Methods: Participants were elementary school children exposed to a 90-minute school-based nutrition program. Reliability was assessed via Cronbach alpha and intraclass correlation coefficient…
Moore, J. L.
The development, testing, and characteristics of an instrument--Computers and Robots Attitude Questionnaire--that can be used to measure the attitudes of secondary students towards computers and robots are described. Individual questionnaire items are largely content-free and may be answered by students with no specialist knowledge of…
Kukaswadia, Atif; Janssen, Ian; Pickett, William; Bajwa, Jasmine; Georgiades, Katholiki; Lalonde, Richard N.; Quon, Elizabeth C.; Safdar, Saba; Pike, Ian
Objectives Acculturation is a multidimensional process involving changes in behaviour and beliefs. Questionnaires developed to measure acculturation are typically designed for specific ethnic populations and adult experiences. This study developed a questionnaire that measures acculturation among ethnically diverse populations of youth that can be included as a module in population surveys. Methods Questionnaires measuring acculturation in youth were identified in the literature. The importance of items from the existing questionnaires was determined using a Delphi process and this informed the development of our questionnaire. The questionnaire was then pilot tested using a sample of 248 Canadians aged 18–25 via an online system. Participants identified as East and South East Asian (27.8%), South Asian (17.7%) and Black (13.7%). The majority were 1st (33.5%) or 2nd generation immigrants (52.0%). After redundant items were eliminated, exploratory factor analysis grouped items into domains, and, for each domain, internal consistency, and convergent validity with immigrant generation then age at immigration estimated. A subset of participants re-completed the questionnaire for reliability estimation. Results The literature review yielded 117 articles that used 13 questionnaires with a total of 440 questions. The Delphi process reduced these to 32 questions. Pilot testing occurred in 248 Canadians aged 18–25. Following item reduction, 16 questions in three domains remained: dominant culture, heritage language, and heritage culture. All had good internal consistency (Cronbach’s alphas > .75). The mean dominant domain score increased with immigrant generation (1st generation: 3.69 (95% CI: 3.49–3.89), 2nd: 4.13 (4.00–4.26), 3rd: 4.40 (4.19–4.61)), and mean heritage language score was higher among those who immigrated after age 12 than before (p = .0001), indicative of convergent validity. Conclusions This Bicultural Youth Acculturation Questionnaire has
Ferneau, E.; Mueller, S.
The drug-abuse questionnaire used to survey college student attitudes on the subject is provided. It is identical to the alcoholism questionnaire except for word changes appropriate to the subject matter. The questionnaire consists of 40 statements about drug abuse and drug abusers, with 7 possible responses: (1) completely disagree; (2) mostly…
Lambe, Laura; Mackinnon, Sean P; Stewart, Sherry H
People engage in gambling behaviour for a variety of different reasons, some of which are riskier than others in terms of associations with heavy and problem gambling. Stewart and Zack (Addiction 103:1110-1117, 2008) developed a measure called the Gambling Motives Questionnaire (GMQ) that assesses levels of three distinct gambling motives: enhancement (to increase positive emotions), coping (to decrease negative emotions), and social (to increase affiliation). While this measure has been validated in a community-recruited sample of middle-aged gamblers, the GMQ has yet to be validated in emerging adulthood (ages 18-25 years)—a developmental period associated with increased risk for heavy and problematic gambling. The current project tested the psychometric properties of the GMQ in a community sample of emerging adult gamblers using archival data from the Manitoba Longitudinal Study of Young Adults. Participants (N = 487; 73.9% Caucasian; 52.6% female; mean age 22.23 years) completed the GMQ and questionnaire measures of gambling behaviour and problems. Exploratory factor analysis revealed that a three-factor model adequately fit the data; however, problematic items were identified. A modified 9-item version of the GMQ with the problem items removed fit the data well. Both the original 15-item and the 9-item versions had acceptable subscale alpha reliabilities (αs >.78). While all three subscales (from both the 9-item and 15-item versions) were positively correlated with problem gambling, only enhancement motives emerged as a significant independent predictor when the other motives and gambling behaviours were entered as simultaneous predictors. These results suggest the GMQ is a valid measure for tapping motives in emerging adults, and that high enhancement motives are particularly predictive of gambling problems in this developmental period. Future intervention efforts might specifically target enhancement motives in emerging adults.
Serra, Francesca; Spoto, Andrea; Ghisi, Marta; Vidotto, Giulio
Psychological Assessment can be defined as a complex procedure of information collection, analysis and processing. Formal Psychological Assessment (FPA) tries to improve this procedure by providing a formal framework to build assessment tools. In this paper, FPA is applied to depression. Seven questionnaires widely used for the self-evaluation of depression were selected. Diagnostic criteria for major depressive disorder were derived from the DSM-5, literature and Seligman's and Beck's theories. A Boolean matrix was built, including 266 items from the questionnaires in the rows and 20 selected attributes, obtained through diagnostic criteria decomposition, in the columns. In the matrix, a 1 in a cell meant that the corresponding item investigated the specific attribute. It was thus possible to analyze the relationships between items and attributes and among items. While none of the considered questionnaires could alone cover all the criteria for the evaluation of depressive symptoms, we observed that a set of 30 items contained the same information that was obtained redundantly with 266 items. Another result highlighted by the matrix regards the relations among items. FPA allows in-depth analysis of currently used questionnaires based on the presence/absence of clinical elements. FPA allows for going beyond the mere score by differentiating the patients according to symptomatology. Furthermore, it allows for computerized-adaptive assessment.
Albano, Anthony D.
In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…
This digest discusses the advantages and disadvantages of using item banks, and it provides useful information for those who are considering implementing an item banking project in their school districts. The primary advantage of item banking is in test development. Using an item response theory method, such as the Rasch model, items from multiple…
McMorris, Robert F.; And Others
Two matched forms of a 50 item grammar test were developed. Twenty items designed to be humorous were included in one form. Inclusion of humorous items did not affect grammar scores on matched humorous/nonhumorous items, nor on commmon post-treatment items. Inclusion did not affect results of anxiety measures. (Author/DWH)
Glas, Cees A. W.; van der Linden, Wim J.
Developed a multilevel item response (IRT) model that allows for differences between the distributions of item parameters of families of item clones. Results from simulation studies based on an item pool from the Law School Admission Test illustrate the accuracy of the item pool calibration and adaptive testing procedures based on the model. (SLD)
... availability of such items and the economic and technological feasibility of using such items, including life cycle costs. USDA will gather information on individual products within an item and extrapolate that product information to the item level for consideration in designating items. In considering these...
Reid, J. Christopher
The author suggests that a computer program be designed which not only computes item statistics, but also provides a running commetary in English to assist teachers in their analysis of test results. (PR)
Hudon, Catherine; Lambert, Mireille; Almirall, José
Abstract Objective To evaluate the reliability and validity of the newly developed Physician Enabling Skills Questionnaire (PESQ) by assessing its internal consistency, test-retest reliability, concurrent validity with patient-centred care, and predictive validity with patient activation and patient enablement. Design Validation study. Setting Saguenay, Que. Participants One hundred patients with at least 1 chronic disease who presented in a waiting room of a regional health centre family medicine unit. Main outcome measures Family physicians’ enabling skills, measured with the PESQ at 2 points in time (ie, while in the waiting room at the family medicine unit and 2 weeks later through a mail survey); patient-centred care, assessed with the Patient Perception of Patient-Centredness instrument; patient activation, assessed with the Patient Activation Measure; and patient enablement, assessed with the Patient Enablement Instrument. Results The internal consistency of the 6 subscales of the PESQ was adequate (Cronbach α = .69 to .92). The test-retest reliability was very good (r = 0.90; 95% CI 0.84 to 0.93). Concurrent validity with the Patient Perception of Patient-Centredness instrument was good (r = −0.67; 95% CI −0.78 to −0.53; P < .001). The PESQ accounts for 11% of the total variance with the Patient Activation Measure (r2 = 0.11; P = .002) and 19% of the variance with the Patient Enablement Instrument (r2 = 0.19; P < .001). Conclusion The newly developed PESQ presents good psychometric properties, allowing for its use in practice and research. PMID:26889507
Beshentsev, B. D.; Vityuk, N. P.; Volkov, A. V.; Yevdokimov, A. I.; Novikov, M. N.; Piskunov, Y. G.; Pobortsev, E. P.; Sadovnichaya, L. M.
The invention refers to the fabrication of ceramic items by the molding method. It can be used to produce items of complicated configuration, in particular composition of binding agent for electroceramic items.
Foulkes, Lucy; Viding, Essi; McCrory, Eamon; Neumann, Craig S.
Human beings seek out social interactions as a source of reward. To date, there have been limited attempts to identify different forms of social reward, and little is known about how the value of social rewards might vary between individuals. This study aimed to address both these issues by developing the Social Reward Questionnaire (SRQ), a measure of individual differences in the value of different social rewards. Exploratory factor analysis (EFA) was run on an initial set of 75 items (N = 305). Based on this analysis, confirmatory factor analysis (CFA) was then conducted on a second sample (N = 505) with a refined 23-item scale. This analysis was used to test a six-factor structure, which resulted in good model fit (CFI = 0.96, RSMEA = 0.07). The factors represent six subscales of social reward defined as follows: Admiration; Negative Social Potency; Passivity; Prosocial Interactions; Sexual Reward; and Sociability. All subscales demonstrated good test-retest reliability and internal consistency. Each subscale also showed a distinct pattern of associations with external correlates measuring personality traits, attitudes, and goals, thus demonstrating construct validity. Taken together, the findings suggest that the SRQ is a reliable, valid measure that can be used to assess individual differences in the value experienced from different social rewards. PMID:24653711
Foulkes, Lucy; Viding, Essi; McCrory, Eamon; Neumann, Craig S
Human beings seek out social interactions as a source of reward. To date, there have been limited attempts to identify different forms of social reward, and little is known about how the value of social rewards might vary between individuals. This study aimed to address both these issues by developing the Social Reward Questionnaire (SRQ), a measure of individual differences in the value of different social rewards. Exploratory factor analysis (EFA) was run on an initial set of 75 items (N = 305). Based on this analysis, confirmatory factor analysis (CFA) was then conducted on a second sample (N = 505) with a refined 23-item scale. This analysis was used to test a six-factor structure, which resulted in good model fit (CFI = 0.96, RSMEA = 0.07). The factors represent six subscales of social reward defined as follows: Admiration; Negative Social Potency; Passivity; Prosocial Interactions; Sexual Reward; and Sociability. All subscales demonstrated good test-retest reliability and internal consistency. Each subscale also showed a distinct pattern of associations with external correlates measuring personality traits, attitudes, and goals, thus demonstrating construct validity. Taken together, the findings suggest that the SRQ is a reliable, valid measure that can be used to assess individual differences in the value experienced from different social rewards.
Van Dam, Nicholas T.; Brown, Anna; Mole, Tom B.; Davis, Jake H.; Britton, Willoughby B.; Brewer, Judson A.
At a fundamental level, taxonomy of behavior and behavioral tendencies can be described in terms of approach, avoid, or equivocate (i.e., neither approach nor avoid). While there are numerous theories of personality, temperament, and character, few seem to take advantage of parsimonious taxonomy. The present study sought to implement this taxonomy by creating a questionnaire based on a categorization of behavioral temperaments/tendencies first identified in Buddhist accounts over fifteen hundred years ago. Items were developed using historical and contemporary texts of the behavioral temperaments, described as “Greedy/Faithful”, “Aversive/Discerning”, and “Deluded/Speculative”. To both maintain this categorical typology and benefit from the advantageous properties of forced-choice response format (e.g., reduction of response biases), binary pairwise preferences for items were modeled using Latent Class Analysis (LCA). One sample (n1 = 394) was used to estimate the item parameters, and the second sample (n2 = 504) was used to classify the participants using the established parameters and cross-validate the classification against multiple other measures. The cross-validated measure exhibited good nomothetic span (construct-consistent relationships with related measures) that seemed to corroborate the ideas present in the original Buddhist source documents. The final 13-block questionnaire created from the best performing items (the Behavioral Tendencies Questionnaire or BTQ) is a psychometrically valid questionnaire that is historically consistent, based in behavioral tendencies, and promises practical and clinical utility particularly in settings that teach and study meditation practices such as Mindfulness Based Stress Reduction (MBSR). PMID:26535904
Motevalian, Seyed Abbas; Asadi-Lari, Mohsen; Rahimi, Habibollah; Eftekhar, Mehrdad
In Iran, road traffic injuries are the first cause of burden of disease and motorcyclists are the most vulnerable road users. Elliot and colleagues developed the “Motorcycle Rider Behavior Questionnaire” (MRBQ), on the basis of Reason’s “Driver Behavior Questionnaire” (DBQ) in 2007. The purpose of this study was to assess the validity and reliability of a Persian version of MRBQ. The 43-item MRBQ was adapted to Persian according to translation-back translation method. The questionnaire was significantly revised after assessment of content validity. In the revised version, 10 items of original MRBQ were deleted and 15 new items were added. The revised MRBQ was used in a survey of 518 motorcyclists. To assess the construct validity of MRBQ, we used Buss-Perry Aggression questionnaire concurrently on all of the subjects. After three weeks, we carried out the retest study on 119 out of 518 subjects. The mean age of the subjects was 32.5 years (SD=8.8). All of the participants were male with mean of 9.3 years of motorcycle riding experience (SD=7.3). Principal Components Analysis (PCA) showed six subscales: “Speed Violations”, “Traffic Errors”, “Safety Violations”, “Traffic Violations”, “Stunts” and “Control Errors”, which accounted for 36.44% of total variance together. For each of these subscales, Cronbach’s Alpha was between 0.79 to 0.91. Intraclass Correlation Coefficient for six subscales and total questionnaire were from 0.73 to 0.91. There were significant correlations between MRBQ subscales and subscales of Buss-Perry aggression questionnaire. The results indicated that the 48-item Persian version of MRBQ is a suitable measure for studying motorcyclists’ behavior. PMID:22105387
Zalma, Abdul Razak; Safiah, Md Yusof; Ajau, Danis; Khairil Anuar, Md Isa
Interventions to counter the influence of television food advertising amongst children are important. Thus, reliable and valid instrument to assess its effect is needed. The objective of this study was to determine the reliability and validity of such a questionnaire. The questionnaire was administered twice on 32 primary schoolchildren aged 10-11 years in Selangor, Malaysia. The interval between the first and second administration was 2 weeks. Test-retest method was used to examine the reliability of the questionnaire. Intra-rater reliability was determined by kappa coefficient and internal consistency by Cronbach's alpha coefficient. Construct validity was evaluated using factor analysis. The test-retest correlation showed moderate-to-high reliability for all scores (r = 0.40*, p = 0.02 to r = 0.95**, p = 0.00), with one exception, consumption of fast foods (r = 0.24, p = 0.20). Kappa coefficient showed acceptable-to-strong intra-rater reliability (K = 0.40-0.92), except for two items under knowledge on television food advertising (K = 0.26 and K = 0.21) and one item under preference for healthier foods (K = 0.33). Cronbach's alpha coefficient indicated acceptable internal consistency for all scores (0.45-0.60). After deleting two items under Consumption of Commonly Advertised Food, the items showed moderate-to-high loading (0.52, 0.84, 0.42 and 0.42) with the Scree plot showing that there was only one factor. The Kaiser-Meyer-Olkin was 0.60, showing that the sample was adequate for factor analysis. The questionnaire on television food advertising is reliable and valid to assess the effect of media literacy education on television food advertising on schoolchildren.
Bariki, Hamda; Hashmi, Mariam; Baggili, Ibrahim
Due to the lack of standards in reporting digital evidence items, investigators are facing difficulties in efficiently presenting their findings. This paper proposes a standard for digital evidence to be used in reports that are generated using computer forensic software tools. The authors focused on developing a standard digital evidence items by surveying various digital forensic tools while keeping in mind the legal integrity of digital evidence items. Additionally, an online questionnaire was used to gain the opinion of knowledgeable and experienced stakeholders in the digital forensics domain. Based on the findings, the authors propose a standard for digital evidence items that includes data about the case, the evidence source, evidence item, and the chain of custody. Research results enabled the authors in creating a defined XML schema for digital evidence items.
Gonzalez-Ramirez, Leivy Patricia; De la Roca-Chiapas, Jose Maria; Colunga-Rodriguez, Cecilia; Preciado-Serrano, Maria de Lourdes; Daneri-Navarro, Adrian; Pedroza-Cabrera, Francisco Javier; Martinez-Arriaga, Reyna Jazmin
Background The transtheoretical model (TTM) has been widely used to promote healthy behaviors in different groups. However, a questionnaire has not yet been developed to evaluate the health behaviors that medical practitioners often consider in individuals with cancer or at a high risk of developing cancer. Purpose The aim of this study was to construct and validate the Health Behavior and Stages of Change Questionnaire (HBSCQ), which is based on the TTM and health recommendations related to risk and factors that protect against cancer. Methods Content validity was conducted in two phases (qualitative and quantitative). Item difficulty index, item discrimination index, and discrimination coefficient were obtained based on the classical test theory. Finally, Cronbach’s alpha was used. Results Measure of concordance showed scores considered adequate and excellent. The item discrimination index obtained a rating of “excellent” and suggested the preservation of all items. The discrimination coefficient scores are >0.74. The global internal consistency of the HBSCQ was 0.384. HBSCQ specification between groups of internal consistency for the sample of men was 0.712 and that for the sample of women was 0.378. Conclusion/implications for practice The HBSCQ represents a proposal for a fast, simple, and innovative screening test, which aims to identify persons who may benefit from interventions to promote health behaviors delimited to the stage of change. PMID:28356769
Weber, Margaret B.; Argo, Jana K.
This study determined whether item forms ( rules for constructing items related to a domain or set of tasks) would enable naive item writers to generate multiple-choice items at three taxonomic levels--knowledge, comprehension, and application. Students wrote 120 multiple-choice items from 20 item forms, corresponding to educational objectives…
The short scale of the Eysenck Personality Questionnaire-Revised (EPQR-S; H. J. Eysenck & S. B. G. Eysenck, 1992) is a 48-item personality questionnaire primarily designed to measure an individual's level of extraversion (vs. introversion) and neuroticism. Although L. J. Francis, L. B. Brown, and R. Philipchalk (1992) created the Eysenck Personality Questionnaire Revised-Abbreviated (EPQR-A), an even briefer version of the EPQR-S, the reliability coefficients of some of the measures have been less than satisfactory (S. Forrest, C. A. Lewis, & M. Shevlin, 2000). Because brevity and reliability are both extremely important, the author of the present study created a briefer version of the EPQR-S, more reliable than the EPQR-A, by making slight alterations in the item content as well as the response format of the EPQR-S. Two hundred and sixty eight participants completed the original EPQR-S and the 24-item newly revised briefer version of the EPQR-S (EPQ-BV) twice. The findings revealed that the EPQ-BV has good internal consistency, test-retest reliability, and concurrent validity. A principal component analysis revealed a solution with factor loadings that accurately reflected the primary measures of the EPQR-S. These findings are discussed in relation to the psychometric properties of the EPQR-A and the original version of the EPQR-S.
Rodrigues, George; Bauman, Glenn; Lock, Michael; D'Souza, David; Mahon, Jeff
Background To construct a short prostate cancer radiation late toxicity (PCRT) questionnaire with health-related quality-of-life (HRQoL) domains. Methods The PCRT was developed by item generation, questionnaire construction (n = 7 experts, n = 8 focus group patients), pilot testing (n = 37), item reduction (n = 100), reliability testing (n = 237), and validity testing (n = 274). Results Reliability of the three item-reduced subscales demonstrated intraclass correlation coefficients (CC) of 0.811 (GU), 0.842 (GI), and 0.740 (sexual). Discriminant validity demonstrated Pearson CC of 0.449 (GU-GI), 0.200 (sexual-GU), and 0.09 (sexual-GI). Content validity correlations between PCRT-PCQoL were 0.35–0.78, PCRT-FACT-G© were 0.19–0.39, and PCRT-SF-36® were 0.03–0.34. Conclusion We successfully generated a PCRT HRQoL questionnaire including subscales with very good psychometric properties. PMID:17540022
Trujillo, Anna C.
With the use of computers, paper questionnaires are being replaced by electronic questionnaires. The formats of traditional paper questionnaires have been found to effect a subject's rating. Consequently, the transition from paper to electronic format can subtly change results. The research presented begins to determine how electronic questionnaire formats change subjective ratings. For formats where subjects used a flow chart to arrive at their rating, starting at the worst and middle ratings of the flow charts were the most accurate but subjects took slightly more time to arrive at their answers. Except for the electronic paper format, starting at the worst rating was the most preferred. The paper and electronic paper versions had the worst accuracy. Therefore, for flowchart type of questionnaires, flowcharts should start at the worst rating and work their way up to better ratings.
Background Proximity of food stores is associated with dietary intake and obesity; however, individuals frequently shop at stores that are not the most proximal. Little is known about other factors that influence food store choice. The current research describes the development of the Food Store Selection Questionnaire (FSSQ) and describes preliminary results of field testing the questionnaire. Methods Development of the FSSQ involved a multidisciplinary literature review, qualitative analysis of focus group transcripts, and expert and community reviews. Field testing consisted of 100 primary household food shoppers (93% female, 64% African American), in rural and urban Arkansas communities, rating FSSQ items as to their importance in store choice and indicating their top two reasons. After eliminating 14 items due to low mean importance scores and high correlations with other items, the final FSSQ questionnaire consists of 49 items. Results Items rated highest in importance were: meat freshness; store maintenance; store cleanliness; meat varieties; and store safety. Items most commonly rated as top reasons were: low prices; proximity to home; fruit/vegetable freshness; fruit/vegetable variety; and store cleanliness. Conclusions The FSSQ is a comprehensive questionnaire for detailing key reasons in food store choice. Although proximity to home was a consideration for participants, there were clearly other key factors in their choice of a food store. Understanding the relative importance of these different dimensions driving food store choice in specific communities may be beneficial in informing policies and programs designed to support healthy dietary intake and obesity prevention. PMID:23773428
Eliot, John; Czarnolewski, Mark Y
The authors developed a 12-category, 116-item critical incident questionnaire of spatial behavior. The authors administered the Everyday Spatial Behavioral Questionnaire (ESBQ) to volunteer undergraduates (114 women, and 31 men) and tests of spatial ability to establish both the reliability and construct validity of the instrument. The authors found that Cronbach's alpha across the subscale scores was .92, and that 8 of the 12 subscales had alphas of .70 or greater. The authors found validity of the ESBQ through canonical correlation analysis. Specifically, spatial tests, gender, and age variables, jointly with the ESBQ subscales, identified 2 apparent continua of spatial skills. The authors labeled the first continuum movement through space (from moving a vehicle at one end of the continuum, to moving one's own body through space at the other end of the continuum). The authors labeled the second identified continuum drawing/perceiving perspective/path finding, and it appeared to represent a continuum of 3-dimensional visualization or redirection. Another suggested label was dimensional discernment. Thus, the ESBQ is a first step toward identifying new ways to think about and quantify people's spatial experience.
Grogan, S; Conner, M; Willits, D; Norman, P
BACKGROUND. It is now a requirement that patients' satisfaction with the services obtained from their general practitioner should be surveyed. AIM. The aim of the study was to produce a reliable and valid multidimensional patient satisfaction questionnaire that could be used in general practice. METHOD. Items were originally derived from patients' responses to open-ended questions. The resulting 148-item Likert-scale questionnaire was completed by 1193 patients. General satisfaction items were removed from the set, and responses to remaining items underwent factor analysis. Subscales were produced from items representing each factor. Reliability and validity of each subscale were examined. RESULTS. Five subscales with a total of 40 items resulted from the factor analysis: doctors, access, nurses, appointments and facilities. Each subscale was internally reliable (Cronbach's alpha coefficient between 0.73 and 0.95), and initial tests of validity suggested that all subscales were valid. CONCLUSION. The study has resulted in a 40-item scale that has been found to be reliable and valid after initial tests. Further work to test the reliability and validity of the final version of the patient satisfaction questionnaire is described. PMID:7492421
Rafiei, Morteza; Rastegari, Hosein Ali; Ghiasi, Mojdeh; Shahsanaie, Vahid
Background: Food security is a state in which all people at every time have physical and economic access to adequate food to obviate nutritional needs and live a healthy and active life. Therefore, this study was performed to quantitatively evaluate the household food security in Esfahan using the localized version of US Household Food Security Survey Module (US HFSSM). Methods: This descriptive cross-sectional study was performed in year 2006 on 3000 households of Esfahan. The study instrument used in this work is 18-item US food security module, which is developed into a localized 15-item questionnaire. This study is performed in two stages of families with no children (under 18 years old) and families with children over 18 years old. Results: The results showed that item severity coefficient, ratio of responses given by households and item infit and outfit coefficient in adult's and children's questionnaire respectively. According to obtained data, scale score of +3 in adults group is described as determination limit of slight food insecurity and +6 is stated as the limit for severe food insecurity. For children's group, scale score of +2 is defined to be the limit of slight food insecurity and +5 is the determination limit of severe food insecurity. Conclusions: The main hypothesis of this survey analysis is based on the raw scale score of USFSSM The item of “lack of enough money for buying food” (item 2) and the item of “lack of balanced meal” (3rd item) have the lowest severity coefficient. Then, the ascending rate of item severity continues in first item, 4th item and keeps increasing into 10th item. PMID:24498498
Smith, M. K.; Hypes, P. A.; Bracken, D. S.
One of the most difficult problems in NDA of nuclear materials is identifying the chemical form of the nuclear material and the surrounding matrix. Recent work analyzing the calorimeter response of sources embedded in a variety of matrices has led to a possible solution to this problem. The wide range of thermal time constants exhibited by typical matrix materials lends itself to permitting the differentiation between materials, based on time constants extracted from the measured response. Potential applications include simple item identification, item fingerprinting as part of shipper-receiver measurements, and distinguishing between Pu metal and Pu oxide as required under certain proposed attribute measurements. The results of applying this technique to a variety of items will be presented and discussed.
Holman, Rebecca; Glas, Cees AW; Lindeboom, Robert; Zwinderman, Aeilko H; de Haan, Rob J
Background Whenever questionnaires are used to collect data on constructs, such as functional status or health related quality of life, it is unlikely that all respondents will respond to all items. This paper examines ways of dealing with responses in a 'not applicable' category to items included in the AMC Linear Disability Score (ALDS) project item bank. Methods The data examined in this paper come from the responses of 392 respondents to 32 items and form part of the calibration sample for the ALDS item bank. The data are analysed using the one-parameter logistic item response theory model. The four practical strategies for dealing with this type of response are: cold deck imputation; hot deck imputation; treating the missing responses as if these items had never been offered to those individual patients; and using a model which takes account of the 'tendency to respond to items'. Results The item and respondent population parameter estimates were very similar for the strategies involving hot deck imputation; treating the missing responses as if these items had never been offered to those individual patients; and using a model which takes account of the 'tendency to respond to items'. The estimates obtained using the cold deck imputation method were substantially different. Conclusions The cold deck imputation method was not considered suitable for use in the ALDS item bank. The other three methods described can be usefully implemented in the ALDS item bank, depending on the purpose of the data analysis to be carried out. These three methods may be useful for other data sets examining similar constructs, when item response theory based methods are used. PMID:15200681
Rossi Ferrario, Silvia; Giorgi, Ines; Baiardi, Paola; Giuntoli, Laura; Balestroni, Gianluigi; Cerutti, Paola; Manera, Marina; Gabanelli, Paola; Solara, Valentina; Fornara, Roberta; Luisetti, Michela; Omarini, Pierangela; Omarini, Giovanna; Vidotto, Giulio
Purpose Interest in assessing denial is still present, despite the criticisms concerning its definition and measurement. We tried to develop a questionnaire (Illness Denial Questionnaire, IDQ) assessing patients’ and caregivers’ denial in relation to their illness/disturbance. Patients and methods After a preliminary study, a final version of 24 dichotomous items (true/false) was selected. We hypothesized a theoretical model with three dimensions: denial of negative emotions, resistance to change, and conscious avoidance, the first two composing the actual Denial and the last representing an independent component of the illness denial behavior. The IDQ was administered to 400 subjects (219 patients and 181 caregivers) together with the Anxiety–Depression Questionnaire – Reduced form (AD-R), in order to assess concurrent validity. Confirmatory factor analysis (CFA), internal consistency indices (Cronbach’s α and McDonald’s ω), and test–retest analysis were performed. Results CFA and internal consistency indices (Cronbach’s α: 0.87–0.96) indicated a clear and meaningful three-factor structure of IDQ, for both patients and caregivers. Further analyses showed good concurrent validity, with Denial and its subscale negatively associated with anxiety and depression and avoidance positively associated with anxiety and depression. The IDQ also showed a good stability (r from 0.71 to 0.87). Conclusion The IDQ demonstrated good psychometric properties. Denial of negative emotions and resistance to change seem to contribute to a real expression of denial, and conscious avoidance seems to constitute a further step in the process of cognitive–affective elaboration of the illness. PMID:28356745
Australian Council for Educational Research, Hawthorn.
The Australian Science Item Bank consists of three volumes of multiple-choice questions. Book 3 contains questions on the biological sciences. The questions are designed to be suitable for high school students (year 8 to year 12 in Australian schools). The questions are classified by the subject content of the question, the cognitive skills…
Zwick, Rebecca; Thayer, Dorothy T.
Two possible standard error formulas for the polytomous differential item functioning index proposed by N. J. Dorans and A. P. Schmitt (1991) were derived. These standard errors, and associated hypothesis-testing procedures, were evaluated through simulated data. The standard error that performed better is based on N. Mantel's (1963)…
Maβ, R; Haasen, C; Krausz, M
The Frankfurt Complaint Questionnaire (FCQ) is a widely used method to investigate non-psychotic subjective experiences of schizophrenics. Less is known about its dimensional structure. Therefore, principal components analyses (PCA) were conducted with the FCQ data of 505 schizophrenics and 187 alcoholics. Furthermore, results of a former analysis using item-to-item comparisons between schizophrenics and alcoholics were examined. PCA yielded two factors called 'dysphoric concomitants of severe illness particularly impairing concentration' and 'subjective experiences of perceptual uncertainties'. Neither of the factors was specific to schizophrenia. The item comparisons suggest that only a group of eight FCQ items (subscale 'FCQ-S') is specific to schizophrenia while ten items ('FCQ-A') are related more to alcoholism. The validity of FCQ-S and FCQ-A was confirmed: schizophrenics reached high scores in FCQ-S and low scores in FCQ-A; alcoholics scored high in FCQ-A and low in FCQ-S; schizophrenics with an additional alcohol disorder scored high in both of the subscales. It is concluded that direct group comparisons seem to be promising for the identification of non-psychotic subjective phenomena which are characteristic for schizophrenia.
Fox, Rina S.; Malcarne, Vanessa L.; Roesch, Scott C.; Sadler, Georgia Robins
This study describes the reliability and validity of scores on the Cultural Health Attributions Questionnaire (CHAQ), and proposes a refined short form. Murguía, Zea, Reisen and Peterson (2000) developed the 24-item CHAQ to assess health beliefs among Latinos/Hispanics. The CHAQ incorporates two 12-item subscales: Equity Attributions (EA) and Behavioral-Environmental Attributions (BEA). Although the CHAQ has been published in Spanish and English, psychometric properties have only been evaluated for scores on the Spanish-language version. Participants in the present study were 436 Latinos/Hispanics, half of whom completed the CHAQ in Spanish and half in English. Multigroup confirmatory factor analysis indicated that the proposed two-factor structure did not fit the data for either language. Subsequent exploratory factor analyses revealed different best-fitting models for the two languages. A common two-factor (EA/BEA) structure was derived from items that loaded univocally in both languages. Additional items were removed to produce a ten-item revised version (CHAQ-R). The two factors were negatively correlated and had good internal consistency reliability. Expected relationships of CHAQ-R scores to acculturation and health locus of control strongly supported convergent validity. The relationship of EA to ethnomedical services usage marginally supported criterion validity. Overall, the results support the reliability and validity of CHAQ-R scores to measure cultural health attributions in Latinos/Hispanics, but further psychometric evaluation is needed. PMID:24773009
Martindale, Russell J J; Collins, Dave; Wang, John C K; McNeill, Michael; Lee, Kok Sonk; Sproule, John; Westbury, Tony
As sporting challenge at the elite level becomes ever harder, maximizing effectiveness of the talent development pathway is crucial. Reflecting this need, this paper describes the development of the Talent Development Environment Questionnaire, which has been designed to facilitate the development of sporting potential to world-class standard. The questionnaire measures the experiences of developing athletes in relation to empirically identified "key features" of effective talent development environments. The first phase involved the generation of questionnaire items with clear content and face validity. The second phase explored the factor structure and reliability. This was carried out with 590 developing athletes through application of exploratory factor analysis with oblique rotation, principal axis factoring extraction and cronbach alpha tests. This yielded a 59-item, seven-factor structure with good internal consistency (0.616-0.978). The Talent Development Environment Questionnaire appears to be a promising psychometric instrument that can potentially be useful for education and formative review in applied settings, and as a measurement tool in talent development research.
Dragoş, D; Ojog, DG; Tănăsescu, MD
Objective. To further evaluate the adequacy of the items in our questionnaire aimed at unraveling the possible correlations between psychological features and internal disorders. This paper is dedicated to the items exploring the individual’s interaction with other people. Method. The items are divided into several subdomains. For each subdomain, we have calculated the correlations between the items of the respective subdomain (inner associations) and with the items in other subdomains (outer associations) by means of chi square test or Fisher exact test as dictated by statistical reasons. We examined the answers from our first 10192 respondents. Results and conclusions. Many inter-item correlations are the consequence of higher or lesser degrees of synonymy. Those within a given subdomain confirm the adequate allocation of items. Those bridging different subdomains may point either to incorrect assignments, or to semantic inclusion relations. Other results are not explicable by semantic similarity, and probably reveal psychological subtleties, such as: most individuals have a sense of undeservedness when badly treated by other people; those easily hurt by insults and humiliations have a propensity to timidity and/or emotivity; the subjects who shun conflicts are more prone to persistent thoughts, brooding people are more sensitive and more prone to conflicts, injustice-indignant people frequently get into conflict although they declare to be bothered by dissent etc. But at the heart of all the PFs in the Interaction-with-other-people domain there seems to be the sense of being undervalued, which should probably be the key issue to be addressed by any therapeutic interventions for diseases psychoemotionally determined by disturbed interpersonal relationships. Abbreviations: PF = psychological feature; Chisq = chi-square; OdRa = odds ratio; OdRaCL = odds ratio confidence limits; ErrProb = probability of error PMID:22514567
Huggins-Manley, Anne Corinne
This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…
A Monte Carlo Study Investigating the Influence of Item Discrimination, Category Intersection Parameters, and Differential Item Functioning Patterns on the Detection of Differential Item Functioning in Polytomous Items
The increased use of polytomous item formats has led assessment developers to pay greater attention to the detection of differential item functioning (DIF) in these items. DIF occurs when an item performs differently for two contrasting groups of respondents (e.g., males versus females) after controlling for differences in the abilities of the…
Nuntiyagul, Atorn; Naruedomkul, Kanlaya; Cercone, Nick; Wongsawang, Damras
We present PKIP, an adaptable learning assistant tool for managing question items in item banks. PKIP is not only able to automatically assist educational users to categorize the question items into predefined categories by their contents but also to correctly retrieve the items by specifying the category and/or the difficulty level. PKIP adapts…
A teacher learning how to write test questions (test items) will almost certainly encounter item-writing guidelines--lists of item-writing do's and don'ts. Item-writing guidelines usually are presented as applicable across all assessment settings. Table I shows some guidelines that I believe to be generally applicable and two will be briefly…
Ottmar, Erin R.; Konold, Timothy R.; Berry, Robert Q.; Grissmer, David W.; Cameron, Claire E.
Psychometric properties of 24 items from the fifth grade Early Childhood Longitudinal Study-Kindergarten Cohort Mathematics Teacher Questionnaire were investigated in a sample of 5,181 participants. These items asked teachers to report how often they had their classroom students engage in different mathematics content, skills and instructional…
Rushall, B S; Wiznuk, K
The purpose of this study was to provide an assessment tool to judge coaching performance that was appropriate for completion by athletes. The questionnaire underwent a variety of developmental stages. In its final form, it contained 36 items. The tool was shown to be a valid, reliable, and standardized questionnaire. It demonstrated discriminability and provoked honest, accurate responding in subjects. The test was capable of providing immediate feedback to coaches seeking information about athletes' perceptions of their coaching performance. Responses on the developed scale were weighted to reflect the desirability of the coaching characteristics of a good coach. The questionnaire provides a total score which can be interpreted by the coach as a measure of how much of an "ideal" coach exists in him/her.
In computerized adaptive testing (CAT), examinees are presented with various sets of items chosen from a precalibrated item pool. Consequently, the attrition speed of the items is extremely fast, and replenishing the item pool is essential. Therefore, item calibration has become a crucial concern in maintaining item banks. In this study, a two-parameter logistic model is used. We applied optimal designs and adaptive sequential analysis to solve this item calibration problem. The results indicated that the proposed optimal designs are cost effective and time efficient. PMID:25188318
Hooker, Giles; Finkelman, Matthew
Hooker, Finkelman, and Schwartzman ("Psychometrika," 2009, in press) defined a paradoxical result as the attainment of a higher test score by changing answers from correct to incorrect and demonstrated that such results are unavoidable for maximum likelihood estimates in multidimensional item response theory. The potential for these results to…
Crescioni, Mabel; Messer, Dawn H.; Warholak, Terri L.; Miller, Joseph M.; Twelker, J. Daniel; Harvey, Erin M.
Purpose To evaluate and refine a newly developed instrument, the Student Refractive Error and Eyeglasses Questionnaire (SREEQ), designed to measure the impact of uncorrected and corrected refractive error on vision-related quality of life (VRQoL) in school-aged children. Methods. A 38 statement instrument consisting of two parts was developed: Part A relates to perceptions regarding uncorrected vision and Part B relates to perceptions regarding corrected vision and includes other statements regarding VRQoL with spectacle correction. The SREEQ was administered to 200 Native American 6th through 12th grade students known to have previously worn and who currently require eyeglasses. Rasch analysis was conducted to evaluate the functioning of the SREEQ. Statements on Part A and Part B were analyzed to examine the dimensionality and constructs of the questionnaire, how well the items functioned, and the appropriateness of the response scale used. Results Rasch analysis suggested two items be eliminated and the measurement scale for matching items be reduced from a 4-point response scale to a 3-point response scale. With these modifications, categorical data were converted to interval level data, to conduct an item and person analysis. A shortened version of the SREEQ was constructed with these modifications, the SREEQ-R, which included the statements that were able to capture changes in VRQoL associated with spectacle wear for those with significant refractive error in our study population. Conclusions While the SREEQ Part B appears to be a have less than optimal reliability to assess the impact of spectacle correction on VRQoL in our student population, it is also able to detect statistically significant differences from pretest to posttest on both the group and individual levels to show that the instrument can assess the impact that glasses have on VRQoL. Further modifications to the questionnaire, such as those included in the SREEQ-R, could enhance its functionality
Jap, Tjibeng; Tiatri, Sri; Jaya, Edo Sebastian; Suteja, Mekar Sari
Online game is an increasingly popular source of entertainment for all ages, with relatively prevalent negative consequences. Addiction is a problem that has received much attention. This research aims to develop a measure of online game addiction for Indonesian children and adolescents. The Indonesian Online Game Addiction Questionnaire draws from earlier theories and research on the internet and game addiction. Its construction is further enriched by including findings from qualitative interviews and field observation to ensure appropriate expression of the items. The measure consists of 7 items with a 5-point Likert Scale. It is validated by testing 1,477 Indonesian junior and senior high school students from several schools in Manado, Medan, Pontianak, and Yogyakarta. The validation evidence is shown by item-total correlation and criterion validity. The Indonesian Online Game Addiction Questionnaire has good item-total correlation (ranging from 0.29 to 0.55) and acceptable reliability (α = 0.73). It is also moderately correlated with the participant's longest time record to play online games (r = 0.39; p<0.01), average days per week in playing online games (ρ = 0.43; p<0.01), average hours per days in playing online games (ρ = 0.41; p<0.01), and monthly expenditure for online games (ρ = 0.30; p<0.01). Furthermore, we created a clinical cut-off estimate by combining criteria and population norm. The clinical cut-off estimate showed that the score of 14 to 21 may indicate mild online game addiction, and the score of 22 and above may indicate online game addiction. Overall, the result shows that Indonesian Online Game Addiction Questionnaire has sufficient psychometric property for research use, as well as limited clinical application.
Pau, Allan; Croucher, Ray; Marcenes, Wagner; Leung, Theresa
Dental pain, estimated to affect 12-40% of community-dwelling adults, is a symptom of a wide range of clinical conditions. A population screening instrument is needed to study their prevalence. This project aimed to develop a questionnaire for classifying a sample of dental pain patients into three groups of common dental pain conditions, i.e. Group 1 (Acute periapical periodontitis and Irreversible pulpitis), Group 2 (Reversible pulpitis and Dentine hypersensitivity) and Group 3 (Pericoronitis). Initial items were generated through a literature review, individual unstructured patient interviews and consultation with experts. Items generated were administered to a sample of dental pain patients for self-completion. Responses were subjected to a series of factor and discriminant analyses to identify questions capable of differentiating the sample into three groups, originally categorized by clinical diagnosis, with high classification rates. The selected items were administered to a further sample of dental pain patients to test for its sensitivity and specificity in classifying the sample into three groups against the gold standard of clinical diagnosis. The final 16-item Dental Pain Questionnaire (DePaQ) was capable of correctly classifying 89.7% of dental pain cases initially categorized by clinical diagnoses. The sensitivity of the questionnaire was 0.80-Group z1, 0.85-Group 2 and 0.59-Group 3. Specificity was 0.83-Group A1, 0.89-Group A2 and 0.90-Group 3. The DePaQ, which can easily be administered by non-clinical personnel, may be used to collect epidemiological data on common dental pain conditions, assess dental needs for a specified population, and triage of patients seeking treatment for dental pain.
Kelly, Diane; Kantor, Paul B.; Morse, Emile; Scholtz, Jean; Sun, Y.
Evaluating interactive question answering (QA) systems with real users can be challenging because traditional evaluation measures based on the relevance of items returned are difficult to employ since relevance judgments can be unstable in multi-user evaluations. The work reported in this paper evaluates, in distinguishing among a set of interactive QA systems, the effectiveness of three questionnaires: a Cognitive Workload Questionnaire (NASA TLX), and Task and System Questionnaires customized to a specific interactive QA application. These Questionnaires were evaluated with four systems, seven analysts, and eight scenarios during a 2-week workshop. Overall, results demonstrate that all three Questionnaires are effective at distinguishing among systems, with the Task Questionnaire being the most sensitive. Results also provide initial support for the validity and reliability of the Questionnaires.
Müller, Karolina; Edvall, Niklas K.; Idrizbegovic, Esma; Huhn, Robert; Cima, Rilana; Persson, Viktor; Leineweber, Constanze; Westerlund, Hugo; Langguth, Berthold; Schlee, Winfried; Canlon, Barbara; Cederroth, Christopher R.
Background: Due to the lack of objective measures for assessing tinnitus, its clinical evaluation largely relies on the use of questionnaires and psychoacoustic tests. A global assessment of tinnitus burden would largely benefit from holistic approaches that not only incorporate measures of tinnitus but also take into account associated fears, emotional aspects (stress, anxiety, and depression), and quality of life. In Sweden, only a few instruments are available for assessing tinnitus, and the existing tools lack validation. Therefore, we translated a set of questionnaires into Swedish and evaluated their reliability and validity in a group of tinnitus subjects. Methods: We translated the English versions of the Tinnitus Functional Index (TFI), the Fear of Tinnitus Questionnaire (FTQ), the Tinnitus Catastrophizing Scale (TCS), the Perceived Stress Questionnaire (PSQ-30), and the Tinnitus Sample Case History Questionnaire (TSCHQ) into Swedish. These translations were delivered via the internet with the already existing Swedish versions of the Tinnitus Handicap Inventory (THI), the Hospital Anxiety and Depression Scale (HADS), the Hyperacusis Questionnaire (HQ), and the World Health Organization Quality of Life questionnaire (WHOQoL-BREF). Psychometric properties were evaluated by means of internal consistency [Cronbach's alpha (α)] and test–retest reliability across a 9-week interval [Intraclass Correlation Coefficient (ICC), Cohen's kappa] in order to establish construct as well as clinical validity using a sample of 260 subjects from a population-based cohort. Results: Internal consistency was acceptable for all questionnaires (α > 0.7) with the exception of the “social relationships” subscale of the WHOQoL-BREF. Test–retest reliability was generally acceptable (ICC > 0.70, Cohens kappa > 0.60) for the tinnitus-related questionnaires, except for the TFI “sense of control” subscale and 15 items of the TSCHQ. Spearmen rank correlations showed that
Toland, Michael D; Sulis, Isabella; Giambona, Francesca; Porcu, Mariano; Campbell, Jonathan M
A bifactor item response theory model can be used to aid in the interpretation of the dimensionality of a multifaceted questionnaire that assumes continuous latent variables underlying the propensity to respond to items. This model can be used to describe the locations of people on a general continuous latent variable as well as on continuous orthogonal specific traits that characterize responses to groups of items. The bifactor graded response (bifac-GR) model is presented in contrast to a correlated traits (or multidimensional GR model) and unidimensional GR model. Bifac-GR model specification, assumptions, estimation, and interpretation are demonstrated with a reanalysis of data (Campbell, 2008) on the Shared Activities Questionnaire. We also show the importance of marginalizing the slopes for interpretation purposes and we extend the concept to the interpretation of the information function. To go along with the illustrative example analyses, we have made available supplementary files that include command file (syntax) examples and outputs from flexMIRT, IRTPRO, R, Mplus, and STATA. Supplementary data to this article can be found online at http://dx.doi.org/10.1016/j.jsp.2016.11.001. Data needed to reproduce analyses in this article are available as supplemental materials (online only) in the Appendix of this article.
Lau, C. Allen; Wang, Tianyou
This paper proposes a new Information-Time index as the basis for item selection in computerized classification testing (CCT) and investigates how this new item selection algorithm can help improve test efficiency for item pools with mixed item types. It also investigates how practical constraints such as item exposure rate control, test…
Powell, Danny H; Elwood Jr, Robert H
The questionnaire is the instrument used for recording performance data on the nuclear material protection, control, and accountability (MPC&A) system at a nuclear facility. The performance information provides a basis for evaluating the effectiveness of the MPC&A system. The goal for the questionnaire is to provide an accurate representation of the performance of the MPC&A system as it currently exists in the facility. Performance grades for all basic MPC&A functions should realistically reflect the actual level of performance at the time the survey is conducted. The questionnaire was developed after testing and benchmarking the material control and accountability (MC&A) system effectiveness tool (MSET) in the United States. The benchmarking exercise at the Idaho National Laboratory (INL) proved extremely valuable for improving the content and quality of the early versions of the questionnaire. Members of the INL benchmark team identified many areas of the questionnaire where questions should be clarified and areas where additional questions should be incorporated. The questionnaire addresses all elements of the MC&A system. Specific parts pertain to the foundation for the facility's overall MPC&A system, and other parts pertain to the specific functions of the operational MPC&A system. The questionnaire includes performance metrics for each of the basic functions or tasks performed in the operational MPC&A system. All of those basic functions or tasks are represented as basic events in the MPC&A fault tree. Performance metrics are to be used during completion of the questionnaire to report what is actually being done in relation to what should be done in the performance of MPC&A functions.
The data needs questionnaire is an element in the project design study for the Michigan Resource Inventory Act and is aimed at gathering information on what inventory information is required by land use planners throughout the state. Analysis of questionnaire responses is discussed. Some information on current use categories was tabulated. The respondents selected a broad range of categories at all levels of detail. Those most frequently indicated were urban categories.
... Programming Interfaces (APIs) that are implemented and/or supported. Explain which interfaces are for internal... wireless products). (12) For products which incorporate an “open cryptographic interface” as defined in part 772 of the EAR, describe the cryptographic interface. (c) For classification requests for...
Bennett, Roger; Kane, Suzanne
In many countries the outputs from university student satisfaction surveys are used for a variety of educational management purposes. Within the United Kingdom, the main instrument employed by state authorities to measure student satisfaction is the National Student Survey (NSS). The issue investigated by the current research related to whether…
... software, provide the following information: (1) Description of all the symmetric and asymmetric encryption... third-party hardware or software encryption components (if any). Identify the manufacturers of the hardware or software components, including specific part numbers and version information as needed...
Cramer, Angelique O. J.
What is validity? A simple question but apparently one with many answers, as Paul Newton highlights in his review of the history of validity. The current definition of validity, as entertained in the 1999 "Standards for Educational and Psychological Testing" is indeed a consensus, one between the classical notion of attributes, and measures…
INOUE, Akiomi; KAWAKAMI, Norito; SHIMOMITSU, Teruichi; TSUTSUMI, Akizumi; HARATANI, Takashi; YOSHIKAWA, Toru; SHIMAZU, Akihito; ODAGIRI, Yuko
This study aimed to investigate the reliability and construct validity of a new version of the Brief Job Stress Questionnaire (New BJSQ), which measures an extended set of psychosocial factors at work by adding new scales/items to the current version of the BJSQ. Additional scales/items were extensively collected from theoretical job stress models and similar questionnaires in several countries. Scales/items were field-tested and refined through a pilot internet survey. Finally, an 84-item questionnaire (141 items in total when combined with the current BJSQ) was developed. A nationally representative survey was administered to employees in Japan (n=1,633) to examine the reliability and construct validity. Most scales showed acceptable levels of internal consistency and test-retest reliability. Principal component analyses showed that the first factor explained 50% or greater proportion of the variance in most scales. A scale factor analysis and a correlation analysis showed that these scales fit the theoretical expectations. These findings provided a piece of evidence that the New BJSQ scales are reliable and valid. Although more detailed content and construct validity should be examined in future study, the New BJSQ is a useful instrument to evaluate psychosocial work environment and positive mental health outcomes in the current workplace. PMID:24492763
Four questionnaires, designed to measure attitudes toward a proposed homework hotline, are included in this document. There are versions for parents of students in grades 4 to 6, for junior high school students, for high school students, and for educators. The items concern student characteristics, desirable parental role in helping with homework,…
Massidda, Davide; Giorgi, Ines; Vidotto, Giulio; Tringali, Salvatore; Imbriani, Marcello; Baiardi, Paola; Bertolotti, Giorgio
Introduction and objectives A multidimensional self-report questionnaire to evaluate job-related stress factors is presented. The questionnaire, called Maugeri Stress Index – reduced form (MASI-R), aims to assess the impact of job strain on a team or on a single worker by considering four domains: wellness, resilience, perception of social support, and reactions to stressful situations. Material and methods The reliability of a first longer version (47 items) of the questionnaire was evaluated by an internal consistency analysis and a confirmatory factor analysis. An item reduction procedure was implemented to obtain a short form of the instrument, and the psychometric properties of the resulting instrument were evaluated using the Rasch measurement model. Results A total of 14 items from the initial pool were deleted because they were not productive for measurement. The analysis of internal consistency led to the exclusion of eight items, while the analysis performed using structural equation models led to the exclusion of another six items. According to the Rasch model, item properties and the reliability of the instruments appear good, especially for the scales for wellness and resilience. In contrast, the scales for perception of social support and negative coping styles show a lower internal consistency. Conclusions The Maugeri Stress Index – reduced form provides a reliable and valid measure, useful for early identification of stress levels in workers or in a team along the eustress–vadistress continuum. PMID:28392695
Letamendi, Andrea M.; Chavira, Denise A.; Hitchcock, Carla A.; Roesch, Scott C.; Shipon-Blum, Elisa; Stein, Murray B.; Roesch, Scott C.
Objective To evaluate the factor structure, reliability, and validity of the 17-item Selective Mutism Questionnaire. Method Diagnostic interviews were administered via telephone to 102 parents of children identified with selective mutism (SM) and 43 parents of children without SM from varying U.S. geographic regions. Children were between the ages of 3 and 11 inclusive and comprised 58% girls and 42% boys. SM diagnoses were determined using the Anxiety Disorders Interview Schedule for Children - Parent Version (ADIS-C/P); SM severity was assessed using the 17-item Selective Mutism Questionnaire (SMQ); and behavioral and affective symptoms were assessed using the Child Behavior Checklist (CBCL). An exploratory factor analysis (EFA) was conducted to investigate the dimensionality of the SMQ and a modified parallel analysis procedure was used to confirm EFA results. Internal consistency, construct validity, and incremental validity were also examined. Results The EFA yielded a 13-item solution consisting of three factors: a) Social Situations Outside of School, b) School Situations, and c) Home and Family Situations. Internal consistency of SMQ factors and total scale ranged from moderate to high. Convergent and incremental validity were also well supported. Conclusions Measure structure findings are consistent with the 3-factor solution found in a previous psychometric evaluation of the SMQ. Results also suggest that the SMQ provides useful and unique information in the prediction of SM phenomenon beyond other child anxiety measures. PMID:18698268
González-Espada, Wilson J.
Many physical science and physics instructors might not be trained in pedagogically appropriate test construction methods. This could lead to test items that do not measure what they are intended to measure. A subgroup of these items might show bias against some groups of students. This paper describes how the author became aware of potentially biased items against females in his examinations, which led to the exploration of fundamental issues related to item validity, gender bias, and differential item functioning, or DIF. A brief discussion of DIF in the context of university courses, as well as practical suggestions to detect possible gender-biased items, follows.
Arunsurat, Itthiphat; Luengyosluechakul, Swita; Prateephoungrat, Krittin; Siripaupradist, Pittayapoom; Khemtong, Sukanya; Jamcharoensup, Kunranan; Thanapatkaiporn, Narin; Limpawattana, Panita; Laohasiriwong, Supawan; Pinitsoontorn, Somdej; Boonjaraspinyo, Sirintip; Sawanyawisuth, Kittisak
Obstructive Sleep Apnea (OSA) is a common disease associated with major cardiovascular diseases. Male subjects are more at higher risk for OSA than female subjects. The Berlin questionnaire is a beneficial screening tool for OSA and has 14 items. The Berlin questionnaire may need some adjustment for Thai or Asian populations. We aimed to find items that should be asked in the Berlin questionnaire to identify high risk for obstructive sleep apnea among Thai male healthcare workers. This study was performed in Thai male healthcare workers over the age of 35 and currently working at the Faculty of Medicine, Khon Kaen University. The Thai version of the Berlin questionnaire was randomly distributed. A study population of 273 subjects was required to provide a confidence value of 95%. An item analysis of the Berlin questionnaire was evaluated as independent factors for being high risk of OSA by using a multivariate logistic regression analysis. Of the 273 distributed questionnaires, 135 subjects returned then (49.5% response rate). Of those, 41 (30.4%) were identified as being at high risk of OSA. Only three items of the Berlin questionnaire, including frequent snoring, high body mass index and hypertension, were independently associated with being at high risk for OSA. In conclusion, the Berlin questionnaire can be shortened to identify high risk for OSA by itself; not polysomnography.
Shore, Bruce M.; Chichekian, Tanya; Syer, Cassidy A.; Aulls, Mark W.; Frederiksen, Carl H.
Tools are needed to track the elements of students' successful engagement in inquiry. The "McGill Strategic Demands of Inquiry Questionnaire" (MSDIQ) is a 79-item, criterion-referenced, learner-focused questionnaire anchored in Schon's model and related models of self-regulated learning. The MSDIQ addresses three phases of inquiry…
Berg, Kelly C.; Peterson, Carol B.; Frazier, Patricia; Crow, Scott J.
Significant discrepancies have been found between interview- and questionnaire-based assessments of psychopathology; however, these studies have typically compared instruments with unmatched item content. The Eating Disorder Examination (EDE), a structured interview, and the questionnaire version of the EDE (EDE-Q) are considered the preeminent…
Van Ravesteyn, Nicolien T.; Dallmeijer, Annet J.; Scholtes, Vanessa A.; Roorda, Leo D.; Becher, Jules G.
Aim: The objective of this study was to assess the reliability of a mobility questionnaire (MobQues) that was developed to measure the mobility limitations of children with cerebral palsy (CP) as rated by their parents. A clinical version of the questionnaire, consisting of 47 items (MobQues47), is available, as well as a research version with 28…
Soutome, Sakiko; Kajiwara, Kazumi; Oho, Takahiko
Objective: To examine whether the combined use of a task-specific self-efficacy scale for oral health behaviour (SEOH) and an oral health questionnaire (OHQ) would be useful for evaluating subjects' behaviours and cognitions. Design: Questionnaires. Methods: One hundred and eighty-five students completed the SEOH and OHQ. The 30-item OHQ uses a…
Ferreira, Joaquim A; Martins, Jorge S; Coelho, Mariana S; Kahler, Christopher W
Extant literature suggests that Portuguese college students frequently drinking alcohol and experience a variety of alcohol-related negative consequences. However, to our knowledge, there is no validated measure to assess negative consequences of drinking alcohol for college students in Portugal. This article describes a validation of the Portuguese version of the Brief Young Adult Alcohol Consequences Questionnaire. Originally developed by Kahler, Strong, and Read (2005), this 24-item questionnaire is a widely used self-report measure with strong psychometric properties and validity for the evaluation of the negative consequences of drinking in college students. We collected data from 620 students at the University of Coimbra (Portugal). Participants completed (a) a background questionnaire, (b) the Alcohol Use Disorders Identification Test (AUDIT), (c) the Daily Drinking Questionnaire - Revised (DDQ-R), and (d) the Brief Young Adult Alcohol Consequences Questionnaire (B-YAACQ) translated into Portuguese as part of this study. Analyses showed that items fit a unidimensional Rasch model well with items infit statistics raging from .82 to 1.27, supporting using all items to create a total sum score of the Portuguese version of the B-YAACQ. The Portuguese version of the B-YAACQ showed adequate internal reliability (α = .87) and concurrent validity. Results support its use and integration in research on interventions targeted to reduce adverse effects associated with excessive drinking among Portuguese college students.
Baranowski, Tom; Allen, Diane D; Mâsse, Louise C; Wilson, Mark
There has been some concern that participation in an intervention and exposure to a measurement instrument can change participants' interpretation of the items on a self-report questionnaire thereby distorting subsequent responses and biasing results. Differential item functioning (DIF) analysis using item response modeling can ascertain possible differences in item interpretation by testing for differences in item location between groups. The DIF for treatment versus control group differences at post-intervention assessment and the Time 1 and Time 2 differences in a control group were analyzed using data from a dietary change intervention trial for Boy Scouts. The measures included fruit and vegetable (FV) frequency of consumption, preferences and self-efficacy. Treatment-control group DIF at post-intervention assessment was detected in a higher percentage of items for FV frequency than for preference or self-efficacy. Time 1 to Time 2 differences in items for the control group were detected in one item for each of the three scales. Further research will need to clarify whether the obtained DIFs reflected true changes in frequency, preference or self-efficacy or some reinterpretation of items by participants following an intervention or merely after previous exposure to the measure.
Kelly, Laura; Potter, Caroline M; Hunter, Cheryl; Gibbons, Elizabeth; Fitzpatrick, Ray; Jenkinson, Crispin; Peters, Michele
Purpose It is a key UK government priority to assess and improve outcomes in people with long-term conditions (LTCs). We are developing a new patient-reported outcome measure, the Long-Term Conditions Questionnaire (LTCQ), for use among people with single or multiple LTCs. This study aimed to refine candidate LTCQ items that had previously been informed through literature reviews, interviews with professional stakeholders, and interviews with people with LTCs. Materials and methods Cognitive interviews (n=32) with people living with LTCs and consultations with professional stakeholders (n=13) and public representatives (n=5) were conducted to assess the suitability of 23 candidate items. Items were tested for content and comprehensibility and underwent a translatability assessment. Results Four rounds of revisions took place, due to amendments to item structure, improvements to item clarity, item duplication, and recommendations for future translations. Twenty items were confirmed as relevant to living with LTCs and understandable to patients and professionals. Conclusion This study supports the content validity of the LTCQ items among people with LTCs and professional stakeholders. The final items are suitable to enter the next stage of psychometric refinement. PMID:27895523
Abreu, Ana Maria de; Faria, Christina Danielli Coelho de Morais; Cardoso, Sônia Maria Vicente; Teixeira-Salmela, Luci Fuscaldi
The aim of the present study was to investigate the psychometric properties and validate the Portuguese version of the Fear Avoidance Beliefs Questionnaire (FABQ-Brazil). This instrument assesses how beliefs and fear of individuals with lower back pain affect two subscales related to their physical activities (FABQ-Phys) and work (FABQ-Work). The questionnaire was translated into Brazilian Portuguese, following the recommended methodology, and applied to 53 individuals with non-specific chronic lower back pain. The test-retest intra-class correlation coefficients (ICC = 0.84 and 0.91) and the internal consistency (Cronbach's = 0.80 and 0.90) for FABQ-Phys and FABQ-Work, respectively, were acceptable. The stepwise multiple regression analyses revealed statistically significant correlations between all isolated items with their respective subscales, and the set of the items explained 99% of the changes in scores for each subscale. No significant correlations were found between the subscales; however, both the FABQ-Phys and FABQ-Work subscales were positively associated with pain intensity (visual numerical scale) and degree of disability (Roland Morris Questionnaire). These findings supported the evidence that the FABQ-Brazil showed adequate psychometric properties for individuals with chronic lower back pain.
Fukuhara, Hirotaka; Kamata, Akihito
A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…
Prins, Martin H; Marrel, Alexia; Carita, Paulo; Anderson, David; Bousser, Marie-Germaine; Crijns, Harry; Consoli, Silla; Arnould, Benoit
Background The side effects and burden of anticoagulant treatments may contribute to poor compliance and consequently to treatment failure. A specific questionnaire is necessary to assess patients' needs and their perceptions of anticoagulant treatment. Methods A conceptual model of expectation and satisfaction with anticoagulant treatment was designed by an advisory board and used to guide patient (n = 31) and clinician (n = 17) interviews in French, US English and Dutch. Patients had either atrial fibrillation (AF), deep venous thrombosis (DVT), or pulmonary embolism (PE). Following interviews, three PACT-Q language versions were developed simultaneously and further pilot-tested by 19 patients. Linguistic validations were performed for additional language versions. Results Initial concepts were developed to cover three areas of interest: 'Treatment', 'Disease and Complications' and 'Information about disease and anticoagulant treatment'. After clinician and patient interviews, concepts were further refined into four domains and 17 concepts; test versions of the PACT-Q were then created simultaneously in three languages, each containing 27 items grouped into four domains: "Treatment Expectations" (7 items), "Convenience" (11 items), "Burden of Disease and Treatment" (2 items) and "Anticoagulant Treatment Satisfaction" (7 items). No item was deleted or added after pilot testing as patients found the PACT-Q easy to understand and appropriate in length in all languages. The PACT-Q was divided into two parts: the first part to measure the expectations and the second to measure the convenience, burden and treatment satisfaction, for evaluation prior to and after anticoagulant treatment, respectively. Eleven additional language versions were linguistically validated. Conclusion The PACT-Q has been rigorously developed and linguistically validated. It is available in 14 languages for use with thromboembolic patients, including AF, PE and DVT patients. Its validation and
Response Alternative Revision. The questionnaire item had been selected for pertinence to basic training based on recruit interviews (cf.. Vickers 4...response format which asked recruits to select from alternatives ranging from "Disagree Strongly" (1) to "Agree Strongly" (7) that an event had...resulting scales used in the revised questionnaire are given in Appendix A. SaleIHOD Four humdred and thirty-three recruits were randomly selected from
Li, Ling; Xue, Jing; Li, Zhan-Zhan
Introduction Breast cancer patients are demanding more active roles in their care, especially in the initial diagnosis and treatment stages. At present, there is no suitable patient questionnaire that appropriately incorporates Chinese language, habits, and cultural differences. Aim To develop and validate a patient-needs questionnaire for female breast cancer inpatients in China. Materials and Methods The questionnaire structure was based on Maslow’s model and a modern medical model. In the first step, a focus group was used to design 125 questions, of which 64 constituted the initial questionnaire for item screening with a group of 115 hospitalized patients with breast cancer. Items were included or excluded based on the evaluation of eight statistical analysis. Ultimately, 38 items were selected and validated. The reliability and validity of the 38-item questionnaire were determined in a cohort of 323 patients. Results The scale was set up with the 38 selected items. The four primary areas were disease knowledge, medical environment, psychosocial parameters and sexual attitudes. Cronbach’s coefficient was 0.959. The split-half reliability value was 0.935. Principal component factor analysis extracted four common factors. Conclusion Our new questionnaire, designed to assess the care needs of Chinese inpatients with breast cancer is reliable, sensitive, effective, independent and representative. It can be used in medical practice as a tool for a more complete assessment of patients’ needs. PMID:27891441
Pettersen, Cathrine; Nunes, Kevin L; Cortoni, Franca
The Buss-Perry Aggression Questionnaire (AQ) is a self-report measure of aggressiveness commonly employed in nonforensic and forensic settings and is included in violent offender pre- and posttreatment assessment batteries. The aim of the current study was to assess the fit of the four-factor model of the AQ with violent offenders ( N = 271), a population for which the factor structure of the English version of the AQ has not previously been examined. Confirmatory factor analyses did not yield support for the four-factor model of the original 29-item AQ. Acceptable fit was obtained with the 12-item short form, but careful examination of the relationships between the latent factors revealed that the four subscales of the AQ may not represent distinct aspects of aggressiveness. Our findings call into question whether the AQ optimally measures trait aggressiveness among violent offenders.
..., USDA will use life cycle cost information only from tests using the BEES analytical method. (c... availability of such items and the economic and technological feasibility of using such items, including life cycle costs. USDA will gather information on individual products within an item and extrapolate...
French, Ann; Godwin, Janet
The development of innovative test item types that use multimedia technology to improve item authenticity and interaction and allow for objective scoring through partial-credit scoring methodologies was studied. Science test items were developed for community college developmental students using "Authorware 3.0," an instructional compact disc. The…
Santelices, Maria Veronica; Wilson, Mark
The relationship between differential item functioning (DIF) and item difficulty on the SAT is such that more difficult items tended to exhibit DIF in favor of the focal group (usually minority groups). These results were reported by Kulick and Hu, and Freedle and have been enthusiastically discussed by more recent literature. Examining the…
Sykes, Robert C.; Ito, Kyoko; Wang, Zhen
Student responses to a large number of constructed response items in three Math and three Reading tests were scored on two occasions using three ways of assigning raters: single reader scoring, a different reader for each response (item-specific), and three readers each scoring a rater item block (RIB) containing approximately one-third of a…
Mislevy, Robert J.; Rieser, Mark R.
Multiple matrix sampling (MMS) theory indicates how data may be gathered to most efficiently convey information about levels of attainment in a population, but standard analyses of these data require random sampling of items from a fixed pool of items. This assumption proscribes the retirement of flawed or obsolete items from the pool as well as…
Holling, Heinz; Bertling, Jonas P.; Zeuch, Nina
Mathematical word problems represent a common item format for assessing student competencies. Automatic item generation (AIG) is an effective way of constructing many items with predictable difficulties, based on a set of predefined task parameters. The current study presents a framework for the automatic generation of probability word problems…
van der Linden, Wim J.
The restrictions on item difficulties that must be met when binomial models are applied to domain-referenced testing are examined. Both a deterministic and a stochastic conception of item responses are discussed with respect to difficulty and Guttman-type items. (Author/BH)
Item response functions of the parametric logistic IRT models follow the logistic form which is monotonically increasing. However, item response functions of some real items are nonmonotonic which might lead to examinees with lower proficiency levels receiving higher scores. This study compared three nonparametric IRF estimation methods--the…
Reneau, Fred; And Others
This guide contains 321 test items for use in teaching a course in repairing computer equipment. All test items were reviewed, revised, and validated by incumbent workers and subject matter instructors. Items are provided for assessing student achievement in the following skill areas (with selected skills mentioned in brackets): performing…
... Defense Acquisition Regulations System Commercial Item Handbook AGENCY: Defense Acquisition Regulations... Commercial Item Handbook. The purpose of the Handbook is to help acquisition personnel develop sound business strategies for procuring commercial items. DoD is seeking industry input on the contents before...
Hiscox, Michael D.
Educational item banking presents observers with a considerable paradox. The development of test items from scratch is viewed as wasteful, a luxury in times of declining resources. On the other hand, item banking has failed to become a mature technology despite large amounts of money and the efforts of talented professionals. The question of which…
Saddy, Douglas; Drenhaus, Heiner; Frisch, Stefan
We describe an experiment that investigated the failure to license polarity items in German using event-related brain potentials (ERPs). The results reveal distinct processing reflexes associated with failure to license positive polarity items in comparison to failure to license negative polarity items. Failure to license both negative and…
Davis, Diane, Ed.
This test item bank on welding contains test questions based upon competencies found in the Missouri Welding Competency Profile. Some test items are keyed for multiple competencies. These criterion-referenced test items are designed to work with the Vocational Instructional Management System. Questions have been statistically sampled and validated…
van den Bergh, Huub; And Others
The term differential item functioning (DIF) refers to whether or not the same psychological constructs are measured across different groups. If an item does not measure the same skills or subskills in different populations, it is said to function differentially or to display item bias. A multilevel approach to DIF is proposed. In such a model,…
We consider the identification of a semiparametric multidimensional fixed effects item response model. Item response models are typically estimated under parametric assumptions about the shape of the item characteristic curves (ICCs), and existing results suggest difficulties in recovering the distribution of individual characteristics under…
Ullstadius, Eva; Carlstedt, Berit; Gustafsson, Jan-Eric
The influence of general and verbal ability on each of 72 verbal analogy test items were investigated with new factor analytical techniques. The analogy items together with the Computerized Swedish Enlistment Battery (CAT-SEB) were given randomly to two samples of 18-year-old male conscripts (n = 8566 and n = 5289). Thirty-two of the 72 items had…
Andrich, David; Hagquist, Curt
The literature in modern test theory on procedures for identifying items with differential item functioning (DIF) among two groups of persons includes the Mantel-Haenszel (MH) procedure. Generally, it is not recognized explicitly that if there is real DIF in some items which favor one group, then as an artifact of this procedure, artificial DIF…
... encryption software are distinguished from controls on other software regulated under the EAR. (a) Licensing... items (“EI”) classified under 5A002.a.1, .a.2, .a.5, .a.6, .a.9, and .b; 5D002.a, .c.1 or .d for... items and terms. Most encryption items may be exported under the provisions of License Exception ENC...
... encryption software are distinguished from controls on other software regulated under the EAR. (a) Licensing... items (“EI”) classified under 5A002.a.1, .a.2, .a.5, .a.6, .a.9, and .b; 5D002.a, .c.1 or .d for... items and terms. Most encryption items may be exported under the provisions of License Exception ENC...
... encryption software are distinguished from controls on other software regulated under the EAR. (a) Licensing... items (“EI”) classified under 5A002.a.1, .a.2, .a.5, .a.6, .a.9, and .b; 5D002.a, .c.1 or .d for... items and terms. Most encryption items may be exported under the provisions of License Exception ENC...
Cai, Li; Yang, Ji Seung; Hansen, Mark
Full-information item bifactor analysis is an important statistical method in psychological and educational measurement. Current methods are limited to single-group analysis and inflexible in the types of item response models supported. We propose a flexible multiple-group item bifactor analysis framework that supports a variety of…
Reneau, Fred; And Others
This guide contains 285 test items for use in teaching a course in computerized numerical control. All test items were reviewed, revised, and validated by incumbent workers and subject matter instructors. Items are provided for assessing student achievement in such aspects of programming and planning, setting up, and operating machines with…
Thompson, Nathan A.
Several alternatives for item selection algorithms based on item response theory in computerized classification testing (CCT) have been suggested, with no conclusive evidence on the substantial superiority of a single method. It is argued that the lack of sizable effect is because some of the methods actually assess items very similarly through…
Kang, Taehoon; Petersen, Nancy S.
This paper compares three methods of item calibration--concurrent calibration, separate calibration with linking, and fixed item parameter calibration--that are frequently used for linking item parameters to a base scale. Concurrent and separate calibrations were implemented using BILOG-MG. The Stocking and Lord in "Appl Psychol Measure"…
Cassel, Russell N.
This questionnaire is intended for use as one aspect in accrediting the "Student Personnel Services" which an institution of higher learning provides for students. Areas in question include personal development, health fostering, vocational preparation, effective personalized learning, economic viability, transpersonal offerings, and satisfactory…
The Diet History Questionnaire (DHQ) and the DHQ nutrient database were modified for use in Canada through the collaborative efforts of Dr. Amy Subar and staff at the Risk Factor Monitoring and Methods Branch, and Dr. Ilona Csizmadi and colleagues in the Division of Population Health and Information at the Alberta Cancer Board in Canada.
Norberg, Melissa M; Wetterneck, Chad T; Sass, Daniel A; Kanter, Jonathan W
The Milwaukee Psychotherapy Expectations Questionnaire (MPEQ) was developed to measure clients' expectations about the components and effects of therapy. Items were generated rationally based upon the theoretical literature and existing expectancy measures. An exploratory factor analysis revealed a 2-factor solution, comprised of Process Expectations and Outcome Expectations, which was supported by confirmatory factor analyses in three additional samples. The measure demonstrated good internal consistency and test-retest reliability, along with support for convergent, discriminant, and predictive validity. These results present initial evidence for the utility of the MPEQ in assessing both process and outcome expectations in therapy.
Hamane, Ryoso; Itoh, Toshiya; Tomita, Kouhei
When a store sells items to customers, the store wishes to determine the prices of the items to maximize its profit. Intuitively, if the store sells the items with low (resp. high) prices, the customers buy more (resp. less) items, which provides less profit to the store. So it would be hard for the store to decide the prices of items. Assume that the store has a set V of n items and there is a set E of m customers who wish to buy those items, and also assume that each item i ∈ V has the production cost di and each customer ej ∈ E has the valuation vj on the bundle ej ⊆ V of items. When the store sells an item i ∈ V at the price ri, the profit for the item i is pi = ri - di. The goal of the store is to decide the price of each item to maximize its total profit. We refer to this maximization problem as the item pricing problem. In most of the previous works, the item pricing problem was considered under the assumption that pi ≥ 0 for each i ∈ V, however, Balcan, et al. [In Proc. of WINE, LNCS 4858, 2007] introduced the notion of “loss-leader, ” and showed that the seller can get more total profit in the case that pi < 0 is allowed than in the case that pi < 0 is not allowed. In this paper, we derive approximation preserving reductions among several item pricing problems and show that all of them have algorithms with good approximation ratio.
Oner, Pinar; Oner, Ozgur; Munir, Kerim
We compared ratings on the Three-Item Direct Observation Screen test for autism spectrum disorders completed by pediatric residents with the Social Communication Questionnaire parent reports as an augmentative tool for improving autism spectrum disorder screening performance. We examined three groups of children (18-60 months) comparable in age…
Marsh, Herbert W.
The purpose of the present investigation was to develop a construct validity approach for testing whether the separation of positive and negative item subscales is substantively meaningful in self-concept research. Results from three published studies using the Self Description Questionnaire (SDQ) III were reanalyzed. The SDQ III measures 13…
Young, William R.
Natural disasters, such as hurricanes, floods, tornados, and tsunami, are becoming a greater problem as climate change impacts our environment. Disasters, whether natural or man made, destroy lives, homes, businesses and the natural environment. Such disasters can happen with little or no warning, leaving hundreds or even thousands of people without medical services, potable water, sanitation, communications and electrical services for up to several weeks. In our modern world, the need for electricity has become a necessity. Modern building codes and new disaster resistant building practices are reducing the damage to homes and businesses. Emergency gasoline and diesel generators are becoming common place for power outages. Generators need fuel, which may not be available after a disaster, but Photovoltaic (solar-electric) systems supply electricity without petroleum fuel as they are powered by the sun. Photovoltaic (PV) systems can provide electrical power for a home or business. PV systems can operate as utility interactive or stand-alone with battery backup. Determining your critical load items and sizing the photovoltaic system for those critical items, guarantees their operation in a disaster.
Glas, Cees A. W.; van der Linden, Wim J.
To reduce the cost of item writing and to enhance the flexibility of item presentation, items can be generated by item-cloning techniques. An important consequence of cloning is that it may cause variability on the item parameters. Therefore, a multilevel item response model is presented in which it is assumed that the item parameters of a…
Studts, Christina R; van Zyl, Michiel A
Screening preschool-aged children for disruptive behavior disorders is a key step in early intervention. The study goal was to identify screening items with excellent measurement properties at sub-clinical to clinical levels of disruptive behavior problems within the developmental context of preschool-aged children. Parents/caregivers of preschool-aged children (N = 900) were recruited from four pediatric primary care settings. Participants (mean age = 31, SD = 8) were predominantly female (87 %), either white (55 %) or African-American (42 %), and biological parents (88 %) of the target children. In this cross-sectional survey, participants completed a sociodemographic questionnaire and two parent-report behavioral rating scales: the PSC-17 and the BPI. Item response theory analyses provided item parameter estimates and information functions for 18 externalizing subscale items, revealing their quality of measurement along the continuum of disruptive behaviors in preschool-aged children. Of 18 investigated items, 5 items measured only low levels of disruptive behaviors among preschool-aged children. The remaining 13 items measured sub-clinical to clinical levels of disruptive behavior problems (i.e., >1.5 SD); however, 5 of these items offered less information, suggesting unreliable measurement. The remaining 8 items had high discrimination and difficulty parameters, offering considerable measurement information at sub-clinical to clinical levels of disruptive behavior problems. Behaviors measured by the 8 selected parent-report items were consistent with those identified in recent efforts to distinguish developmentally typical misbehaviors from clinically concerning behaviors among preschool-aged children. These items may have clinical utility in screening young children for disruptive behavior disorders.
Tay, Louis; Vermunt, Jeroen K.; Wang, Chun
We evaluate the item response theory with covariates (IRT-C) procedure for assessing differential item functioning (DIF) without preknowledge of anchor items (Tay, Newman, & Vermunt, 2011). This procedure begins with a fully constrained baseline model, and candidate items are tested for uniform and/or nonuniform DIF using the Wald statistic.…
Holman, Rebecca; Berger, Martijn P. F.
Studied calibration designs that maximize the determinants of Fisher's information matrix on the item parameters for sets of polytomously scored items. Analyzed these items using a number of item response theory models. Results show that for the data and models used, a D-optimal calibration design for an answer or set of answers can reduce the…
Gierl, Mark J.; Lai, Hollis
Changes to the design and development of our educational assessments are resulting in the unprecedented demand for a large and continuous supply of content-specific test items. One way to address this growing demand is with automatic item generation (AIG). AIG is the process of using item models to generate test items with the aid of computer…
Use of the Mantel-Haenszel procedure as a test for differential item functioning under the Rasch model of item-response theory is examined. Results of the procedure cannot be generalized to the class of items for which item-response functions are monotonic and local independence holds. (TJH)
Ferrando, Pere J.; Lorenzo-Seva, Urbano
This article describes a general item response theory model for personality items that allows the information provided by the item response times to be used to estimate the individual trait levels. The submodel describing the item response times is a modification of Thissen's log-linear model and is based on the distance-difficulty hypothesis in…
Posserud, Britt; Lundervold, Astri J; Steijnen, Maaike C; Verhoeven, Sophie; Stormark, Kjell Morten; Gillberg, Christopher
The present study investigated the factor structure of parent and teacher Autism Spectrum Screening Questionnaire (ASSQ) in a population of 7-9 years old children. For validation purposes, factors derived were correlated with results on the Strengths and Difficulties Questionnaire (SDQ). A three-factor solution was identified on both parent and teacher ASSQ. Most of the variance was explained by one factor including measures of social function, validated by a high correlation with the SDQ peer problems scale. The second factor included measures of autism-associated problems. The items allocated to the third factor were more specific for a cognitive style typically found in high-functioning individuals with autism/Asperger syndrome. This factor did not correlate highly with any of the SDQ subscales. The results indicated that the screening efficiency of ASSQ could be increased by closer examination of the individual profile of factor scores.
Stocco, Corey S; Thompson, Rachel H; Rodriguez, Nicole M
Restricted and repetitive behavior (RRB) is more pervasive, prevalent, frequent, and severe in individuals with autism spectrum disorders (ASDs) than in their typical peers. One subtype of RRB is restricted interests in items or activities, which is evident in the manner in which individuals engage with items (e.g., repetitious wheel spinning), the types of items or activities they select (e.g., preoccupation with a phone book), or the range of items or activities they select (i.e., narrow range of items). We sought to describe the relation between restricted interests and teacher presentation of items. Overall, we observed 5 teachers interacting with 2 pairs of students diagnosed with an ASD. Each pair included 1 student with restricted interests. During these observations, teachers were free to present any items from an array of 4 stimuli selected by experimenters. We recorded student responses to teacher presentation of items and analyzed the data to determine the relation between teacher presentation of items and the consequences for presentation provided by the students. Teacher presentation of items corresponded with differential responses provided by students with ASD, and those with restricted preferences experienced a narrower array of items. PMID:21941381
Stocco, Corey S; Thompson, Rachel H; Rodriguez, Nicole M
Restricted and repetitive behavior (RRB) is more pervasive, prevalent, frequent, and severe in individuals with autism spectrum disorders (ASDs) than in their typical peers. One subtype of RRB is restricted interests in items or activities, which is evident in the manner in which individuals engage with items (e.g., repetitious wheel spinning), the types of items or activities they select (e.g., preoccupation with a phone book), or the range of items or activities they select (i.e., narrow range of items). We sought to describe the relation between restricted interests and teacher presentation of items. Overall, we observed 5 teachers interacting with 2 pairs of students diagnosed with an ASD. Each pair included 1 student with restricted interests. During these observations, teachers were free to present any items from an array of 4 stimuli selected by experimenters. We recorded student responses to teacher presentation of items and analyzed the data to determine the relation between teacher presentation of items and the consequences for presentation provided by the students. Teacher presentation of items corresponded with differential responses provided by students with ASD, and those with restricted preferences experienced a narrower array of items.
Ziemssen, Tjalf; Phillips, Glenn; Shah, Ruchit; Mathias, Adam; Foley, Catherine; Coon, Cheryl; Sen, Rohini; Lee, Andrew; Agarwal, Sonalee
The Early Mobility Impairment Questionnaire (EMIQ) was developed to facilitate early identification of mobility impairments in multiple sclerosis (MS) patients. We describe the initial development of the EMIQ with a focus on the psychometric evaluation of the questionnaire using classical and item response theory methods. The initial 20-item EMIQ was constructed by clinical specialists and qualitatively tested among people with MS and physicians via cognitive interviews. Data from an observational study was used to make additional updates to the instrument based on exploratory factor analysis (EFA) and item response theory (IRT) analysis, and psychometric analyses were performed to evaluate the reliability and validity of the final instrument's scores and screening properties (i.e., sensitivity and specificity). Based on qualitative interview analyses, a revised 15-item EMIQ was included in the observational study. EFA, IRT and item-to-item correlation analyses revealed redundant items which were removed leading to the final nine-item EMIQ. The nine-item EMIQ performed well with respect to: test-retest reliability (ICC = 0.858); internal consistency (α = 0.893); convergent validity; and known-groups methods for construct validity. A cut-point of 41 on the 0-to-100 scale resulted in sufficient sensitivity and specificity statistics for viably identifying patients with mobility impairment. The EMIQ is a content valid and psychometrically sound instrument for capturing MS patients' experience with mobility impairments in a clinical practice setting. Additional research is suggested to further confirm the EMIQ's screening properties over time.
Eaves, Linda C; Wingert, Heather D; Ho, Helena H; Mickelson, Elizabeth C R
The Social Communication Questionnaire (SCQ) is a parent report screening measure for autism spectrum disorders (ASDs) based on the Autism Diagnostic Interview-Revised (ADI-R). To examine its validity in a young sample, the SCQ was given to parents of 151 children at a mean age of 5 years, before assessment in tertiary autism or preschool clinics. Overall sensitivity was .71, the same for both clinics, but specificity was better for the preschool clinic (.62) than for the autism clinic (.53) reflecting fewer false-positives in the former. The "hit rate" was 65% with 28% of the children with autism missed by the SCQ at a cutoff score of 15 (false-negatives) and 38% of the nonautistic misidentified as having an ASD (false-positives). Item validity analysis, contrary to what was previously published, indicated that only 15 or 46% of the items distinguished between children with and without ASD in this much younger sample. False-negatives were somewhat higher functioning. The SCQ would seem to be a useful tool for identifying young children in need of further assessment and assisting in routing them to the appropriate clinic, especially if used in conjunction with a screening by a community professional. There remain questions about the "best" cutoff score to use and whether a shorter version, based on the items that distinguished autistic from nonautistic, would be more reliable and valid with younger children. Furthermore, it may be that an adjusted score is required when parents omit items or with nonverbal children who cannot be scored on some of the items.
Atkinson, Nancy L.
Objectives: To design a valid and reliable questionnaire to assess perceived attributes of technology-based health education innovations. Methods: College students in 12 personal health courses reviewed a prototype eHealth intervention using a 30-item instrument based upon diffusion theory's perceived attributes of an innovation. Results:…
Kersten, Paula; Czuba, Karol; McPherson, Kathryn; Dudley, Margaret; Elder, Hinemoa; Tauroa, Robyn; Vandal, Alain
This article synthesized evidence for the validity and reliability of the Strengths and Difficulties Questionnaire in children aged 3-5 years. A systematic review using the Preferred Reporting Items for Systematic Reviews and Meta-Analyses statement guidelines was carried out. Study quality was rated using the Consensus-based Standards for the…
Pestle, Sarah L.; Chorpita, Bruce F.; Schiffman, Jason
The Penn State Worry Questionnaire for Children (PSWQ-C; Chorpita, Tracey, Brown, Collica, & Barlow, 1997) is a 14-item self-report measure of worry in children and adolescents. Although the PSWQ-C has demonstrated favorable psychometric properties in small clinical and large community samples, this study represents the first psychometric…
Aitken, Madison; Martinussen, Rhonda; Wolfe, Richard G.; Tannock, Rosemary
The Strengths and Difficulties Questionnaire (SDQ) is a 25-item screening measure for emotional and behavioral problems in children and adolescents aged 4 to 16. Structural equation modeling was used to test the five-factor structure of teacher and parent ratings on the British version of the SDQ in a community sample of 501 Canadian children aged…
Hertzsprung, Emerenciana A.; Konnert, Candace; Brinker, Jaylene
This paper describes a new measure, the Worry Questionnaire for Nursing Home Residents (WQNHR), designed to assess the frequency of specific worries. It was administered to 67 residents. Psychometric evaluation showed an average item-to-total correlation of 0.20 (range = 0.15 to 0.58), an internal consistency estimate of 0.79, and a test-retest…
An estimated 20% of all adolescents will experience a depressive disorder by the age of 18, with schools being at the forefront of initiatives to promote resilience and well-being. This study reports on the development of the 24-item Student Perception of Wellbeing Questionnaire (SPWQ), created as a measure of well-being in three areas: exercise,…
Fastame, Maria Chiara; Cherchi, Rossella; Penna, Maria Pietronilla
The current research was aimed mainly at exploring the reliability of a short-screening tool developed to self-evaluate visuospatial abilities in children. We presented 290 Italian third, fourth, and fifth graders with the 16-item Shortened Visuospatial questionnaire and several objective measures of intellectual efficiency, such as Raven's…
Willcutt, Erik G.; Boada, Richard; Riddle, Margaret W.; Chhabildas, Nomita; DeFries, John C.; Pennington, Bruce F.
This study evaluated the internal structure and convergent and discriminant evidence for the Colorado Learning Difficulties Questionnaire (CLDQ), a 20-item parent-report rating scale that was developed to provide a brief screening measure for learning difficulties. CLDQ ratings were obtained from parents of children in 2 large community samples…
LeBold, William K.; And Others
The Purdue Interest Questionnaire (PIQ), a 264-item Likert-type scale, was developed to assist engineering students in their career planning. The six engineering scales identify specialized fields: aeronautical, chemical, civil, electrical, industrial, or mechanical. For students planning to transfer out of engineering, four scales identify…
Copeland, Valire Carr; Koeske, Gary; Greeno, Catherine G.
This study used the Client Satisfaction Questionnaire (CSQ-8) to examine the level of consumer satisfaction with children's (ages 8 to 17 years) outpatient mental health services. Analyses were completed using both individual satisfaction items and a summed scale score. The CSQ scale had satisfactory internal consistency reliability for both…
Balsam, Kimberly F.; Beadnell, Blair; Molina, Yamile
The authors conducted a three-phase, mixed-methods study to develop a self-report measure assessing the unique aspects of minority stress for lesbian, gay, bisexual, and transgender adults. The Daily Heterosexist Experiences Questionnaire has 50 items and nine subscales with acceptable internal reliability, and construct and concurrent validity. Mean sexual orientation and gender differences were found. PMID:24058262
Dickey, Wayne C.; Blumberg, Stephen J.
Objective: The Strengths and Difficulties Questionnaire is a 25-item instrument developed to assess emotional and behavioral problems. The current study attempted to replicate previous European structural analyses and to describe the latent dimensions that underlie responses to the parent-reported version of the Strengths and Difficulties…
Elgar, Frank J.; Waschbusch, Daniel A.; Dadds, Mark R.; Sigvaldason, Nadine
Brief assessments of parenting practices can provide important information about the development of disruptive behavior disorders in children. We examined the factor structure of a widely used assessment of parenting practices, the Alabama Parenting Questionnaire, and produced a 9-item short scale around its three supported factors: Positive…
George, Darren; Dixon, Sinikka; Stansal, Emory; Gelb, Shannon Lund; Pheri, Tabitha
Objective and Participants: A sample of 231 students attending a private liberal arts university in central Alberta, Canada, completed a 5-day time diary and a 71-item questionnaire assessing the influence of personal, cognitive, and attitudinal factors on success. Methods: The authors used 3 success measures: cumulative grade point average (GPA),…
Collier, Margo L.; Griffin, Megan M.; Wei, Yonghua
This article describes the pilot study of an informal assessment, the "Student Transition Questionnaire" (STQ). The STQ is a 38-item assessment designed to elicit student perspectives on transition-related topics. In this mixed-methods study, we piloted the STQ with 186 participants, and then conducted focus groups with various…
Marmaroti, Panagiota; Galanopoulou, Dia
In this study, a close-ended questionnaire examining all aspects of photosynthesis simultaneously has been developed and administered to 290 Greek pupils aged 13. It contains complementary or logically related items that permitted us to assess the understanding of each aspect by carrying out crossanalysis. The main findings are: that pupils are…
Clark, Sheldon B.; Boser, Judith A.
A context in which existing items may provide a convenient source of questions for questionnaires was explored through a case study making use of existing comparison groups. Two programs at Oak Ridge Associated Universities (ORAU), the Science and Engineering Research Semester (SERS) and the Laboratory Graduate Research Participation (Lab Grad)…
Quirk, Matthew; Unrau, Norman; Ragusa, Gisele; Rueda, Robert; Lim, Hyo; Velasco, Alejandra; Fujii, Kayoko; Bowers, Erica; Nemerouf, Ann; Loera, Gustavo
This study examined teachers' beliefs about motivating students to read through the development of a new survey questionnaire. The current investigation reports on initial tests of the scale's reliability and validity. The items for this measure were developed from an engagement perspective to reflect the motivational constructs represented in an…
Hornsveld, Ruud H. J.; Muris, Peter; Kraaimaat, Floris W.; Meesters, Cor
The psychometric properties of a Dutch version of Buss and Perry's Aggression Questionnaire (AQ) were examined in a sample of violent forensic psychiatric inpatients and outpatients and a sample of secondary vocational students. The internal consistency, interitem correlations, and item--scale correlations of the subscales Physical Aggression,…
The purpose of the current study was to evaluate the psychometric validity of a Spanish translated version of a family involvement questionnaire (the FELP) using a mixed-methods design. Thus, statistical analyses (i.e., factor analysis, reliability analysis, and item analysis) and qualitative analyses (i.e., focus group data) were assessed.…
Zijlstra, Wobbe P.; van der Ark, L. Andries; Sijtsma, Klaas
Outliers in questionnaire data are unusual observations, which may bias statistical results, and outlier statistics may be used to detect such outliers. The authors investigated the effect outliers have on the specificity and the sensitivity of each of six different outlier statistics. The Mahalanobis distance and the item-pair based outlier…
Renshaw, Tyler L.; Long, Anna C. J.; Cook, Clayton R.
This study reports on the initial development and validation of the Student Subjective Wellbeing Questionnaire (SSWQ) with a sample of 1,002 students in Grades 6-8. The SSWQ is a 16-item self-report instrument for assessing youths' subjective wellbeing at school, which is operationalized via 4 subscales measuring school connectedness, academic…
Krotseng, Marsha V.
Using discriminant analysis, a study examined (1) the extent to which the new Student Adjustment to College Questionnaire (SACQ) accurately predicts student departure for a private, comprehensive university (n=1,978 students); (2) SACQ items distinguishing nonpersisters; (3) use with an incoming class; and (4) evidence linking the SACQ with…
Emmanouilidou, Kyriaki; Derri, Vassiliki; Aggelousis, Nicolaos; Vassiliadou, Olga
The purpose of this pilot study was to develop and evaluate an instrument for measuring Greek elementary physical educators' knowledge of student assessment. A multiple-choice questionnaire comprised of items about concepts, methods, tools, and types of student assessment in physical education was designed and tested. The initial 35-item…
van Ginkel, Joost R.; van der Ark, L. Andries
A well-known problem in the analysis of test and questionnaire data is that some item scores may be missing. Advanced methods for the imputation of missing data are available, such as multiple imputation under the multivariate normal model and imputation under the saturated logistic model (Schafer, 1997). Accompanying software was made available…
Gable, Robert K.; And Others
The development of the Parent Attitudes toward School Effectiveness (PATSE) questionnaire was conducted in two phases. The pilot test form contained 47 items reflecting parents' attitudes toward 6 categories: (1) school and community relationships; (2) clear school mission; (3) high expectations; (4) safe and orderly environment; (5) instructional…
Simonds, John F.; Simonds, M. Patricia
Mothers of 182 nursery school children completed the Behavior Style Questionnaire (BSQ) and the Child Personality Scale (CPS). Intercorrelational analyses showed many significantly correlated items. Scores of the five CPS factors clearly distinguished between subjects in easy and difficult BSQ clusters. Found boys significantly more introverted…
Johnson, Laurel L; Bradley, Susan J; Birkenfeld-Adams, Andrea S; Kuksis, Myra A Radzins; Maing, Dianne M; Mitchell, Janet N; Zucker, Kenneth J
This paper reports on the psychometric properties of a 16-item parent-report Gender Identity Questionnaire, originally developed by P. H. Elizabeth and R. Green (1984), to aid in the assessment of children with potential problems in their gender identity development. The questionnaire, which covered aspects of the core phenomenology of gender identity disorder (GID), was completed by parents of gender-referred children (N = 325) and controls (siblings, clinic-referred, and nonreferred; N = 504), who ranged in age from 2.5-12 years (mean age, 7.6 years). Factor-analysis indicated that a one-factor solution, containing 14 of the 16 items with factor loadings > or =.30, best fit the data, accounting for 43.7% of the variance. The gender-referred children had a significantly more deviant total score than did the controls, with a large effect size of 3.70. The GIQ total score had negligible age effects, indicating that the questionnaire has utility for assessing change over time. The gender-referred children who met the complete DSM criteria for GID had a significantly more deviant total score than did the children who were subthreshold for GID, although the latter group had a mean score that was closer to the threshold cases than to the controls. With a specificity rate set at 95% for the controls, the sensitivity rate for the probands was 86.8%. It is concluded that this parent-report gender identity questionnaire has excellent psychometric properties and can serve as a useful screening device for front-line clinicians, for whom more extensive, expensive, and time-consuming assessment procedures may be precluded.
de Beurs, Derek P; Terluin, Berend; Verhaak, Peter F
Background Efficient screening questionnaires are useful in general practice. Computerized adaptive testing (CAT) is a method to improve the efficiency of questionnaires, as only the items that are particularly informative for a certain responder are dynamically selected. Objective The objective of this study was to test whether CAT could improve the efficiency of the Four-Dimensional Symptom Questionnaire (4DSQ), a frequently used self-report questionnaire designed to assess common psychosocial problems in general practice. Methods A simulation study was conducted using a sample of Dutch patients visiting a general practitioner (GP) with psychological problems (n=379). Responders completed a paper-and-pencil version of the 50-item 4DSQ and a psychometric evaluation was performed to check if the data agreed with item response theory (IRT) assumptions. Next, a CAT simulation was performed for each of the four 4DSQ scales (distress, depression, anxiety, and somatization), based on the given responses as if they had been collected through CAT. The following two stopping rules were applied for the administration of items: (1) stop if measurement precision is below a predefined level, or (2) stop if more than half of the items of the subscale are administered. Results In general, the items of each of the four scales agreed with IRT assumptions. Application of the first stopping rule reduced the length of the questionnaire by 38% (from 50 to 31 items on average). When the second stopping rule was also applied, the total number of items could be reduced by 56% (from 50 to 22 items on average). Conclusions CAT seems useful for improving the efficiency of the 4DSQ by 56% without losing a considerable amount of measurement precision. The CAT version of the 4DSQ may be useful as part of an online assessment to investigate the severity of mental health problems of patients visiting a GP. This simulation study is the first step needed for the development a CAT version of the 4
Chevat, Catherine; Viala-Danten, Muriel; Dias-Barbosa, Carla; Nguyen, Van Hung
Background Influenza is among the most common infectious diseases. The main protection against influenza is vaccination. A self-administered questionnaire was developed and validated for use in clinical trials to assess subjects' perception and acceptance of influenza vaccination and its subsequent injection site reactions (ISR). Methods The VAPI questionnaire was developed based on interviews with vaccinees. The initial version was administered to subjects in international clinical trials comparing intradermal with intramuscular influenza vaccination. Item reduction and scale construction were carried out using principal component and multitrait analyses (n = 549). Psychometric validation of the final version was conducted per country (n = 5,543) and included construct and clinical validity and internal consistency reliability. All subjects gave their written informed consent before being interviewed or included in the clinical studies. Results The final questionnaire comprised 4 dimensions ("bother from ISR"; "arm movement"; "sleep"; "acceptability") grouping 16 items, and 5 individual items (anxiety before vaccination; bother from pain during vaccination; satisfaction with injection system; willingness to be vaccinated next year; anxiety about vaccination next year). Construct validity was confirmed for all scales in most of the countries. Internal consistency reliability was good for all versions (Cronbach's alpha ranging from 0.68 to 0.94), as was clinical validity: scores were positively correlated with the severity of ISR and pain. Conclusion The VAPI questionnaire is a valid and reliable tool, assessing the acceptance of vaccine injection and reactions following vaccination. Trial registration NCT00258934, NCT00383526, NCT00383539. PMID:19261173
Toderi, Stefano; Sarchielli, Guido
The development of supervisors’ behaviours has been proposed as an innovative approach for the reduction of employees’ work stress. The UK Health and Safety Executive (HSE) developed the “Stress Management Competency Indicator Tool” (SMCIT), designed to be used within a learning and development intervention. However, its psychometric properties have never been evaluated, and the length of the questionnaire (66 items) limits its practical applicability. We developed a brief 36-item version of the questionnaire, assessed its psychometric properties and studied the relationship with the employees’ psychosocial work environment. 353 employees filled in the brief SMCIT and the “Stress Management Indicator Tool”. The latter is a self-report questionnaire developed by the UK HSE, measuring workers’ perceptions of seven dimensions of the psychosocial work environment that if not properly managed can lead to harm. Data were analysed with structural equation modelling and multiple regressions. The results confirmed the factorial structure of the brief SMCIT questionnaire and mainly supported the convergent validity and internal consistency of the scales. Furthermore, with few exceptions, the relations hypothesized between supervisors’ competencies and the psychosocial work environment were confirmed, supporting the criterion validity of the revised questionnaire and the UK HSE framework. We conclude that the brief 36-item version of the SMCIT represents an important step toward the development of interventions directed at supervisors and we discuss the practical implications for work stress prevention. PMID:27827940
Toderi, Stefano; Sarchielli, Guido
The development of supervisors' behaviours has been proposed as an innovative approach for the reduction of employees' work stress. The UK Health and Safety Executive (HSE) developed the "Stress Management Competency Indicator Tool" (SMCIT), designed to be used within a learning and development intervention. However, its psychometric properties have never been evaluated, and the length of the questionnaire (66 items) limits its practical applicability. We developed a brief 36-item version of the questionnaire, assessed its psychometric properties and studied the relationship with the employees' psychosocial work environment. 353 employees filled in the brief SMCIT and the "Stress Management Indicator Tool". The latter is a self-report questionnaire developed by the UK HSE, measuring workers' perceptions of seven dimensions of the psychosocial work environment that if not properly managed can lead to harm. Data were analysed with structural equation modelling and multiple regressions. The results confirmed the factorial structure of the brief SMCIT questionnaire and mainly supported the convergent validity and internal consistency of the scales. Furthermore, with few exceptions, the relations hypothesized between supervisors' competencies and the psychosocial work environment were confirmed, supporting the criterion validity of the revised questionnaire and the UK HSE framework. We conclude that the brief 36-item version of the SMCIT represents an important step toward the development of interventions directed at supervisors and we discuss the practical implications for work stress prevention.
Abbott, J.A.; Waddoups, I.G.
This report responds to the Department of Energy`s request that Sandia National Laboratories compare existing technologies against several advanced technologies as they apply to DOE needs to monitor the movement of material, weapons, or personnel for safety and security programs. The authors describe several material control systems, discuss their technologies, suggest possible applications, discuss assets and limitations, and project costs for each system. The following systems are described: WATCH system (Wireless Alarm Transmission of Container Handling); Tag system (an electrostatic proximity sensor); PANTRAK system (Personnel And Material Tracking); VRIS (Vault Remote Inventory System); VSIS (Vault Safety and Inventory System); AIMS (Authenticated Item Monitoring System); EIVS (Experimental Inventory Verification System); Metrox system (canister monitoring system); TCATS (Target Cueing And Tracking System); LGVSS (Light Grid Vault Surveillance System); CSS (Container Safeguards System); SAMMS (Security Alarm and Material Monitoring System); FOIDS (Fiber Optic Intelligence & Detection System); GRADS (Graded Radiation Detection System); and PINPAL (Physical Inventory Pallet).
Kröz, M; Feder, G; von Laue, HB; Zerm, R; Reif, M; Girke, M; Matthes, H; Gutenbrunner, C; Heckmann, C
Background To broaden the range of outcomes that we can measure for patients undergoing treatment for oncological and other chronic conditions, we aimed to validate a questionnaire measuring self-reported autonomic regulation (aR), i.e. to characterise a subject's autonomic functioning by questions on sleeping and waking, vertigo, morningness-eveningness, thermoregulation, perspiration, bowel movements and digestion. Methods We administered the questionnaire to 440 participants (♀: N = 316, ♂: N = 124): 95 patients with breast cancer, 49 with colorectal cancer, 60 with diabetes mellitus, 39 with coronary heart disease, 28 with rheumatological conditions, 32 with Hashimoto's disease, 22 with multiple morbidities and 115 healthy people. We administered the questionnaire a second time to 50.2% of the participants. External convergence criteria included the German version of the Hospital Anxiety and Depression Scale (HADS-D), a short questionnaire on morningness-eveningness, the Herdecke Quality of Life Questionnaire (HLQ) and a short version questionnaire on self-regulation. Results A principal component analysis yielded a three dimensional 18-item inventory of aR. The subscales orthostatic-circulatory, rest/activity and digestive regulation had internal consistency (Cronbach-α: rα = 0.65 – 0.75) and test-retest reliability (rrt = 0.70 – 85). AR was negatively associated with anxiety, depression, and dysmenorrhoea but positively correlated to HLQ, self-regulation and in part to morningness (except digestive aR) (0.49 – 0.13, all p < 0.05). Conclusion An internal validation of the long-version scale of aR yielded consistent relationships with health versus illness, quality of life and personality. Further studies are required to clarify the issues of external validity, clinical and physiological relevance. PMID:18533043
Cho, Seonghee; Drasgow, Fritz; Cao, Mengyang
This study investigated the psychometric properties of 3 frequently administered emotional intelligence (EI) scales (Wong and Law Emotional Intelligence Scale [WLEIS], Schutte Self-Report Emotional Intelligence Test [SEIT], and Trait Emotional Intelligence Questionnaire [TEIQue]), which were developed on the basis of different theoretical frameworks (i.e., ability EI and mixed EI). By conducting item response theory (IRT) analyses, the authors examined the item parameters and compared the fits of 2 response process models (i.e., dominance model and ideal point model) for these scales with data from 355 undergraduate sample recruited from the subject pool. Several important findings were obtained. First, the EI scales seem better able to differentiate individuals at low trait levels than high trait levels. Second, a dominance model showed better model fit to the self-report ability EI scale (WLEIS) and also fit better with most subfactors of the SEIT, except for the mood regulation/optimism factor. Both dominance and ideal point models fit a self-report mixed EI scale (TEIQue). Our findings suggest (a) the EI scales should be revised to include more items at moderate and higher trait levels; and (b) the nature of the EI construct should be considered during the process of scale development.
Rosecrance, John C; Ketchen, Kelly J; Merlino, Linda A; Anton, Dan C; Cook, Tom M
The purpose of this study was to investigate the test-retest reliability of questionnaire items related to musculoskeletal symptoms and the reliability of specific job factors. The type of questionnaire items described in the present study have been used by several investigators to assess symptoms of musculoskeletal disorders and problematic job factors among workers from a variety of occupations. Employees at a plastics molding facility were asked to complete an initial symptom and jobs factors questionnaire and then complete an identical questionnaire either two or four weeks later. Of the 216 employees participating in the initial round, 99 (45.8%) agreed to participate in the retest portion of the study. The kappa coefficient was used to determine repeatability for categorical outcomes. The majority of the kappa coefficients for the 58 questionnaire items were above 0.50 but ranged between 0.13 and 1.00. The section of the questionnaire having the highest kappa coefficients was the section related to hand symptoms. Interval lengths of two and four weeks between the initial test and retest were found to be equally sufficient in terms of reliability. The results indicated that the symptom and job factors questionnaire is reliable for use in epidemiologic studies. Like all measurement instruments, the reliability of musculoskeletal questionnaires must be established before drawing conclusions from studies that employ the instrument.
The aim of this study was to design a concise, focused questionnaire to measure individuals' perceptions of the impact of their renal condition on their quality of life, taking account of the importance of life domains relevant for the individual. The design of the renal-dependent quality of life (RDQoL) questionnaire was based on that of the Audit of Diabetes Dependent Quality of Life (ADDQoL) diabetes-specific individualized quality of life questionnaire, which was influenced by patient-centered principles underlying the interview method of McGee et al. The questionnaires specify life domains, and the respondents rate personally applicable domains for the importance and impact of the renal condition. Observation in eight U.K. renal clinics, together with 40 in-depth interviews with peritoneal dialysis, hemodialysis, and transplant patients, provided the basis for item selection for the RDQoL. The results of the study were as follows: each of the 13 ADDQoL items was relevant and important for renal patients. Additional suggestions for items included physical appearance, dependency, freedom, restrictions of fluid intake, and societal prejudice. In conclusion, unlike other quality of life measures, the RDQoL is an individualized questionnaire measure of the impact of renal disease and its treatment on quality of life. Face and content validity is established for adult renal patients, and the RDQoL is being further evaluated for research and clinical use.
Hendriks, Erik JM; Bernards, Arnold TM; Staal, J Bart; de Vet, Henrica CW; de Bie, Rob A
Background To investigate the factor structure, dimensionality and construct validity of the (5-item) PRAFAB questionnaire score in women with stress urinary incontinence (stress UI). Methods A cross validation study design was used in a cohort of 279 patients who were randomly divided into Sample A or B. Sample A was used for preliminary exploratory factor analyses with promax rotation. Sample B provided an independent sample for confirming the premeditated and proposed factor structure and item retention. Internal consistency, item-total and subscale correlations were determined to assess the dimensionality. Construct validity was assessed by comparing factor-based scale means by clinical characteristics based on known relationships. Results Factor analyses resulted in a two-factor structure or subscales: items related to 'leakage severity' (protection, amount and frequency) and items related to its 'perceived symptom impact' or consequences of stress UI on the patient's life (adjustment and body (or self) image). The patterns of the factor loadings were fairly identical for both study samples. The two constructed subscales demonstrated adequate internal consistency with Cronbach's alphas in a range of 0.78 and 0.84 respectively. Scale scores differed by clinical characteristics according to the expectations and supported the construct validity of the scales. Conclusion The findings suggest a two-factorial structure of the PRAFAB questionnaire. Furthermore the results confirmed the internal consistency and construct validity as demonstrated in our previous study. The best description of the factorial structure of the PRAFAB questionnaire was given by a two-factor solution, measuring the stress UI leakage severity items and the perceived symptom impact items. Future research will be necessary to replicate these findings in different settings, type of UI and non-white women and men. PMID:18218110
van Velzen, Joke H
The development of a questionnaire to assess students' use of self-reflective thinking in the classroom is described. On the basis of a literature search, items were selected. The items are students' self-report measures and open-ended questions. The participants were 96 fourth grade secondary vocational students from six classes in The Netherlands, all of whom were used to learning in cooperative groups. Complementary data were selected to validate this questionnaire. Visual inspection of the frequencies indicated a difference between levels of students' self-reflecting thinking. Between-subjects t tests showed that students' motivational engagement and marks could be used to validate the measure of self-reflective thinking. The implication of the questionnaire to assess students' self-reflective thinking within the classroom are discussed.
Valdivia, Ivan; Stewart, Sherry H
Although the expectancies component of the Comprehensive Effects of Alcohol Questionnaire has previously been shown to be factorially valid, the factor structure of its valuations component has not previously been examined. The aims of this paper were: (i) to replicate the factor structure of the expectancies items; (ii) to explore the factor structure of the valuations items; and (iii) to investigate the utility of using the Comprehensive Effects of Alcohol Questionnaire to predict drinking behavior. The questionnaire was administered to 1004 university students along with measures of quantity and frequency of alcohol consumption. Fromme, Stroot, and Kaplan's (1993) factor structure of the expectancies scales was replicated. The factor structures of the negative valuations scales were characterized by 2 rather than 3 factors. Negative expectancies improved upon the prediction of drinking quantity and frequency over-and-above positive expectancies, and valuations further improved prediction over-and-above expectancies. Theoretical and clinical implications are discussed.
Ding, Kele; Olds, R. Scott; Thombs, Dennis L.
This retrospective case study assessed the influence of item non-response error on subsequent response to questionnaire items assessing adolescent alcohol and marijuana use. Post-hoc analyses were conducted on survey results obtained from 4,371 7th to 12th grade students in Ohio in 2005. A skip pattern design in a conventional questionnaire…
Gomes, Raquel Regina de Freitas Magalhães; Batista, José Rodrigues; Ceccato, Maria das Graças Braga; Kerr, Lígia Regina Franco Sansigolo; Guimarães, Mark Drew Crosland
OBJECTIVE To evaluate the level of HIV/AIDS knowledge among men who have sex with men in Brazil using the latent trait model estimated by Item Response Theory. METHODS Multicenter, cross-sectional study, carried out in ten Brazilian cities between 2008 and 2009. Adult men who have sex with men were recruited (n = 3,746) through Respondent Driven Sampling. HIV/AIDS knowledge was ascertained through ten statements by face-to-face interview and latent scores were obtained through two-parameter logistic modeling (difficulty and discrimination) using Item Response Theory. Differential item functioning was used to examine each item characteristic curve by age and schooling. RESULTS Overall, the HIV/AIDS knowledge scores using Item Response Theory did not exceed 6.0 (scale 0-10), with mean and median values of 5.0 (SD = 0.9) and 5.3, respectively, with 40.7% of the sample with knowledge levels below the average. Some beliefs still exist in this population regarding the transmission of the virus by insect bites, by using public restrooms, and by sharing utensils during meals. With regard to the difficulty and discrimination parameters, eight items were located below the mean of the scale and were considered very easy, and four items presented very low discrimination parameter (< 0.34). The absence of difficult items contributed to the inaccuracy of the measurement of knowledge among those with median level and above. CONCLUSIONS Item Response Theory analysis, which focuses on the individual properties of each item, allows measures to be obtained that do not vary or depend on the questionnaire, which provides better ascertainment and accuracy of knowledge scores. Valid and reliable scales are essential for monitoring HIV/AIDS knowledge among the men who have sex with men population over time and in different geographic regions, and this psychometric model brings this advantage. PMID:24897041
Luchetti, Martina; Sutin, Angelina R
The Memory Experiences Questionnaire (MEQ) is a theoretically driven and empirically validated 63-item self-report scale designed to measure 10 phenomenological qualities of autobiographical memories: Vividness, Coherence, Accessibility, Time Perspective, Sensory Details, Visual Perspective, Emotional Intensity, Sharing, Distancing and Valence. To develop a short form of the MEQ to use when time is limited, participants from two samples (N = 719; N = 352) retrieved autobiographical memories, rated the phenomenological experience of each memory and completed several scales measuring psychological distress. For each MEQ dimension, the number of items was reduced by one-half based on item content and item-total correlations. Each short-form scale had acceptable internal consistency (median alpha = .79), and, similar to the long-form version of the scales, the new short scales correlated with psychological distress in theoretically meaningful ways. The new short form of the MEQ has similar psychometric proprieties as the original long form and can be used when time is limited.
Kolstad, R; Goaz, P; Kolstad, R
Multiple-choice items are frequently used in objective examinations. The format chosen should conform to the nature of the instruction. Knowledge about cumulative information, such as lists of attributes, can be tested efficiently by means of multiple-choice items that include a variable number of correct answers. In contrast to conventional, single-answer questions, nonrestricted multiple-choice items are capable of including more facts and fewer incorrect responses. In addition, the nonrestricted format is not burdened with the repetitious pattern of one correct answer coupled with several incorrect responses, a cue that may promote successful guessing. Item analyses can be performed on examinations that include both conventional and nonrestricted items. The reliability of one examination constructed totally with nonrestricted items was analyzed by means of the Kuder-Richardson Formula No. 20. The value 0.72 proved this examination to be both discriminating and consistent.
Richter-Appelt, Hertha; Schimmelmann, Benno Graf; Tiefensee, Jutta
A positive parent-child relationship is one of the most important determinants of a healthy cognitive, emotional and social development. The relationship from parent to child is determined by parenting styles. Parenting styles are characterised by the two dimensions parental attitudes and rearing practices. The development and the psychometric properties of a questionnaire on parental attitudes and rearing practices (FEPS), which contains an extended version of the Parental Bonding Instrument by Parker et al. (PBI, 1979) and two scales on parental reinforcement and punishment behaviour, is presented. In a sample of 457 women and 159 men factorial and item analysis revealed four scales (care, autonomy, low punishment and low material reinforcement). The care dimension contained items of immaterial reinforcement on the positive pole and items of coldness and ignorance as means of punishment on the negative pole. Based on findings from its first application in a clinical study it can be assumed that the FEPS differentiates between clinical and non-clinical populations. Additionally, varying patterns of the four scales may emerge as risk factors for the development of certain psychiatric/psychological problems.
Chauvin, Bruno; Leonova, Tamara
Key concerns about the psychometric properties of the 25-item version of the Strengths and Difficulties Questionnaire (SDQ) have consistently been raised in the literature. The present study aimed at examining the meaningfulness of an alternative model to the SDQ in which 7 problematic items are excluded. French-speaking parents of 262 boys and…
Peng, Samuel S.; And Others
Tabular summaries of the 153 numerical responses to the Second Followup Questionnaire items of the National Longitudinal Study of the High School Class of 1972 are presented--20,872 individuals responded. These items summarize participants' educational experiences and occupational attainments from October 1973 to October 1974; continuing or…
Chowdhury, Monali; Aman, Michael G; Lecavalier, Luc; Smith, Tristram; Johnson, Cynthia; Swiezy, Naomi; McCracken, James T; King, Bryan; McDougle, Christopher J; Bearss, Karen; Deng, Yanhong; Scahill, Lawrence
Previously, we adapted the Home Situations Questionnaire to measure behavioral non-compliance in everyday settings in children with pervasive developmental disorders. In this study, we further revised this instrument for use in autism spectrum disorder and examined its psychometric properties (referred to as the Home Situations Questionnaire-Autism Spectrum Disorder). To cover a broader range of situations and improve reliability, we prepared seven new items describing situations in which children with autism spectrum disorder might display non-compliance. Parents completed ratings of 242 children with autism spectrum disorder with accompanying disruptive behaviors (ages 4-14 years) participating in one of two randomized clinical trials. Results from an exploratory factor analysis indicated that the Home Situations Questionnaire-Autism Spectrum Disorder consists of two 12-item factors: Socially Inflexible (α = 0.84) and Demand Specific (α = 0.89). One-to-two-week test-retest reliability was statistically significant for all scored items and also for subscale totals. The pattern of correspondence between the Home Situations Questionnaire-Autism Spectrum Disorder and parent-rated problem behavior, clinician-rated repetitive behavior, adaptive behavior, and IQ provided evidence for concurrent and divergent validity of the Home Situations Questionnaire-Autism Spectrum Disorder. Overall, the results suggest that the Home Situations Questionnaire-Autism Spectrum Disorder is an adequate measure for assessing non-compliance in a variety of situations in this population, and use of its two subscales will likely provide a more refined interpretation of ratings.
Ko, Su-Hwan; Lee, Mi-Soon; Koo, Bon-Sung; Lee, Joon-Ho; Kim, Sang-Hyun; Chae, Won Seok; Jin, Hee Cheol; Lee, Jeong Seok; Kim, Yong-Ik
Background To assess the multidisciplinary aspects of pain, various self-rating questionnaires have been developed, but there have not been sufficient relevant studies on this topic in South Korea. The aim of this study was to develop a new pain sensitivity-related questionnaire in the Korean language that would be simple and would well reflect Koreans' senses. Methods A new pain assessment questionnaire was developed through a pre-survey on "geop", which is the Korean word expressing fear, anxiety, or catastrophizing. We named the new assessment questionnaire the Geop-Pain Questionnaire (GPQ). The GPQ was composed of 15 items divided into three categories and rated on a 5-point scale. As a preliminary study, internal consistency and test-retest reliability analyses were conducted. Subsequently, 109 individuals completed the GPQ along with three pain-related questionnaires translated into Korean (Pain Sensitivity Questionnaire [PSQ], Pain Anxiety Symptoms Scale [PASS], and Pain Catastrophizing Scale [PCS]), and the correlations were analyzed. Results All items in the GPQ showed appropriate internal consistency, and the test-retest reliability analysis showed no statistically significant differences. The correlations between the GPQ and the existing questionnaires revealed that the GPQ scores had mid-positive correlations with the PSQ scores and strong positive correlations with the PASS and PCS scores. Conclusions This study attempted to develop a questionnaire assessing pain sensitivity multidimensionally using the Korean word geop for the first time. The self-rating GPQ showed high correlations with the existing questionnaires and demonstrated potential to be utilized as a pain prediction index in clinical practice. PMID:27703631
Morley, David; Dummett, Sarah; Kelly, Laura; Dawson, Jill; Fitzpatrick, Ray; Jenkinson, Crispin
Purpose There is growing interest in the management of long-term conditions and in keeping people active and participating in the community. Testing the effectiveness of interventions that aim to affect activities and participation can be challenging without a well-developed, valid, and reliable instrument. This study therefore aims to develop a patient-reported outcome measure, the Oxford Participation and Activities Questionnaire (Ox-PAQ), which is theoretically grounded in the World Health Organization’s International Classification of Functioning, Disability, and Health (ICF) and fully compliant with current best practice guidelines. Methods Questionnaire items generated from patient interviews and based on the nine chapters of the ICF were administered by postal survey to 386 people with three neurological conditions: motor neuron disease, multiple sclerosis, and Parkinson’s disease. Participants also completed the Medical Outcomes Study (MOS) 36-Item Short Form Health Survey (SF-36) and EQ-5D-5L. Results Thus, 334 participants completed the survey, a response rate of 86.5%. Factor analysis techniques identified three Ox-PAQ domains, consisting of 23 items, accounting for 72.8% of variance. Internal reliability for the three domains was high (Cronbach’s α: 0.81–0.96), as was test–retest reliability (intraclass correlation: 0.83–0.92). Concurrent validity was demonstrated through highly significant relationships with relevant domains of the MOS SF-36 and the EQ- 5D-5L. Assessment of known-groups validity identified significant differences in Ox-PAQ scores among the three conditions included in the survey. Conclusion Results suggest that the Ox-PAQ is a valid and reliable measure of participation and activity. The measure will now be validated in a range of further conditions, and additional properties, such as responsiveness, will also be assessed in the next phase of the instrument’s development. PMID:27366108
... 41 Public Contracts and Property Management 2 2012-07-01 2012-07-01 false Item reduction study....7-Item Reduction Program § 101-30.701-1 Item reduction study. Item reduction study means the study... so identified, a replacement item shall be proposed. The result of item reduction studies...
... 41 Public Contracts and Property Management 2 2014-07-01 2012-07-01 true Item reduction study. 101....7-Item Reduction Program § 101-30.701-1 Item reduction study. Item reduction study means the study... so identified, a replacement item shall be proposed. The result of item reduction studies...
... 41 Public Contracts and Property Management 2 2010-07-01 2010-07-01 true Item reduction study. 101....7-Item Reduction Program § 101-30.701-1 Item reduction study. Item reduction study means the study... so identified, a replacement item shall be proposed. The result of item reduction studies...
... 41 Public Contracts and Property Management 2 2013-07-01 2012-07-01 true Item reduction study. 101....7-Item Reduction Program § 101-30.701-1 Item reduction study. Item reduction study means the study... so identified, a replacement item shall be proposed. The result of item reduction studies...
... 41 Public Contracts and Property Management 2 2011-07-01 2007-07-01 true Item reduction study. 101....7-Item Reduction Program § 101-30.701-1 Item reduction study. Item reduction study means the study... so identified, a replacement item shall be proposed. The result of item reduction studies...
... 41 Public Contracts and Property Management 2 2010-07-01 2010-07-01 true Item standardization code....7-Item Reduction Program § 101-30.701-2 Item standardization code. Item standardization code (ISC) means a code assigned an item in the supply system which identifies the item as authorized...
DeMars, Christine E.; Jurich, Daniel P.
The nonequivalent groups anchor test (NEAT) design is often used to scale item parameters from two different test forms. A subset of items, called the anchor items or common items, are administered as part of both test forms. These items are used to adjust the item calibrations for any differences in the ability distributions of the groups taking…
Magis, David; Tuerlinckx, Francis; De Boeck, Paul
This article proposes a novel approach to detect differential item functioning (DIF) among dichotomously scored items. Unlike standard DIF methods that perform an item-by-item analysis, we propose the "LR lasso DIF method": logistic regression (LR) model is formulated for all item responses. The model contains item-specific intercepts,…
The present study seeks to investigate the extent to which the Acceptance and Action Questionnaire (AAQ-II) is successful in discriminating between experiential avoidance/psychological flexibility on the one hand and the supposed outcomes in terms of psychological well-being of having this trait on the other. This was done using exploratory factor analysis on an item pool containing the AAQ-II items, and items designed for the present study to measure distress and acceptance/non-acceptance, to see what factors are identified and on which factor(s) the AAQ-II items had the highest factor loadings. Interestingly, the analysis found the items of the AAQ-II to be more strongly related to items designed to measure distress than items designed to measure acceptance/nonacceptance with minimal references to functional outcomes. The results of the study are interpreted and discussed in relation to the widespread use of the AAQ in both clinical and scientific contexts and given the centrality of the measure in empirically validating the ACT model of psychopathology and treatment.
Choi, Seung W.; Gibbons, Laura E.; Crane, Paul K.
Logistic regression provides a flexible framework for detecting various types of differential item functioning (DIF). Previous efforts extended the framework by using item response theory (IRT) based trait scores, and by employing an iterative process using group–specific item parameters to account for DIF in the trait scores, analogous to purification approaches used in other DIF detection frameworks. The current investigation advances the technique by developing a computational platform integrating both statistical and IRT procedures into a single program. Furthermore, a Monte Carlo simulation approach was incorporated to derive empirical criteria for various DIF statistics and effect size measures. For purposes of illustration, the procedure was applied to data from a questionnaire of anxiety symptoms for detecting DIF associated with age from the Patient–Reported Outcomes Measurement Information System. PMID:21572908
Blebil, Ali Qais; Sulaiman, Syed Azhar Syed; Hassali, Mohamed Azmi; Dujaili, Juman Abdulelah; Zin, Alfian Mohamed
This study aimed to evaluate the psychometric properties of Malay translated version of the brief questionnaire of smoking urges (QSU-Brief). The translation procedure was done following the standard guidelines. The reliability and validity of the Malaysian version scale were evaluated based on the data collected from 133 Malaysian smokers. The internal consistency was calculated to assess the reliability. Factor analysis and construct validity were performed to validate psychometric properties of the scale. Total Cronbach's alpha of the scale was 0.806. The exploratory factor analysis revealed two factors that accounted for 66.15% of the explained total variance. The first component consisted of items 1, 3, 6, 7, and 10, while the second component included the rest. The QSU-Brief total score had a significant positive relationship with exhaled CO level (r=0.24; P=0.005), number of cigarettes smoked per day (r=0.30; P<0.001) and other clinical factors. Items 2 and 5 loaded strongly on factor 2, whereas both items loaded ambivalently on two factors in the previous studies. This discrepancy might be clarified by language differences. The Malaysian QSU-Brief is a good candidate for evaluating urge to smoke in both clinical practice and clinical trials.
Levasseur, Oona; McDermott, Mark R; Lafreniere, Kathryn D
For each of eight literature-identified conceptual dimensions of mortality awareness, questionnaire items were generated, producing 89 in all. A total of 359 participants responded to these items and to questionnaires measuring health attitudes, risk taking, rebelliousness, and demographic variables. Multivariate correlational analyses investigated the underlying structure of the item pool and the construct validity as well as the reliability of the emergent empirically derived subscales. Five components, rather than eight, were identified. Given the item content of each, the associated mortality awareness subscales were labeled as legacy, fearfulness, acceptance, disempowerment, and disengagement. Each attained an acceptable level of internal reliability. Relationships with other variables supported the construct validity of these empirically derived subscales and more generally of this five-factor model. In conclusion, this new multidimensional measure and model of mortality awareness extends our understanding of this important aspect of human existence and supports a more integrative and optimistic approach to mortality awareness than previously available.
Ito, Naomi; Wada, Hideo; Matsumoto, Masanori; Fujimura, Yoshihiro; Murata, Mitsuru; Izuno, Takashi; Sugita, Minoru; Ikeda, Yasuo
A questionnaire survey of Japanese patients with thrombotic microangiopathy (TMA) was carried out to investigate the frequency, laboratory abnormalities, and outcome in 2004. Out of 185 patients, there were 13 with familial TMA and 172 with acquired TMA. In acquired TMA, there were 66 with Escherichia coli O-157 infection (O-157)-related TMA, 35 with ADAMTS13-related TMA, and 22 with other types of TMA. The frequency of TMA in O-157-related TMA was high in patients from 0- to 15-year-old, and acquired TMA without O-157 was frequently observed in patients ranging from 31 to 65 years of age. In the treatment of acquired TMA, including plasma exchange (PE), steroid, antiplatelet agent, and anticoagulant, PE was carried out in 94.3% of ADAMTS13-related TMA, 77.3% of other TMA, and 7.6% of O-157-related TMA. The efficacy of PE and steroid therapy tended to be higher in ADAMTS13 TMA than in other types of TMA. The complete remission rate is the highest in O-157 TMA. The mortality rate was the lowest for O-157 TMA, and this rate also tended to be lower in ADAMTS13-related TMA than in other types of TMA. However, the determination of ADAMTS13 was not universal in Japan at the time of this questionnaire.
Al-Rubaish, Abdullah M.; Rahim, Sheikh Idris A.; Abumadini, Mahdi S.; Wosornu, Lade
Background: Colleges and universities are becoming increasingly accountable for teaching outcomes in order to meet rigorous accreditation standards. Job satisfaction (JS) seems more difficult to measure in the academic field in view of the complexity of roles, duties and responsibilities. Objectives: To compile and determine the psychometric properties of a proposed Academic Job Satisfaction Questionnaire (AJSQ) suitable for university faculty, and amenable to future upgrading. Materials and Methods: A 46-item five-option Likert-type draft questionnaire on JS was distributed for anonymous self-reporting by all the academic staff of five colleges in University of Dammam (n=340). The outcome measures were (1) factor analysis of the questionnaire items, (2) intra-factor α-Coefficient of Internal Consistency Reliability, (3) inter-factor correlations, (4) comparison of psychometric properties in separately analyzed main faculty subgroups. Results: The response rate was 72.9 percent. Factor analysis extracted eight factors which conjointly explained 60.3 percent of the variance in JS. These factors, in descending order of eigenvalue, were labeled “Authority”, “Supervision”, “Policies and Facilities”, “My Work Itself”, “Interpersonal Relationships”, “Commitment”, “Salary” and “Workload”. Cronbach's-α ranged from 0.90 in Supervision to 0.63 in Salary and Workload. All inter-factor correlations were positive and significant, ranging from 0.65 to 0.23. The psychometric properties of the instrument in separately analyzed subgroups divided by sex, nationality, college and clinical duties produced fairly comparable findings. Conclusion: The AJSQ demonstrated good overall psychometric properties in terms of construct validity and internal consistency reliability in both the overall sample and its separately analyzed subgroups. Recommendation: To replicate these findings in larger multicenter samples of academic staff. PMID:21694952
Nabak, Andrea C.; Johnson, Rachael Erin; Keuler, Nicholas S.; Hansen, Karen E.
Objective Our objective was to determine whether a questionnaire can identify subjects with vitamin D insufficiency (VDI). Design Subjects completed the vitamin D and sun (VIDSUN) questionnaire and we measured their serum 25(OH)D levels. We assessed the sensitivity and specificity of the questionnaire to identify VDI (25(OH)D level <50 nmol/L). Setting Clinical Research Unit, University of Wisconsin-Madison Subjects Postmenopausal women Results We recruited 609 postmenopausal women with a mean ± SD age of 61 ± 6 years, of whom 113 (19%) had VDI. Subjects with VDI were more likely to be Black (17% vs. 2%, p<0.001), heavier (BMI 33±7 kg/m2 vs. 29±7 kg/m2, p<0.001) and less likely to tan in the past year (49% vs. 72%, p<0.001), use sunscreen (57% vs. 72%, p<0.001) or report sun exposure in the last three months. They consumed less vitamin D from supplements (86±210 vs. 188±344 IU/day, p=0.003). In logistic regression models, Black race, BMI, suntan within one year, sun exposure in the past three months, sunscreen use and supplemental vitamin D intake were the most useful questions to identify VDI. From these six items, a composite score ≤2.25 demonstrated ≥89% sensitivity but ≤35% specificity for VDI. Conclusion The VIDSUN questionnaire provides an initial tool to identify postmenopausal women at high or low risk of VDI. Existing studies suggest that inclusion of physical activity and triglyceride levels might improve the performance of the VIDSUN questionnaire. PMID:23870503
Crins, Martine H P; Roorda, Leo D; Smits, Niels; de Vet, Henrica C W; Westhovens, Rene; Cella, David; Cook, Karon F; Revicki, Dennis; van Leeuwen, Jaap; Boers, Maarten; Dekker, Joost; Terwee, Caroline B
The Dutch-Flemish PROMIS Group translated the adult PROMIS Pain Interference item bank into Dutch-Flemish. The aims of the current study were to calibrate the parameters of these items using an item response theory (IRT) model, to evaluate the cross-cultural validity of the Dutch-Flemish translations compared to the original English items, and to evaluate their reliability and construct validity. The 40 items in the bank were completed by 1085 Dutch chronic pain patients. Before calibrating the items, IRT model assumptions were evaluated using confirmatory factor analysis (CFA). Items were calibrated using the graded response model (GRM), an IRT model appropriate for items with more than two response options. To evaluate cross-cultural validity, differential item functioning (DIF) for language (Dutch vs. English) was examined. Reliability was evaluated based on standard errors and Cronbach's alpha. To evaluate construct validity correlations with scores on legacy instruments (e.g., the Disabilities of the Arm, Shoulder and Hand Questionnaire) were calculated. Unidimensionality of the Dutch-Flemish PROMIS Pain Interference item bank was supported by CFA tests of model fit (CFI = 0.986, TLI = 0.986). Furthermore, the data fit the GRM and showed good coverage across the pain interference continuum (threshold-parameters range: -3.04 to 3.44). The Dutch-Flemish PROMIS Pain Interference item bank has good cross-cultural validity (only two out of 40 items showing DIF), good reliability (Cronbach's alpha = 0.98), and good construct validity (Pearson correlations between 0.62 and 0.75). A computer adaptive test (CAT) and Dutch-Flemish PROMIS short forms of the Dutch-Flemish PROMIS Pain Interference item bank can now be developed.
Lambert, Nadine M.
This investigation attempted to demonstrate the utility of standard item analysis procedures for selecting the most reliable and valid items for scoring Bender Visual Motor Gestalt Test test records. (Author)
Fox, Claire L; Gadd, David; Sim, Julius
To provide a more robust assessment of the effectiveness of a domestic abuse prevention education program, a questionnaire was developed to measure children's attitudes to domestic violence. The aim was to develop a short questionnaire that would be easy to use for practitioners but, at the same time, sensitive enough to pick up on subtle changes in young people's attitudes. We therefore chose to ask children about different situations in which they might be willing to condone domestic violence. In Study 1, we tested a set of 20 items, which we reduced by half to a set of 10 items. The factor structure of the scale was explored and its internal consistency was calculated. In Study 2, we tested the factor structure of the 10-item Attitudes to Domestic Violence (ADV) Scale in a separate calibration sample. Finally, in Study 3, we then assessed the test-retest reliability of the 10-item scale. The ADV Questionnaire is a promising tool to evaluate the effectiveness of domestic abuse education prevention programs. However, further development work is necessary.
Bian, Xiaoyan; Xie, Huichao; Squires, Jane; Chen, Chieh-Yu
The Ages & Stages Questionnaire: Social-Emotional (ASQ:SE; Squires, Bricker, & Twombly, 2002a), developed in the United States, was translated and adapted for use in China. Lack of valid and reliable instruments for identifying social and emotional delays in young children is a worldwide issue. Professionals in China have recently focused efforts on developing methods for early identification of social, emotional, and behavioral issues in the birth-to-5 population. Following the guidelines of the International Test Commission, the ASQ:SE was translated into Simplified Chinese (ASQ:SE-C) to collect a normative sample of 2,528 children across China. Data were analyzed to evaluate the psychometric properties of the ASQ:SE-C, using both classical test theory and item response theory, including generating cutoff points appropriate for the Chinese sample. A panel of Chinese experts was surveyed to assess face validity and estimated utility of the newly adapted tool. Discussions of research findings and implications for future studies are provided.
Dyett, Patricia; Rajaram, Sujatha; Haddad, Ella H; Sabate, Joan
This study aimed to develop and validate a de novo food frequency questionnaire for self-defined vegans in the United States. Diet histories from pilot samples of vegans and a modified 'Block Method' using seven selected nutrients of concern in vegan diet patterns, were employed to generate the questionnaire food list. Food frequency responses of 100 vegans from 19 different U.S. states were obtained via completed mailed questionnaires and compared to multiple telephone-conducted diet recall interviews. Computerized diet analyses were performed. Correlation coefficients, t-tests, rank, cross-tabulations, and probability tests were used to validate and compare intake estimates and dietary reference intake (DRI) assessment trends between the two methods. A 369-item vegan-specific questionnaire was developed with 252 listed food frequency items. Calorie-adjusted correlation coefficients ranged from r = 0.374 to 0.600 (p < 0.001) for all analyzed nutrients except calcium. Estimates, ranks, trends and higher-level participant percentile placements for Vitamin B12 were similar with both methods. Questionnaire intakes were higher than recalls for most other nutrients. Both methods demonstrated similar trends in DRI adequacy assessment (e.g., significantly inadequate vitamin D intake among vegans). This vegan-specific questionnaire can be a useful assessment tool for health screening initiatives in U.S. vegan communities.
Stammel, Nadine; Neuner, Frank; Böttche, Maria; Knaevelsrud, Christine
Background Post-conflict reconciliation is supposed to have a positive impact on survivors of war and conflict. However, knowledge is limited as validated questionnaires to assess individual readiness to reconcile in the context of human rights violations are still missing. Objectives This study aimed to develop and pilot-test a questionnaire to assess individual readiness to reconcile in victims of human rights violations. Methods The questionnaire was developed and pilot-tested in a sample of 60 adult Kurdish refugees from Turkey. In addition to the questionnaire, trauma exposure, Posttraumatic Stress Disorder (PTSD), depression, anxiety, perceived emotional closeness to the Kurdish people as well as the participants’ ability to differentiate between perpetrators and the people in general were assessed in structured interviews, and their associations with readiness to reconcile were analyzed. Results Factor and item analysis resulted in an 18-item questionnaire with three subscales (openness to interactions; absence of feelings of revenge; openness to conflict resolution). Cronbach's α for the subscales ranged from 0.74 to 0.90, explaining 61% of the total variance. The ability to differentiate between perpetrators and people in general and perceived emotional closeness were the best predictors for readiness to reconcile. The level of trauma exposure was not linked to readiness to reconcile. Although readiness to reconcile was negatively related to PTSD, depression and anxiety, none of these associations reached statistical significance. Conclusions The questionnaire appears to be a reliable measure with good psychometric properties. Further validations in different samples are needed. PMID:22893837
Background Many questionnaires have been developed to measure how psychosocial characteristics are perceived in a work environment. But the content validity of these questionnaires has rarely been questioned due to the absence of a reference taxonomy for characteristics of work environments. Objectives To propose an exhaustive taxonomy of work environment characteristics involved in psychosocial risks and to apply this taxonomy to questionnaires on workplace psychosocial factors. Methods The taxonomy was developed by categorizing factors present in the main theoretical models of the field. Questionnaire items most frequently cited in scientific literature were retained for classification. Results The taxonomy was structured into four hierarchical levels and comprises 53 categories. The 17 questionnaires analyzed included 927 items: 59 from the “physical environment” category, 116 from the “social environment” category, 236 from the “work activity” category, 255 from the “activity management” category, and 174 from the “organizational context” category. Conclusions There are major content differences among analyzed questionnaires. This study offers a means for selecting a scale on the basis of content. PMID:27367232
Dyett, Patricia; Rajaram, Sujatha; Haddad, Ella H.; Sabate, Joan
This study aimed to develop and validate a de novo food frequency questionnaire for self-defined vegans in the United States. Diet histories from pilot samples of vegans and a modified ‘Block Method’ using seven selected nutrients of concern in vegan diet patterns, were employed to generate the questionnaire food list. Food frequency responses of 100 vegans from 19 different U.S. states were obtained via completed mailed questionnaires and compared to multiple telephone-conducted diet recall interviews. Computerized diet analyses were performed. Correlation coefficients, t-tests, rank, cross-tabulations, and probability tests were used to validate and compare intake estimates and dietary reference intake (DRI) assessment trends between the two methods. A 369-item vegan-specific questionnaire was developed with 252 listed food frequency items. Calorie-adjusted correlation coefficients ranged from r = 0.374 to 0.600 (p < 0.001) for all analyzed nutrients except calcium. Estimates, ranks, trends and higher-level participant percentile placements for Vitamin B12 were similar with both methods. Questionnaire intakes were higher than recalls for most other nutrients. Both methods demonstrated similar trends in DRI adequacy assessment (e.g., significantly inadequate vitamin D intake among vegans). This vegan-specific questionnaire can be a useful assessment tool for health screening initiatives in U.S. vegan communities. PMID:25006856
Elías, María Jesús Pérez; Gómez-Ayerbe, Cristina; Elías, Pilar Pérez; Muriel, Alfonso; de Santiago, Alberto Diaz; Martinez-Colubi, María; Moreno, Ana; Santos, Cristina; Polo, Lidia; Barea, Rafa; Robledillo, Gema; Uranga, Almudena; Espín, Agustina Cano; Quereda, Carmen; Dronda, Fernando; Casado, Jose Luis; Moreno, Santiago
Abstract The aim of our study was to develop a Spanish-structured HIV risk of exposure and indicator conditions (RE&IC) questionnaire. People attending to an emergency room or to a primary clinical care center were offered to participate in a prospective, 1 arm, open label study, in which all enrolled patients filled out our developed questionnaire and were HIV tested. Questionnaire accuracy, feasibility, and reliability were evaluated. Valid paired 5329 HIV RE&IC questionnaire and rapid HIV tests were performed, 69.3% in the primary clinical care center, 49.6% women, median age 37 years old, 74.9% Spaniards, 20.1% Latin-Americans. Confirmed hidden HIV infection was detected in 4.1%, while HIV RE&IC questionnaire was positive in 51.2%. HIV RE&IC questionnaire sensitivity was 100% to predict HIV infection, with a 100% negative predictive value. When considered separately, RE or IC items sensitivity decreases to 86.4% or 91%, and similarly their negative predictive value to 99.9% for both of them. The majority of people studied, 90.8% self-completed HIV RE&IC questionnaire. Median time to complete was 3 minutes. Overall HIV RE&IC questionnaire test-retest Kappa agreement was 0.82 (almost perfect), likewise for IC items 0.89, while for RE items was lower 0.78 (substantial). A feasible and reliable Spanish HIV RE&IC self questionnaire accurately discriminated all non–HIV-infected people without missing any HIV diagnoses, in a low prevalence HIV infection area. The best accuracy and reliability were obtained when combining HIV RE&IC items. PMID:26844471
Elías, María Jesús Pérez; Gómez-Ayerbe, Cristina; Elías, Pilar Pérez; Muriel, Alfonso; de Santiago, Alberto Diaz; Martinez-Colubi, María; Moreno, Ana; Santos, Cristina; Polo, Lidia; Barea, Rafa; Robledillo, Gema; Uranga, Almudena; Espín, Agustina Cano; Quereda, Carmen; Dronda, Fernando; Casado, Jose Luis; Moreno, Santiago
The aim of our study was to develop a Spanish-structured HIV risk of exposure and indicator conditions (RE&IC) questionnaire. People attending to an emergency room or to a primary clinical care center were offered to participate in a prospective, 1 arm, open label study, in which all enrolled patients filled out our developed questionnaire and were HIV tested. Questionnaire accuracy, feasibility, and reliability were evaluated.Valid paired 5329 HIV RE&IC questionnaire and rapid HIV tests were performed, 69.3% in the primary clinical care center, 49.6% women, median age 37 years old, 74.9% Spaniards, 20.1% Latin-Americans. Confirmed hidden HIV infection was detected in 4.1%, while HIV RE&IC questionnaire was positive in 51.2%. HIV RE&IC questionnaire sensitivity was 100% to predict HIV infection, with a 100% negative predictive value. When considered separately, RE or IC items sensitivity decreases to 86.4% or 91%, and similarly their negative predictive value to 99.9% for both of them. The majority of people studied, 90.8% self-completed HIV RE&IC questionnaire. Median time to complete was 3 minutes. Overall HIV RE&IC questionnaire test-retest Kappa agreement was 0.82 (almost perfect), likewise for IC items 0.89, while for RE items was lower 0.78 (substantial).A feasible and reliable Spanish HIV RE&IC self questionnaire accurately discriminated all non-HIV-infected people without missing any HIV diagnoses, in a low prevalence HIV infection area. The best accuracy and reliability were obtained when combining HIV RE&IC items.
Carver, Rebecca Bruu; Castéra, Jérémy; Gericke, Niklas; Evangelista, Neima Alice Menezes
In this paper we present the development and validation a comprehensive questionnaire to assess college students’ knowledge about modern genetics and genomics, their belief in genetic determinism, and their attitudes towards applications of modern genetics and genomic-based technologies. Written in everyday language with minimal jargon, the Public Understanding and Attitudes towards Genetics and Genomics (PUGGS) questionnaire is intended for use in research on science education and public understanding of science, as a means to investigate relationships between knowledge, determinism and attitudes about modern genetics, which are to date little understood. We developed a set of core ideas and initial items from reviewing the scientific literature on genetics and previous studies on public and student knowledge and attitudes about genetics. Seventeen international experts from different fields (e.g., genetics, education, philosophy of science) reviewed the initial items and their feedback was used to revise the questionnaire. We validated the questionnaire in two pilot tests with samples of university freshmen students. The final questionnaire contains 45 items, including both multiple choice and Likert scale response formats. Cronbach alpha showed good reliability for each section of the questionnaire. In conclusion, the PUGGS questionnaire is a reliable tool for investigating public understanding and attitudes towards modern genetics and genomic-based technologies. PMID:28114357
Carver, Rebecca Bruu; Castéra, Jérémy; Gericke, Niklas; Evangelista, Neima Alice Menezes; El-Hani, Charbel N
In this paper we present the development and validation a comprehensive questionnaire to assess college students' knowledge about modern genetics and genomics, their belief in genetic determinism, and their attitudes towards applications of modern genetics and genomic-based technologies. Written in everyday language with minimal jargon, the Public Understanding and Attitudes towards Genetics and Genomics (PUGGS) questionnaire is intended for use in research on science education and public understanding of science, as a means to investigate relationships between knowledge, determinism and attitudes about modern genetics, which are to date little understood. We developed a set of core ideas and initial items from reviewing the scientific literature on genetics and previous studies on public and student knowledge and attitudes about genetics. Seventeen international experts from different fields (e.g., genetics, education, philosophy of science) reviewed the initial items and their feedback was used to revise the questionnaire. We validated the questionnaire in two pilot tests with samples of university freshmen students. The final questionnaire contains 45 items, including both multiple choice and Likert scale response formats. Cronbach alpha showed good reliability for each section of the questionnaire. In conclusion, the PUGGS questionnaire is a reliable tool for investigating public understanding and attitudes towards modern genetics and genomic-based technologies.
... Burial Benefits § 3.1606 Transportation items. The transportation costs of those persons who come within... case. In any such instance any excess amount would be an acceptable item to be included in the burial...) is used as a shipping case and also for burial, an allowance of $30 may be made thereon in lieu of...
... Burial Benefits § 3.1606 Transportation items. The transportation costs of those persons who come within... case. In any such instance any excess amount would be an acceptable item to be included in the burial...) is used as a shipping case and also for burial, an allowance of $30 may be made thereon in lieu of...
... Burial Benefits § 3.1606 Transportation items. The transportation costs of those persons who come within... case. In any such instance any excess amount would be an acceptable item to be included in the burial...) is used as a shipping case and also for burial, an allowance of $30 may be made thereon in lieu of...
Geerlings, Hanneke; Glas, Cees A. W.; van der Linden, Wim J.
An application of a hierarchical IRT model for items in families generated through the application of different combinations of design rules is discussed. Within the families, the items are assumed to differ only in surface features. The parameters of the model are estimated in a Bayesian framework, using a data-augmented Gibbs sampler. An obvious…
Tijmstra, Jesper; Hessen, David J.; van der Heijden, Peter G. M.; Sijtsma, Klaas
A new observable consequence of the property of invariant item ordering is presented, which holds under Mokken's double monotonicity model for dichotomous data. The observable consequence is an invariant ordering of the item-total regressions. Kendall's measure of concordance "W" and a weighted version of this measure are proposed as measures for…
Kouimanos, John, Ed.
As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items of value from past tests are made available to teachers for the construction of unit tests, term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The…
Kouimanos, John, Ed.
As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items of value from past tests are made available to teachers for the construction of unit tests, term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The…
... 47 Telecommunication 3 2010-10-01 2010-10-01 false Deducted items. 65.830 Section 65.830 Telecommunication FEDERAL COMMUNICATIONS COMMISSION (CONTINUED) COMMON CARRIER SERVICES (CONTINUED) INTERSTATE RATE OF RETURN PRESCRIPTION PROCEDURES AND METHODOLOGIES Rate Base § 65.830 Deducted items. (a)...
Stocco, Corey S.; Thompson, Rachel H.; Rodriguez, Nicole M.
Restricted and repetitive behavior (RRB) is more pervasive, prevalent, frequent, and severe in individuals with autism spectrum disorders (ASDs) than in their typical peers. One subtype of RRB is restricted interests in items or activities, which is evident in the manner in which individuals engage with items (e.g., repetitious wheel spinning),…
Kouimanos, John, Ed.
As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items of value from past tests are made available to teachers for the construction of unit tests, term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The…
Hale, Gordon A.; And Others
To ascertain how the Test of English as a Foreign Language (TOEFL) would be affected if candidates had access to some of the items before administration of a test containing those items, a number of specially constructed TOEFL forms were made available to 945 foreign students in intensive English language programs. The students were later…
Kouimanos, John, Ed.
Oosterhof, Albert C.
The purpose of this study was to investigate the degree to which various selected test item discrimination indices reflect a common factor. The indices used include the point-biserial, biserial, phi and tetrachoric coefficients, Flanagan's approximation of the product-moment correlation, Gulliksen's item reliability index, and Findley's difference…
Eggen, Theo J. H. M.; Verhelst, Norman D.
This study discusses the justifiability of item parameter estimation in incomplete testing designs in item response theory. Marginal maximum likelihood (MML) as well as conditional maximum likelihood (CML) procedures are considered in three commonly used incomplete designs: random incomplete, multistage testing and targeted testing designs.…
Rosenfeld, Barry; Pessin, Hayley; Lewis, Charles; Abbey, Jennifer; Olden, Megan; Sachs, Emily; Amakawa, Lia; Kolva, Elissa; Brescia, Robert; Breitbart, William
Hopelessness has become an increasingly important construct in palliative care research, yet concerns exist regarding the utility of existing measures when applied to patients with a terminal illness. This article describes a series of studies focused on the exploration, development, and analysis of a measure of hopelessness specifically intended for use with terminally ill cancer patients. The 1st stage of measure development involved interviews with 13 palliative care experts and 30 terminally ill patients. Qualitative analysis of the patient interviews culminated in the development of a set of potential questionnaire items. In the 2nd study phase, we evaluated these preliminary items with a sample of 314 participants, using item response theory and classical test theory to identify optimal items and response format. These analyses generated an 8-item measure that we tested in a final study phase, using a 3rd sample (n = 228) to assess reliability and concurrent validity. These analyses demonstrated strong support for the Hopelessness Assessment in Illness Questionnaire providing greater explanatory power than existing measures of hopelessness and found little evidence that this assessment was confounded by illness-related variables (e.g., prognosis). In summary, these 3 studies suggest that this brief measure of hopelessness is particularly useful for palliative care settings. Further research is needed to assess the applicability of the measure to other populations and contexts. PMID:21443366
Jones, Andrew T.
Practitioners often depend on item analysis to select items for exam forms and have a variety of options available to them. These include the point-biserial correlation, the agreement statistic, the B index, and the phi coefficient. Although research has demonstrated that these statistics can be useful for item selection, no research as of yet has…
Woods, Carol M.
In Ramsay-curve item response theory (RC-IRT), the latent variable distribution is estimated simultaneously with the item parameters of a unidimensional item response model using marginal maximum likelihood estimation. This study evaluates RC-IRT for the three-parameter logistic (3PL) model with comparisons to the normal model and to the empirical…
Arce-Ferrer, Alvaro J.; Bulut, Okan
This study examines separate and concurrent approaches to combine the detection of item parameter drift (IPD) and the estimation of scale transformation coefficients in the context of the common item nonequivalent groups design with the three-parameter item response theory equating. The study uses real and synthetic data sets to compare the two…
For this dissertation, four item purification procedures were implemented onto the generalized linear mixed model for differential item functioning (DIF) analysis, and the performance of these item purification procedures was investigated through a series of simulations. Among the four procedures, forward and generalized linear mixed model (GLMM)…
Papanastasiou, Elena C.
If good measurement depends in part on the estimation of accurate item characteristics, it is essential that test developers become aware of discrepancies that may exist on the item parameters before and after item review. The purpose of this study was to examine the answer changing patterns of students while taking paper-and-pencil multiple…
Current CAT applications consist of predominantly dichotomous items, and CATs with polytomously scored items are limited. To ascertain the best approach to polytomous CAT, a significant amount of research has been conducted on item selection, ability estimation, and impact of termination rules based on polytomous IRT models. Few studies…
Cher Wong, Cheow
Building on previous works by Lord and Ogasawara for dichotomous items, this article proposes an approach to derive the asymptotic standard errors of item response theory true score equating involving polytomous items, for equivalent and nonequivalent groups of examinees. This analytical approach could be used in place of empirical methods like…
Maij-de Meij, Annette M.; Kelderman, Henk; van der Flier, Henk
Usually, methods for detection of differential item functioning (DIF) compare the functioning of items across manifest groups. However, the manifest groups with respect to which the items function differentially may not necessarily coincide with the true source of the bias. It is expected that DIF detection under a model that includes a latent DIF…
Wendt, Anne; Harmes, J Christine
This article is a continuation of the research on the development and evaluation of innovative item formats for the NCLEX examinations that was published in the March/April 2009 edition of Nurse Educator. The authors discuss the innovative item templates and evaluate the statistical characteristics and level of cognitive processing required to answer the examination items.
Zwick, Rebecca; Thayer, Dorothy T.; Mazzeo, John
Differential item functioning (DIF) assessment procedures for items with more than two ordered score categories, referred to as polytomous items, were evaluated. Three descriptive statistics (standardized mean difference and two procedures based on the SIBTEST computer program) and five inferential procedures were used. Conditions under which the…
Sun, Jianan; Chen, Yunxiao; Liu, Jingchen; Ying, Zhiliang; Xin, Tao
We develop a latent variable selection method for multidimensional item response theory models. The proposed method identifies latent traits probed by items of a multidimensional test. Its basic strategy is to impose an [Formula: see text] penalty term to the log-likelihood. The computation is carried out by the expectation-maximization algorithm combined with the coordinate descent algorithm. Simulation studies show that the resulting estimator provides an effective way in correctly identifying the latent structures. The method is applied to a real dataset involving the Eysenck Personality Questionnaire.
This paper describes the background and development of a Mental Distress Explanatory Model Questionnaire designed to explore how people from different cultures explain mental distress. A 45-item questionnaire was developed with items derived from the Murdock et al. categories, with additional items covering western notions of physiological causation and stress. The questionnaire was administered to 261 people, mostly college students. Multi-dimensional scaling analysis shows four clusters of mental distress: a) stress; b) western physiological; c) nonwestern physiological; and d) supernatural. These clusters form two dimensions: western physiological vs. supernatural and impersonal vs. personalistic explanations. Natural and stress items are separated from supernatural and nonwestern physiological items along the first dimension. Brain damage, physical illness, and genetic defects have the greatest separation along the first dimension. Being hot, the body being out of balance, and wind currents passing through the body most strongly represent the non-western physiological category. The questionnaire has the potential to be used for community health screening and for monitoring patient care, as well as with students in the health sciences and with health practitioners.
FRANCO-MICHELONI, Ana Lucia; FERNANDES, Giovana; GONÇALVES, Daniela Aparecida de Godoi; CAMPARIS, Cinara Maria
Temporomandibular disorders (TMD) screeners assume significant item overlap with the screening questionnaire proposed by the American Academy of Orofacial Pain (AAOP). Objective To test the reliability and validity of the Portuguese version of AAOP questions for TMD screening among adolescents. Material and Methods Diagnoses from Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD) Axis I were used as reference standard. Reliability was evaluated by internal consistency (KR-20) and inter-item correlation. Validity was tested by sensitivity, specificity, predictive values, accuracy and receiver operating characteristic (ROC) curves, the relationship between the true-positive rate (sensitivity) and the false-positive rate (specificity). Test-retest reliability of AAOP questions and intra-examiner reproducibility of RDC/TMD Axis I were tested with kappa statistics. Results The sample consisted of 1307 Brazilian adolescents (56.8% girls; n=742), with mean age of 12.72 years (12.69 F/12.75 M). According to RDC/TMD, 397 [30.4% (32.7% F/27.3% M)] of adolescents presented TMD, of which 330 [25.2% (27.6% F/22.2% M)] were painful TMD. Because of low consistency, items #8 and #10 of the AAOP questionnaire were excluded. Remaining items (of the long questionnaire version) showed good consistency and validity for three positive responses or more. After logistic regression, items #4, #6, #7 and #9 also showed satisfactory consistency and validity for two or more positive responses (short questionnaire version). Both versions demonstrated excellent specificity (about 90%), but higher sensitivity for detecting painful TMD (78.2%). Better reproducibility was obtained for the short version (k=0.840). Conclusions The Portuguese version of AAOP questions showed both good reliability and validity for the screening of TMD among adolescents, especially painful TMD, according to RDC/TMD. PMID:25141204
Mulero-Portela, Ana L.; Colón Santaella, Carmen L.; Cruz Gómez, Cynthia
The Theory of Planned Behavior (TPB) serves to understand determinants that predict the intention to exercise. According to this theory, attitudes, subjective norms, and perceptions of behavioral control determine intention. This is the first theory-based tool designed to measure the determinants of exercise among women in Puerto Rico who are breast cancer survivors. Understanding the determinants will assist in planning theory based interventions. The purpose of this study was to develop a TPB-based questionnaire to assess the determinants of exercise of breast cancer survivors in Puerto Rico and to evaluate its psychometric properties. Quantitative and qualitative methods were used for questionnaire development and psychometric testing. Three independent samples were recruited for the phases of item generation, pilot testing, and evaluation of psychometric properties. An initial 97-item questionnaire was constructed. Test–retest reliability was assessed for the indirect subscales; six items were found unreliable and removed. For the direct subscales, seven items with item-to-total correlations <0.30 were removed. The final version consisted of 84 items, with Cronbach’s α ranging from 0.65 to 0.89. Construct validity was demonstrated by significant, fair-to-moderate correlations of all but one of the direct subscales and the multiplied scores of the indirect subscales of similar constructs. PMID:23244037
DeWalt, Darren A.; Thissen, David; Stucky, Brian D.; Langer, Michelle M.; DeWitt, Esi Morgan; Irwin, Debra E.; Lai, Jin-Shei; Yeatts, Karin B.; Gross, Heather E.; Taylor, Olivia; Varni, James W.
Objective This study’s objective was to develop a measure of social health using item response theory as part of the Patient Reported Outcomes Measurement Information System (PROMIS). Methods After candidate items were generated from review of prior literature, focus groups, expert input, and cognitive interviews, items were administered to youth aged 8–17 as part of the PROMIS pediatric large scale testing. Exploratory and confirmatory factor analyses were used to assess dimensionality and to identify instances of local dependence. Items that met the unidimensionality criteria were subsequently calibrated using Samejima’s Graded Response Model. Differential item functioning was examined by gender and age. Results The sample included 3,048 youth who completed the questionnaire (51.8% female, 60% white, and 22.7% with chronic illness). The initial conceptualization of social function and sociability did not yield unidimensional item banks. Rather, factor analysis revealed dimensions contrasting peer relationships and adult relationships. The analysis also identified dimensions formed by responses to positively versus negatively worded items. The resulting 15-item bank measures quality of peer relationships and has strong psychometric characteristics as a full bank or an 8-item short form. Conclusions The PROMIS pediatric peer relationships scale demonstrates good psychometric characteristics and addresses an important aspect of child health. PMID:23772887
Krägeloh, Christian U; Billington, D Rex; Hsu, Patricia Hsien-Chuan; Feng, Xuan Joanna; Medvedev, Oleg N; Kersten, Paula; Landon, Jason; Siegert, Richard J
The World Health Organisation Quality of Life (WHOQOL) questionnaires are widely used around the world and can claim strong cross-cultural validity due to their development in collaboration with international field centres. To enhance conceptual equivalence of quality of life across cultures, optional national items are often developed for use alongside the core instrument. The present study outlines the development of national items for the New Zealand WHOQOL-BREF. Focus groups with members of the community as well as health experts discussed what constitutes quality of life in their opinion. Based on themes extracted of aspects not contained in the existing WHOQOL instrument, 46 candidate items were generated and subsequently rated for their importance by a random sample of 585 individuals from the general population. Applying importance criteria reduced these items to 24, which were then sent to another large random sample (n = 808) to be rated alongside the existing WHOQOL-BREF. A final set of five items met the criteria for national items. Confirmatory factor analysis identified four national items as belonging to the psychological domain of quality of life, and one item to the social domain. Rasch analysis validated these results and generated ordinal-to-interval conversion algorithms to allow use of parametric statistics for domain scores with and without national items.
Billington, D. Rex; Hsu, Patricia Hsien-Chuan; Feng, Xuan Joanna; Medvedev, Oleg N.; Kersten, Paula; Landon, Jason; Siegert, Richard J.
The World Health Organisation Quality of Life (WHOQOL) questionnaires are widely used around the world and can claim strong cross-cultural validity due to their development in collaboration with international field centres. To enhance conceptual equivalence of quality of life across cultures, optional national items are often developed for use alongside the core instrument. The present study outlines the development of national items for the New Zealand WHOQOL-BREF. Focus groups with members of the community as well as health experts discussed what constitutes quality of life in their opinion. Based on themes extracted of aspects not contained in the existing WHOQOL instrument, 46 candidate items were generated and subsequently rated for their importance by a random sample of 585 individuals from the general population. Applying importance criteria reduced these items to 24, which were then sent to another large random sample (n = 808) to be rated alongside the existing WHOQOL-BREF. A final set of five items met the criteria for national items. Confirmatory factor analysis identified four national items as belonging to the psychological domain of quality of life, and one item to the social domain. Rasch analysis validated these results and generated ordinal-to-interval conversion algorithms to allow use of parametric statistics for domain scores with and without national items. PMID:27812203
Bowers, John J.
Procedures devised for computing factor matching statistics in questionnaire construction are described. In a study of the impact of vocational education on social, affective and nontechnical development of students, an attitude questionnaire to measure outcomes was developed. Each item in the final section had a six-point Likert type response…
International Labour Office, Geneva (Switzerland).
The report is a compilation of replies by 61 member States of the International Labour Office to a questionnaire concerning paid educational leave prepared in anticipation of the 58th Session of the International Labour Conference. A brief section on general observations precedes the responses to specific items on the questionnaire, the relevant…
Oberholzer, Michael; Poryazova, Rositsa; Bassetti, Claudio L
Sleepwalking (SW) corresponds to a complex sleep-associated behavior that includes locomotion, mental confusion, and amnesia. SW is present in about 10% of children and 2-3% of adults. In a retrospective series of 165 patients with Parkinson's disease (PD), we found adult-onset ("de novo") SW "de novo" in six (4%) of them. The aim of this study was to assess prospectively and systematically the frequency and characteristics of SW in PD patients. A questionnaire including items on sleep quality, sleep disorders, and specifically also SW and REM sleep behavior disorder (RBD), PD characteristics and severity, was sent to the members of the national PD patients organization in Switzerland. In the study, 36/417 patients (9%) reported SW, of which 22 (5%) had adult-onset SW. Patients with SW had significantly longer disease duration (p = 0.035), they reported more often hallucinations (p = 0.004) and nightmares (p = 0.003), and they had higher scores, suggestive for RBD in a validated questionnaire (p = 0.001). Patients with SW were also sleepier (trend to a higher Epworth Sleepiness Scale score, p = 0.055). Our data suggest that SW in PD patients is (1) more common than in the general population, and (2) is associated with RBD, nightmares, and hallucinations. Further studies including polysomnographic recordings are needed to confirm the results of this questionnaire-based analysis, to understand the relationship between SW and other nighttime wandering behaviors in PD, and to clarify the underlying mechanisms.
Karekla, Maria; Pilipenko, Nataliya; Feldman, Jonathan
This study aimed to assess the reliability, validity, and factor structure of the Greek translation of the Patient Health Questionnaire (PHQ) in a sample of Cypriot, Greek-speaking university students. This is the first study to examine PHQ psychometric properties in Greek and to investigate the factor structure of the PHQ subscales. A total of 520 participants (73.9% women; M(Age) = 21.57; SD, 4.94) completed the PHQ and assessment tools used for convergent validity analysis. Patient Health Questionnaire was translated and culturally adapted according to international standards. Overall, PHQ subscales in Greek language demonstrated good internal consistency (mean Cronbach α = .75, P < .001) and convergent validity with the following: Alcohol Use Disorders Identification Test, Beck Depression Inventory, Psychiatric Diagnostic Screening Questionnaire (panic disorder, somatization, bulimia, and binge eating), and Anxiety Sensitivity Index (overall mean, r = 0.52; P < .001). The relation between the PHQ subscale diagnoses and functional impairment, as assessed by the 12-item Health Survey 12, was comparable with the original validation results for all subscales except alcohol. The depression, alcohol, and anxiety subscales exhibited single-factor structures. Subscales assessing eating disorders, panic disorder, and somatization difficulties exhibited 2-, 3-, and 4-factor structures, respectively. Overall, PHQ subscales demonstrated good psychometric properties, with the exception of the subscale examining problematic alcohol use. Overall, PHQ demonstrates good reliability, validity, and appropriate factor structure in a Greek-speaking college population. Psychometric research is needed on the Greek PHQ in primary care settings.
Oner, Pinar; Oner, Ozgur; Munir, Kerim
We compared ratings on the Three-Item Direct Observation Screen test for autism spectrum disorders completed by pediatric residents with the Social Communication Questionnaire parent reports as an augmentative tool for improving autism spectrum disorder screening performance. We examined three groups of children (18-60 months) comparable in age (18-24 month, 24-36 month, 36-60 preschool subgroups) and gender distribution: n = 86 with Diagnostic and Statistical Manual of Mental Disorders (4th ed., text rev.) autism spectrum disorders; n = 76 with developmental delay without autism spectrum disorders; and n = 97 with typical development. The Three-Item Direct Observation Screen test included the following (a) Joint Attention, (b) Eye Contact, and (c) Responsiveness to Name. The parent Social Communication Questionnaire ratings had a sensitivity of .73 and specificity of .70 for diagnosis of autism spectrum disorders. The Three-Item Direct Observation Screen test item Joint Attention had a sensitivity of .82 and specificity of .90, Eye Contact had a sensitivity of .89 and specificity of .91, and Responsiveness to Name had a sensitivity of .67 and specificity of .87. In the Three-Item Direct Observation Screen test, having at least one of the three items positive had a sensitivity of .95 and specificity of .85. Age, diagnosis of autism spectrum disorder, and developmental level were important factors affecting sensitivity and specificity. The results indicate that augmentation of autism spectrum disorder screening by observational items completed by trained pediatric-oriented professionals can be a highly effective tool in improving screening performance. If supported by future population studies, the results suggest that primary care practitioners will be able to be trained to use this direct procedure to augment screening for autism spectrum disorders in the community.
Surveys and questionnaires are often used in nursing research to elicit the views of large groups of people to develop the nursing knowledge base. This article provides an overview of survey and questionnaire use in nursing research, clarifies the place of the questionnaire as a data collection tool in quantitative research design and provides information and advice about best practice in the development of quantitative surveys and questionnaires.
Taylor, C. L.; Summerhill, W. R.
The effects of (1) format and color, and (2) severity of issue (freeze damage to citrus industry) on response rate of mail questionnaires is presented. Questionnaires were formatted in two different ways: a one page, legal size printed on both sides, and one sheet 11- by 17-inch size center-folded with items on three pages. Two colors were used:…
Cook, Gillian; And Others
To investigate conditions for and current practices among supervisors of English language arts, a 20-item questionnaire was mailed to 350 language arts supervisors. By the April 1, 1983, deadline date, 96 had returned completed questionnaires. Of these, 62 respondents held positions at the school district level, 29 held positions at the state or…
Hartman, Catharina A.; Luteijn, Ellen; Serra, Marike; Minderaa, Ruud
The objective of this study was to refine the Children's Social Behavior Questionnaire (CSBQ), to reduce its length, and to verify its psychometric properties. The CSBQ is a questionnaire for parents or caregivers of children with PDD. The items describe a broad range of features that are typical of PDD, particularly in its milder forms. Based on…
... 19 Customs Duties 3 2011-04-01 2011-04-01 false Questionnaires. 357.105 Section 357.105 Customs... Questionnaires. For reviews conducted under section 106(b)(2), the Secretary normally will send questionnaires to potential producers/suppliers of the product to determine whether it is in short supply....
... 19 Customs Duties 3 2012-04-01 2012-04-01 false Questionnaires. 357.105 Section 357.105 Customs... Questionnaires. For reviews conducted under section 106(b)(2), the Secretary normally will send questionnaires to potential producers/suppliers of the product to determine whether it is in short supply....
... 19 Customs Duties 3 2010-04-01 2010-04-01 false Questionnaires. 357.105 Section 357.105 Customs... Questionnaires. For reviews conducted under section 106(b)(2), the Secretary normally will send questionnaires to potential producers/suppliers of the product to determine whether it is in short supply....
Kim, Namjoo; Lee, Seonjoo; Lee, Sujung; Seo, Sang-Soo; Chung, Seung Hyun
Objective The Gynecologic Cancer Lymphedema Questionnaire (GCLQ) was designed to identify gynecologic cancer patients with lower limb lymphedema (LLL). The questionnaire consists of 20 items distributed over 7 symptom clusters. The present study aimed to develop an abridged form of the GCLQ for simpler screening and more effective follow-up of LLL. Methods Data that had been collected for the development and validation of the Korean version of the GCLQ (GCLQ-K) were used in this study. Receiver-operating characteristic (ROC) curves were drawn according to the individual items of the GCLQ-K. Based on discrimination ability, the candidate items were selected in each symptom cluster. After combining the items, the best model was identified and named GCLQ-7. The area under the ROC curve (AUC) was compared between the GCLQ-7 and the original GCLQ-K. Results In total, 11 candidate items were selected from the original GCLQ-K. Among the models made with the candidate items, GCLQ-7, the best model, was constructed with 7 items as follows: 1) limited knee movement, 2) general swelling, 3) redness, 4) firmness/tightness, 5) groin swelling, 6) heaviness, and 7) aching. This model exhibited an AUC of 0.945 (95% confidence interval [CI], 0.900–0.991), which is comparable with that of the original GCLQ-K (AUC, 0.867; 95% CI, 0.779–0.956). The best cutoff value was 2 points, at which the sensitivity and specificity were 97.0% and 76.5%, respectively. Conclusion The newly developed short version model, GCLQ-7, showed acceptable discrimination ability as compared with the original GCLQ-K. PMID:27819411
Gamper, Eva-Maria; Groenvold, Mogens; Petersen, Morten Aa; Young, Teresa; Costantini, Anna; Aaronson, Neil; Giesinger, Johannes M; Meraner, Verena; Kemmler, Georg; Holzner, Bernhard
Background The European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group is currently developing computerized adaptive testing measures for the Quality of Life Questionnaire Core-30 (QLQ-C30) scales. The work presented here describes the development of an EORTC item bank for emotional functioning (EF), which is one of the core domains of the QLQ-C30. Methods According to the EORTC guidelines on module development, the development of the EF item bank comprised four phases, of which the phases I–III are reported in the present paper. Phase I involved defining the theoretical framework for the EF item bank and a literature search. Phase II included pre-defined item selection steps and a multi-stage expert review process. In phase III, feedback from cancer patients from different countries was obtained. Results On the basis of literature search in phase I, a list of 1750 items was generated. These were reviewed and further developed in phase II with a focus on relevance, redundancy, clarity, and difficulty. The development and selection steps led to a preliminary list of 41 items. In phase III, patient interviews (N = 41; Austria, Denmark, Italy, and the UK) were conducted with the preliminary item list, resulting in some minor changes to item wording. The final list comprised 38 items. Discussion The phases I–III of the developmental process have resulted in an EF item list that was well accepted by patients in several countries. The items will be subjected to larger-scale field testing in order to establish their psychometric characteristics and their fit to an item response theory model. PMID:24217943
Aleksandrowicz, J W
"S-II" Symptom Check-list which allows for a fast diagnosis of neurotic disorders. A result of 165 points suggests the incidence of such disorders with the probability of 90%. The methodology of the construction of the check-list intends for the application of questions most common in those ill due to neurotic disorders (owing to the change in frequency) and the most possibly equal amount of questions on the symptoms common to women and men. Thanks to this the norm for women and men is identical. SCL S-II Symptom Check-list is a shortened and actualised version of the "O" Symptom Check-list, developed in 1975. It is similar to the SCL-90 and highly correlated with it, but it does not contain the variables concerning the psychotic symptoms. Thanks to this, its' accuracy (specificity) in the diagnosis of neurotic disorders is high. 4 pairs of questions allow for the judgement of answer reliability. 10 scales were singled out in the questionnaire. They are only of a helpful value and do not allow for a one-sided diagnosis of the type of the disorder, listed in the ICD-10. The scale results can, however make the correct diagnosis easier.
Chen, Liuxi; Xu, Kai; Fu, Lingyun; Xu, Shaofang; Gao, Qianqian; Wang, Wei
Consistent results have shown a relationship between the psychological world of children and their perceived parental bonding or family attachment style, but to date there is no single measure covering both styles. The authors designed a statement matrix with 116 items for this purpose and compared it with the Parental Bonding Instrument (PBI) in a study with 718 university students. After exploratory and confirmatory factor analyses, five factors (scales)--namely, Paternal/Maternal Encouragement (5 items each), Paternal/Maternal Abuse (5 items each), Paternal/Maternal Freedom Release (5 items each), General Attachment (5 items), and Paternal/Maternal Dominance (4 items each)--were defined to form a Family Relationship Questionnaire (FRQ). The internal alphas of the factors ranged from .64 to .83, and their congruency coefficients were .93 to .98 in samples regarding father and mother. Women scored significantly higher on FRQ General Attachment and Maternal Encouragement and lower on Paternal Abuse than men did; only children scored significantly higher on Paternal and Maternal Encouragements than children with siblings did. Women also scored significantly higher on PBI Paternal Autonomy Denial; only children scored significantly higher on Paternal and Maternal Cares and Maternal Autonomy Denial. All intercorrelations between FRQ scales were low to medium, and some correlations between FRQ and PBI scales were medium to high. This study demonstrates that the FRQ has a structure of five factors with satisfactory discriminant and convergent validities, which might help to characterize family relationships in healthy and clinical populations.
Choi, Bernard C K; Pak, Anita W P
Bias in questionnaires is an important issue in public health research. To collect the most accurate data from respondents, investigators must understand and be able to prevent or at least minimize bias in the design of their questionnaires. This paper identifies and categorizes 48 types of bias in questionnaires based on a review of the literature and offers an example of each type. The types are categorized according to three main sources of bias: the way a question is designed, the way the questionnaire as a whole is designed, and how the questionnaire is administered. This paper is intended to help investigators in public health understand the mechanism and dynamics of problems in questionnaire design and to provide a checklist for identifying potential bias in a questionnaire before it is administered.
Ozturk, Nagihan Boztunc; Dogan, Nuri
This study aims to investigate the effects of item exposure control methods on measurement precision and on test security under various item selection methods and item pool characteristics. In this study, the Randomesque (with item group sizes of 5 and 10), Sympson-Hetter, and Fade-Away methods were used as item exposure control methods. Moreover,…
Ariel, Adelaide; van der Linden, Wim J.; Veldkamp, Bernard P.
Item-pool management requires a balancing act between the input of new items into the pool and the output of tests assembled from it. A strategy for optimizing item-pool management is presented that is based on the idea of a periodic update of an optimal blueprint for the item pool to tune item production to test assembly. A simulation study with…
Penfield, Randall David
A polytomous item is one for which the responses are scored according to three or more categories. Given the increasing use of polytomous items in assessment practices, item response theory (IRT) models specialized for polytomous items are becoming increasingly common. The purpose of this ITEMS module is to provide an accessible overview of…
Chassany, O; Marquis, P; Scherrer, B; Read, N; Finger, T; Bergmann, J; Fraitag, B; Geneve, J; Caulin, C
BACKGROUND—Dyspepsia and irritable bowel syndrome are suitable conditions for assessment of quality of life. Their similarities justify the elaboration of a single specific questionnaire for the two conditions. AIMS—To examine the process leading to the validation of the psychometric properties of the functional digestive disorders quality of life questionnaire (FDDQL). METHODS—Initially, the questionnaire was given to 154 patients, to assess its acceptability and reproducibility, analyse its content, and reduce the number of items. Its responsiveness was tested during two therapeutic trials which included 428 patients. The questionnaire has been translated into French, English, and German. The psychometric validation study was conducted in France, United Kingdom, and Germany by 187 practitioners. A total of 401patients with dyspepsia or irritable bowel syndrome, defined by the Rome criteria, filled in the FDDQL and generic SF-36 questionnaires. RESULTS—The structure of the FDDQL scales was checked by factorial analysis. Its reliability was expressed by a Cronbach's α coefficient of 0.94. Assessment of its discriminant validity showed that the more severe the functional digestive disorders, the more impaired the quality of life (p<0.05). Concurrent validity was supported by the correlation found between the FDDQL and SF-36 questionnaire scales. The final version of the questionnaire contains 43 items belonging to eight domains. CONCLUSIONS—The properties of the FDDQL questionnaire, available in French, English, and German, make it appropriate for use in clinical trials designed to evaluate its responsiveness to treatment among patients with dyspepsia and irritable bowel syndrome. Keywords: digestive disorders; irritable bowel syndrome; dyspepsia; quality of life; clinical trial; validation PMID:10075960
Afolabi, Muhammed O; Bojang, Kalifa; D'Alessandro, Umberto; Ota, Martin O C; Imoukhuede, Egeruan B; Ravinetto, Raffaella; Larson, Heidi J; McGrath, Nuala; Chandramohan, Daniel
Objective To develop and psychometrically evaluate an audio digitised tool for assessment of comprehension of informed consent among low-literacy Gambian research participants. Setting We conducted this study in the Gambia where a high illiteracy rate and absence of standardised writing formats of local languages pose major challenges for research participants to comprehend consent information. We developed a 34-item questionnaire to assess participants’ comprehension of key elements of informed consent. The questionnaire was face validated and content validated by experienced researchers. To bypass the challenge of a lack of standardised writing formats, we audiorecorded the questionnaire in three major Gambian languages: Mandinka, Wolof and Fula. The questionnaire was further developed into an audio computer-assisted interview format. Participants The digitised questionnaire was administered to 250 participants enrolled in two clinical trials in the urban and rural areas of the Gambia. One week after first administration, the questionnaire was readministered to half of the participants who were randomly selected. Participants were eligible if enrolled in the parent trials and could speak any of the three major Gambian languages. Outcome measure The primary outcome measure was reliability and validity of the questionnaire. Results Item reduction by factor analysis showed that 21 of the question items have strong factor loadings. These were retained along with five other items which were fundamental components of informed consent. The 26-item questionnaire has high internal consistency with a Cronbach's α of 0.73–0.79 and an intraclass correlation coefficient of 0.94 (95% CI 0.923 to 0.954). Hypotheses testing also showed that the questionnaire has a positive correlation with a similar questionnaire and discriminates between participants with and without education. Conclusions We have developed a reliable and valid measure of comprehension of informed consent
Solari, A.; Filippini, G.; Mendozzi, L.; Ghezzi, A.; Cifani, S.; Barbieri, E.; Baldini, S.; Salmaggi, A.; Mantia, L. L.; Farinotti, M.; Caputo, D.; Mosconi, P.
OBJECTIVES—Health related quality of life (HRQOL) inventories are multi-dimensional measures of patient-centred health status developed for clinical research. The MS quality of life 54 (MSQOL-54) is an MS-specific HRQOL inventory originally devised for English speaking patients. It consists of a core measure, the 36-item short form health survey (SF-36) previously adapted into Italian, and 18 additional items exploring domains relevant to patients with MS (MS-18 module). The authors translated and culturally adapted into Italian the MS-18 module of the MSQOL-54 questionnaire, and clinically validated the whole questionnaire. METHODS—The MS-18 module was translated following the methodology of the International Quality of Life Assessment (IQOLA) project. The MSQOL-54 was validated in 204 consecutive patients with MS seen between April and September 1997 at three participating centres. The questionnaire was explained by the physician who also administered the expanded disability status scale (EDSS) and mini mental status scale examination, and the patient filled in the MSQOL-54 and Beck depression inventory questionnaires (BDI), with assistance if required. The contribution of impairments and disabilities to MSQOL-54 scores were assessed, and mean scores were compared with normative data for the general Italian population, and with the original sample of United States MS patients. RESULTS—The mean age of the 204 patients was 42 years; mean EDSS score was 4.5 (range 0-8.5). Patients' participation in the assessment was satisfactory, and all scales satisfied the usual psychometric standards. The characteristics of the United States sample matched those of our patients in all but gender (72% United States patients v 52% Italian patients were women), and education (90% United States patients and 44% Italian patients completed high school); MSQOL-54 profiles were also similar. The EDSS was significantly associated with the physical health composite but not with
Item response theory (IRT) becomes an increasingly important tool when analyzing "big data" gathered from online educational venues. However, the mechanism was originally developed in traditional exam settings, and several of its assumptions are infringed upon when deployed in the online realm. For a large-enrollment physics course for scientists and engineers, the study compares outcomes from IRT analyses of exam and homework data, and then proceeds to investigate the effects of each confounding factor introduced in the online realm. It is found that IRT yields the correct trends for learner ability and meaningful item parameters, yet overall agreement with exam data is moderate. It is also found that learner ability and item discrimination is robust over a wide range with respect to model assumptions and introduced noise. Item difficulty is also robust, but over a narrower range.
Spiegel, F. Xavier
Everyday household items are used to demonstrate some unique properties of materials. A coat hanger, rubber band, balloon, and corn starch have typical properties which we often take for granted but can be truly amazing.
Park, Yoon Soo; Lee, Young-Sun; Xing, Kuan
This study investigates the impact of item parameter drift (IPD) on parameter and ability estimation when the underlying measurement model fits a mixture distribution, thereby violating the item invariance property of unidimensional item response theory (IRT) models. An empirical study was conducted to demonstrate the occurrence of both IPD and an underlying mixture distribution using real-world data. Twenty-one trended anchor items from the 1999, 2003, and 2007 administrations of Trends in International Mathematics and Science Study (TIMSS) were analyzed using unidimensional and mixture IRT models. TIMSS treats trended anchor items as invariant over testing administrations and uses pre-calibrated item parameters based on unidimensional IRT. However, empirical results showed evidence of two latent subgroups with IPD. Results also showed changes in the distribution of examinee ability between latent classes over the three administrations. A simulation study was conducted to examine the impact of IPD on the estimation of ability and item parameters, when data have underlying mixture distributions. Simulations used data generated from a mixture IRT model and estimated using unidimensional IRT. Results showed that data reflecting IPD using mixture IRT model led to IPD in the unidimensional IRT model. Changes in the distribution of examinee ability also affected item parameters. Moreover, drift with respect to item discrimination and distribution of examinee ability affected estimates of examinee ability. These findings demonstrate the need to caution and evaluate IPD using a mixture IRT framework to understand its effects on item parameters and examinee ability. PMID:26941699
Pan, Raquel; Marques, Amanda Rossi; dos Santos, Bruna Domingos; Jacob, Eufemia; dos Santos, Claudia Benedita; Nascimento, Lucila Castanheira
OBJECTIVE: to present the cultural adaptation of the questionnaire Costs of caring for children with cancer, offering a valid and reliable tool to assess the economic repercussions of childhood cancer for Brazilian families. METHOD: it is a methodological research with a cross-sectional design. The methodological framework to validate the questionnaire was a combined process that included seven steps: translation to Portuguese; first translated consensus version; evaluation by Expert Committee; consensus on the Expert Committee version; back-translation; consensus of back-translated versions; semantic validation. The study was conducted in two phases: phase one was the translation and back-translations process, with five expert committee members. Phase two was the semantic validation, with 24 participants, who answered an instrument about their impressions of the questionnaire and suggested modifications. RESULTS: in phase one, items were included, excluded, and replaced to make the content equivalent and valid for use with Brazilian context. In phase two, the majority of the participants were mothers, who made suggestions about the relevance and clarity of the items in the questionnaire. CONCLUSIONS: the authors discussed these recommendations and made adaptations, turning the questionnaire into a valid and reliable tool for application. PMID:25296142
Youngs, Donna; Canter, David V
The study of narrative processes as part of the immediate factors that shape criminal action is limited by the lack of a methodology for differentiating the narrative themes that characterise specific crime events. The current study explores how the roles offenders see themselves as playing during an offence encapsulate their underlying crime narratives and thus provide the basis for a quantitative methodology. To test this possibility, a 33-item Narrative Roles Questionnaire (NRQ) was developed from intensive interviews with offenders about their experience of committing a recent offence. A multidimensional analysis of the NRQ completed by 71 convicted offenders revealed life narrative themes similar to those identified in fiction by Frye and with noncriminals by McAdams, labelled The Professional, Victim, Hero, and Revenger offence roles. The NRQ thus is a first step in opening up the possibility of empirical studies of the narrative aetiological perspective in criminology.
Fitkov-Norris, E. D.; Yeghiazarian, A.
This article discusses the application of Rasch analysis to assess the internal validity of a four sub-scale VARK (Visual, Auditory, Read/Write and Kinaesthetic) learning styles instrument. The results from the analysis show that the Rasch model fits the majority of the VARK questionnaire data and the sample data support the internal validity of the four sub-constructs at 1% level of significance for all but one item. While this suggests that the instrument could potentially be used as a predictor for a person's learning preference orientation, further analysis is necessary to confirm the invariability of the instrument across different user groups across factors such as gender, age, educational and cultural background.
Brooks, Robert; Bryant, Richard A; Silove, Derrick; Creamer, Mark; O'Donnell, Meaghan; McFarlane, Alexander C; Marmar, Charles R
This paper has been retracted due to a publisher's error: the order of the authors was incorrect. The Editor and Publisher of the Journal of Traumatic Stress apologize to the authors and our readership. The Peritraumatic Dissociative Experiences Questionnaire (PDEQ) is a widely used measure of peritraumatic dissociation, and is presumably a unidimensional construct. Two hundred forty-seven individuals admitted to five hospitals after traumatic injury were administered the Clinician Administered PTSD Scale, the Hospital Anxiety and Depression Scale, and the PDEQ. Factor analysis indicated that the PDEQ involved two factors containing four items each: one factor (altered awareness) indexes alterations in awareness and the other (derealization) reflects distortions in perceptions of the self and the world. Only the derealization factor was associated with acute stress, anxiety, and depression symptoms. Cross-validation with independent data provided only partial support for the 2-factor structure model. These data indicate that peritraumatic dissociation may involve two distinct constructs.
Torrealday, O.; Stein, L. A. R.; Barnett, N.; Golembeske, C.; Lebeau, R.; Colby, S. M.; Monti, P. M.
The purpose of this study was to evaluate a brief version of the Marijuana Effect Expectancy Questionnaire (MEEQ; Schafer & Brown, 1991). The original MEEQ was reduced to 6 items (MEEQ-B). Principal component analysis (PCA) was performed and two factors were identified (positive effects and negative effects) accounting for 52.3% of the variance. Internal consistencies (0.42 to 0.60) were slightly lower than those of the original MEEQ. The negative effect expectancy scale correlated with criterion variables that assess marijuana use (p ≤ .05). This measure is a helpful tool for clinicians to use when assessing youth expectancies. Replication across different samples of adjudicated youth is recommended. PMID:22058648
profitable ) commercial customer-base. This means that the commercial vendors have several customers and their products are manufactured to meet more...NAVAL POSTGRADUATE SCHOOL Monterey, California THESIS Approved for public release; distribution is unlimited IDENTIFICATION OF COMMERCIAL ITEMS...of Commercial Items Risk Factors 6. AUTHOR(S) 5. FUNDING NUMBERS 7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) Naval Postgraduate School
Cremers, Teresa L; Sampson, Thomas E
Calorimetric assay has the reputation of providing the highest precision and accuracy of all nondestructive assay measurements. Unfortunately, non-destructive assay practitioners and measurement consumers often extend, inappropriately, the high precision and accuracy of calorimetric assay to very low mass items. One purpose of this document is to present more realistic expectations for the random uncertainties associated with calorimetric assay for weapons grade plutonium items with masses of 200 grams or less.
NAC Aftermarket Brake Components Project (Secondary Items) SAE Paper #2006-01-3192 25 September 2006, Grapevine Version R4 (Final) Report...REPORT TYPE N/A 3. DATES COVERED - 4. TITLE AND SUBTITLE NAC Aftermarket Brake Components Project (Secondary Items) 5a. CONTRACT NUMBER 5b...PAGE unclassified Standard Form 298 (Rev. 8-98) Prescribed by ANSI Std Z39-18 NAC Aftermarket Brake Components Project By: Leo Miller, USA
... encryption software are distinguished from controls on other software regulated under the EAR. (a) Licensing... items (“EI”) classified under ECCN 5A002.a.1, a.2, a.5, a.6 and a.9; 5D002.a or c.1 for equipment... items may be exported under the provisions of License Exception ENC set forth in § 740.17 of the...
... From the Federal Register Online via the Government Publishing Office DEPARTMENT OF HOUSING AND URBAN DEVELOPMENT Federal Labor Standards Questionnaire(s); Complaint Intake Form AGENCY: Office of the...: Federal Labor Standards Questionnaire(s); Complaint Intake Form. OMB Approval Number: 2501-0018....
... Questionnaire(s); Complaint Intake Form AGENCY: Office of Labor Relations, HUD. ACTION: Notice. SUMMARY: HUD is... Standards Questionnaire; Complaint Intake Form. OMB Approval Number: 2501-0018. Type of Request: Extension... 4730SP, Federal Labor Standards Questionnaires, will be used by HUD and agencies administering...
Núñez, Juan L; León, Jaime; Grijalvo, Fernando; Martín Albo, José
The goals of this research were to translate and analyze the psychometric properties of the Learning Climate Questionnaire (LCQ) and to develop a short form. The LCQ is a 15-item self-report measure that assesses autonomy support in educational settings. A total of 422 students (60 men and 362 women) took part in this study. Results showed evidence of construct validity and adequate reliability for the LCQ. The short form consists of five items that showed sound psychometric properties. Results of Pearson correlation and Gower index showed high agreement between the long and short forms. In conclusion, both forms can be considered as preliminary versions of the original questionnaire to assess autonomy support in educational settings.
Background Despite the large number of parenting questionnaires, considerable disagreement exists about how to best assess parenting. Most of the instruments only assess limited aspects of parenting. To overcome this shortcoming, the “Comprehensive General Parenting Questionnaire” (CGPQ) was systematically developed. Such a measure is frequently requested in the area of childhood overweight. Methods First, an item bank of existing parenting measures was created assessing five key parenting constructs that have been identified across multiple theoretical approaches to parenting (Nurturance, Overprotection, Coercive control, Behavioral control, and Structure). Caregivers of 5- to 13-year-olds were asked to complete the online survey in the Netherlands (N = 821), Belgium (N = 435) and the United States (N = 241). In addition, a questionnaire regarding personality characteristics (“Big Five”) of the caregiver was administered and parents were asked to report about their child’s height and weight. Factor analyses and Item-Response Modeling (IRM) techniques were used to assess the underlying parenting constructs and for item reduction. Correlation analyses were performed to assess the relations between general parenting and personality of the caregivers, adjusting for socio-economic status (SES) indicators, to establish criterion validity. Multivariate linear regressions were performed to examine the associations of SES indicators and parenting with child BMI z-scores. Additionally, we assessed whether scores on the parenting constructs and child BMI z-scores differed depending on SES indicators. Results The reduced questionnaire (62 items) revealed acceptable fit of our parenting model and acceptable IRM item fit statistics. Caregiver personality was related as hypothesized with the GCPQ parenting constructs. While correcting for SES, overprotection was positively related to child BMI. The negative relationship between structure and BMI was
Gottipati, Gopichand; Karlsson, Mats O; Plan, Elodie L
In the current work, we present the methodology for development of an Item Response Theory model within a non-linear mixed effects framework to characterize the longitudinal changes of the Movement Disorder Society (sponsored revision) of Unified Parkinson's Disease Rating Scale (MDS-UPDRS) endpoint in Parkinson's disease (PD). The data were obtained from Parkinson's Progression Markers Initiative database and included 163,070 observations up to 48 months from 430 subjects belonging to De Novo PD cohort. The probability of obtaining a score, reported for each of the items in the questionnaire, was modeled as a function of the subject's disability. Initially, a single latent variable model was explored to characterize the disease progression over time. However, based on the understanding of the questionnaire set-up and the results of a residuals-based diagnostic tool, a three latent variable model with a mixture implementation was able to adequately describe longitudinal changes not only at the total score level but also at each individual item level. The linear progression rates obtained for the patient-reported items and the non-sided items were similar, each of which roughly take about 50 months for a typical subject to progress linearly from the baseline by one standard deviation. However for the sided items, it was found that the better side deteriorates quicker than the disabled side. This study presents a framework for analyzing MDS-UPDRS data, which can be adapted to more traditional UPDRS data collected in PD clinical trials and result in more efficient designs and analyses of such studies.
Preti, Antonio; Siddi, Sara; Vellante, Marcello; Scanu, Rosanna; Muratore, Tamara; Gabrielli, Mersia; Tronci, Debora; Masala, Carmelo; Petretto, Donatella Rita
The schizotypal personality questionnaire (SPQ) is used to characterize schizotypy, a complex construct helpful for the investigation of schizophrenia-related psychopathology and putative endophenotypes. The SPQ factor structure at item level has been rarely replicated and no study had tested a bifactor model of the SPQ so far. The unidimensional, the correlated, the second-order and the bifactor models of the SPQ were tested to evaluate whether the items converge into a major single factor defining the schizotypy-proneness of the participants, to be used for grouping purpose. Parallel principal component analysis (PCA) and confirmatory factor analysis (CFA) were used to determine the optimal number of factors and components in a cross-sectional, survey design involving 649 college students (males: 47%). The first-order, nine-subscale model was confirmed by CFA in the whole sample. The best evidence from parallel PCA in the training set was in favor of a two-factor model; the bifactor implementation of this model showed good fit in the subsequent CFA. Two main dimensions of positive and negative symptoms underlie schizotypy in non-clinical samples, entailing specific risk of psychosis. On a measurement level, the study provided support for the use of the total scores of the SPQ to characterize schizotypy.
Li, Chunxiao; Wang, Chee Keng John; Pyun, Do Young; Martindale, Russell
Given the significance of monitoring the critical environmental factors that facilitate athlete performance, this two-phase research aimed to validate and refine the revised talent development environment questionnaire (TDEQ). The TDEQ is a multidimensional self-report scale that assesses talented athletes' environmental experiences. Study 1 (the first phase) involved the examination of the revised TDEQ through an exploratory factor analysis (n = 363). This exploratory investigation identified a 28-item five-factor structure (i.e., TDEQ-5) with adequate internal consistency. Study 2 (the second phase) examined the factorial structure of the TDEQ-5, including convergent validity, discriminant validity, and group invariance (i.e., gender and sports type). The second phase was carried out with 496 talented athletes through the application of confirmatory factor analyses and multigroup invariance tests. The results supported the convergent validity, discriminant validity, and group invariance of the TDEQ-5. In conclusion, the TDEQ-5 with 25 items appears to be a reliable and valid scale for use in talent development environments.
Nower, Lia; Blaszczynski, Alex
The Pathways Model (Blaszczynski & Nower, 2002) is a theoretical framework that proposes three pathways for identifying etiological subtypes of problem gamblers. The model has been used to assist clinicians in developing individualized treatments that target not only the gambling behavior but also associated risk factors that may undermine recovery and precipitate relapse. The current study sought to develop and validate a new screening instrument, based on the Pathways Model for treatment-seeking gamblers. Participants were gamblers age 18 and over who scored 1+ symptoms on the Problem Gambling Severity Index of the Canadian Problem Gambling Index and presented to one of 22 participating treatment centers in Canada, the United States, and Australia (N = 1,176). Data were collected on 127 items, consisting of 62 core items that reflected variables in the Pathways Model and 65 experimental items derived from recent scholarly literature in gambling etiology. Exploratory and confirmatory factor analyses identified the following six factors: Antisocial Impulsive Risk-Taking, Stress-Coping, Mood Pre-Problem-Gambling Onset, Mood Post-Problem-Gambling Onset, Child Maltreatment, and Meaning Motivation. The Gambling Pathways Questionnaire showed excellent internal consistency (α = .937), with good to high reliability found for each of the six factors, ranging from .851 to .945. Cluster analysis results demonstrated that the three-factor model produced good model fit to the data: Cluster 1 (Behaviorally Conditioned Subtype), Cluster 2 (Emotionally Vulnerable Subtype) and Cluster 3 (Antisocial, Impulsive Risk-Taking Subtype). The present study is the first to present an empirical measure for assigning problem gamblers to etiological subtypes for use as a screening tool in treatment settings. (PsycINFO Database Record
Alonso-Matías, Lizeth; Páez-Martínez, Nayeli; Reyes-Zamorano, Ernesto; González-Olvera, Jorge J
Inhalants are substances widely used as recreational drugs: their addictive potential has been demonstrated by many studies. There is no reported measurable evidence of craving in inhalant users. The main goal of this study was to design and obtain evidence of validity of the score of a questionnaire for the evaluation of inhalant craving (ICQ) in a Mexican population sample. The ICQ is a type of visual analog scale with ten items. Face validity was evaluated by a group of experts in the addiction field. Reviewers considered the completeness, semantics, and sentence structure to guarantee a conceptual representation of the items. The final ICQ was applied to a sample of 520 Mexican high school students, 46% women and 54% men, between 12-19 years of age (M=15.18; SD=1.48), from 7th to 12th grades. The internal consistency of the ICQ showed a Cronbach's Alpha of 0.947. The 10 items were grouped into one single factor, with a factor loading above 0.74 for each of them. ROC analysis breakpoint was located at 18.5 mm with a sensitivity of 0.855 and specificity of 0.753. Thirty-three per cent (n= 172) of the student population evaluated reported the use of inhalants at some point in their lifetimes, with an average of misuse beginning at 13.6 years of age. The ICQ showed adequate psychometric properties, suggesting that the instrument may be considered a useful tool for screening for craving in young inhalant users.
Engelberg, Ruth; Downey, Lois; Curtis, J Randall
The importance of good clinician-patient communication to quality end-of-life care has been well documented yet there are no validated measures that allow patients to assess the quality of this communication. Using a sample of hospice patients (n = 83) and patients with chronic obstructive pulmonary disease (COPD) (n = 113), we evaluated the psychometric characteristics of a 13-item patient-centered, patient-report questionnaire about the quality of end-of-life communication (QOC). Our purpose was to explore the measurement structure of the QOC items to ascertain if the items represent unitary or multidimensional constructs and to describe the construct validity of the QOC score(s). Analyses included: principal component analyses to identify scales, internal consistency analyses to demonstrate reliability, and correlational and group comparisons to support construct validity. Findings support the construction of two scales: a six-item "general communication skills" scale and a seven-item, "communication about end-of-life care" scale. The two scales meet standards of scale measurement, including good factor convergence (values >or= 0.63) and discrimination (values different >or= 0.25), percent of variance explained (69.3%), and good internal consistency (alpha >or= 0.79). The scales' construct validity is supported by significant associations (p
Arnetz, Bengt B; Broadbridge, Carissa L; Jamil, Hikmet; Lumley, Mark A; Pole, Nnamdi; Barkho, Evone; Fakhouri, Monty; Talia, Yousif Rofa; Arnetz, Judith E
Trauma exposure contributes to poor mental health among refugees, and exposure often is measured using a cumulative index of items from the Harvard Trauma Questionnaire (HTQ). Few studies, however, have asked whether trauma subtypes derived from the HTQ could be superior to this cumulative index in predicting mental health outcomes. A community sample of recently arrived Iraqi refugees (N = 298) completed the HTQ and measures of posttraumatic stress disorder (PTSD) and depression symptoms. Principal components analysis of HTQ items revealed a 5-component subtype model of trauma that accounted for more item variance than a 1-component solution. These trauma subtypes also accounted for more variance in PTSD and depression symptoms (12 and 10%, respectively) than did the cumulative trauma index (7 and 3%, respectively). Trauma subtypes provided more information than cumulative trauma in the prediction of negative mental health outcomes. Therefore, use of these subtypes may enhance the utility of the HTQ when assessing at-risk populations.
Controversy over the internal structure of personality inventories has centered on appropriate methodology and has often been based on differing criteria among researchers. Much of this controversy has revolved in particular around the Eysenck Personality Questionnaire (EPQ) or other Eysenck tests. An approach based on targeted rotations and the test's scoring key is proposed as a means of providing common criteria. These are based on the number of items having their highest loading on their keyed scale, the mean loading of keyed items and the number of items having their highest loading on non-keyed scales. Several data sets from earlier studies are analyzed, together with a new set based on the responses to the EPQ of 195 undergraduates, using the proposed criteria. Results were very similar across samples and suggested specific weaknesses with two EPQ scales. This provided support for the utility of the three criteria.
Pestle, Sarah L; Chorpita, Bruce F; Schiffman, Jason
The Penn State Worry Questionnaire for Children (PSWQ-C; Chorpita, Tracey, Brown, Collica, & Barlow, 1997) is a 14-item self-report measure of worry in children and adolescents. Although the PSWQ-C has demonstrated favorable psychometric properties in small clinical and large community samples, this study represents the first psychometric evaluation of the PSWQ-C in a large clinical sample (N = 491). Factor analysis indicated a two-factor structure, in contrast to all previously published findings on the measure. The PSWQ-C demonstrated favorable psychometric properties in this sample, including high internal consistency, high convergent validity with related constructs, and acceptable discriminative validity between diagnostic categories. The performance of the 3 reverse-scored items was closely examined, and results indicated retaining all 14 items.
Roe, C; Myhre, K; Marchand, G H; Lau, B; Leivseth, G; Bautz-Holter, E
The main aim of this study was to evaluate the measurement properties of the Nordic Questionnaire for Psychological and Social Factors at Work (QPS Nordic) and the domains of demand, control and support. The Rasch analysis (RUMM 2030) was based on responses from 226 subjects with back pain who completed the QPS Nordic dimensions of demand, control, and social support (30 items) at one year follow up. The Rasch analysis revealed disordered thresholds in a total of 25 of the 30 items. The domains of demand, control and support fit the Rasch model when analyzed separately. The demand domain was well targeted, whereas patients with current neck and back pain had lower control and higher support than reflected by the questions. Two items revealed DIF by gender, otherwise invariance to age, gender, occupation and sick-leave was documented. The demand, control support domains of QPS Nordic comprised unidimensional constructs with adequate measurement properties.
Markland, David; Oliver, Emily J
The Sociocultural Attitudes Towards Appearance Questionnaire-3 measures awareness and endorsement of societal appearance standards. The instrument has been subjected to exploratory factor analyses but to date no studies have reported a priori tests of its hypothesized factor structure using confirmatory factor analysis (CFA). The aim of the present study was to subject the SATAQ-3 to a CFA. Results from a non-clinical convenience sample of 369 women revealed an adequate fit of the model according to conventional criteria. However, detailed residual analysis indicated a significant lack of fit which was explainable by one mis-specified item and shared method variance due to similarities in item content. It was concluded that, with the removal of the mis-specified item, the degree of misfit was tolerable and the intended four-factor solution provides a satisfactory and parsimonious representation of the data.
Ryman, David H.; And Others
Describes study conducted with U.S. Marine Corps enlisted personnel to measure response time to computer-administered questionnaire items, and to evaluate how measurement of response time might be useful in various research areas. Topics addressed include mood states; the occurrence of straight lining; and experimental effects of sleep loss and…
... AB) § 229.1115 (Item 1115) Certain derivatives instruments. This item relates to derivative.... Instructions 2, 3 and 5 to Item 1114 of this Regulation AB apply to the information contemplated by...
... AB) § 229.1115 (Item 1115) Certain derivatives instruments. This item relates to derivative.... Instructions 2, 3 and 5 to Item 1114 of this Regulation AB apply to the information contemplated by...
... AB) § 229.1115 (Item 1115) Certain derivatives instruments. This item relates to derivative.... Instructions 2, 3 and 5 to Item 1114 of this Regulation AB apply to the information contemplated by...
Wendt, Anne; Kenny, Lorraine E
Many test developers suggest that multiple-choice items can be used to evaluate critical thinking if the items are focused on measuring higher order thinking ability. The literature supports the use of alternate item types to assess additional competencies, such as higher level cognitive processing and critical thinking, as well as ways to allow examinees to demonstrate their competencies differently. This research study surveyed nurses after taking a test composed of alternate item types paired with multiple-choice items. The participants were asked to provide opinions regarding the items and the item formats. Demographic information was asked. In addition, information was collected as the participants responded to the items. The results of this study reveal that the participants thought that, in general, the items were more authentic and allowed them to demonstrate their competence better than multiple-choice items did. Further investigation into the optimal blend of alternate items and multiple-choice items is needed.
Cypher, B.L.; Spencer, K.A.; Scrivner, J.H.
Food item use by coyotes was compared between sexes and among age classes at the Naval Petroleum Reserves, California. Item use did not differ significantly between males and females. Although leporid was the item most frequently used by all age classes, item use differed significantly between pups (< 1 year), yearlings (1 year), and adults (> 1 year), probably due to differential use of secondary items. Variation in item use among age classes could potentially bias results of coyote food habit studies.
Cairnduff, Victoria; Dean, Moira; Koidis, Anastasios
Food preparation and storage behaviors in the home deviating from the "best practice" food safety recommendations may result in foodborne illnesses. Currently, there are limited tools available to fully evaluate the consumer knowledge, perceptions, and behavior in the area of refrigerator safety. The current study aimed to develop a valid and reliable tool in the form of a questionnaire, the Consumer Refrigerator Safety Questionnaire (CRSQ), for assessing systematically all these aspects. Items relating to refrigerator safety knowledge (n =17), perceptions (n =46), and reported behavior (n =30) were developed and pilot tested by an expert reference group and various consumer groups to assess face and content validity (n =20), item difficulty and consistency (n =55), and construct validity (n =23). The findings showed that the CRSQ has acceptable face and content validity with acceptable levels of item difficulty. Item consistency was observed for 12 of 15 in refrigerator safety knowledge. Further, all 5 of the subscales of consumer perceptions of refrigerator safety practices relating to risk of developing foodborne disease showed acceptable internal consistency (Cronbach's α value > 0.8). Construct validity of the CRSQ was shown to be very good (P = 0.022). The CRSQ exhibited acceptable test-retest reliability at 14 days with the majority of knowledge items (93.3%) and reported behavior items (96.4%) having correlation coefficients of greater than 0.70. Overall, the CRSQ was deemed valid and reliable in assessing refrigerator safety knowledge and behavior; therefore, it has the potential for future use in identifying groups of individuals at increased risk of deviating from recommended refrigerator safety practices, as well as the assessment of refrigerator safety knowledge and behavior for use before and after an intervention.
Krekmanova, Larisa; Hakeberg, Magnus; Robertson, Agneta; Klingberg, Gunilla
The aim of the study was to reduce everyday and dental treatment pain items included in the extended Children's Pain Inventory (CPI), used in a prior study on Swedish children and adolescents. Another aim was to, by means of exploratory factor analysis (EFA), expose hitherto undiscovered dimensions of the CPI pain variables and thus to improve the psychometric properties of CPI. As some pain items are relevant merely to some individuals, a new and more useful questionnaire construction would enhance the internal validity of the instrument in observational surveys. EFA was applied on the extended CPI instrument. 368 children, 8-19 years old, had answered a questionnaire comprising 10 dental and 28 everyday pain variables. These pain items were analysed using a series of sequentially implemented EFA. Interpretations and decisions on the final number of the extracted factors was based on accepted principles; Kaiser's Eigenvalue >1 criterion, inspection of the scree plot and the interpretability of the items loading. The factors were orthogonally rotated using the Varimax method to maximize the amount of variance. Of all tested EFA models in the analysis, a two, three, four, and five factor model surfaced. The interpretability of the factors and their items loading were stepwise examined; the items were modulated and the factors re-evaluated. A four factor pain model emerged as the most interpretable, explaining 79% of the total variance depicting Eigenvalues > 1.014. The factors were named indicating the profile of the content: Factor I cutting trauma to skin/mucosal pain, Factor II head/neck pain, Factor III tenderness/blunt trauma pain, Factor IV oral/dental treatment pain.
Prieto, Luis; Thorsen, Hanne; Juul, Kristian
Background Quality of life of stoma patients is increasingly being addressed in clinical trials. However, the instruments used in the majority of these studies have not been validated specifically for stoma patients. The aim of this paper is to describe the development and validation of a quality-of-life instrument, "Stoma-QOL", specifically for patients with colostomy or ileostomy. Methods Potential items were formulated in English on the basis of the results of a series of semi-structured interviews with 169 adult stoma patients. The process resulted in a preliminary 37-item version, which was translated into French, German, Spanish and Danish, and administered repeatedly to 182 patients with colostomy or ileostomy. A psychometric selection of items was performed through Rasch Analysis. The measurement properties of the final questionnaire version were subsequently tested. Results The 20 items in the final questionnaire covered four domains – sleep, sexual activity, relations to family and close friends, and social relations to other than family and close friends. These items were found to define a unidimensional variable according to Rasch specifications (Infit MNSQ < 1.3). Internal consistency reliability calculated as Cronbach's alpha was 0.92, i.e., highly reliable. Spearman's correlation coefficients of scores across times of administration was >0.88 (p < 0.01), indicating a high test-retest reliability. Item calibrations by country calculated as ICC were 0.81 (0.67–0.91 95% CI), confirming cross-cultural comparability across the European countries included in the study. Conclusion Given the adequacy of the metric properties of the Stoma-QOL suggested by the psychometric analyses, this study confirms the suitability of the instrument in clinical practice and in clinical research. PMID:16219109
Barrett, Frederick S; Bradstreet, Matthew P; Leoutsakos, Jeannie-Marie S; Johnson, Matthew W; Griffiths, Roland R
Acute adverse psychological reactions to classic hallucinogens ("bad trips" or "challenging experiences"), while usually benign with proper screening, preparation, and support in controlled settings, remain a safety concern in uncontrolled settings (such as illicit use contexts). Anecdotal and case reports suggest potential adverse acute symptoms including affective (panic, depressed mood), cognitive (confusion, feelings of losing sanity), and somatic (nausea, heart palpitation) symptoms. Responses to items from several hallucinogen-sensitive questionnaires (Hallucinogen Rating Scale, the States of Consciousness Questionnaire, and the Five-Dimensional Altered States of Consciousness questionnaire) in an Internet survey of challenging experiences with the classic hallucinogen psilocybin were used to construct and validate a Challenging Experience Questionnaire. The stand-alone Challenging Experience Questionnaire was then validated in a separate sample. Seven Challenging Experience Questionnaire factors (grief, fear, death, insanity, isolation, physical distress, and paranoia) provide a phenomenological profile of challenging aspects of experiences with psilocybin. Factor scores were associated with difficulty, meaningfulness, spiritual significance, and change in well-being attributed to the challenging experiences. The factor structure did not differ based on gender or prior struggle with anxiety or depression. The Challenging Experience Questionnaire provides a basis for future investigation of predictors and outcomes of challenging experiences with classic hallucinogens.
Greco, Laurie A.; Lambert, Warren; Baer, Ruth A.
The authors describe the development and validation of the Avoidance and Fusion Questionnaire for Youth (AFQ-Y), a child-report measure of psychological inflexibility engendered by high levels of cognitive fusion and experiential avoidance. Consistent with the theory underlying acceptance and commitment therapy (ACT), items converged into a…