rating scale items: Topics by Science.gov

Sample records for rating scale items

Identifying Differential Item Functioning of Rating Scale Items with the Rasch Model: An Introduction and an Application

ERIC Educational Resources Information Center

Myers, Nicholas D.; Wolfe, Edward W.; Feltz, Deborah L.; Penfield, Randall D.

2006-01-01

This study (a) provided a conceptual introduction to differential item functioning (DIF), (b) introduced the multifaceted Rasch rating scale model (MRSM) and an associated statistical procedure for identifying DIF in rating scale items, and (c) applied this procedure to previously collected data from American coaches who responded to the coaching…
Capturing the true burden of dystonia on patients: the Cervical Dystonia Impact Profile (CDIP-58).

PubMed

Cano, S J; Warner, T T; Linacre, J M; Bhatia, K P; Thompson, A J; Fitzpatrick, R; Hobart, J C

2004-11-09

To develop a new rating scale for measuring the health impact of cervical dystonia (CD) that includes patients' perceptions and complements existing observer dependent clinician rating scales. Scale development was in three stages. In Stage 1, a large pool of items was generated from patient interviews (n = 25), expert opinion, and literature review. In Stage 2, these items were administered by postal survey to people with CD. The resulting data were analyzed using Rasch item analysis to construct, from the item pool, a rating scale that satisfied criteria for rigorous measurement. In Stage 3, the measurement properties of this rating scale were examined in an independent sample of people with CD. In Stage 1, 150 items concerning the health impact of CD were generated. In Stage 2, 556 people completed questionnaires (87% response rate) and a 58-item rating scale measuring the health impact of CD in eight areas was constructed (CD Impact Profile, CDIP-58). In Stage 3, CDIP-58 data from 391 people (87% response rate) were received. Analyses supported the measurement of eight unidimensional constructs (infit mean square range 0.62 to 1.50), item calibration (33.37 to 67.56), and patient separation statistics (2.59 to 3.38). Items demonstrated stable calibrations in subgroups of people with CD supporting the stability of the CDIP-58. The CDIP-58 is a reliable and valid patient-based rating scale measuring the health impact of CD in eight health dimensions.
Generalizability and Dependability of a Multi-Item Direct Behavior Rating Scale in a Kindergarten Classroom Setting

ERIC Educational Resources Information Center

Wickerd, Garry; Hulac, David

2017-01-01

Accurate and rapid identification of students displaying behavioral problems requires instrumentation that is user friendly and reliable. The purpose of the study was to evaluate a multi-item direct behavior rating scale called the Direct Behavior Rating-Multiple Item Scale (DBR-MIS) for disruptive behavior to determine the number of…
Psychometric properties of the Multidimensional Assessment of Fatigue scale in traumatic brain injury: an NIDRR Traumatic Brain Injury Model Systems study.

PubMed

Lequerica, Anthony; Bushnik, Tamara; Wright, Jerry; Kolakowsky-Hayner, Stephanie A; Hammond, Flora M; Dijkers, Marcel P; Cantor, Joshua

2012-01-01

To investigate the psychometric properties of the Multidimensional Assessment of Fatigue (MAF) scale in a traumatic brain injury (TBI) sample. Prospective survey study. Community. One hundred sixty-seven individuals with TBI admitted for inpatient rehabilitation, enrolled into the TBI Model Systems national database, and followed up at either the first or second year postinjury. Not applicable. Multidimensional Assessment of Fatigue. The initial analysis, using items 1 to 14, which are based on a 10-point rating scale, found that only 1 item ("walking") misfit the overall construct of fatigue in this TBI population. However, this 10-point rating scale was found to have disordered thresholds. When ratings were collapsed into 4 response categories, all MAF items used to calculate the Global Fatigue Index formed a unidimensional scale. Findings generally support the unidimensionality of the MAF when used in a TBI population but call into question the use of a 10-point rating scale for items 1 to 14. Further study is needed to investigate the use of a 4-category rating scale across all items and the fit of the "walking" item for a measure of fatigue among individuals with TBI.
Assessment of Competence in EVAR Procedures: A Novel Rating Scale Developed by the Delphi Technique.

PubMed

Strøm, M; Lönn, L; Bech, B; Schroeder, T V; Konge, L

2017-07-01

To develop a procedure specific global rating scale for assessment of operator competence in endovascular aortic repair (EVAR). A Delphi approach was used to achieve expert consensus. A panel of 32 international experts (median 300 EVAR procedures, range 200-3000) from vascular surgery (n = 21) and radiology (n = 11) was established. The first Delphi round was based on a review of endovascular skills assessment papers, stent graft instructions for use, and structured interviews. It led to a primary pool of 83 items that were formulated as global rating scale items with tentative anchors. Iterative Delphi rounds were executed. The panellists rated the importance of each item on a 5 point Likert scale. Consensus was defined as 80% of the panel rating an item 4 or 5 in the primary round and 90% in subsequent rounds. Consensus on the final assessment tool was defined as Cronbach's alpha > .8 after a minimum of three rounds. Thirty-two of 35 invited experts participated. Three rounds of surveys were completed with a completion rate of 100% in the first two rounds and 91% in round three. The 83 primary assessment items were supplemented with five items suggested by the panel and reduced to seven pivotal assessment items that reached consensus, Cronbach's alpha = 0.82. The seven item rating scale covers key elements of competence in EVAR stent placement and deployment. Each item has well defined grades with explicit anchors at unacceptable, acceptable, and superior performance on a 5 point Likert scale. The Delphi methodology allowed for international consensus on a new procedure specific global rating scale for assessment of competence in EVAR. The resulting scale, EndoVascular Aortic Repair Assessment of Technical Expertise (EVARATE), represents key elements in the procedure. EVARATE constitutes an assessment tool for providing structured feedback to endovascular operators in training. Copyright © 2017 European Society for Vascular Surgery. Published by Elsevier Ltd. All rights reserved.
An alternative to Rasch analysis using triadic comparisons and multi-dimensional scaling

NASA Astrophysics Data System (ADS)

Bradley, C.; Massof, R. W.

2016-11-01

Rasch analysis is a principled approach for estimating the magnitude of some shared property of a set of items when a group of people assign ordinal ratings to them. In the general case, Rasch analysis not only estimates person and item measures on the same invariant scale, but also estimates the average thresholds used by the population to define rating categories. However, Rasch analysis fails when there is insufficient variance in the observed responses because it assumes a probabilistic relationship between person measures, item measures and the rating assigned by a person to an item. When only a single person is rating all items, there may be cases where the person assigns the same rating to many items no matter how many times he rates them. We introduce an alternative to Rasch analysis for precisely these situations. Our approach leverages multi-dimensional scaling (MDS) and requires only rank orderings of items and rank orderings of pairs of distances between items to work. Simulations show one variant of this approach - triadic comparisons with non-metric MDS - provides highly accurate estimates of item measures in realistic situations.
Using the Cumulative Common Log-Odds Ratio to Identify Differential Item Functioning of Rating Scale Items in the Exercise and Sport Sciences

ERIC Educational Resources Information Center

Penfield, Randall D.; Giacobbi, Peter R., Jr.; Myers, Nicholas D.

2007-01-01

One aspect of construct validity is the extent to which the measurement properties of a rating scale are invariant across the groups being compared. An increasingly used method for assessing between-group differences in the measurement properties of items of a scale is the framework of differential item functioning (DIF). In this paper we…
Development process of an assessment tool for disruptive behavior problems in cross-cultural settings: the Disruptive Behavior International Scale – Nepal version (DBIS-N)

PubMed Central

Burkey, Matthew D.; Ghimire, Lajina; Adhikari, Ramesh P.; Kohrt, Brandon A.; Jordans, Mark J. D.; Haroz, Emily; Wissow, Lawrence

2017-01-01

Systematic processes are needed to develop valid measurement instruments for disruptive behavior disorders (DBDs) in cross-cultural settings. We employed a four-step process in Nepal to identify and select items for a culturally valid assessment instrument: 1) We extracted items from validated scales and local free-list interviews. 2) Parents, teachers, and peers (n=30) rated the perceived relevance and importance of behavior problems. 3) Highly rated items were piloted with children (n=60) in Nepal. 4) We evaluated internal consistency of the final scale. We identified 49 symptoms from 11 scales, and 39 behavior problems from free-list interviews (n=72). After dropping items for low ratings of relevance and severity and for poor item-test correlation, low frequency, and/or poor acceptability in pilot testing, 16 items remained for the Disruptive Behavior International Scale—Nepali version (DBIS-N). The final scale had good internal consistency (α=0.86). A 4-step systematic approach to scale development including local participation yielded an internally consistent scale that included culturally relevant behavior problems. PMID:28093575
Psychometric properties of the communication Confidence Rating Scale for Aphasia (CCRSA): phase 1.

PubMed

Cherney, Leora R; Babbitt, Edna M; Semik, Patrick; Heinemann, Allen W

2011-01-01

Confidence is a construct that has not been explored previously in aphasia research. We developed the Communication Confidence Rating Scale for Aphasia (CCRSA) to assess confidence in communicating in a variety of activities and evaluated its psychometric properties using rating scale (Rasch) analysis. The CCRSA was administered to 21 individuals with aphasia before and after participation in a computer-based language therapy study. Person reliability of the 8-item CCRSA was .77. The 5-category rating scale demonstrated monotonic increases in average measures from low to high ratings. However, one item ("I follow news, sports, stories on TV/movies") misfit the construct defined by the other items (mean square infit = 1.69, item-measure correlation = .41). Deleting this item improved reliability to .79; the 7 remaining items demonstrated excellent fit to the underlying construct, although there was a modest ceiling effect in this sample. Pre- to posttreatment changes on the 7-item CCRSA measure were statistically significant using a paired samples t test. Findings support the reliability and sensitivity of the CCRSA in assessing participants' self-report of communication confidence. Further evaluation of communication confidence is required with larger and more diverse samples.
[Development of competency to stand trial rating scale in offenders with mental disorders].

PubMed

Chen, Xiao-Bing; Cai, Wei-Xiong

2013-04-01

According with Chinese legal system, to develop a competency to stand trial rating scale in offenders with mental disorders. Proceeding from the juristical elements, 15 items were extracted and formulated a preliminary instrument named the competency to stand trial rating scale in offenders with mental disorders. The item analysis included six aspects, which were critical ratio, item-total correlation, corrected item-total correlation, alpha value if item deleted, communalities of items, and factor loading. The Logistic regression equation and cut-off score of ROC curve were used to explore the diagnostic efficiency. The data of critical ratio of extreme group were 18.390-46.763; item-total correlation, 0.639-0.952; corrected item-total correlation, 0.582-0.944; communalities of items, 0.377-0.916; and factor loadings, 0.614-0.957. Seven items were included in the regression equation and the accuracy of back substitution test was 96.0%. The score of 33 was ascertained as the cut-off score by ROC fitting curve, the overlapping ratio compared with the expertise was 95.8%. The sensibility and the specificity were 0.938 and 0.966, respectively, while the positive and negative likelihood ratios were 27.67 and 0.06, respectively. With all items satisfied the requirement of homogeneity test, the rating scale has a reasonable construct and excellent diagnostic efficiency.
Should Global Items on Student Rating Scales Be Used for Summative Decisions?

ERIC Educational Resources Information Center

Berk, Ronald A.

2013-01-01

One of the simplest indicators of teaching or course effectiveness is student ratings on one or more global items from the entire rating scale. That approach seems intuitively sound and easy to use. Global items have even been recommended by a few researchers to get a quick-read, at-a-glance summary for summative decisions about faculty. The…
Item Response Theory Analyses of the Parent and Teacher Ratings of the DSM-IV ADHD Rating Scale

ERIC Educational Resources Information Center

Gomez, Rapson

2008-01-01

The graded response model (GRM), which is based on item response theory (IRT), was used to evaluate the psychometric properties of the inattention and hyperactivity/impulsivity symptoms in an ADHD rating scale. To accomplish this, parents and teachers completed the DSM-IV ADHD Rating Scale (DARS; Gomez et al., "Journal of Child Psychology and…
The Structure of the Narcissistic Personality Inventory With Binary and Rating Scale Items.

PubMed

Boldero, Jennifer M; Bell, Richard C; Davies, Richard C

2015-01-01

Narcissistic Personality Inventory (NPI) items typically have a forced-choice format, comprising a narcissistic and a nonnarcissistic statement. Recently, some have presented the narcissistic statements and asked individuals to either indicate whether they agree or disagree that the statements are self-descriptive (i.e., a binary response format) or to rate the extent to which they agree or disagree that these statements are self-descriptive on a Likert scale (i.e., a rating response format). The current research demonstrates that when NPI items have a binary or a rating response format, the scale has a bifactor structure (i.e., the items load on a general factor and on 6 specific group factors). Indexes of factor strength suggest that the data are unidimensional enough for the NPI's general factor to be considered a measure of a narcissism latent trait. However, the rating item general factor assessed more narcissism components than the binary item one. The positive correlations of the NPI's general factor, assessed when items have a rating response format, were moderate with self-esteem, strong with a measure of narcissistic grandiosity, and weak with 2 measures of narcissistic vulnerability. Together, the results suggest that using a rating format for items enhances the information provided by the NPI.
THE HUMAN BEHAVIOR RATING SCALE-BRIEF: A TOOL TO MEASURE 21ST CENTURY SKILLS OF K-12 LEARNERS.

PubMed

Woods-Groves, Suzanne

2015-06-01

Currently there is a call for brief concise measurements to appraise relevant 21st century college readiness skills in K-12 learners. This study employed K-12 teachers' ratings for over 3,000 students for an existing 91-item rating scale, the Human Behavior Rating Scale, that measured the 21st century skills of persistence, curiosity, externalizing affect, internalizing affect, and cognition. Teachers' ratings for K-12 learners were used to develop a brief, concise, and manageable 30-item tool, the Human Behavior Rating Scale-Brief. Results yielded high internal consistency coefficients and inter-item correlations. The items were not biased with regard to student sex or race, and were supported through confirmatory factor analyses. In addition, when teachers' ratings were compared with students' academic and behavioral performance data, moderate to strong relationships were revealed. This study provided an essential first step in the development of a psychometrically sound, manageable, and brief tool to appraise 21st century skills in K-12 learners.
Social Desirability Scale Values of Locus of Control Items

ERIC Educational Resources Information Center

Kestenbaum, Joel M.

1976-01-01

Subjects rated each item in Rotter's I-E Scale for its social desirability value. Social desirability scale values (SDSV) of paired items were compared with one another. Results indicate that paired items are not similar in their SDSV, thus enabling subjects to respond on the basis of social desirability. (Author/DEP)
A new, female-specific irritability rating scale

PubMed Central

Born, Leslie; Koren, Gideon; Lin, Elizabeth; Steiner, Meir

2008-01-01

Objective Irritability is a prominent symptom in the spectrum of female-specific mood disorders, and in some women, irritability is serious enough to disrupt their lives and warrant treatment. The objective of this research was to develop a new, female-specific state measure of irritability. Methods We constructed self-rating and observer rating scales using items derived from spontaneous descriptions of irritability by women with mood disturbances related to the menstrual cycle, childbearing or menopause. Following a pretest, the scales were shortened to the core items of irritability (annoyance, anger, tension, hostility, sensitivity to noise and touch) and tested on a new cohort of patients. Results The 14-item Self-Rating Scale and the 5-item Observer Rating Scale showed evidence for internal consistency (Self-Rating: n = 36 patients, Cronbach's α = 0.9257, mean interitem correlation = 0.4690; Observer Rating: Cronbach's α = 0.7418, mean interitem correlation = 0.3616), Self-Rating test–retest reliability (n = 29 patients, rs = 0.704, p = 0.01) and interrater reliability (n = 20 patients; τb = 1.000, p = 0.001). Conclusion This new, female-specific scale for rating irritability has the potential to further the evaluation of this prominent symptom cluster and increase specificity in clinical assessments of emotional disturbances related to reproductive cyclicity in women. PMID:18592028
Validity and Reliability of Trichotomous Achievement Goal Scale

ERIC Educational Resources Information Center

Ilker, Gokce Erturan; Arslan, Yunus; Demirhan, Giyasettin

2011-01-01

The Trichotomous Achievement Goal Scale was developed by Agbuga and Xiang (2008) by including selected items from the scales of Duda and Nicholls (1992), Elliot (1999), and Elliot and Church (1997) and adapting them into Turkish. The scale consists of 18 items, and students rated each item on a 7-point Likert scale. To ascertain the validity and…
Multi-Item Direct Behavior Ratings: Dependability of Two Levels of Assessment Specificity

ERIC Educational Resources Information Center

Volpe, Robert J.; Briesch, Amy M.

2015-01-01

Direct Behavior Rating-Multi-Item Scales (DBR-MIS) have been developed as formative measures of behavioral assessment for use in school-based problem-solving models. Initial research has examined the dependability of composite scores generated by summing all items comprising the scales. However, it has been argued that DBR-MIS may offer assessment…
Combining agreement and frequency rating scales to optimize psychometrics in measuring behavioral health functioning.

PubMed

Marfeo, Elizabeth E; Ni, Pengsheng; Chan, Leighton; Rasch, Elizabeth K; Jette, Alan M

2014-07-01

The goal of this article was to investigate optimal functioning of using frequency vs. agreement rating scales in two subdomains of the newly developed Work Disability Functional Assessment Battery: the Mood & Emotions and Behavioral Control scales. A psychometric study comparing rating scale performance embedded in a cross-sectional survey used for developing a new instrument to measure behavioral health functioning among adults applying for disability benefits in the United States was performed. Within the sample of 1,017 respondents, the range of response category endorsement was similar for both frequency and agreement item types for both scales. There were fewer missing values in the frequency items than the agreement items. Both frequency and agreement items showed acceptable reliability. The frequency items demonstrated optimal effectiveness around the mean ± 1-2 standard deviation score range; the agreement items performed better at the extreme score ranges. Findings suggest an optimal response format requires a mix of both agreement-based and frequency-based items. Frequency items perform better in the normal range of responses, capturing specific behaviors, reactions, or situations that may elicit a specific response. Agreement items do better for those whose scores are more extreme and capture subjective content related to general attitudes, behaviors, or feelings of work-related behavioral health functioning. Copyright © 2014 Elsevier Inc. All rights reserved.
An item response theory evaluation of the young mania rating scale and the montgomery-asberg depression rating scale in the systematic treatment enhancement program for bipolar disorder (STEP-BD).

PubMed

Prisciandaro, James J; Tolliver, Bryan K

2016-11-15

The Young Mania Rating Scale (YMRS) and Montgomery-Asberg Depression Rating Scale (MADRS) are among the most widely used outcome measures for clinical trials of medications for Bipolar Disorder (BD). Nonetheless, very few studies have examined the measurement characteristics of the YMRS and MADRS in individuals with BD using modern psychometric methods. The present study evaluated the YMRS and MADRS in the Systematic Treatment Enhancement Program for BD (STEP-BD) study using Item Response Theory (IRT). Baseline data from 3716 STEP-BD participants were available for the present analysis. The Graded Response Model (GRM) was fit separately to YMRS and MADRS item responses. Differential item functioning (DIF) was examined by regressing a variety of clinically relevant covariates (e.g., sex, substance dependence) on all test items and on the latent symptom severity dimension, within each scale. Both scales: 1) contained several items that provided little or no psychometric information, 2) were inefficient, in that the majority of item response categories did not provide incremental psychometric information, 3) poorly measured participants outside of a narrow band of severity, 4) evidenced DIF for nearly all items, suggesting that item responses were, in part, determined by factors other than symptom severity. Limited to outpatients; DIF analysis only sensitive to certain forms of DIF. The present study provides evidence for significant measurement problems involving the YMRS and MADRS. More work is needed to refine these measures and/or develop suitable alternative measures of BD symptomatology for clinical trials research. Copyright © 2016 Elsevier B.V. All rights reserved.

[Preliminary study on civil capacity rating scale for mental disabled patients].

PubMed

Zhang, Qin-Ting; Pang, Yan-Xia; Cai, Wei-Xiong; Tang, Tao; Huang, Fu-Yin

2010-10-01

To create civil capacity rating scale for mentally disabled patients, and explore its feasibility during the forensic psychiatric expertise. The civil capacity-related items were determined after discussion and consultation. The civil capacity rating scale for mentally disabled patients was established and the manual was created according to the logistic sequence of the assessment. The rating scale was used during the civil assessment in four institutes. There were 14 items in civil capacity rating scale for mentally disabled patients. Two hundred and two subjects were recruited and divided into three groups according to the experts' opinion on their civil capacities: full civil capacity, partial civil capacity and no civil capacity. The mean score of the three groups were 2.32 +/- 2.45, 11.62 +/- 4.01 and 25.02 +/- 3.90, respectively, and there was statistical differences among the groups. The Cronbach alpha of the rating scale was 0.9724, and during the split-reliability test, the two-splited part of the rating scale were highly correlated (r = 0.9729, P = 0.000). The Spearman correlative coefficient between each item and the score of the rating scale was from 0.643 to 0.882 (P = 0.000). There was good correlation between the conclusion according to the rating scale and the experts' opinion (kappa = 0.841, P = 0.000). When the discriminate analysis was used, 7 items were included into the discrimination equation, and 92.6% subjects were identified as the correct groups using the equation. There is satisfied reliability and validity on civil capacity rating scale for mentally disabled patients. The rating scale can be used as effective tools to grade their civil capacity during the forensic expertise.
When less is more: validating a brief scale to rate interprofessional team competencies.

PubMed

Lie, Désirée A; Richter-Lagha, Regina; Forest, Christopher P; Walsh, Anne; Lohenry, Kevin

2017-01-01

There is a need for validated and easy-to-apply behavior-based tools for assessing interprofessional team competencies in clinical settings. The seven-item observer-based Modified McMaster-Ottawa scale was developed for the Team Objective Structured Clinical Encounter (TOSCE) to assess individual and team performance in interprofessional patient encounters. We aimed to improve scale usability for clinical settings by reducing item numbers while maintaining generalizability; and to explore the minimum number of observed cases required to achieve modest generalizability for giving feedback. We administered a two-station TOSCE in April 2016 to 63 students split into 16 newly-formed teams, each consisting of four professions. The stations were of similar difficulty. We trained sixteen faculty to rate two teams each. We examined individual and team performance scores using generalizability (G) theory and principal component analysis (PCA). The seven-item scale shows modest generalizability (.75) with individual scores. PCA revealed multicollinearity and singularity among scale items and we identified three potential items for removal. Reducing items for individual scores from seven to four (measuring Collaboration, Roles, Patient/Family-centeredness, and Conflict Management) changed scale generalizability from .75 to .73. Performance assessment with two cases is associated with reasonable generalizability (.73). Students in newly-formed interprofessional teams show a learning curve after one patient encounter. Team scores from a two-station TOSCE demonstrate low generalizability whether the scale consisted of four (.53) or seven items (.55). The four-item Modified McMaster-Ottawa scale for assessing individual performance in interprofessional teams retains the generalizability and validity of the seven-item scale. Observation of students in teams interacting with two different patients provides reasonably reliable ratings for giving feedback. The four-item scale has potential for assessing individual student skills and the impact of IPE curricula in clinical practice settings. IPE: Interprofessional education; SP: Standardized patient; TOSCE: Team objective structured clinical encounter.
The Single-Item Math Anxiety Scale: An Alternative Way of Measuring Mathematical Anxiety

ERIC Educational Resources Information Center

Núñez-Peña, M. Isabel; Guilera, Georgina; Suárez-Pellicioni, Macarena

2014-01-01

This study examined whether the Single-Item Math Anxiety Scale (SIMA), based on the item suggested by Ashcraft, provided valid and reliable scores of mathematical anxiety. A large sample of university students (n = 279) was administered the SIMA and the 25-item Shortened Math Anxiety Rating Scale (sMARS) to evaluate the relation between the scores…
Clinical validation of a non-heteronormative version of the Social Interaction Anxiety Scale (SIAS).

PubMed

Lindner, Philip; Martell, Christopher; Bergström, Jan; Andersson, Gerhard; Carlbring, Per

2013-12-19

Despite welcomed changes in societal attitudes and practices towards sexual minorities, instances of heteronormativity can still be found within healthcare and research. The Social Interaction Anxiety Scale (SIAS) is a valid and reliable self-rating scale of social anxiety, which includes one item (number 14) with an explicit heteronormative assumption about the respondent's sexual orientation. This heteronormative phrasing may confuse, insult or alienate sexual minority respondents. A clinically validated version of the SIAS featuring a non-heteronormative phrasing of item 14 is thus needed. 129 participants with diagnosed social anxiety disorder, enrolled in an Internet-based intervention trial, were randomly assigned to responding to the SIAS featuring either the original or a novel non-heteronormative phrasing of item 14, and then answered the other item version. Within-subject, correlation between item versions was calculated and the two scores were statistically compared. The two items' correlations with the other SIAS items and other psychiatric rating scales were also statistically compared. Item versions were highly correlated and scores did not differ statistically. The two items' correlations with other measures did not differ statistically either. The SIAS can be revised with a non-heteronormative formulation of item 14 with psychometric equivalence on item and scale level. Implications for other psychiatric instruments with heteronormative phrasings are discussed.
Evaluating item endorsement rates for the MMPI-2-RF F-r and Fp-r scales across ethnic, gender, and diagnostic groups with a forensic inpatient sample.

PubMed

Glassmire, David M; Jhawar, Amandeep; Burchett, Danielle; Tarescavage, Anthony M

2017-05-01

The Minnesota Multiphasic Personality Inventory-2 (MMPI-2) F(p) (Infrequency-Psychopathology) scale was developed to measure overreporting in a manner that was minimally confounded by genuine psychopathology, which was a problem with using the MMPI-2 F (Infrequency) scale among patients with severe mental illness. Although revised versions of both of these scales are included on the MMPI-2-Restructured Form and used in a forensic context, no item-level research has been conducted on their sensitivity to genuine psychopathology among forensic psychiatric inpatients. Therefore, we examined the psychometric properties of the scales in a sample of 438 criminally committed forensic psychiatric inpatients who were adjudicated as not guilty by reason of insanity and had no known incentive to overreport. We found that 20 of the 21 Fp-r items (95.2%) demonstrated endorsement rates ≤ 20%, with 14 of the items (66.7%) endorsed by less than 10% of the sample. Similar findings were observed across genders and across patients with mood and psychotic disorders. The one item endorsed by more than 20% of the sample had a 23.7% overall endorsement rate and significantly different endorsement rates across ethnic groups, with the highest endorsements occurring among Hispanic/Latino (43.3% endorsement rate) patients. Endorsement rates of F-r items were generally higher than for Fp-r items. At the scale level, we also examined correlations with the Restructured Clinical Scales and found that Fp-r demonstrated lower correlations than F-r, indicating that Fp-r is less associated with a broad range of psychopathology. Finally, we found that Fp-r demonstrated slightly higher specificity values than F-r at all T score cutoffs. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Rasch analysis of the Italian Lower Extremity Functional Scale: insights on dimensionality and suggestions for an improved 15-item version.

PubMed

Bravini, Elisabetta; Giordano, Andrea; Sartorio, Francesco; Ferriero, Giorgio; Vercelli, Stefano

2017-04-01

To investigate dimensionality and the measurement properties of the Italian Lower Extremity Functional Scale using both classical test theory and Rasch analysis methods, and to provide insights for an improved version of the questionnaire. Rasch analysis of individual patient data. Rehabilitation centre. A total of 135 patients with musculoskeletal diseases of the lower limb. Patients were assessed with the Lower Extremity Functional Scale before and after the rehabilitation. Rasch analysis showed some problems related to rating scale category functioning, items fit, and items redundancy. After an iterative process, which resulted in the reduction of rating scale categories from 5 to 4, and in the deletion of 5 items, the psychometric properties of the Italian Lower Extremity Functional Scale improved. The retained 15 items with a 4-level response format fitted the Rasch model (internal construct validity), and demonstrated unidimensionality and good reliability indices (person-separation reliability 0.92; Cronbach's alpha 0.94). Then, the analysis showed differential item functioning for six of the retained items. The sensitivity to change of the Italian 15-item Lower Extremity Functional Scale was nearly equal to the one of the original version (effect size: 0.93 and 0.98; standardized response mean: 1.20 and 1.28, respectively for the 15-item and 20-item versions). The Italian Lower Extremity Functional Scale had unsatisfactory measurement properties. However, removing five items and simplifying the scoring from 5 to 4 levels resulted in a more valid measure with good reliability and sensitivity to change.
Dependability and Treatment Sensitivity of Multi-Item Direct Behavior Rating Scales for Interpersonal Peer Conflict

ERIC Educational Resources Information Center

Daniels, Brian; Volpe, Robert J.; Briesch, Amy M.; Gadow, Kenneth D.

2017-01-01

Direct behavior rating (DBR) represents a feasible method for monitoring student behavior in the classroom; however, limited work to date has focused on the use of multi-item scales. The purposes of the study were to examine the (a) dependability of data obtained from a multi-item DBR designed to assess peer conflict and (b) treatment sensitivity…
Conners' Teacher Rating Scale for Preschool Children: A Revised, Brief, Age-Specific Measure

ERIC Educational Resources Information Center

Purpura, David J.; Lonigan, Christopher J.

2009-01-01

The Conners' Teacher Rating Scale-Revised (CTRS-R) is one of the most commonly used measures of child behavior problems. However, the scale length and the appropriateness of some of the items on the scale may reduce the usefulness of the CTRS-R for use with preschoolers. In this study, a Graded Response Model analysis based on Item Response Theory…
Development of a clinician-administered National Institutes of Health-Brief Fatigue Inventory: A measure of fatigue in the context of depressive disorders.

PubMed

Saligan, Leorey N; Luckenbaugh, David A; Slonena, Elizabeth E; Machado-Vieira, Rodrigo; Zarate, Carlos A

2015-09-01

Fatigue is a complex, multidimensional condition. Although it is often associated with depression, it is not known whether it has a distinct network from depression or whether it can be clinically evaluated, separately. This study describes preliminary findings in the development of a brief, clinician-administered instrument to measure fatigue in the context of depressive disorders using items from existing clinician-administered depression and mania scales. Based on items from prior fatigue measurements, items were selected from the Hamilton Depression Rating Scale (HDRS), Montgomery-Asberg Depression Rating Scale (MADRS), Young Mania Rating Scale, and Structured Interview Guide for HDRS with Atypical Depression. The final items composed the NIH-Brief Fatigue Inventory (NIH-BFI). Responses from 89 depressed adults collected pre- and post-antidepressant therapy (ADT) determined the reliability and consistency of the NIH-BFI using Cronbach's alpha and principal components analysis (PCA). Correlations of the NIH-BFI and fatigue items from other scales before and after ADT explored validity. The 7-item NIH-BFI had Cronbach alphas ranging from 0.81 to 0.88 and PCA indicating a single dimension. The NIH-BFI score was strongly correlated (r = 0.73, p < 0.001) with fatigue items from Beck Depression Index, with MADRS without fatigue items (r = 0.77, p < 0.001), and HDRS without fatigue items (pre: r = 0.69, p < 0.001). Preliminary findings show support for internal consistency reliability and validity of the NIH-BFI, a clinician-administered measure of fatigue. Further testing in other clinical populations is recommended to obtain additional information on reliability and validity. The NIH-BFI provides a method for clinician-rated fatigue that may be a separate from depression. Published by Elsevier Ltd.
Psychometric properties of a revised version of the Assisting Hand Assessment (Kids-AHA 5.0).

PubMed

Holmefur, Marie M; Krumlinde-Sundholm, Lena

2016-06-01

The aim of this study was to scrutinize the Assisting Hand Assessment (AHA) version 4.4 for possible improvements and to evaluate the psychometric properties regarding internal scale validity and aspects of reliability of a revised version of the AHA. In collaboration with experts, scoring criteria were changed for four items, and one fully new item was constructed. Twenty-two original, one new, and four revised items were scored for 164 assessments of children with unilateral cerebral palsy aged 18 months to 12 years. Rasch measurement analysis was used to evaluate internal scale validity by exploring rating-scale functioning, item and person goodness-of-fit, and principal component analysis. Targeting and scale reliability were also evaluated. After removal of misfitting items, a 20-item scale showed satisfactory goodness-of-fit. Unidimensionality was confirmed by principal component analysis. The rating scale functioned well for the 20 items, and the item difficulty was well suited to the ability level of the sample. The person reliability coefficient was 0.98, indicating high separation ability of the scale. A conversion table of AHA scores between the previous version (4.4) and the new version (5.0) was constructed. The new, 20-item version of the Kids-AHA (version 5.0), demonstrated excellent internal scale validity, suggesting improved responsiveness to changes and shortened scoring time. For comparison of scores from version 4.4 to 5.0, a transformation table is presented. © 2015 Mac Keith Press.
Rasch analysis for psychometric improvement of science attitude rating scales

NASA Astrophysics Data System (ADS)

Oon, Pey-Tee; Fan, Xitao

2017-04-01

Students' attitude towards science (SAS) is often a subject of investigation in science education research. Survey of rating scale is commonly used in the study of SAS. The present study illustrates how Rasch analysis can be used to provide psychometric information of SAS rating scales. The analyses were conducted on a 20-item SAS scale used in an existing dataset of The Trends in International Mathematics and Science Study (TIMSS) (2011). Data of all the eight-grade participants from Hong Kong and Singapore (N = 9942) were retrieved for analyses. Additional insights from Rasch analysis that are not commonly available from conventional test and item analyses were discussed, such as invariance measurement of SAS, unidimensionality of SAS construct, optimum utilization of SAS rating categories, and item difficulty hierarchy in the SAS scale. Recommendations on how TIMSS items on the measurement of SAS can be better designed were discussed. The study also highlights the importance of using Rasch estimates for statistical parametric tests (e.g. ANOVA, t-test) that are common in science education research for group comparisons.
Gender-, age-, and race/ethnicity-based differential item functioning analysis of the movement disorder society-sponsored revision of the Unified Parkinson's disease rating scale.

PubMed

Goetz, Christopher G; Liu, Yuanyuan; Stebbins, Glenn T; Wang, Lu; Tilley, Barbara C; Teresi, Jeanne A; Merkitch, Douglas; Luo, Sheng

2016-12-01

Assess MDS-UPDRS items for gender-, age-, and race/ethnicity-based differential item functioning. Assessing differential item functioning is a core rating scale validation step. For the MDS-UPDRS, differential item functioning occurs if item-score probability among people with similar levels of parkinsonism differ according to selected covariates (gender, age, race/ethnicity). If the magnitude of differential item functioning is clinically relevant, item-score interpretation must consider influences by these covariates. Differential item functioning can be nonuniform (covariate variably influences an item-score across different levels of parkinsonism) or uniform (covariate influences an item-score consistently over all levels of parkinsonism). Using the MDS-UPDRS translation database of more than 5,000 PD patients from 14 languages, we tested gender-, age-, and race/ethnicity-based differential item functioning. To designate an item as having clinically relevant differential item functioning, we required statistical confirmation by 2 independent methods, along with a McFadden pseudo-R 2 magnitude statistic greater than "negligible." Most items showed no gender-, age- or race/ethnicity-based differential item functioning. When differential item functioning was identified, the magnitude statistic was always in the "negligible" range, and the scale-level impact was minimal. The absence of clinically relevant differential item functioning across all items and all parts of the MDS-UPDRS is strong evidence that the scale can be used confidently. As studies of Parkinson's disease increasingly involve multinational efforts and the MDS-UPDRS has several validated non-English translations, the findings support the scale's broad applicability in populations with varying gender, age, and race/ethnicity distributions. © 2016 International Parkinson and Movement Disorder Society. © 2016 International Parkinson and Movement Disorder Society.
The Behavior Problems Inventory-Short Form for individuals with intellectual disabilities: part I: development and provisional clinical reference data.

PubMed

Rojahn, J; Rowe, E W; Sharber, A C; Hastings, R; Matson, J L; Didden, R; Kroes, D B H; Dumont, E L M

2012-05-01

The Behavior Problems Inventory-01 (BPI-01) is an informant-based behaviour rating instrument that was designed to assess maladaptive behaviours in individuals with intellectual disabilities (ID). Its items fall into one of three sub-scales: Self-injurious Behavior (14 items), Stereotyped Behavior (24 items), and Aggressive/Destructive Behavior (11 items). Each item is rated on a frequency scale (0 = never to 4 = hourly), and a severity scale (0 = no problem to 3 = severe problem). The BPI-01 has been successfully used in several studies and has shown acceptable to very good psychometric properties. One concern raised by some investigators was the large number of items on the BPI-01, which has reduced its user friendliness for certain applications. Furthermore, researchers and clinicians were often uncertain how to interpret their BPI-01 data without norms or a frame of reference. The Behavior Problems Inventory-Short Form (BPI-S) was empirically developed, based on an aggregated archival data set of BPI-01 data from individuals with ID from nine locations in the USA, Wales, England, the Netherlands, and Romania (n = 1122). The BPI-S uses the same rating system and the same three sub-scales as the BPI-01, but has fewer items: Self-injurious Behavior (8 items), Stereotyped Behavior (12 items), and Aggressive/Destructive Behavior (10 items). Rating anchors for the severity scales of the Self-injurious Behavior and the Aggressive/Destructive Behavior sub-scales were added in an effort to enhance the objectivity of the ratings. The sensitivity of the BPI-S compared with the BPI-01 was high (0.92 to 0.99), and so were the correlations between the analogous BPI-01 and the BPI-S sub-scales (0.96 to 0.99). Means and standard deviations were generated for both BPI versions in a Sex-by-age matrix, and in a Sex-by-ID Level matrix. Combined sex ranges are also provided by age and level of ID. In summary, the BPI-S is a very useful alternative to the BPI-01, especially for research and evaluation purposes involving groups of individuals. © 2011 The Authors. Journal of Intellectual Disability Research © 2011 Blackwell Publishing Ltd.
Developing Multidimensional Likert Scales Using Item Factor Analysis: The Case of Four-Point Items

ERIC Educational Resources Information Center

Asún, Rodrigo A.; Rdz-Navarro, Karina; Alvarado, Jesús M.

2016-01-01

This study compares the performance of two approaches in analysing four-point Likert rating scales with a factorial model: the classical factor analysis (FA) and the item factor analysis (IFA). For FA, maximum likelihood and weighted least squares estimations using Pearson correlation matrices among items are compared. For IFA, diagonally weighted…
Rating the methodological quality of single-subject designs and n-of-1 trials: introducing the Single-Case Experimental Design (SCED) Scale.

PubMed

Tate, Robyn L; McDonald, Skye; Perdices, Michael; Togher, Leanne; Schultz, Regina; Savage, Sharon

2008-08-01

Rating scales that assess methodological quality of clinical trials provide a means to critically appraise the literature. Scales are currently available to rate randomised and non-randomised controlled trials, but there are none that assess single-subject designs. The Single-Case Experimental Design (SCED) Scale was developed for this purpose and evaluated for reliability. Six clinical researchers who were trained and experienced in rating methodological quality of clinical trials developed the scale and participated in reliability studies. The SCED Scale is an 11-item rating scale for single-subject designs, of which 10 items are used to assess methodological quality and use of statistical analysis. The scale was developed and refined over a 3-year period. Content validity was addressed by identifying items to reduce the main sources of bias in single-case methodology as stipulated by authorities in the field, which were empirically tested against 85 published reports. Inter-rater reliability was assessed using a random sample of 20/312 single-subject reports archived in the Psychological Database of Brain Impairment Treatment Efficacy (PsycBITE). Inter-rater reliability for the total score was excellent, both for individual raters (overall ICC = 0.84; 95% confidence interval 0.73-0.92) and for consensus ratings between pairs of raters (overall ICC = 0.88; 95% confidence interval 0.78-0.95). Item reliability was fair to excellent for consensus ratings between pairs of raters (range k = 0.48 to 1.00). The results were replicated with two independent novice raters who were trained in the use of the scale (ICC = 0.88, 95% confidence interval 0.73-0.95). The SCED Scale thus provides a brief and valid evaluation of methodological quality of single-subject designs, with the total score demonstrating excellent inter-rater reliability using both individual and consensus ratings. Items from the scale can also be used as a checklist in the design, reporting and critical appraisal of single-subject designs, thereby assisting to improve standards of single-case methodology.
Adult Attachment Ratings (AAR): an item response theory analysis.

PubMed

Pilkonis, Paul A; Kim, Yookyung; Yu, Lan; Morse, Jennifer Q

2014-01-01

The Adult Attachment Ratings (AAR) include 3 scales for anxious, ambivalent attachment (excessive dependency, interpersonal ambivalence, and compulsive care-giving), 3 for avoidant attachment (rigid self-control, defensive separation, and emotional detachment), and 1 for secure attachment. The scales include items (ranging from 6-16 in their original form) scored by raters using a 3-point format (0 = absent, 1 = present, and 2 = strongly present) and summed to produce a total score. Item response theory (IRT) analyses were conducted with data from 414 participants recruited from psychiatric outpatient, medical, and community settings to identify the most informative items from each scale. The IRT results allowed us to shorten the scales to 5-item versions that are more precise and easier to rate because of their brevity. In general, the effective range of measurement for the scales was 0 to +2 SDs for each of the attachment constructs; that is, from average to high levels of attachment problems. Evidence for convergent and discriminant validity of the scales was investigated by comparing them with the Experiences of Close Relationships-Revised (ECR-R) scale and the Kobak Attachment Q-sort. The best consensus among self-reports on the ECR-R, informant ratings on the ECR-R, and expert judgments on the Q-sort and the AAR emerged for anxious, ambivalent attachment. Given the good psychometric characteristics of the scale for secure attachment, however, this measure alone might provide a simple alternative to more elaborate procedures for some measurement purposes. Conversion tables are provided for the 7 scales to facilitate transformation from raw scores to IRT-calibrated (theta) scores.
Item-Based Psychometrics of the Preschool Behavioral and Emotional Rating Scale

ERIC Educational Resources Information Center

Cress, Cynthia J.; Lambert, Matthew C.; Epstein, Michael H.

2014-01-01

The Preschool Behavioral and Emotional Rating Scale (PreBERS) is an assessment of emotional and behavioral strengths in preschoolers with well-established reliability and validity for educational and clinical application in children with and without disabilities. The present study provides further evidence of psychometric rigor for items and…
Rating catatonia in patients with chronic schizophrenia: Rasch analysis of the Bush-Francis Catatonia Rating Scale.

PubMed

Wong, Eric; Ungvari, Gabor S; Leung, Siu-Kau; Tang, Wai-Kwong

2007-01-01

Catatonic signs and symptoms are frequently observed in patients with chronic schizophrenia. Clinical surveys have suggested that the composition of catatonic syndrome occurring in chronic schizophrenia may be different from what is found in acute psychiatric disorders or medical conditions. Consequently, this patient population may need tailor-made rating instruments for catatonia. The aim of the present study was to examine the suitability and accuracy of using the Bush-Francis Catatonia Rating Scale (BFCRS) in chronic schizophrenia inpatients. The unidimensionality (optimal number of items; item fit), and the scoring scheme (the optimal number of scoring categories) of the BFCRS were determined in a random sample of 225 patients with chronic schizophrenia applying Rasch analysis. In addition, differential item functioning (DIF) analysis was also performed. The BFCRS proved to be unidimensional apart from three misfit and one marginally misfit items. The three misfit items were removed from the scale thereby constructing a revised version called BFCRS-R. Since the original BFCRS (BFCRS-O) showed no increase across items across steep gradients (poor endorsability of step calibrations), in BFCRS-R a binary scale ('absent' versus 'present' choices only) was constructed instead of the scoring scheme of 0-3. The 20-item BFCRS-R showed improved psychometric properties in that it had a higher item separation index than BFCRS-O. BFCRS-R mean logit was closer to zero indicating that the items on the scale and the subjects were better matched than in BFCRS-O. DIF analysis showed that certain items of both versions of BFCRS were influenced by the presence of negative symptoms. BFCRS-R is shorter and simpler than the original version and having better psychometric properties seems to be better suited for identifying and quantifying catatonia in chronic psychotic patients. Copyright (c) 2007 John Wiley & Sons, Ltd.
Examining the validity and reliability of the Taita symptom checklist using Rasch analysis.

PubMed

Chen, Yun-Ling; Pan, Ay-Woan; Chung, LyInn; Chen, Tsyr-Jang

2015-03-01

The Taita symptom checklist (TSCL) is a standardized self-rating psychiatric symptom scale for outpatients with mental illness in Taiwan. This study aimed to examine the validity and reliability of the TSCL using Rasch analysis. The TSCL was given to 583 healthy people and 479 people with mental illness. Rasch analysis was used to examine the appropriateness of the rating scale, the unidimensionality of the scale, the differential item functioning across sex and diagnosis, and the Rasch cut-off score of the scale. Rasch analysis confirmed that the revised 37 items with a three-point rating scale of the TSCL demonstrated good internal consistency and met criteria for unidimensionality. The person and item reliability indices were high. The TSCL could reliably measure healthy participants and patients with mental illness. Differential item functioning due to sex or psychiatric diagnosis was evident for three items. A Rasch cut-off score for TSCL was produced for detecting participants' psychiatric symptoms based on an eight-level classification. The TSCL is a reliable and valid assessment to evaluate the participants' perceived disturbance of psychiatric symptoms based on Rasch analysis. Copyright © 2013. Published by Elsevier B.V.
Movement Issues Identified in Movement ABC2 Checklist Parent Ratings for Students with Persisting Dysgraphia, Dyslexia, and OWL LD and Typical Literacy Learners.

PubMed

Nielsen, Kathleen; Henderson, Sheila; Barnett, Anna L; Abbott, Robert D; Berninger, Virginia

2018-01-01

Movement, which draws on motor skills and executive functions for managing them, plays an important role in literacy learning (e.g., movement of mouth during oral reading and movement of hand and fingers during writing); but relatively little research has focused on movement skills in students with specific learning disabilities as the current study did. Parents completed normed Movement Assessment Battery for Children Checklist, 2nd edition (ABC-2), ratings and their children in grades 4 to 9 ( M = 11 years, 11 months; 94 boys, 61 girls) completed diagnostic assessment used to assign them to diagnostic groups: control typical language learning ( N = 42), dysgraphia (impaired handwriting) ( N = 29), dyslexia (impaired word decoding/reading and spelling) ( N = 65), or oral and written language learning disability (OWL LD) (impaired syntax in oral and written language) ( N = 19). The research aims were to (a) correlate the Movement ABC-2 parent ratings for Scale A Static/Predictable Environment (15 items) and Scale B Dynamic/Unpredictable Environment (15 items) with reading and writing achievement in total sample varying within and across different skills; and (b) compare each specific learning disability group with the control group on Movement ABC-2 parent ratings for Scale A, Scale B, and Scale C Movement-Related (Non-Motor Executive Functions, or Self-Efficacy, or Affect) (13 items). At least one Movement ABC-2 parent rating was correlated with each assessed literacy achievement skill. Each of three specific learning disability groups differed from the control group on two Scale A (static/predictable environment) items (fastens buttons and forms letters with pencil or pen) and on three Scale C items (distractibility, overactive, and underestimates own ability); but only OWL LD differed from control on Scale B (dynamic/unpredictable environment) items. Applications of findings to assessment and instruction for students ascertained for and diagnosed with persisting specific learning disabilities in literacy learning, and future research directions are discussed.

ECT Has Greater Efficacy Than Fluoxetine in Alleviating the Burden of Illness for Patients with Major Depressive Disorder: A Taiwanese Pooled Analysis

PubMed Central

Huang, Chun-Jen; Chen, Cheng-Chung

2018-01-01

Abstract Background The burden of major depressive disorder includes suffering due to symptom severity, functional impairment, and quality of life deficits. The aim of this study was to compare the differences between electroconvulsive therapy and pharmacotherapy in reducing such burdens. Methods This was a pooled analysis study including 2 open-label trials for major depressive disorder inpatients receiving either standard bitemporal and modified electroconvulsive therapy with a maximum of 12 sessions or 20 mg/d of fluoxetine for 6 weeks. Symptom severity, functioning, and quality of life were assessed using the 17-item Hamilton Rating Scale for Depression, the Modified Work and Social Adjustment Scale, and SF-36. Side effects following treatment, including subjective memory impairment, nausea/vomiting, and headache, were recorded. The differences between these 2 groups in 17-item Hamilton Rating Scale for Depression, Modified Work and Social Adjustment Scale, quality of life, side effects, and time to response (at least a 50% reduction of 17-item Hamilton Rating Scale for Depression) and remission (17-item Hamilton Rating Scale for Depression ≤7) following treatment were analyzed. Results Electroconvulsive therapy (n=116) showed a significantly greater reduction in 17-item Hamilton Rating Scale for Depression, Modified Work and Social Adjustment Scale, and quality of life deficits and had significantly shorter time to response/remission than fluoxetine (n=126). However, the electroconvulsive therapy group was more likely to experience subjective memory impairment and headache. Conclusions Compared with fluoxetine, electroconvulsive therapy was more effective in alleviating the burden of major depressive disorder and had a substantially increased speed of response/remission in the acute phase. Increased education and information about electroconvulsive therapy for clinicians, patients, and their families and the general public is warranted. PMID:29228200
Clinimetric Testing of the Comprehensive Cervical Dystonia Rating Scale

PubMed Central

Comella, C. L.; Perlmutter, J.S.; Jinnah, H. A.; Waliczek, T. A.; Rosen, A. R.; Galpern, W. R.; Adler, C. H.; Barbano, R. L.; Factor, S. A.; Goetz, C.G.; Jankovic, J.; Reich, S. G.; Rodriguez, R. L.; Severt, W. L.; Zurowski, M.; Fox, S. H.; Stebbins, G.T.

2016-01-01

Objective To test the clinimetric properties of the Comprehensive Cervical Dystonia Rating Scale. Background This is a modular scale with modifications of the Toronto Western Spasmodic Torticollis Rating Scale (composed of three subscales assessing motor severity, disability and pain) now referred to as the revised Toronto Western Spasmodic Torticollis Scale-2.; a newly developed psychiatric screening instrument; and the Cervical Dystonia Impact Profile-58 as a quality of life measure. Methods Ten dystonia experts rated subjects with cervical dystonia using the comprehensive scale. Clinimetric techniques assessed each module of the scale for reliability, item correlation and factor structure. Results There were 208 cervical dystonia patients (73% women, age 59±10 years, duration 15±12 years). The internal consistency of the motor severity subscale was acceptable (Cronbach’s alpha = 0.57). Item to total correlations showed that elimination of items with low correlations (<0.20) increased alpha to 0.71. Internal consistency estimates for the subscales for disability and pain were 0.88 and 0.95 respectively. The psychiatric screening scale had a Cronbach’s alpha of 0.84 and satisfactory item to total correlations. When the subscales of the Toronto Western Spasmodic Torticollis scale -2 were combined with the psychiatric screening scale, Cronbach's alpha was 0.88, and construct validity assessment demonstrated four rational factors: motor, disability, pain and psychiatric disorders. The Cervical Dystonia Impact Profile-58 had an alpha of 0.98 and its construction was validated through a confirmatory factor analysis. Conclusions The modules of the Comprehensive Cervical Dystonia Rating Scale are internally consistent with a logical factor structure. PMID:26971359
Anxiety and fear. Discriminant validity in the child and adolescent practitioner's perspective.

PubMed

Pavuluri, Mani N; Henry, David; Allen, Kathleen

2002-12-01

We assessed the ability of child and adolescent practitioners to discriminate between anxiety items from the Revised Children's Manifest Anxiety Scale (RCMAS) and fear items from the Fear Survey Schedule for Children-Revised (FSSC-R). In addition, we examined the effects age, gender, nationality, and therapeutic orientation on discrimination ability. Child and adolescent psychiatrists and psychologists from two university hospitals in Australia and the USA completed a questionnaire comprised of items randomly chosen from the RCMAS and the FSSC-R. Clinicians rated each item on the extent to which the item represented the construct of anxiety or fear, using a 7-point Likert-type scale. Clinicians were more accurate in their perceptions of anxiety than in their perceptions of fear. Clinicians with a psychodynamic orientation were more likely to perceive an item as describing anxiety, and were less likely to identify fear. There was a significant interaction between age, scale and perception, with the youngest clinicians showing the greatest perceptual differentiation between the fear and anxiety items. The results suggest a need to develop common terminology among researchers and clinicians, develop scales with items specific to the pathology they intend to measure, and consider the variables influencing the clinicians rating them.
Psychometrics of the Fitness-to-Drive Screening Measure.

PubMed

Classen, Sherrilene; Velozo, Craig A; Winter, Sandra M; Bédard, Michel; Wang, Yanning

2015-01-01

We employed item response theory (IRT), specifically using Rasch modeling, to determine the measurement precision of the Fitness-to-Drive Screening Measure (FTDS), a tool that can be used by caregivers and occupational therapists to help detect at-risk drivers. We examined unidimensionality through the factor structure (how items contribute to the central construct of fitness to drive), rating scale (use of the categories of the rating scale), item/person-level separation (distinguishing between items with different difficulty levels or persons with different ability levels) and reliability, item hierarchy (easier driving items advancing to more difficult driving items), rater reliability, rater effects (severity vs. leniency of a rater), and criterion validity of the FTDS to an on-road assessment, via three rater groups (n = 200 older drivers; n = 200 caregivers; n = 2 evaluators). The FTDS is unidimensional, the rating scale performed well, has good person (> 3.07) and item (> 5.43) separation, good person (> 0.90) and item reliability (> 0.97), with < 10% misfitting items for two rater groups (caregivers and drivers). The intraclass correlation (ICC) coefficient among the three rater groups was significant (.253, p < .001) and the evaluators were the most severe raters. When comparing the caregivers' FTDS rating with the drivers' on-road assessment, the areas under the curve (index of discriminability; caregivers .726, p < .001) suggested concurrent validity between the FTDS and the on-road assessment. Despite limitations, the FTDS is a reliable and accurate screening measure for caregivers to help identify at-risk older drivers and for occupational therapy practitioners to start conversations about driving.
The Relationship between Symptom Relief and Psychosocial Functional Improvement during Acute Electroconvulsive Therapy for Patients with Major Depressive Disorder.

PubMed

Lin, Ching-Hua; Yang, Wei-Cheng

2017-07-01

We aimed to compare the degree of symptom relief to psychosocial functional (abbreviated as "functional") improvement and explore the relationships between symptom relief and functional improvement during acute electroconvulsive therapy for patients with major depressive disorder. Major depressive disorder inpatients (n=130) requiring electroconvulsive therapy were recruited. Electroconvulsive therapy was generally performed for a maximum of 12 treatments. Symptom severity, using the 17-item Hamilton Depression Rating Scale, and psychosocial functioning (abbreviated as "functioning"), using the Modified Work and Social Adjustment Scale, were assessed before electroconvulsive therapy, after every 3 electroconvulsive therapy treatments, and after the final electroconvulsive therapy. Both 17-item Hamilton Depression Rating Scale and Modified Work and Social Adjustment Scale scores were converted to T-score units to compare the degrees of changes between depressive symptoms and functioning after electroconvulsive therapy. Structural equation modeling was used to test the relationships between 17-item Hamilton Depression Rating Scale and Modified Work and Social Adjustment Scale during acute electroconvulsive therapy. One hundred sixteen patients who completed at least the first 3 electroconvulsive therapy treatments entered the analysis. Reduction of 17-item Hamilton Depression Rating Scale T-scores was significantly greater than that of Modified Work and Social Adjustment Scale T-scores at assessments 2, 3, 4, and 5. The model analyzed by structural equation modeling satisfied all indices of goodness-of-fit (chi-square = 32.882, P =.107, TLI = 0.92, CFI = 0.984, RMSEA = 0.057). The 17-item Hamilton Depression Rating Scale change did not predict subsequent Modified Work and Social Adjustment Scale change. Functioning improved less than depressive symptoms during acute electroconvulsive therapy. Symptom reduction did not predict subsequent functional improvement. Depressive symptoms and functional impairment are distinct domains and should be assessed independently to accurately reflect the effectiveness of electroconvulsive therapy. © The Author 2017. Published by Oxford University Press on behalf of CINP.
The Montgomery Äsberg and the Hamilton Ratings of Depression

PubMed Central

Carmody, Thomas; Rush, A. John; Bernstein, Ira; Warden, Diane; Brannan, Stephen; Burnham, Daniel; Woo, Ada; Trivedi, Madhukar

2007-01-01

The 17-item Hamilton Rating Scale for Depression (HRSD17) and the Montgomery Äsberg Depression Rating Scale (MADRS) are two widely used clinicianrated symptom scales. A 6-item version of the HRSD (HRSD6) was created by Bech to address the psychometric limitations of the HRSD17. The psychometric properties of these measures were compared using classical test theory (CTT) and item response theory (IRT) methods. IRT methods were used to equate total scores on any two scales. Data from two distinctly different outpatient studies of nonpsychotic major depression: a 12-month study of highly treatment-resistant patients (n=233) and an 8-week acute phase drug treatment trial (n=985) were used for robustness of results. MADRS and HRSD6 items generally contributed more to the measurement of depression than HRSD17 items as shown by higher item-total correlations and higher IRT slope parameters. The MADRS and HRSD6 were unifactorial while the HRSD17 contained 2 factors. The MADRS showed about twice the precision in estimating depression as either the HRSD17 or HRSD6 for average severity of depression. An HRSD17 of 7 corresponded to an 8 or 9 on the MADRS and 4 on the HRSD6. The MADRS would be superior to the HRSD17 in the conduct of clinical trials. PMID:16769204
The Development and Validation of the Intercultural Sensitivity Scale.

ERIC Educational Resources Information Center

Chen, Guo-Ming; Starosta, William J.

The present study developed and assessed reliability and validity of a new instrument, the Intercultural Sensitivity Scale (ISS). Based on a review of the literature, 44 items thought to be important for intercultural sensitivity were generated. A sample of 414 college students rated these items and generated a 24-item final version of the…
Quality of life in Essential Tremor Questionnaire (QUEST): development and initial validation.

PubMed

Tröster, Alexander I; Pahwa, Rajesh; Fields, Julie A; Tanner, Caroline M; Lyons, Kelly E

2005-09-01

Essential tremor (ET) can diminish functioning and quality of life (QOL) but generic QOL measures may be relatively insensitive to ET and its therapies. We sought to develop an ET-specific measure that might be more sensitive, acceptable to patients, relatively brief, and easily used. A sample of 200 patients (average age 70 years, range 30-91; average disease duration 15 years) rated the extent to which tremor impacts a function or state, tremor severity in various body parts, perceived health, and overall QOL. Responses to this initial questionnaire were subjected to principal components analysis (PCA). Inspection of factor coordinates, Eigenvalues, variance accounted for, and correlation matrices were used to select items for confirmatory PCA. Final scale reliability was assessed using Cronbach's alpha. Validity was evaluated by correlations between QOL scales and self-rated tremor severity. PCA of 65 initial items yielded 11 factors accounting for 71% of variance. Six factors were discarded. Two items were eliminated for not loading on a factor and 33 for perceived redundancy. Confirmatory PCA of the retained 30 items yielded an almost identical factor structure (six factors, 70% of variance accounted for, and similar item loadings). Because two factors had very few items loading on them, these two factors were combined into one scale. The final measure has five scales: Physical, Psychosocial, Communication, Hobbies/Leisure, and Work/Finance. Reliability was excellent for the whole instrument and four scales (> or =0.89), and good for the Work/Finance scale (0.79). Severity of voice and head tremor were the best correlates of Communication (0.70 and 0.35), while the Physical scale was related to right and left upper extremity tremor (0.59 and 0.56). Scales correlated more highly with patients' rating of their overall QOL than their health perception. A brief, 30-item, ET-specific QOL scale with excellent reliability was developed. Preliminary validity data are encouraging. The Quality of Life in Essential Tremor Questionnaire (QUEST) promises to facilitate QOL measurement in ET.
Validation of the Expanded Versions of the Adult ADHD Self-Report Scale v1.1 Symptom Checklist and the Adult ADHD Investigator Symptom Rating Scale.

PubMed

Silverstein, Michael J; Faraone, Stephen V; Alperin, Samuel; Leon, Terry L; Biederman, Joseph; Spencer, Thomas J; Adler, Lenard A

2018-02-01

The aim of this study is to validate the Adult ADHD Self-Report Scale (ASRS) and Adult ADHD Investigator Symptom Rating Scale (AISRS) expanded versions, including executive function deficits (EFDs) and emotional dyscontrol (EC) items, and to present ASRS and AISRS pilot normative data. Two patient samples (referred and primary care physician [PCP] controls) were pooled together for these analyses. Final analysis included 297 respondents, 171 with adult ADHD. Cronbach's alphas were high for all sections of the scales. Examining histograms of ASRS 31-item and AISRS 18-item total scores for ADHD controls, 95% cutoff scores were 70 and 23, respectively; histograms for pilot normative sample suggest cutoffs of 82 and 26, respectively. (a) ASRS- and AISRS-expanded versions have high validity in assessment of core 18 adult ADHD Diagnostic and Statistical Manual of Mental Disorders ( DSM) symptoms and EFD and EC symptoms. (b) ASRS (31-item) scores 70 to 82 and AISRS (18-item) scores from 23 to 26 suggest a high likelihood of adult ADHD.
Measuring Response Styles Across the Big Five: A Multiscale Extension of an Approach Using Multinomial Processing Trees.

PubMed

Khorramdel, Lale; von Davier, Matthias

2014-01-01

This study shows how to address the problem of trait-unrelated response styles (RS) in rating scales using multidimensional item response theory. The aim is to test and correct data for RS in order to provide fair assessments of personality. Expanding on an approach presented by Böckenholt (2012), observed rating data are decomposed into multiple response processes based on a multinomial processing tree. The data come from a questionnaire consisting of 50 items of the International Personality Item Pool measuring the Big Five dimensions administered to 2,026 U.S. students with a 5-point rating scale. It is shown that this approach can be used to test if RS exist in the data and that RS can be differentiated from trait-related responses. Although the extreme RS appear to be unidimensional after exclusion of only 1 item, a unidimensional measure for the midpoint RS is obtained only after exclusion of 10 items. Both RS measurements show high cross-scale correlations and item response theory-based (marginal) reliabilities. Cultural differences could be found in giving extreme responses. Moreover, it is shown how to score rating data to correct for RS after being proved to exist in the data.
Bridging the Measurement Gap Between Research and Clinical Care in Schizophrenia: Positive and Negative Syndrome Scale-6 (PANSS-6) and Other Assessments Based on the Simplified Negative and Positive Symptoms Interview (SNAPSI).

PubMed

Østergaard, Søren D; Opler, Mark G A; Correll, Christoph U

2017-12-01

There is currently a "measurement gap" between research and clinical care in schizophrenia. The main reason behind this gap is that the most widely used rating scale in schizophrenia research, the 30-item Positive and Negative Syndrome Scale (PANSS), takes so long to administer that it is rarely used in clinical practice. This compromises the translation of research findings into clinical care and vice versa. The aim of this paper is to discuss how this measurement gap can be closed. Specifically, the main points of discussion are 1) the practical problems associated with using the full 30-item PANSS in clinical practice; 2) how the brief, six-item version of the Positive and Negative Syndrome Scale (PANSS-6) was derived empirically from the full 30-item PANSS and what the initial results obtained with PANSS-6 entail; and 3) how PANSS-6 ratings, guided by the newly developed, 15-25-minute, stand-alone Simplified Negative and Positive Symptoms Interview (SNAPSI), might help bridge the measurement gap between research and clinical care in schizophrenia. The full 30-item PANSS is often used in research studies, but is too time consuming to allow for routine clinical use. Recent studies suggest that the much briefer PANSS-6 is a psychometrically valid measure of core positive and negative symptoms of schizophrenia and that the scale is sensitive to symptom improvement following pharmacological treatment. SNAPSI is a brief interview that yields the information needed to rate PANSS-6 (and other brief rating scales). We believe that PANSS-6 ratings guided by SNAPSI will help bridge the measurement gap between research and clinical care in schizophrenia.
The 4-Item Negative Symptom Assessment (NSA-4) Instrument: A Simple Tool for Evaluating Negative Symptoms in Schizophrenia Following Brief Training.

PubMed

Alphs, Larry; Morlock, Robert; Coon, Cheryl; van Willigenburg, Arjen; Panagides, John

2010-07-01

Objective. To assess the ability of mental health professionals to use the 4-item Negative Symptom Assessment instrument, derived from the Negative Symptom Assessment-16, to rapidly determine the severity of negative symptoms of schizophrenia.Design. Open participation.Setting. Medical education conferences.Participants. Attendees at two international psychiatry conferences.Measurements. Participants read a brief set of the 4-item Negative Symptom Assessment instructions and viewed a videotape of a patient with schizophrenia. Using the 1 to 6 4-item Negative Symptom Assessment severity rating scale, they rated four negative symptom items and the overall global negative symptoms. These ratings were compared with a consensus rating determination using frequency distributions and Chi-square tests for the proportion of participant ratings that were within one point of the expert rating.Results. More than 400 medical professionals (293 physicians, 50% with a European practice, and 55% who reported past utilization of schizophrenia ratings scales) participated. Between 82.1 and 91.1 percent of the 4-items and the global rating determinations by the participants were within one rating point of the consensus expert ratings. The differences between the percentage of participant rating scores that were within one point versus the percentage that were greater than one point different from those by the consensus experts was significant (p<0.0001). Participants rating of negative symptoms using the 4-item Negative Symptom Assessment did not generally differ among the geographic regions of practice, the professional credentialing, or their familiarity with the use of schizophrenia symptom rating instruments.Conclusion. These findings suggest that clinicians from a variety of geographic practices can, after brief training, use the 4-item Negative Symptom Assessment effectively to rapidly assess negative symptoms in patients with schizophrenia.
Excellent reliability of the Hamilton Depression Rating Scale (HDRS-21) in Indonesia after training.

PubMed

Istriana, Erita; Kurnia, Ade; Weijers, Annelies; Hidayat, Teddy; Pinxten, Lucas; de Jong, Cor; Schellekens, Arnt

2013-09-01

The Hamilton Depression Rating Scale (HDRS) is the most widely used depression rating scale worldwide. Reliability of HDRS has been reported mainly from Western countries. The current study tested the reliability of HDRS ratings among psychiatric residents in Indonesia, before and after HDRS training. The hypotheses were that: (i) prior to the training reliability of HDRS ratings is poor; and (ii) HDRS training can improve reliability of HDRS ratings to excellent levels. Furthermore, we explored cultural validity at item level. Videotaped HDRS interviews were rated by 30 psychiatric residents before and after 1 day of HDRS training. Based on a gold standard rating, percentage correct ratings and deviation from the standard were calculated. Correct ratings increased from 83% to 99% at item level and from 70% to 100% for the total rating. The average deviation from the gold standard rating improved from 0.07 to 0.02 at item level and from 2.97 to 0.46 for the total rating. HDRS assessment by psychiatric trainees in Indonesia without prior training is unreliable. A short, evidence-based HDRS training improves reliability to near perfect levels. The outlined training program could serve as a template for HDRS trainings. HDRS items that may be less valid for assessment of depression severity in Indonesia are discussed. Copyright © 2013 Wiley Publishing Asia Pty Ltd.
An Item Response Unfolding Model for Graphic Rating Scales

ERIC Educational Resources Information Center

Liu, Ying

2009-01-01

The graphic rating scale, a measurement tool used in many areas of psychology, usually takes a form of a fixed-length line segment, with both ends bounded and labeled as extreme responses. The raters mark somewhere on the line, and the length of the line segment from one endpoint to the mark is taken as the measure. An item response unfolding…
Concurrent Validity and Sensitivity to Change of Direct Behavior Rating Single-Item Scales (DBR-SIS) within an Elementary Sample

ERIC Educational Resources Information Center

Smith, Rhonda L.; Eklund, Katie; Kilgus, Stephen P.

2018-01-01

The purpose of this study was to evaluate the concurrent validity, sensitivity to change, and teacher acceptability of Direct Behavior Rating single-item scales (DBR-SIS), a brief progress monitoring measure designed to assess student behavioral change in response to intervention. Twenty-four elementary teacher-student dyads implemented a daily…
ECT Has Greater Efficacy Than Fluoxetine in Alleviating the Burden of Illness for Patients with Major Depressive Disorder: A Taiwanese Pooled Analysis.

PubMed

Lin, Ching-Hua; Huang, Chun-Jen; Chen, Cheng-Chung

2018-01-01

The burden of major depressive disorder includes suffering due to symptom severity, functional impairment, and quality of life deficits. The aim of this study was to compare the differences between electroconvulsive therapy and pharmacotherapy in reducing such burdens. This was a pooled analysis study including 2 open-label trials for major depressive disorder inpatients receiving either standard bitemporal and modified electroconvulsive therapy with a maximum of 12 sessions or 20 mg/d of fluoxetine for 6 weeks. Symptom severity, functioning, and quality of life were assessed using the 17-item Hamilton Rating Scale for Depression, the Modified Work and Social Adjustment Scale, and SF-36. Side effects following treatment, including subjective memory impairment, nausea/vomiting, and headache, were recorded. The differences between these 2 groups in 17-item Hamilton Rating Scale for Depression, Modified Work and Social Adjustment Scale, quality of life, side effects, and time to response (at least a 50% reduction of 17-item Hamilton Rating Scale for Depression) and remission (17-item Hamilton Rating Scale for Depression ≤7) following treatment were analyzed. Electroconvulsive therapy (n=116) showed a significantly greater reduction in 17-item Hamilton Rating Scale for Depression, Modified Work and Social Adjustment Scale, and quality of life deficits and had significantly shorter time to response/remission than fluoxetine (n=126). However, the electroconvulsive therapy group was more likely to experience subjective memory impairment and headache. Compared with fluoxetine, electroconvulsive therapy was more effective in alleviating the burden of major depressive disorder and had a substantially increased speed of response/remission in the acute phase. Increased education and information about electroconvulsive therapy for clinicians, patients, and their families and the general public is warranted. © The Author(s) 2017. Published by Oxford University Press on behalf of CINP.
French Norms for the Harvard Group Scale of Hypnotic Susceptibility, Form A.

PubMed

Anlló, Hernán; Becchio, Jean; Sackur, Jérôme

2017-01-01

The authors present French norms for the Harvard Group Scale of Hypnotic Susceptibility, Form A (HGSHS:A). They administered an adapted translation of Shor and Orne's original text (1962) to a group of 126 paid volunteers. Participants also rated their own responses following our translation of Kihlstrom's Scale of Involuntariness (2006). Item pass rates, score distributions, and reliability were calculated and compared with several other reference samples. Analyses show that the present French norms are congruous with the reference samples. Interestingly, the passing rate for some items drops significantly if "entirely voluntary" responses (as identified by Kihlstrom's scale) are scored as "fail." Copies of the translated scales and response booklet are available online.
A Model-Free Diagnostic for Single-Peakedness of Item Responses Using Ordered Conditional Means.

PubMed

Polak, Marike; de Rooij, Mark; Heiser, Willem J

2012-09-01

In this article we propose a model-free diagnostic for single-peakedness (unimodality) of item responses. Presuming a unidimensional unfolding scale and a given item ordering, we approximate item response functions of all items based on ordered conditional means (OCM). The proposed OCM methodology is based on Thurstone & Chave's (1929) criterion of irrelevance, which is a graphical, exploratory method for evaluating the "relevance" of dichotomous attitude items. We generalized this criterion to graded response items and quantified the relevance by fitting a unimodal smoother. The resulting goodness-of-fit was used to determine item fit and aggregated scale fit. Based on a simulation procedure, cutoff values were proposed for the measures of item fit. These cutoff values showed high power rates and acceptable Type I error rates. We present 2 applications of the OCM method. First, we apply the OCM method to personality data from the Developmental Profile; second, we analyze attitude data collected by Roberts and Laughlin (1996) concerning opinions of capital punishment.
Liverpool University Neuroleptic Side-Effect Rating Scale (LUNSERS) as a subjective measure of drug-induced parkinsonism and akathisia.

PubMed

Jung, Hee-Yeon; Kim, Jong-Hoon; Ahn, Yong-Min; Kim, Seong-Chan; Hwang, Samuel S; Kim, Yong-Sik

2005-01-01

The Liverpool University Neuroleptic Side-Effect Rating Scale (LUNSERS) was examined for its usefulness as a subjective measure of drug-induced parkinsonism and akathisia. Eighty-three subjects were assessed using the LUNSERS, the Simpson-Angus Scale (SAS) and the Barnes Akathisia Rating Scale (BARS), before and after a 6-week treatment with olanzapine. Significant correlations were found between the changes in scores of parkinsonism items of LUNSERS and SAS. The changes in scores of akathisia item (restlessness), extrapyramidal side effects (EPS) subscale and psychic side-effects subscale of LUNSERS were significantly correlated with those of the BARS. 'Shakiness', one item of the EPS subscale of LUNSERS, correctly classified between parkinsonism and non-parkinsonism groups with 81.0% accuracy. A combination of four items included in EPS and psychic side-effect subscales of LUNSERS identified akathisia and non-akathisia groups with 76.2% accuracy. These results suggest that the EPS and psychic side-effect subscales of LUNSERS may be useful in screening for drug-induced parkinsonism and akathisia. Copyright (c) 2004 John Wiley & Sons, Ltd.
Is the Parkinson Anxiety Scale comparable across raters?

PubMed

Forjaz, Maria João; Ayala, Alba; Martinez-Martin, Pablo; Dujardin, Kathy; Pontone, Gregory M; Starkstein, Sergio E; Weintraub, Daniel; Leentjens, Albert F G

2015-04-01

The Parkinson Anxiety Scale is a new scale developed to measure anxiety severity in Parkinson's disease specifically. It consists of three dimensions: persistent anxiety, episodic anxiety, and avoidance behavior. This study aimed to assess the measurement properties of the scale while controlling for the rater (self- vs. clinician-rated) effect. The Parkinson Anxiety Scale was administered to a cross-sectional multicenter international sample of 362 Parkinson's disease patients. Both patients and clinicians rated the patient's anxiety independently. A many-facet Rasch model design was applied to estimate and remove the rater effect. The following measurement properties were assessed: fit to the Rasch model, unidimensionality, reliability, differential item functioning, item local independency, interrater reliability (self or clinician), and scale targeting. In addition, test-retest stability, construct validity, precision, and diagnostic properties of the Parkinson Anxiety Scale were also analyzed. A good fit to the Rasch model was obtained for Parkinson Anxiety Scale dimensions A and B, after the removal of one item and rescoring of the response scale for certain items, whereas dimension C showed marginal fit. Self versus clinician rating differences were of small magnitude, with patients reporting higher anxiety levels than clinicians. The linear measure for Parkinson Anxiety Scale dimensions A and B showed good convergent construct with other anxiety measures and good diagnostic properties. Parkinson Anxiety Scale modified dimensions A and B provide valid and reliable measures of anxiety in Parkinson's disease that are comparable across raters. Further studies are needed with dimension C. © 2014 International Parkinson and Movement Disorder Society.

Item Development and Validity Testing for a Self- and Proxy Report: The Safe Driving Behavior Measure

PubMed Central

Classen, Sherrilene; Winter, Sandra M.; Velozo, Craig A.; Bédard, Michel; Lanford, Desiree N.; Brumback, Babette; Lutz, Barbara J.

2010-01-01

OBJECTIVE We report on item development and validity testing of a self-report older adult safe driving behaviors measure (SDBM). METHOD On the basis of theoretical frameworks (Precede–Proceed Model of Health Promotion, Haddon’s matrix, and Michon’s model), existing driving measures, and previous research and guided by measurement theory, we developed items capturing safe driving behavior. Item development was further informed by focus groups. We established face validity using peer reviewers and content validity using expert raters. RESULTS Peer review indicated acceptable face validity. Initial expert rater review yielded a scale content validity index (CVI) rating of 0.78, with 44 of 60 items rated ≥0.75. Sixteen unacceptable items (≤0.5) required major revision or deletion. The next CVI scale average was 0.84, indicating acceptable content validity. CONCLUSION The SDBM has relevance as a self-report to rate older drivers. Future pilot testing of the SDBM comparing results with on-road testing will define criterion validity. PMID:20437917
International comparisons of behavioral and emotional problems in preschool children: parents' reports from 24 societies.

PubMed

Rescorla, Leslie A; Achenbach, Thomas M; Ivanova, Masha Y; Harder, Valerie S; Otten, Laura; Bilenberg, Niels; Bjarnadottir, Gudrun; Capron, Christiane; De Pauw, Sarah S W; Dias, Pedro; Dobrean, Anca; Döpfner, Manfred; Duyme, Michel; Eapen, Valsamma; Erol, Nese; Esmaeili, Elaheh Mohammad; Ezpeleta, Lourdes; Frigerio, Alessandra; Fung, Daniel S S; Gonçalves, Miguel; Guðmundsson, Halldór; Jeng, Suh-Fang; Jusiené, Roma; Ah Kim, Young; Kristensen, Solvejg; Liu, Jianghong; Lecannelier, Felipe; Leung, Patrick W L; Machado, Bárbara César; Montirosso, Rosario; Ja Oh, Kyung; Ooi, Yoon Phaik; Plück, Julia; Pomalima, Rolando; Pranvera, Jetishi; Schmeck, Klaus; Shahini, Mimoza; Silva, Jaime R; Simsek, Zeynep; Sourander, Andre; Valverde, José; van der Ende, Jan; Van Leeuwen, Karla G; Wu, Yen-Tzu; Yurdusen, Sema; Zubrick, Stephen R; Verhulst, Frank C

2011-01-01

International comparisons were conducted of preschool children's behavioral and emotional problems as reported on the Child Behavior Checklist for Ages 1½-5 by parents in 24 societies (N = 19,850). Item ratings were aggregated into scores on syndromes; Diagnostic and Statistical Manual of Mental Disorders-oriented scales; a Stress Problems scale; and Internalizing, Externalizing, and Total Problems scales. Effect sizes for scale score differences among the 24 societies ranged from small to medium (3-12%). Although societies differed greatly in language, culture, and other characteristics, Total Problems scores for 18 of the 24 societies were within 7.1 points of the omnicultural mean of 33.3 (on a scale of 0-198). Gender and age differences, as well as gender and age interactions with society, were all very small (effect sizes < 1%). Across all pairs of societies, correlations between mean item ratings averaged .78, and correlations between internal consistency alphas for the scales averaged .92, indicating that the rank orders of mean item ratings and internal consistencies of scales were very similar across diverse societies.
International Comparisons of Behavioral and Emotional Problems in Preschool Children: Parents’ Reports From 24 Societies

PubMed Central

Rescorla, Leslie A.; Achenbach, Thomas M.; Ivanova, Masha Y.; Harder, Valerie S.; Otten, Laura; Bilenberg, Niels; Bjarnadottir, Gudrun; Capron, Christiane; De Pauw, Sarah S. W.; Dias, Pedro; Dobrean, Anca; Döpfner, Manfred; Duyme, Michel; Eapen, Valsamma; Erol, Nese; Esmaeili, Elaheh Mohammad; Ezpeleta, Lourdes; Frigerio, Alessandra; Fung, Daniel S. S.; Gonçalves, Miguel; Guđmundsson, Halldór; Jeng, Suh-Fang; Jusiené, Roma; Kim, Young Ah; Kristensen, Solvejg; Liu, Jianghong; Lecannelier, Felipe; Leung, Patrick W. L.; Machado, Bárbara César; Montirosso, Rosario; Oh, Kyung Ja; Ooi, Yoon Phaik; Plück, Julia; Pomalima, Rolando; Pranvera, Jetishi; Schmeck, Klaus; Shahini, Mimoza; Silva, Jaime R.; Simsek, Zeynep; Sourander, Andre; Valverde, José; van der Ende, Jan; Van Leeuwen, Karla G.; Wu, Yen-Tzu; Yurdusen, Sema; Zubrick, Stephen R.; Verhulst, Frank C.

2014-01-01

International comparisons were conducted of preschool children’s behavioral and emotional problems as reported on the Child Behavior Checklist for Ages 1½–5 by parents in 24 societies (N =19,850). Item ratings were aggregated into scores on syndromes; Diagnostic and Statistical Manual of Mental Disorders–oriented scales; a Stress Problems scale; and Internalizing, Externalizing, and Total Problems scales. Effect sizes for scale score differences among the 24 societies ranged from small to medium (3–12%). Although societies differed greatly in language, culture, and other characteristics, Total Problems scores for 18 of the 24 societies were within 7.1 points of the omnicultural mean of 33.3 (on a scale of 0–198). Gender and age differences, as well as gender and age interactions with society, were all very small (effect sizes <1%). Across all pairs of societies, correlations between mean item ratings averaged .78, and correlations between internal consistency alphas for the scales averaged .92, indicating that the rank orders of mean item ratings and internal consistencies of scales were very similar across diverse societies. PMID:21534056
A symptom self-rating scale for schizophrenia (4S): psychometric properties, reliability and validity.

PubMed

Lindström, Eva; Jedenius, Erik; Levander, Sten

2009-01-01

The objective of the study was to validate a self-administrated symptom rating scale for use in patients with schizophrenia spectrum disorders by item analysis, exploration of factor structure, and analyses of reliability and validity. Data on 151 patients, initially treated by risperidone, obtained within the framework of a naturalistic Phase IV longitudinal study, were analysed by comparing patient and clinician ratings of symptoms, side-effects and global indices of illness. The Symptom Self-rating Scale for Schizophrenia (4S) is psychometrically adequate (item analysis, internal consistency, factor structure). Side-effect ratings were reliable. Symptom ratings displayed consistent associations with clinicians' ratings of corresponding symptom dimensions, suggesting construct validity. Patients had most difficulties assessing negative symptom items. Patients were well able to assess their own symptoms and drug side-effects. The factor structure of symptom ratings differs between patients and clinicians as well as how they construe global indices of illness. Clinicians focus on psychotic, patients on affective symptoms. Use of symptom self-ratings is one way to improve communication and thereby strengthen the therapeutic alliance and increase treatment adherence.
Using Rasch rating scale model to reassess the psychometric properties of the Persian version of the PedsQL™ 4.0 Generic Core Scales in school children.

PubMed

Jafari, Peyman; Bagheri, Zahra; Ayatollahi, Seyyed Mohamad Taghi; Soltani, Zahra

2012-03-13

Item response theory (IRT) is extensively used to develop adaptive instruments of health-related quality of life (HRQoL). However, each IRT model has its own function to estimate item and category parameters, and hence different results may be found using the same response categories with different IRT models. The present study used the Rasch rating scale model (RSM) to examine and reassess the psychometric properties of the Persian version of the PedsQL™ 4.0 Generic Core Scales. The PedsQL™ 4.0 Generic Core Scales was completed by 938 Iranian school children and their parents. Convergent, discriminant and construct validity of the instrument were assessed by classical test theory (CTT). The RSM was applied to investigate person and item reliability, item statistics and ordering of response categories. The CTT method showed that the scaling success rate for convergent and discriminant validity were 100% in all domains with the exception of physical health in the child self-report. Moreover, confirmatory factor analysis supported a four-factor model similar to its original version. The RSM showed that 22 out of 23 items had acceptable infit and outfit statistics (<1.4, >0.6), person reliabilities were low, item reliabilities were high, and item difficulty ranged from -1.01 to 0.71 and -0.68 to 0.43 for child self-report and parent proxy-report, respectively. Also the RSM showed that successive response categories for all items were not located in the expected order. This study revealed that, in all domains, the five response categories did not perform adequately. It is not known whether this problem is a function of the meaning of the response choices in the Persian language or an artifact of a mostly healthy population that did not use the full range of the response categories. The response categories should be evaluated in further validation studies, especially in large samples of chronically ill patients.
An in-depth psychometric analysis of the Connor-Davidson Resilience Scale: calibration with Rasch-Andrich model.

PubMed

Arias González, Víctor B; Crespo Sierra, María Teresa; Arias Martínez, Benito; Martínez-Molina, Agustín; Ponce, Fernando P

2015-09-23

The Connor-Davidson Resilience Scale (CD-RISC) is inarguably one of the best-known instruments in the field of resilience assessment. However, the criteria for the psychometric quality of the instrument were based only on classical test theory. The aim of this paper has focused on the calibration of the CD-RISC with a nonclinical sample of 444 adults using the Rasch-Andrich Rating Scale Model, in order to clarify its structure and analyze its psychometric properties at the level of item. Two items showed misfit to the model and were eliminated. The remaining 22 items form basically a unidimensional scale. The CD-RISC has good psychometric properties. The fit of both the items and the persons to the Rasch model was good, and the response categories were functioning properly. Two of the items showed differential item functioning. The CD-RISC has an obvious ceiling effect, which suggests to include more difficult items in future versions of the scale.
Bifactor and Item Response Theory Analyses of Interviewer Report Scales of Cognitive Impairment in Schizophrenia

ERIC Educational Resources Information Center

Reise, Steven P.; Ventura, Joseph; Keefe, Richard S. E.; Baade, Lyle E.; Gold, James M.; Green, Michael F.; Kern, Robert S.; Mesholam-Gately, Raquelle; Nuechterlein, Keith H.; Seidman, Larry J.; Bilder, Robert

2011-01-01

A psychometric analysis of 2 interview-based measures of cognitive deficits was conducted: the 21-item Clinical Global Impression of Cognition in Schizophrenia (CGI-CogS; Ventura et al., 2008), and the 20-item Schizophrenia Cognition Rating Scale (SCoRS; Keefe et al., 2006), which were administered on 2 occasions to a sample of people with…
A natural language screening measure for motivation to change.

PubMed

Miller, William R; Johnson, Wendy R

2008-09-01

Client motivation for change, a topic of high interest to addiction clinicians, is multidimensional and complex, and many different approaches to measurement have been tried. The current effort drew on psycholinguistic research on natural language that is used by clients to describe their own motivation. Seven addiction treatment sites participated in the development of a simple scale to measure client motivation. Twelve items were drafted to represent six potential dimensions of motivation for change that occur in natural discourse. The maximum self-rating of motivation (10 on a 0-10 scale) was the median score on all items, and 43% of respondents rated 10 on all 12 items - a substantial ceiling effect. From 1035 responses, three factors emerged representing importance, ability, and commitment - constructs that are also reflected in several theoretical models of motivation. A 3-item version of the scale, with one marker item for each of these constructs, accounted for 81% of variance in the full scale. The three items are: 1. It is important for me to . . . 2. I could . . . and 3. I am trying to . . . This offers a quick (1-minute) assessment of clients' self-reported motivation for change.
Meta-analysis of the Brief Psychiatric Rating Scale Factor Structure

ERIC Educational Resources Information Center

Shafer, Alan

2005-01-01

A meta-analysis (N=17,620; k=26) of factor analyses of the Brief Psychiatric Rating Scale (BPRS) was conducted. Analysis of the 12 items from Overall et al.'s (J. E. Overall, L. E. Hollister, & P. Pichot, 1974) 4 subscales found support for his 4 subscales. Analysis of all 18 BPRS items found 4 components similar to those of Overall et al. In a…
Progress Monitoring the Effects of Daily Report Cards across Elementary and Secondary Settings Using Direct Behavior Rating: Single Item Scales

ERIC Educational Resources Information Center

Miller, Faith G.; Crovello, Nicholas J.; Chafouleas, Sandra M.

2017-01-01

Direct Behavior Rating-Single Item Scales (DBR-SIS) have been advanced as a promising, systematic, behavioral, progress-monitoring method that is flexible, efficient, and defensible. This study aimed to extend existing literature on the use of DBR-SIS in elementary and secondary settings, and to examine methods of monitoring student progress in…
Screening for depression in advanced disease: psychometric properties, sensitivity, and specificity of two items of the Palliative Care Outcome Scale (POS).

PubMed

Antunes, Bárbara; Murtagh, Fliss; Bausewein, Claudia; Harding, Richard; Higginson, Irene J

2015-02-01

Depression is common among patients with advanced disease but often difficult to detect. To assess the Palliative care Outcome Scale (POS) (10 items) against the Geriatric Depression Scale (GDS)-10 total score and the Hospital Anxiety and Depression Scale (HADS)-Depression subscale total score and determine if the POS has appropriate items to screen for depression among people with advanced disease. This was a secondary analysis performed on five studies. Four psychometric properties were assessed: data quality, scaling assumptions, acceptability, and internal consistency (reliability). Receiver operating characteristic (ROC) curves were used to determine the area under the curve. Sensitivity, specificity, positive and negative predictive values, false positive and negative rates, and positive and negative likelihood ratios were computed. The overall sample had 416 patients from Germany and England: 144 had cancer and 267 had nonmalignant conditions. Prevalence of depression across the sample was 17.5%. Floor and ceiling effects were rare. Cronbach's alpha coefficients for POS items 7 and 8 summed, GDS-10 and HADS-Depression items varied: 0.61 (heart failure) and 0.80 (cancer). Two items combined (Item 7-feeling depressed and Item 8-feeling good about yourself) consistently presented the highest area under the ROC curve, ranging from 0.76 (95% CI 0.60, 0.93) (Germany, lung cancer) to 0.97 (95% CI 0.91, 1.0) (heart failure), highest negative predictive value, and lowest false negative rate. For the overall sample, the cutoff 2/3 presented a negative predictive value of 89.4% (95% CI 84.7, 92.8) and false negative rate of 10.6 (95% CI 7.2, 15.3). POS items 7 and 8 summed are potentially useful to screen for depression in advanced disease populations. Copyright © 2015 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Inter-rater reliability of Hamilton depression rating scale using video-recorded interviews — Focus on rater-blinding

PubMed Central

Prasad, M. Krishna; Udupa, K.; Kishore, K. R.; Thirthalli, J.; Sathyaprabha, T. N.; Gangadhar, B. N.

2009-01-01

Background: Hamilton depression rating scale (Ham-D) is the most widely used clinician rating scale for depression. There has been no Indian study that has examined the inter-rater reliability (IRR) of video-recorded interviews of the 21-item Ham-D. Aim: To study the IRR of scoring video-recorded interviews for 21-item Ham-D. Materials and Methods: Eighteen subjects with major depressive disorder involved in a larger study were interviewed using the semi-structured clinical interview of the 21-item Ham-D by a primary rater after informed consent. These interviews were video-recorded and portions edited to ensure rater blinding. Subsequently, the video-recorded interviews were rated by a “blind” rater. Both rated the different sub-domains of Ham-D according to Rhoades and Overall (1983). IRR was evaluated using intra-class correlation coefficient. Results: Excellent IRR was observed (0.9891) between the two raters. This was true for each of the primary factors and super-factors. Conclusion: Video recorded 21-item Ham-D has excellentIRR. Video-recorded interviews of Ham-D can be reliably used to blind raters in research. PMID:19881046
Reliability and Validity of Scores on the IFSP Rating Scale

ERIC Educational Resources Information Center

Jung, Lee Ann; McWilliam, R. A.

2005-01-01

Evidence is presented regarding the construct validity and internal consistency reliability of scores for an investigator-developed individualized family service plan (IFSP) rating scale. One hundred and twenty IFSPs were rated using a 12-item instrument, the IFSP Rating Scale (McWilliam & Jung, 2001). Using principal components factor…
The Hierarchical Rater Model for Rated Test Items and Its Application to Large-Scale Educational Assessment Data.

ERIC Educational Resources Information Center

Patz, Richard J.; Junker, Brian W.; Johnson, Matthew S.; Mariano, Louis T.

2002-01-01

Discusses the hierarchical rater model (HRM) of R. Patz (1996) and shows how it can be used to scale examinees and items, model aspects of consensus among raters, and model individual rater severity and consistency effects. Also shows how the HRM fits into the generalizability theory framework. Compares the HRM to the conventional item response…
Content Validation of the Scale of Teachers' Attitudes towards Inclusive Classrooms (STATIC)

ERIC Educational Resources Information Center

Nishimura, Trisha Sugita; Busse, R. T.

2016-01-01

The purpose of this study was to examine the content validity of the Scale of Teachers' Attitudes towards Inclusive Classrooms (STATIC). An expert panel of 20 special education teachers and five university faculty members provided individual item ratings on a five-point scale regarding wording and content, along with comments. Item and comment…
An Investigation of the Generalizability and Dependability of Direct Behavior Rating Single Item Scales (DBR-SIS) to Measure Academic Engagement and Disruptive Behavior of Middle School Students

ERIC Educational Resources Information Center

Chafouleas, Sandra M.; Briesch, Amy M.; Riley-Tillman, T. Chris; Christ, Theodore J.; Black, Anne C.; Kilgus, Stephen P.

2010-01-01

A total of 4 raters, including 2 teachers and 2 research assistants, used Direct Behavior Rating Single Item Scales (DBR-SIS) to measure the academic engagement and disruptive behavior of 7 middle school students across multiple occasions. Generalizability study results for the full model revealed modest to large magnitudes of variance associated…
Using Rasch Rating Scale Methodology to Examine a Behavioral Screener for Preschoolers at Risk

ERIC Educational Resources Information Center

DiStefano, Christine; Greer, Fred W.; Kamphaus, R. W.; Brown, William H.

2014-01-01

A screening instrument used to identify young children at risk for behavioral and emotional difficulties, the Behavioral and Emotional Screening System Teacher Rating Scale-Preschool was examined. The Rasch Rating Scale Method was used to provide additional information about psychometric properties of items, respondents, and the response scale.…
Generalizability of Scaling Gradients on Direct Behavior Ratings

ERIC Educational Resources Information Center

Chafouleas, Sandra M.; Christ, Theodore J.; Riley-Tillman, T. Chris

2009-01-01

Generalizability theory is used to examine the impact of scaling gradients on a single-item Direct Behavior Rating (DBR). A DBR refers to a type of rating scale used to efficiently record target behavior(s) following an observation occasion. Variance components associated with scale gradients are estimated using a random effects design for persons…
Test Review: Autism Spectrum Rating Scales

ERIC Educational Resources Information Center

Simek, Amber N.; Wahlberg, Andrea C.

2011-01-01

This article reviews Autism Spectrum Rating Scales (ASRS) which are designed to measure behaviors in children between the ages of 2 and 18 that are associated with disorders on the autism spectrum as rated by parents/caregivers and/or teachers. The rating scales include items related to behaviors associated with Autism, Asperger's Disorder, and…
The VCOP Scale: a measure of overprotection in parents of physically vulnerable children.

PubMed

Wright, L; Mullen, T; West, K; Wyatt, P

1993-11-01

A scale is developed for measuring the overprotecting vs. optimal developmental stimulation tendencies for parents of physically "vulnerable" children. A series of items were administered to parents whose parenting techniques had been rated as either highly overprotective or as optimal by a group of MDs and other professionals. Correlations were estimated between each of the items and parental tendencies as rated by professionals. Twenty-eight items were selected that provided maximum prediction of over-protection. The resulting R2 was extraordinarily high (.94). Coefficient alpha and test-retest coefficients were acceptable. It is hoped that release of the new instrument (VCOPS) at this time will allow others to join in determining the clinical and experimental validity of this scale.

Karolinska Psychodynamic Profile for Sexual Disorders: KAPP-SD. A proposal for a psychodynamic rating scale for sexual disorders.

PubMed

Soldati, Lorenzo; Köhl, John; Abraham, Georges; Bianchi Demicheli, Francesco; Wilczek, Alexander

2015-01-01

Our first objective in this paper was to review the literature on psychodynamic rating scales of sexual disorders. Our second objective, based on the findings from our review, was to develop a psychodynamic rating scale for people with sexual disorders: the KAPP-SD. We developed the KAPP-SD by modifying an existing psychodynamic rating scale, which assesses stable modes of mental functioning and character traits, the Karolinska Psychodynamic Profile (KAPP). We removed items 13 and 14 of the KAPP and replaced them with three other items-sexual fantasies, conceptions and role of gender identity, and conceptions and role of sexual orientation. These items are part of the assessment of an individual's sexuality and are used to evaluate a person with a sexual disorder psychodynamically. The KAPP-SD, a modified version of the KAPP, can be found in the Appendix. We developed the KAPP-SD in order to help sex therapists make a rigorous psychodynamic evaluation of persons with sexual disorders, which would give information on the prognosis and on the type of treatment to offer.
Measuring pretest-posttest change with a Rasch Rating Scale Model.

PubMed

Wolfe, E W; Chiu, C W

1999-01-01

When measures are taken on the same individual over time, it is difficult to determine whether observed differences are the result of changes in the person or changes in other facets of the measurement situation (e.g., interpretation of items or use of rating scale). This paper describes a method for disentangling changes in persons from changes in the interpretation of Likert-type questionnaire items and the use of rating scales (Wright, 1996a). The procedure relies on anchoring strategies to create a common frame of reference for interpreting measures that are taken at different times and provides a detailed illustration of how to implement these procedures using FACETS.
Examining Measurement Properties of an English Self-Efficacy Scale for English Language Learners in Korea

ERIC Educational Resources Information Center

Wang, Chuang; Kim, Do-Hong; Bong, Mimi; Ahn, Hyun Seon

2013-01-01

This study provides evidence for the validity of the Questionnaire of English Self-Efficacy in a sample of 167 college students in Korea. Results show that the scale measures largely satisfy the Rasch model for unidimensionality. The rating scale appeared to function effectively. The item hierarchy was consistent with the expected item order. The…
Therapist Competence in Global Mental Health: Development of the Enhancing Assessment of Common Therapeutic Factors (ENACT) Rating Scale

PubMed Central

Kohrt, Brandon A.; Jordans, Mark J.D.; Rai, Sauharda; Shrestha, Pragya; Luitel, Nagendra P.; Ramaiya, Megan; Singla, Daisy; Patel, Vikram

2015-01-01

Lack of reliable and valid measures of therapist competence is a barrier to dissemination and implementation of psychological treatments in global mental health. We developed the ENhancing Assessment of Common Therapeutic factors (ENACT) rating scale for training and supervision across settings varied by culture and access to mental health resources. We employed a four-step process in Nepal: (1) Item generation: We extracted 1,081 items (grouped into 104 domains) from 56 existing tools; role-plays with Nepali therapists generated 11 additional domains. (2) Item relevance: From the 115 domains, Nepali therapists selected 49 domains of therapeutic importance and high comprehensibility. (3) Item utility: We piloted the ENACT scale through rating role-play videotapes, patient session transcripts, and live observations of primary care workers in trainings for psychological treatments and the Mental Health Gap Action Programme (mhGAP). (4) Inter-rater reliability was acceptable for experts (intraclass correlation coefficient, ICC(2,7)=0.88 (95% confidence interval (CI) 0.81—0.93), N=7) and non-specialists (ICC(1,3)=0.67 (95% CI 0.60—0.73), N=34). In sum, the ENACT scale is an 18-item assessment for common factors in psychological treatments, including task-sharing initiatives with non-specialists across cultural settings. Further research is needed to evaluate applications for therapy quality and association with patient outcomes. PMID:25847276
Recommendations to improve the positive and negative syndrome scale (PANSS) based on item response theory.

PubMed

Levine, Stephen Z; Rabinowitz, Jonathan; Rizopoulos, Dimitris

2011-08-15

The adequacy of the Positive and Negative Syndrome Scale (PANSS) items in measuring symptom severity in schizophrenia was examined using Item Response Theory (IRT). Baseline PANSS assessments were analyzed from two multi-center clinical trials of antipsychotic medication in chronic schizophrenia (n=1872). Generally, the results showed that the PANSS (a) item ratings discriminated symptom severity best for the negative symptoms; (b) has an excess of "Severe" and "Extremely severe" rating options; and (c) assessments are more reliable at medium than very low or high levels of symptom severity. Analysis also showed that the detection of statistically and non-statistically significant differences in treatment were highly similar for the original and IRT-modified PANSS. In clinical trials of chronic schizophrenia, the PANSS appears to require the following modifications: fewer rating options, adjustment of 'Lack of judgment and insight', and improved severe symptom assessment. 2011 Elsevier Ltd. All rights reserved.
Pre-Kindergarten Scale.

ERIC Educational Resources Information Center

Flynn, Tim

This 25-item scale for rating prekindergarten children concerns personal and cognitive skills. Directions for using the scale are provided. Personal skills include personal hygiene, communication skills, eating habits, relationships with the teacher, peer relations, and personal behavior. Cognitive skills rated are verbal skills, object…
Using Rasch rating scale model to reassess the psychometric properties of the Persian version of the PedsQLTM 4.0 Generic Core Scales in school children

PubMed Central

2012-01-01

Background Item response theory (IRT) is extensively used to develop adaptive instruments of health-related quality of life (HRQoL). However, each IRT model has its own function to estimate item and category parameters, and hence different results may be found using the same response categories with different IRT models. The present study used the Rasch rating scale model (RSM) to examine and reassess the psychometric properties of the Persian version of the PedsQLTM 4.0 Generic Core Scales. Methods The PedsQLTM 4.0 Generic Core Scales was completed by 938 Iranian school children and their parents. Convergent, discriminant and construct validity of the instrument were assessed by classical test theory (CTT). The RSM was applied to investigate person and item reliability, item statistics and ordering of response categories. Results The CTT method showed that the scaling success rate for convergent and discriminant validity were 100% in all domains with the exception of physical health in the child self-report. Moreover, confirmatory factor analysis supported a four-factor model similar to its original version. The RSM showed that 22 out of 23 items had acceptable infit and outfit statistics (<1.4, >0.6), person reliabilities were low, item reliabilities were high, and item difficulty ranged from -1.01 to 0.71 and -0.68 to 0.43 for child self-report and parent proxy-report, respectively. Also the RSM showed that successive response categories for all items were not located in the expected order. Conclusions This study revealed that, in all domains, the five response categories did not perform adequately. It is not known whether this problem is a function of the meaning of the response choices in the Persian language or an artifact of a mostly healthy population that did not use the full range of the response categories. The response categories should be evaluated in further validation studies, especially in large samples of chronically ill patients. PMID:22414135
Psychometric properties of the Japanese version of short forms of the Pain Catastrophizing Scale in participants with musculoskeletal pain: A cross-sectional study.

PubMed

Nishigami, Tomohiko; Mibu, Akira; Tanaka, Katsuyoshi; Yamashita, Yuh; Watanabe, Akihisa; Tanabe, Akihito

2017-03-01

The Pain Catastrophizing Scale (PCS) is a commonly used as measure of pain catastrophizing. The scale comprises 13 items related to magnification, rumination, and helplessness. To facilitate quick screening and to reduce participant's burden, the four-item and six-item short forms of the English version of the PCS were developed. The purpose of the present study was to evaluate the psychometric properties of a Japanese version of the short forms of PCS using a contemporary approach called Rasch analysis. A total of 216 patients with musculoskeletal disorders were recruited in this study. Participants completed study measures, which included the pain intensity, the Pain Catastrophizing Scale (PCS), and the Tampa Scale of Kinesiophobia (TSK). Furthermore, the four-item (items 3, 6, 8, and 11) and six-item (items 4, 5, 6, 10, 11, and 13) short forms of the Japanese version of PCS were measured. We used Rasch analysis to analyze the psychometric properties of the original, four-item, and six-item short forms of PCS. Rasch analysis showed that both short forms of PCS had acceptable internal consistency, unidimensionality, and no notable DIF and were functional on the category rating scale. However, four-item short form of PCS had two misfit items. Six-item short form of PCS has acceptable psychometric properties and is suitable for use in participants with musculoskeletal pain. Thus, six-item can be used as brief instruments to evaluate pain catastrophizing. Copyright © 2016 The Japanese Orthopaedic Association. Published by Elsevier B.V. All rights reserved.
Developing an African youth psychosocial assessment: an application of item response theory.

PubMed

Betancourt, Theresa S; Yang, Frances; Bolton, Paul; Normand, Sharon-Lise

2014-06-01

This study aimed to refine a dimensional scale for measuring psychosocial adjustment in African youth using item response theory (IRT). A 60-item scale derived from qualitative data was administered to 667 war-affected adolescents (55% female). Exploratory factor analysis (EFA) determined the dimensionality of items based on goodness-of-fit indices. Items with loadings less than 0.4 were dropped. Confirmatory factor analysis (CFA) was used to confirm the scale's dimensionality found under the EFA. Item discrimination and difficulty were estimated using a graded response model for each subscale using weighted least squares means and variances. Predictive validity was examined through correlations between IRT scores (θ) for each subscale and ratings of functional impairment. All models were assessed using goodness-of-fit and comparative fit indices. Fisher's Information curves examined item precision at different underlying ranges of each trait. Original scale items were optimized and reconfigured into an empirically-robust 41-item scale, the African Youth Psychosocial Assessment (AYPA). Refined subscales assess internalizing and externalizing problems, prosocial attitudes/behaviors and somatic complaints without medical cause. The AYPA is a refined dimensional assessment of emotional and behavioral problems in African youth with good psychometric properties. Validation studies in other cultures are recommended. Copyright © 2014 John Wiley & Sons, Ltd.
Developing an African youth psychosocial assessment: an application of item response theory

PubMed Central

BETANCOURT, THERESA S.; YANG, FRANCES; BOLTON, PAUL; NORMAND, SHARON-LISE

2014-01-01

This study aimed to refine a dimensional scale for measuring psychosocial adjustment in African youth using item response theory (IRT). A 60-item scale derived from qualitative data was administered to 667 war-affected adolescents (55% female). Exploratory factor analysis (EFA) determined the dimensionality of items based on goodness-of-fit indices. Items with loadings less than 0.4 were dropped. Confirmatory factor analysis (CFA) was used to confirm the scale's dimensionality found under the EFA. Item discrimination and difficulty were estimated using a graded response model for each subscale using weighted least squares means and variances. Predictive validity was examined through correlations between IRT scores (θ) for each subscale and ratings of functional impairment. All models were assessed using goodness-of-fit and comparative fit indices. Fisher's Information curves examined item precision at different underlying ranges of each trait. Original scale items were optimized and reconfigured into an empirically-robust 41-item scale, the African Youth Psychosocial Assessment (AYPA). Refined subscales assess internalizing and externalizing problems, prosocial attitudes/behaviors and somatic complaints without medical cause. The AYPA is a refined dimensional assessment of emotional and behavioral problems in African youth with good psychometric properties. Validation studies in other cultures are recommended. PMID:24478113
Measuring euthymia within the Neuroticism Scale from the NEO Personality Inventory: A Mokken analysis of the Norwegian general population study for scalability.

PubMed

Bech, P; Carrozzino, D; Austin, S F; Møller, S B; Vassend, O

2016-03-15

Whereas the Eysenck Neuroticism Scale only contains items covering negative mental health to measure dysthymia, the NEO Personality Inventory (NEO-PI) contains neuroticism items covering both negative mental health and positive mental health (or euthymia). The consequence of wording items both positively and negatively within the NEO-PI has never been psychometrically investigated. The aim of this study was to perform a validation analysis of the NEO-PI neuroticism scale. Using a Norwegian general population study we examined the structure of the negatively and positively formulated items by principal component analysis (PCA). The scalability of the identified two groups of euthymia versus dysthymia items was examined by Mokken analysis. With a response rate of 90%, 1082 individuals with a completed NEO-PI were available. The PCA identified the neuroticism scale as the most distinct where 14 items had acceptable loadings for the euthymia subscale, another 14 items for the dysthymia subscale. However, the Mokken analysis coefficient of homogeneity only found acceptable scalability for the euthymia subscale. A comparison with the Eysenck Neuroticism Scale was not performed. The NEO-PI neuroticism scale contains two subscales consisting of items worded in an opposite direction where only the positive euthymia items have an acceptable scalability. Copyright © 2016 Elsevier B.V. All rights reserved.
Revised multicultural perspective index and measures of depression, life satisfaction, shyness, and self-esteem.

PubMed

Mowrer, Robert R; Parker, Keesha N

2004-12-01

In a 2002 publication, Mowrer and McCarver reported weak but significant correlations (r =.24) between scores on the Multicultural Perspective Index and scores on Neugarten, Havighurst, and Tobin's 1961 Life Satisfaction Index-A and the Life Satisfaction Scale developed in 1985 by Diener, Emmons, Larsen, and Griffin. Using 382 undergraduate students the present study reduced the Index from 42 to 29 items based on each item's correlation with total items. An additional 104 undergraduate students then completed the modified 29-item version, Rosenberg's Self-esteem Scale, Cheek and Buss's Shyness Scale, the Self-rating Depression Scale by Zung, and the Neugarten, et al. Life Satisfaction Index-A. Scores on the modified Index were negatively correlated with those on the Depression and Shyness scales and positively correlated with scores on the Self-esteem and Life Satisfaction scales (p< .05).
[German version of the Northoff catatonia rating scale (NCRS-dv) : A validated instrument for measuring catatonic symptoms].

PubMed

Hirjak, D; Thomann, P A; Northoff, G; Kubera, K M; Wolf, R C

2017-07-01

The clinical picture of catatonia includes impressive motor phenomena, such as rigidity, dyskinesia, festination, negativism, posturing, catalepsy, stereotypies and mannerisms, along with affective (e. g. aggression, anxiety, anhedonism or emotional lability) and behavioral symptoms (e.g. mutism, autism, excitement, echolalia or echopraxia). In English speaking countries seven catatonia rating scales have been introduced, which are widely used in clinical and scientific practice. In contrast, only one validated catatonia rating scale is available in Germany so far. In this paper, we introduce the German version of the Northoff catatonia rating scale (NCRS-dv). The original English version of the NCRS consists of 40 items describing motor (13 items), affective (12 items) and behavioral (15 items) catatonic symptoms. The NCRS shows high internal reliability (Crombachs alpha = 0.87), high interrater (r = 0.80-0.96) and high intrarater (r = 0.80-0.95) reliability. Factor analysis of the NCRS revealed four domains: affective, hyperactive or excited, hypoactive or retarded and behavior with individual eigenvalues of 8.98, 3.61, 2.98 and 2.82, respectively, which explained 21.5 %, 9.3 %, 7.6 % and 7.2 % of variance, respectively. In conclusion, the NCRS-dv represents a second validated instrument which can be used by German clinicians and scientists for the assessment of catatonic symptoms.
Development of a Measure of Mood State for Children: The MINIMOOD

ERIC Educational Resources Information Center

Lynch, Mervin D.; Foley-Peres, Kathleen; Sullivan, Stefanie

2008-01-01

The purposes of the present study were to develop and validate a mood scale measure for elementary grade school children. Graduate students generated a sampling of mood state items, 30 to use in a pilot study and 60 to use in a study to develop and validate this scale. Ratings were obtained on five point scale choices on each of the items from a…
Development and validation of the Medical Student Scholar-Ideal Mentor Scale (MSS-IMS).

PubMed

Sozio, Stephen M; Chan, Kitty S; Beach, Mary Catherine

2017-08-08

Programs encouraging medical student research such as Scholarly Concentrations (SC) are increasing nationally. However, there are few validated measures of mentoring quality tailored to medical students. We sought to modify and validate a mentoring scale for use in medical student research experiences. SC faculty created a scale evaluating how medical students assess mentors in the research setting. A validated graduate student scale of mentorship, the Ideal Mentor Scale, was modified by selecting 10 of the 34 original items most relevant for medical students and adding an item on project ownership. We administered this 11-item assessment to second year medical students in the Johns Hopkins University SC Program from 2011 to 2016, and performed exploratory factor analysis with oblique rotation to determine included items and subscales. We correlate overall mentoring quality scale and subscales with four student outcomes: 'very satisfied' with mentor, 'more likely' to do future research, project accepted at a national meeting, and highest SC faculty rating of student project. Five hundred ninety-eight students responded (87% response rate). After factor analysis, we eliminated three items producing a final scale of overall mentoring quality (8 items, Cronbach's alpha = 0.92) with three subscales: advocacy, responsiveness, and assistance. The overall mentoring quality scale was significantly associated with all four student outcomes, including mentor satisfaction: OR [(95% CI), p-value] 1.66 [(1.53-1.79), p < 0.001]; likelihood of future research: OR 1.06 [(1.03-1.09), p < 0.001]; abstract submission to national meetings: OR 1.05 [(1.02-1.08), p = 0.002]; and SC faculty rating of student projects: OR 1.08 [(1.03-1.14), p = 0.004]. Each subscale also correlated with overall mentor satisfaction, and the strongest relationship of each subscale was seen with 'mentor advocacy.' Mentor quality can be reliably measured and associates with important medical student scholarly outcomes. Given the lack of tools, this scale can be used by other SC Programs to advance medical students' scholarship.
Perpetration of Severe Intimate Partner Violence: Premilitary and Second Year of Service Rates

DTIC Science & Technology

2004-04-01

Of the 18 CTS items, only the 5 items comprising the severe physical violence scale were used in the present study . These items asked whether the...Gelles R: Physical violence in American families. New Brunswick, NJ, Transaction Publishers, 1990. 5. Straus MA: Measuring intrafamily conflict and...NW WASHINGTON, DC 20372-5300 Perpetration of Severe Intimate Partner Violence : Premilitary and Second Year of Service Rates
Rasch Analysis for Psychometric Improvement of Science Attitude Rating Scales

ERIC Educational Resources Information Center

Oon, Pey-Tee; Fan, Xitao

2017-01-01

Students' attitude towards science (SAS) is often a subject of investigation in science education research. Survey of rating scale is commonly used in the study of SAS. The present study illustrates how Rasch analysis can be used to provide psychometric information of SAS rating scales. The analyses were conducted on a 20-item SAS scale used in an…
The MMPI-2-RF Personality Psychopathology Five (PSY-5-RF) scales: development and validity research.

PubMed

Harkness, Allan R; McNulty, John L; Finn, Jacob A; Reynolds, Shannon M; Shields, Susan M; Arbisi, Paul

2014-01-01

This article describes the development, internal psychometric, and external validation studies on scales designed to measure the Personality Psychopathology Five (PSY-5) from MMPI-2 Restructured Form (MMPI-2-RF) items. Diverse and comprehensive data sets, representing various clinical and nonclinical populations, were classified into development and validation research samples. Item selection, retention, and exclusion procedures are detailed. The final set of PSY-5-RF scales contain 104 items, with no item overlap between scales (same as the original MMPI-2 PSY-5 scales), and no item overlap with the Demoralization scale. Internal consistency estimates are comparable to the longer MMPI-2 PSY-5 scales. Appropriate convergent and discriminant validity findings utilizing various self-report, collateral rating, and record review data are reported and discussed. A particular emphasis is offered for the unique aspects of the PSY-5 model: psychoticism and disconstraint. The findings are connected to the broader PSY-5 literature and the recommended review of systems (Harkness, Reynolds, & Lilienfeld, this issue) presented in this series of articles.
Behavioral/Emotional Problems of Preschoolers: Caregiver/Teacher Reports From 15 Societies.

PubMed

Rescorla, Leslie A; Achenbach, Thomas M; Ivanova, Masha Y; Bilenberg, Niels; Bjarnadottir, Gudrun; Denner, Silvia; Dias, Pedro; Dobrean, Anca; Döpfner, Manfred; Frigerio, Alessandra; Gonçalves, Miguel; Guđmundsson, Halldór; Jusiene, Roma; Kristensen, Solvejg; Lecannelier, Felipe; Leung, Patrick W L; Liu, Jianghong; Löbel, Sofia P; Machado, Bárbara César; Markovic, Jasminka; Mas, Paola A; Esmaeili, Elaheh Mohammad; Montirosso, Rosario; Plück, Julia; Pronaj, Adelina Ahmeti; Rodriguez, Jorge T; Rojas, Pamela O; Schmeck, Klaus; Shahini, Mimoza; Silva, Jaime R; van der Ende, Jan; Verhulst, Frank C

2012-01-01

This study tested societal effects on caregiver/teacher ratings of behavioral/emotional problems for 10,521 preschoolers from 15 societies. Many societies had problem scale scores within a relatively narrow range, despite differences in language, culture, and other characteristics. The small age and gender effects were quite similar across societies. The rank orders of mean item ratings were similar across diverse societies. For 7,380 children from 13 societies, ratings were also obtained from a parent. In all 13 societies, mean Total Problems scores derived from parent ratings were significantly higher than mean Total Problems scores derived from caregiver/teacher ratings, although the size of the difference varied somewhat across societies. Mean cross-informant agreement for problem scale scores varied across societies. Societies were very similar with respect to which problem items, on average, received high versus low ratings from parents and caregivers/teachers. Within every society, cross-informant agreement for item ratings varied widely across children. In most respects, results were quite similar across 15 very diverse societies.
Behavioral/Emotional Problems of Preschoolers: Caregiver/Teacher Reports From 15 Societies

PubMed Central

Rescorla, Leslie A.; Achenbach, Thomas M.; Ivanova, Masha Y.; Bilenberg, Niels; Bjarnadottir, Gudrun; Denner, Silvia; Dias, Pedro; Dobrean, Anca; Döpfner, Manfred; Frigerio, Alessandra; Gonçalves, Miguel; Guđmundsson, Halldór; Jusiene, Roma; Kristensen, Solvejg; Lecannelier, Felipe; Leung, Patrick W. L.; Liu, Jianghong; Löbel, Sofia P.; Machado, Bárbara César; Markovic, Jasminka; Mas, Paola A.; Esmaeili, Elaheh Mohammad; Montirosso, Rosario; Plück, Julia; Pronaj, Adelina Ahmeti; Rodriguez, Jorge T.; Rojas, Pamela O.; Schmeck, Klaus; Shahini, Mimoza; Silva, Jaime R.; van der Ende, Jan; Verhulst, Frank C.

2017-01-01

This study tested societal effects on caregiver/teacher ratings of behavioral/emotional problems for 10,521 preschoolers from 15 societies. Many societies had problem scale scores within a relatively narrow range, despite differences in language, culture, and other characteristics. The small age and gender effects were quite similar across societies. The rank orders of mean item ratings were similar across diverse societies. For 7,380 children from 13 societies, ratings were also obtained from a parent. In all 13 societies, mean Total Problems scores derived from parent ratings were significantly higher than mean Total Problems scores derived from caregiver/teacher ratings, although the size of the difference varied somewhat across societies. Mean cross-informant agreement for problem scale scores varied across societies. Societies were very similar with respect to which problem items, on average, received high versus low ratings from parents and caregivers/teachers. Within every society, cross-informant agreement for item ratings varied widely across children. In most respects, results were quite similar across 15 very diverse societies. PMID:29416292

Comparison of scales for evaluating premenstrual symptoms in women using oral contraceptives.

PubMed

Coffee, Andrea L; Kuehl, Thomas J; Sulak, Patricia J

2008-05-01

To compare two scales used in research to evaluate daily premenstrual mood symptoms during use of a monophasic oral contraceptive. Subanalysis of data from a prospective study. University-affiliated medical center. SUBJECTS; One hundred two reproductive-aged (18-48 yrs) women taking a monophasic oral contraceptive containing ethinyl estradiol and drospirenone in the standard 21-7 fashion (21 days of hormones followed by 7 days of placebo), and who had self-identified premenstrual symptoms of headache, mood changes, or pelvic pain. Subjects completed a single-item questionnaire, the Scott & White Daily Diary of Symptoms, and a multiple-item questionnaire, the Penn State Daily Symptom Report (DSR), to assess their premenstrual symptoms. The Scott & White diary used a visual analog scale of 0-10 to assess pelvic pain, headache, and mood (a composite of anxiety, depression, and irritability). The Penn State DSR contained 17 items: 10 behavioral and seven physical components, each rated on a scale of 0-4, with one item that specifically rated mood swings. Scores from the two scales were compared by using Spearman correlation coefficients, the Kendall W for concordance, and linear regression of ranked sums for study cycles. The Scott & White mood score significantly correlated with the total of the 17 items on the Penn State DSR, as well as the 10 behavioral items, the seven physical items, and the single mood-swing item (p<0.0001); specific coefficients of concordance were 0.44, 0.23, 0.10, and 0.28, respectively, and R2 values were 0.39, 0.39, 0.30, and 0.34, respectively. The daily Scott & White mood score was positively correlated with all 17 elements of the Penn State DSR (0.25-0.57). The greatest correlation was seen with the mood-swing element. Both instruments demonstrated the same patterns during the 21-7 oral contraceptive cycle, with symptoms increasing immediately before and peaking during the 7-day hormone-free interval. A single-item daily mood score using a rating scale of 0-10 was concordant with a relatively complex 17-element symptom index and demonstrated the same pattern of change during cycles of oral contraception. The simple scoring system offers an advantage, especially in clinical studies of long duration.
Identifying patient fear-avoidance beliefs by physical therapists managing patients with low back pain.

PubMed

Calley, Darren Q; Jackson, Steven; Collins, Heather; George, Steven Z

2010-12-01

Cross-sectional. To evaluate the accuracy with which physical therapists identify fear-avoidance beliefs in patients with low back pain by comparing therapist ratings of perceived patient fear-avoidance to the Fear-Avoidance Beliefs Questionnaire (FABQ), Tampa Scale of Kinesiophobia 11-item (TSK-11), and Pain Catastrophizing Scale (PCS). To compare the concurrent validity of therapist ratings of perceived patient fear-avoidance and a 2-item questionnaire on fear of physical activity and harm, with clinical measures of fear-avoidance (FABQ, TSK-11, PCS), pain intensity as assessed with a numeric pain rating scale (NPRS), and disability as assessed with the Oswestry Disability Questionnaire (ODQ). The need to consider psychosocial factors for identifying patients at risk for disability and chronic low back pain has been well documented. Yet the ability of physical therapists to identify fear-avoidance beliefs using direct observation has not been studied. Eight physical therapists and 80 patients with low back pain from 3 physical therapy clinics participated in the study. Patients completed the FABQ, TSK-11, PCS, ODQ, NPRS, and a dichotomous 2-item fear-avoidance screening questionnaire. Following the initial evaluation, physical therapists rated perceived patient fear-avoidance on a 0-to-10 scale and recorded 2 influences on their ratings. Spearman correlation and independent t tests determined the level of association of therapist 0-to-10 ratings and 2-item screening with fear-avoidance and clinical measures. Therapist ratings of perceived patient fear-avoidance had fair to moderate interrater reliability (ICC2,1 = 0.663). Therapist ratings did not strongly correlate with FABQ or TSK-11 scores. Instead, they unexpectedly had stronger associations with ODQ and PCS scores. Both 2-item screening questions were associated with FABQ-physical activity scores, while the fear of physical activity question was also associated with FABQ-work, TSK-11, PCS, and ODQ scores. Therapists' ratings of perceived patient fear-avoidance were not associated with self-reported fear-avoidance scores, showing a potential disconnect between therapist judgments and commonly used fear-avoidance measures. Instead, therapist ratings had small but statistically significant correlations with pain catastrophizing and disability, findings that may support therapists' inability to discriminate fear-avoidance from these other factors. The 2-item screening questions based on fear of physical activity and harm showed potential to identify elevated FABQ physical activity scores. Differential diagnosis, level 2b.
[French translation and validation of the Scale to assess Unawareness of Mental Disorder (SUMD) in patients with schizophrenics].

PubMed

Paillot, C; Ingrand, P; Millet, B; Amador, X-F; Senon, J-L; Olié, J-P; Jaafari, N

2010-12-01

The Scale to assess Unawareness of Mental Disorder (SUMD) is a semi-structured interview based on a dimensional and quantitative approach of insight. Different forms of insight are assessed: global insight into mental illness, insight into symptoms and insight into symptom aetiology (i.e. attribution). The SUMD divides the recognition of mental disorders into two concepts: awareness of, and attribution for mental disorders. Awareness relates to the subject's ability to recognize that the phenomenon in question is present, whereas attribution refers to explanations as to cause or source of these signs or symptoms. Thus, the scale distinguishes between the recognition of a symptom and its explanation. For example, the scale allows the investigator to distinguish between a patient's ability to recognize visual hallucinations as such (false perceptions), from his/her ability to explain their cause (e.g. due to mental illness or not). The aim of this study was to translate the SUMD (version 3.1 revised) and test its convergent validity among 43 French adult inpatients diagnosed with schizophrenia according to DSM-IV-TR criteria. Awareness of mental disorder was assessed using the SUMD and the Hamilton Rating Scale for Depression (HAMD) insight item (item 17) respectively, as done in the original English validation study. The SUMD was translated into French then back-translated into English. The back-translation was performed by both English and French native speakers who had no prior knowledge of the scale (the back translation was reviewed by one of the SUMD's authors, Dr Amador, for accuracy). The SUMD manual (v.2/14/99) was also translated into French. Concerning the SUMD directions followed in this study, the first three SUMD items, which are called general items: G1 "Awareness of mental disorder", G2 "Awareness of the achieved effects of medication" and G3 "Awareness of the social consequences of mental disorder" were systematically rated. However, symptom items (four through 20) are not always relevant for every patient. Indeed, for each symptom-item on the scale, it must first be ascertained that the patient has exhibited the particular symptom during the period under investigation. Therefore, for every patient, the symptom checklist was completed prior to filling out the scale, in order to determine which symptom-items were relevant. In addition, symptom attribution items are rated only if the subject received a score between 1 and 3 on the awareness item. Two periods of time of insight were assessed: "current" insight involved rating the highest level of awareness obtained at the time of the interview for the psychopathology present at anytime during the past 7 days. "Past" insight was defined as the present level of awareness during the period of time preceding the current period of investigation. The French translation of the SUMD achieved good convergent validity with the insight item of the Hamilton rating scale for depression. The SUMD has proven to be a reliable and valid instrument to assess insight into schizophrenia. The more psychometrically sound rating tools we have at our disposal, many of which have been published in non French journals, the more we will be able to sharpen our assessment of insight into schizophrenia. We are facing an epistemic paradox in which quantification helps description, i.e. we need to have access to different rating tools to measure insight in order to improve our knowledge of the causes, course and treatment of poor insight into mental disorders. Copyright © 2010 L'Encéphale, Paris. Published by Elsevier Masson SAS. All rights reserved.
Item Response Theory Modeling and Categorical Regression Analyses of the Five-Factor Model Rating Form: A Study on Italian Community-Dwelling Adolescent Participants and Adult Participants.

PubMed

Fossati, Andrea; Widiger, Thomas A; Borroni, Serena; Maffei, Cesare; Somma, Antonella

2017-06-01

To extend the evidence on the reliability and construct validity of the Five-Factor Model Rating Form (FFMRF) in its self-report version, two independent samples of Italian participants, which were composed of 510 adolescent high school students and 457 community-dwelling adults, respectively, were administered the FFMRF in its Italian translation. Adolescent participants were also administered the Italian translation of the Borderline Personality Features Scale for Children-11 (BPFSC-11), whereas adult participants were administered the Italian translation of the Triarchic Psychopathy Measure (TriPM). Cronbach α values were consistent with previous findings; in both samples, average interitem r values indicated acceptable internal consistency for all FFMRF scales. A multidimensional graded item response theory model indicated that the majority of FFMRF items had adequate discrimination parameters; information indices supported the reliability of the FFMRF scales. Both categorical (i.e., item-level) and scale-level regression analyses suggested that the FFMRF scores may predict a nonnegligible amount of variance in the BPFSC-11 total score in adolescent participants, and in the TriPM scale scores in adult participants.
Cross-National Prevalence of Traditional Bullying, Traditional Victimization, Cyberbullying and Cyber-Victimization: Comparing Single-Item and Multiple-Item Approaches of Measurement

ERIC Educational Resources Information Center

Yanagida, Takuya; Gradinger, Petra; Strohmeier, Dagmar; Solomontos-Kountouri, Olga; Trip, Simona; Bora, Carmen

2016-01-01

Many large-scale cross-national studies rely on a single-item measurement when comparing prevalence rates of traditional bullying, traditional victimization, cyberbullying, and cyber-victimization between countries. However, the reliability and validity of single-item measurement approaches are highly problematic and might be biased. Data from…
The Rothschild Scale for Antidepressant Tachyphylaxis: reliability and validity.

PubMed

Rothschild, Anthony J

2008-01-01

After successful treatment of an episode of major depression, many patients complain of symptoms of apathy or decreased motivation (described by patients as "the blahs"), fatigue, dullness in cognitive function, sleep disturbance, weight gain, and sexual dysfunction; however, the characterization of this phenomenon of antidepressant tachyphylaxis has been hampered by the lack of an accepted definition and a reliable and valid assessment tool. To address this problem, the development and assessment of the Rothschild Scale for Antidepressant Tachyphylaxis (RSAT) are described. The RSAT consists of 6 self-report items assessing energy level, motivation and interest, cognitive functioning, weight gain, sleep, and sexual functioning. A seventh item, affect, is assessed by the interviewer. Each item is measured within a 5-point ordinal scale with anchor points developed to illustrate each rating. This study assesses the internal consistency, test-retest reliability, convergent and discriminant validity, sensitivity, specificity, and positive and negative predictive values of the RSAT. The RSAT demonstrated excellent internal consistency and scale reliability (Cronbach alpha = .902). The RSAT also demonstrated strong test-retest reliability (for depressed patients: r = 0.822, P < .01; for control subjects: r = 0.887, P < .01). The total RSAT score did not correlate with severity of depression as measured by the total Hamilton Depression Rating Scale score or the Hamilton Depression Rating Scale item 1 (depressed mood), supporting the discriminant validity of the RSAT for use in antidepressant tachyphylaxis. The RSAT is a reliable measure of antidepressant tachyphylaxis.
Bridging the Gap: Direct Behavior Rating-Single Item Scales

ERIC Educational Resources Information Center

Miller, Faith G.; Crovello, Nicholas; Swenson, Nicole

2017-01-01

Direct Behavior Ratings (DBRs) are behavioral assessment methods that combine the benefits of systematic direct observation and behavior rating scales. That is, DBRs involve the observation of operationally defined target behaviors during a prespecified observation period and the evaluation of those behaviors via brief ratings. In this way, DBR is…
The Motivation and Pleasure Scale-Self-Report (MAP-SR): reliability and validity of a self-report measure of negative symptoms.

PubMed

Llerena, Katiah; Park, Stephanie G; McCarthy, Julie M; Couture, Shannon M; Bennett, Melanie E; Blanchard, Jack J

2013-07-01

The Clinical Assessment Interview for Negative Symptoms (CAINS) is an empirically developed interview measure of negative symptoms. Building on prior work, this study examined the reliability and validity of a self-report measure based on the CAINS-the Motivation and Pleasure Scale-Self-Report (MAP-SR)-that assesses the motivation and pleasure domain of negative symptoms. Thirty-seven participants with schizophrenia or schizoaffective disorder completed the 18-item MAP-SR, the CAINS, and other measures of functional outcome. Item analyses revealed three items that performed poorly. The revised 15-item MAP-SR demonstrated good internal consistency and convergent validity with the clinician-rated Motivation and Pleasure scale of the CAINS, as well as good discriminant validity, with little association with psychotic symptoms or depression/anxiety. MAP-SR scores were related to social anhedonia, social closeness, and clinician-rated social functioning. The MAP-SR is a promising self-report measure of severity of negative symptoms. Copyright © 2013 Elsevier Inc. All rights reserved.
RhinAsthma patient perspective: A Rasch validation study.

PubMed

Molinengo, Giorgia; Baiardini, Ilaria; Braido, Fulvio; Loera, Barbara

2018-02-01

In daily practice, Health-Related Quality of Life (HRQoL) tools are useful for supplementing clinical data with the patient's perspective. To encourage their use by clinicians, the availability of tools that can quickly provide valid results is crucial. A new HRQoL tool has been proposed for patients with asthma and rhinitis: the RhinAsthma Patient Perspective-RAPP. The aim of this study was to evaluate the psychometric robustness of the RAPP using the Item Response Theory (IRT) approach, to evaluate the scalability of items and test whether or not patients use the items response scale correctly. 155 patients (53.5% women, mean age 39.1, range 16-76) were recruited during a multicenter study. RAPP metric properties were investigated using IRT models. Differential item functioning (DIF) was used for gender, age, and asthma control test (ACT). The RAPP adequately fitted the Rating Scale model, demonstrating the equality of the rating scale structure for all items. All statistics on items were satisfactory. The RAPP had adequate internal reliability and showed good ability to discriminate among different groups of participants. DIF analysis indicated that there were no differential item functioning issues for gender. One item showed a DIF by age and four items by ACT. The psychometric evaluation performed using IRT models demonstrated that the RAPP met all the criteria to be considered a reliable and valid method of measurement. From a clinical perspective, this will allow physicians to confidently interpret scores as good indicators of Quality of Life of patients with asthma.
Comparison of the Fullerton Advanced Balance Scale, Mini-BESTest, and Berg Balance Scale to Predict Falls in Parkinson Disease.

PubMed

Schlenstedt, Christian; Brombacher, Stephanie; Hartwigsen, Gesa; Weisser, Burkhard; Möller, Bettina; Deuschl, Günther

2016-04-01

The correct identification of patients with Parkinson disease (PD) at risk for falling is important to initiate appropriate treatment early. This study compared the Fullerton Advanced Balance (FAB) scale with the Mini-Balance Evaluation Systems Test (Mini-BESTest) and Berg Balance Scale (BBS) to identify individuals with PD at risk for falls and to analyze which of the items of the scales best predict future falls. This was a prospective study to assess predictive criterion-related validity. The study was conducted at a university hospital in an urban community. Eighty-five patients with idiopathic PD (Hoehn and Yahr stages: 1-4) participated in the study. Measures were number of falls (assessed prospectively over 6 months), FAB scale, Mini-BESTest, BBS, and Unified Parkinson's Disease Rating Scale. The FAB scale, Mini-BESTest, and BBS showed similar accuracy to predict future falls, with values for area under the curve (AUC) of the receiver operating characteristic (ROC) curve of 0.68, 0.65, and 0.69, respectively. A model combining the items "tandem stance," "rise to toes," "one-leg stance," "compensatory stepping backward," "turning," and "placing alternate foot on stool" had an AUC of 0.84 of the ROC curve. There was a dropout rate of 19/85 participants. The FAB scale, Mini-BESTest, and BBS provide moderate capacity to predict "fallers" (people with one or more falls) from "nonfallers." Only some items of the 3 scales contribute to the detection of future falls. Clinicians should particularly focus on the item "tandem stance" along with the items "one-leg stance," "rise to toes," "compensatory stepping backward," "turning 360°," and "placing foot on stool" when analyzing postural control deficits related to fall risk. Future research should analyze whether balance training including the aforementioned items is effective in reducing fall risk. © 2016 American Physical Therapy Association.
Correspondence of verbal descriptor and numeric rating scales for pain intensity: an item response theory calibration.

PubMed

Edelen, Maria Orlando; Saliba, Debra

2010-07-01

Assessing pain intensity in older adults is critical and challenging. There is debate about the most effective way to ask older adults to describe their pain severity, and clinicians vary in their preferred approaches, making comparison of pain intensity scores across settings difficult. A total of 3,676 residents from 71 community nursing homes across eight states were asked about pain presence. The 1,960 residents who reported pain within the past 5 days (53% of total, 70% female; age: M = 77.9, SD = 12.4) were included in analyses. Those who reported pain were also asked to provide a rating of pain intensity using either a verbal descriptor scale (VDS; mild, moderate, severe, and very severe and horrible), a numeric rating scale (NRS; 0 = no pain to 10 = worst pain imaginable), or both. We used item response theory (IRT) methods to identify the correspondence between the VDS and the NRS response options by estimating item parameters for these and five additional pain items. The sample reported moderate amounts of pain on average. Examination of the IRT location parameters for the pain intensity items indicated the following approximate correspondence: VDS mild approximately NRS 1-4, VDS moderate approximately NRS 5-7, VDS severe approximately NRS 8-9, and VDS very severe, horrible approximately NRS 10. This IRT calibration provides a crosswalk between the two response scales so that either can be used in practice depending on the preference of the clinician and respondent.
A clinimetric approach to assessing quality of life in epilepsy.

PubMed

Cramer, J A

1993-01-01

Clinimetrics is a concept involving the use of rating scales for clinical phenomena ranging from physical examinations to functional performance. Clinimetric or rating scales can be used for defining patient status and changes that occur during long-term observation. The scores derived from such scales can be used as guidelines for intervention, treatment, or prediction of outcome. In epilepsy, clinimetric scales have been developed for assessing seizure frequency, seizure severity, adverse effects related to antiepileptic drugs (AEDs), and quality of life after surgery for epilepsy. The VA Epilepsy Cooperative Study seizure rating scale combines frequency and severity in a weighted scoring system for simple and complex partial and generalized tonic-clonic seizures, summing all items in a total seizure score. Similarly, the rating scales for systemic toxicity and neurotoxicity use scores weighted for severity for assessing specific adverse effects typically related to AEDs. A composite score, obtained by adding the scores for seizures, systemic toxicity, and neurotoxicity, represents the overall status of the patient at a given time. The Chalfont Seizure Severity Scale also applies scores relative to the impact of a given item on the patient, without factoring in seizure frequency. The Liverpool Seizure Severity Scale is a patient questionnaire covering perceived seizure severity and the impact of ictal and postictal events. The UCLA Epilepsy Surgery Inventory (ESI-55) assesses quality of life for patients who have undergone surgery for epilepsy using generic health status instruments with additional epilepsy-specific items.(ABSTRACT TRUNCATED AT 250 WORDS)
Direct Behavior Rating Instrumentation: Evaluating the Impact of Scale Formats

ERIC Educational Resources Information Center

Miller, Faith G.; Riley-Tillman, T. Chris; Chafouleas, Sandra M.; Schardt, Alyssa A.

2017-01-01

The purpose of this study was to investigate the impact of two different Direct Behavior Rating--Single Item Scale (DBR-SIS) formats on rating accuracy. A total of 119 undergraduate students participated in one of two study conditions, each utilizing a different DBR-SIS scale format: one that included percentage of time anchors on the DBR-SIS…
A protocol for the Hamilton Rating Scale for Depression: Item scoring rules, Rater training, and outcome accuracy with data on its application in a clinical trial.

PubMed

Rohan, Kelly J; Rough, Jennifer N; Evans, Maggie; Ho, Sheau-Yan; Meyerhoff, Jonah; Roberts, Lorinda M; Vacek, Pamela M

2016-08-01

We present a fully articulated protocol for the Hamilton Rating Scale for Depression (HAM-D), including item scoring rules, rater training procedures, and a data management algorithm to increase accuracy of scores prior to outcome analyses. The latter involves identifying potentially inaccurate scores as interviews with discrepancies between two independent raters on the basis of either scores >=5-point difference) or meeting threshold for depression recurrence status, a long-term treatment outcome with public health significance. Discrepancies are resolved by assigning two new raters, identifying items with disagreement per an algorithm, and reaching consensus on the most accurate scores for those items. These methods were applied in a clinical trial where the primary outcome was the Structured Interview Guide for the Hamilton Rating Scale for Depression-Seasonal Affective Disorder version (SIGH-SAD), which includes the 21-item HAM-D and 8 items assessing atypical symptoms. 177 seasonally depressed adult patients were enrolled and interviewed at 10 time points across treatment and the 2-year followup interval for a total of 1589 completed interviews with 1535 (96.6%) archived. Inter-rater reliability ranged from ICCs of .923-.967. Only 86 (5.6%) interviews met criteria for a between-rater discrepancy. HAM-D items "Depressed Mood", "Work and Activities", "Middle Insomnia", and "Hypochondriasis" and Atypical items "Fatigability" and "Hypersomnia" contributed most to discrepancies. Generalizability beyond well-trained, experienced raters in a clinical trial is unknown. Researchers might want to consider adopting this protocol in part or full. Clinicians might want to tailor it to their needs. Copyright © 2016 Elsevier B.V. All rights reserved.
Evaluation of the Edinburgh Post Natal Depression Scale using Rasch analysis

PubMed Central

Pallant, Julie F; Miller, Renée L; Tennant, Alan

2006-01-01

Background The Edinburgh Postnatal Depression Scale (EPDS) is a 10 item self-rating post-natal depression scale which has seen widespread use in epidemiological and clinical studies. Concern has been raised over the validity of the EPDS as a single summed scale, with suggestions that it measures two separate aspects, one of depressive feelings, the other of anxiety. Methods As part of a larger cross-sectional study conducted in Melbourne, Australia, a community sample (324 women, ranging in age from 18 to 44 years: mean = 32 yrs, SD = 4.6), was obtained by inviting primiparous women to participate voluntarily in this study. Data from the EPDS were fitted to the Rasch measurement model and tested for appropriate category ordering, for item bias through Differential Item Functioning (DIF) analysis, and for unidimensionality through tests of the assumption of local independence. Results Rasch analysis of the data from the ten item scale initially demonstrated a lack of fit to the model with a significant Item-Trait Interaction total chi-square (chi Square = 82.8, df = 40; p < .001). Removal of two items (items 7 and 8) resulted in a non-significant Item-Trait Interaction total chi-square with a residual mean value for items of -0.467 with a standard deviation of 0.850, showing fit to the model. No DIF existed in the final 8-item scale (EPDS-8) and all items showed fit to model expectations. Principal Components Analysis of the residuals supported the local independence assumption, and unidimensionality of the revised EPDS-8 scale. Revised cut points were identified for EPDS-8 to maintain the case identification of the original scale. Conclusion The results of this study suggest that EPDS, in its original 10 item form, is not a viable scale for the unidimensional measurement of depression. Rasch analysis suggests that a revised eight item version (EPDS-8) would provide a more psychometrically robust scale. The revised cut points of 7/8 and 9/10 for the EPDS-8 show high levels of agreement with the original case identification for the EPDS-10. PMID:16768803
Rasch analysis of the London Handicap Scale in stroke patients: a cross-sectional study.

PubMed

Park, Eun-Young; Choi, Yoo-Im

2014-07-31

Although activity and participation are the target domains in stroke rehabilitation interventions, there is insufficient evidence available regarding the validity of participation measurement. The purpose of this study was to investigate the psychometric properties of the London Handicap Scale in community-dwelling stroke patients, using Rasch analysis. Participants were 170 community-dwelling stroke survivors. The data were analyzed using Winsteps (version 3.62) with the Rasch model to determine the unidimensionality of item fit, the distribution of item difficulty, and the reliability and suitability of the rating process for the London Handicap Scale. Data of 16 participants did not fit the Rasch model and there were no misfitting items. The person separation value was 2.42, and the reliability was .85; furthermore, the rating process for the London Handicap Scale was found to be suitable for use with stroke patients. This was the first trial to investigate the psychometric properties of the London Handicap Scale using Rasch analysis; the results supported the suitability of this scale for use with stroke patients.
Development and Validation of the Physics Anxiety Rating Scale

ERIC Educational Resources Information Center

Sahin, Mehmet; Caliskan, Serap; Dilek, Ufuk

2015-01-01

This study reports the development and validation process for an instrument to measure university students' anxiety in physics courses. The development of the Physics Anxiety Rating Scale (PARS) included the following steps: Generation of scale items, content validation, construct validation, and reliability calculation. The results of construct…
Validity of personality measurement in adults with anxiety disorders: psychometric properties of the Spanish NEO-FFI-R using Rasch analyses

PubMed Central

Inchausti, Felix; Mole, Joe; Fonseca-Pedrero, Eduardo; Ortuño-Sierra, Javier

2015-01-01

The aim of this study was to analyse the psychometric properties of the Spanish NEO Five Factor Inventory–Revised (NEO-FFI-R) using Rasch analyses, in order to test its rating scale functioning, the reliability of scores, internal structure, and differential item functioning (DIF) by gender in a psychiatric sample. The NEO-FFI-R responses of 433 Spanish adults (154 males) with an anxiety disorder as primary diagnosis were analysed using the Rasch model for rating scales. Two intermediate categories of response (‘neutral’ and ‘agree’) malfunctioned in the Neuroticism and Conscientiousness scales. In addition, model reliabilities were lower than expected in Agreeableness and Neuroticism, and the item fit values indicated each scale had items that did not achieve moderate to high discrimination on its dimension, particularly in the Agreeableness scale. Concerning unidimensionality, the five NEO-FFI-R scales showed large first components of unexplained variance. Finally, DIF by gender was detected in many items. The results suggest that the scores of the Spanish NEO-FFI-R are unreliable in psychiatric samples and cannot be generalized between males and females, especially in the Openness, Conscientiousness, and Agreeableness scales. Future directions for testing and refinement should be developed before the NEO-FFI-R can be used reliably in clinical samples. PMID:25954224
Update on the Child's Challenging Behaviour Scale following evaluation using Rasch analysis.

PubMed

Bourke-Taylor, H M; Pallant, J F; Law, M

2014-03-01

The Child's Challenging Behaviour Scale (CCBS) was designed to measure a mother's rating of her child's challenging behaviours. The CCBS was initially developed for mothers of school-aged children with developmental disability and has previously been shown to have good psychometric properties using classical test theory techniques. The aim of this study was to use Rasch analysis to fully evaluate all aspects of the scale, including response format, item fit, dimensionality and targeting. The sample consisted of 152 mothers of a school-aged child (aged 5-18 years) with a disability. Mothers were recruited via websites and mail-out newsletters through not-for-profit organizations that supported families with disabilities. Respondents completed a survey which included the 11 items of the CCBS. Rasch analysis was conducted on these responses using the RUMM2030 package. Rasch analysis of the CCBS revealed serious threshold disordering for nine of the 11 items, suggesting problems with the 5-point response format used for the scale. The neutral midpoint of the response format was subsequently removed to create a 4-point scale. High levels of local dependency were detected among two pairs of items, resulting in the removal of two items (item 7 and item 1). The final nine-item version of the scale (CCBS Version 2) was unidimensional, well targeted, showed good fit to the Rasch model, and strong internal consistency. To achieve fit to the Rasch model it was necessary to make two modifications to the CCBS scale. The resulting nine-item scale with a 4-point response format showed excellent psychometric properties, supporting its internal validity. © 2013 John Wiley & Sons Ltd.
Validity and reliability of short forms of parental-caregiver perception and family impact scale in a Telugu speaking population of India.

PubMed

Kumar, Santhosh; Kroon, Jeroen; Lalloo, Ratilal; Johnson, Newell W

2016-03-01

Parental-Caregiver Perception Questionnaire (P-CPQ) and Family Impact Scale (FIS) are commonly used measures to evaluate the parent's perception of the impact of children's oral health on quality of life and family respectively. Recently, shorter forms of P-CPQ and FIS have been developed. No study has sought to validate these short forms in other languages and cultures. This study aimed to evaluate the validity and reliability of FIS, 8 and 16-item P-CPQ in a Telugu speaking population of India. For this cross-sectional study, a multi-stage random sampling technique was used to recruit 11-13 year-old schoolchildren of Medak district, Telangana, India and their parents (n = 1342). Parents were approached with questionnaires through their children who underwent clinical examinations for dental caries, fluorosis and malocclusion. The translated versions underwent pilot testing (n = 40), test-retest reliability was also assessed (n = 161). The overall summary scale and subscales of the short forms of P-CPQ and FIS failed to discriminate between the categories of dental caries severity. Also, malocclusion status was not related to the domain or overall scores of both the short forms of P-CPQ. There were significant differences in subscale and overall scores of 16 and 8-item P-CPQ and FIS between the fluorosis categories. Both 16 and 8-item P-CPQ summary scales were significantly related to parent's global rating of oral health (16-item, r = 0.30, p < 0.01; 8-item, r = 0.28, p < 0.01) and overall wellbeing (16-item, r = 0.22, p < 0.01; 8-item, r = 0.22, p < 0.01), thereby exhibiting good construct validity. However, the correlation of emotional and social wellbeing scales of short forms of P-CPQ and FIS with global ratings was of low strength. Cronbach's alphas for FIS, 16-items and 8-items P-CPQ scales were 0.78, 0.83 and 0.71 respectively, while the Intra-Class Correlation coefficients were 0.752, 0.812 and 0.816 respectively. Cronbach's alphas for most of the subscales of short forms of P-CPQ were less than 0.7. The overall scales of 16 and 8-items P-CPQ scales demonstrated good construct validity while the construct validity of FIS was questionable. Discriminant validity of all the three instruments was good only in relation to fluorosis. Overall scales of all three short forms exhibited acceptable internal consistency and reliability on repeated administrations.

Communication about patient pain in primary care: development of the Physician-Patient Communication about Pain scale (PCAP).

PubMed

Haskard-Zolnierek, Kelly B

2012-01-01

This paper describes the development of the 47-item Physician-Patient Communication about Pain (PCAP) scale for use with audiotaped medical visit interactions. Patient pain was assessed with the Medical Outcomes Study SF-36 Bodily Pain Scale. Four raters assessed 181 audiotaped patient interactions with 68 physicians. Descriptive statistics of PCAP items were computed. Principal components analyses with 20 scale items were used to reduce the scale to composite variables for analyses. Validity was assessed through (1) comparing PCAP composite scores for patients with high versus low pain and (2) correlating PCAP composites with a separate communication rating scale. Principal components analyses yielded four physician and five patient communication composites (mean alpha=.77). Some evidence for concurrent validity was provided (5 of 18 correlations with communication validation rating scale were significant). Paired-sample t tests showed significant differences for 4 patient PCAP composites, showing the PCAP scale discriminates between high and low pain patients' communication. The PCAP scale shows partial evidence of reliability and two forms of validity. More research with this scale (developing more reliable and valid composites) is needed to extend these preliminary findings before this scale is applicable for use in practice. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Development and Validation of a Rating Scale for Wind Jazz Improvisation Performance

ERIC Educational Resources Information Center

Smith, Derek T.

2009-01-01

The purpose of this study was to construct and validate a rating scale for collegiate wind jazz improvisation performance. The 14-item Wind Jazz Improvisation Evaluation Scale (WJIES) was constructed and refined through a facet-rational approach to scale development. Five wind jazz students and one professional jazz educator were asked to record…
Detecting Parental Deception Using a Behavior Rating Scale during Assessment of Attention-Deficit/Hyperactivity Disorder: An Experimental Study

ERIC Educational Resources Information Center

Norfolk, Philip A.; Floyd, Randy G.

2016-01-01

It is often assumed that parents completing behavior rating scales during the assessment of attention-deficit/hyperactivity disorder (ADHD) can deliberately manipulate the outcomes of the assessment. To detect these actions, items designed to detect over-reporting or under-reporting of results are sometimes embedded in such rating scales. This…
Creating Abbreviated Rating Scales to Monitor Classroom Inattention-Overactivity, Aggression, and Peer Conflict: Reliability, Validity, and Treatment Sensitivity

ERIC Educational Resources Information Center

Volpe, Robert J.; Gadow, Kenneth D.

2010-01-01

Rating scales developed to measure child emotional and behavioral problems typically are so long as to make their use in progress monitoring impractical in typical school settings. This study examined two methods of selecting items from existing rating scales to create shorter instruments for use in assessing response to intervention. The…
Measuring organisational readiness for patient engagement (MORE): an international online Delphi consensus study.

PubMed

Oostendorp, Linda J M; Durand, Marie-Anne; Lloyd, Amy; Elwyn, Glyn

2015-02-14

Widespread implementation of patient engagement by organisations and clinical teams is not a reality yet. The aim of this study is to develop a measure of organisational readiness for patient engagement designed to monitor and facilitate a healthcare organisation's willingness and ability to effectively implement patient engagement in healthcare. The development of the MORE (Measuring Organisational Readiness for patient Engagement) scale was guided by Weiner's theory of organisational readiness for change. Weiner postulates that an organisation's readiness is determined by both the willingness and ability to implement the change (i.e. in this context: patient engagement). A first version of the scale was developed based on a literature search and evaluation of pre-existing tools. We invited multi-disciplinary stakeholders to participate in a two-round online Delphi survey. Respondents were asked to rate the importance of each proposed item, and to comment on the proposed domains and items. Second round participants received feedback from the first round and were asked to re-rate the importance of the revised, new and unchanged items, and to provide comments. The first version of the scale contained 51 items divided into three domains: (1) Respondents' characteristics; (2) the organisation's willingness to implement patient engagement; and (3) the organisation's ability to implement patient engagement. 131 respondents from 16 countries (health care managers, policy makers, clinicians, patients and patient representatives, researchers, and other stakeholders) completed the first survey, and 72 of them also completed the second survey. During the Delphi process, 34 items were reworded, 8 new items were added, 5 items were removed, and 18 were combined. The scale's instructions were revised. The final version of MORE totalled 38 items; 5 on stakeholders, 13 on an organisation's willingness to implement, and 20 on an organisation's ability to implement patient engagement in healthcare. The Delphi technique was successfully used to refine the scale's instructions, domains and items, using input from a broad range of international stakeholders, hoping that MORE can be applied in a variety of healthcare contexts worldwide. Further assessment is needed to determine the psychometric properties of the scale.
Evaluation of the Bess TRS-CA Using the Rasch Rating Scale Model

ERIC Educational Resources Information Center

DiStefano, Christine; Morgan, Grant B.

2010-01-01

This study examined the Behavioral and Emotional Screening System Teacher Rating System for Children and Adolescents (BESS TRS-CA; Kamphaus & Reynolds, 2007) screener using Rasch Rating Scale model (RSM) methodology to provide additional information about psychometric properties of items. Data from the Behavioral Assessment System for Children…
Psychometric Properties of the Children's Depression Inventory: An Item Response Theory Analysis across Age in a Nonclinical, Longitudinal, Adolescent Sample

ERIC Educational Resources Information Center

Lee, Young-Sun; Krishnan, Anita; Park, Yoon Soo

2012-01-01

The purpose of this study was to investigate psychometric properties of the Children's Depression Inventory within a nonclinical and longitudinal sample (8th and 12th grades). Using the Rasch rating scale, most items represented one dimension. There was adequate separation among items and no overlap between ranges of item difficulties with latent…
Estimating Ordinal Reliability for Likert-Type and Ordinal Item Response Data: A Conceptual, Empirical, and Practical Guide

ERIC Educational Resources Information Center

Gadermann, Anne M.; Guhn, Martin; Zumbo, Bruno D.

2012-01-01

This paper provides a conceptual, empirical, and practical guide for estimating ordinal reliability coefficients for ordinal item response data (also referred to as Likert, Likert-type, ordered categorical, or rating scale item responses). Conventionally, reliability coefficients, such as Cronbach's alpha, are calculated using a Pearson…
How IRT Can Solve Problems of Ipsative Data in Forced-Choice Questionnaires

ERIC Educational Resources Information Center

Brown, Anna; Maydeu-Olivares, Alberto

2013-01-01

In multidimensional forced-choice (MFC) questionnaires, items measuring different attributes are presented in blocks, and participants have to rank order the items within each block (fully or partially). Such comparative formats can reduce the impact of numerous response biases often affecting single-stimulus items (aka rating or Likert scales).…
Impact of Rating Scale Categories on Reliability and Fit Statistics of the Malay Spiritual Well-Being Scale using Rasch Analysis.

PubMed

Daher, Aqil Mohammad; Ahmad, Syed Hassan; Winn, Than; Selamat, Mohd Ikhsan

2015-01-01

Few studies have employed the item response theory in examining reliability. We conducted this study to examine the effect of Rating Scale Categories (RSCs) on the reliability and fit statistics of the Malay Spiritual Well-Being Scale, employing the Rasch model. The Malay Spiritual Well-Being Scale (SWBS) with the original six; three and four newly structured RSCs was distributed randomly among three different samples of 50 participants each. The mean age of respondents in the three samples ranged between 36 and 39 years old. The majority was female in all samples, and Islam was the most prevalent religion among the respondents. The predominating race was Malay, followed by Chinese and Indian. The original six RSCs indicated better targeting of 0.99 and smallest model error of 0.24. The Infit Mnsq (mean square) and Zstd (Z standard) of the six RSCs were "1.1"and "-0.1"respectively. The six RSCs achieved the highest person and item reliabilities of 0.86 and 0.85 respectively. These reliabilities yielded the highest person (2.46) and item (2.38) separation indices compared to other the RSCs. The person and item reliability and, to a lesser extent, the fit statistics, were better with the six RSCs compared to the four and three RSCs.
ABILHAND-Kids: a measure of manual ability in children with cerebral palsy.

PubMed

Arnould, Carlyne; Penta, Massimo; Renders, Anne; Thonnard, Jean-Louis

2004-09-28

To develop a clinical tool for measuring manual ability (ABILHAND-Kids) in children with cerebral palsy (CP) using the Rasch measurement model. The authors developed a 74-item questionnaire based on existing scales and experts' advice. The questionnaire was submitted to 113 children with CP (59% boys; mean age, 10 years) without major intellectual deficits (IQ > 60) and to their parents, and resubmitted to both groups after 1 month. The children's and parents' responses were analyzed separately with the WINSTEPS Rasch software to select items presenting an ordered rating scale, sharing the same discrimination, and fitting a unidimensional scale. The final ABILHAND-Kids scale consisted of 21 mostly bimanual items rated by the parents. The parents reported a finer perception of their children's ability than the children themselves, leading to a wider range of measurement, a higher reliability (R = 0.94), and a good reproducibility over time (R = 0.91). The item difficulty hierarchy was consistent between the parents and the experts. The ABILHAND-kids measures are significantly related to school education, type of CP, and gross motor function. ABILHAND-Kids is a functional scale specifically developed to measure manual ability in children with CP providing guidelines for goal setting in treatment planning. Its range and measurement precision are appropriate for clinical practice.
[Development and evaluation of the reliability and validity of an empowerment scale for health promotion volunteers].

PubMed

Koyama, Utako; Murayama, Nobuko

2011-08-01

This qualitative and quantitative research was conducted to develop an empowerment scale for health promotion volunteers (hereinafter referred to as the ESFHPV), key persons responsible for creating healthy communities. A focus group interview was conducted with four groups of health promotion volunteers from two cities in S Public Health Center of N Prefecture. A qualitative analysis was employed and a 32-item draft scale was created. The reliability and validity of this scale were then evaluated using quantitative methods. A questionnaire survey was conducted in 2009 for all 660 health promotion volunteers across the 2 cities. Of 401 respondents (response rate, 60.8%), 356 (53.9%) provided valid responses and were thus included in the analysis. 1) Internal consistency was confirmed by item-total correlation analysis (I-T analysis), assessment of Cronbach's coefficient alpha for all except one item and good-poor analysis (G-P analysis). Four items were excluded from the 32-item draft scale because of correlation coefficients more than 0.7, leaving 28 items for analysis. 2) Based on the results obtained from the factor analysis performed on the 28 provisional empowerment questions, 28 items were chosen for inclusion in the ESFHPV. These items consisted of four sub-scales, namely 'activity for healthy community' (10 items), 'intention for solving health problems of the community' (10 items), 'democratic organization activity' (four items) and 'growth as individual health promotion volunteers' (four items). 3) The Cronbach's coefficient alpha for the ESFHPV and its four sub-scales were 0.93, 0.88, 0.89, 0.84 and 0.79 respectively. The coefficients of I-T analysis were between 0.33 and 0.69. 4) The health promotion volunteers who attended other community activities demonstrated significantly high scores for the ESFHPV and the four sub-scales. Persons who were above 60 years, had a longer duration of activity as a health promotion volunteer and were housewives showed significantly high scores on the first sub-scale, 'growth as individual health promotion volunteers' To measure the empowerment levels of health promotion volunteers, a 28-item scale was developed and its reliability and validity were confirmed. Health promotion volunteers as well as the public health nurses who assist them can use this scale to assess the empowerment levels of other health promotion volunteers.
Stress and depression scales in aphasia: relation between the aphasia depression rating scale, stroke aphasia depression questionnaire-10, and the perceived stress scale.

PubMed

Laures-Gore, Jacqueline S; Farina, Matthew; Moore, Elliot; Russell, Scott

2017-03-01

Assessment and diagnosis of post-stroke depression (PSD) among patients with aphasia presents unique challenges. A gold standard assessment of PSD among this population has yet to be identified. The first aim was to investigate the association between two depression scales developed for assessing depressive symptoms among patients with aphasia. The second aim was to evaluate the relation between these scales and a measure of perceived stress. Twenty-five (16 male; 9 female) individuals with history of left hemisphere cerebrovascular accident (CVA) were assessed for depression and perceived stress using the Stroke Aphasic Depression Questionnaire-10 (SADQ-10), the Aphasia Depression Rating Scale (ADRS), and the Perceived Stress Scale (PSS). SADQ-10 and ADRS ratings were strongly correlated with each other (r = 0.708, p < 0.001). SADQ-10 ratings were strongly correlated with PSS ratings (r = 0.620, p = 0.003), while ADRS ratings were moderately correlated (r = 0.492, p = 0.027). Item analysis of each scale identified items which increased both inter-scale correlation and intra-scale consistency when excluded. The SADQ-10 and ADRS appear to be acceptable measures of depressive symptoms in aphasia patients. Measurements of perceived stress may also be an important factor in assessment of depressive symptoms.
Psychometric evaluation of the PainCAS Interference with Daily Activities, Psychological/Emotional Distress, and Pain scales.

PubMed

McCaffrey, Stacey A; Black, Ryan A; Butler, Stephen F

2018-03-01

The PainCAS is a web-based clinical tool for assessing and tracking pain and opioid risk in chronic pain patients. Despite evidence for its utility within the clinical setting, the PainCAS scales have never been subject to psychometric evaluation. The current study is the first to evaluate the psychometric properties of the PainCAS Interference with Daily Activities, Psychological/Emotional Distress, and Pain scales. Patients (N = 4797) from treatment centers and hospitals in 16 different states completed the PainCAS as part of routine clinical assessment. A subsample (n = 73) from two hospital-based treatment centers also completed comparator measures. Rasch Rating Scale Models were employed to evaluate the Interference with Daily Activities and Psychological/Emotional Distress scales, and empirical evaluation included assessment of dimensionality, discrimination, item fit, reliability, information, and person-to-item targeting. Additionally, convergent and discriminant validity were evaluated through classical test theory approaches. Convergent validity of the Pain scales was evaluated through correlations with corresponding comparator items. One Interference with Daily Activities item was removed due to poor functioning and discrimination. The retained items from the Interference with Daily Activities and Psychological/Emotional Distress scales conformed to unidimensional Rasch measurement models, yielding satisfactory item fit, reliability, precision, and coverage. Further, results provided support for the convergent and discriminant validity of these two scales. Convergent validity between the PainCAS Pain and BPI Pain items was also strong. Taken together, results provide strong psychometric support for these PainCAS Pain scales. Strengths and limitations of the current study are discussed.
The Therapeutic Environment Screening Survey for Nursing Homes (TESS-NH): an observational instrument for assessing the physical environment of institutional settings for persons with dementia.

PubMed

Sloane, Philip D; Mitchell, C Madeline; Weisman, Gerald; Zimmerman, Sheryl; Foley, Kristie M Long; Lynn, Mary; Calkins, Margaret; Lawton, M Powell; Teresi, Jeanne; Grant, Leslie; Lindeman, David; Montgomery, Rhonda

2002-03-01

To develop an observational instrument that describes the ability of physical environments of institutional settings to address therapeutic goals for persons with dementia. A National Institute on Aging workgroup identified and subsequently revised items that evaluated exit control, maintenance, cleanliness, safety, orientation/cueing, privacy, unit autonomy, outdoor access, lighting, noise, visual/tactile stimulation, space/seating, and familiarity/homelikeness. The final instrument contains 84 discrete items and one global rating. A summary scale, the Special Care Unit Environmental Quality Scale (SCUEQS), consists of 18 items. Lighting items were validated using portable light meters. Concurrent criterion validation compared SCUEQS scores with the Professional Environmental Assessment Protocol (PEAP). Interrater kappa statistics for 74% of items were above.60. For another 10% of items, kappas could not be calculated due to empty cells, but interrater agreement was above 80%. The SCUEQS demonstrated an interrater reliability of.93, a test--retest reliability of.88, and an internal consistency of.81--.83. Light meter ratings correlated significantly with the Therapeutic Environment Screening Survey for Nursing Homes (TESS-NH) lighting items (r =.29--.38, p =.01--.04), and the SCUEQS correlated significantly with global PEAP ratings (r =.52, p <.01). The TESS-NH efficiently assesses discrete elements of the physical environment and has strong reliability and validity. The SCUEQS provides a quantitative measure of environmental quality in institutional settings.
The Modified Abbreviated Math Anxiety Scale: A Valid and Reliable Instrument for Use with Children.

PubMed

Carey, Emma; Hill, Francesca; Devine, Amy; Szűcs, Dénes

2017-01-01

Mathematics anxiety (MA) can be observed in children from primary school age into the teenage years and adulthood, but many MA rating scales are only suitable for use with adults or older adolescents. We have adapted one such rating scale, the Abbreviated Math Anxiety Scale (AMAS), to be used with British children aged 8-13. In this study, we assess the scale's reliability, factor structure, and divergent validity. The modified AMAS (mAMAS) was administered to a very large ( n = 1746) cohort of British children and adolescents. This large sample size meant that as well as conducting confirmatory factor analysis on the scale itself, we were also able to split the sample to conduct exploratory and confirmatory factor analysis of items from the mAMAS alongside items from child test anxiety and general anxiety rating scales. Factor analysis of the mAMAS confirmed that it has the same underlying factor structure as the original AMAS, with subscales measuring anxiety about Learning and Evaluation in math. Furthermore, both exploratory and confirmatory factor analysis of the mAMAS alongside scales measuring test anxiety and general anxiety showed that mAMAS items cluster onto one factor (perceived to represent MA). The mAMAS provides a valid and reliable scale for measuring MA in children and adolescents, from a younger age than is possible with the original AMAS. Results from this study also suggest that MA is truly a unique construct, separate from both test anxiety and general anxiety, even in childhood.
Development and content validation of performance assessments for endoscopic third ventriculostomy.

PubMed

Breimer, Gerben E; Haji, Faizal A; Hoving, Eelco W; Drake, James M

2015-08-01

This study aims to develop and establish the content validity of multiple expert rating instruments to assess performance in endoscopic third ventriculostomy (ETV), collectively called the Neuro-Endoscopic Ventriculostomy Assessment Tool (NEVAT). The important aspects of ETV were identified through a review of current literature, ETV videos, and discussion with neurosurgeons, fellows, and residents. Three assessment measures were subsequently developed: a procedure-specific checklist (CL), a CL of surgical errors, and a global rating scale (GRS). Neurosurgeons from various countries, all identified as experts in ETV, were then invited to participate in a modified Delphi survey to establish the content validity of these instruments. In each Delphi round, experts rated their agreement including each procedural step, error, and GRS item in the respective instruments on a 5-point Likert scale. Seventeen experts agreed to participate in the study and completed all Delphi rounds. After item generation, a total of 27 procedural CL items, 26 error CL items, and 9 GRS items were posed to Delphi panelists for rating. An additional 17 procedural CL items, 12 error CL items, and 1 GRS item were added by panelists. After three rounds, strong consensus (>80% agreement) was achieved on 35 procedural CL items, 29 error CL items, and 10 GRS items. Moderate consensus (50-80% agreement) was achieved on an additional 7 procedural CL items and 1 error CL item. The final procedural and error checklist contained 42 and 30 items, respectively (divided into setup, exposure, navigation, ventriculostomy, and closure). The final GRS contained 10 items. We have established the content validity of three ETV assessment measures by iterative consensus of an international expert panel. Each measure provides unique assessment information and thus can be used individually or in combination, depending on the characteristics of the learner and the purpose of the assessment. These instruments must now be evaluated in both the simulated and operative settings, to determine their construct validity and reliability. Ultimately, the measures contained in the NEVAT may prove suitable for formative assessment during ETV training and potentially as summative assessment measures during certification.
Rasch analysis of the participation scale (P-scale): usefulness of the P-scale to a rehabilitation services network.

PubMed

Souza, Mariana Angélica Peixoto; Coster, Wendy Jane; Mancini, Marisa Cotta; Dutra, Fabiana Caetano Martins Silva; Kramer, Jessica; Sampaio, Rosana Ferreira

2017-12-08

A person's participation is acknowledged as an important outcome of the rehabilitation process. The Participation Scale (P-Scale) is an instrument that was designed to assess the participation of individuals with a health condition or disability. The scale was developed in an effort to better describe the participation of people living in middle-income and low-income countries. The aim of this study was to use Rasch analysis to examine whether the Participation Scale is suitable to assess the perceived ability to take part in participation situations by patients with diverse levels of function. The sample was comprised by 302 patients from a public rehabilitation services network. Participants had orthopaedic or neurological health conditions, were at least 18 years old, and completed the Participation Scale. Rasch analysis was conducted using the Winsteps software. The mean age of all participants was 45.5 years (standard deviation = 14.4), 52% were male, 86% had orthopaedic conditions, and 52% had chronic symptoms. Rasch analysis was performed using a dichotomous rating scale, and only one item showed misfit. Dimensionality analysis supported the existence of only one Rasch dimension. The person separation index was 1.51, and the item separation index was 6.38. Items N2 and N14 showed Differential Item Functioning between men and women. Items N6 and N12 showed Differential Item Functioning between acute and chronic conditions. The item difficulty range was -1.78 to 2.09 logits, while the sample ability range was -2.41 to 4.61 logits. The P-Scale was found to be useful as a screening tool for participation problems reported by patients in a rehabilitation context, despite some issues that should be addressed to further improve the scale.
Developing a scale to measure "attachment to the local community" in late middle aged individuals.

PubMed

Sakai, Taichi; Omori, Junko; Takahashi, Kazuko; Mitsumori, Yasuko; Kobayashi, Maasa; Ono, Wakanako; Miyazaki, Toshie; Anzai, Hitomi; Saito, Mika

2016-01-01

Objectives This study was conducted to develop a scale for measuring "attachment to the local community" for its use in health services. The scale is also intended to nurture new social relationships in late middle-aged individuals.Methods Thirty items were initially planned to be included in the scale to measure "attachment to the local community", according to a previous study that identified the concept. The study subjects were late middle-aged residents of City B in Prefecture A, located in Tokyo suburbs. From the basic resident register data, 1,000 individuals (local residents in the 50-69 year age group) were selected by a multi-stage random sampling technique, on the basis of their residential area, age, and sex (while maintaining the male to female ratio). An unsigned self-administered questionnaire was distributed to the subjects, and the responses were collected by postal mail. The collected data was analyzed using psychometric study of scale.Results Valid responses were obtained from 583 subjects, and the response rate was 58.3%. In an item analysis, none of the items were rejected. In a subsequent factor analysis, 7 items were eliminated. These items included 2 items with a factor loading of <0.40, 3 items loading on multiple factors and showing a factor loading of ≥0.40, and 2 items with a low factor correlation (0.04-0.16). These items included factors that related to only these 2 items. Consequently, 23 items in the following 4-factor structure were selected as the scale items: "Source of vitality to live life," "Intention to cherish ties with people," "Place where one can be oneself," and "Pride of being a resident." Cronbach's coefficient α for the entire scale of "attachment to the local community" was 0.95, demonstrating internal consistency. We then examined the correlation with an existing scale to measure social support; the results revealed a statistically significant correlation and confirmed criterion-related validity (P<0.001). In addition, the fit indices in a covariance structure analysis showed adequate values.Conclusions The developed scale was considered reliable and appropriate for measuring "attachment to the local community."
The Role of Water Consumption on Consumption of the Ration, Cold Weather

DTIC Science & Technology

1989-02-22

like slightly) on the scale. The items with the highest ratings were mostly in the sweet dessert category, including the fig bar (8.1), the blueberry...higher percentages of the sweet dessert items. Table 8 Mean 4cceptance of RCW Items Item: Hedonic Rating* Previous Ratinga Oatmeal (Apple & Cinnamon) 6.7...4.3 5.4 Ease of Heating 3.9 --- Taste of Food 5.1 6.2 Appearance of Food 4.6 6.3 Amount of Food per Daily Pack 3.6 4.6 Variety per Daily Pack 4.3 5.3

THE BRIEF PSYCHIATRIC RATING SCALE IN POSITIVE AND NEGATIVE SUBTYPES OF SCHIZOPHRENIA

PubMed Central

Kulhara, P.; Mattoo, S.K.; Avasthi, A.; Malhotra, A.

1987-01-01

SUMMARY Usefulness of the Brief Psychiatric Rating Scale (BPRS) in distinguishing positive and negative subtypes of schizophrenia is presented. Ninety five schizophrenic patients were assessed on BPRS. Significant differences emerged between positive and negative subtypes of schizophrenia on items like emotional withdrawal, guilt feelings, tension, hallucinatory behaviour, motor retardation, blunted affect and excitement. Discriminant function equation generated by these items had a high rate of prediction of group membership either to positive or negative schizophrenia group. Principal components analysis of BPRS scores yielded factors which favour categorization of patients in positive, negative subtypes. The study provides support for classification of schizophrenia into these subtypes. PMID:21927241
Methods for estimating comparable prevalence rates of food insecurity experienced by adults in 147 countries and areas

NASA Astrophysics Data System (ADS)

Nord, Mark; Cafiero, Carlo; Viviani, Sara

2016-11-01

Statistical methods based on item response theory are applied to experiential food insecurity survey data from 147 countries, areas, and territories to assess data quality and develop methods to estimate national prevalence rates of moderate and severe food insecurity at equal levels of severity across countries. Data were collected from nationally representative samples of 1,000 adults in each country. A Rasch-model-based scale was estimated for each country, and data were assessed for consistency with model assumptions. A global reference scale was calculated based on item parameters from all countries. Each country's scale was adjusted to the global standard, allowing for up to 3 of the 8 scale items to be considered unique in that country if their deviance from the global standard exceeded a set tolerance. With very few exceptions, data from all countries were sufficiently consistent with model assumptions to constitute reasonably reliable measures of food insecurity and were adjustable to the global standard with fair confidence. National prevalence rates of moderate-or-severe food insecurity assessed over a 12-month recall period ranged from 3 percent to 92 percent. The correlations of national prevalence rates with national income, health, and well-being indicators provide external validation of the food security measure.
Comparison of Rating Scales in the Development of Patient-Reported Outcome Measures for Children with Eye Disorders.

PubMed

Hatt, Sarah R; Leske, David A; Wernimont, Suzanne M; Birch, Eileen E; Holmes, Jonathan M

2017-03-01

A rating scale is a critical component of patient-reported outcome instrument design, but the optimal rating scale format for pediatric use has not been investigated. We compared rating scale performance when administering potential questionnaire items to children with eye disorders and their parents. Three commonly used rating scales were evaluated: frequency (never, sometimes, often, always), severity (not at all, a little, some, a lot), and difficulty (not difficult, a little difficult, difficult, very difficult). Ten patient-derived items were formatted for each rating scale, and rating scale testing order was randomized. Both child and parent were asked to comment on any problems with, or a preference for, a particular scale. Any confusion about options or inability to answer was recorded. Twenty-one children, aged 5-17 years, with strabismus, amblyopia, or refractive error were recruited, each with one of their parents. Of the first 10 children, 4 (40%) had problems using the difficulty scale, compared with 1 (10%) using frequency, and none using severity. The difficulty scale was modified, replacing the word "difficult" with "hard." Eleven additional children (plus parents) then completed all 3 questionnaires. No children had problems using any scale. Four (36%) parents had problems using the difficulty ("hard") scale and 1 (9%) with frequency. Regarding preference, 6 (55%) of 11 children and 5 (50%) of 10 parents preferred using the frequency scale. Children and parents found the frequency scale and question format to be the most easily understood. Children and parents also expressed preference for the frequency scale, compared with the difficulty and severity scales. We recommend frequency rating scales for patient-reported outcome measures in pediatric populations.
Aging gauge

DOEpatents

Betts, Robert E.; Crawford, John F.

1989-04-04

An aging gauge comprising a container having a fixed or a variable sized t opening with a cap which can be opened to control the sublimation rate of a thermally sublimational material contained within the container. In use, the aging gauge is stored with an item to determine total heat the item is subjected to and also the maximum temperature to which the item has been exposed. The aging gauge container contains a thermally sublimational material such as naphthalene or similar material which has a low sublimation rate over the temperature range from about 70.degree. F. to about 160.degree. F. The aging products determined by analyses of a like item aged along with the aging gauge for which the sublimation amount is determined is employed to establish a calibration curve for future aging evaluation. The aging gauge is provided with a means for determining the maximum temperature exposure (i.e., a thermally indicating material which gives an irreversible color change, Thermocolor pigment). Because of the relationship of doubling reaction rates for increases of 10.degree. C., equivalency of item used in accelerated aging evaluation can be obtained by referring to a calibration curve depicting storage temperature on the abscissa scale and multiplier on the ordinate scale.
Aging gauge

DOEpatents

Betts, Robert E.; Crawford, John F.

1989-01-01

An aging gauge comprising a container having a fixed or a variable sized t opening with a cap which can be opened to control the sublimation rate of a thermally sublimational material contained within the container. In use, the aging gauge is stored with an item to determine total heat the item is subjected to and also the maximum temperature to which the item has been exposed. The aging gauge container contains a thermally sublimational material such as naphthalene or similar material which has a low sublimation rate over the temperature range from about 70.degree. F. to about 160.degree. F. The aging products determined by analyses of a like item aged along with the aging gauge for which the sublimation amount is determined is employed to establish a calibration curve for future aging evaluation. The aging gauge is provided with a means for determining the maximum temperature exposure (i.e., a thermally indicating material which gives an irreversible color change, Thermocolor pigment). Because of the relationship of doubling reaction rates for increases of 10.degree. C., equivalency of item used in accelerated aging evaluation can be obtained by referring to a calibration curve depicting storage temperature on the abscissa scale and multiplier on the ordinate scale.
Item-Level Psychometrics of the Glasgow Outcome Scale: Extended Structured Interviews.

PubMed

Hong, Ickpyo; Li, Chih-Ying; Velozo, Craig A

2016-04-01

The Glasgow Outcome Scale-Extended (GOSE) structured interview captures critical components of activities and participation, including home, shopping, work, leisure, and family/friend relationships. Eighty-nine community dwelling adults with mild-moderate traumatic brain injury (TBI) were recruited (average = 2.7 year post injury). Nine items of the 19 items were used for the psychometrics analysis purpose. Factor analysis and item-level psychometrics were investigated using the Rasch partial-credit model. Although the principal components analysis of residuals suggests that a single measurement factor dominates the measure, the instrument did not meet the factor analysis criteria. Five items met the rating scale criteria. Eight items fit the Rasch model. The instrument demonstrated low person reliability (0.63), low person strata (2.07), and a slight ceiling effect. The GOSE demonstrated limitations in precisely measuring activities/participation for individuals after TBI. Future studies should examine the impact of the low precision of the GOSE on effect size. © The Author(s) 2016.
The Standardization of the Clock Drawing Test (CDT) for People with Stroke Using Rasch Analysis

PubMed Central

Yoo, Doo Han; Hong, Deok Gi; Lee, Jae Shin

2014-01-01

[Purpose] The aim of this study was to standardize the clock drawing test (CDT) for people with stroke using Rasch analysis. [Subjects and Methods] Seventeen items of the CDT identified through a literature review were performed by 159 stroke patients. The data was analyzed with Winstep version 3.57 using the Rasch model to examine the unidimensionality of the items’ fit, the distribution of the items’ difficulty, and the reliability and appropriateness of the rating scale. [Result] Ten out of the 159 participations (6.2%) were considered misfit subjects, and one item of the CDT was determined to be a misfit item based on Rasch analysis. The rating scales were judged as suitable because the observed average showed an array of vertical orders and MNSQ values < 2. The separate index and reliability of the subject (1.98, 0.80) and item (6.45, 0.97) showed relatively high values. [Conclusion] This study is the first to examine the CDT scale in stroke patients by Rasch analysis. The CDT is expected to be useful for screening stroke patients with cognitive problems. PMID:24409026
Test Review: Michael H. Epstein and Lori Synhorst "Preschool Behavioral and Emotional Rating Scale" Austin, TX--PRO-ED, 2009

ERIC Educational Resources Information Center

Drevon, Daniel D.

2011-01-01

This article presents a review of the "Preschool Behavioral and Emotional Rating Scale" (PreBERS), a 42-item family member--or school personnel--completed rating scale designed to measure the behavioral and emotional strengths of preschool children ages 3-0 to 5-11. According to the manual, results can be used to identify preschoolers with limited…
The Influence of Alternative Scale Formats on the Generalizability of Data Obtained from Direct Behavior Rating Single-Item Scales (DBR-SIS)

ERIC Educational Resources Information Center

Briesch, Amy M.; Kilgus, Stephen P.; Chafouleas, Sandra M.; Riley-Tillman, T. Chris; Christ, Theodore J.

2013-01-01

The current study served to extend previous research on scaling construction of Direct Behavior Rating (DBR) in order to explore the potential flexibility of DBR to fit various intervention contexts. One hundred ninety-eight undergraduate students viewed the same classroom footage but rated student behavior using one of eight randomly assigned…
Assessing depression outcome in patients with moderate dementia: sensitivity of the HoNOS65+ scale.

PubMed

Canuto, Alessandra; Rudhard-Thomazic, Valérie; Herrmann, François R; Delaloye, Christophe; Giannakopoulos, Panteleimon; Weber, Kerstin

2009-08-15

To date, there is no widely accepted clinical scale to monitor the evolution of depressive symptoms in demented patients. We assessed the sensitivity to treatment of a validated French version of the Health of the Nation Outcome Scale (HoNOS) 65+ compared to five routinely used scales. Thirty elderly inpatients with ICD-10 diagnosis of dementia and depression were evaluated at admission and discharge using paired t-test. Using the Brief Psychiatric Rating Scale (BPRS) "depressive mood" item as gold standard, a receiver operating characteristic curve (ROC) analysis assessed the validity of HoNOS65+F "depressive symptoms" item score changes. Unlike Geriatric Depression Scale, Mini Mental State Examination and Activities of Daily Living scores, BPRS scores decreased and Global Assessment Functioning Scale score increased significantly from admission to discharge. Amongst HoNOS65+F items, "behavioural disturbance", "depressive symptoms", "activities of daily life" and "drug management" items showed highly significant changes between the first and last day of hospitalization. The ROC analysis revealed that changes in the HoNOS65+F "depressive symptoms" item correctly classified 93% of the cases with good sensitivity (0.95) and specificity (0.88) values. These data suggest that the HoNOS65+F "depressive symptoms" item may provide a valid assessment of the evolution of depressive symptoms in demented patients.
Validation of an Empathy Scale in Pharmacy and Nursing Students

PubMed Central

Chen, Aleda M. H.; Yehle, Karen S.; Plake, Kimberly S.

2013-01-01

Objective. To validate an empathy scale to measure empathy in pharmacy and nursing students. Methods. A 15-item instrument comprised of the cognitive and affective empathy domains, was created. Each item was rated using a 7-point Likert scale, ranging from strongly disagree to strongly agree. Concurrent validity was demonstrated with the Jefferson Scale of Empathy – Health Professional Students (JSE-HPS). Results. Reliability analysis of data from 216 students (pharmacy, N=158; nursing, N=58) showed that scores on the empathy scale were positively associated with JSE-HPS scores (p<0.001). Factor analysis confirmed that 14 of the 15 items were significantly associated with their respective domain, but the overall instrument had limited goodness of fit. Conclusions. Results of this study demonstrate the reliability and validity of a new scale for evaluating student empathy. Further testing of the scale at other universities is needed to establish validity. PMID:23788805
[The application of diminished criminal responsibility rating scale to mental retardation offenders].

PubMed

Guan, Wei; Cai, Wei-Xiong; Huang, Fu-Yin; Wu, Jia-Sheng

2009-10-01

To explore the application of Diminished Criminal Responsibility Rating Scale (DCRRS) to mental retardation offenders. The DCRRS was used to 121 cases of mental retardation offenders who were divided into three groups according to the degree of their diminished criminal responsibility. There were significant differences in rating score among the three groups (mild group 22.12+/-4.69, moderate group 25.50+/-5.48, major group 27.59+/-5.69), and 17 items had good correlation with the total score of the scale with the correlation coefficient from 0.289 to 0.665. Six factors were extracted by the factor analysis, and 69.392% variation could be explained. The DCRRS has rational items, its total score could show the difference among the three degree diminished criminal responsibility of mental retardation offenders.
Rasch analysis of the carers quality of life questionnaire for parkinsonism.

PubMed

Pillas, Marios; Selai, Caroline; Schrag, Anette

2017-03-01

To assess the psychometric properties of the Carers Quality of Life Questionnaire for Parkinsonism using a Rasch modeling approach and determine the optimal cut-off score. We performed a Rasch analysis of the survey answers of 430 carers of patients with atypical parkinsonism. All of the scale items demonstrated acceptable goodness of fit to the Rasch model. The scale was unidimensional and no notable differential item functioning was detected in the items regarding age and disease type. Rating categories were functioning adequately in all scale items. The scale had high reliability (.95) and construct validity and a high degree of precision, distinguishing between 5 distinct groups of carers with different levels of quality of life. A cut-off score of 62 was found to have the optimal screening accuracy based on Hospital Anxiety and Depression Scale subscores. The results suggest that the Carers Quality of Life Questionnaire for Parkinsonism is a useful scale to assess carers' quality of life and allows analyses requiring interval scaling of variables. © 2016 International Parkinson and Movement Disorder Society. © 2016 International Parkinson and Movement Disorder Society.
The initial development of the WebMedQual scale: domain assessment of the construct of quality of health web sites.

PubMed

Provost, Mélanie; Koompalum, Dayin; Dong, Diane; Martin, Bradley C

2006-01-01

To develop a comprehensive instrument assessing quality of health-related web sites. Phase I consisted of a literature review to identify constructs thought to indicate web site quality and to identify items. During content analysis, duplicate items were eliminated and items that were not clear, meaningful, or measurable were reworded or removed. Some items were generated by the authors. Phase II: a panel consisting of six healthcare and MIS reviewers was convened to assess each item for its relevance and importance to the construct and to assess item clarity and measurement feasibility. Three hundred and eighty-four items were generated from 26 sources. The initial content analysis reduced the scale to 104 items. Four of the six expert reviewers responded; high concordance on the relevance, importance and measurement feasibility of each item was observed: 3 out of 4, or all raters agreed on 76-85% of items. Based on the panel ratings, 9 items were removed, 3 added, and 10 revised. The WebMedQual consists of 8 categories, 8 sub-categories, 95 items and 3 supplemental items to assess web site quality. The constructs are: content (19 items), authority of source (18 items), design (19 items), accessibility and availability (6 items), links (4 items), user support (9 items), confidentiality and privacy (17 items), e-commerce (6 items). The "WebMedQual" represents a first step toward a comprehensive and standard quality assessment of health web sites. This scale will allow relatively easy assessment of quality with possible numeric scoring.
A Scale for Rating Fire-Prevention Contactors

Treesearch

M.L. Doolittle

1979-01-01

A scale is constructed to help fire-prevention program administrators determine if an individual contactor is effective at influencing people. The 24 items in the scale indicate the qualities that an effective contactor should have.
Development and reliability of a structured interview guide for the Montgomery Asberg Depression Rating Scale (SIGMA).

PubMed

Williams, Janet B W; Kobak, Kenneth A

2008-01-01

The Montgomery-Asberg Depression Rating Scale (MADRS) is often used in clinical trials to select patients and to assess treatment efficacy. The scale was originally published without suggested questions for clinicians to use in gathering the information necessary to rate the items. Structured and semi-structured interview guides have been found to improve reliability with other scales. To describe the development and test-retest reliability of a structured interview guide for the MADRS (SIGMA). A total of 162 test-retest interviews were conducted by 81 rater pairs. Each patient was interviewed twice, once by each rater conducting an independent interview. The intraclass correlation for total score between raters using the SIGMA was r=0.93, P<0.0001. All ten items had good to excellent interrater reliability. Use of the SIGMA can result in high reliability of MADRS scores in evaluating patients with depression.
Reliability and validity of the work and social adjustment scale in phobic disorders.

PubMed

Mataix-Cols, David; Cowley, Amy J; Hankins, Matthew; Schneider, Andreas; Bachofen, Martin; Kenwright, Mark; Gega, Lina; Cameron, Rachel; Marks, Isaac M

2005-01-01

The Work and Social Adjustment Scale (WSAS) is a simple widely used 5-item measure of disability whose psychometric properties need more analysis in phobic disorders. The reliability, factor structure, validity, and sensitivity to change of the WSAS were studied in 205 phobic patients (73 agoraphobia, 62 social phobia, and 70 specific phobia) who participated in various open and randomized trials of self-exposure therapy. Internal consistency of the WSAS was excellent in all phobics pooled and in agoraphobics and social phobics separately. Principal components analysis extracted a single general factor of disability. Specific phobics gave less consistent ratings across WSAS items, suggesting that some items were less relevant to their problem. Internal consistency was marginally higher for self-ratings than clinician ratings of the WSAS. Self-ratings and clinician ratings correlated highly though patients tended to rate themselves as more disabled than clinicians did. WSAS total scores reflected differences in phobic severity and improvement with treatment. The WSAS is a valid, reliable, and change-sensitive measure of work/social and other adjustment in phobic disorders, especially in agoraphobia and social phobia.
Development of life story experience (LSE) scales for migrant dentists in Australia: a sequential qualitative-quantitative study.

PubMed

Balasubramanian, M; Spencer, A J; Short, S D; Watkins, K; Chrisopoulos, S; Brennan, D S

2016-09-01

The integration of qualitative and quantitative approaches introduces new avenues to bridge strengths, and address weaknesses of both methods. To develop measure(s) for migrant dentist experiences in Australia through a mixed methods approach. The sequential qualitative-quantitative design involved first the harvesting of data items from qualitative study, followed by a national survey of migrant dentists in Australia. Statements representing unique experiences in migrant dentists' life stories were deployed the survey questionnaire, using a five-point Likert scale. Factor analysis was used to examine component factors. Eighty-two statements from 51 participants were harvested from the qualitative analysis. A total of 1,022 of 1,977 migrant dentists (response rate 54.5%) returned completed questionnaires. Factor analysis supported an initial eight-factor solution; further scale development and reliability analysis led to five scales with a final list of 38 life story experience (LSE) items. Three scales were based on home country events: health system and general lifestyle concerns (LSE1; 10 items), society and culture (LSE4; 4 items) and career development (LSE5; 4 items). Two scales included migrant experiences in Australia: appreciation towards Australian way of life (LSE2; 13 items) and settlement concerns (LSE3; 7 items). The five life story experience scales provided necessary conceptual clarity and empirical grounding to explore migrant dentist experiences in Australia. Being based on original migrant dentist narrations, these scales have the potential to offer in-depth insights for policy makers and support future research on dentist migration. Copyright© 2016 Dennis Barber Ltd
Students' Beliefs about Mobile Devices vs. Desktop Computers in South Korea and the United States

ERIC Educational Resources Information Center

Sung, Eunmo; Mayer, Richard E.

2012-01-01

College students in the United States and in South Korea completed a 28-item multidimensional scaling (MDS) questionnaire in which they rated the similarity of 28 pairs of multimedia learning materials on a 10-point scale (e.g., narrated animation on a mobile device Vs. movie clip on a desktop computer) and a 56-item semantic differential…
Translation, adaptation and validation of the American short form Patient Activation Measure (PAM13) in a Danish version.

PubMed

Maindal, Helle Terkildsen; Sokolowski, Ineta; Vedsted, Peter

2009-06-29

The Patient Activation Measure (PAM) is a measure that assesses patient knowledge, skill, and confidence for self-management. This study validates the Danish translation of the 13-item Patient Activation Measure (PAM13) in a Danish population with dysglycaemia. 358 people with screen-detected dysglycaemia participating in a primary care health education study responded to PAM13. The PAM13 was translated into Danish by a standardised forward-backward translation. Data quality was assessed by mean, median, item response, missing values, floor and ceiling effects, internal consistency (Cronbach's alpha and average inter-item correlation) and item-rest correlations. Scale properties were assessed by Rasch Rating Scale models. The item response was high with a small number of missing values (0.8-4.2%). Floor effect was small (range 0.6-3.6%), but the ceiling effect was above 15% for all items (range 18.6-62.7%). The alpha-coefficient was 0.89 and the average inter-item correlation 0.38. The Danish version formed a unidimensional, probabilistic Guttman-like scale explaining 43.2% of the variance. We did however, find a different item sequence compared to the original scale. A Danish version of PAM13 with acceptable validity and reliability is now available. Further development should focus on single items, response categories in relation to ceiling effects and further validation of reproducibility and responsiveness.

Rasch Based Analysis of Oral Proficiency Test Data.

ERIC Educational Resources Information Center

Nakamura, Yuji

2001-01-01

This paper examines the rating scale data of oral proficiency tests analyzed by a Rasch Analysis focusing on an item map and factor analysis. In discussing the item map, the difficulty order of six items and students' answering patterns are analyzed using descriptive statistics and measures of central tendency of test scores. The data ranks the…
Exploring the Manifestations of Anxiety in Children with Autism Spectrum Disorders

ERIC Educational Resources Information Center

Hallett, Victoria; Lecavalier, Luc; Sukhodolsky, Denis G.; Cipriano, Noreen; Aman, Michael G.; McCracken, James T.; McDougle, Christopher J.; Tierney, Elaine; King, Bryan H.; Hollander, Eric; Sikich, Linmarie; Bregman, Joel; Anagnostou, Evdokia; Donnelly, Craig; Katsovich, Lily; Dukes, Kimberly; Vitiello, Benedetto; Gadow, Kenneth; Scahill, Lawrence

2013-01-01

This study explores the manifestation and measurement of anxiety symptoms in 415 children with ASDs on a 20-item, parent-rated, DSM-IV referenced anxiety scale. In both high and low-functioning children (IQ above vs. below 70), commonly endorsed items assessed restlessness, tension and sleep difficulties. Items requiring verbal expression of worry…
Family-centred service: differences in what parents of children with cerebral palsy rate important.

PubMed

Terwiel, M; Alsem, M W; Siebes, R C; Bieleman, K; Verhoef, M; Ketelaar, M

2017-09-01

A family-centred approach to services of children with disabilities is widely accepted as the foundational approach to service delivery in paediatric health care. The 56 items of the Measure of Processes of Care questionnaire (MPOC-56) all reflect elements of family-centred service. In this study, we investigated which elements of family-centred service are rated important by parents of children with cerebral palsy by adding a question on importance to each item of the MPOC-56 (MPOC-56-I). In total, 175 parents of children with cerebral palsy completed the MPOC-56-I. For each MPOC item, parents were asked to rate the importance on a 5-point scale ranging from 0 (not important at all) up to and including 4 (very important). We used Spearman's rank correlation coefficient to further explore the variation in parents' importance ratings. Parents' importance ratings of the MPOC-56 items varied. The percentage of parents rating an item important (importance rating 3 or 4) varied between 43.8% and 96.8%. The percentage of parents rating an item unimportant (rating 0 or 1) varied between 0.0% and 20.3%, and the percentage of parents rating an item neutral (rating 2) varied between 3.0% and 36.0%. Most diverse importance ratings were found for five items concerning the provision of general information. Three correlations between these items and child and parent characteristics were found. Six items were rated important by almost all (≥95%) parents. These items concern elements of specific information about the child, co-ordinated and comprehensive care for child and family and enabling and partnership. Parents rate the importance of family-centred services for their situation in various ways. These findings endorse that family-centred services should recognize the uniqueness of families and should be tailored to what parents find important. © 2017 John Wiley & Sons Ltd.
Psychometric properties of responses by clinicians and older adults to a 6-item Hebrew version of the Hamilton Depression Rating Scale (HAM-D6)

PubMed Central

2013-01-01

Background The Hamilton Depression Rating Scale (HAM-D) is commonly used as a screening instrument, as a continuous measure of change in depressive symptoms over time, and as a means to compare the relative efficacy of treatments. Among several abridged versions, the 6-item HAM-D6 is used most widely in large degree because of its good psychometric properties. The current study compares both self-report and clinician-rated versions of the Hebrew version of this scale. Methods A total of 153 Israelis 75 years of age on average participated in this study. The HAM-D6 was examined using confirmatory factor analytic (CFA) models separately for both patient and clinician responses. Results Reponses to the HAM-D6 suggest that this instrument measures a unidimensional construct with each of the scales’ six items contributing significantly to the measurement. Comparisons between self-report and clinician versions indicate that responses do not significantly differ for 4 of the 6 items. Moreover, 100% sensitivity (and 91% specificity) was found between patient HAM-D6 responses and clinician diagnoses of depression. Conclusion These results indicate that the Hebrew HAM-D6 can be used to measure and screen for depressive symptoms among elderly patients. PMID:23281688
Response pattern of depressive symptoms among college students: What lies behind items of the Beck Depression Inventory-II?

PubMed

de Sá Junior, Antonio Reis; de Andrade, Arthur Guerra; Andrade, Laura Helena; Gorenstein, Clarice; Wang, Yuan-Pang

2018-07-01

This study examines the response pattern of depressive symptoms in a nationwide student sample, through item analyses of a rating scale by both classical test theory (CTT) and item response theory (IRT). The 21-item Beck Depression Inventory-II (BDI-II) was administered to 12,711 college students. First, the psychometric properties of the scale were described. Thereafter, the endorsement probability of depressive symptom in each scale item was analyzed through CTT and IRT. Graphical plots depicted the endorsement probability of scale items and intensity of depression. Three items of different difficulty level were compared through CTT and IRT approach. Four in five students reported the presence of depressive symptoms. The BDI-II items presented good reliability and were distributed along the symptomatic continuum of depression. Similarly, in both CTT and IRT approaches, the item 'changes in sleep' was easily endorsed, 'loss of interest' moderately and 'suicidal thoughts' hardly. Graphical representation of BDI-II of both methods showed much equivalence in terms of item discrimination and item difficulty. The item characteristic curve of the IRT method provided informative evaluation of item performance. The inventory was applied only in college students. Depressive symptoms were frequent psychopathological manifestations among college students. The performance of the BDI-II items indicated convergent results from both methods of analysis. While the CTT was easy to understand and to apply, the IRT was more complex to understand and to implement. Comprehensive assessment of the functioning of each BDI-II item might be helpful in efficient detection of depressive conditions in college students. Copyright © 2018 Elsevier B.V. All rights reserved.
Development and validation of the simulation-based learning evaluation scale.

PubMed

Hung, Chang-Chiao; Liu, Hsiu-Chen; Lin, Chun-Chih; Lee, Bih-O

2016-05-01

The instruments that evaluate a student's perception of receiving simulated training are English versions and have not been tested for reliability or validity. The aim of this study was to develop and validate a Chinese version Simulation-Based Learning Evaluation Scale (SBLES). Four stages were conducted to develop and validate the SBLES. First, specific desired competencies were identified according to the National League for Nursing and Taiwan Nursing Accreditation Council core competencies. Next, the initial item pool was comprised of 50 items related to simulation that were drawn from the literature of core competencies. Content validity was established by use of an expert panel. Finally, exploratory factor analysis and confirmatory factor analysis were conducted for construct validity, and Cronbach's coefficient alpha determined the scale's internal consistency reliability. Two hundred and fifty students who had experienced simulation-based learning were invited to participate in this study. Two hundred and twenty-five students completed and returned questionnaires (response rate=90%). Six items were deleted from the initial item pool and one was added after an expert panel review. Exploratory factor analysis with varimax rotation revealed 37 items remaining in five factors which accounted for 67% of the variance. The construct validity of SBLES was substantiated in a confirmatory factor analysis that revealed a good fit of the hypothesized factor structure. The findings tally with the criterion of convergent and discriminant validity. The range of internal consistency for five subscales was .90 to .93. Items were rated on a 5-point scale from 1 (strongly disagree) to 5 (strongly agree). The results of this study indicate that the SBLES is valid and reliable. The authors recommend that the scale could be applied in the nursing school to evaluate the effectiveness of simulation-based learning curricula. Copyright © 2016 Elsevier Ltd. All rights reserved.
International Comparisons of the Dysregulation Profile Based on Reports by Parents, Adolescents, and Teachers.

PubMed

Rescorla, Leslie A; Blumenfeld, Mary C; Ivanova, Masha Y; Achenbach, Thomas M; International Aseba Consortium

2018-06-14

Our objective was to examine international similarities and differences in the Dysregulation Profile (DP) of the Child Behavior Checklist (CBCL), Teacher's Report Form (TRF), and Youth Self-Report (YSR) via comparisons of data from many societies. Primary samples were those studied by Rescorla et al. (2012): CBCL: N = 69,866, 42 societies; YSR: N = 38,070, 34 societies; TRF: N = 37,244, 27 societies. Omnicultural Q correlations of items composing the DP (from the Anxious/Depressed, Attention Problems, and Aggressive Behavior syndromes) indicated considerable consistency across diverse societies with respect to which of the DP items tended to receive low, medium, or high ratings, whether ratings were provided by parents (M Q = .70), adolescents (M Q = .72), or teachers (M Q = .68). Omnicultural mean item ratings indicated that, for all 3 forms, the most common items on the DP reflect a mix of problems from all 3 constituent scales. Cross-informant analyses for the CBCL-YSR and CBCL-TRF supported these results. Aggregated DP scores, derived by summing ratings on all DP items, varied significantly by society. Age and gender differences were minor for all 3 forms, but boys scored higher than girls on the TRF. Many societies differing in ethnicity, religion, political/economic system, and geographical region manifested very similar DP scores. The most commonly reported DP problems reflected the mixed symptom picture of the DP, with dysregulation in mood, attention, and aggression. Overall, societies were more similar than different on DP scale scores and item ratings.
Sharing medicine: the candidacy of medicines and other household items for sharing, Dominican Republic.

PubMed

Dohn, Michael N; Pilkington, Hugo

2014-01-01

People share medicines and problems can result from this behavior. Successful interventions to change sharing behavior will require understanding people's motives and purposes for sharing medicines. Better information about how medicines fit into the gifting and reciprocity system could be useful in designing interventions to modify medicine sharing behavior. However, it is uncertain how people situate medicines among other items that might be shared. This investigation is a descriptive study of how people sort medicines and other shareable items. This study in the Dominican Republic examined how a convenience sample (31 people) sorted medicines and rated their shareability in relation to other common household items. We used non-metric multidimensional scaling to produce association maps in which the distances between items offer a visual representation of the collective opinion of the participants regarding the relationships among the items. In addition, from a pile sort constrained by four categories of whether sharing or loaning the item was acceptable (on a scale from not shareable to very shareable), we assessed the degree to which the participants rated the medicines as shareable compared to other items. Participants consistently grouped medicines together in all pile sort activities; yet, medicines were mixed with other items when rated by their candidacy to be shared. Compared to the other items, participants had more variability of opinion as to whether medicines should be shared. People think of medicines as a distinct group, suggesting that interventions might be designed to apply to medicines as a group. People's differing opinions as to whether it was appropriate to share medicines imply a degree of uncertainty or ambiguity that health promotion interventions might exploit to alter attitudes and behaviors. These findings have implications for the design of health promotion interventions to impact medicine sharing behavior.
Social Skills Intervention Planning for Preschoolers: Using the SSiS-Rating Scales to Identify Target Behaviors Valued by Parents and Teachers

ERIC Educational Resources Information Center

Frey, Jennifer R.; Elliott, Stephen N.; Kaiser, Ann P.

2014-01-01

Teachers' and parents' importance ratings of social behaviors for 95 preschoolers were examined using the "Social Skills Improvement System-Rating Scales" (Gresham & Elliott, 2008). Multivariate analyses were used to examine parents' and teachers' importance ratings at the item and subscale levels. Overall,…
Another Look at the PART-O Using the Traumatic Brain Injury Model Systems National Database: Scoring to Optimize Psychometrics.

PubMed

Malec, James F; Whiteneck, Gale G; Bogner, Jennifer A

2016-02-01

To integrate previous approaches to scoring the Participation Assessment with Recombined Tools-Objective (PART-O) in a unidimensional scale. Retrospective analysis of PART-O data from the Traumatic Brain Injury Model Systems. Community. Data from individuals (N=469) selected randomly from participants who completed 1-year follow-up in the Traumatic Brain Injury Model Systems were used in Rasch model development. The model was subsequently tested on data from additional random samples of similar size at 1-, 2-, 5-, 10-, and >15-year follow-ups. Not applicable. PART-O. After combining items for productivity and social interaction, the initial analysis at 1-year follow-up indicated relatively good fit to the Rasch model (person reliability=.80) but also suggested item misfit and that the 0-to-5 scale used for most items did not consistently show clear separation between rating levels. Reducing item rating scales to 3 levels (except combined and dichotomous items) resolved these issues and demonstrated good item level discrimination, fit, and person reliability (.81), with no evidence of multidimensionality. These results replicated in analyses at each additional follow-up period. Modifications to item scoring for the PART-O resulted in a unidimensional parametric equivalent measure that addresses previous concerns about competing item relations, and it fit the Rasch model consistently across follow-up periods. The person-item map shows a progression toward greater community participation from solitary and dyadic activities, such as leaving the house and having a friend through social and productivity activities, to group activities with others who share interests or beliefs. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Improving the evaluation of therapeutic interventions in multiple sclerosis: the role of new psychometric methods.

PubMed

Hobart, J; Cano, S

2009-02-01

In this monograph we examine the added value of new psychometric methods (Rasch measurement and Item Response Theory) over traditional psychometric approaches by comparing and contrasting their psychometric evaluations of existing sets of rating scale data. We have concentrated on Rasch measurement rather than Item Response Theory because we believe that it is the more advantageous method for health measurement from a conceptual, theoretical and practical perspective. Our intention is to provide an authoritative document that describes the principles of Rasch measurement and the practice of Rasch analysis in a clear, detailed, non-technical form that is accurate and accessible to clinicians and researchers in health measurement. A comparison was undertaken of traditional and new psychometric methods in five large sets of rating scale data: (1) evaluation of the Rivermead Mobility Index (RMI) in data from 666 participants in the Cannabis in Multiple Sclerosis (CAMS) study; (2) evaluation of the Multiple Sclerosis Impact Scale (MSIS-29) in data from 1725 people with multiple sclerosis; (3) evaluation of test-retest reliability of MSIS-29 in data from 150 people with multiple sclerosis; (4) examination of the use of Rasch analysis to equate scales purporting to measure the same health construct in 585 people with multiple sclerosis; and (5) comparison of relative responsiveness of the Barthel Index and Functional Independence Measure in data from 1400 people undergoing neurorehabilitation. Both Rasch measurement and Item Response Theory are conceptually and theoretically superior to traditional psychometric methods. Findings from each of the five studies show that Rasch analysis is empirically superior to traditional psychometric methods for evaluating rating scales, developing rating scales, analysing rating scale data, understanding and measuring stability and change, and understanding the health constructs we seek to quantify. There is considerable added value in using Rasch analysis rather than traditional psychometric methods in health measurement. Future research directions include the need to reproduce our findings in a range of clinical populations, detailed head-to-head comparisons of Rasch analysis and Item Response Theory, and the application of Rasch analysis to clinical practice.
The development and initial psychometric evaluation of a measure assessing adherence to prescribed exercise: the Exercise Adherence Rating Scale (EARS).

PubMed

Newman-Beinart, Naomi A; Norton, Sam; Dowling, Dominic; Gavriloff, Dimitri; Vari, Chiara; Weinman, John A; Godfrey, Emma L

2017-06-01

There is no gold standard for measuring adherence to prescribed home exercise. Self-report diaries are commonly used however lack of standardisation, inaccurate recall and self-presentation bias limit their validity. A valid and reliable tool to assess exercise adherence behaviour is required. Consequently, this article reports the development and psychometric evaluation of the Exercise Adherence Rating Scale (EARS). Development of a questionnaire. Secondary care in physiotherapy departments of three hospitals. A focus group consisting of 8 patients with chronic low back pain (CLBP) and 2 physiotherapists was conducted to generate qualitative data. Following on from this, a convenience sample of 224 people with CLBP completed the initial 16-item EARS for purposes of subsequent validity and reliability analyses. Construct validity was explored using exploratory factor analysis and item response theory. Test-retest reliability was assessed 3 weeks later in a sub-sample of patients. An item pool consisting of 6 items was found suitable for factor analysis. Examination of the scale structure of these 6 items revealed a one factor solution explaining a total of 71% of the variance in adherence to exercise. The six items formed a unidimensional scale that showed good measurement properties, including acceptable internal consistency and high test-retest reliability. The EARS enables the measurement of adherence to prescribed home exercise. This may facilitate the evaluation of interventions promoting self-management for both the prevention and treatment of chronic conditions. Copyright © 2017 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.
Psychometrics of the preschool behavioral and emotional rating scale with children from early childhood special education settings.

PubMed

Lambert, Matthew C; Cress, Cynthia J; Epstein, Michael H

2015-01-01

In a previous study with a nationally representative sample, researchers found that the items of the Preschool Behavioral and Emotional Rating Scale can best be described by a four-factor structure model (Emotional Regulation, School Readiness, Social Confidence, and Family Involvement). The findings of this investigation replicate and extend these previous results with a national sample of children (N = 1,075) with disabilities enrolled in early childhood special education programs. Data were analyzed using classical tests theory, Rasch modeling, and confirmatory factor analysis. Results confirmed that for the most part, individual items were internally consistent within a four-factor model and showed consistent item difficulty, discrimination, and fit relative to their respective subscale scores. © 2015 Michigan Association for Infant Mental Health.
Reliability of self-rated tinnitus distress and association with psychological symptom patterns.

PubMed

Hiller, W; Goebel, G; Rief, W

1994-05-01

Psychological complaints were investigated in two samples of 60 and 138 in-patients suffering from chronic tinnitus. We administered the Tinnitus Questionnaire (TQ), a 52-item self-rating scale which differentiates between dimensions of emotional and cognitive distress, intrusiveness, auditory perceptual difficulties, sleep disturbances and somatic complaints. The test-retest reliability was .94 for the TQ global score and between .86 and .93 for subscales. Three independent analyses were conducted to estimate the split-half reliability (internal consistency) which was only slightly lower than the test-retest values for scales with a relatively small number of items. Reliability was sufficient also on the level of single items. Low correlation between the TQ and the Hopkins Symptom Checklist (SCL-90-R) indicate a distinct quality of tinnitus-related and general psychological disturbances.
Improved collaborative filtering recommendation algorithm of similarity measure

NASA Astrophysics Data System (ADS)

Zhang, Baofu; Yuan, Baoping

2017-05-01

The Collaborative filtering recommendation algorithm is one of the most widely used recommendation algorithm in personalized recommender systems. The key is to find the nearest neighbor set of the active user by using similarity measure. However, the methods of traditional similarity measure mainly focus on the similarity of user common rating items, but ignore the relationship between the user common rating items and all items the user rates. And because rating matrix is very sparse, traditional collaborative filtering recommendation algorithm is not high efficiency. In order to obtain better accuracy, based on the consideration of common preference between users, the difference of rating scale and score of common items, this paper presents an improved similarity measure method, and based on this method, a collaborative filtering recommendation algorithm based on similarity improvement is proposed. Experimental results show that the algorithm can effectively improve the quality of recommendation, thus alleviate the impact of data sparseness.
The JFK Coma Recovery Scale--Revised.

PubMed

Kalmar, Kathleen; Giacino, Joseph T

2005-01-01

The JFK Coma Recovery Scale (CRS) was developed to help characterise and monitor patients functioning at Rancho Levels I-IV and has been used widely in both clinical and research settings within the US and Europe. The CRS was recently revised to address a number of concerns emanating from our own clinical experience with the scale, feedback from users and researchers as well as the results of Rasch analyses. Additionally, the CRS did not include all of the behavioural criteria necessary to diagnose the minimally conscious state (MCS), thereby limiting diagnostic utility. The revised JFK Coma Recovery Scale (CRS-R) includes addition of new items, merging of items found to be statistically similar, deletion or modification of items showing poor fit with the scale's underlying construct, renaming of items, more stringent scoring criteria, and quantification of elicited behaviours to improve accuracy of rating. Psychometric properties of the CRS-R appear to meet standards for measurement and evaluation tools for use in clinical and research settings, and diagnostic application suggests that the scale is capable of discriminating patients in the minimally conscious state from those in the vegetative state.
Cross-cultural adaptation and psychometric evaluations of the Turkish version of Parkinson Fatigue Scale.

PubMed

Ozturk, Erhan Arif; Kocer, Bilge Gonenli; Umay, Ebru; Cakci, Aytul

2018-06-07

The objectives of the present study were to translate and cross-culturally adapt the English version of the Parkinson Fatigue Scale into Turkish, to evaluate its psychometric properties, and to compare them with that of other language versions. A total of 144 patients with idiopathic Parkinson disease were included in the study. The Turkish version of Parkinson Fatigue Scale was evaluated for data quality, scaling assumptions, acceptability, reliability, and validity. The questionnaire response rate was 100% for both test and retest. The percentage of missing data was zero for items, and the percentage of computable scores was full. Floor and ceiling effects were absent. The Parkinson Fatigue Scale provides an acceptable internal consistency (Cronbach's alpha was 0.974 for 1st test and 0.964 for a retest, and corrected item-to-total correlations were ranged from 0.715 to 0.906) and test-retest reliability (Cohen's kappa coefficients were ranged from 0.632 to 0.786 for individuals items, and intraclass correlation coefficient was 0.887 for the overall Parkinson Fatigue Scale Score). An exploratory factor analysis of the items revealed a single factor explaining 71.7% of variance. The goodness-of-fit statistics for the one-factorial confirmatory factor analysis were Tucker Lewis index = 0.961, comparative fit index = 0.971 and root mean square error of approximation = 0.077 for a single factor. The average Parkinson Fatigue Scale Score was correlated significantly with sociodemographic data, clinical characteristics and scores of rating scales. The Turkish version of the Parkinson Fatigue Scale seems to be culturally well adapted and have good psychometric properties. The scale can be used in further studies to assess the fatigue in patients with Parkinson's disease.
Standardized reporting guidelines for emergency department syncope risk-stratification research.

PubMed

Sun, Benjamin C; Thiruganasambandamoorthy, Venkatesh; Cruz, Jeffrey Dela

2012-06-01

There is increasing research interest in the risk stratification of emergency department (ED) syncope patients. A major barrier to comparing and synthesizing existing research is wide variation in the conduct and reporting of studies. The authors wanted to create standardized reporting guidelines for ED syncope risk-stratification research using an expert consensus process. In that pursuit, a panel of syncope researchers was convened and a literature review was performed to identify candidate reporting guideline elements. Candidate elements were grouped into four sections: eligibility criteria, outcomes, electrocardiogram (ECG) findings, and predictors. A two-round, modified Delphi consensus process was conducted using an Internet-based survey application. In the first round, candidate elements were rated on a five-point Likert scale. In the second round, panelists rerated items after receiving information about group ratings from the first round. Items that were rated by >80% of the panelists at the two highest levels of the Likert scale were included in the final guidelines. There were 24 panelists from eight countries who represented five clinical specialties. The panel identified an initial set of 183 candidate elements. After two survey rounds, the final reporting guidelines included 92 items that achieved >80% consensus. These included 10 items for study eligibility, 23 items for outcomes, nine items for ECG abnormalities, and 50 items for candidate predictors. Adherence to these guidelines should facilitate comparison of future research in this area. © 2012 by the Society for Academic Emergency Medicine.
Measuring leader perceptions of school readiness for reforms: use of an iterative model combining classical and Rasch methods.

PubMed

Chatterji, Madhabi

2002-01-01

This study examines validity of data generated by the School Readiness for Reforms: Leader Questionnaire (SRR-LQ) using an iterative procedure that combines classical and Rasch rating scale analysis. Following content-validation and pilot-testing, principal axis factor extraction and promax rotation of factors yielded a five factor structure consistent with the content-validated subscales of the original instrument. Factors were identified based on inspection of pattern and structure coefficients. The rotated factor pattern, inter-factor correlations, convergent validity coefficients, and Cronbach's alpha reliability estimates supported the hypothesized construct properties. To further examine unidimensionality and efficacy of the rating scale structures, item-level data from each factor-defined subscale were subjected to analysis with the Rasch rating scale model. Data-to-model fit statistics and separation reliability for items and persons met acceptable criteria. Rating scale results suggested consistency of expected and observed step difficulties in rating categories, and correspondence of step calibrations with increases in the underlying variables. The combined approach yielded more comprehensive diagnostic information on the quality of the five SRR-LQ subscales; further research is continuing.
Bifactor and Item Response Theory Analyses of Interviewer Report Scales of Cognitive Impairment in Schizophrenia

PubMed Central

Reise, Steven P.; Ventura, Joseph; Keefe, Richard S. E.; Baade, Lyle E.; Gold, James M.; Green, Michael F.; Kern, Robert S.; Mesholam-Gately, Raquelle; Nuechterlein, Keith H.; Seidman, Larry J.; Bilder, Robert

2011-01-01

We conducted psychometric analyses of two interview-based measures of cognitive deficits: the 21-item Clinical Global Impression of Cognition in Schizophrenia (CGI-CogS; Ventura et al., 2008), and the 20-item Schizophrenia Cognition Rating Scale (SCoRS; Keefe et al., 2006), which were administered on two occasions to a sample of people with schizophrenia. Traditional psychometrics, bifactor analysis, and item response theory (IRT) methods were used to explore item functioning, dimensionality, and to compare instruments. Despite containing similar item content, responses to the CGI-CogS demonstrated superior psychometric properties (e.g., higher item-intercorrelations, better spread of ratings across response categories), relative to the SCoRS. We argue that these differences arise mainly from the differential use of prompts and how the items are phrased and scored. Bifactor analysis demonstrated that although both measures capture a broad range of cognitive functioning (e.g., working memory, social cognition), the common variance on each is overwhelmingly explained by a single general factor. IRT analyses of the combined pool of 41 items showed that measurement precision is peaked in the mild to moderate range of cognitive impairment. Finally, simulated adaptive testing revealed that only about 10 to 12 items are necessary to achieve latent trait level estimates with reasonably small standard errors for most individuals. This suggests that these interview-based measures of cognitive deficits could be shortened without loss of measurement precision. PMID:21381848

Psychometric properties and cross-cultural equivalence of the Arabic Social Capital Scale: instrument development study.

PubMed

Looman, Wendy Sue; Farrag, Shewikar

2009-01-01

Social capital, defined as an investment in relationships that facilitates the exchange of resources, has been identified as a possible protective factor for child health in the context of risk factors such as poverty. Reliable and valid measures of social capital are needed for research and practice, particularly in non-English-speaking populations in developing countries. To evaluate the psychometric properties and cross-cultural equivalence of the Arabic translation of the Social Capital Scale (SCS). Descriptive, cross-sectional study for psychometric testing of a translated tool. Two metropolitan health clinics in Alexandria, Egypt. A convenience sample of 117 Egyptian parents of children with chronic conditions. To be eligible to participate, respondents had to be a parent of child with a chronic health condition between the ages of 1 and 18 years. The sample included primarily biological parents between the ages of 20 and 56 years. The 20-item Arabic SCS was administered as part of a written survey that included additional measures on demographic information and parent ratings of the child's overall health. Six items were ultimately removed based on item analysis, and exploratory factor analysis was conducted on the resulting 14-item scale. As a measure of construct validity, hypothesis testing was conducted using an independent samples t-test to determine whether a significant difference exists between mean total social capital scores for two groups of respondents based on the parental rating of the child's overall health. Item and factor analysis yielded preliminary support for a revised, 14-item Arabic SCS with four internally consistent factors. The standardized item alpha reliability coefficient for the total 14-item scale was .75. Respondents who reported that their child was in good health had significantly higher social capital scores than those who rated their child's health as poor. The 14-item Arabic SCS was found to be reliable and valid in this sample, with four internally consistent factors. While the tool may not be appropriate for comparing social capital between cultural groups, it will enable clinicians and researchers to address an important gap in knowledge characterized by a paucity of research on childhood chronic illness in low- and middle-income countries such as Egypt.
The Self-Perception and Relationships Tool (S-PRT): A novel approach to the measurement of subjective health-related quality of life

PubMed Central

Atkinson, Mark J; Wishart, Paul M; Wasil, Bushra I; Robinson, John W

2004-01-01

Background The Self-Perception and Relationships Tool (S-PRT) is intended to be a clinically responsive and holistic assessment of patients' experience of illness and subjective Health Related Quality of Life (HRQL). Methods A diversity of patients were involved in two phases of this study. Patient samples included individuals involved with renal, cardiology, psychiatric, cancer, chronic pelvic pain, and sleep services. In Phase I, five patient focus groups generated 128 perceptual rating scales. These scales described important characteristics of illness-related experience within six life domains (i.e., Physical, Mental-Emotional, Interpersonal Receptiveness, Interpersonal Contribution, Transpersonal Receptiveness and Transpersonal Orientation). Item reduction was accomplished using Importance Q-sort and Importance Checklist methodologies with 150 patients across the participating services. In Phase II, a refined item pool (88 items) was administered along with measures of health status (SF-36) and spiritual beliefs (Spiritual Involvements and Beliefs Scale – SIBS) to 160 patients, of these 136 patients returned complete response sets. Results Factor analysis of S-PRT results produced a surprisingly clean five-factor solution (Eigen values> 2.0 explaining 73.5% of the pooled variance). Items with weaker or split loadings were removed leaving 36 items to form the final S-PRT rating scales; Intrapersonal Well-being (physical, mental & emotional items), Interpersonal Receptivity, Interpersonal Contribution, Transpersonal Receptivity and Transpersonal Orientation (Eigen values> 5.4 explaining 83.5% of the pooled variance). The internal consistency (Cronbach's Alpha) of these scales was very high (0.82–0.97). Good convergent correlations (0.40 to 0.67) were observed between the S-PRT scales and the Mental Health scales of the SF-36. Correlations between the S-PRT Intrapersonal Well-being scale and three of SF-36 Physical Health scales were moderate (0.30 to 0.46). The criterion-related validity of the S-PRT spiritual scales was supported by moderate convergence (0.40–0.49) with three SIBS scales. Conclusion Evidence supports the validity of the S-PRT as a generally applicable measure of perceived health status and HRQL. The test-retest reliability was found to be adequate for most scales, and there is some preliminary evidence that the S-PRT is responsive to patient-reported changes in determinants of their HRQL. Clinical uses and directions for future research are discussed. PMID:15257754
Efficacy of vilazodone on anxiety symptoms in patients with major depressive disorder

PubMed Central

Chen, Dalei; Edwards, John; Ruth, Adam

2014-01-01

Anxiety symptoms are prevalent in patients with major depressive disorder. A post-hoc analysis of two phase III trials was conducted to evaluate the efficacy of vilazodone on depression-related anxiety. Using the 17-item Hamilton Depression Rating Scale (HAMD17) Anxiety/Somatization subscale, patients were classified as anxious or nonanxious. Improvements in depressive symptoms were based on least squares mean changes in HAMD17 and Montgomery–Asberg Depression Rating Scale total scores. Anxiety symptoms in the anxious subgroup were evaluated using Hamilton Anxiety Rating Scale (HAMA) total and subscale (Psychic Anxiety, Somatic Anxiety) scores, HAMD17 Anxiety/Somatization subscale and item (Psychic Anxiety, Somatic Anxiety) scores, and the Montgomery–Asberg Depression Rating Scale Inner Tension item score. Most of the pooled study population [82.0% (708/863)] was classified with anxious depression. After 8 weeks of treatment, least squares mean differences between vilazodone and placebo for changes in HAMA total and HAMD17 Anxiety/Somatization subscale scores were −1.82 (95% confidence interval −2.81 to −0.83; P<0.001) and −0.75 (95% confidence interval −1.17 to −0.32; P<0.001), respectively. Statistically significant improvements with vilazodone were also found on all other anxiety-related measures, except the HAMA Somatic Anxiety subscale. Vilazodone may be effective in treating patients with major depressive disorder who exhibit somatic and/or psychic symptoms of anxiety. PMID:24978955
Using Rasch Analysis to Inform Rating Scale Development

ERIC Educational Resources Information Center

Van Zile-Tamsen, Carol

2017-01-01

The use of surveys, questionnaires, and rating scales to measure important outcomes in higher education is pervasive, but reliability and validity information is often based on problematic Classical Test Theory approaches. Rasch Analysis, based on Item Response Theory, provides a better alternative for examining the psychometric quality of rating…
Preliminary Validation of the Motor Skills Rating Scale

ERIC Educational Resources Information Center

Cameron, Claire E.; Chen, Wei-Bing; Blodgett, Julia; Cottone, Elizabeth A.; Mashburn, Andrew J.; Brock, Laura L.; Grissmer, David

2012-01-01

This study examined psychometric properties of the Motor Skills Rating Scale (MSRS), a questionnaire designed for classroom teachers of children in early elementary school. Items were developed with the guidance of two occupational therapists, and factor structure was examined with an exploratory factor analysis (EFA). The resulting model showed…
Examining the Psychometric Properties of the Infant-Toddler Environment Rating Scale-Revised Edition in a High-Stakes Context

ERIC Educational Resources Information Center

Bisceglia, Rossana; Perlman, Michal; Schaack, Diana; Jenkins, Jennifer

2009-01-01

The psychometric properties of the Infant-Toddler Environment Rating Scale-Revised Edition (ITERS-R) were examined using 153 classrooms from child-care centers where resources were tied to center performance. An exploratory factor analysis revealed that the scale measures one global aspect of quality. To decrease redundancy, subsets of items were…
International epidemiology of child and adolescent psychopathology ii: integration and applications of dimensional findings from 44 societies.

PubMed

Rescorla, Leslie; Ivanova, Masha Y; Achenbach, Thomas M; Begovac, Ivan; Chahed, Myriam; Drugli, May Britt; Emerich, Deisy Ribas; Fung, Daniel S S; Haider, Mariam; Hansson, Kjell; Hewitt, Nohelia; Jaimes, Stefanny; Larsson, Bo; Maggiolini, Alfio; Marković, Jasminka; Mitrović, Dragan; Moreira, Paulo; Oliveira, João Tiago; Olsson, Martin; Ooi, Yoon Phaik; Petot, Djaouida; Pisa, Cecilia; Pomalima, Rolando; da Rocha, Marina Monzani; Rudan, Vlasta; Sekulić, Slobodan; Shahini, Mimoza; de Mattos Silvares, Edwiges Ferreira; Szirovicza, Lajos; Valverde, José; Vera, Luis Anderssen; Villa, Maria Clara; Viola, Laura; Woo, Bernardine S C; Zhang, Eugene Yuqing

2012-12-01

To build on Achenbach, Rescorla, and Ivanova (2012) by (a) reporting new international findings for parent, teacher, and self-ratings on the Child Behavior Checklist, Youth Self-Report, and Teacher's Report Form; (b) testing the fit of syndrome models to new data from 17 societies, including previously underrepresented regions; (c) testing effects of society, gender, and age in 44 societies by integrating new and previous data; (d) testing cross-society correlations between mean item ratings; (e) describing the construction of multisociety norms; (f) illustrating clinical applications. Confirmatory factor analyses (CFAs) of parent, teacher, and self-ratings, performed separately for each society; tests of societal, gender, and age effects on dimensional syndrome scales, DSM-oriented scales, Internalizing, Externalizing, and Total Problems scales; tests of agreement between low, medium, and high ratings of problem items across societies. CFAs supported the tested syndrome models in all societies according to the primary fit index (Root Mean Square Error of Approximation [RMSEA]), but less consistently according to other indices; effect sizes were small-to-medium for societal differences in scale scores, but very small for gender, age, and interactions with society; items received similarly low, medium, or high ratings in different societies; problem scores from 44 societies fit three sets of multisociety norms. Statistically derived syndrome models fit parent, teacher, and self-ratings when tested individually in all 44 societies according to RMSEAs (but less consistently according to other indices). Small to medium differences in scale scores among societies supported the use of low-, medium-, and high-scoring norms in clinical assessment of individual children. Copyright © 2012 American Academy of Child and Adolescent Psychiatry. Published by Elsevier Inc. All rights reserved.
Cross Validated Temperament Scale Validities Computed Using Profile Similarity Metrics

DTIC Science & Technology

2017-04-27

true at both the item and the scale level. 6 Moreover, the correlation between conventional scores and distance scores for these types of scales...have a perfect negative correlation , r = -1.00. From this perspective, conventional and distance scores are completely redundant. Therefore, we argue... correlation between each respondent’s rating profile and the scale key: shape-scores = rx,k. 2. Rating elevation difference, which is computed as the
The Swedish Version of the Ritvo Autism and Asperger Diagnostic Scale: Revised (RAADS-R). A Validation Study of a Rating Scale for Adults

ERIC Educational Resources Information Center

Andersen, Lisa M. J.; Naswall, Katharina; Manouilenko, Irina; Nylander, Lena; Edgar, Johan; Ritvo, Riva Ariella; Ritvo, Edward; Bejerot, Susanne

2011-01-01

There is a paucity of diagnostic instruments for adults with autism spectrum disorder (ASD). This study evaluates the psychometric properties of the Swedish version of the Ritvo Autism and Asperger Diagnostic Scale-Revised (RAADS-R), an 80-item self-rating scale designed to assist clinicians diagnosing ASD in adults. It was administered to 75…
Effect of ketamine dose on self-rated dissociation in patients with treatment refractory anxiety disorders.

PubMed

Castle, Cameron; Gray, Andrew; Neehoff, Shona; Glue, Paul

2017-10-01

Patients receiving ketamine for refractory depression and anxiety report dissociative symptoms in the first 60 min post-dose. The most commonly used instrument to assess this is the Clinician-Administered Dissociative States Scale (CADSS), developed based on the assessment of patients with dissociative symptoms. Its psychometric properties for ketamine-induced dissociation have not been reported. We evaluated these from a study using 0.25-1 mg/kg ketamine and midazolam (as an active control) in 18 patients with treatment-resistant anxiety. Dissociation ratings were increased by ketamine in a dose-dependent manner. In contrast, midazolam showed no effect on ratings of dissociation. For individual CADSS items, the magnitude of change and the ketamine dose at which changes were observed were not homogenous. The Cronbach alpha for the total scale was high (0.937), with acceptable item-rest correlations for almost all individual items. Purposefully removing items to maximise alpha did not lead to meaningful improvements. Acceptable internal consistency was still observed after removing items which lacked evidence of responsiveness at lower doses. The high Cronbach alpha values identified in this study suggests that the CADSS is an internally consistent instrument for evaluating ketamine-induced dissociation in clinical trials in anxiety, although it does not capture symptoms such as thought disorder.
Assessing Children's Homework Performance: Development of Multi-Dimensional, Multi-Informant Rating Scales.

PubMed

Power, Thomas J; Dombrowski, Stefan C; Watkins, Marley W; Mautone, Jennifer A; Eagle, John W

2007-06-01

Efforts to develop interventions to improve homework performance have been impeded by limitations in the measurement of homework performance. This study was conducted to develop rating scales for assessing homework performance among students in elementary and middle school. Items on the scales were intended to assess student strengths as well as deficits in homework performance. The sample included 163 students attending two school districts in the Northeast. Parents completed the 36-item Homework Performance Questionnaire - Parent Scale (HPQ-PS). Teachers completed the 22-item teacher scale (HPQ-TS) for each student for whom the HPQ-PS had been completed. A common factor analysis with principal axis extraction and promax rotation was used to analyze the findings. The results of the factor analysis of the HPQ-PS revealed three salient and meaningful factors: student task orientation/efficiency, student competence, and teacher support. The factor analysis of the HPQ-TS uncovered two salient and substantive factors: student responsibility and student competence. The findings of this study suggest that the HPQ is a promising set of measures for assessing student homework functioning and contextual factors that may influence performance. Directions for future research are presented.
Assessing Children’s Homework Performance: Development of Multi-Dimensional, Multi-Informant Rating Scales

PubMed Central

Power, Thomas J.; Dombrowski, Stefan C.; Watkins, Marley W.; Mautone, Jennifer A.; Eagle, John W.

2007-01-01

Efforts to develop interventions to improve homework performance have been impeded by limitations in the measurement of homework performance. This study was conducted to develop rating scales for assessing homework performance among students in elementary and middle school. Items on the scales were intended to assess student strengths as well as deficits in homework performance. The sample included 163 students attending two school districts in the Northeast. Parents completed the 36-item Homework Performance Questionnaire – Parent Scale (HPQ-PS). Teachers completed the 22-item teacher scale (HPQ-TS) for each student for whom the HPQ-PS had been completed. A common factor analysis with principal axis extraction and promax rotation was used to analyze the findings. The results of the factor analysis of the HPQ-PS revealed three salient and meaningful factors: student task orientation/efficiency, student competence, and teacher support. The factor analysis of the HPQ-TS uncovered two salient and substantive factors: student responsibility and student competence. The findings of this study suggest that the HPQ is a promising set of measures for assessing student homework functioning and contextual factors that may influence performance. Directions for future research are presented. PMID:18516211
Rating disease progression of Friedreich’s ataxia by the International Cooperative Ataxia Rating Scale: analysis of a 603-patient database

PubMed Central

Coppard, Nicholas; Cooper, Jonathon M.; Delatycki, Martin B.; Dürr, Alexandra; Di Prospero, Nicholas A.; Giunti, Paola; Lynch, David R.; Schulz, J. B.; Rummey, Christian; Meier, Thomas

2013-01-01

The aim of this cross-sectional study was to analyse disease progression in Friedreich’s ataxia as measured by the International Cooperative Ataxia Rating Scale. Single ratings from 603 patients with Friedreich’s ataxia were analysed as a function of disease duration, age of onset and GAA repeat lengths. The relative contribution of items and subscales to the total score was studied as a function of disease progression. In addition, the scaling properties were assessed using standard statistical measures. Average total scale progression per year depends on the age of disease onset, the time since diagnosis and the GAA repeat length. The age of onset inversely correlates with increased GAA repeat length. For patients with an age of onset ≤14 years associated with a longer repeat length, the average yearly rate of decline was 2.5 ± 0.18 points in the total International Cooperative Ataxia Rating Scale for the first 20 years of disease duration, whereas patients with a later onset progress more slowly (1.8 ± 0.27 points/year). Ceiling effects in posture, gait and lower limb scale items lead to a reduced sensitivity of the scale in the severely affected population with a total score of >60 points. Psychometric scaling analysis shows generally favourable properties for the total scale, but the subscale grouping could be improved. This cross-sectional study provides a detailed characterization of the International Cooperative Ataxia Rating Scale. The analysis further provides rates of change separated for patients with early and late disease onset, which is driven by the GAA repeat length. Differences in the subscale dynamics merit consideration in the design of future clinical trials applying this scale as a neurological assessment instrument in Friedreich’s ataxia. PMID:23365101
Children's Depression Inventory (CDI) and the Children's Depression Rating Scale-Revised (CDRS-R): reliability of the Hebrew version.

PubMed

Zalsman, Gil; Misgav, Sagit; Sommerfeld, Eliane; Kohn, Yoav; Brunstein-Klomek, Anat; Diller, Robyne; Sher, Leo; Schwartz, Joseph; Shoval, Gal; Ben-Dor, David H; Wolovik, Luisa; Oquendo, Maria A

2005-01-01

The Children's Depression Inventory (CDI) and Children's Depression Rating Scale-Revised (CDRS-R) are two widely used instruments, which measure depression in children and adolescents. This pilot study assessed the reliability of the Hebrew versions of these two instruments. Both CDRS-R and CDI were translated from English into Hebrew and then back translated. Seventeen healthy Israeli bilingual children volunteers were interviewed with both scales with a one day intermission between the interviews. Non-parametric correlations were used to compare scores in the two versions for each item. Results showed high agreement between the two versions for almost all items of the CDI and moderate to high for the CDRS-R. When CDRS-R summary scores for each item were compared, the agreement was high for this instrument as well. It is concluded that both CDI and CDRS-R Hebrew versions are reliable and can be used for studies of depression in the Israeli pediatric population.
"Up Means Good": The Effect of Screen Position on Evaluative Ratings in Web Surveys.

PubMed

Tourangeau, Roger; Couper, Mick P; Conrad, Frederick G

2013-01-01

This paper presents results from six experiments that examine the effect of the position of an item on the screen on the evaluative ratings it receives. The experiments are based on the idea that respondents expect "good" things-those they view positively-to be higher up on the screen than "bad" things. The experiments use items on different topics (Congress and HMOs, a variety of foods, and six physician specialties) and different methods for varying their vertical position on the screen. A meta-analysis of all six experiments demonstrates a small but reliable effect of the item's screen position on mean ratings of the item; the ratings are significantly more positive when the item appears in a higher position on the screen than when it appears farther down. These results are consistent with the hypothesis that respondents follow the "Up means good" heuristic, using the vertical position of the item as a cue in evaluating it. Respondents seem to rely on heuristics both in interpreting response scales and in forming judgments.
Item response theory analysis of the Amyotrophic Lateral Sclerosis Functional Rating Scale-Revised in the Pooled Resource Open-Access ALS Clinical Trials Database.

PubMed

Bacci, Elizabeth D; Staniewska, Dorota; Coyne, Karin S; Boyer, Stacey; White, Leigh Ann; Zach, Neta; Cedarbaum, Jesse M

2016-01-01

Our objective was to examine dimensionality and item-level performance of the Amyotrophic Lateral Sclerosis Functional Rating Scale-Revised (ALSFRS-R) across time using classical and modern test theory approaches. Confirmatory factor analysis (CFA) and Item Response Theory (IRT) analyses were conducted using data from patients with amyotrophic lateral sclerosis (ALS) Pooled Resources Open-Access ALS Clinical Trials (PRO-ACT) database with complete ALSFRS-R data (n = 888) at three time-points (Time 0, Time 1 (6-months), Time 2 (1-year)). Results demonstrated that in this population of 888 patients, mean age was 54.6 years, 64.4% were male, and 93.7% were Caucasian. The CFA supported a 4* individual-domain structure (bulbar, gross motor, fine motor, and respiratory domains). IRT analysis within each domain revealed misfitting items and overlapping item response category thresholds at all time-points, particularly in the gross motor and respiratory domain items. Results indicate that many of the items of the ALSFRS-R may sub-optimally distinguish among varying levels of disability assessed by each domain, particularly in patients with less severe disability. Measure performance improved across time as patient disability severity increased. In conclusion, modifications to select ALSFRS-R items may improve the instrument's specificity to disability level and sensitivity to treatment effects.
[Study of functional rating scale for amyotrophic lateral sclerosis: revised ALSFRS(ALSFRS-R) Japanese version].

PubMed

Ohashi, Y; Tashiro, K; Itoyama, Y; Nakano, I; Sobue, G; Nakamura, S; Sumino, S; Yanagisawa, N

2001-04-01

Amyotrophic lateral sclerosis(ALS) is progressive, degenerative, fatal disease of the motor neuron. No efficacious therapy is available to slow the progressive loss of function, but several new approaches including neurotrophic factors, antioxidants and glutamate antagonists, are currently being evaluated as potential therapies. Mortality, and/or time to tracheostomy, muscle strength and pulmonary function are used as primary endpoints in clinical trials for treatment of ALS. The effect of new therapies on the quality of patients' lives are also important, so we sought to develop a rating scale to measure it. The revised ALS Functional Rating Scale(ALSFRS-R), which has addition of items to ALSFRS to enhance the ability to assess respiratory symptoms, is an assessment determining the degree of impairment in ALS patients' abilities to function independently in activities of daily living. It consists of 12 items to evaluate bulbar function, motor function and respiratory function and each item is scored from 0(unable) to 4(normal). We translated the English score into Japanese one with minor modification considering the inter cultural difference. And we examined reliability of the translated scale. As a measure of reliability, the intraclass correlation coefficient(ICC) was evaluated for total score and the Kappa coefficient proposed by Cohen and Kraemer was calculated for each item. Moreover, we examined sensitivity to clinical change over time and carried out the factor analysis to analyze the factorial structure. The subjects were 27 ALS patients and each was scored twice for reliability or three times for sensitivity by 2 to 5 neurologists and if possible, nurses. The ICC for total score was 0.97(95% C. I.; 0.94-0.98). Extension of the Kappa coefficients were 0.48 to 1.00 for inter-rater reliability and the averaged Kappa coefficients were 0.63 to 1.00 for intra rater reliability, respectively. Concerning the factorial structure, the contribution of the first factor(the first principal component) were 53.5% principal factor solution. The factor loadings of items were 0.52-0.91 except "salivation" and this factor almost equal to the simple sum of all items was interpreted as the general degree of deterioration. The promax votation revealed the riginally supposed factor structure with 3 factors(groups of items): neuromuscuclar function, respiratory function and bulbar function. The rating scale correlated with Global clinical impression of change(GCIC) scored by neurologists and declined with time, indicating its sensitivity to change. On the bases of these results, ALSFRS-R(Japanese version) is considered to be highly reliable enough for clinical use.
Identifying Core Competencies of Infection Control Nurse Specialists in Hong Kong.

PubMed

Chan, Wai Fong; Bond, Trevor G; Adamson, Bob; Chow, Meyrick

2016-01-01

To confirm a core competency scale for Hong Kong infection control nurses at the advanced nursing practice level from the core competency items proposed in a previous phase of this study. This would serve as the foundation of competency assurance in Hong Kong hospitals. A cross-sectional survey design was used. All public and private hospitals in Hong Kong. All infection control nurses in hospitals of Hong Kong. The 83-item proposed core competency list established in an earlier study was transformed into a questionnaire and sent to 112 infection control nurses in 48 hospitals in Hong Kong. They were asked to rate the importance of each infection prevention and control item using Likert-style response categories. Data were analyzed using the Rasch model. The response rate of 81.25% was achieved. Seven items were removed from the proposed core competency list, leaving a scale of 76 items that fit the measurement requirements of the unidimensional Rasch model. Essential core competency items of advanced practice for infection control nurses in Hong Kong were identified based on the measurement criteria of the Rasch model. Several items of the scale that reflect local Hong Kong contextual characteristics are distinguished from the overseas standards. This local-specific competency list could serve as the foundation for education and for certification of infection control nurse specialists in Hong Kong. Rasch measurement is an appropriate analytical tool for identifying core competencies of advanced practice nurses in other specialties and in other locations in a manner that incorporates practitioner judgment and expertise.
The Palin Parent Rating Scales: Parents' Perspectives of Childhood Stuttering and Its Impact.

PubMed

Millard, Sharon K; Davis, Stephen

2016-10-01

The goal of this study is to explore the psychometric properties of the Parent Rating Scales-V1 (S. K. Millard, S. Edwards, & F. M. Cook, 2009), an assessment tool for parents of children who stutter, and to refine the measure accordingly. We included 259 scales completed prior to therapy. An exploratory factor analysis determined the test constructs and identified the items that had greatest loadings on those factors. Items that did not load on the factors were removed, and normative scores calculated. The resulting 19-item questionnaire measures three factors: (a) the impact of stuttering on the child; (b) the severity of stuttering and its impact on the parents; and (c) the parents' knowledge about stuttering and confidence in managing it. Reliability was demonstrated, norms established, and an automated online version constructed. The Palin Parent Rating Scale is a valid and reliable tool, providing a method of exploring parents' perceptions of stuttering, the impact it has on the child and themselves, and the parents' knowledge of and confidence in managing the stuttering. This is an important addition to the existing range of assessments that may be used to evaluate stuttering in children up to age 14;6 (years;months) and allows the wider targets of parent-led therapy programs to be evaluated.
The Consumer Assessment of Healthcare Providers and Systems (CAHPS) cultural competence (CC) item set.

PubMed

Weech-Maldonado, Robert; Carle, Adam; Weidmer, Beverly; Hurtado, Margarita; Ngo-Metzger, Quyen; Hays, Ron D

2012-09-01

There is a need for reliable and valid measures of cultural competence (CC) from the patient's perspective. This paper evaluates the reliability and validity of the Consumer Assessments of Healthcare Providers and Systems (CAHPS) CC item set. Using 2008 survey data, we assessed the internal consistency of the CAHPS CC scales using the Cronbach α's and examined the validity of the measures using exploratory and confirmatory factor analysis, multitrait scaling analysis, and regression analysis. A random stratified sample (based on race/ethnicity and language) of 991 enrollees, younger than 65 years, from 2 Medicaid managed care plans in California and New York. CAHPS CC item set after excluding screener items and ratings. Confirmatory factor analysis (Comparative Fit Index=0.98, Tucker Lewis Index=0.98, and Root Mean Square Error or Approximation=0.06) provided support for a 7-factor structure: Doctor Communication--Positive Behaviors, Doctor Communication--Negative Behaviors, Doctor Communication--Health Promotion, Doctor Communication--Alternative Medicine, Shared Decision-Making, Equitable Treatment, and Trust. Item-total correlations (corrected for item overlap) for the 7 scales exceeded 0.40. Exploratory factor analysis showed support for 1 additional factor: Access to Interpreter Services. Internal consistency reliability estimates ranged from 0.58 (Alternative Medicine) to 0.92 (Positive Behaviors) and was 0.70 or higher for 4 of the 8 composites. All composites were positively and significantly associated with the overall doctor rating. The CAHPS CC 26-item set demonstrates adequate measurement properties and can be used as a supplemental item set to the CAHPS Clinician and Group Surveys in assessing culturally competent care from the patient's perspective.

Which kind of psychometrics is adequate for patient satisfaction questionnaires?

PubMed

Konerding, Uwe

2016-01-01

The construction and psychometric analysis of patient satisfaction questionnaires are discussed. The discussion is based upon the classification of multi-item questionnaires into scales or indices. Scales consist of items that describe the effects of the latent psychological variable to be measured, and indices consist of items that describe the causes of this variable. Whether patient satisfaction questionnaires should be constructed and analyzed as scales or as indices depends upon the purpose for which these questionnaires are required. If the final aim is improving care with regard to patients' preferences, then these questionnaires should be constructed and analyzed as indices. This implies two requirements: 1) items for patient satisfaction questionnaires should be selected in such a way that the universe of possible causes of patient satisfaction is covered optimally and 2) Cronbach's alpha, principal component analysis, exploratory factor analysis, confirmatory factor analysis, and analyses with models from item response theory, such as the Rasch Model, should not be applied for psychometric analyses. Instead, multivariate regression analyses with a direct rating of patient satisfaction as the dependent variable and the individual questionnaire items as independent variables should be performed. The coefficients produced by such an analysis can be applied for selecting the best items and for weighting the selected items when a sum score is determined. The lower boundaries of the validity of the unweighted and the weighted sum scores can be estimated by their correlations with the direct satisfaction rating. While the first requirement is fulfilled in the majority of the previous patient satisfaction questionnaires, the second one deviates from previous practice. Hence, if patient satisfaction is actually measured with the final aim of improving care with regard to patients' preferences, then future practice should be changed so that the second requirement is also fulfilled.
A general theoretical framework for interpreting patient-reported outcomes estimated from ordinally scaled item responses.

PubMed

Massof, Robert W

2014-10-01

A simple theoretical framework explains patient responses to items in rating scale questionnaires. Fixed latent variables position each patient and each item on the same linear scale. Item responses are governed by a set of fixed category thresholds, one for each ordinal response category. A patient's item responses are magnitude estimates of the difference between the patient variable and the patient's estimate of the item variable, relative to his/her personally defined response category thresholds. Differences between patients in their personal estimates of the item variable and in their personal choices of category thresholds are represented by random variables added to the corresponding fixed variables. Effects of intervention correspond to changes in the patient variable, the patient's response bias, and/or latent item variables for a subset of items. Intervention effects on patients' item responses were simulated by assuming the random variables are normally distributed with a constant scalar covariance matrix. Rasch analysis was used to estimate latent variables from the simulated responses. The simulations demonstrate that changes in the patient variable and changes in response bias produce indistinguishable effects on item responses and manifest as changes only in the estimated patient variable. Changes in a subset of item variables manifest as intervention-specific differential item functioning and as changes in the estimated person variable that equals the average of changes in the item variables. Simulations demonstrate that intervention-specific differential item functioning produces inefficiencies and inaccuracies in computer adaptive testing. © The Author(s) 2013 Reprints and permissions: sagepub.co.uk/journalsPermissions.nav.
Development and psychometric evaluation of a clinical global impression for schizoaffective disorder scale.

PubMed

Allen, Michael H; Daniel, David G; Revicki, Dennis A; Canuso, Carla M; Turkoz, Ibrahim; Fu, Dong-Jing; Alphs, Larry; Ishak, K Jack; Bartko, John J; Lindenmayer, Jean-Pierre

2012-01-01

The Clinical Global Impression for Schizoaffective Disorder scale is a new rating scale adapted from the Clinical Global Impression scale for use in patients with schizoaffective disorder. The psychometric characteristics of the Clinical Global Impression for Schizoaffective Disorder are described. Content validity was assessed using an investigator questionnaire. Inter-rater reliability was determined with 12 sets of videotaped interviews rated independently by two trained individuals. Test-retest reliability was assessed using 30 randomly selected raters from clinical trials who evaluated the same videos on separate occasions two weeks apart. Convergent and divergent validity and effect size were evaluated by comparing scores between the Clinical Global Impression for Schizoaffective Disorder and the Positive and Negative Syndrome Scale, 21-item Hamilton Rating Scale for Depression, and Young Mania Rating Scale scales using pooled patient data from two clinical trials. Clinical Global Impression for Schizoaffective Disorder scores were then linked to corresponding Positive and Negative Syndrome Scale scores. Content validity was strong. Inter-rater agreement was good to excellent for most scales and subscales (intra-class correlation coefficient ≥ 0.50). Test-retest showed good reproducibility, with intraclass correlation coefficients ranging from 0.444 to 0.898. Spearman correlations between Clinical Global Impression for Schizoaffective Disorder domains and corresponding symptom scales were 0.60 or greater, and effect sizes for Clinical Global Impression for Schizoaffective Disorder overall and domain scores were similar to Positive and Negative Syndrome Scale Young Mania Rating Scale, and 21-item Hamilton Rating Scale for Depression scores. Raters anticipated that the scale might be less effective in distinguishing negative from depressive symptoms, and, in fact, the results here may reflect that clinical reality. Multiple lines of evidence support the reliability and validity of the Clinical Global Impression for Schizoaffective Disorder for studies in schizoaffective disorder.
Developing the Person-Environment Apathy Rating for persons with dementia.

PubMed

Jao, Ying-Ling; Algase, Donna L; Specht, Janet K; Williams, Kristine

2016-08-01

To develop the Person-Environment Apathy Rating (PEAR) scale that measures environmental stimulation and apathy in persons with dementia and to evaluate its psychometrics. The PEAR scale consists of the PEAR-Environment subscale and PEAR-Apathy subscales. The items were developed via literature review, field testing, expert review, and pilot testing. The construct validity and reliability were examined through video observation. The parent study enrolled 185 institutionalized residents with dementia. For this study, 96 videos were selected from 24 participants. The PEAR-Environment subscale was validated using the Ambiance Scale and the Crowding Index. The PEAR-Apathy subscale was validated using the Neuropsychiatric Inventory (NPI)-Apathy, Passivity in Dementia Scale (PDS), and NPI-Depression. The PEAR-Environment subscale and PEAR-Apathy subscales each consists of six items rated on a 1-4 scale. For validity, the Crowding Index slightly, yet significantly, correlated with the PEAR-Environment subscale total score and three of the individual scores. Ambiance Scale scores, both engaging and soothing, did not correlate with the PEAR-Environment subscale. The PEAR-Apathy highly correlated with the PDS and NPI-Apathy and moderately correlated with the NPI-Depression, suggesting good convergent validity and moderate discriminant validity. For reliability, both environment and apathy subscales demonstrated excellent internal consistency. Although facial expression and eye contact showed moderate inter-rater reliability, all other items showed good to excellent inter-rater and intra-rater reliability. This study has successfully developed the PEAR scale and established its psychometrics based on the compatible scales available. The PEAR scale is the first scale that concurrently assesses apathy and environmental stimulation, and is recommended for use in persons with dementia.
Cross-cultural adaptation and psychometric properties of the Korean Scale for Internet Addiction (K-Scale) in Japanese high school students.

PubMed

Mak, Kwok-Kei; Nam, JeeEun Karin; Kim, Dongil; Aum, Narae; Choi, Jung-Seok; Cheng, Cecilia; Ko, Huei-Chen; Watanabe, Hiroko

2017-03-01

The Korean Scale for Internet Addiction (K-Scale) was developed in Korea for assessing addictive internet behaviors. This study aims to adopt K-Scale and examine its psychometric properties in Japanese adolescents. In 2014, 589 (36.0% boys) high school students (Grade 10-12) from Japan completed a survey, including items of Japanese versions of K-Scale and Smartphone Scale for Smartphone Addiction (S-Scale). Model fit indices of the original four-factor structure, three-factor structure obtained from exploratory factor analysis, and improved two-factor structure of K-Scale were computed using confirmatory factor analysis, with internal reliability of included items reported. The convergent validity of K-Scale was tested against self-rated internet addiction, and S-Scale using multiple regression models. The results showed that a second-order two-factor 13-item structure was the most parsimonious model (NFI=0.919, NNFI=0.935, CFI=0.949, and RMSEA=0.05) with good internal reliability (Cronbach's alpha=0.87). The two factors revealed were "Disturbance of Adaptation and Life Orientation" and "Withdrawal and Tolerance". Moreover, the correlation between internet user classifications defined by K-Scale and self-rating was significant. K-Scale total score was significantly and positively associated with S-Scale total (adjusted R 2 =0.440) and subscale scores (adjusted R 2 =0.439). In conclusion, K-Scale is a valid and reliable assessment scale of internet addiction for Japanese high school students after modifications. Copyright © 2017. Published by Elsevier B.V.
Pattern analysis of total item score and item response of the Kessler Screening Scale for Psychological Distress (K6) in a nationally representative sample of US adults

PubMed Central

Kawasaki, Yohei; Ide, Kazuki; Akutagawa, Maiko; Yamada, Hiroshi; Yutaka, Ono; Furukawa, Toshiaki A.

2017-01-01

Background Several recent studies have shown that total scores on depressive symptom measures in a general population approximate an exponential pattern except for the lower end of the distribution. Furthermore, we confirmed that the exponential pattern is present for the individual item responses on the Center for Epidemiologic Studies Depression Scale (CES-D). To confirm the reproducibility of such findings, we investigated the total score distribution and item responses of the Kessler Screening Scale for Psychological Distress (K6) in a nationally representative study. Methods Data were drawn from the National Survey of Midlife Development in the United States (MIDUS), which comprises four subsamples: (1) a national random digit dialing (RDD) sample, (2) oversamples from five metropolitan areas, (3) siblings of individuals from the RDD sample, and (4) a national RDD sample of twin pairs. K6 items are scored using a 5-point scale: “none of the time,” “a little of the time,” “some of the time,” “most of the time,” and “all of the time.” The pattern of total score distribution and item responses were analyzed using graphical analysis and exponential regression model. Results The total score distributions of the four subsamples exhibited an exponential pattern with similar rate parameters. The item responses of the K6 approximated a linear pattern from “a little of the time” to “all of the time” on log-normal scales, while “none of the time” response was not related to this exponential pattern. Discussion The total score distribution and item responses of the K6 showed exponential patterns, consistent with other depressive symptom scales. PMID:28289560
Examining Parents' Ratings of Middle-School Students' Academic Self-Regulation Using Principal Axis Factoring Analysis

ERIC Educational Resources Information Center

Chen, Peggy P.; Cleary, Timothy J.; Lui, Angela M.

2015-01-01

This study examined the reliability and validity of a parent rating scale, the "Self-Regulation Strategy Inventory: Parent Rating Scale" ("SRSI-PRS"), using a sample of 451 parents of sixth- and seventh-grade middle-school students. Principal axis factoring (PAF) analysis revealed a 3-factor structure for the 23-item SRSI-PRS:…
Psychometric properties of Conversion Disorder Scale- Revised (CDS) for children.

PubMed

Ijaz, Tazvin; Nasir, Attikah; Sarfraz, Naema; Ijaz, Shirmeen

2017-05-01

To revise conversion disorder scale and to establish the psychometric properties of the revised scale. This case-control study was conducted from February to June, 2014, at the Government College University, Lahore, Pakistan, and comprised schoolchildren and children with conversion disorder. In order to generate items for revised version of conversion disorder scale, seven practising mental health professionals were consulted. A list of 42 items was finalised for expert ratings. After empirical validation, a scale of 40 items was administered on the participants and factor analysis was conducted. Of the240 participants, 120(50%) were schoolchildren (controls group) and 120(50%)were children with conversion disorder (clinical group).The results of factor analysis revealed five factors (swallowing and speech symptoms, motor symptoms, sensory symptoms, weakness and fatigue, and mixed symptoms) and retention of all 40 items of revised version of conversion disorder scale. Concurrent validity of the revised scale was found to be 0.81 which was significantly high. Similarly, discriminant validity of the scale was also high as both clinical and control groups had significant difference (p<0.001) in scores. Cronbach's alpha of scale was a=0.91 while item total correlation ranged from 0.50 to 0.80. The sensitivity and specificity analysis indicated that the revised conversion disorder scale was 76% sensitive to predicting conversion disorder while specificity showed that the scale was 73% accurate in specifying participants of the control group. The revised version of conversion disorder scale was a reliable and valid tool to be used for screening of children with conversion disorder.
Manual for the Extrapyramidal Symptom Rating Scale (ESRS).

PubMed

Chouinard, Guy; Margolese, Howard C

2005-07-15

The Extrapyramidal Symptom Rating Scale (ESRS) was developed to assess four types of drug-induced movement disorders (DIMD): Parkinsonism, akathisia, dystonia, and tardive dyskinesia (TD). Comprehensive ESRS definitions and basic instructions are given. Factor analysis provided six ESRS factors: 1) hypokinetic Parkinsonism; 2) orofacial dyskinesia; 3) trunk/limb dyskinesia; 4) akathisia; 5) tremor; and 6) tardive dystonia. Two pivotal studies found high inter-rater reliability correlations in both antipsychotic-induced movement disorders and idiopathic Parkinson disease. For inter-rater reliability and certification of raters, >or=80% of item ratings of the complete scale should be +/-1 point of expert ratings and >or=70% of ratings on individual items of each ESRS subscale should be +/-1 point of expert ratings. During a cross-scale comparison, AIMS and ESRS were found to have a 96% (359/374) agreement between TD-defined cases by DSM-IV TD criteria. Two recent international studies using the ESRS included over 3000 patients worldwide and showed an incidence of TD ranging from 10.2% (2000) to 12% (1998). ESRS specificity was investigated through two different approaches, path analyses and ANCOVA PANSS factors changes, which found that ESRS measurement of drug-induced EPS is valid and discriminative from psychiatric symptoms.
Item response theory analysis of the Lichtenberg Financial Decision Screening Scale.

PubMed

Teresi, Jeanne A; Ocepek-Welikson, Katja; Lichtenberg, Peter A

2017-01-01

The focus of these analyses was to examine the psychometric properties of the Lichtenberg Financial Decision Screening Scale (LFDSS). The purpose of the screen was to evaluate the decisional abilities and vulnerability to exploitation of older adults. Adults aged 60 and over were interviewed by social, legal, financial, or health services professionals who underwent in-person training on the administration and scoring of the scale. Professionals provided a rating of the decision-making abilities of the older adult. The analytic sample included 213 individuals with an average age of 76.9 (SD = 10.1). The majority (57%) were female. Data were analyzed using item response theory (IRT) methodology. The results supported the unidimensionality of the item set. Several IRT models were tested. Ten ordinal and binary items evidenced a slightly higher reliability estimate (0.85) than other versions and better coverage in terms of the range of reliable measurement across the continuum of financial incapacity.
Assessment of anxiety symptoms in school children: a cross-sex and ethnic examination.

PubMed

Holly, Lindsay E; Little, Michelle; Pina, Armando A; Caterino, Linda C

2015-02-01

We evaluated the cross-sex and -ethnic (Hispanic/Latino, non-Hispanic White) measurement invariance of anxiety symptoms based on the Spence Children's Anxiety Scale (SCAS) as well as SCAS anxiety symptoms' correspondence with scores on the 5-item Screen for Child Anxiety Related Emotional Disorders (SCARED) and teacher ratings of child anxiety. Based on data corresponding to 702 children (M age = 9.65, SD = 0.70; 51.9 % girls; 55 % Hispanic/Latino), findings showed some sex and ethnic variations in SCAS measured anxiety at the item and scale levels. Moreover, SCAS correspondence to the 5-item SCARED was found across ethnicity and sex. SCAS correspondence to teacher ratings was found for non-Hispanic White boys and non-Hispanic White girls, marginally in Hispanic/Latino boys, and poorly in Hispanic/Latino girls.
Mixed Messages: Ambiguous Penalty Information in Modified Restaurant Menu Items

PubMed Central

Lawless, Harry T.; Patel, Anjali A.; Lopez, Nanette V.

2016-01-01

Restaurant menu items from six national or regional brands were modified to reduce fat, saturated fat, sodium and total calories. Twenty-four items were tested with a current recipe, and two modifications (small and moderate reductions) for 72 total products. Approximately 100 consumers tested each product for acceptability as well as for desired levels of tastes/flavor, amounts of key ingredients and texture/consistency using just-about-right (JAR) scales. Penalty analysis was conducted to assess the effects of non-JAR ratings on acceptability scores. Situations arose where JAR ratings and penalty analyses could yield different recommendations, including large groups with low penalties and small groups with high penalties. Opposing groups with moderate to high penalties on opposite sides of the same JAR scale were also seen. Strategies for dealing with these observances are discussed. PMID:27833254
Development and validation of a professionalism assessment scale for medical students

PubMed Central

Klemenc-Ketis, Zalika; Vrecko, Helena

2014-01-01

Objectives To develop and validate a scale for the assess-ment of professionalism in medical students based on students' perceptions of and attitudes towards professional-ism in medicine. Methods This was a mixed methods study with under-graduate medical students. Two focus groups were carried out with 12 students, followed by a transcript analysis (grounded theory method with open coding). Then, a 3-round Delphi with 20 family medicine experts was carried out. A psychometric assessment of the scale was performed with a group of 449 students. The items of the Professional-ism Assessment Scale could be answered on a five-point Likert scale. Results After the focus groups, the first version of the PAS consisted of 56 items and after the Delphi study, 30 items remained. The final sample for quantitative study consisted of 122 students (27.2% response rate). There were 95 (77.9%) female students in the sample. The mean age of the sample was 22.1 ± 2.1 years. After the principal component analysis, we removed 8 items and produced the final version of the PAS (22 items). The Cronbach's alpha of the scale was 0.88. Factor analysis revealed three factors: empathy and humanism, professional relationships and development and responsibility. Conclusions The new Professionalism Assessment Scale proved to be valid and reliable. It can be used for the assessment of professionalism in undergraduate medical students. PMID:25382090
MM-MDS: a multidimensional scaling database with similarity ratings for 240 object categories from the Massive Memory picture database.

PubMed

Hout, Michael C; Goldinger, Stephen D; Brady, Kyle J

2014-01-01

Cognitive theories in visual attention and perception, categorization, and memory often critically rely on concepts of similarity among objects, and empirically require measures of "sameness" among their stimuli. For instance, a researcher may require similarity estimates among multiple exemplars of a target category in visual search, or targets and lures in recognition memory. Quantifying similarity, however, is challenging when everyday items are the desired stimulus set, particularly when researchers require several different pictures from the same category. In this article, we document a new multidimensional scaling database with similarity ratings for 240 categories, each containing color photographs of 16-17 exemplar objects. We collected similarity ratings using the spatial arrangement method. Reports include: the multidimensional scaling solutions for each category, up to five dimensions, stress and fit measures, coordinate locations for each stimulus, and two new classifications. For each picture, we categorized the item's prototypicality, indexed by its proximity to other items in the space. We also classified pairs of images along a continuum of similarity, by assessing the overall arrangement of each MDS space. These similarity ratings will be useful to any researcher that wishes to control the similarity of experimental stimuli according to an objective quantification of "sameness."
Development and validation of a novel patient-reported treatment satisfaction measure for hyperfunctional facial lines: facial line satisfaction questionnaire.

PubMed

Pompilus, Farrah; Burgess, Somali; Hudgens, Stacie; Banderas, Benjamin; Daniels, Selena

2015-12-01

Facial lines or wrinkles are among the most visible signs of aging, and minimally invasive cosmetic procedures are becoming increasingly popular. The aim of this study was to develop and validate the Facial Line Satisfaction Questionnaire (FLSQ) for use in adults with upper facial lines (UFL). A literature review, concept elicitation interviews (n = 33), and cognitive debriefing interviews (n = 23) of adults with UFL were conducted to develop the FLSQ. The FLSQ comprises Baseline and Follow-up versions and was field-tested with 150 subjects in a US observational study designed to assess its psychometric performance. Analyses included acceptability (item and scale distribution [i.e. missingness, floor, and ceiling effects]), reliability, and validity (including concurrent validity). In total, 69 concepts were elicited during patient interviews. Following cognitive debriefing interviews, the FLSQ-Baseline version included 11 items and the Follow-up version included 13 items. Response rates for the FLSQ were 100% and 73% at baseline and follow-up, respectively; no items had excessive missing data. Questionnaire scale scores were normally distributed. Most domain scores demonstrated good internal consistency reliability (Cronbach's α ≥ 0.70). Most items within their respective domains exhibited good convergent (item-scale correlations > 0.40) and discriminant (items had higher correlation with their hypothesized scales than other scales) validity. Concurrent validity correlation coefficients of the FLSQ domain scores with the associated concurrent measures were acceptable (range: r = 0.40-0.70). Six FLSQ items demonstrated reliability and validity as stand-alone items outside their domains. The FLSQ is a valid questionnaire for assessing treatment expectations, satisfaction, impact, and preference in adults with UFL. © 2015 The Authors. Journal of Cosmetic Dermatology Published by Wiley Periodicals, Inc.
Health- and vision-related quality of life in intellectually disabled children.

PubMed

Cui, Yu; Stapleton, Fiona; Suttle, Catherine; Bundy, Anita

2010-01-01

To investigate the psychometric properties of instruments for the assessment of self-reported functional vision performance and health-related quality of life in children with intellectual disabilities (IDs). Two instruments [Autoquestionnaire Enfant Image (AUQUEI), LV Prasad-Functional Vision Questionnaire (LVP-FVQ)] designed for the assessment of functional vision and health-related quality of life were adapted and administered to 168 school children with ID, aged 8 to 18 years. Rasch analysis was used to determine the appropriateness of the rating scales of these instruments and to identify any redundant items. Redundant items were excluded based on descriptive statistics and Rasch analysis, leaving 17 of 23 items in the revised AUQUEI and 16 of 22 in the LVP-FVQ. The AUQUEI items showed disordered thresholds on the rating scale. A modified step calibration (collapsed from four categories to three categories) resulted in ordered response thresholds for all items. The adjusted instrument produced an overall fit to the model (mean item infit = 1.06, SD = 0.32; mean item outfit = 1.11, SD = 0.35), indicating good construct validity. After Rasch analysis, the AUQUEI showed good content validity (person separation = 2.18; item reliability = 0.99; Cronbach alpha = 0.89). Increased similarity of person and item means and SDs on the logit scale after modification would indicate that the instrument was more applicable to the target population in its modified form. In contrast, the LVP-FVQ had a low person separation (1.35), suggesting that a more appropriate instrument is needed for assessment of vision-related quality of life in children with ID. The psychometric properties of two instruments were explored using Rasch analysis. By rescaling and reduction of items, the instruments were modified for use in a population of children with at least mild to moderate ID. However, an alternative instrument is needed for the assessment of vision-related quality of life in intellectually disabled children with normal vision or mild visual abnormalities.
A new scale for disaster nursing core competencies: Development and psychometric testing.

PubMed

Al Thobaity, Abdulellah; Williams, Brett; Plummer, Virginia

2016-02-01

All nurses must have core competencies in preparing for, responding to and recovering from a disaster. In the Kingdom of Saudi Arabia (KSA), as in many other countries, disaster nursing core competencies are not fully understood and lack reliable, validated tools. Thus, it is imperative to develop a scale for exploring disaster nursing core competencies, roles and barriers in the KSA. This study's objective is to develop a valid, reliable scale that identifies and explores core competencies of disaster nursing, nurses' roles in disaster management and barriers to developing disaster nursing in the KSA. This study developed a new scale testing its validity and reliability. A principal component analysis (PCA) was used to develop and test psychometric properties of the new scale. The PCA used a purposive sample of nurses from emergency departments in two hospitals in the KSA. Participants rated 93 paper-based, self-report questionnaire items from 1 to 10 on a Likert scale. PCA using Varimax rotation was conducted to explore factors emerging from responses. The study's participants were 132 nurses (66% response rate). PCA of the 93 questionnaire items revealed 49 redundant items (which were deleted) and 3 factors with eigenvalues of >1. The remaining 44 items accounted for 77.3% of the total variance. The overall Cronbach's alpha was 0.96 for all factors: 0.98 for Factor 1, 0.92 for Factor 2 and 0.86 for Factor 3. This study provided a validated, reliable scale for exploring nurses' core competencies, nurses' roles and barriers to developing disaster nursing in the KSA. The new scale has many implications, such as for improving education, planning and curricula. Crown Copyright © 2015. Published by Elsevier Ltd. All rights reserved.
Social Desirability Ratings From Males and Females: A Sexual Item Pool

ERIC Educational Resources Information Center

Galbraith, Gary G.; And Others

1974-01-01

Examines the relation between social desirability judgements (social de sirability scale values) of males and females in the area of sexual behavior. The findings raise some questions about the use of obvious-direct items with pathological import in sex behavior questionnaires. (Author/PC)
Self-Rated Mental Health: Screening for Depression and Posttraumatic Stress Disorder Among Women Exposed to Perinatal Intimate Partner Violence.

PubMed

Kastello, Jennifer C; Jacobsen, Kathryn H; Gaffney, Kathleen F; Kodadek, Marie P; Bullock, Linda C; Sharps, Phyllis W

2015-11-01

The purpose of the current study was to evaluate the validity of a single-item, self-rated mental health (SRMH) measure in the identification of women at risk for depression and posttraumatic stress disorder (PTSD). Baseline data of 239 low-income women participating in an intimate partner violence (IPV) intervention study were analyzed. PTSD was measured with the Davidson Trauma Scale. Risk for depression was determined using the Edinburgh Postnatal Depression Scale. SRMH was assessed with a single item asking participants to rate their mental health at the time of the baseline interview. Single-item measures can be an efficient way to increase the proportion of patients screened for mental health disorders. Although SRMH is not a strong indicator of PTSD, it may be useful in identifying pregnant women who are at increased risk for depression and need further comprehensive assessment in the clinical setting. Future research examining the use of SRMH among high-risk populations is needed. Copyright 2015, SLACK Incorporated.
The Evaluation of Child Care Centers and the "Infant/Toddler Environment Rating Scale": An Environmental Critique.

ERIC Educational Resources Information Center

Moore, Gary T.

This paper questions the physical environmental adequacy of the Infant/Toddler Environment Rating Scale (ITERS) developed by Thelma Harms, Debby Cryer, and Richard Clifford at the University of North Carolina, Chapel Hill. ITERS is a 35-item scale designed to assess the quality of center-based infant and toddler care, and one of a family of child…

An Investigation of the Validity and Reliability of the Adapted Mathematics Anxiety Rating Scale-Short Version (MARS-SV) among Turkish Students

ERIC Educational Resources Information Center

Baloglu, Mustafa

2010-01-01

This study adapted the Mathematics Anxiety Rating Scale-Short Version (MARS-SV) into Turkish and investigated the validity and reliability of the adapted instrument. Twenty-five bilingual experts agreed on the language validity, and 49 Turkish language experts agreed on the conformity and understandability of the scale's items. Thirty-two subject…
Psychometric Properties of the Teacher-Reported Motor Skills Rating Scale

ERIC Educational Resources Information Center

Kim, Helyn; Murrah, William M.; Cameron, Claire E.; Brock, Laura L.; Cottone, Elizabeth A.; Grissmer, David

2015-01-01

Children's early motor competence is associated with social development and academic achievement. However, few studies have examined teacher reports of children's motor skills. This study evaluated the psychometric properties of the Motor Skills Rating Scale (MSRS), a 19-item measure of children's teacher-reported motor skills in the classroom.…
Rasch analyses of the Activities-specific Balance Confidence Scale with individuals 50 years and older with lower limb amputations

PubMed Central

Sakakibara, Brodie M.; Miller, William C.; Backman, Catherine L.

2012-01-01

Objective To explore shortened response formats for use with the Activities-specific Balance Confidence scale and then: 1) evaluate the unidimensionality of the scale; 2) evaluate the item difficulty; 3) evaluate the scale for redundancy and content gaps; and 4) evaluate the item standard error of measurement (SEM) and internal consistency reliability among aging individuals (≥50 years) with a lower-limb amputation living in the community. Design Secondary analysis of cross-sectional survey and chart review data. Setting Out-patient amputee clinics, Ontario, Canada. Participants Four hundred forty eight community living adults, at least 50 years old (mean = 68 years), who have used a prosthesis for at least 6 months for a major unilateral lower limb amputation. Three hundred twenty five (72.5%) were men. Intervention N/a Main Outcome Measure(s) Activities-specific Balance Confidence Scale. Results A 5-option response format outperformed 4- and 6-option formats. Factor analyses confirmed a unidimensional scale. The distance between response options is not the same for all items on the scale, evident by the Partial Credit Model (PCM) having a better fit to the data than the Rating Scale Model. Two items, however, did not fit the PCM within statistical reason. Revising the wording of the two items may resolve the misfit, and improve the construct validity and lower the SEM. Overall, the difficulty of the scale’s items is appropriate for use with aging individuals with lower-limb amputation, and is most reliable (Cronbach ∝ = 0.94) for use with individuals with moderately low balance confidence levels. Conclusions The ABC-scale with a simplified 5-option response format is a valid and reliable measure of balance confidence for use with individuals aging with a lower limb amputation. PMID:21704978
Identifying shortcomings in the measurement of service quality.

PubMed

Fogarty, G; Catts, R; Forlin, C

2000-01-01

SERVPEFR, the performance component of the Service Quality Scale (SERVQUAL), has been shown to measure five underlying dimensions corresponding to Tangibles, Reliability, Responsiveness, Assurance, and Empathy (Parasuraman, Zeithaml, & Berry, 1988). This paper describes three separate studies employing SERVPERF in an Australian context. In the first of these studies (N = 113), a shortened 15-item version of the SERVPERF scale (SERVPERF-R) was found to be suitable for use in an Australian small business setting. A five-factor structure was identifiable but the factors were highly correlated, suggesting that they were not clearly distinct. The tendency for marked negative skewness observed by other researchers was also noted here. A follow-up study involving three other small businesses (N = 212) used Rasch analysis to test assumptions about the spread of items on the underlying continuum. These analyses indicated that there is an even, though narrow, spread of items across the continuum. The Rasch analysis suggested that the items in both SERVPERF and SERVPERF-R are too easy to rate highly and that more "difficult" items need to be added to the scale. The third study (N = 122) was conducted using a version of SERVPERF-R that included seven new items intended to extend the range of the scale. The new items, however, did not achieve this desirable outcome. The implications for service quality assessment are discussed.
Unified Parkinson's Disease Rating Scale-Motor Exam: inter-rater reliability of advanced practice nurse and neurologist assessments.

PubMed

Palmer, Janice L; Coats, Mary A; Roe, Catherine M; Hanko, Shelly M; Xiong, Chengjie; Morris, John C

2010-06-01

This paper is a report of a study to establish the inter-rater reliability of advanced practice nurse and neurologist neurological assessments which included ratings with the Unified Parkinson's Disease Rating Scale-Motor Exam. Around the world, advanced practice nurses are performing tasks once completed only by physicians. To promote consumer and provider confidence, it is important to establish that nurse and physician ratings using assessment tools are similar. In addition in research settings, when different raters are used, establishment of inter-rater reliability for study assessments is needed. Advanced practice nurses and neurologists independently recorded findings on neurological examinations of 46 participants in a study conducted between August 2007 and January 2008. An intraclass correlation coefficient was calculated to estimate overall agreement between the nurse and neurologist ratings. Agreement for individual items measured on a dichotomous scale was assessed by calculating Cohen's kappa. There was substantial agreement between advanced practice nurses and neurologists on the mean Unified Parkinson's Disease Rating Scale-Motor Exam ratings (intraclass correlation coefficient = 0.65) and the U.S. National Alzheimer's Coordinating Center Uniform Data Set neurological examination ratings of unremarkable findings (kappa = 0.74) and of gait disorder (kappa = 0.73). Moderate agreement (kappa = 0.53) was reached for the rating of whether all Unified Parkinson's Disease Rating Scale-Motor Exam items were normal. These findings are consistent with studies of the inter-rater agreement of the Unified Parkinson's Disease Rating Scale-Motor Exam and support the conduct of neurological assessments by advanced practice nurses.
Uncertainty in BRCA1 cancer susceptibility testing.

PubMed

Baty, Bonnie J; Dudley, William N; Musters, Adrian; Kinney, Anita Y

2006-11-15

This study investigated uncertainty in individuals undergoing genetic counseling/testing for breast/ovarian cancer susceptibility. Sixty-three individuals from a single kindred with a known BRCA1 mutation rated uncertainty about 12 items on a five-point Likert scale before and 1 month after genetic counseling/testing. Factor analysis identified a five-item total uncertainty scale that was sensitive to changes before and after testing. The items in the scale were related to uncertainty about obtaining health care, positive changes after testing, and coping well with results. The majority of participants (76%) rated reducing uncertainty as an important reason for genetic testing. The importance of reducing uncertainty was stable across time and unrelated to anxiety or demographics. Yet, at baseline, total uncertainty was low and decreased after genetic counseling/testing (P = 0.004). Analysis of individual items showed that after genetic counseling/testing, there was less uncertainty about the participant detecting cancer early (P = 0.005) and coping well with their result (P < 0.001). Our findings support the importance to clients of genetic counseling/testing as a means of reducing uncertainty. Testing may help clients to reduce the uncertainty about items they can control, and it may be important to differentiate the sources of uncertainty that are more or less controllable. Genetic counselors can help clients by providing anticipatory guidance about the role of uncertainty in genetic testing. (c) 2006 Wiley-Liss, Inc.
An intervention to improve the reliability of manuscript reviews for the Journal of the American Academy of Child and Adolescent Psychiatry.

PubMed

Strayhorn, J; McDermott, J F; Tanguay, P

1993-06-01

The effects of methods used to improve the interrater reliability of reviewers' ratings of manuscripts submitted to the Journal of the American Academy of Child and Adolescent Psychiatry were studied. Reviewers' ratings of consecutive manuscripts submitted over approximately 1 year were first analyzed; 296 pairs of ratings were studied. Intraclass correlations and confidence intervals for the correlations were computed for the two main ratings by which reviewers quantified the quality of the article: a 1-10 overall quality rating and a recommendation for acceptance or rejection with four possibilities along that continuum. Modifications were then introduced, including a multi-item rating scale and two training manuals to accompany it. Over the next year, 272 more articles were rated, and reliabilities were computed for the new scale and for the scales previously used. The intraclass correlation of the most reliable rating before the intervention was 0.27; the reliability of the new rating procedure was 0.43. The difference between these two was significant. The reliability for the new rating scale was in the fair to good range, and it became even better when the ratings of the two reviewers were averaged and the reliability stepped up by the Spearman-Brown formula. The new rating scale had excellent internal consistency and correlated highly with other quality ratings. The data confirm that the reliability of ratings of scientific articles may be improved by increasing the number of rating scale points, eliciting ratings of separate, concrete items rather than a global judgment, using training manuals, and averaging the scores of multiple reviewers.
Handling missing values in the MDS-UPDRS.

PubMed

Goetz, Christopher G; Luo, Sheng; Wang, Lu; Tilley, Barbara C; LaPelle, Nancy R; Stebbins, Glenn T

2015-10-01

This study was undertaken to define the number of missing values permissible to render valid total scores for each Movement Disorder Society Unified Parkinson's Disease Rating Scale (MDS-UPDRS) part. To handle missing values, imputation strategies serve as guidelines to reject an incomplete rating or create a surrogate score. We tested a rigorous, scale-specific, data-based approach to handling missing values for the MDS-UPDRS. From two large MDS-UPDRS datasets, we sequentially deleted item scores, either consistently (same items) or randomly (different items) across all subjects. Lin's Concordance Correlation Coefficient (CCC) compared scores calculated without missing values with prorated scores based on sequentially increasing missing values. The maximal number of missing values retaining a CCC greater than 0.95 determined the threshold for rendering a valid prorated score. A second confirmatory sample was selected from the MDS-UPDRS international translation program. To provide valid part scores applicable across all Hoehn and Yahr (H&Y) stages when the same items are consistently missing, one missing item from Part I, one from Part II, three from Part III, but none from Part IV can be allowed. To provide valid part scores applicable across all H&Y stages when random item entries are missing, one missing item from Part I, two from Part II, seven from Part III, but none from Part IV can be allowed. All cutoff values were confirmed in the validation sample. These analyses are useful for constructing valid surrogate part scores for MDS-UPDRS when missing items fall within the identified threshold and give scientific justification for rejecting partially completed ratings that fall below the threshold. © 2015 International Parkinson and Movement Disorder Society.
Use of Patient and Observer Scar Assessment Scale for evaluation of facial scars treated with self-drying silicone gel.

PubMed

Bianchi, Francesca A; Roccia, Fabio; Fiorini, Paola; Berrone, Sid

2010-05-01

In this prospective study, we used the Patient and Observer Scar Assessment Scale (POSAS) to evaluate the outcome of the healing process of posttraumatic and surgical facial scars that were treated with self-drying silicone gel, by both the patient and the observer. In our division, the application of base cream and massage represents the standard management of facial scars after suture removal. In the current study, 15 patients (7 men and 8 women) with facial scars were treated with self-drying silicone gel that was applied without massage, and 15 patients (8 men and 7 women) were treated with base cream and massage. Both groups underwent a clinical evaluation of facial scars by POSAS at the time of suture removal (T0) and after 2 months of treatment (T1). The patient rated scar pain, itch, color, stiffness, thickness, and surface (Patient Scale), and the observer rated scar vascularity, pigmentation, thickness, relief, pliability, and surface area (Observer Scale [OS]). The Patient Scale reported the greatest improvement in the items color, stiffness, and thickness. Itch was the only item that worsened in the group self-drying silicone gel. The OS primarily reported an improvement in the items vascularization, pigmentation, and pliability. The only item in the OS that underwent no change from T0 to T1 was surface area. The POSAS revealed satisfactory healing of posttraumatic and surgical facial scars that were treated with self-drying silicone gel.
Reporting of suicide in the Australian media.

PubMed

Pirkis, Jane; Francis, Catherine; Blood, Richard Warwick; Burgess, Philip; Morley, Belinda; Stewart, Andrew; Putnis, Peter

2002-04-01

The media monitoring project aimed to establish a baseline picture of the extent, nature and quality of reporting of suicide by the Australian media, with a view to informing future strategies intended to optimize reporting of suicide. Newspaper, television and radio items on suicide were retrieved over 12 months. Identifying and descriptive information were extracted for each item. Approximately 10% of items were rated for quality, using a rating scale based on criteria from Achieving the Balance, a kit designed to promote awareness among media professionals of issues relating to suicide. The scale ranged from 0 (poor quality) to 100 (good quality). Reporting of suicide was extensive (with 4813 items retrieved). The nature of reporting was variable. Items tended to be about completed suicide (rather than attempted suicide or suicidal ideation), and most commonly involved content related to an individual's experiences, policy/programme initiatives and/or suicide statistics, although there were differences across media types. Items showed variability across dimensions of quality. The majority of suicide items did not have examples of inappropriate language, were not inappropriately located, did not use the word 'suicide' in the headline, and did not use explicit photographs/diagrams or footage. However, around half of the suicide items provided a detailed discussion of the method of self-harm and portrayed suicide as merely a social phenomenon. Where items concerned the suicide of a celebrity, reference was commonly made to that person's celebrity status. Most items failed to provide information on help services. The median total quality score was 57.1%. The reporting of suicide is extensive across all media types, and varies in nature and quality. In general, good items outnumber poorer items. However, there are still opportunities for improving media reporting of suicide.
Program director opinions of core competencies in hand surgery training: analysis of differences between plastic and orthopedic surgery accredited programs.

PubMed

Sears, Erika Davis; Larson, Bradley P; Chung, Kevin C

2013-03-01

The authors' aim was to conduct a national survey of hand surgery fellowship program directors to determine differences of opinions of essential components of hand surgery training between program directors from plastic and orthopedic surgery programs. The authors performed a Web-based survey of 74 program directors from all Accreditation Council for Graduate Medical Education-accredited hand surgery fellowship programs to determine components that are essential for hand surgery training. The survey included assessment of nine general areas of practice, 97 knowledge topics, and 172 procedures. Twenty-seven scales of related survey items were created to determine differences between specialty groups based on clinical themes. An 84 percent response rate was achieved, including 49 orthopedic and 12 plastic surgery program directors. There were significant differences in mean responses between the specialty groups in 11 of 27 scales. Only one scale, forearm fractures, contained items with a significantly stronger preference for essential rating among orthopedic surgeons. The other 10 scales contained items with a significantly higher preference for essential rating among plastic surgeons, most of which related to soft-tissue injury and reconstruction. The burn scale had the greatest discrepancy in opinion of essential ratings between the groups, followed by pedicled and free tissue transfer, and amputation and fingertip injuries. Despite being united under the subspecialty of hand surgery, program directors tend to emphasize clinical areas that are stressed in their respective primary disciplines. These differences promote the advantage of programs that provide exposure to both plastic surgery-trained and orthopedic surgery-trained hand surgeons.
Psychometric assessment of the Behavior and Attitudes Questionnaire for Healthy Habits: measuring parents' views on food and physical activity.

PubMed

Henry, Beverly W; Smith, Thomas J; Ahmad, Saadia

2014-05-01

To assess parents' perspectives of their home environments to establish the validity of scores from the Behavior and Attitudes Questionnaire for Healthy Habits (BAQ-HH). In the present descriptive study, we surveyed a cross-sectional sample of parents of pre-school children. Questionnaire items developed in an iterative process with community-based programming addressed parents' knowledge/awareness, attitudes/concerns and behaviours about healthy foods and physical activity habits with 6-point rating scales. Exploratory and confirmatory factor analyses were used to psychometrically evaluate scores from the scales. English and Spanish versions of the BAQ-HH were administered at parent-teacher conferences for pre-school children at ten Head Start centres across a five-county agency in autumn 2010. From 672 families with pre-school children, 532 parents provided responses to the BAQ-HH (79 % response rate). The majority was female (83 %), Hispanic (66 %) or white (16 %), and ages ranged from 20 to 39 years (85 %). Exploratory and confirmatory analyses revealed a knowledge scale (seven items), an attitude scale (four items) and three behaviour subscales (three items each). Correlations were identified between parents' perceptions of home activities and reports of children's habits. Differences were identified by gender and ethnicity groupings. As a first step in psychometric testing, the dimensionality of each of the three scales (Knowledge, Attitudes and Behaviours) was identified and scale scores were related to other indicators of child behaviours and parents' demographic characteristics. This questionnaire offers a method to measure parents' views to inform planning and monitoring of obesity-prevention education programmes.
Adjacent-Categories Mokken Models for Rater-Mediated Assessments

PubMed Central

Wind, Stefanie A.

2016-01-01

Molenaar extended Mokken’s original probabilistic-nonparametric scaling models for use with polytomous data. These polytomous extensions of Mokken’s original scaling procedure have facilitated the use of Mokken scale analysis as an approach to exploring fundamental measurement properties across a variety of domains in which polytomous ratings are used, including rater-mediated educational assessments. Because their underlying item step response functions (i.e., category response functions) are defined using cumulative probabilities, polytomous Mokken models can be classified as cumulative models based on the classifications of polytomous item response theory models proposed by several scholars. In order to permit a closer conceptual alignment with educational performance assessments, this study presents an adjacent-categories variation on the polytomous monotone homogeneity and double monotonicity models. Data from a large-scale rater-mediated writing assessment are used to illustrate the adjacent-categories approach, and results are compared with the original formulations. Major findings suggest that the adjacent-categories models provide additional diagnostic information related to individual raters’ use of rating scale categories that is not observed under the original formulation. Implications are discussed in terms of methods for evaluating rating quality. PMID:29795916
[Development of a New Scale for Gauging Smartphone Dependence].

PubMed

Toda, Masahiro; Nishio, Nobuhiro; Takeshita, Tatsuya

2015-01-01

We designed a scale to gauge smartphone dependence and assessed its reliability and validity. A prototype self-rating smartphone-dependence scale was tested on 133 medical students who use smartphones more frequently than other devices to access web pages. Each response was scored on a Likert scale (0, 1, 2, 3), with higher scores indicating greater dependence. To select items for the final scale, exploratory factor analysis was conducted. On the basis of factor analysis results, we designed the Wakayama Smartphone-Dependence Scale (WSDS) comprising 21 items with 3 subscales: immersion in Internet communication; using a smartphone for extended periods of time and neglecting social obligations and other tasks; using a smartphone while doing something else and neglect of etiquette. Our analysis confirmed the validity of the different elements of the WSDS: the reliability coefficient (Cronbach's alpha) values of all subscales and total WSDS were from 0.79 to 0.83 and 0.88, respectively. These findings suggest that the WSDS is a useful tool for rating smartphone dependence.
Validation of the Peer Social Maturity Scale for Assessing Children's Social Skills

ERIC Educational Resources Information Center

Fink, Elian; de Rosnay, Marc; Peterson, Candida; Slaughter, Virginia

2013-01-01

We evaluated the utility of a brief, seven-item, teacher-rated Peer Social Maturity Scale (PSMAT). In Study 1, teachers of 138 Australian children (ranging from 5 to 8?years and 5?months old) in kindergarten and Grades 1 and 2 rated their pupils' social maturity using the PSMAT and their classroom social skills via the Social Skills Rating System…
Accurate and scalable social recommendation using mixed-membership stochastic block models.

PubMed

Godoy-Lorite, Antonia; Guimerà, Roger; Moore, Cristopher; Sales-Pardo, Marta

2016-12-13

With increasing amounts of information available, modeling and predicting user preferences-for books or articles, for example-are becoming more important. We present a collaborative filtering model, with an associated scalable algorithm, that makes accurate predictions of users' ratings. Like previous approaches, we assume that there are groups of users and of items and that the rating a user gives an item is determined by their respective group memberships. However, we allow each user and each item to belong simultaneously to mixtures of different groups and, unlike many popular approaches such as matrix factorization, we do not assume that users in each group prefer a single group of items. In particular, we do not assume that ratings depend linearly on a measure of similarity, but allow probability distributions of ratings to depend freely on the user's and item's groups. The resulting overlapping groups and predicted ratings can be inferred with an expectation-maximization algorithm whose running time scales linearly with the number of observed ratings. Our approach enables us to predict user preferences in large datasets and is considerably more accurate than the current algorithms for such large datasets.
Older adults' drug benefit beliefs: construct definition and measure development.

PubMed

Cline, Richard R; Gupta, Kiran; Singh, Reshmi L

2008-03-01

The Medicare Prescription Drug, Improvement and Modernization Act of 2003 provides coverage of outpatient prescription drugs for Medicare beneficiaries. Although much has been learned since the program's implementation, a context within which this information can be understood is lacking. The purpose of this study was to develop a reliable and valid multi-item instrument measuring beliefs about Medicare prescription drug benefits. Survey items were generated using focus group transcripts, other surveys on the Medicare Part "D" program, and past studies of choice and satisfaction in drug insurance programs. Using data from the survey pilot test, item and reliability analyses were used to reduce and refine an initial pool of items. Data then were collected from a cross-sectional, mail survey of older adults living in Minnesota. Data were analyzed using exploratory factor analysis. Summated rating scales then were constructed and assessed further using reliability analyses. Construct validity of summated scales was examined by comparing scale scores across response categories of survey items that collected information on general political attitudes, perceptions of the Medicare Part "D" program, health status, and health care utilization and demographics. The adjusted response rate for the main survey was 55.98% (744/1329). Iterative factor analysis produced 2 interpretable scales. The first, termed "access/equity" (13 items, Cronbach's alpha=0.89) measures beliefs that a Medicare drug benefit should both provide affordable prescription drugs for beneficiaries and do this in a manner that is equitable for all participants. The second, termed "comprehensibility" (6 items, Cronbach's alpha=0.80) assesses beliefs that regulations governing a Medicare drug benefit should be easily understood. Discriminant validity tests suggest that these measures behave in a manner consistent with related research in these areas. Measures of 2 facets of older adults' drug benefit beliefs were developed using a multiple step procedure. Future research could focus on developing a better understanding of other facets of these beliefs and sound methods of measurement.
Specialty Training's Organizational Readiness for curriculum Change (STORC): development of a questionnaire in a Delphi study.

PubMed

Bank, Lindsay; Jippes, Mariëlle; van Luijk, Scheltus; den Rooyen, Corry; Scherpbier, Albert; Scheele, Fedde

2015-08-05

In postgraduate medical education (PGME), programs have been restructured according to competency-based frameworks. The scale and implications of these adjustments justify a comprehensive implementation plan. Organizational Readiness for Change (ORC) is seen as a critical precursor for a successful implementation of change initiatives. Though, ORC in health care settings is mostly assessed in small scale settings and in relation to new policies and practices rather than educational change. Therefore our aim with this work was to develop an instrument to asses Specialty Training's Organizational Readiness for curriculum Change (STORC). A Delphi procedure was conducted to examine the applicability of a preliminary questionnaire in PGME, which was based on existing instruments designed for business and health care organizations. The 41 panellists (19 trainees and 22 supervisors from 6 specialties) from four different countries who were confronted with an apparent curriculum change, or would be in the near future, were asked to rate the relevance of a 89-item web-based questionnaire with regard to changes in specialty training on a 5-point Likert scale. Furthermore, they were invited to make qualitative comments on the items. In two rounds the 89-item preliminary questionnaire was reduced to 44 items. Items were either removed, kept, adapted or added based on individual item scores and qualitative comments. In the absence of a gold standard, this Delphi procedure was considered complete when the overall questionnaire rating exceeded 4.0 (scale 0-5). The overall item score reached 4.1 in the second round, meeting our criteria for completion of this Delphi procedure. This Delphi study describes the initial validating step in the development of an instrument to asses Specialty Training's Organisational Readiness for curriculum Change (STORC). Since ORC is measured on various subscales and presented as such, its strength lies in analysing these subscales. The latter makes it possible for educational leaders to identify and anticipate on hurdles in the implementation process and subsequently optimize efforts for successful curriculum change.
Measuring Psychobiosocial States in Sport: Initial Validation of a Trait Measure

PubMed Central

Bertollo, Maurizio; Ruiz, Montse C.; Bortoli, Laura

2016-01-01

We examined the item characteristics, the factor structure, and the concurrent validity of a trait measure of psychobiosocial states. In Study 1, Italian athletes (N = 342, 228 men, 114 women, Mage = 23.93, SD = 6.64) rated the intensity, the frequency, and the perceived impact dimensions of a psychobiosocial states scale, trait version (PBS-ST), which is composed of 20 items (10 functional and 10 dysfunctional) referring to how they usually felt before an important competition. In Study 2, the scale was cross validated in an independent sample (N = 251, 181 men, 70 women, Mage = 24.35, SD = 7.25). The concurrent validity of the PBS-ST scale scores were also examined in comparison with two sport-specific emotion-related measures and a general measure of affect. Exploratory structural equation modeling and confirmatory factor analysis of the data of Study 1 showed that a 2-factor, 15-item solution of the PBS-ST scale (8 functional items and 7 dysfunctional items) reached satisfactory fit indices for the three dimensions (i.e., intensity, frequency, and perceived impact). Results of Study 2 provided evidence of substantial measurement and structural invariance of all dimensions across samples. The low association of the PBS-ST scale with other measures suggests that the scale taps unique constructs. Findings of the two studies offer initial validity evidence for a sport-specific tool to measure psychobiosocial states. PMID:27907111
The Brief Psychiatric Rating Scale (version 4.0) factorial structure and its sensitivity in the treatment of outpatients with unipolar depression.

PubMed

Zanello, Adriano; Berthoud, Laurent; Ventura, Joseph; Merlo, Marco C G

2013-12-15

The 24-item Brief Psychiatric Rating Scale (BPRS, version 4.0) enables the rater to measure psychopathology severity. Still, little is known about the BPRS's reliability and validity outside of the psychosis spectrum. The aim of this study was to examine the factorial structure and sensitivity to change of the BPRS in patients with unipolar depression. Two hundred and forty outpatients with unipolar depression were administered the 24-item BPRS. Assessments were conducted at intake and at post-treatment in a Crisis Intervention Centre. An exploratory factor analysis of the 24-item BPRS produced a six-factor solution labelled "Mood disturbance", "Reality distortion", "Activation", "Apathy", "Disorganization", and "Somatization". The reduction of the total BPRS score and dimensional scores, except for "Activation", indicates that the 24-item BPRS is sensitive to change as shown in patients that appeared to have benefited from crisis treatment. The findings suggest that the 24-item BPRS could be a useful instrument to measure symptom severity and change in symptom status in outpatients presenting with unipolar depression. © 2013 Elsevier Ireland Ltd. All rights reserved.

Factor structure and clinical correlates of the 61-item Wender Utah Rating Scale (WURS).

PubMed

Calamia, Matthew; Hill, Benjamin D; Musso, Mandi W; Pella, Russell D; Gouvier, Wm Drew

2018-02-09

The objective of this study was to assess the factor structure and clinical correlates of a 61-item version of the Wender Utah Rating Scale (WURS), a self-report retrospective measure of childhood problems, experiences, and behavior used in ADHD assessment. Given the currently mostly widely used form of the WURS was derived via a criterion-keyed approach, the study aimed to use latent variable modeling of the 61-item WURS to potentially identify more and more homogeneous set of items reflecting current conceptualizations of ADHD symptoms. Exploratory structural equation modeling was used to generate factor scores which were then correlated with neuropsychological measures of intelligence and executive attention as well as a broad measure of personality and emotional functioning. Support for a modified five-factor model was found: ADHD, disruptive mood and behavior, negative affectivity, social confidence, and academic problems. The ADHD factor differed somewhat from the traditional 25-item WURS short form largely through weaker associations with several measures of personality and psychopathology. This study identified a factor more aligned with DSM-5 conceptualization of ADHD as well as measures of other types of childhood characteristics and symptoms which may prove useful for both research and clinical practice.
An Independent Investigation into the Psychometric Properties of the Adult Scale of Hostility and Aggression (A-SHARP)

ERIC Educational Resources Information Center

Rojahn, Johannes; Rick-Betancourt, Brittney; Barnard-Brak, Lucy; Moore, Linda

2017-01-01

Background: The Adult Scale of Hostility and Aggression (A-SHARP) rating scale assesses the frequency/severity (problem scale) and the reactive-proactive motivation (provocation scale) of aggressive behaviors in adults with intellectual disabilities (ID). Items are assigned to five subscales (Verbal Aggression, Physical Aggression, Hostile Affect,…
Taking the Test Taker's Perspective: Response Process and Test Motivation in Multidimensional Forced-Choice Versus Rating Scale Instruments.

PubMed

Sass, Rachelle; Frick, Susanne; Reips, Ulf-Dietrich; Wetzel, Eunike

2018-03-01

The multidimensional forced-choice (MFC) format has been proposed as an alternative to the rating scale (RS) response format. However, it is unclear how changing the response format may affect the response process and test motivation of participants. In Study 1, we investigated the MFC response process using the think-aloud technique. In Study 2, we compared test motivation between the RS format and different versions of the MFC format (presenting 2, 3, 4, and 5 items simultaneously). The response process to MFC item blocks was similar to the RS response process but involved an additional step of weighing the items within a block against each other. The RS and MFC response format groups did not differ in their test motivation. Thus, from the test taker's perspective, the MFC format is somewhat more demanding to respond to, but this does not appear to decrease test motivation.
Development of a questionnaire for assessing the childbirth experience (QACE).

PubMed

Carquillat, Pierre; Vendittelli, Françoise; Perneger, Thomas; Guittier, Marie-Julia

2017-08-30

Due to its potential impact on women's psychological health, assessing perceptions of their childbirth experience is important. The aim of this study was to develop a multidimensional self-reporting questionnaire to evaluate the childbirth experience. Factors influencing the childbirth experience were identified from a literature review and the results of a previous qualitative study. A total of 25 items were combined from existing instruments or were created de novo. A draft version was pilot tested for face validity with 30 women and submitted for evaluation of its construct validity to 477 primiparous women at one-month post-partum. The recruitment took place in two obstetric clinics from Swiss and French university hospitals. To evaluate the content validity, we compared item responses to general childbirth experience assessments on a numeric, 0 to 10 rating scale. We dichotomized two group assessment scores: "0 to 7" and "8 to 10". We performed an exploratory factor analysis to identify underlying dimensions. In total, 291 women completed the questionnaire (response rate = 61%). The responses to 22 items were statistically significant between the 0 to 7 and 8 to 10 groups for the general childbirth experience assessments. An exploratory factor analysis yielded four sub-scales, which were labelled "relationship with staff" (4 items), "emotional status" (3 items), "first moments with the new born," (3 items) and "feelings at one month postpartum" (3 items). All 4 scales had satisfactory internal consistency levels (alpha coefficients from 0.70 to 0.85). The full 25-item version can be used to analyse each item by itself, and the short 4-dimension version can be scored to summarize the general assessment of the childbirth experience. The Questionnaire for Assessing the Childbirth Experience (QACE) could be useful as a screening instrument to identify women with negative childbirth experiences. It can be used as both a research instrument in its short version and a questionnaire for use in clinical practice in its full version.
Informed choice: understanding knowledge in the context of screening uptake.

PubMed

Michie, Susan; Dormandy, Elizabeth; Marteau, Theresa M

2003-07-01

This study evaluates a scale measuring knowledge about a screening test and investigates the association between knowledge, uptake and attitudes towards screening. One thousand four hundred ninety-nine pregnant women completed the knowledge scale of the multidimensional measure of informed choice (MMIC). Three hundred forty-five of these women and 152 professionals providing antenatal care also rated the importance of the knowledge items. Item characteristic curves show that, with one exception, the knowledge items reflect a spread of difficulty and are able to discriminate between people. All items were seen as essential or helpful by both women and health professionals, with two items seen as particularly important and one as unimportant. There were some differences between health professionals, women with low risk results and women with high risk results. Knowledge was not associated with uptake, attitude, or the extent to which uptake was consistent with women's attitudes towards undergoing the test.
Impressions of functional food consumers.

PubMed

Saher, Marieke; Arvola, Anne; Lindeman, Marjaana; Lähteenmäki, Liisa

2004-02-01

Functional foods provide a new way of expressing healthiness in food choices. The objective of this study was to apply an indirect measure to explore what kind of impressions people form of users of functional foods. Respondents (n=350) received one of eight versions of a shopping list and rated the buyer of the foods on 66 bipolar attributes on 7-point scales. The shopping lists had either healthy or neutral background items, conventional or functional target items and the buyer was described either as a 40-year-old woman or man. The attribute ratings revealed three factors: disciplined, innovative and gentle. Buyers with healthy background items were perceived as more disciplined than those having neutral items on the list, users of functional foods were rated as more disciplined than users of conventional target items only when the background list consisted of neutral items. Buyers of functional foods were regarded as more innovative and less gentle, but gender affected the ratings on gentle dimension. The impressions of functional food users clearly differ from those formed of users of conventional foods with a healthy image. The shopping list method performed well as an indirect method, but further studies are required to test its feasibility in measuring other food-related impressions.
Assessing the Quality of Problems in Problem-Based Learning

ERIC Educational Resources Information Center

Sockalingam, Nachamma; Rotgans, Jerome; Schmidt, Henk

2012-01-01

This study evaluated the construct validity and reliability of a newly devised 32-item problem quality rating scale intended to measure the quality of problems in problem-based learning. The rating scale measured the following five characteristics of problems: the extent to which the problem (1) leads to learning objectives, (2) is familiar, (3)…
Estimating Skin Cancer Risk: Evaluating Mobile Computer-Adaptive Testing.

PubMed

Djaja, Ngadiman; Janda, Monika; Olsen, Catherine M; Whiteman, David C; Chien, Tsair-Wei

2016-01-22

Response burden is a major detriment to questionnaire completion rates. Computer adaptive testing may offer advantages over non-adaptive testing, including reduction of numbers of items required for precise measurement. Our aim was to compare the efficiency of non-adaptive (NAT) and computer adaptive testing (CAT) facilitated by Partial Credit Model (PCM)-derived calibration to estimate skin cancer risk. We used a random sample from a population-based Australian cohort study of skin cancer risk (N=43,794). All 30 items of the skin cancer risk scale were calibrated with the Rasch PCM. A total of 1000 cases generated following a normal distribution (mean [SD] 0 [1]) were simulated using three Rasch models with three fixed-item (dichotomous, rating scale, and partial credit) scenarios, respectively. We calculated the comparative efficiency and precision of CAT and NAT (shortening of questionnaire length and the count difference number ratio less than 5% using independent t tests). We found that use of CAT led to smaller person standard error of the estimated measure than NAT, with substantially higher efficiency but no loss of precision, reducing response burden by 48%, 66%, and 66% for dichotomous, Rating Scale Model, and PCM models, respectively. CAT-based administrations of the skin cancer risk scale could substantially reduce participant burden without compromising measurement precision. A mobile computer adaptive test was developed to help people efficiently assess their skin cancer risk.
“Up Means Good”

PubMed Central

Tourangeau, Roger

2013-01-01

This paper presents results from six experiments that examine the effect of the position of an item on the screen on the evaluative ratings it receives. The experiments are based on the idea that respondents expect “good” things—those they view positively—to be higher up on the screen than “bad” things. The experiments use items on different topics (Congress and HMOs, a variety of foods, and six physician specialties) and different methods for varying their vertical position on the screen. A meta-analysis of all six experiments demonstrates a small but reliable effect of the item’s screen position on mean ratings of the item; the ratings are significantly more positive when the item appears in a higher position on the screen than when it appears farther down. These results are consistent with the hypothesis that respondents follow the “Up means good” heuristic, using the vertical position of the item as a cue in evaluating it. Respondents seem to rely on heuristics both in interpreting response scales and in forming judgments. PMID:24634546
Multidisciplinary Delphi Development of a Scale to Evaluate Team Function in Obstetric Emergencies: The PETRA Scale.

PubMed

Balki, Mrinalini; Hoppe, David; Monks, David; Cooke, Mary Ellen; Sharples, Lynn; Windrim, Rory

2017-06-01

The objective of this study was to develop a new interdisciplinary teamwork scale, the Perinatal Emergency: Team Response Assessment (PETRA), for the management of obstetric crises, through consensus agreement of obstetric caregivers. This prospective study was performed using expert consensus, based on a Delphi method. The study investigators developed a new PETRA tool, specifically related to obstetric crisis management, based on the existing literature and discussions among themselves. The scale was distributed to a selected panel of experts in the field for the Delphi process. After each round of Delphi, every component of the scale was analyzed quantitatively by the percentage of agreement ratings and each comment reviewed by the blinded investigators. The assessment scale was then modified, with components of less than 80% agreement removed from the scale. The process was repeated on three occasions to reach a consensus and final PETRA scale. Fourteen of 24 invited experts participated in the Delphi process. The original PETRA scale included six categories and 48 items, one global scale item, and a 3-point rubric for rating. The overall percentage agreement by experts in the first, second, and third rounds was 95.0%, 93.2%, and 98.5%, respectively. The final scale after the third round of Delphi consisted of the following seven categories: shared mental model, communication, situational awareness, leadership, followership, workload management, and positive/effective behaviours and attitudes. There were 34 individual items within these categories, each with a 5-point rating rubric (1 = unacceptable to 5 = perfect). Using a structured Delphi method, we established the face and content validity of this assessment scale that focuses on important aspects of interdisciplinary teamwork in the management of obstetric crises. Copyright © 2017 The Society of Obstetricians and Gynaecologists of Canada/La Société des obstétriciens et gynécologues du Canada. Published by Elsevier Inc. All rights reserved.
Effects of levomilnacipran ER on fatigue symptoms associated with major depressive disorder

PubMed Central

Fava, Maurizio; Gommoll, Carl; Chen, Changzheng; Greenberg, William M.; Ruth, Adam

2016-01-01

The aim of this study was to evaluate the effects of levomilnacipran extended-release (ER) on depression-related fatigue in adults with major depressive disorder. Post-hoc analyses of five phase III trials were carried out, with evaluation of fatigue symptoms based on score changes in four items: Montgomery–Åsberg Depression Rating Scale (MADRS) item 7 (lassitude), and 17-item Hamilton Depression Rating Scale (HAMD17) items 7 (work/activities), 8 (retardation), and 13 (somatic symptoms). Symptom remission was analyzed on the basis of score shifts from baseline to end of treatment: MADRS item 7 and HAMD17 item 7 (from ≥2 to ≤1); HAMD17 items 8 and 13 (from ≥1 to 0). The mean change in MADRS total score was analyzed in patients with low and high fatigue (MADRS item 7 baseline score <4 and ≥4, respectively). Patients receiving levomilnacipran ER had significantly greater mean improvements and symptom remission (no/minimal residual fatigue) on all fatigue-related items: lassitude (35 vs. 28%), work/activities (43 vs. 35%), retardation (46 vs. 39%), somatic symptoms (26 vs. 18%; all Ps<0.01 versus placebo). The mean change in MADRS total score was significantly greater with levomilnacipran ER versus placebo in both low (least squares mean difference=−2.8, P=0.0018) and high (least squares mean difference=−3.1, P<0.0001) fatigue subgroups. Levomilnacipran ER treatment was effective in reducing depression-related fatigue in adult patients with major depressive disorder and was associated with remission of fatigue symptoms. PMID:26584326
Refinement and initial validation of a multidimensional composite scale for use in assessing acute postoperative pain in cats.

PubMed

Brondani, Juliana Tabarelli; Luna, Stelio Pacca Loureiro; Padovani, Carlos Roberto

2011-02-01

To refine and test construct validity and reliability of a composite pain scale for use in assessing acute postoperative pain in cats undergoing ovariohysterectomy. 40 cats that underwent ovariohysterectomy in a previous study. In a previous randomized, double-blind, placebo-controlled study, a composite pain scale was developed to assess postoperative pain in cats that received a placebo or an analgesic (tramadol, vedaprofen, or tramadol-vedaprofen combination). In the present study, the scale was refined via item analysis (distribution frequency and occurrence), a nonparametric ANOVA, and item-to-total score correlation. Construct validity was assessed via factor analysis and known-groups discrimination, and reliability was measured by assessing internal consistency. Respiratory rate and respiratory pattern were rejected after item analysis. Factor analysis resulted in 5 dimensions (F1 [psychomotor change], posture, comfort, activity, mental status, and miscellaneous behaviors; F2 [protection of wound area], reaction to palpation of the surgical wound and palpation of the abdomen and flank; F3 [physiologic variables], systolic arterial blood pressure and appetite; F4 [vocal expression of pain], vocalization; and F5 [heart rate]). Internal consistency was excellent for the overall scale and for F1, F2, and F3; very good for F4; and unacceptable for F5. Except for heart rate, the identified factors and scale total score could be used to detect differences between the analgesic and placebo groups and differences among the analgesic treatments. Results provided initial evidence of construct validity and reliability of a multidimensional composite tool for use in assessing acute postoperative pain in cats undergoing ovariohysterectomy.
Psychometric properties of the Spanish version of the UPPS-P Impulsive Behavior Scale: A Rasch rating scale analysis and confirmatory factor analysis.

PubMed

Pilatti, Angelina; Lozano, Oscar M; Cyders, Melissa A

2015-12-01

The present study was aimed at determining the psychometric properties of the Spanish version of the UPPS-P Impulsive Behavior Scale in a sample of college students. Participants were 318 college students (36.2% men; mean age = 20.9 years, SD = 6.4 years). The psychometric properties of this Spanish version were analyzed using the Rasch model, and the factor structure was examined using confirmatory factor analysis. The verification of the global fit of the data showed adequate indexes for persons and items. The reliability estimates were high for both items and persons. Differential item functioning across gender was found for 23 items, which likely reflects known differences in impulsivity levels between men and women. The factor structure of the Spanish version of the UPPS-P replicates previous work with the original UPPS-P Scale. Overall, results suggest that test scores from the Spanish version of the UPPS-P show adequate psychometric properties to accurately assess the multidimensional model of impulsivity, which represents the most exhaustive measure of this construct. (c) 2015 APA, all rights reserved).
Development of a Questionnaire Assessing School Physical Activity Environment

ERIC Educational Resources Information Center

Robertson-Wilson, Jennifer; Levesque, Lucie; Holden, Ronald R.

2007-01-01

This study was designed to develop the Questionnaire Assessing School Physical Activity Environment (Q--SPACE) based on student perceptions. Twenty-eight items rated on 4-point Likert scales were administered to 244 middle school students in 9 schools. Exploratory factor analysis was used to evaluate the underlying structure of the items and 2…
Symptom Frequency Characteristics of the Hamilton Depression Rating Scale of Major Depressive Disorder in Epilepsy.

PubMed

Wiglusz, Mariusz S; Landowski, Jerzy; Michalak, Lidia; Cubała, Wiesław J

2015-09-01

Depressive disorders are common among patients with epilepsy (PWE). The aim of this study was to explore symptom frequencies of 17-item Hamilton Depression Rating Scale (HDRS-17) and recognize the clinical characteristics of Major Depressive Disorder in PWE. A sample of 40 adults outpatients with epilepsy and depression was diagnosed using SCID-I for DSM-IV-TR and HDRS-17. The total HDRS-17 score was analysed followed by the exploratory analysis based on the hierarchical model. The frequencies of HDRS-17 items varied widely in this study. Insomnia related items and general somatic symptoms items as well as insomnia and somatic factors exhibited constant and higher frequency. Feeling guilty, suicide, psychomotor retardation and depressed mood showed relatively lower frequencies. Other symptoms had variable frequencies across the study population. Depressive disorders are common among PWE. In the study group insomnia and somatic symptoms displayed highest values which could represent atypical clinical features of mood disorders in PWE. There is a need for more studies with a use of standardized approach to the problem.
A teaching videotape for the assessment of essential tremor.

PubMed

Louis, E D; Barnes, L; Wendt, K J; Ford, B; Sangiorgio, M; Tabbal, S; Lewis, L; Kaufmann, P; Moskowitz, C; Comella, C L; Goetz, C C; Lang, A E

2001-01-01

Teaching videotapes, developed to aid in the evaluation of several movement disorders, have not been used in essential tremor research. As part of the Washington Heights-Inwood Genetic Study of Essential Tremor (WHIGET), we developed a reliable and valid tremor rating scale. Because this rating scale is currently being used by investigators at other centers, we developed a teaching videotape to aid in the consistent application of this scale. To develop a teaching videotape for a revised version of the WHIGET Tremor Rating Scale and to assess the interrater agreement among raters who used this videotape to rate tremor. The revised WHIGET Tremor Rating Scale was used to rate action tremor from 0 to 4 during six tests: arm extension, pouring, drinking, using a spoon, finger-to-nose, and drawing spirals. A 22-minute teaching videotape was developed that includes a 29-item educational section and a self-assessment section consisting of 20 examples of tremor ratings chosen by the two WHIGET study neurologists. Eight raters, including senior movement disorder specialists, movement disorder fellows, general neurologists, and a movement disorder nurse practitioner, independently viewed the videotape and rated tremor during the self-assessment section. Interobserver reliability was assessed with weighted kappa statistics (kappa(w)). Eight raters each rated 20 items (160 ratings total). Total kappa(w) was 0.97 (nearly perfect agreement). Interrater reliability was as follows: kappa(w) = 0.99 (movement disorder specialists), kappa(w) = 0.98 (movement disorder fellows), and kappa(w) = 0.97 (general neurologists); all kappa(w) were nearly perfect. This teaching videotape may be used to improve the uniform application of the revised WHIGET Tremor Rating Scale by raters with various levels of experience in movement disorders.
Unified Parkinson’s Disease Rating Scale-Motor Exam: Inter-rater reliability of advanced practice nurse and neurologist assessments

PubMed Central

Palmer, Janice L.; Coats, Mary A.; Roe, Catherine M.; Hanko, Shelly M.; Xiong, Chengjie; Morris, John C.

2010-01-01

Aim This paper is a report of a study to establish the inter-rater reliability of advanced practice nurse and neurologist neurological assessments which included ratings with the Unified Parkinson’s Disease Rating Scale-Motor Exam. Background Around the world, advanced practice nurses are performing tasks once completed by only physicians. To promote consumer and provider confidence, it is important to establish that nurse and physician ratings using assessment tools are similar. In addition in research settings, when different raters are used, establishment of inter-rater reliability for study assessments is needed. Method Advanced practice nurses and neurologists independently recorded findings on neurological examinations of 46 participants in a study conducted between August 2007 and January 2008. An intraclass correlation coefficient was calculated to estimate overall agreement between the nurse and neurologist ratings. Agreement for individual items measured on a dichotomous scale was assessed by calculating Cohen’s kappa. Results There was substantial agreement between advanced practice nurses and neurologists on the mean Unified Parkinson’s Disease Rating Scale-Motor Exam ratings (intraclass correlation coefficient = 0.65) and the U.S. National Alzheimer’s Coordinating Center Uniform Data Set neurological examination ratings of unremarkable findings (kappa = 0.74) and of gait disorder (kappa = 0.73). Moderate agreement (kappa = 0.53) was reached for the rating of whether all Unified Parkinson’s Disease Rating Scale-Motor Exam items were normal. Conclusion These findings are consistent with studies of the inter-rater agreement of the Unified Parkinson’s Disease Rating Scale-Motor Exam and support the conduct of neurological assessments by advanced practice nurses. PMID:20546368
The construct validity of the Major Depression Inventory: A Rasch analysis of a self-rating scale in primary care.

PubMed

Nielsen, Marie Germund; Ørnbøl, Eva; Vestergaard, Mogens; Bech, Per; Christensen, Kaj Sparle

2017-06-01

We aimed to assess the measurement properties of the ten-item Major Depression Inventory when used on clinical suspicion in general practice by performing a Rasch analysis. General practitioners asked consecutive persons to respond to the web-based Major Depression Inventory on clinical suspicion of depression. We included 22 practices and 245 persons. Rasch analysis was performed using RUMM2030 software. The Rasch model fit suggests that all items contribute to a single underlying trait (defined as internal construct validity). Mokken analysis was used to test dimensionality and scalability. Our Rasch analysis showed misfit concerning the sleep and appetite items (items 9 and 10). The response categories were disordered for eight items. After modifying the original six-point to a four-point scoring system for all items, we achieved ordered response categories for all ten items. The person separation reliability was acceptable (0.82) for the initial model. Dimensionality testing did not support combining the ten items to create a total score. The scale appeared to be well targeted to this clinical sample. No significant differential item functioning was observed for gender, age, work status and education. The Rasch and Mokken analyses revealed two dimensions, but the Major Depression Inventory showed fit to one scale if items 9 and 10 were excluded. Our study indicated scalability problems in the current version of the Major Depression Inventory. The conducted analysis revealed better statistical fit when items 9 and 10 were excluded. Copyright © 2017 Elsevier Inc. All rights reserved.
The Reward-Based Eating Drive Scale: A Self-Report Index of Reward-Based Eating

PubMed Central

Mason, Ashley E.; Laraia, Barbara A.; Hartman, William; Ready, Karen; Acree, Michael; Adam, Tanja C.; St. Jeor, Sachiko; Kessler, David

2014-01-01

Why are some individuals more vulnerable to persistent weight gain and obesity than are others? Some obese individuals report factors that drive overeating, including lack of control, lack of satiation, and preoccupation with food, which may stem from reward-related neural circuitry. These are normative and common symptoms and not the sole focus of any existing measures. Many eating scales capture these common behaviors, but are confounded with aspects of dysregulated eating such as binge eating or emotional overeating. Across five studies, we developed items that capture this reward-based eating drive (RED). Study 1 developed the items in lean to obese individuals (n = 327) and examined changes in weight over eight years. In Study 2, the scale was further developed and expert raters evaluated the set of items. Study 3 tested psychometric properties of the final 9 items in 400 participants. Study 4 examined psychometric properties and race invariance (n = 80 women). Study 5 examined psychometric properties and age/gender invariance (n = 381). Results showed that RED scores correlated with BMI and predicted earlier onset of obesity, greater weight fluctuations, and greater overall weight gain over eight years. Expert ratings of RED scale items indicated that the items reflected characteristics of reward-based eating. The RED scale evidenced high internal consistency and invariance across demographic factors. The RED scale, designed to tap vulnerability to reward-based eating behavior, appears to be a useful brief tool for identifying those at higher risk of weight gain over time. Given the heterogeneity of obesity, unique brief profiling of the reward-based aspect of obesity using a self-report instrument such as the RED scale may be critical for customizing effective treatments in the general population. PMID:24979216
The Australian Racism, Acceptance, and Cultural-Ethnocentrism Scale (RACES): item response theory findings.

PubMed

Grigg, Kaine; Manderson, Lenore

2016-03-17

Racism and associated discrimination are pervasive and persistent challenges with multiple cumulative deleterious effects contributing to inequities in various health outcomes. Globally, research over the past decade has shown consistent associations between racism and negative health concerns. Such research confirms that race endures as one of the strongest predictors of poor health. Due to the lack of validated Australian measures of racist attitudes, RACES (Racism, Acceptance, and Cultural-Ethnocentrism Scale) was developed. Here, we examine RACES' psychometric properties, including the latent structure, utilising Item Response Theory (IRT). Unidimensional and Multidimensional Rating Scale Model (RSM) Rasch analyses were utilised with 296 Victorian primary school students and 182 adolescents and 220 adults from the Australian community. RACES was demonstrated to be a robust 24-item three-dimensional scale of Accepting Attitudes (12 items), Racist Attitudes (8 items), and Ethnocentric Attitudes (4 items). RSM Rasch analyses provide strong support for the instrument as a robust measure of racist attitudes in the Australian context, and for the overall factorial and construct validity of RACES across primary school children, adolescents, and adults. RACES provides a reliable and valid measure that can be utilised across the lifespan to evaluate attitudes towards all racial, ethnic, cultural, and religious groups. A core function of RACES is to assess the effectiveness of interventions to reduce community levels of racism and in turn inequities in health outcomes within Australia.

Measuring health-related problem solving among African Americans with multiple chronic conditions: application of Rasch analysis.

PubMed

Fitzpatrick, Stephanie L; Hill-Briggs, Felicia

2015-10-01

Identification of patients with poor chronic disease self-management skills can facilitate treatment planning, determine effectiveness of interventions, and reduce disease complications. This paper describes the use of a Rasch model, the Rating Scale Model, to examine psychometric properties of the 50-item Health Problem-Solving Scale (HPSS) among 320 African American patients with high risk for cardiovascular disease. Items on the positive/effective HPSS subscales targeted patients at low, moderate, and high levels of positive/effective problem solving, whereas items on the negative/ineffective problem solving subscales mostly targeted those at moderate or high levels of ineffective problem solving. Validity was examined by correlating factor scores on the measure with clinical and behavioral measures. Items on the HPSS show promise in the ability to assess health-related problem solving among high risk patients. However, further revisions of the scale are needed to increase its usability and validity with large, diverse patient populations in the future.
A Life Events Scale for Armed Forces personnel

PubMed Central

Chaudhury, Suprakash; Srivastava, Kalpana; Raju, M.S.V. Kama; Salujha, S.K.

2006-01-01

Background: Armed Forces personnel are routinely exposed to a number of unique stressful life events. None of the available scales are relevant to service personnel. Aim: To construct a scale to measure life events in service personnel. Methods: In the first stage of the study open-ended questions along with items generated by the expert group by consensus method were administered to 50 soldiers. During the second stage a scale comprising 59 items and open-ended questions was administered to 165 service personnel. The final scale of 52 items was administered to 200 service personnel in group setting. Weightage was assigned on a 0 to 100 range. For normative study the Armed Forces Medical College Life Events Scale (AFMC LES) was administered to 1200 Army, 100 Air Force and 100 Navy personnel. Results: Service personnel experience an average of 4 life events in past one year and 13 events in a life-time. On an average service personnel experience 115 life change unit scores in past one year and 577 life change unit scores in life-time on the AFMC LES. The scale has concurrent validity when compared with the Presumptive Stressful Life Events Scale (PSLES). There is internal consistency in the scale with the routine items being rated very low. There is a pattern of uniformity with the civilian counterparts along with differences in the items specific to service personnel. Conclusions: The AFMC LES includes the unique stresses of service personnel that are not included in any life events scale available in India or in the west and should be used to assess stressful life events in service personnel. PMID:20844647
The Effect of Response Format on the Psychometric Properties of the Narcissistic Personality Inventory: Consequences for Item Meaning and Factor Structure.

PubMed

Ackerman, Robert A; Donnellan, M Brent; Roberts, Brent W; Fraley, R Chris

2016-04-01

The Narcissistic Personality Inventory (NPI) is currently the most widely used measure of narcissism in social/personality psychology. It is also relatively unique because it uses a forced-choice response format. We investigate the consequences of changing the NPI's response format for item meaning and factor structure. Participants were randomly assigned to one of three conditions: 40 forced-choice items (n = 2,754), 80 single-stimulus dichotomous items (i.e., separate true/false responses for each item; n = 2,275), or 80 single-stimulus rating scale items (i.e., 5-point Likert-type response scales for each item; n = 2,156). Analyses suggested that the "narcissistic" and "nonnarcissistic" response options from the Entitlement and Superiority subscales refer to independent personality dimensions rather than high and low levels of the same attribute. In addition, factor analyses revealed that although the Leadership dimension was evident across formats, dimensions with entitlement and superiority were not as robust. Implications for continued use of the NPI are discussed. © The Author(s) 2015.
Examining the psychometric properties of the Hindi version of Family Accommodation Scale-Self-Report (FAS-SR).

PubMed

Mahapatra, Ananya; Gupta, Rishi; Patnaik, Kuppili Pooja; Pattanaik, Raman Deep; Khandelwal, Sudhir Kumar

2017-10-01

Family accommodation (FA) is the phenomenon whereby caregivers assist or facilitate rituals or behaviours related to obsessive compulsive disorder (OCD). There is a need for a self-rated instrument to assess this construct in resource-strained clinical settings of India. To explore the factor structure of Hindi version of Family Accommodation Scale-Self Rated version (FAS-SR) and compare its validity with the gold standard Family Accommodation Scale-Interviewer Rated (FAS-IR) scale. The Hindi version of FAS-SR scale and FAS-IR scale was applied on 105 caregivers of patients with OCD. The initial factor analysis yielded three-factor models with an eigenvalue of >1 and the total variance explained by these factors was 72.017%. The internal consistency of the 19-item scale was 0.93 indicating good inter-item correlation. There was a significant positive correlation between FAS-IR scale total score and all the factors of the FAS-SR Scale. The average measure ICC was 0.889 with a 95% confidence interval from 0.783 to 0.981 (F (62,84)=37.547, p<001) indicating high degree of reliability between the Hindi version of FAS-SR and the FAS-IR scale. FAS-SR is a practical alternative to FAS-IR and has the potential to be used widely in an Indian setting. Copyright © 2017 Elsevier B.V. All rights reserved.
Psychometric evaluation of a short measure of social capital at work.

PubMed

Kouvonen, Anne; Kivimäki, Mika; Vahtera, Jussi; Oksanen, Tuula; Elovainio, Marko; Cox, Tom; Virtanen, Marianna; Pentti, Jaana; Cox, Sara J; Wilkinson, Richard G

2006-10-13

Prior studies on social capital and health have assessed social capital in residential neighbourhoods and communities, but the question whether the concept should also be applicable in workplaces has been raised. The present study reports on the psychometric properties of an 8-item measure of social capital at work. Data were derived from the Finnish Public Sector Study (N = 48,592) collected in 2000-2002. Based on face validity, an expert unfamiliar with the data selected 8 questionnaire items from the available items for a scale of social capital. Reliability analysis included tests of internal consistency, item-total correlations, and within-unit (interrater) agreement by rwg index. The associations with theoretically related and unrelated constructs were examined to assess convergent and divergent validity (construct validity). Criterion-related validity was explored with respect to self-rated health using multilevel logistic regression models. The effects of individual level and work unit level social capital were modelled on self-rated health. The internal consistency of the scale was good (Cronbach's alpha = 0.88). The rwg index was 0.88, which indicates a significant within-unit agreement. The scale was associated with, but not redundant to, conceptually close constructs such as procedural justice, job control, and effort-reward imbalance. Its associations with conceptually more distant concepts, such as trait anxiety and magnitude of change in work, were weaker. In multilevel models, significantly elevated age adjusted odds ratios (ORs) of poor self-rated health (OR = 2.42, 95% confidence interval (CI): 2.24-2.61 for the women and OR = 2.99, 95% CI: 2.56-3.50 for the men) were observed for the employees in the lowest vs. highest quartile of individual level social capital. In addition, low social capital at the work unit level was associated with a higher likelihood of poor self-rated health. Psychometric techniques show our 8-item measure of social capital to be a valid tool reflecting the construct and displaying the postulated links with other variables.
Development of a brief measure of intimate partner violence experiences: the Composite Abuse Scale (Revised)—Short Form (CASR-SF)

PubMed Central

Ford-Gilboe, Marilyn; Wathen, C Nadine; Varcoe, Colleen; MacMillan, Harriet L; Scott-Storey, Kelly; Mantler, Tara; Hegarty, Kelsey; Perrin, Nancy

2016-01-01

Objectives Approaches to measuring intimate partner violence (IPV) in populations often privilege physical violence, with poor assessment of other experiences. This has led to underestimating the scope and impact of IPV. The aim of this study was to develop a brief, reliable and valid self-report measure of IPV that adequately captures its complexity. Design Mixed-methods instrument development and psychometric testing to evolve a brief version of the Composite Abuse Scale (CAS) using secondary data analysis and expert feedback. Setting Data from 5 Canadian IPV studies; feedback from international IPV experts. Participants 31 international IPV experts including academic researchers, service providers and policy actors rated CAS items via an online survey. Pooled data from 6278 adult Canadian women were used for scale development. Primary/secondary outcome measures Scale reliability and validity; robustness of subscales assessing different IPV experiences. Results A 15-item version of the CAS has been developed (Composite Abuse Scale (Revised)—Short Form, CASR-SF), including 12 items developed from the original CAS and 3 items suggested through expert consultation and the evolving literature. Items cover 3 abuse domains: physical, sexual and psychological, with questions asked to assess lifetime, recent and current exposure, and abuse frequency. Factor loadings for the final 3-factor solution ranged from 0.81 to 0.91 for the 6 psychological abuse items, 0.63 to 0.92 for the 4 physical abuse items, and 0.85 and 0.93 for the 2 sexual abuse items. Moderate correlations were observed between the CASR-SF and measures of depression, post-traumatic stress disorder and coercive control. Internal consistency of the CASR-SF was 0.942. These reliability and validity estimates were comparable to those obtained for the original 30-item CAS. Conclusions The CASR-SF is brief self-report measure of IPV experiences among women that has demonstrated initial reliability and validity and is suitable for use in population studies or other studies. Additional validation of the 15-item scale with diverse samples is required. PMID:27927659
Reliability and Validity of the Turkish Version of the Gastrointestinal Symptom Rating Scale.

PubMed

Turan, Nuray; Aşt, Türkinaz Atabek; Kaya, Nurten

The purpose of this methodological study is to investigate the validity and reliability of the Turkish version of the Gastrointestinal Symptom Rating Scale (GSRS). The scale was adapted to the Turkish language via backward translation. Content validity was examined by referring to experts. Reliability was examined via test-retest reliability and internal consistency, and validity was examined with divergent and convergent validity. The Epworth Sleepiness Scale (ESS) and the Marlowe-Crowne Social Desirability Scale (MCSDS) were used for divergent validity. As for convergent validity, the Constipation Severity Instrument (CSI) and the Patient Assessment of Constipation Quality of Life Scale (PAC-QOLQ) were utilized. The relationship between the GSRS and the health-related quality of life (36-item short-form health survey [SF-36]) was also analyzed. The study population consisted of patients in orthopedic clinic who volunteered to participate. Test-retest reliability was examined with the participation of 30 patients; internal consistency and validity were examined with 150 patients. Test-retest reliability correlation coefficients of the GSRS varied from 0.39 to 0.87 for all items. For internal consistency, the GSRS's item total correlation was found to be 0.17-0.67, and Cronbach α was 0.82 for all items. There was a positive linear significant correlation between the GSRS, CSI, and PAC-QOLQ. There was no significant correlation between the GSRS, MCSDS, and ESS. Higher GSRS scores inversely correlated with general quality of life (SF-36). The Turkish version of the GSRS has been found to be a reliable and valid instrument for assessing patients' gastrointestinal symptoms. Therefore, this instrument can be confidently used with Turkish individuals.
Value-Eroding Teacher Behaviors Scale: A Validity and Reliability Study

ERIC Educational Resources Information Center

Arseven, Zeynep; Kiliç, Abdurrahman; Sahin, Seyma

2016-01-01

In the present study, it is aimed to develop a valid and reliable scale for determining value-eroding behaviors of teachers, hence their values of judgment. The items of the "Value-eroding Teacher Behaviors Scale" were designed in the form of 5-point likert type rating scale. The exploratory factor analysis (EFA) was conducted to…
Development and Psychometric Evaluation of a Clinical Global Impression for Schizoaffective Disorder Scale

PubMed Central

Daniel, David G; Revicki, Dennis A; Canuso, Carla M; Turkoz, Ibrahim; Fu, Dong-Jing; Alphs, Larry; Ishak, K. Jack; Bartko, John J; Lindenmayer, Jean-Pierre

2012-01-01

Objective: The Clinical Global Impression for Schizoaffective Disorder scale is a new rating scale adapted from the Clinical Global Impression scale for use in patients with schizoaffective disorder. The psychometric characteristics of the Clinical Global Impression for Schizoaffective Disorder are described. Design: Content validity was assessed using an investigator questionnaire. Inter-rater reliability was determined with 12 sets of videotaped interviews rated independently by two trained individuals. Test-retest reliability was assessed using 30 randomly selected raters from clinical trials who evaluated the same videos on separate occasions two weeks apart. Convergent and divergent validity and effect size were evaluated by comparing scores between the Clinical Global Impression for Schizoaffective Disorder and the Positive and Negative Syndrome Scale, 21-item Hamilton Rating Scale for Depression, and Young Mania Rating Scale scales using pooled patient data from two clinical trials. Clinical Global Impression for Schizoaffective Disorder scores were then linked to corresponding Positive and Negative Syndrome Scale scores. Results: Content validity was strong. Inter-rater agreement was good to excellent for most scales and subscales (intra-class correlation coefficient ≥0.50). Test-retest showed good reproducibility, with intraclass correlation coefficients ranging from 0.444 to 0.898. Spearman correlations between Clinical Global Impression for Schizoaffective Disorder domains and corresponding symptom scales were 0.60 or greater, and effect sizes for Clinical Global Impression for Schizoaffective Disorder overall and domain scores were similar to Positive and Negative Syndrome Scale Young Mania Rating Scale, and 21-item Hamilton Rating Scale for Depression scores. Raters anticipated that the scale might be less effective in distinguishing negative from depressive symptoms, and, in fact, the results here may reflect that clinical reality. Conclusion: Multiple lines of evidence support the reliability and validity of the Clinical Global Impression for Schizoaffective Disorder for studies in schizoaffective disorder. PMID:22347687
The Adaptation of the Mathematics Anxiety Rating Scale-Elementary Form into Turkish, Language Validity, and Preliminary Psychometric Investigation

ERIC Educational Resources Information Center

Baloglu, Mustafa; Balgalmis, Esra

2010-01-01

The purpose of the present study was to adapt the Mathematics Anxiety Rating Scale- Elementary Form (MARS-E, Suinn, 1988) into Turkish by first doing the translation of its items and then the preliminary psychometric investigation of the Turkish form. The study included four different samples: 30 bilingual language experts, 50 Turkish language…
Psychometric Properties of a Proposed Short Form of the BASC Teacher Rating Scale--Preschool

ERIC Educational Resources Information Center

Yanosky, Daniel J.; Schwanenflugel, Paula J.; Kamphaus, Randy W.

2013-01-01

A 25 item short form of the Behavioral Assessment System for Children (BASC) Teacher Rating Scale--Preschool (TRS-P) was developed by the BASC authors to serve as an emotional/behavioral indicator for an academic intervention study targeting preschool-aged students. The BASC screener is thought to fulfill a need for an abbreviated behavior rating…
A Brief "DSM-IV"-Referenced Teacher Rating Scale for Monitoring Behavioral Improvement in ADHD and Co-Occurring Symptoms

ERIC Educational Resources Information Center

Sprafkin, Joyce; Mattison, Richard E.; Gadow, Kenneth D.; Schneider, Jayne; Lavigne, John V.

2011-01-01

Objective: To examine the psychometric properties of the 30-item teacher's version of the Child and Adolescent Symptom Inventory Progress Monitor (CASI-PM-T), a "DSM-IV"-referenced rating scale for monitoring change in ADHD and co-occurring symptoms in youths receiving behavioral or pharmacological interventions. Method: Three separate studies…
Development of a new assessment scale for measuring interaction during staff-assisted transfer of residents in dementia special care units.

PubMed

Thunborg, Charlotta; von Heideken Wågert, Petra; Götell, Eva; Ivarsson, Ann-Britt; Söderlund, Anne

2015-02-10

Mobility problems and cognitive deficits related to transferring or moving persons suffering from dementia are associated with dependency. Physical assistance provided by staff is an important component of residents' maintenance of mobility in dementia care facilities. Unfortunately, hands-on assistance during transfers is also a source of confusion in persons with dementia, as well as a source of strain in the caregiver. The bidirectional effect of actions in a dementia care dyad involved in transfer is complicated to evaluate. This study aimed to develop an assessment scale for measuring actions related to transferring persons with dementia by dementia care dyads. This study was performed in four phases and guided by the framework of the biopsychosocial model and the approach presented by Social Cognitive Theory. These frameworks provided a starting point for understanding reciprocal effects in dyadic interaction. The four phases were 1) a literature review identifying existing assessment scales; 2) analyses of video-recorded transfer of persons with dementia for further generation of items, 3) computing the item content validity index of the 93 proposed items by 15 experts; and 4) expert opinion on the response scale and feasibility testing of the new assessment scale by video observation of the transfer situations. The development process resulted in a 17-item scale with a seven-point response scale. The scale consists of two sections. One section is related to transfer-related actions (e.g., capability of communication, motor skills performance, and cognitive functioning) of the person with dementia. The other section addresses the caregivers' facilitative actions (e.g., preparedness of transfer aids, interactional skills, and means of communication and interaction). The literature review and video recordings provided ideas for the item pool. Expert opinion decreased the number of items by relevance ratings and qualitative feedback. No further development of items was performed after feasibility testing of the scale. To enable assessment of transfer-related actions in dementia care dyads, our new scale shows potential for bridging the gap in this area. Results from this study could provide health care professionals working in dementia care facilities with a useful tool for assessing transfer-related actions.
Screening for postnatal depression in Chinese-speaking women using the Hong Kong translated version of the Edinburgh Postnatal Depression Scale.

PubMed

Chen, Helen; Bautista, Dianne; Ch'ng, Ying Chia; Li, Wenyun; Chan, Edwin; Rush, A John

2013-06-01

The Edinburgh Postnatal Depression Scale (EPDS) may not be a uniformly valid postnatal depression (PND) screen across populations. We evaluated the performance of a Chinese translation of 10-item (HK-EPDS) and six-item (HK-EPDS-6) versions in post-partum women in Singapore. Chinese-speaking post-partum obstetric clinic patients were recruited for this study. They completed the HK-EPDS, from which we derived the six-item HK-EPDS-6. All women were clinically assessed for PND based on Diagnostic and Statistical Manual, Fourth Edition-Text Revision criteria. Receiver-operator curve (ROC) analyses and likelihood ratio computations informed scale cutoff choices. Clinical fitness was judged by thresholds for internal consistency [α ≥ 0.70] and for diagnostic performance by true-positive rate (>85%), false-positive rate (≤10%), positive likelihood ratio (>1), negative likelihood ratio (<0.2), area under the ROC curve (AUC, ≥90%) and effect size (≥0.80). Based on clinical interview, prevalence of PND was 6.2% in 487 post-partum women. HK-EPDS internal consistency was 0.84. At 13 or more cutoff, the true-positive rate was 86.7%, false-positive rate 3.3%, positive likelihood ratio 26.4, negative likelihood ratio 0.14, AUC 94.4% and effect size 0.81. For the HK-EPDS-6, internal consistency was 0.76. At 8 or more cutoff, we found a true-positive rate of 86.7%, false-positive rate 6.6%, positive likelihood ratio 13.2, negative likelihood ration 0.14, AUC 92.9% and effect size 0.98. The HK-EPDS (cutoff ≥13) and HK-EPDS6 (cutoff ≥8) are fit for PND screening for general population post-partum women. The brief six-item version appears to be clinically suitable for quick screening in Chinese speaking women. Copyright © 2013 Wiley Publishing Asia Pty Ltd.
The functional assessment measure (FAM) in closed traumatic brain injury outpatients: a Rasch-based psychometric study.

PubMed

Tesio, L; Cantagallo, A

1998-01-01

The Functional Assessment Measure (FAM) has been proposed as a measure of disability in post-acute Traumatic Brain Injury (TBI) outpatients. It is comprised of the 18 items of The Functional Independence Measure (FIMSM), scored in terms of dependence, and of 12 newly designed items, scored in terms of dependence (7 items) or performance (5 items). The FIMSM covers the domains of self-care, sphincter management, mobility, locomotion, communication and social cognition. The 12 new items explore the domains of community integration, emotional status, orientation, attention, reading/writing skills, swallowing and speech intelligibility. By addressing a set of problems quite specific for TBI outpatients the FAM was intended to raise the ceiling of the FIMSM and to allow a more precise estimate of their disability. These claims, however, were never supported in previous studies. We administered the FAM to 60 TBI outpatient, 2-88 months (median 16) from trauma. Rasch analysis (rating scale model) was adopted to test the psychometric properties of the scale. The FAM was reliable (Rasch item and person reliability 0.91 and 0.93, respectively). Two of the 12 FAM-specific items were severely misfitting with the general construct, and were deleted. Within the 28-item refined FAM scale, 4 new items and 2 FIMSM items still retained signs of misfit. The FAM was on average too easy. The most difficult item (a new one, Employability) did not attain the average ability of the subjects. Also, it was only slightly more difficult than than the most difficult FIMSM item (Memory). The FAM does not seem to improve the FIMSM as a far as TBI outpatients are to be assessed.
Household Food Security in Isfahan Based on Current Population Survey Adapted Questionnaire

PubMed Central

Rafiei, Morteza; Rastegari, Hosein Ali; Ghiasi, Mojdeh; Shahsanaie, Vahid

2013-01-01

Background: Food security is a state in which all people at every time have physical and economic access to adequate food to obviate nutritional needs and live a healthy and active life. Therefore, this study was performed to quantitatively evaluate the household food security in Esfahan using the localized version of US Household Food Security Survey Module (US HFSSM). Methods: This descriptive cross-sectional study was performed in year 2006 on 3000 households of Esfahan. The study instrument used in this work is 18-item US food security module, which is developed into a localized 15-item questionnaire. This study is performed in two stages of families with no children (under 18 years old) and families with children over 18 years old. Results: The results showed that item severity coefficient, ratio of responses given by households and item infit and outfit coefficient in adult's and children's questionnaire respectively. According to obtained data, scale score of +3 in adults group is described as determination limit of slight food insecurity and +6 is stated as the limit for severe food insecurity. For children's group, scale score of +2 is defined to be the limit of slight food insecurity and +5 is the determination limit of severe food insecurity. Conclusions: The main hypothesis of this survey analysis is based on the raw scale score of USFSSM The item of “lack of enough money for buying food” (item 2) and the item of “lack of balanced meal” (3rd item) have the lowest severity coefficient. Then, the ascending rate of item severity continues in first item, 4th item and keeps increasing into 10th item. PMID:24498498
Accumulation of Content Validation Evidence for the Critical Thinking Self-Assessment Scale.

PubMed

Nair, Girija Gopinathan; Hellsten, Laurie-Ann M; Stamler, Lynnette Leeseberg

2017-04-01

Critical thinking skills (CTS) are essential for nurses; assessing students' acquisition of these skills is a mandate of nursing curricula. This study aimed to develop a self-assessment instrument of critical thinking skills (Critical Thinking Self-Assessment Scale [CTSAS]) for students' self-monitoring. An initial pool of 196 items across 6 core cognitive skills and 16 subskills were generated using the American Philosophical Association definition of CTS. Experts' content review of the items and their ratings provided evidence of content relevance using the item-level content validity index (I-CVI) and Aiken's content validity coefficient (VIk). 115 items were retained (range of I-CVI values = .70 to .94 and range of VIk values = .69-.95; significant at p< .05). The CTSAS is the first CTS instrument designed specifically for self-assessment purposes.
Development of a nursing care problems coping scale for male caregivers for people with dementia living at home.

PubMed

Nishio, Midori; Ono, Mitsu

2015-01-01

The number of male caregivers has increased, but male caregivers face several problems that reduce their quality of life and psychological condition. This study focused on the coping problems of men who care for people with dementia at home. It aimed to develop a coping scale for male caregivers so that they can continue caring for people with dementia at home and improve their own quality of life. The study also aimed to verify the reliability and validity of the scale. The subjects were 759 men who care for people with dementia at home. The Care Problems Coping Scale consists of 21 questions based on elements of questions extracted from a pilot study. Additionally, subjects completed three self-administered questionnaires: the Japanese version of the Zarit Caregiver Burden Scale, the Depressive Symptoms and the Self-esteem Emotional Scale, and Rosenberg Self-Esteem Scale. There were 274 valid responses (36.1% response rate). Regarding the answer distribution, each average value of the 21 items ranged from 1.56 to 2.68. The median answer distribution of the 21 items was 39 (SD = 6.6). Five items had a ceiling effect, and two items had a floor effect. The scale stability was about 50%, and Cronbach's α was 0.49. There were significant correlations between the Care Problems Coping Scale and total scores of the Japanese version of the Zarit Caregiver Burden Scale, the Depressive Symptoms and Self-esteem Emotional Scale, and the Rosenberg Self-Esteem Scale. The answers provided on the Care Problems Coping Scale questionnaire indicated that male caregivers experience care problems. In terms of validity, there were significant correlations between the external questionnaires and 19 of the 21 items in this scale. This scale can therefore be used to measure problems with coping for male caregivers who care for people with dementia at home.
Dimensional approach to symptom factors of major depressive disorder in Koreans, using the Brief Psychiatric Rating Scale: the Clinical Research Center for Depression of South Korea study.

PubMed

Park, Seon-Cheol; Jang, Eun Young; Kim, Daeho; Jun, Tae-Youn; Lee, Min-Soo; Kim, Jae-Min; Kim, Jung-Bum; Jo, Sun-Jin; Park, Yong Chon

2015-01-01

Although major depressive disorder (MDD) has a variety of symptoms beyond the affective dimensions, the factor structure and contents of comprehensive psychiatric symptoms of this disorder have rarely been explored using the 18-item Brief Psychiatric Rating Scale (BPRS). We aimed to identify the factor structure of the 18-item BPRS in Korean MDD patients. A total of 258 MDD patients were recruited from a multicenter sample of the Clinical Research Center for Depression of South Korea study. Psychometric scales were used to assess overall psychiatric symptoms (BPRS), depression (Hamilton Depression Rating Scale), anxiety (Hamilton Anxiety Rating Scale), global severity (Clinical Global Impression of Severity Scale), suicidal ideation (Scale for Suicide Ideation), functioning (Social and Occupational Functioning Assessment Scale), and quality of life (World Health Organization Quality of Life Assessment-abbreviated version). Common factor analysis with oblique rotation was used to yield factor structure. A four-factor structure was designed and interpreted by the symptom dimensions to reflect mood disturbance, positive symptoms/apathy, bipolarity, and thought distortion/mannerism. These individual factors were also significantly correlated with clinical variables. The findings of this study support the view that the BPRS may be a promising measuring tool for the initial assessment of MDD patients. In addition, the four-factor structure of the BPRS may be useful in understanding the mood and psychotic characteristics of these patients. Copyright © 2014. Published by Elsevier Taiwan.
Bayesian Estimation of Circumplex Models Subject to Prior Theory Constraints and Scale-Usage Bias

ERIC Educational Resources Information Center

Lenk, Peter; Wedel, Michel; Bockenholt, Ulf

2006-01-01

This paper presents a hierarchical Bayes circumplex model for ordinal ratings data. The circumplex model was proposed to represent the circular ordering of items in psychological testing by imposing inequalities on the correlations of the items. We provide a specification of the circumplex, propose identifying constraints and conjugate priors for…

Survey of the Importance of Professional Behaviors among Medical Students, Residents, and Attending Physicians

ERIC Educational Resources Information Center

Morreale, Mary K.; Balon, Richard; Arfken, Cynthia L.

2011-01-01

Objective: The authors compared the importance of items related to professional behavior among medical students rotating through their psychiatry clerkship, psychiatry residents, and attending psychiatrists. Method: The authors sent an electronic survey with 43 items (rated on the scale 1: Not at All Important; to 5: Very Important) to medical…
Screening for somatization and hypochondriasis in primary care and neurological in-patients: a seven-item scale for hypochondriasis and somatization.

PubMed

Fink, P; Ewald, H; Jensen, J; Sørensen, L; Engberg, M; Holm, M; Munk-Jørgensen, P

1999-03-01

The aim of this study was to investigate the internal and external validity of the Whiteley Index as a screening instrument for somatization illness. A 14-item version of the Whiteley Index for hypochondriacal traits was given to 99 of 191 consecutive primary care patients, aged 18-65 years, and to 100 consecutive patients, aged 18-60 years, admitted for the first time to a neurological ward. The primary care sample was, in addition, interviewed by means of the SCAN (Schedules for Clinical Assessment in Neuropsychiatry) psychiatric interview. The GPs and the neurologists were asked to rate various characteristics of the patients that might indicate somatization. The internal validity of the Whiteley Index was tested by means of latent structure analysis. On this basis, a reduced seven-item scale (Whiteley-7 scale) and two subscales (i.e., an Illness Conviction and Illness Worrying scale, each with three items) were constructed. All three had a high internal validity fitting into the very restricted Rasch statistical model (p>0.05) and an acceptable transferability between most of the subpopulations investigated. In the primary care population, the Whiteley-7 and the Illness Conviction scales at cut-point 0/1 showed 1.00 and 0.87 sensitivity and 0.65 and 0.87 specificity, respectively, using as "gold standard" the fulfillment of criteria for at least one ICD-10 somatoform disorder, and 0.71 and 0.63 sensitivity and 0.62 and 0.87 specificity, respectively, as gold standard for the fulfillment of criteria for at least one DSM-IV somatoform disorder, excluding the NOS diagnostic group. The Illness Worrying subscale showed less impressive performance in this respect. The agreement between the Whiteley-7 scale including the two subscales and neurologists' rating and the GPs' rating and the somatization subscale on the SCL-90 was modest or worse. It may be concluded that the Whiteley-7 scale and the Illness Conviction subscale had acceptable psychometric profiles, and both seem to be promising screening tools for not only hypochondriasis but also for somatoform disorders in general.
Development of the Ghent Multidimensional Somatic Complaints Scale

ERIC Educational Resources Information Center

Beirens, Koen; Fontaine, Johnny R. J.

2010-01-01

The present study aimed at developing a new scale that operationalizes a hierarchical model of somatic complaints. First, 63 items representing a wide range of symptoms and sensations were compiled from somatic complaints scales and emotion literature. These complaints were rated by Belgian students (n = 307) and Belgian adults (n = 603).…
The Job Responsibilities Scale: Invariance in a Longitudinal Prospective Study.

ERIC Educational Resources Information Center

Ludlow, Larry H.; Lunz, Mary E.

1998-01-01

The degree of invariance of the Job Responsibilities Scale for medical technologists was studied for 1993 and 1995, conducting factor analyses of data from each year (1063 and 665 individuals, respectively). Nearly identical factor patterns were found, and Rasch rating scale analyses found nearly identical pairs of item estimates. Implications are…
Item Response Modeling of Forced-Choice Questionnaires

ERIC Educational Resources Information Center

Brown, Anna; Maydeu-Olivares, Alberto

2011-01-01

Multidimensional forced-choice formats can significantly reduce the impact of numerous response biases typically associated with rating scales. However, if scored with classical methodology, these questionnaires produce ipsative data, which lead to distorted scale relationships and make comparisons between individuals problematic. This research…
Development and validation of the German version of the Orofacial Esthetic Scale.

PubMed

Reissmann, Daniel R; Benecke, Andreas W; Aarabi, Ghazal; Sierwald, Ira

2015-07-01

This study aimed to develop the German version of the Orofacial Esthetic Scale (OES-G) and to assess its psychometric properties. The OES is an eight-item instrument with seven items directly addressing esthetic impacts of the orofacial region and an eighth item for a global assessment. It applies an 11-point ordinal rating scale, with summary scores ranging from 0 (worst) to 70 (best). The original OES items were translated into German using a forward-backward method. A de novo development of German items (n = 21 patients) and a cross-cultural adaptation after pilot testing (n = 15 patients) established content validity. Internal consistency and construct validity (structural, convergent, known-groups) of the OES-G were assessed in a sample of 165 prosthodontic patients. The OES was applied in 42 patients on two occasions, with a temporal distance of 2-4 weeks apart to determine test-retest reliability. Internal consistency of the OES-G was considered as satisfactory (Cronbach's alpha 0.94; average inter-item correlation 0.64). Intraclass correlation coefficient of 0.95 (95 % confidence interval 0.92-0.98) indicated excellent test-retest reliability. Correlation matrix and exploratory factor analysis provided support for unidimensionality of the measured construct. The OES-G summary score was correlated with the patients' global assessment of their esthetics (r = 0.87) and external ratings of the expert group (r = 0.55) and discriminated patients with treatment need (39.4 points) from patients without (58.4 points; p < 0.001) and with a large effect size. The OES-G has good psychometric properties and is a valuable instrument for the assessment of self-perceived orofacial esthetics.
Testing parent dyad interchangeability in the parent proxy-report of PedsQL™ 4.0: a differential item functioning analysis.

PubMed

Doostfatemeh, Marziyeh; Ayatollahi, Seyyed Mohammad Taghi; Jafari, Peyman

2015-08-01

In child-parent agreement studies in the field of paediatric health-related quality of life (HRQoL), little attention has been paid to the effect of gender in parental proxy rating of children's HRQoL. This study aims to test the potential interchangeability of parent dyads in reporting children's HRQoL on both item and scale levels of the PedsQL™ 4.0 instrument, using the approach of differential item functioning (DIF). The PedsQL™ 4.0 Generic Core Scales were completed by 576 father-and-mother dyads. A polytomous item response theory model, graded response model, was used to detect DIF across fathers and mothers. Assessment at item level showed that fathers and mothers perceived the meaning of items of the PedsQL™ 4.0 consistently. Regarding the scale level, a moderate to high level of agreement was observed between mothers' and fathers' reports on all similar subscales. Although the significant mean score differences in total, physical and emotional functioning indicated that fathers gave higher scores to their children, the small effect size implied that this difference may not be practically meaningful. Our findings revealed that discrepancy in parent dyads in rating children's HRQoL is a "real" difference and not an artefact due to measurement non-invariance. Fathers were seen to have slightly different insights into their children, especially for emotional functioning, but overall the results were not all that different. This suggests that paternal proxy-reports can be included in studies along with maternal proxy-reports, and the two may be combined when looking at parent-child agreement. Parent-child agreement studies in Iran are not affected by parents' gender, and therefore, researchers may rely on the assumption of the interchangeability of fathers and mothers in these studies.
An Item Response Analysis of the Motor and Behavioral Subscales of the Unified Huntington's Disease Rating Scale in Huntington Disease Gene Expansion Carriers

PubMed Central

Vaccarino, Anthony L.; Anderson, Karen; Borowsky, Beth; Duff, Kevin; Giuliano, Joseph; Guttman, Mark; Ho, Aileen K.; Orth, Michael; Paulsen, Jane S.; Sills, Terrence; van Kammen, Daniel P.; Evans, Kenneth R.

2011-01-01

Although the Unified Huntington's Disease Rating Scale (UHDRS) is widely used in the assessment of Huntington disease (HD), the ability of individual items to discriminate individual differences in motor or behavioral manifestations has not been extensively studied in HD gene expansion carriers without a motor-defined clinical diagnosis (i.e., prodromal-HD or prHD). To elucidate the relationship between scores on individual motor and behavioral UHDRS items and total score for each subscale, a non-parametric item response analysis was performed on retrospective data from two multicentre, longitudinal studies. Motor and Behavioral assessments were supplied for 737 prHD individuals with data from 2114 visits (PREDICT-HD) and 686 HD individuals with data from 1482 visits (REGISTRY). Option characteristic curves were generated for UHDRS subscale items in relation to their subscale score. In prHD, overall severity of motor signs was low and participants had scores of 2 or above on very few items. In HD, motor items that assessed ocular pursuit, saccade initiation, finger tapping, tandem walking, and to a lesser extent saccade velocity, dysarthia, tongue protrusion, pronation/supination, Luria, bradykinesia, choreas, gait and balance on the retropulsion test were found to discriminate individual differences across a broad range of motor severity. In prHD, depressed mood, anxiety, and irritable behavior demonstrated good discriminative properties. In HD, depressed mood demonstrated a good relationship with the overall behavioral score. These data suggest that at least some UHDRS items appear to have utility across a broad range of severity, although many items demonstrate problematic features. PMID:21370269
An item response analysis of the motor and behavioral subscales of the unified Huntington's disease rating scale in huntington disease gene expansion carriers.

PubMed

Vaccarino, Anthony L; Anderson, Karen; Borowsky, Beth; Duff, Kevin; Giuliano, Joseph; Guttman, Mark; Ho, Aileen K; Orth, Michael; Paulsen, Jane S; Sills, Terrence; van Kammen, Daniel P; Evans, Kenneth R

2011-04-01

Although the Unified Huntington's Disease Rating Scale (UHDRS) is widely used in the assessment of Huntington disease (HD), the ability of individual items to discriminate individual differences in motor or behavioral manifestations has not been extensively studied in HD gene expansion carriers without a motor-defined clinical diagnosis (ie, prodromal-HD or prHD). To elucidate the relationship between scores on individual motor and behavioral UHDRS items and total score for each subscale, a nonparametric item response analysis was performed on retrospective data from 2 multicenter longitudinal studies. Motor and behavioral assessments were supplied for 737 prHD individuals with data from 2114 visits (PREDICT-HD) and 686 HD individuals with data from 1482 visits (REGISTRY). Option characteristic curves were generated for UHDRS subscale items in relation to their subscale score. In prHD, overall severity of motor signs was low, and participants had scores of 2 or above on very few items. In HD, motor items that assessed ocular pursuit, saccade initiation, finger tapping, tandem walking, and to a lesser extent, saccade velocity, dysarthria, tongue protrusion, pronation/supination, Luria, bradykinesia, choreas, gait, and balance on the retropulsion test were found to discriminate individual differences across a broad range of motor severity. In prHD, depressed mood, anxiety, and irritable behavior demonstrated good discriminative properties. In HD, depressed mood demonstrated a good relationship with the overall behavioral score. These data suggest that at least some UHDRS items appear to have utility across a broad range of severity, although many items demonstrate problematic features. Copyright © 2011 Movement Disorder Society.
Development and evaluation of CAHPS survey items assessing how well healthcare providers address health literacy.

PubMed

Weidmer, Beverly A; Brach, Cindy; Hays, Ron D

2012-09-01

The complexity of health information often exceeds patients' skills to understand and use it. To develop survey items assessing how well healthcare providers communicate health information. Domains and items for the Consumer Assessment of Healthcare Providers and Systems (CAHPS) Item Set for Addressing Health Literacy were identified through an environmental scan and input from stakeholders. The draft item set was translated into Spanish and pretested in both English and Spanish. The revised item set was field tested with a randomly selected sample of adult patients from 2 sites using mail and telephonic data collection. Item-scale correlations, confirmatory factor analysis, and internal consistency reliability estimates were estimated to assess how well the survey items performed and identify composite measures. Finally, we regressed the CAHPS global rating of the provider item on the CAHPS core communication composite and the new health literacy composites. A total of 601 completed surveys were obtained (52% response rate). Two composite measures were identified: (1) Communication to Improve Health Literacy (16 items); and (2) How Well Providers Communicate About Medicines (6 items). These 2 composites were significantly uniquely associated with the global rating of the provider (communication to improve health literacy: P<0.001, b=0.28; and communication about medicines composite: P=0.02, b=0.04). The 2 composites and the CAHPS core communication composite accounted for 51% of the variance in the global rating of the provider. A 5-item subset of the Communication to Improve Health Literacy composite accounted for 90% of the variance of the original 16-item composite. This study provides support for reliability and validity of the CAHPS Item Set for Addressing Health Literacy. These items can serve to assess whether healthcare providers have communicated effectively with their patients and as a tool for quality improvement.
Evaluation of complementary-alternative medicine (CAM) questionnaire development for Indonesian clinical psychologists: A pilot study.

PubMed

Liem, Andrian; Newcombe, Peter A; Pohlman, Annie

2017-08-01

This study aimed to evaluate questionnaire development to measure the knowledge of Complementary-Alternative Medicine (CAM), attitudes towards CAM, CAM experiences, and CAM educational needs of clinical psychologists in Indonesia. A 26-item questionnaire was developed through an extensive literature search. Data was obtained from provisional psychologists from the Master of Professional Clinical Psychology programs at two established public universities in urban areas of Indonesia. To validate the questionnaire, panel reviews by executive members of the Indonesian Clinical Psychology Association (ICPA), experts in health psychology, and experts in public health and CAM provided their professional judgements. The self-reporting questionnaire consisted of four scales including: knowledge of CAM (6 items), attitudes towards CAM (10 items), CAM experiences (4 items), and CAM educational needs (6 items). All scales, except CAM Experiences, were assessed on a 7-point Likert scale. Sixty provisional psychologists were eligible to complete the questionnaire with a response rate of 73% (N=44). The results showed that the CAM questionnaire was reliable (Cronbach's coefficient alpha range=0.62-0.96; item-total correlation range=0.14-0.92) and demonstrated content validity. Following further psychometric evaluation, the CAM questionnaire may provide the evidence-based information to inform the education and practice of Indonesian clinical psychologists. Copyright © 2017 Elsevier Ltd. All rights reserved.
Validation of the shortened Perceived Medical Condition Self-Management Scale in patients with chronic disease.

PubMed

Wild, Marcus G; Ostini, Remo; Harrington, Magdalena; Cavanaugh, Kerri L; Wallston, Kenneth A

2018-05-21

Self-efficacy, or perceived competence, has been identified as an important factor in self-management behaviors and health outcomes in patients with chronic disease. Measures of self-management self-efficacy are currently available for multiple forms of chronic disease. One established measure is the 8-item Perceived Medical Condition Self-Management Scale (PMCSMS). This study investigated the use of the PMCSMS in samples of patients with a chronic disease to develop an abbreviated version of the scale that could be more readily used in clinical contexts or in large population health cohort studies. The PMCSMS was administered as either a generic scale or as a disease-specific scale. The results of analyses using item response theory and classical test theory methods indicated that using 4 items of the scale resulted in similar internal consistency (α = .70-0.90) and temporal stability (test-retest r = .75 after 2 to 4 weeks) to the 8-item PMCSMS (r = .81 after 2 to 4 weeks). The 4 items selected had the greatest discriminability among participants (α parameters = 2.49-3.47). Scores from both versions also demonstrated similar correlations with related constructs such as health literacy (r = .13-0.29 vs. 0.14-0.27), self-rated health (r = .17-0.48 vs. 0.26-0.50), social support (r = .21-0.32 vs. 0.25-0.34), and medication adherence (r = .20-0.24 vs. 0.20-0.25). The results of this study indicate that 4-item PMCSMS scores are equally valid but more efficient, and have the potential to be beneficial for both research and clinical applications. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Development of the Career Anchors Scale among Occupational Health Nurses in Japan.

PubMed

Kubo, Yoshiko; Hatono, Yoko; Kubo, Tomohide; Shimamoto, Satoko; Nakatani, Junko; Burgel, Barbara J

2016-11-29

This study aimed to develop the Career Anchors Scale among Occupational Health Nurses (CASOHN) and evaluate its reliability and validity. Scale items were developed through a qualitative inductive analysis of interview data, and items were revised following an examination of content validity by experts and occupational health nurses (OHNs), resulting in a provisional scale of 41 items. A total of 745 OHNs (response rate 45.2%) affiliated with the Japan Society for Occupational Health participated in the self-administered questionnaire survey. Two items were deleted based on item-total correlations. Factor analysis was then conducted on the remaining 39 items to examine construct validity. An exploratory factor analysis with a main factor method and promax rotation resulted in the extraction of six factors. The variance contribution ratios of the six factors were 37.45, 7.01, 5.86, 4.95, 4.16, and 3.19%. The cumulative contribution ratio was 62.62%. The factors were named as follows: Demonstrating expertise and considering position in work (Factor 1); Management skills for effective work (Factor 2); Supporting health improvement in groups and organizations (Factor 3); Providing employee-focused support (Factor 4); Collaborating with occupational health team members and personnel (Factor 5); and Compatibility of work and private life (Factor 6). The confidence coefficient determined by the split-half method was 0.85. Cronbach's alpha coefficient for the overall scale was 0.95, whereas those of the six subscales were 0.88, 0.90, 0.91, 0.80, 0.85, and 0.79, respectively. CASOHN was found to be valid and reliable for measuring career anchors among OHNs in Japan.
Development and validation of the Dimensional Anhedonia Rating Scale (DARS) in a community sample and individuals with major depression.

PubMed

Rizvi, Sakina J; Quilty, Lena C; Sproule, Beth A; Cyriac, Anna; Michael Bagby, R; Kennedy, Sidney H

2015-09-30

Anhedonia, a core symptom of Major Depressive Disorder (MDD), is predictive of antidepressant non-response. In contrast to the definition of anhedonia as a "loss of pleasure", neuropsychological studies provide evidence for multiple facets of hedonic function. The aim of the current study was to develop and validate the Dimensional Anhedonia Rating Scale (DARS), a dynamic scale that measures desire, motivation, effort and consummatory pleasure across hedonic domains. Following item selection procedures and reliability testing using data from community participants (N=229) (Study 1), the 17-item scale was validated in an online study with community participants (N=150) (Study 2). The DARS was also validated in unipolar or bipolar depressed patients (n=52) and controls (n=50) (Study 3). Principal components analysis of the 17-item DARS revealed a 4-component structure mapping onto the domains of anhedonia: hobbies, food/drink, social activities, and sensory experience. Reliability of the DARS subscales was high across studies (Cronbach's α=0.75-0.92). The DARS also demonstrated good convergent and divergent validity. Hierarchical regression analysis revealed the DARS showed additional utility over the Snaith-Hamilton Pleasure Scale (SHAPS) in predicting reward function and distinguishing MDD subgroups. These studies provide support for the reliability and validity of the DARS. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Effect of desvenlafaxine 50 mg and 100 mg on energy and lassitude in patients with major depressive disorder: A pooled analysis.

PubMed

Lam, Raymond W; Wajsbrot, Dalia B; Meier, Ellen; Pappadopulos, Elizabeth; Mackell, Joan A; Boucher, Matthieu

2017-09-01

Nine randomized, double-blind, placebo-controlled studies of major depressive disorder were pooled to evaluate the effects of desvenlafaxine 50- and 100-mg/d on energy and lassitude in adults with major depressive disorder ( n=4279). Changes from baseline to endpoint in 17-item Hamilton Rating Scale for Depression (HAM-D 17 ) Work and Activities, Retardation, and Somatic Symptoms General items, HAM-D 17 psychomotor retardation factor, and Montgomery-Åsberg Depression Rating Scale Lassitude item were analyzed with a mixed model for repeated measures analysis of variance. Associations between residual energy measures and functional impairment, based on the Sheehan Disability Scale, were modeled using stepwise multiple linear regression. Improvement from baseline was significantly greater for both desvenlafaxine doses versus placebo on all energy symptom outcomes at week 8 (all p⩽0.005). Both early improvement in HAM-D 17 psychomotor retardation at week 2 and residual energy symptoms at week 8 were associated with Sheehan Disability Scale total score at week 8 (all p⩽0.001). Among Sheehan Disability Scale remitters and responders, the HAM-D 17 psychomotor retardation score at week 8 was significantly lower with desvenlafaxine (both doses) than placebo. Desvenlafaxine 50 and 100 mg/d significantly improved energy and lassitude symptoms in patients with major depressive disorder. Both early improvement in energy and fewer residual energy symptoms were associated with functional improvement.
Evaluation and revision of checklists for screening facilities and municipal governmental programs for gastric cancer and colorectal cancer screening in Japan.

PubMed

Higashi, Takahiro; Machii, Ryoko; Aoki, Ayako; Hamashima, Chisato; Saito, Hiroshi

2010-11-01

To evaluate the appropriateness of current checklists created by a governmental committee to assess screening programs run by municipal governments and service provider facilities for gastric and colorectal cancer, and to accumulate expert opinions to provide insights aimed at the next revision. We convened an expert panel that consisted of physicians nominated by regional offices of the Japanese Society for Gastrointestinal Cancer Screening and radiology technicians nominated by the technician chapter of the society. The panel rated the appropriateness of each checklist item on a scale of 1-9 (1, extremely inappropriate; 9, extremely appropriate) twice, between which they had a face-to-face discussion meeting. During the process they were allowed to propose modifications and additions to the items. In the first round of rating, the panelists rated all 57 and 56 checklists items for gastric and colorectal cancer, respectively, as appropriate based on an acceptance rule determined a priori. During the process of the face-to-face discussion, however, the panel proposed modifications to 23 (40%) and 22 (39%) items, respectively, and the addition of 27 new items each. After integrating overlapping items and rating again for appropriateness, 66 and 64 items, respectively, were accepted as the revised checklist set. The expert panel considered current checklists for colorectal and gastric cancer-screening programs and facilities to be suitable. Their proposals for a new set of checklist items will help further improve the checklists.
Inter-rater reliability of the German version of the Nurses' Global Assessment of Suicide Risk scale.

PubMed

Kozel, Bernd; Grieser, Manuela; Abderhalden, Christoph; Cutcliffe, John R

2016-10-01

In comparison to the general population, the suicide rates of psychiatric inpatient populations in Germany and Switzerland are very high. An important preventive contribution to the lowering of the suicide rates in mental health care is to ensure that the risk of suicide of psychiatric inpatients is assessed as accurately as possible. While risk-assessment instruments can serve an important function in determining such risk, very few have been translated to German. Therefore, in the present study, we reported on the German version of Nurses' Global Assessment of Suicide Risk (NGASR) scale. After translating the original instrument into German and pretesting the German version, we tested the inter-rater reliability of the instrument. Twelve video case studies were evaluated by 13 raters with the NGASR scale in a 'laboratory' trial. In each case, the observer's agreement was calculated for the single items, the overall scale, the risk levels, and the sum scores. The statistical data analysis was conducted with kappa and AC1 statistics for dichotomous (items, scale) scales. A high-to-very high observers' agreement (AC1: 0.62-1.00, kappa: 0.00-1.00) was determined for 16 items of the German version of the NGASR scale. We conclude that the German version of the NGASR scale is a reliable instrument for evaluating risk factors for suicide. A reliable application in the clinical practise appears to be enhanced by training in the use of the instrument and the right implementation instructions. © 2016 Australian College of Mental Health Nurses Inc.
Perpetration of Severe Intimate Partner Violence: Premilitary and Second Year of Service Rates

DTIC Science & Technology

2009-01-01

San Diego, CA 92186-5122. tCenter for the Study of Family Violence and Sexual Assault. Northern Illinois University. DeKalb. IL 60115-2854. The views...tended to focus only on male-to-female violence within married couples, the present study included any perpetration of SIPV perpetrated hy either male...a 5-point scale fO. 1, 2-5. 6-10. or >IO). Of the 18 CTS items, only the 5 items on the severe physical violence scale were used in the present study
Parent-teacher agreement on children's problems in 21 societies.

PubMed

Rescorla, Leslie A; Bochicchio, Lauren; Achenbach, Thomas M; Ivanova, Masha Y; Almqvist, Fredrik; Begovac, Ivan; Bilenberg, Niels; Bird, Hector; Dobrean, Anca; Erol, Nese; Fombonne, Eric; Fonseca, Antonio; Frigerio, Alessandra; Fung, Daniel S S; Lambert, Michael C; Leung, Patrick W L; Liu, Xianchen; Marković, Ivica; Markovic, Jasminka; Minaei, Asghar; Ooi, Yoon Phaik; Roussos, Alexandra; Rudan, Vlasta; Simsek, Zeynep; van der Ende, Jan; Weintraub, Sheila; Wolanczyk, Tomasz; Woo, Bernardine; Weiss, Bahr; Weisz, John; Zukauskiene, Rita; Verhulst, Frank C

2014-01-01

Parent-teacher cross-informant agreement, although usually modest, may provide important clinical information. Using data for 27,962 children from 21 societies, we asked the following: (a) Do parents report more problems than teachers, and does this vary by society, age, gender, or type of problem? (b) Does parent-teacher agreement vary across different problem scales or across societies? (c) How well do parents and teachers in different societies agree on problem item ratings? (d) How much do parent-teacher dyads in different societies vary in within-dyad agreement on problem items? (e) How well do parents and teachers in 21 societies agree on whether the child's problem level exceeds a deviance threshold? We used five methods to test agreement for Child Behavior Checklist (CBCL) and Teacher's Report Form (TRF) ratings. CBCL scores were higher than TRF scores on most scales, but the informant differences varied in magnitude across the societies studied. Cross-informant correlations for problem scale scores varied moderately across societies studied and were significantly higher for Externalizing than Internalizing problems. Parents and teachers tended to rate the same items as low, medium, or high, but within-dyad item agreement varied widely in every society studied. In all societies studied, both parental noncorroboration of teacher-reported deviance and teacher noncorroboration of parent-reported deviance were common. Our findings underscore the importance of obtaining information from parents and teachers when evaluating and treating children, highlight the need to use multiple methods of quantifying cross-informant agreement, and provide comprehensive baselines for patterns of parent-teacher agreement across 21 societies.
Symptoms of anxiety in depression: assessment of item performance of the Hamilton Anxiety Rating Scale in patients with depression.

PubMed

Vaccarino, Anthony L; Evans, Kenneth R; Sills, Terrence L; Kalali, Amir H

2008-01-01

Although diagnostically dissociable, anxiety is strongly co-morbid with depression. To examine further the clinical symptoms of anxiety in major depressive disorder (MDD), a non-parametric item response analysis on "blinded" data from four pharmaceutical company clinical trials was performed on the Hamilton Anxiety Rating Scale (HAMA) across levels of depressive severity. The severity of depressive symptoms was assessed using the 17-item Hamilton Depression Rating Scale (HAMD). HAMA and HAMD measures were supplied for each patient on each of two post-screen visits (n=1,668 observations). Option characteristic curves were generated for all 14 HAMA items to determine the probability of scoring a particular option on the HAMA in relation to the total HAMD score. Additional analyses were conducted using Pearson's product-moment correlations. Results showed that anxiety-related symptomatology generally increased as a function of overall depressive severity, though there were clear differences between individual anxiety symptoms in their relationship with depressive severity. In particular, anxious mood, tension, insomnia, difficulties in concentration and memory, and depressed mood were found to discriminate over the full range of HAMD scores, increasing continuously with increases in depressive severity. By contrast, many somatic-related symptoms, including muscular, sensory, cardiovascular, respiratory, gastro-intestinal, and genito-urinary were manifested primarily at higher levels of depression and did not discriminate well at lower HAMD scores. These results demonstrate anxiety as a core feature of depression, and the relationship between anxiety-related symptoms and depression should be considered in the assessment of depression and evaluation of treatment strategies and outcome.

Use of non-parametric item response theory to develop a shortened version of the Positive and Negative Syndrome Scale (PANSS).

PubMed

Khan, Anzalee; Lewis, Charles; Lindenmayer, Jean-Pierre

2011-11-16

Nonparametric item response theory (IRT) was used to examine (a) the performance of the 30 Positive and Negative Syndrome Scale (PANSS) items and their options ((levels of severity), (b) the effectiveness of various subscales to discriminate among differences in symptom severity, and (c) the development of an abbreviated PANSS (Mini-PANSS) based on IRT and a method to link scores to the original PANSS. Baseline PANSS scores from 7,187 patients with Schizophrenia or Schizoaffective disorder who were enrolled between 1995 and 2005 in psychopharmacology trials were obtained. Option characteristic curves (OCCs) and Item Characteristic Curves (ICCs) were constructed to examine the probability of rating each of seven options within each of 30 PANSS items as a function of subscale severity, and summed-score linking was applied to items selected for the Mini-PANSS. The majority of items forming the Positive and Negative subscales (i.e. 19 items) performed very well and discriminate better along symptom severity compared to the General Psychopathology subscale. Six of the seven Positive Symptom items, six of the seven Negative Symptom items, and seven out of the 16 General Psychopathology items were retained for inclusion in the Mini-PANSS. Summed score linking and linear interpolation was able to produce a translation table for comparing total subscale scores of the Mini-PANSS to total subscale scores on the original PANSS. Results show scores on the subscales of the Mini-PANSS can be linked to scores on the original PANSS subscales, with very little bias. The study demonstrated the utility of non-parametric IRT in examining the item properties of the PANSS and to allow selection of items for an abbreviated PANSS scale. The comparisons between the 30-item PANSS and the Mini-PANSS revealed that the shorter version is comparable to the 30-item PANSS, but when applying IRT, the Mini-PANSS is also a good indicator of illness severity.
Use of NON-PARAMETRIC Item Response Theory to develop a shortened version of the Positive and Negative Syndrome Scale (PANSS)

PubMed Central

2011-01-01

Background Nonparametric item response theory (IRT) was used to examine (a) the performance of the 30 Positive and Negative Syndrome Scale (PANSS) items and their options ((levels of severity), (b) the effectiveness of various subscales to discriminate among differences in symptom severity, and (c) the development of an abbreviated PANSS (Mini-PANSS) based on IRT and a method to link scores to the original PANSS. Methods Baseline PANSS scores from 7,187 patients with Schizophrenia or Schizoaffective disorder who were enrolled between 1995 and 2005 in psychopharmacology trials were obtained. Option characteristic curves (OCCs) and Item Characteristic Curves (ICCs) were constructed to examine the probability of rating each of seven options within each of 30 PANSS items as a function of subscale severity, and summed-score linking was applied to items selected for the Mini-PANSS. Results The majority of items forming the Positive and Negative subscales (i.e. 19 items) performed very well and discriminate better along symptom severity compared to the General Psychopathology subscale. Six of the seven Positive Symptom items, six of the seven Negative Symptom items, and seven out of the 16 General Psychopathology items were retained for inclusion in the Mini-PANSS. Summed score linking and linear interpolation was able to produce a translation table for comparing total subscale scores of the Mini-PANSS to total subscale scores on the original PANSS. Results show scores on the subscales of the Mini-PANSS can be linked to scores on the original PANSS subscales, with very little bias. Conclusions The study demonstrated the utility of non-parametric IRT in examining the item properties of the PANSS and to allow selection of items for an abbreviated PANSS scale. The comparisons between the 30-item PANSS and the Mini-PANSS revealed that the shorter version is comparable to the 30-item PANSS, but when applying IRT, the Mini-PANSS is also a good indicator of illness severity. PMID:22087503
Revision and validation of a scale to assess pregnancy stress.

PubMed

Chen, Chung-Hey

2015-03-01

Pregnancy is a potentially stressful event. Prenatal stress alters maternal endocrine and immune systems, has been implicated in the etiology of prenatal complications or postnatal psychiatric disorders, and may adversely affect fetal health. The 30-item Pregnancy Stress Rating Scale (PSRS), initially developed in 1983 by Chen and colleagues, is the only measure to date designed specifically to evaluate prenatal stress. The purpose of this study was to reconsider and revise the 30-item PSRS and validate the new PSRS. A cross-sectional design was used. Adding new items of pregnancy stress generated from clinical experience and expert recommendations resulted in a 40-item revised PSRS that was more reflective of current social conditions. Three hundred pregnant women, recruited from the antenatal clinic of a medical center in southern Taiwan, completed the revised PSRS to assess its internal consistency, test-retest reliability, construct validity, and convergent and discriminate validity. The final 36-item PSRS (PSRS36) was derived by deleting four items with relatively low item-total correlation coefficients or factor loadings. The resultant 36-item scale showed good internal consistency (α = .92) and 2-week test-retest reliability (r = .82). Factor analysis confirmed construct validity and suggested five prenatal stress dimensions, which explained 52.17% of the total variance. Convergent and discriminate validities were indicated by significant correlations among the PSRS36, Perceived Stress Scale, and Interpersonal Support Evaluation List. The PSRS36 is a psychometrically sound and practical tool for nurses and other healthcare providers to assess prenatal stress and to examine intervention protocols in Taiwanese prenatal women. More research is recommended to determine whether the PSRS36 may be used in other racial-ethnic groups.
ABILOCO-Kids: a Rasch-built 10-item questionnaire for assessing locomotion ability in children with cerebral palsy.

PubMed

Caty, Gilles D; Gilles, Caty D; Arnould, Carlyne; Thonnard, Jean-Louis; Lejeune, Thierry M

2008-11-01

To develop a questionnaire (ABILOCO-Kids) based on the Rasch measurement model that assesses locomotion ability in children with cerebral palsy. Prospective study and questionnaire development. A total of 113 children with cerebral palsy (10 (standard deviation 2.5) years old). A 41-item questionnaire was developed based on existing scales and on the clinical experience of professionals in the field of rehabilitation. This questionnaire was tested separately on the 113 children with cerebral palsy and their parents. Their responses were analysed using the Rasch model (RUMM-2020) to select items that had an ordered rating scale and that fit a unidimensional model. The final ABILOCO-Kids scale consisted of 10 locomotion activities, of which difficulty was rated by the parents. The parents gave a more precise assessment of their children's ability than the children themselves, leading to a wider range of measurement that was well-targeted on the sample population and that had good reliability (r=0.97) and reproducibility (intraclass correlation coefficient=0.96). Item calibration did not vary with age, sex or clinical presentation (hemiplegia, diplegia, quadriplegia). The concurrent validity of the ABILOCO-Kids questionnaire was also shown by its correlation with the Gross Motor Function Classification System. The ABILOCO-Kids questionnaire has good psychometric qualities for measuring a wide range of locomotion abilities in children with cerebral palsy.
Judgmental Bias in the Rating of Attitude Statements

ERIC Educational Resources Information Center

Bruvold, William H.

1975-01-01

Judges holding divergent attitudes rated two sets of statements regarding uses of water reclaimed from sewage. Results showed a close linear relationship between item scale values obtained from positive and negative attitudinal groups, and a somewhat reduced rating range for judges holding unfavorable personal attitudes toward reuse. (Author/RC)
Rasch analysis of the patient-rated wrist evaluation questionnaire.

PubMed

Esakki, Saravanan; MacDermid, Joy C; Vincent, Joshua I; Packham, Tara L; Walton, David; Grewal, Ruby

2018-01-01

The Patient-Rated Wrist Evaluation (PRWE) was developed as a wrist joint specific measure of pain and disability and evidence of sound validity has been accumulated through classical psychometric methods. Rasch analysis (RA) has been endorsed as a newer method for analyzing the clinical measurement properties of self-report outcome measures. The purpose of this study was to evaluate the PRWE using Rasch modeling. We employed the Rasch model to assess overall fit, response scaling, individual item fit, differential item functioning (DIF), local dependency, unidimensionality and person separation index (PSI). A convenience sample of 382 patients with distal radius fracture was recruited from the hand and upper limb clinic at large academic healthcare organization, London, Ontario, Canada, 6-month post-injury scores of the PRWE was used. RA was conducted on the 3 subscales (pain, specific activities, and usual activities) of the PRWE separately. The pain subscale adequately fit the Rasch model when item 4 "Pain - When it is at its worst" was deleted to eliminate non-uniform DIF by age group, and item 5 "How often do you have pain" was rescored by collapsing into 8 intervals to eliminate disordered thresholds. Uniform DIF for "Use my affected hand to push up from the chair" (by work status) and "Use bathroom tissue with my affected hand" (by injured hand) was addressed by splitting the items for analysis. After background rescoring of 2 items in pain subscale, 2 items in specific activities and 3 items in usual activities, all three subscales of the PRWE were well targeted and had high reliability (PSI = 0.86). These changes provided a unidimensional, interval-level scaled measure. Like a previous analysis of the Patient-Rated Wrist and Hand Evaluation, this study found the PRWE could be fit to the Rasch model with rescoring of multiple items. However, the modifications required to achieve fit were not the same across studies, our fit statistics also suggested one of the pain items should be deleted. This study adds to the pool of evidence supporting the PRWE, but cannot confidently provide a Rasch-based scoring algorithm.
Evaluating Counseling Practicum Students

ERIC Educational Resources Information Center

Borman, Christopher A.; Ramirez, Carlos

1975-01-01

The purpose of this study was to determine significant differences in the self-ratings of counseling practicum students, their supervisor, and practicum assistants using the Counselor Evaluation Rating Scale (CERS). Findings indicated significant differences for 9 of the 27 items on the instrument. (Author)
Incremental validity of the Minnesota Multiphasic Personality Inventory-2 and symptom checklist-90-revised with mental health inpatients.

PubMed

Simonds, Elise C; Handel, Richard W; Archer, Robert P

2008-03-01

This study evaluated the incremental validity of scores from the Minnesota Multiphasic Personality Inventory-2 (MMPI-2) and the Symptom Checklist-90-Revised (SCL-90-R) in a sample of mental health inpatients originally published by Archer, Griffin, and Aiduk (1995). The incremental validity of scores from the SCL-90-R primary symptom dimensions and MMPI-2 Clinical, Content, and Restructured Clinical scales was assessed in a sample of 544 mental health inpatients using conceptually related items from the Brief Psychiatric Rating Scale (BPRS) as criteria. A series of hierarchical multiple regressions indicated that scores from the SCL-90-R primary symptom dimensions exhibited limited incremental validity (Mdn DeltaR(2) = .01, range = 0-.01), whereas scores from MMPI-2 scales contributed additional information in the prediction of ratings on all but one BPRS item (Mdn DeltaR( 2) = .08, range = .04-.12).
Disruptive behaviors in the classroom: initial standardization data on a new teacher rating scale.

PubMed

Burns, G L; Owen, S M

1990-10-01

This study presents initial standardization data on the Sutter-Eyberg Student Behavior Inventory (SESBI), a teacher-completed measure of disruptive classroom behaviors. SESBIs were completed on 1116 children in kingergarten through fifth grade in a rural eastern Washington school district. Various analyses (Cronbach's alpha, corrected item-total correlations, average interitem correlations, principal components analyses) indicated that the SESBI provides a homogeneous measure of disruptive behaviors. Support was also found for three factors within the scale (e.g., overt aggression, oppositional behavior, and attentional difficulties). While the child's age did not have a significant effect on the SESBI, the child's gender did have a significant effect on scale scores as well as on most of the items, with males being rated more problematic than females. The SESBI was also able to discriminate between children in treatment for behavioral problems or learning disabilities and children not in treatment.
[Late-onset depression and a new psychometric scale for its clinical evaluation].

PubMed

Ivanets, N N; Kinkul'kina, M A; Avdeeva, T I

2012-01-01

The most of existed psychometric scales for depression have some shortcomings hampering their use in old patients. The authors worked out the original scale for clinical evaluation of symptoms of late-onset depression. The list of symptoms was made up basing on literature data. The most significant symptoms that characterized the structure and severity of depression in old patients were singled out. According to results of factor analyses they were combined in the groups forming the corresponding items of the scale. In addition, some symptoms with particular clinical significance for late-onset depression (suicidal thoughts, senesto-hypochondriac symptoms, insight) were singled out. The scale comprises 13 items with scores from -6 to +6. It can be implemented for symptom screening, clinical diagnosis and rating, including dynamics of depression in elderly patients.
Dalhousie dyspnea scales: construct and content validity of pictorial scales for measuring dyspnea.

PubMed

McGrath, Patrick J; Pianosi, Paul T; Unruh, Anita M; Buckley, Chloe P

2005-08-30

Because there are no child-friendly, validated, self-report measures of dyspnea or breathlessness, we developed, and provided initial validation, of three, 7-item, pictorial scales depicting three sub-constructs of dyspnea: throat closing, chest tightness, and effort. We developed the three scales (Throat closing, Chest tightness, and Effort) using focus groups with 25 children. Subsequently, seventy-nine children (29 children with asthma, 30 children with cystic fibrosis. and 20 children who were healthy) aged 6 to 18 years rated each picture in each series, using a 0-10 scale. In addition, each child placed each picture in each series on a 100-cm long Visual Analogue Scale, with the anchors "not at all" and "a lot". Children aged eight years or older rated the scales in the correct order 75% to 98% correctly, but children less than 8 years of age performed unreliably. The mean distance between each consecutive item in each pictorial scale was equal. Preliminary results revealed that children aged 8 to 18 years understood and used these three scales measuring throat closing, chest tightness, and effort appropriately. The scales appear to accurately measure the construct of breathlessness, at least at an interval level. Additional research applying these scales to clinical situations is warranted.
Adherence Rating Scale for Cognitive Processing Therapy - Cognitive Only: Analysis of Psychometric Properties.

PubMed

Dittmann, Clara; Müller-Engelmann, Meike; Resick, Patricia A; Gutermann, Jana; Stangier, Ulrich; Priebe, Kathlen; Fydrich, Thomas; Ludäscher, Petra; Herzog, Julia; Steil, Regina

2017-11-01

The assessment of therapeutic adherence is essential for accurately interpreting treatment outcomes in psychotherapy research. However, such assessments are often neglected. To fill this gap, we aimed to develop and test a scale that assessed therapeutic adherence to Cognitive Processing Therapy - Cognitive Only (CPT), which was adapted for a treatment study targeting patients with post-traumatic stress disorder and co-occurring borderline personality symptoms. Two independent, trained raters assessed 30 randomly selected treatment sessions involving seven therapists and eight patients who were treated in a multicentre randomized controlled trial. The inter-rater reliability for all items and the total score yielded good to excellent results (intraclass correlation coefficient [ICC] = 0.70 to 1.00). Cronbach's α was .56 for the adherence scale. Regarding content validity, three experts confirmed the relevance and appropriateness of each item. The adherence rating scale for the adapted version of CPT is a reliable instrument that can be helpful for interpreting treatment effects, analysing possible relationships between therapeutic adherence and treatment outcomes and teaching therapeutic skills.
Measuring emotion socialization in families affected by pediatric cancer: Refinement and reduction of the Parents' Beliefs about Children's Emotions questionnaire.

PubMed

Beitra, Danette; El-Behadli, Ana F; Faith, Melissa A

2018-01-01

The aim of this study is to conduct a multimethod psychometric reduction in the Parents' Beliefs about Children's Emotions (PBCE) questionnaire using an item response theory framework with a pediatric oncology sample. Participants were 216 pediatric oncology caregivers who completed the PBCE. The PBCE contains 105 items (11 subscales) rated on a 6-point Likert-type scale. We evaluated the PBCE subscale performance by applying a partial credit model in WINSTEPS. Sixty-six statistically weak items were removed, creating a 44-item PBCE questionnaire with 10 subscales and 3 response options per item. The refined scale displayed good psychometric properties and correlated .910 with the original PBCE. Additional analyses examined dimensionality, item-level (e.g. difficulty), and person-level (e.g. ethnicity) characteristics. The refined PBCE questionnaire provides better test information, improves instrument reliability, and reduces burden on families, providers, and researchers. With this improved measure, providers can more easily identify families who may benefit from psychosocial interventions targeting emotion socialization. The results of the multistep approach presented should be considered preliminary, given the limited sample size.
Uncovering Predictors of Disagreement: Ensuring the Quality of Expert Ratings

ERIC Educational Resources Information Center

Hoth, Jessica; Schwarz, Björn; Kaiser, Gabriele; Busse, Andreas; König, Johannes; Blömeke, Sigrid

2016-01-01

Rating scales are a popular item format used in many types of assessments. Yet, defining which rating is correct often represents a challenge. Using expert ratings as benchmarks is one approach to ensuring the quality of a rating instrument. In this paper, such expert ratings are analyzed in detail taking a video-based test instrument of teachers'…
Cross-informant agreement between parent-reported and adolescent self-reported problems in 25 societies.

PubMed

Rescorla, Leslie A; Ginzburg, Sofia; Achenbach, Thomas M; Ivanova, Masha Y; Almqvist, Fredrik; Begovac, Ivan; Bilenberg, Niels; Bird, Hector; Chahed, Myriam; Dobrean, Anca; Döpfner, Manfred; Erol, Nese; Hannesdottir, Helga; Kanbayashi, Yasuko; Lambert, Michael C; Leung, Patrick W L; Minaei, Asghar; Novik, Torunn S; Oh, Kyung-Ja; Petot, Djaouida; Petot, Jean-Michel; Pomalima, Rolando; Rudan, Vlasta; Sawyer, Michael; Simsek, Zeynep; Steinhausen, Hans-Christoph; Valverde, José; Ende, Jan van der; Weintraub, Sheila; Metzke, Christa Winkler; Wolanczyk, Tomasz; Zhang, Eugene Yuqing; Zukauskiene, Rita; Verhulst, Frank C

2013-01-01

We used population sample data from 25 societies to answer the following questions: (a) How consistently across societies do adolescents report more problems than their parents report about them? (b) Do levels of parent-adolescent agreement vary among societies for different kinds of problems? (c) How well do parents and adolescents in different societies agree on problem item ratings? (d) How much do parent-adolescent dyads within each society vary in agreement on item ratings? (e) How well do parent-adolescent dyads within each society agree on the adolescent's deviance status? We used five methods to test cross-informant agreement for ratings obtained from 27,861 adolescents ages 11 to 18 and their parents. Youth Self-Report (YSR) mean scores were significantly higher than Child Behavior Checklist (CBCL) mean scores for all problem scales in almost all societies, but the magnitude of the YSR-CBCL discrepancy varied across societies. Cross-informant correlations for problem scale scores varied more across societies than across types of problems. Across societies, parents and adolescents tended to rate the same items as low, medium, or high, but within-dyad parent-adolescent item agreement varied widely in every society. In all societies, both parental noncorroboration of self-reported deviance and adolescent noncorroboration of parent-reported deviance were common. Results indicated many multicultural consistencies but also some important differences in parent-adolescent cross-informant agreement. Our findings provide valuable normative baselines against which to compare multicultural findings for clinical samples.
Accurate and scalable social recommendation using mixed-membership stochastic block models

PubMed Central

Godoy-Lorite, Antonia; Moore, Cristopher

2016-01-01

With increasing amounts of information available, modeling and predicting user preferences—for books or articles, for example—are becoming more important. We present a collaborative filtering model, with an associated scalable algorithm, that makes accurate predictions of users’ ratings. Like previous approaches, we assume that there are groups of users and of items and that the rating a user gives an item is determined by their respective group memberships. However, we allow each user and each item to belong simultaneously to mixtures of different groups and, unlike many popular approaches such as matrix factorization, we do not assume that users in each group prefer a single group of items. In particular, we do not assume that ratings depend linearly on a measure of similarity, but allow probability distributions of ratings to depend freely on the user’s and item’s groups. The resulting overlapping groups and predicted ratings can be inferred with an expectation-maximization algorithm whose running time scales linearly with the number of observed ratings. Our approach enables us to predict user preferences in large datasets and is considerably more accurate than the current algorithms for such large datasets. PMID:27911773
Transformational, transactional among physician and laissez-faire leadership among physician executives.

PubMed

Xirasagar, Sudha

2008-01-01

The purpose of this paper is to examine the empirical validity of transformational, transactional and laissez-faire leadership and their sub-scales among physician managers. A nation-wide, anonymous mail survey was carried out in the United States, requesting community health center executive directors to provide ratings of their medical director's leadership behaviors (34 items) and effectiveness (nine items), using the Multifactor Leadership Questionnaire 5X-Short, on a five-point Likert scale. The survey response rate was 40.9 percent, for a total 269 responses. Exploratory factor analysis was done, using principal factor extraction, followed by promax rotation). The data yielded a three-factor structure, generally aligned with Bass and Avolio's constructs of transformational, transactional and laissez-faire leadership. Data do not support the factorial independence of their subscales (idealized influence, inspirational motivation, individualized consideration, and intellectual stimulation under transformational leadership; contingent reward, management-by-exception active, and management-by-exception passive under transactional leadership). Two contingent reward items loaded on transformational leadership, and all items of management-by-exception passive loaded on laissez-faire. A key limitation is that supervisors were surveyed for ratings of the medical directors' leadership style. Although past research in other fields has shown that supervisor ratings are strongly correlated with subordinate ratings, further research is needed to validate the findings by surveying physician and other clinical subordinates. Such research will also help to develop appropriate content of leadership training for clinical leaders. This study represents an important step towards establishing the empirical evidence for the full range of leadership constructs among physician leaders.
SA30. Self-Assessment of Amotivation and Insight into Patients With Schizophrenia

PubMed Central

Papsuev, Oleg; Movina, Larisa; Minyaycheva, Maria; Luther, Lauren

2017-01-01

Abstract Background: Schizophrenia is a disabling disorder characterized by negative and cognitive symptoms. The negative symptom domain of low motivation has recently been found to be an important determinant of functioning. Currently, motivation is frequently assessed with either self-rated or clinician-rated motivation measures. However, little is known about the overlap between self-rated and clinician-rated motivation and whether these two assessment types are differentially related to clinical variables. Therefore, this study investigated (1) the association between self-rated and clinician-rated motivation, (2) the clinical correlates of both motivation assessment types, and (3) the correlates of the discrepancy between the motivation assessments types. Methods: Fifty patients with schizophrenia spectrum disorders were assessed by trained clinicians using the Positive and Negative Syndrome Scale (PANSS), the Calgary Depression Scale (CDSS), and both the clinician-rated (C) and self-rated (S) versions of the Apathy Evaluation Scale (AES). Neurocognition was assessed with the Brief Assessment of Cognition in Schizophrenia (BACS). Social cognition was assessed with the Hinting Task, the Relationships Across Domains measure, and the Ekman-60 emotion recognition task. Results: The AES-C and AES-S were positively correlated (r = .43; P < .05). Further, moderate, positive correlations were established between the AES-C and most of the PANSS amotivation subscale items (N2 (r = .51), N4 (r = .45)). However, a significant correlation between the AES-C and the G16 item of the PANSS amotivation subscale was not observed. The AES-S was not significantly correlated with any of the PANSS amotivation items. The AES-C did not correlate with the PANSS depression item or the CDSS total score, while moderate correlations with the AES-S were observed with both (r = .38 and r = .45, respectively). The AES-C/AES-S discrepancy score was positively correlated with the PANSS insight item (r = 0.39) and the presence of a paranoid schizophrenia diagnosis (r = .32). No significant correlations were observed between the discrepancy score and the BACS, social cognition measures, or additional demographic variables. Conclusion: While the clinician-rated AES is regarded as a sensitive instrument for the assessment of apathetic/amotivation schizophrenia symptoms, our results suggest that scores from the self-rated AES need to be interpreted carefully. Our findings also indicate that patients with schizophrenia might be less aware of primary negative (i.e., amotivation) symptoms, and when asked to self-rate negative symptoms, they rate secondary negative symptoms caused by depression. Results also suggest that reduced insight might be driving part of the discrepancy between self-rated and clinician-rated motivation. Findings should be considered when choosing motivation measures.
Generalized IRT Models for Extreme Response Style

ERIC Educational Resources Information Center

Jin, Kuan-Yu; Wang, Wen-Chung

2014-01-01

Extreme response style (ERS) is a systematic tendency for a person to endorse extreme options (e.g., strongly disagree, strongly agree) on Likert-type or rating-scale items. In this study, we develop a new class of item response theory (IRT) models to account for ERS so that the target latent trait is free from the response style and the tendency…
How I Feel About Some Other Kids.

ERIC Educational Resources Information Center

Purdue Univ., Lafayette, IN. Educational Research Center.

This rating scale was developed to yield a measure of peer acceptance and socialization for students in grades 1-6. Each child is asked to consider his classmates in terms of three sets of questions, each set having 20 items. The child responds to the question by circling yes or no or sometimes on the answer sheet. Items are organized around three…

The second version of the L. V. Prasad-functional vision questionnaire.

PubMed

Gothwal, Vijaya K; Sumalini, Rebecca; Bharani, Seelam; Reddy, Shailaja P; Bagga, Deepak K

2012-11-01

The L. V. Prasad-Functional Vision Questionnaire (LVP-FVQ) was developed using Rasch analysis to assess self-reported difficulties in performing daily tasks in school children with visual impairment (VI) in India. However, the LVP-FVQ has psychometric problems of inadequate measurement precision and lack of detailed assessment of dimensionality. Furthermore, items pertaining to use of technology are lacking. The aim of this study was to present the development and validation of the second version of LVP-FVQ (LVP-FVQ II). Development of LVP-FVQ II involved extracting items from other similar questionnaires (albeit developed for Western populations) and focus group discussions of children with VI and their parents that resulted in a 32-item pilot questionnaire. Overall, six items from the LVP-FVQ were retained. The questionnaire underwent pilot testing in 25 such children, following which a 27-item LVP-FVQ II emerged, and this was administered to 150 children with VI. Response to each item was rated on a three-category scale. Rasch analysis was used to validate the LVP-FVQ II. Rating scale was used by participants as was intended to. Four mobility-related items required deletion, as these did not contribute toward measurement of a single construct, indicating a secondary dimension. Deletion of the four items resulted in the 23-item unidimensional LVP-FVQ II, with good measurement precision, effective targeting of item difficulty to participant ability, and lack of notable differential item functioning. The LVP-FVQ II has high reliability, indicating that it is effectively able to discriminate between visual disability of school children in India, and is valid across age, gender, duration of VI, and location of residence. Given the superior measurement properties and the interval-level scores, the LVP-FVQ II appears to offer advantages over LVP-FVQ in assessment of difficulties in performing daily tasks in this population. It can be adapted for use in other developing countries.
Rating Scale Analysis and Psychometric Properties of the Caregiver Self-Efficacy Scale for Transfers

ERIC Educational Resources Information Center

Cipriani, Daniel J.; Hensen, Francine E.; McPeck, Danielle L.; Kubec, Gina L. D.; Thomas, Julie J.

2012-01-01

Parents and caregivers faced with the challenges of transferring children with disability are at risk of musculoskeletal injuries and/or emotional stress. The Caregiver Self-Efficacy Scale for Transfers (CSEST) is a 14-item questionnaire that measures self-efficacy for transferring under common conditions. The CSEST yields reliable data and valid…
Measuring Functional Creativity: Non-Expert Raters and the Creative Solution Diagnosis Scale

ERIC Educational Resources Information Center

Cropley, David H.; Kaufman, James C.

2012-01-01

The Creative Solution Diagnosis Scale (CSDS) is a 30-item scale based on a core of four criteria: Relevance & Effectiveness, Novelty, Elegance, and Genesis. The CSDS offers potential for the consensual assessment of functional product creativity. This article describes an empirical study in which non-expert judges rated a series of mousetrap…
Disparity between General Symptom Relief and Remission Criteria in the Positive and Negative Syndrome Scale (PANSS): A Post-treatment Bifactor Item Response Theory Model.

PubMed

Anderson, Ariana E; Reise, Steven P; Marder, Stephen R; Mansolf, Maxwell; Han, Carol; Bilder, Robert M

2017-12-01

Objective: Total scale scores derived by summing ratings from the 30-item PANSS are commonly used in clinical trial research to measure overall symptom severity, and percentage reductions in the total scores are sometimes used to document the efficacy of treatment. Acknowledging that some patients may have substantial changes in PANSS total scores but still be sufficiently symptomatic to warrant diagnosis, ratings on a subset of 8 items, referred to here as the "Remission set," are sometimes used to determine if patients' symptoms no longer satisfy diagnostic criteria. An unanswered question remains: is the goal of treatment better conceptualized as reduction in overall symptom severity, or reduction in symptoms below the threshold for diagnosis? We evaluated the psychometric properties of PANSS total scores, to assess whether having low symptom severity post-treatment is equivalent to attaining Remission. Design: We applied a bifactor item response theory (IRT) model to post-treatment PANSS ratings of 3,647 subjects diagnosed with schizophrenia assessed at the termination of 11 clinical trials. The bifactor model specified one general dimension to reflect overall symptom severity, and five domain-specific dimensions. We assessed how PANSS item discrimination and information parameters varied across the range of overall symptom severity (θ), with a special focus on low levels of symptoms (i.e., θ<-1), which we refer to as "Relief" from symptoms. A score of θ=-1 corresponds to an expected PANSS item score of 1.83, a rating between "Absent" and "Minimal" for a PANSS symptom. Results: The application of the bifactor IRT model revealed: (1) 88% of total score variation was attributable to variation in general symptom severity, and only 8% reflected secondary domain factors. This implies that a general factor may provide a good indicator of symptom severity, and that interpretation is not overly complicated by multidimensionality; (2) Post-treatment, 534 individuals (about 15% of the whole sample) scored in the "Relief" range of general symptom severity, but more than twice that number (n = 1351) satisfied Remission criteria (37%). 2 in 3 Remitted patients had scores that were not in a low symptom range (corresponding to Absent or Minimal item scores); (3) PANSS items vary greatly in their ability to measure the general symptom severity dimension; while many items are highly discriminating and relatively "pure" indicators of general symptom severity (delusions, conceptual disorganization), others are better indicators of specific dimensions (blunted affect, depression). The utility of a given PANSS item for assessing a patient depended on the illness level of the patient. Conclusion: Satisfying conventional Remission criteria was not strongly associated with low levels of symptoms. The items providing the most information for patients in the symptom Relief range were Delusions, Preoccupation, Suspiciousness Persecution, Unusual Thought Content, Conceptual Disorganization, Stereotyped Thinking, Active Social Avoidance, and Lack of Judgment and Insight. Lower scores on these items (item scores ≤2) were strongly associated with having a low latent trait θ or experiencing overall symptom relief. The inter-rater agreement between Remission and Relief subjects suggested that these criteria identified different subsets of patients. Alternative subsets of items may offer better indicators of general symptom severity and provide better discrimination (and lower standard errors) for scaling individuals and judging symptom relief, where the "best" subset of items ultimately depends on the illness range and treatment phase being evaluated.
Attention checklist: a rating scale for mildly mentally handicapped adolescents.

PubMed

Das, J P; Melnyk, L

1989-06-01

A check list for attentional deficits without reference to hyperactive behavior observed in the classroom was constructed, and teachers' ratings were factor analyzed. The check-list rating was compared to a widely used rating scale for attention deficit-hyperactive disorder (AD-HD), the Abbreviated Conners Rating Scale. Both scales were given to 15 teachers to rate 100 mildly mentally handicapped adolescent students. Analysis showed that 33% of the mentally handicapped students were rated above 1.5 on the Conners Scale, which is the cut-off for hyperactivity. This is much higher than the prevalence of hyperactivity in regular classrooms. The two sets of ratings correlated strongly (.84). Check-list items were grouped under one factor explaining 70.7% of variance and so are recommended for use in discriminating attentional deficit in mentally handicapped as well as in regular class students. The high correlation with ratings on the Conners Scale suggests that AD-HD is a unitary syndrome with attention being most problematic for children labeled hyperactive.
Psychometric properties of the Finnish version of the Women's Health Questionnaire.

PubMed

Katainen, Riina E; Engblom, Janne R; Vahlberg, Tero J; Polo-Kantola, Päivi

2017-08-01

The Women's Health Questionnaire (WHQ) is a validated and commonly used instrument for measuring climacteric-related symptoms. A revised version was previously developed. However, validation in a Finnish population is lacking. As it is important to use qualified instruments, we performed a validation study of the WHQ in a Finnish population. In all, 3,421 women, aged 41 to 54 years, formed the study population. In the original 36-item WHQ, the items were rated on a 1 to 4 scale and on a binary scale (0-1). The scaling of the revised 23-item WHQ was 0 to 100. We evaluated the psychometric properties (internal consistency, correlations between the symptom domains, factor structure, and sampling adequacy) in all three versions. For the 1 to 4 scale and on the revised version of the WHQ, the internal consistency was acceptable (the Cronbach's α coefficients >0.70) for most of the domains. On the binary scale, the majority of the coefficient values were below the acceptable level. The original symptom domains, especially those on the revised version, were recognizable from the factors in the exploratory factor analysis, but there were some limitations. The Kaiser-Meyer-Olkin values were high. The WHQ is a valid instrument for measuring climacteric-related symptoms in Finnish middle-aged women. The psychometric properties of the revised 23-item WHQ were as good or even better than those of the original 36-item WHQ. Thus, we encourage use of the revised version.
Do early changes in the HAM-D-17 anxiety/somatization factor items affect the treatment outcome among depressed outpatients? Comparison of two controlled trials of St John's wort (Hypericum perforatum) versus a SSRI.

PubMed

Bitran, Stella; Farabaugh, Amy H; Ameral, Victoria E; LaRocca, Rachel A; Clain, Alisabet J; Fava, Maurizio; Mischoulon, David

2011-07-01

To assess whether early changes in Hamilton Depression Rating Scale-17 anxiety/somatization items predict remission in two controlled studies of Hypericum perforatum (St John's wort) versus selective serotonin reuptake inhibitors for major depressive disorder. The Hypericum Depression Trial Study Group (National Institute of Mental Health) randomized 340 patients to Hypericum, sertraline, or placebo for 8 weeks, whereas the Massachusetts General Hospital study randomized 135 patients to Hypericum, fluoxetine, or placebo for 12 weeks. The investigators examined whether remission was associated with early changes in anxiety/somatization symptoms. In the National Institute of Mental Health study, significant associations were observed between remission and early improvement in the anxiety (psychic) item (sertraline arm), somatic (gastrointestinal item; Hypericum arm), and somatic (general) symptoms (placebo arm). None of the three treatment arms of the Massachusetts General Hospital study showed significant associations between anxiety/somatization symptoms and remission. When both study samples were pooled, we found associations for anxiety (psychic; selective serotonin reuptake inhibitors arm), somatic (gastrointestinal), and hypochondriasis (Hypericum arm), and anxiety (psychic) and somatic (general) symptoms (placebo arm). In the entire sample, remission was associated with the improvement in the anxiety (psychic), somatic (gastrointestinal), and somatic (general) items. The number and the type of anxiety/somatization items associated with remission varied depending on the intervention. Early scrutiny of the Hamilton Depression Rating Scale-17 anxiety/somatization items may help to predict remission of major depressive disorder.
Validity, sensitivity and specificity of the mentation, behavior and mood subscale of the UPDRS.

PubMed

Holroyd, Suzanne; Currie, Lillian J; Wooten, G Frederick

2008-06-01

The unified Parkinson's disease rating scale (UPDRS) is the most widely used tool to rate the severity and the stage of Parkinson's disease (PD). However, the mentation, behavior and mood (MBM) subscale of the UPDRS has received little investigation regarding its validity and sensitivity. Three items of this subscale were compared to criterion tests to examine validity, sensitivity and specificity. Ninety-seven patients with idiopathic PD were assessed on the UPDRS. Scores on three items of the MBM subscale, intellectual impairment, thought disorder and depression, were compared to criterion tests, the telephone interview for cognition status (TICS), psychiatric assessment for psychosis and the geriatric depression scale (GDS). Non-parametric tests of association were performed to examine concurrent validity of the MBM items. The sensitivities, specificities and optimal cutoff scores for each MBM item were estimated by receiver operating characteristic (ROC) curve analysis. The MBM items demonstrated low to moderate correlation with the criterion tests, and the sensitivity and specificity were not strong. Even using a score of 7.0 on the items of the MBM demonstrated a sensitivity/specificity of only 0.19/0.48 for intellectual impairment, 0.60/0.72 for thought disorder and 0.61/0.87 for depression. Using a more appropriate cutoff of 2.0 revealed sensitivities of 0.01, 0.38 and 0.13 respectively. The MBM subscale items of intellectual impairment, thought disorder and depression are not appropriate for screening or diagnostic purposes. Tools such as the TICS and the GDS should be considered instead.
Systematic review of empowerment measures in health promotion.

PubMed

Cyril, Sheila; Smith, Ben J; Renzaho, Andre M N

2016-12-01

Empowerment, a multi-level construct comprising individual, community and organizational domains, is a fundamental value and goal in health promotion. While a range of scales have been developed for the measurement of empowerment, the qualities of these have not been rigorously assessed. The aim of this study was to evaluate the measurement properties of quantitative empowerment scales and their applicability in health promotion programs. A systematic review following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines was done to evaluate empowerment scales across three dimensions: item development, reliability and validity. This was followed by assessment of measurement properties using a ratings scale with criteria addressing an a priori explicit theoretical framework, assessment of content validity, internal consistency and factor analysis to test structural validity. Of the 20 studies included in this review, only 8 (40%) used literature reviews, expert panels and empirical studies to develop scale items and 9 (45%) of studies fulfilled ≥5 criteria on the ratings scale. Two studies (10%) measured community empowerment and one study measured organizational empowerment, the rest (85%) measured individual empowerment. This review highlights important gaps in the measurement of community and organizational domains of empowerment using quantitative scales. A priority for future empowerment research is to investigate and explore approaches such as mixed methods to enable adequate measurement of empowerment across all three domains. This would help health promotion practitioners to effectively measure empowerment as a driver of change and an outcome in health promotion programs. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Influence of the wording of evaluation items on outcome-based evaluation results for large-group teaching in anatomy, biochemistry and legal medicine.

PubMed

Anders, Sven; Pyka, Katharina; Mueller, Tjark; von Streinbuechel, Nicole; Raupach, Tobias

2016-11-01

Student learning outcome is an important dimension of teaching quality in undergraduate medical education. Measuring an increase in knowledge during teaching requires repetitive objective testing which is usually not feasible. As an alternative, student learning outcome can be calculated from student self-ratings. Comparative self-assessment (CSA) gain reflects the performance difference before and after teaching, adjusted for initial knowledge. It has been shown to be a valid proxy measure of actual learning outcome derived from objective tests. However, student self-ratings are prone to a number of confounding factors. In the context of outcome-based evaluation, the wording of self-rating items is crucial to the validity of evaluation results. This randomized trial assessed whether including qualifiers in these statements impacts on student ratings and CSA gain. First-year medical students self-rated their initial (then-test) and final (post-test) knowledge for lectures in anatomy, biochemistry and legal medicine, respectively, and 659 questionnaires were retrieved. Six-point scales were used for self-ratings with 1 being the most positive option. Qualifier use did not affect then-test ratings but was associated with slightly less favorable post-test ratings. Consecutively, mean CSA gain was smaller for items containing qualifiers than for items lacking qualifiers (50.6±15.0% vs. 56.3±14.6%, p=0.079). The effect was more pronounced (Cohen's d=0.82) for items related to anatomy. In order to increase fairness of outcome-based evaluation and increase the comparability of CSA gain data across subjects, medical educators should agree on a consistent approach (qualifiers for all items or no qualifiers at all) when drafting self-rating statements for outcome-based evaluation. Copyright © 2016 Elsevier GmbH. All rights reserved.
Content Validity and Psychometric Characteristics of the "Knowledge about Older Patients Quiz" for Nurses Using Item Response Theory.

PubMed

Dikken, Jeroen; Hoogerduijn, Jita G; Kruitwagen, Cas; Schuurmans, Marieke J

2016-11-01

To assess the content validity and psychometric characteristics of the Knowledge about Older Patients Quiz (KOP-Q), which measures nurses' knowledge regarding older hospitalized adults and their certainty regarding this knowledge. Cross-sectional. Content validity: general hospitals. Psychometric characteristics: nursing school and general hospitals in the Netherlands. Content validity: 12 nurse specialists in geriatrics. Psychometric characteristics: 107 first-year and 78 final-year bachelor of nursing students, 148 registered nurses, and 20 nurse specialists in geriatrics. Content validity: The nurse specialists rated each item of the initial KOP-Q (52 items) on relevance. Ratings were used to calculate Item-Content Validity Index and average Scale-Content Validity Index (S-CVI/ave) scores. Items with insufficient content validity were removed. Psychometric characteristics: Ratings of students, nurses, and nurse specialists were used to test for different item functioning (DIF) and unidimensionality before item characteristics (discrimination and difficulty) were examined using Item Response Theory. Finally, norm references were calculated and nomological validity was assessed. Content validity: Forty-three items remained after assessing content validity (S-CVI/ave = 0.90). Psychometric characteristics: Of the 43 items, two demonstrating ceiling effects and 11 distorting ability estimates (DIF) were subsequently excluded. Item characteristics were assessed for the remaining 30 items, all of which demonstrated good discrimination and difficulty parameters. Knowledge was positively correlated with certainty about this knowledge. The final 30-item KOP-Q is a valid, psychometrically sound, comprehensive instrument that can be used to assess the knowledge of nursing students, hospital nurses, and nurse specialists in geriatrics regarding older hospitalized adults. It can identify knowledge and certainty deficits for research purposes or serve as a tool in educational or quality improvement programs. © 2016, Copyright the Authors Journal compilation © 2016, The American Geriatrics Society.
The development of an outcome measure for liaison mental health services.

PubMed

Guthrie, Else; Harrison, Mathew; Brown, Richard; Sandhu, Rajdeep; Trigwell, Peter; Abraham, Seri; Nawaz, Shazada; Kelsall, Peter; Thomasson, Rachel

2018-06-01

Aims and methodTo develop and pilot a clinician-rated outcome scale to evaluate symptomatic outcomes in liaison psychiatry services. Three hundred and sixty patient contacts with 207 separate individuals were rated using six subscales (mood, psychosis, cognition, substance misuse, mind-body problems and behavioural disturbance) plus two additional items (side-effects of medication and capacity to consent for medical treatment). Each item was rated on a five-point scale from 0 to 5 (nil, mild, moderate, severe and very severe). The liaison outcome measure was acceptable and easy to use. All subscales showed acceptable interrater reliability, with the exception of the mind-body subscale. Overall, the measure appears to show stability and sensitivity to change.Clinical implicationsThe measure provides a useful and robust way to determine symptomatic change in a liaison mental health setting, although the mind-body subscale requires modification.Declaration of interestNone.
Development and Validation of MMPI-2-RF Scales for Indexing Triarchic Psychopathy Constructs.

PubMed

Sellbom, Martin; Drislane, Laura E; Johnson, Alexandria K; Goodwin, Brandee E; Phillips, Tasha R; Patrick, Christopher J

2016-10-01

The triarchic model characterizes psychopathy in terms of three distinct dispositional constructs of boldness, meanness, and disinhibition. The model can be operationalized through scales designed specifically to index these domains or by using items from other inventories that provide coverage of related constructs. The present study sought to develop and validate scales for assessing the triarchic model domains using items from the Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF). A consensus rating approach was used to identify items relevant to each triarchic domain, and following psychometric refinement, the resulting MMPI-2-RF-based triarchic scales were evaluated for convergent and discriminant validity in relation to multiple psychopathy-relevant criterion variables in offender and nonoffender samples. Expected convergent and discriminant associations were evident very clearly for the Boldness and Disinhibition scales and somewhat less clearly for the Meanness scale. Moreover, hierarchical regression analyses indicated that all MMPI-2-RF triarchic scales incremented standard MMPI-2-RF scale scores in predicting extant triarchic model scale scores. The widespread use of MMPI-2-RF in clinical and forensic settings provides avenues for both clinical and research applications in contexts where traditional psychopathy measures are less likely to be administered. © The Author(s) 2015.
Validation of the brief version of the Recovery Self-Assessment (RSA-B) using Rasch measurement theory.

PubMed

Barbic, Skye P; Kidd, Sean A; Davidson, Larry; McKenzie, Kwame; O'Connell, Maria J

2015-12-01

In psychiatry, the recovery paradigm is increasingly identified as the overarching framework for service provision. Currently, the Recovery Self-Assessment (RSA), a 36-item rating scale, is commonly used to assess the uptake of a recovery orientation in clinical services. However, the consumer version of the RSA has been found challenging to complete because of length and the reading level required. In response to this feedback, a brief 12-item version of the RSA was developed (RSA-B). This article describes the development of the modified instrument and the application of traditional psychometric analysis and Rasch Measurement Theory to test the psychometrics properties of the RSA-B. Data from a multisite study of adults with serious mental illnesses (n = 1256) who were followed by assertive community treatment teams were examined for reliability, clinical meaning, targeting, response categories, model fit, reliability, dependency, and raw interval-level measurement. Analyses were performed using the Rasch Unidimensional Measurement Model (RUMM 2030). Adequate fit to the Rasch model was observed (χ2 = 112.46, df = 90, p = .06) and internal consistency was good (r = .86). However, Rasch analysis revealed limitations of the 12-item version, with items covering only 39% of the targeted theoretical continuum, 2 misfitting items, and strong evidence for the 5 option response categories not working as intended. This study revealed areas for improvement in the shortened version of the 12-item RSA-B. A revisit of the conceptual model and original 36-item rating scale is encouraged to select items that will help practitioners and researchers measure the full range of recovery orientation. (c) 2015 APA, all rights reserved).
The Impact of Target, Wording, and Duration on Rating Accuracy for Direct Behavior Rating

ERIC Educational Resources Information Center

Chafouleas, Sandra M.; Jaffery, Rose; Riley-Tillman, T. Chris; Christ, Theodore J.; Sen, Rohini

2013-01-01

The purpose of this study was to extend evaluation of rater accuracy using "Direct Behavior Rating--Single-Item Scales" (DBR-SIS). Extension of prior research was accomplished through use of criterion ratings derived from both systematic direct observation and expert DBR-SIS scores, and also through control of the durations over which…
Dutch Translation and Psychometric Testing of the 9-Item Shared Decision Making Questionnaire (SDM-Q-9) and Shared Decision Making Questionnaire-Physician Version (SDM-Q-Doc) in Primary and Secondary Care

PubMed Central

Rodenburg-Vandenbussche, Sumayah; Pieterse, Arwen H.; Kroonenberg, Pieter M.; Scholl, Isabelle; van der Weijden, Trudy; Luyten, Gre P. M.; Kruitwagen, Roy F. P. M.; den Ouden, Henk; Carlier, Ingrid V. E.; van Vliet, Irene M.; Zitman, Frans G.; Stiggelbout, Anne M.

2015-01-01

Purpose The SDM-Q-9 and SDM-Q-Doc measure patient and physician perception of the extent of shared decision making (SDM) during a physician-patient consultation. So far, no self-report instrument for SDM was available in Dutch, and validation of the scales in other languages has been limited. The aim of this study was to translate both scales into Dutch and assess their psychometric characteristics. Methods Participants were patients and their treating physicians (general practitioners and medical specialists). Patients (N = 182) rated their consultation using the SDM-Q-9, 43 physicians rated their consultations using the SDM-Q-Doc (N = 201). Acceptability, reliability (internal consistency), and the factorial structure of the instruments were determined. For convergent validity the CPSpost was used. Results Reliabilities of both scales were high (alpha SDM-Q-9 0.88; SDM-Q-Doc 0.87). The SDM-Q-9 and SDM-Q-Doc total scores correlated as expected with the CPSpost (SDM-Q-9: r = 0.29; SDM-Q-Doc: r = 0.48) and were significantly different between the CPSpost categories, with lowest mean scores when the physician made the decision alone. Principal Component Analyses showed a two-component model for each scale. A confirmatory factor analysis yielded a mediocre, but acceptable, one-factor model, if Item 1 was excluded; for both scales the best indices of fit were obtained for a one-factor solution, if both Items 1 and 9 were excluded. Conclusion The Dutch SDM-Q-9 and SDM-Q-Doc demonstrate good acceptance and reliability; they correlated as expected with the CPSpost and are suitable for use in Dutch primary and specialised care. Although the best model fit was found when excluding Items 1 and 9, we believe these items address important aspects of SDM. Therefore, also based on the coherence with theory and comparability with other studies, we suggest keeping all nine items of the scale. Further research on the SDM-concept in patients and physicians, in different clinical settings and different countries, is necessary to gain a better understanding of the SDM-construct and its measurement. PMID:26151946
Psychometric properties of the Press Ganey® Outpatient Medical Practice Survey.

PubMed

Presson, Angela P; Zhang, Chong; Abtahi, Amir M; Kean, Jacob; Hung, Man; Tyser, Andrew R

2017-02-10

The Press Ganey® Medical Practice Survey ("Press Ganey® survey") is a patient-reported questionnaire commonly used to measure patient satisfaction with outpatient health care in the United States. Our objective was to evaluate the reliability and validity of the Press Ganey® survey in a single institution setting. We analyzed surveys from 34,503 unique respondents seen by 624 providers from 47 specialties and 94 clinics at the University of Utah in 2013. The University of Utah is a health care system that provides primary through tertiary care for over 200 medical specialties. Surveys were administered online. The Press Ganey® survey consisted of 24 items organized into 6 scales: Access (4 items), Moving Through the Visit (2), Nurse Assistant (2), Care Provider (10), Personal Issues (4) and Overall Assessment (2). Missingness, ceiling and floor rates were summarized. Cronbach's alpha was used to evaluate internal consistency reliability. Confirmatory factor analysis was used to assess convergent and discriminant validities. Missingness was 0.01% for the total score and ranged from 0.8 to 11.4% across items. The ceiling rate was high at 29.3% for the total score, and ranged from 55.4 to 84.1% across items. Floor rates were 0.01% for the total score, and ranged from 0.1 to 2.1% across items. Internal consistency reliability ranged from 0.79 to 0.96, and item-scale correlations ranged from 0.49 to 0.9. Confirmatory factor analysis supported convergent and discriminant validities. The Press Ganey® survey demonstrated suitable psychometric properties for most metrics. However, the high ceiling rate can have a notable impact on quarterly percentile scores within our institution. Multi-institutional studies of the Press Ganey® survey are needed to inform administrative decision making and institution reimbursement decisions based on this survey.
A Facet-Factorial Approach towards the Development and Validation of a Jazz Rhythm Section Performance Rating Scale

ERIC Educational Resources Information Center

Wesolowski, Brian C.

2017-01-01

The purpose of this study was to develop a valid and reliable rating scale to assess jazz rhythm sections in the context of jazz big band performance. The research questions that guided this study included: (a) what central factors contribute to the assessment of a jazz rhythm section? (b) what items should be used to describe and assess a jazz…
Self-assessment of competencies in dental education in Germany - a multicentred survey.

PubMed

Bitter, K; Rüttermann, S; Lippmann, M; Hahn, P; Giesler, M

2016-11-01

The aim was to assess the competencies of undergraduate dental students in Germany in the domains team competence, communicative competence, learning competence and scholarship. The survey was conducted at 11 dental schools that are equally distributed all over Germany. Competencies were assessed with the Freiburg Questionnaire to Assess Competencies in Medicine (FCM). A short version of the FCM was used in this study. This short form included the four domains: team competence (three items), communicative competence (eight items), learning competence (five items) and scholarship (four items). Students had to rate each item twice: first with regard to the respondent's current level of competence and second with regard to the level of competence that respondents think is required by their job. All items were rated on a five-point Likert scale (1 'very much' and 5 'not at all'). Responsible lecturers from all selected dental schools received another questionnaire to answer the questions whether the FCM domain corresponding learning objectives were taught at the respective dental school. A total of 317 undergraduate students from 11 dental schools in their last clinical year participated. The response rate varied between 48% and 92%. Cronbach's α for the FCM scales addressing the current level of competencies ranged from 0.70 to 0.89 and for the scales measuring the presumed level of competencies demanded by their job ranged from 0.72 to 0.82. The mean values of the scales for the assessment of the presumed level of competencies demanded by the job were significantly lower compared to the mean values of the scales for the current level of competencies (P < 0.001 in all analyses). We found large differences between the two levels - in terms of 'standardised response means' (SRM) - in the domains team competence (SRM 1.34), learning competence (SRM 1.27) and communicative competence (SRM 1.18). Overall, the learning objectives that correspond to the assessed domains of competencies were taught to 19.6% completely, to 55.4% partially and to 25% not at all at the participating dental schools. The results of the present survey revealed that the participating students perceived deficiencies in all domains of competencies. These results indicate that the assessed domains are still barely integrated into dental medicine curricula in Germany and that further research in this field is needed. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Development and psychometric testing of a barriers to HIV testing scale among individuals with HIV infection in Sweden; The Barriers to HIV testing scale-Karolinska version.

PubMed

Wiklander, Maria; Brännström, Johanna; Svedhem, Veronica; Eriksson, Lars E

2015-11-19

Barriers to HIV testing experienced by individuals at risk for HIV can result in treatment delay and further transmission of the disease. Instruments to systematically measure barriers are scarce, but could contribute to improved strategies for HIV testing. Aims of this study were to develop and test a barriers to HIV testing scale in a Swedish context. An 18-item scale was developed, based on an existing scale with addition of six new items related to fear of the disease or negative consequences of being diagnosed as HIV-infected. Items were phrased as statements about potential barriers with a three-point response format representing not important, somewhat important, and very important. The scale was evaluated regarding missing values, floor and ceiling effects, exploratory factor analysis, and internal consistencies. The questionnaire was completed by 292 adults recently diagnosed with HIV infection, of whom 7 were excluded (≥9 items missing) and 285 were included (≥12 items completed) in the analyses. The participants were 18-70 years old (mean 40.5, SD 11.5), 39 % were females and 77 % born outside Sweden. Routes of transmission were heterosexual transmission 63 %, male to male sex 20 %, intravenous drug use 5 %, blood product/transfusion 2 %, and unknown 9 %. All scale items had <3 % missing values. The data was feasible for factor analysis (KMO = 0.92) and a four-factor solution was chosen, based on level of explained common variance (58.64 %) and interpretability of factor structure. The factors were interpreted as; personal consequences, structural barriers, social and economic security, and confidentiality. Ratings on the minimum level (suggested barrier not important) were common, resulting in substantial floor effects on the scales. The scales were internally consistent (Cronbach's α 0.78-0.91). This study gives preliminary evidence of the scale being feasible, reliable and valid to identify different types of barriers to HIV testing.

Use of Direct Behavior Ratings to Collect Functional Assessment Data

ERIC Educational Resources Information Center

Kilgus, Stephen P.; Kazmerski, Jennifer S.; Taylor, Crystal N.; von der Embse, Nathaniel P.

2017-01-01

The purpose of this investigation was to evaluate the utility of Direct Behavior Rating Single Item Scale (DBR-SIS) methodology in collecting functional behavior assessment data. Specific questions of interest pertained to the evaluation of the accuracy of brief DBR-SIS ratings of behavioral consequences and determination of the type of training…
A Psychometric Comparison of the Clinical Assessment Interview for Negative Symptoms and the Brief Negative Symptom Scale.

PubMed

Strauss, Gregory P; Gold, James M

2016-11-01

In 2005, the National Institute of Mental Health held a consensus development conference on negative symptoms of schizophrenia. Among the important conclusions of this meeting were that there are at least 5 commonly accepted domains of negative symptoms (blunted affect, alogia, avolition, anhedonia, asociality) and that new rating scales were needed to adequately assess these constructs. Two next-generation negative symptom scales resulted from this meeting: the Brief Negative Symptom Scale (BNSS) and Clinical Assessment Interview for Negative Symptoms (CAINS). Both measures are becoming widely used and studies have demonstrated good psychometric properties for each scale. The current study provides the first direct psychometric comparison of these scales. Participants included 65 outpatients diagnosed with schizophrenia or schizoaffective disorder who completed clinical interviews, questionnaires, and neuropsychological testing. Separate raters completed the BNSS and CAINS within the same week. Results indicated that both measures had good internal consistency, convergent validity, and discriminant validity. High correspondence was observed between CAINS and BNSS blunted affect and alogia items. Moderate convergence occurred for avolition and asociality items, and low convergence was seen among anhedonia items. Findings suggest that both scales have good psychometric properties, but that there are important distinctions among the items related to motivation and pleasure. © The Author 2016. Published by Oxford University Press on behalf of the Maryland Psychiatric Research Center. All rights reserved. For permissions, please email: journals.permissions@oup.com.
A short and valid measure of work-family enrichment.

PubMed

Kacmar, K Michele; Crawford, Wayne S; Carlson, Dawn S; Ferguson, Merideth; Whitten, Dwayne

2014-01-01

The stream of research concerning work-family enrichment has generated a significant body of research because it plays an important role in occupational health (Masuda, McNall, Allen, & Nicklin, 2012). work-family enrichment has been defined as "the extent to which experiences in one role improve the quality of life in the other role" (Greenhaus & Powell, 2006, p. 73). Within work-family enrichment, there are two directions: work to family and family to work. Carlson, Kacmar, Wayne, and Grzywacz (2006) developed an 18-item scale to measure this construct. Although the scale has been shown to be both reliable and valid, it also requires work-family researchers to include a proportionally large number of items to capture this construct in a study. The goal of the current study was to isolate a subset of the items in this measure that produces results similar to the full version thereby providing a more streamlined scale for researchers. Using a five-sample study that follows the scale reduction procedures offered by Stanton, Sinar, Balzer, and Smith (2002), we provide evidence that scales containing only three items for each direction of enrichment produce results equivalent to the full scale with respect to reliability and discriminant, convergent, and predictive validity. Reducing the original scale by two thirds, without losing explanatory power, allows scholars to measure enrichment in the work and family domains more efficiently, which should help minimize survey time, lower refusal rates, and generate less missing data. PsycINFO Database Record (c) 2014 APA, all rights reserved.
Development of the Career Anchors Scale among Occupational Health Nurses in Japan

PubMed Central

Kubo, Yoshiko; Hatono, Yoko; Kubo, Tomohide; Shimamoto, Satoko; Nakatani, Junko; Burgel, Barbara J.

2016-01-01

Objectives: This study aimed to develop the Career Anchors Scale among Occupational Health Nurses (CASOHN) and evaluate its reliability and validity. Methods: Scale items were developed through a qualitative inductive analysis of interview data, and items were revised following an examination of content validity by experts and occupational health nurses (OHNs), resulting in a provisional scale of 41 items. A total of 745 OHNs (response rate 45.2%) affiliated with the Japan Society for Occupational Health participated in the self-administered questionnaire survey. Results: Two items were deleted based on item-total correlations. Factor analysis was then conducted on the remaining 39 items to examine construct validity. An exploratory factor analysis with a main factor method and promax rotation resulted in the extraction of six factors. The variance contribution ratios of the six factors were 37.45, 7.01, 5.86, 4.95, 4.16, and 3.19%. The cumulative contribution ratio was 62.62%. The factors were named as follows: Demonstrating expertise and considering position in work (Factor 1); Management skills for effective work (Factor 2); Supporting health improvement in groups and organizations (Factor 3); Providing employee-focused support (Factor 4); Collaborating with occupational health team members and personnel (Factor 5); and Compatibility of work and private life (Factor 6). The confidence coefficient determined by the split-half method was 0.85. Cronbach's alpha coefficient for the overall scale was 0.95, whereas those of the six subscales were 0.88, 0.90, 0.91, 0.80, 0.85, and 0.79, respectively. Conclusions: CASOHN was found to be valid and reliable for measuring career anchors among OHNs in Japan. PMID:27725484
[Checklist Development for Women-Doctor-Friendly Working Conditions in a Hospital Setting].

PubMed

Horie, Saki; Takeuchi, Masumi; Yamaoka, Kazue; Nohara, Michiko; Hasunuma, Naoko; Okinaga, Hiroko; Nomura, Kyoko

2015-01-01

This study aims to develop a scale of "women-doctor-friendly working conditions in a hospital setting". A task team consisting of relevant people including a medical doctor and a hospital personnel identified 36 items related to women-doctor-friendly working conditions. From December in 2012 to January in 2013, we sent a self-administered questionnaire to 807 full-time employees including faculty members and medical doctors who worked for a university-affiliated hospital. We asked them to score the extent to which they think it is necessary for women doctors to balance between work and gender role responsibilities on the basis of the Likert scale. We carried out a factor analysis and computed Cronbach's alpha to develop a scale and investigated its construct validity and reliability. Of the 807 employees, 291 returned the questionnaires (response rate, 36.1%). The item-total correlation (between an individual item score and the total score) coefficient was in the range from 0.44 to 0.68. In factor analysis, we deleted six items, and five factors were extracted on the basis of the least likelihood method with the oblique Promax rotation. The factors were termed "gender equality action in an organization", "the compliance of care leave in both sexes and parental leave in men", "balance between life events and work", "childcare support at the workplace", and "flexible employment status". The Cronbach's alpha values of all the factors and the total items were 0.82-0.89 and 0.93, respectively, suggesting that the scale we developed has high reliability. The result indicated that the scale of women-doctor-friendly working conditions consisting of five factors with 30 items is highly validated and reliable.
Validation of a general measure of treatment satisfaction, the Treatment Satisfaction Questionnaire for Medication (TSQM), using a national panel study of chronic disease

PubMed Central

Atkinson, Mark J; Sinha, Anusha; Hass, Steven L; Colman, Shoshana S; Kumar, Ritesh N; Brod, Meryl; Rowland, Clayton R

2004-01-01

Background The objective of this study was to develop and psychometrically evaluate a general measure of patients' satisfaction with medication, the Treatment Satisfaction Questionnaire for Medication (TSQM). Methods The content and format of 55 initial questions were based on a formal conceptual framework, an extensive literature review, and the input from three patient focus groups. Patient interviews were used to select the most relevant questions for further evaluation (n = 31). The psychometric performance of items and resulting TSQM scales were examined using eight diverse patient groups (arthritis, asthma, major depression, type I diabetes, high cholesterol, hypertension, migraine, and psoriasis) recruited from a national longitudinal panel study of chronic illness (n = 567). Participants were then randomized to complete the test items using one of two alternate scaling methods (Visual Analogue vs. Likert-type). Results A factor analysis (principal component extraction with varimax rotation) of specific items revealed three factors (Eigenvalues > 1.7) explaining 75.6% of the total variance; namely Side effects (4 items, 28.4%, Cronbach's Alpha = .87), Effectiveness (3 items, 24.1%, Cronbach's Alpha = .85), and Convenience (3 items, 23.1%, Cronbach's Alpha = .87). A second factor analysis of more generally worded items yielded a Global Satisfaction scale (3 items, Eigenvalue = 2.3, 79.1%, Cronbach's Alpha = .85). The final four scales possessed good psychometric properties, with the Likert-type scaling method performing better than the VAS approach. Significant differences were found on the TSQM by the route of medication administration (oral, injectable, topical, inhalable), level of illness severity, and length of time on medication. Regression analyses using the TSQM scales accounted for 40–60% of variation in patients' ratings of their likelihood to persist with their current medication. Conclusion The TSQM is a psychometrically sound and valid measure of the major dimensions of patients' satisfaction with medication. Preliminary evidence suggests that the TSQM may also be a good predictor of patients' medication adherence across different types of medication and patient populations. PMID:14987333
Disparity between General Symptom Relief and Remission Criteria in the Positive and Negative Syndrome Scale (PANSS)

PubMed Central

Reise, Steven P.; Marder, Stephen R.; Mansolf, Maxwell; Han, Carol; Bilder, Robert M.

2017-01-01

Objective: Total scale scores derived by summing ratings from the 30-item PANSS are commonly used in clinical trial research to measure overall symptom severity, and percentage reductions in the total scores are sometimes used to document the efficacy of treatment. Acknowledging that some patients may have substantial changes in PANSS total scores but still be sufficiently symptomatic to warrant diagnosis, ratings on a subset of 8 items, referred to here as the “Remission set,” are sometimes used to determine if patients’ symptoms no longer satisfy diagnostic criteria. An unanswered question remains: is the goal of treatment better conceptualized as reduction in overall symptom severity, or reduction in symptoms below the threshold for diagnosis? We evaluated the psychometric properties of PANSS total scores, to assess whether having low symptom severity post-treatment is equivalent to attaining Remission. Design: We applied a bifactor item response theory (IRT) model to post-treatment PANSS ratings of 3,647 subjects diagnosed with schizophrenia assessed at the termination of 11 clinical trials. The bifactor model specified one general dimension to reflect overall symptom severity, and five domain-specific dimensions. We assessed how PANSS item discrimination and information parameters varied across the range of overall symptom severity (θ), with a special focus on low levels of symptoms (i.e., θ<-1), which we refer to as “Relief” from symptoms. A score of θ=-1 corresponds to an expected PANSS item score of 1.83, a rating between “Absent” and “Minimal” for a PANSS symptom. Results: The application of the bifactor IRT model revealed: (1) 88% of total score variation was attributable to variation in general symptom severity, and only 8% reflected secondary domain factors. This implies that a general factor may provide a good indicator of symptom severity, and that interpretation is not overly complicated by multidimensionality; (2) Post-treatment, 534 individuals (about 15% of the whole sample) scored in the “Relief” range of general symptom severity, but more than twice that number (n = 1351) satisfied Remission criteria (37%). 2 in 3 Remitted patients had scores that were not in a low symptom range (corresponding to Absent or Minimal item scores); (3) PANSS items vary greatly in their ability to measure the general symptom severity dimension; while many items are highly discriminating and relatively “pure” indicators of general symptom severity (delusions, conceptual disorganization), others are better indicators of specific dimensions (blunted affect, depression). The utility of a given PANSS item for assessing a patient depended on the illness level of the patient. Conclusion: Satisfying conventional Remission criteria was not strongly associated with low levels of symptoms. The items providing the most information for patients in the symptom Relief range were Delusions, Preoccupation, Suspiciousness Persecution, Unusual Thought Content, Conceptual Disorganization, Stereotyped Thinking, Active Social Avoidance, and Lack of Judgment and Insight. Lower scores on these items (item scores ≤2) were strongly associated with having a low latent trait θ or experiencing overall symptom relief. The inter-rater agreement between Remission and Relief subjects suggested that these criteria identified different subsets of patients. Alternative subsets of items may offer better indicators of general symptom severity and provide better discrimination (and lower standard errors) for scaling individuals and judging symptom relief, where the “best” subset of items ultimately depends on the illness range and treatment phase being evaluated. PMID:29410936
The Direct and Indirect Effects of Paliperidone Extended-release on Depressive Symptoms in Schizoaffective Disorder: A Path Analysis.

PubMed

Turkoz, Ibrahim; Fu, Dong-Jing; Bossie, Cynthia A; Alphs, Larry

2015-01-01

This analysis evaluates improvement in symptoms of depression in patients with schizoaffective disorder administered oral paliperidone extended-release by accounting for the magnitude of direct and indirect (changes in negative and positive symptoms and worsening of extrapyramidal symptoms) treatment effects on depressive symptoms. Data for this post hoc analysis were drawn from two six-week, randomized, placebo-controlled studies of paliperidone extended-release versus placebo in adult subjects with schizoaffective disorder (N=614; NCT00412373, NCT00397033). Subjects with baseline 17-item Hamilton Rating Scale for Depression scores of 16 or greater were included. Structural equation models (path analyses) were used to separate total effects into direct and indirect effects on depressive symptoms. Change from baseline in 17-item Hamilton Rating Scale for Depression score at the Week 6 end point was the dependent variable; changes in Positive and Negative Syndrome Scale positive and negative factors and Simpson-Angus Scale (to evaluate extrapyramidal symptoms) scores were independent variables. At baseline, 332 of 614 (54.1%) subjects had a 17-item Hamilton Rating Scale for Depression score of 16 or greater. Path analysis determined that up to 26.4 percent of the paliperidone extended-release versus placebo effect on depressive symptoms may be attributed to a direct treatment effect, and 45.8 percent and 28.4 percent were mediated indirectly through improvements on positive and negative symptoms, respectively. No effects were identified as mediated through extrapyramidal symptoms changes (-0.7%). RESULTS of this analysis suggest that paliperidone's effect on depressive symptoms in subjects with schizoaffective disorder participating in two six-week, randomized, placebo-controlled studies is mediated through indirect effects (e.g., positive and negative symptom changes) and a direct treatment effect.
The VCOP Scale: A Measure of Overprotection in Parents of Physically Vulnerable Children.

ERIC Educational Resources Information Center

Wright, Logan; And Others

1993-01-01

Developed Vulnerable Child/Overprotecting Parent Scale to measure overprotecting versus optimal developmental stimulation tendencies for parents of physically vulnerable children. Items were administered to parents whose parenting techniques had been rated as either highly overprotective or as optimal by group of physicians and other…
Multiscale Measurement of Extreme Response Style

ERIC Educational Resources Information Center

Bolt, Daniel M.; Newton, Joseph R.

2011-01-01

This article extends a methodological approach considered by Bolt and Johnson for the measurement and control of extreme response style (ERS) to the analysis of rating data from multiple scales. Specifically, it is shown how the simultaneous analysis of item responses across scales allows for more accurate identification of ERS, and more effective…
Schaefer Behavior Inventory. Teacher's Manual.

ERIC Educational Resources Information Center

Schaefer, Earl S.; And Others

This 15-item teacher rating scale measures three behavior traits: task orientation (how a child attends to and stays with classroom activities), extraversion (how readily a child interacts with other people), and hostility (how a child responds to some of the adjustments and conflict problems encountered in group activities). The scale is based…
Towards the use of a census tract poverty indicator variable in cancer surveillance.

PubMed

Boscoe, Francis P

2010-01-01

Incidence rates for many cancer sites are strongly correlated with area measures of socioeconomic conditions such as poverty rate. Analyzing such measures at the county scale produces misleading results by masking enormous within-county variations. The census tract is a more suitable scale for assessing the relationship between cancer and socioeconomics. The North American Association of Central Cancer Registries (NAACCR) developed a census tract-level poverty indicator variable which was included as an optional item in its 2010 Call for Data. This variable does not allow the identification of individual census tracts as long as the county of diagnosis is not known. It is expected that this data item will be made available to researchers in future releases of the CINA Deluxe file.
IRT Item Parameter Scaling for Developing New Item Pools

ERIC Educational Resources Information Center

Kang, Hyeon-Ah; Lu, Ying; Chang, Hua-Hua

2017-01-01

Increasing use of item pools in large-scale educational assessments calls for an appropriate scaling procedure to achieve a common metric among field-tested items. The present study examines scaling procedures for developing a new item pool under a spiraled block linking design. The three scaling procedures are considered: (a) concurrent…
Manual for Transference Work Scale; a micro-analytical tool for therapy process analyses.

PubMed

Ulberg, Randi; Amlo, Svein; Høglend, Per

2014-11-18

The present paper is a manual for the Transference Work Scale (TWS). The inter-rater agreement on the 26 TWS items was good to excellent and previously published. TWS is a therapy process rating scale focusing on Transference Work (TW) (i.e. analysis of the patient-therapist relationship). TW is considered a core active ingredient in dynamic psychotherapy. Adequate process scales are needed to identify and analyze in-session effects of therapist techniques in psychodynamic psychotherapy and empirically establish their links to outcome. TWS was constructed to identify and categorize relational (transference) interventions, and explore the in-session impact of analysis of the patient-therapist relationship (transference work). TWS has sub scales that rate timing, content, and valence of the transference interventions, as well as response from the patient. Descriptions and elaborations of the items in TWS are provided. Clinical examples of transference work from the First Experimental Study of Transference Interpretations (FEST) are included and followed by examples of how to rate transcripts from therapy sessions with TWS. The present manual describes in detail the rating procedure when using Transference Work Scale. Ratings are illustrated with clinical examples from FEST. TWS might be a potentially useful tool to explore the interaction of timing, category, and valence of transference work in predicting in-session patient response as well as treatment outcome. TWS might prove especially suitable for intensive case studies combining quantitative and narrative data. First Experimental Study of Transference-interpretations (FEST307/95). ClinicalTrials.gov Identifier: NCT00423462. URL: http://clinicaltrials.gov/ct2/show/NCT00423462?term=FEST&rank=2.
Correlates of a Single-Item Indicator Versus a Multi-Item Scale of Outness About Same-Sex Attraction

PubMed Central

Noor, Syed W.; Galos, Dylan L.; Simon Rosser, B. R.

2017-01-01

In this study, we investigated if a single-item indicator measured the degree to which people were open about their same-sex attraction (“out”) as accurately as a multi-item scale. For the multi-item scale, we used the Outness Inventory, which includes three subscales: family, world, and religion. We examined correlations between the single- and multi-item measures; between the single-item indicator and the subscales of the multi-item scale; and between the measures and internalized homonegativity, social attitudes towards homosexuality, and depressive symptoms. In addition, we calculated Tjur’s R2 as a measure of predictive power of the single-item indicator, multi-item scale, and subscales of the multi-item scale in predicting two health-related outcomes: depressive symptoms and condomless anal sex with multiple partners. There was a strong correlation between the single- and multi-item measures (r = 0.73). Furthermore, there were strong correlations between the single-item indicator and each subscale of the multi-item scale: family (r = 0.70), world (r = 0.77), and religion (r = 0.50). In addition, the correlations between the single-item indicator and internalized homonegativity (r = −0.63), social attitudes towards homosexuality (r = −0.38), and depression (r = −0.14) were higher than those between the multi-item scale and internalized homonegativity (r = −0.55), social attitudes towards homosexuality (r = −0.21), and depression (r = −0.13). Contrary to the premise that multi-item measures are superior to single-item measures, our collective findings indicate that the single-item indicator of outness performs better than the multi-item scale of outness. PMID:26292840
Development of a scale of executive functioning for the RBANS.

PubMed

Spencer, Robert J; Kitchen Andren, Katherine A; Tolle, Kathryn A

2018-01-01

The Repeatable Battery for the Assessment of Neuropsychological Status (RBANS) is a cognitive battery that contains scales of several cognitive abilities, but no scale in the instrument is exclusively dedicated to executive functioning. Although the subtests allow for observation of executive-type errors, each error is of fairly low base rate, and healthy and clinical normative data are lacking on the frequency of these types of errors, making their significance difficult to interpret in isolation. The aim of this project was to create an RBANS executive errors scale (RBANS EE) with items comprised of qualitatively dysexecutive errors committed throughout the test. Participants included Veterans referred for outpatient neuropsychological testing. Items were initially selected based on theoretical literature and were retained based on item-total correlations. The RBANS EE (a percentage calculated by dividing the number of dysexecutive errors by the total number of responses) was moderately related to each of seven established measures of executive functioning and was strongly predictive of dichotomous classification of executive impairment. Thus, the scale had solid concurrent validity, justifying its use as a supplementary scale. The RBANS EE requires no additional administration time and can provide a quantified measure of otherwise unmeasured aspects of executive functioning.
Development of a quality of life instrument for children with advanced cancer: the pediatric advanced care quality of life scale (PAC-QoL).

PubMed

Cataudella, Danielle; Morley, Tara Elise; Nesin, April; Fernandez, Conrad V; Johnston, Donna Lynn; Sung, Lillian; Zelcer, Shayna

2014-10-01

There is currently no published, validated measures available that comprehensively capture quality of life (QoL) symptoms for children with poor-prognosis malignancies. The pediatric advanced care-quality of life scale (PAC-QoL) has been developed to address this gap. The current paper describes the first two phases in the development of this measure. The first two phases included: (1) construct and item generation, and (2) preliminary content validation. Domains of QoL relevant to this population were identified from the literature and items generated to capture each; items were then adapted to create versions sensitive to age/developmental differences. Two types of experts reviewed the draft PAC-QoL and rated items for relevance, understandability, and sensitivity of wording: bereaved parents (n = 8) and health care professionals (HCP; n = 7). Content validity was calculated using the index of content validity (CVI [Lynn. Nurs Res 1986;35:382-385]). One hundred and forty-one candidate items congruent with the domains identified as relevant to children with advanced malignancies were generated, and four report versions with a 5-choice response scale created. Parent mean scores for importance, understandability, and sensitivity of wording ranged from 4.29 (SD = 0.52) to 4.66 (SD = 0.50). The CVI ranged from 95% to 100%. These steps resulted in reductions of the PAC-QoL to 57-65 items, as well as a modification of the response scale to a 4-choice option with new anchors. The next phase of this study will be to conduct cognitive probing with the intended population to further modify and reduce candidate items prior to psychometric evaluation. © 2014 Wiley Periodicals, Inc.
Measuring cancer caregiver health literacy: Validation of the Health Literacy of Caregivers Scale-Cancer (HLCS-C) in an Australian population.

PubMed

Yuen, Eva; Knight, Tess; Dodson, Sarity; Chirgwin, Jacqueline; Busija, Lucy; Ricciardelli, Lina A; Burney, Susan; Parente, Phillip; Livingston, Patricia M

2018-05-01

Caregivers have been largely neglected in health literacy measurement. We assess the construct validity, and internal consistency of the Health Literacy of Caregivers Scale-Cancer (HLCS-C), and present a revised, psychometrically robust scale. Using data from 297 cancer caregivers (12.4% response rate) recruited from Melbourne, Australia between January-July 2014, confirmatory factor analysis (CFA) was conducted to evaluate the HLCS-C's proposed factor structure. Items were evaluated for: item difficulty, unidimensionality and overall item fit within their domain. Item-threshold-ordering was examined though one-parameter Item Response Theory models. Internal consistency was assessed using Raykov's reliability coefficient. CFA results identified 42 poorly performing/redundant items which were subsequently removed. A 10-factor model was fitted to 46 acceptable items with no correlated residuals or factor cross-loadings accepted. Adequate fit was revealed (χ 2 WLSMV = 1463.807[df = 944], p < .001, RMSEA = 0.043, CFI = 0.980, TLI = 0.978, WRMR = 1.00). Ten domains were identified: Proactivity and determination to seek information; Adequate information about cancer and cancer management; Supported by healthcare providers (HCP) to understand information; Social support; Cancer-related communication with the care recipient (CR); Understanding CR needs and preferences; Self-care; Understanding the healthcare system; Capacity to process health information; and Active engagement with HCP. Internal consistency was adequate across domains (0.78-0.92). The revised HLCS-C demonstrated good structural, convergent, and discriminant validity, and high internal consistency. The scale may be useful for the development and evaluation of caregiver interventions. © 2017 John Wiley & Sons Ltd.
Development and validation of a pediatric sports activity rating scale: the Hospital for Special Surgery Pediatric Functional Activity Brief Scale (HSS Pedi-FABS).

PubMed

Fabricant, Peter D; Robles, Alex; Downey-Zayas, Timothy; Do, Huong T; Marx, Robert G; Widmann, Roger F; Green, Daniel W

2013-10-01

Having simple and reliable validated outcome measures is vital to conducting high-quality outcomes research in the field of orthopaedic surgery. Activity level is a key prognostic variable for patients with sports injuries. There is a paucity of such activity scales for children and adolescents who are otherwise healthy and athletically active. In addition to frequency and intensity of athletic activity, level of play and coach/trainer supervision are important variables unique to children and adolescents that are not captured in available adult scoring systems. To create and validate a concise and comprehensive activity rating scale for athletically active children and adolescents 10 to 18 years of age. Cohort study (diagnosis); Level of evidence, 2. Item generation was performed with a panel of orthopaedic surgeons and adolescent athletes. Item reduction, pilot testing and scale refinement resulted in a final 8-item instrument, the Hospital for Special Surgery Pediatric Functional Activity Brief Scale (HSS Pedi-FABS). Existing methods were used to determine reliability and validation. The Flesch-Kincaid score was calculated at a 6.6th-grade reading level (approximately 13 years old); therefore, although all subjects provided their own answers, parents were allowed to assist children younger than 13 years with reading the questionnaire. Scale reliability was excellent (test-retest reliability, intraclass correlation coefficient = 0.91; internal consistency, Cronbach alpha = .914), and there were no floor or ceiling effects. There was also robust construct validity: Convergent validity testing revealed positive correlations between the HSS Pedi-FABS and level of competition in athletic activity, number of reported hours of athletic activity per week, and existing comparable adult and pediatric scales. Discriminant validity was shown with age, body mass index, and type of sport as measured by the Daniel scale. The 8-item HSS Pedi-FABS can be used to reliably and accurately evaluate activity level as a prognostic variable for clinical research studies. It is a simple, reliable, and valid metric to assess activity in children and adolescents 10 to 18 years of age. This instrument will lead to better evaluation of posttreatment outcomes and patient-reported activity for child and adolescent athletes.
Initial validation of a scale to measure purposelessness, understimulation, and boredom in cancer patients: toward a redefinition of depression in advanced disease.

PubMed

Passik, Steven D; Inman, Alice; Kirsh, Kenneth; Theobald, Dale; Dickerson, Pamela

2003-03-01

The problem of boredom in people with cancer has received little research attention, and yet clinical experience suggests that it has the potential to profoundly affect quality of life in those patients. We were interested in developing a Purposelessness, Understimulation, and Boredom (PUB) Scale to identify this problem and to begin to differentiate it from depression. Cancer patients and professionals were interviewed using a semi-structured format to elicit their perceptions of the incidence, causes, scope, and consequences of boredom. From their responses, 45 questions were developed, edited for clarity, and piloted. A total of 100 cancer patients were recruited to participate in the study. Preliminary validation of the PUB using a cross-sectional survey of the measure was conducted. Other instruments used for purposes of convergent and divergent validity included the Functional Assessment of Cancer Therapy Scale-Anemia, Zung Self-Rating Depression Scale, Boredom Proneness Scale, Leisure Boredom Scale, Cancer Behavior Inventory, Systems of Belief Inventory, and the Eastern Cooperative Oncology Group Performance Status Scale. The average age of the sample was 62.37 years (SD = 13.43) and was comprised of 60 women (60.00%) and 40 men (40.00%). The results of a factor analysis on the 45 initial items (selected on the basis of professional and patient interviews) created a two-factor scale. The eight items from the strongest factor (items 1, 2, 3, 4, 5, 6, 9, 10) seemed to best tap the construct that could be deemed as overt boredom whereas the six items of the second factor (items 36, 38, 39, 42, 44, 45) seemed to tap the construct of boredom related to meaning and spirituality. Total scale internal consistency, when all 14 items were included in the analysis, yielded a coefficient alpha of 0.84 and good test-retest reliability at 2 weeks (r = .80, p < .001). The novel 14-item PUB Scale was significantly correlated to other measures of boredom; the Boredom Proneness Scale (r = -.588, p < .001) and the Leisure Boredom Scale (r = .576, p < .001). The PUB Scale was found to be a statistically viable tool with the ability to detect boredom and differentiate it from depression. In many respects this work is in concert with much of the current research and clinical effort going on in psycho-oncology that defines components of distress that in sum, redefines depression in advanced cancer.

Test Review: Constantino, J. N., & Gruber, C. P. (2012). "Social Responsiveness Scale-Second Edition" ("SRS-2"). Torrance, CA: Western Psychological Services

ERIC Educational Resources Information Center

Bruni, Teryn P.

2014-01-01

This article reviews the Social Responsiveness Scale-Second Edition (SRS-2), a 65-item rating scale measuring deficits in social behavior associated with Autism Spectrum Disorder (ASD), as outlined by the "Diagnostic and Statistical Manual of Mental Disorders" (4th ed., text rev.; "DSM-IV-TR"; American Psychiatric Association,…
Psychometric evaluation of a short measure of social capital at work

PubMed Central

Kouvonen, Anne; Kivimäki, Mika; Vahtera, Jussi; Oksanen, Tuula; Elovainio, Marko; Cox, Tom; Virtanen, Marianna; Pentti, Jaana; Cox, Sara J; Wilkinson, Richard G

2006-01-01

Background Prior studies on social capital and health have assessed social capital in residential neighbourhoods and communities, but the question whether the concept should also be applicable in workplaces has been raised. The present study reports on the psychometric properties of an 8-item measure of social capital at work. Methods Data were derived from the Finnish Public Sector Study (N = 48,592) collected in 2000–2002. Based on face validity, an expert unfamiliar with the data selected 8 questionnaire items from the available items for a scale of social capital. Reliability analysis included tests of internal consistency, item-total correlations, and within-unit (interrater) agreement by rwg index. The associations with theoretically related and unrelated constructs were examined to assess convergent and divergent validity (construct validity). Criterion-related validity was explored with respect to self-rated health using multilevel logistic regression models. The effects of individual level and work unit level social capital were modelled on self-rated health. Results The internal consistency of the scale was good (Cronbach's alpha = 0.88). The rwg index was 0.88, which indicates a significant within-unit agreement. The scale was associated with, but not redundant to, conceptually close constructs such as procedural justice, job control, and effort-reward imbalance. Its associations with conceptually more distant concepts, such as trait anxiety and magnitude of change in work, were weaker. In multilevel models, significantly elevated age adjusted odds ratios (ORs) of poor self-rated health (OR = 2.42, 95% confidence interval (CI): 2.24–2.61 for the women and OR = 2.99, 95% CI: 2.56–3.50 for the men) were observed for the employees in the lowest vs. highest quartile of individual level social capital. In addition, low social capital at the work unit level was associated with a higher likelihood of poor self-rated health. Conclusion Psychometric techniques show our 8-item measure of social capital to be a valid tool reflecting the construct and displaying the postulated links with other variables. PMID:17038200
Concise Associated Symptoms Tracking scale: a brief self-report and clinician rating of symptoms associated with suicidality.

PubMed

Trivedi, Madhukar H; Wisniewski, Stephen R; Morris, David W; Fava, Maurizio; Kurian, Benji T; Gollan, Jackie K; Nierenberg, Andrew A; Warden, Diane; Gaynes, Bradley N; Luther, James F; Rush, A John

2011-06-01

US Food and Drug Administration (FDA) warnings recommend monitoring negative symptoms associated with the initiation of antidepressant medications as these symptoms may interfere with full recovery and pose safety concerns. There is currently no brief, reliable rating instrument for assessing treatment-emergent, negative symptoms. We evaluated the psychometric properties of 2 versions of the newly developed 17-item Concise Associated Symptom Tracking (CAST) scale, the CAST Clinician Rating (CAST-C) and CAST Self-Rated (CAST-SR), which are brief instruments designed to measure the 5 relevant associated symptom domains (irritability, anxiety, mania, insomnia, and panic). The study enrolled 265 outpatients with major depressive disorder (MDD), from July 2007 through February 2008, into an 8-week, open-label trial with a selective serotonin reuptake inhibitor. Diagnosis of MDD was determined by the Psychiatric Diagnostic Screening questionnaire and an MDD checklist based on DSM-IV-TR criteria. Suicidality (suicidal ideation with associated behaviors) is 1 of 9 symptoms of MDD (depressed mood, loss of interest, appetite or weight change, sleep disturbance, reduced concentration or indecisiveness, fatigue or decreased energy, psychomotor agitation or retardation, feelings of worthlessness or excessive guilt). Psychometric evaluations were conducted on both versions of the CAST. Cronbach α was .80 (CAST-C) and .81 (CAST-SR). Factor analysis identified 5 factors for each scale: (1) irritability, (2) anxiety, (3) mania, (4) insomnia, and (5) panic. When the item that cross-loaded on 2 factors was eliminated, the 16-item solution had a better goodness of fit (CAST-C: 0.90 vs 0.87; CAST-SR: 0.88 vs 0.84). Cronbach α for the 16-item versions was .77 (CAST-C) and .78 (CAST-SR). The 5 associated CAST symptom domains correlated well with other standard measures of these domains. The 16-item CAST-C and CAST-SR demonstrated excellent psychometric properties. These are potentially useful measures for monitoring treatment-emergent negative symptoms associated with antidepressants, as recommended by the FDA. Clinicaltrials.gov Identifier: NCT00532103. © Copyright 2011 Physicians Postgraduate Press, Inc.
Acute and long-term treatment of late-life major depressive disorder: duloxetine versus placebo.

PubMed

Robinson, Michael; Oakes, Tina Myers; Raskin, Joel; Liu, Peng; Shoemaker, Scarlett; Nelson, J Craig

2014-01-01

To compare the efficacy of duloxetine with placebo on depression in elderly patients with major depressive disorder. Multicenter, 24-week (12-week short-term and 12-week continuation), randomized, placebo-controlled, double-blind trial. United States, France, Mexico, Puerto Rico. Age 65 years or more with major depressive disorder diagnosis (one or more previous episode); Mini-Mental State Examination score ≥20; Montgomery-Asberg Depression Rating Scale total score ≥20. Duloxetine 60 or 120 mg/day or placebo; placebo rescue possible. Primary-Maier subscale of the 17-item Hamilton Depression Rating Scale (HAMD-17) at week 12. Secondary-Geriatric Depression Scale, HAMD-17 total score, cognitive measures, Brief Pain Inventory (BPI), Numeric Rating Scales (NRS) for pain, Clinical Global Impression-Severity scale, Patient Global Impression of Improvement in acute phase and acute plus continuation phase of treatment. Compared with placebo, duloxetine did not show significantly greater improvement from baseline on Maier subscale at 12 weeks, but did show significantly greater improvement at weeks 4, 8, 16, and 20. Similar patterns for Geriatric Depression Scale and Clinical Global Impression-Severity scale emerged, with significance also seen at week 24. There was a significant treatment effect for all BPI items and 4 of 6 NRS pain measures in the acute phase, most BPI items and half of the NRS measures in the continuation phase. More duloxetine-treated patients completed the study (63% versus 55%). A significantly higher percentage of duloxetine-treated patients versus placebo discontinued due to adverse event (15.3% versus 5.8%). Although the antidepressant efficacy of duloxetine was not confirmed by the primary outcome, several secondary measures at multiple time points suggested efficacy. Duloxetine had significant and meaningful beneficial effects on pain. Copyright © 2014 American Association for Geriatric Psychiatry. Published by Elsevier Inc. All rights reserved.
A preliminary study to measure and develop job satisfaction scale for medical teachers.

PubMed

Bhatnagar, Kavita; Srivastava, Kalpana; Singh, Amarjit; Jadav, S L

2011-07-01

Job satisfaction of medical teachers has an impact on quality of medical education and patient care. In this background, the study was planned to develop scale and measure job satisfaction status of medical teachers. To generate items pertaining to the scale of job satisfaction, closed-ended and open-ended questionnaires were administered to medical professionals. The job satisfaction questionnaire was developed and rated on Likert type of rating scale. Both quantitative and qualitative methods were used to ascertain job satisfaction among 245 health science faculty of an autonomous educational institution. Factor loading was calculated and final items with strong factor loading were selected. Data were statistically evaluated. Average job satisfaction score was 53.97 on a scale of 1-100. The Cronbach's alpha reliability coefficient was 0.918 for entire set of items. There was statistically significant difference in job satisfaction level across different age groups (P 0.0358) showing a U-shaped pattern and fresh entrants versus reemployed faculty (P 0.0188), former showing lower satisfaction. Opportunity for self-development was biggest satisfier, followed by work, opportunity for promotion, and job security. Factors contributing toward job dissatisfaction were poor utilization of skills, poor promotional prospects, inadequate pay and allowances, work conditions, and work atmosphere. Tertiary care teaching hospitals in autonomous educational institutions need to build infrastructure and create opportunities for their medical professional. Job satisfaction of young entrants needs to be raised further by improving their work environment. This will pave the way for effective delivery of health care.
Instrument validation process: a case study using the Paediatric Pain Knowledge and Attitudes Questionnaire.

PubMed

Peirce, Deborah; Brown, Janie; Corkish, Victoria; Lane, Marguerite; Wilson, Sally

2016-06-01

To compare two methods of calculating interrater agreement while determining content validity of the Paediatric Pain Knowledge and Attitudes Questionnaire for use with Australian nurses. Paediatric pain assessment and management documentation was found to be suboptimal revealing a need to assess paediatric nurses' knowledge and attitude to pain. The Paediatric Pain Knowledge and Attitudes Questionnaire was selected as it had been reported as valid and reliable in the United Kingdom with student nurses. The questionnaire required content validity determination prior to use in the Australian context. A two phase process of expert review. Ten paediatric nurses completed a relevancy rating of all 68 questionnaire items. In phase two, five pain experts reviewed the items of the questionnaire that scored an unacceptable item level content validity. Item and scale level content validity indices and intraclass correlation coefficients were calculated. In phase one, 31 items received an item level content validity index <0·78 and the scale level content validity index average was 0·80 which were below levels required for acceptable validity. The intraclass correlation coefficient was 0·47. In phase two, 10 items were amended and four items deleted. The revised questionnaire provided a scale level content validity index average >0·90 and an intraclass correlation coefficient of 0·94 demonstrating excellent agreement between raters therefore acceptable content validity. Equivalent outcomes were achieved using the content validity index and the intraclass correlation coefficient. To assess content validity the content validity index has the advantage of providing an item level score and is a simple calculation. The intraclass correlation coefficient requires statistical knowledge, or support, and has the advantage of accounting for the possibility of chance agreement. © 2016 John Wiley & Sons Ltd.
The integral inventory for depression, a new, self-rated clinimetric instrument for the emotional and painful dimensions in major depressive disorder.

PubMed

Dueñas, Héctor; Lara, Carmen; Walton, Richard J; Granger, Renee E; Dossenbach, Martin; Raskin, Joel

2011-09-01

To assess the reliability and validity of the Integral Inventory for Depression (IID) scale using post hoc analyses of data from a multi-country study (ClinicalTrials.gov: NCT00561509) of patients with major depressive disorder (MDD). Patients (N = 1629) completed the IID (comprising two separate dimensions for emotional and physically painful symptoms; maximum score of 65) and a reference scale (16-item Quick Inventory of Depressive Symptomatology Self-Report) at baseline and at follow-up (8 and 24 weeks). Physicians rated MDD symptoms using the Clinical Global Impressions of Severity scale at each visit. Inter-item correlation, internal consistency, external validity, factor structure, and exploratory analysis of an optimal severity cut-off point were assessed. The IID displayed two distinct dimensions (i.e. painful and emotional) with little item redundancy and good internal consistency (Cronbach's α > 0.83 at each visit). The IID displayed good external validity (Pearson's correlations coefficients >0.60 at each visit) and statistically significant agreement (McNemar's test; P < 0.001 at follow-up) with the reference scale. Results suggest that a cut-off score of ≤24 had adequate precision (>80%) to identify patients with and without moderate MDD. Results suggest that the IID may be a reliable and valid tool for assessing emotional and painful symptoms of MDD.
Representational constraints on children's suggestibility.

PubMed

Ceci, Stephen J; Papierno, Paul B; Kulkofsky, Sarah

2007-06-01

In a multistage experiment, twelve 4- and 9-year-old children participated in a triad rating task. Their ratings were mapped with multidimensional scaling, from which euclidean distances were computed to operationalize semantic distance between items in target pairs. These children and age-mates then participated in an experiment that employed these target pairs in a story, which was followed by a misinformation manipulation. Analyses linked individual and developmental differences in suggestibility to children's representations of the target items. Semantic proximity was a strong predictor of differences in suggestibility: The closer a suggested distractor was to the original item's representation, the greater was the distractor's suggestive influence. The triad participants' semantic proximity subsequently served as the basis for correctly predicting memory performance in the larger group. Semantic proximity enabled a priori counterintuitive predictions of reverse age-related trends to be confirmed whenever the distance between representations of items in a target pair was greater for younger than for older children.
A psychometric evaluation of the Hospital Anxiety and Depression Scale for the medically hospitalized elderly.

PubMed

Helvik, Anne-Sofie; Engedal, Knut; Skancke, Randi H; Selbæk, Geir

2011-10-01

Few psychometric studies of the Hospital Anxiety and Depression Scale (HADS) scale have been performed with clinical samples of elderly individuals. The participants were 484 elderly (65-101 years, 241 men) patients in an acute medical unit. The HADS, the Montgomery-Aasberg Depression Rating Scale (MADRS) and questionnaires assessing quality of life, functional impairment, and cognitive function were used. The psychometric evaluation of the HADS included the following analyses: 1) the internal construct validity by means of principal component analysis followed by an oblique rotation and corrected item-total correlation; 2) the internal consistency reliability by means of the alpha coefficient (Cronbach's) and 3) concurrent validity by means of Spearman's rho. We found a two-factor solution explaining 45% of the variance. Six of seven items loaded adequately (≥0.40) on the HADS-A subscale (item 7 did not) and five of seven items loaded adequately on the HADS-D subscale (items 8 and 10 did not). Cronbach's alpha for the HADS-A and HADS-D subscale was 0.78 and 0.71, respectively. The correlation between HADS-D and the MADRS, a measure of the concurrent validity, was 0.51. The HADS appears to differentiate well between depression and anxiety. The internal consistency of the HADS in a sample of elderly persons was as satisfactory as it is in samples with younger persons. In contrast to younger samples, item 8 ("I feel as if I have slowed down") did not load adequately on the HADS-D subscale. This may be attributed to the way elderly people experience and describe their symptoms.
Development of the Brief Bipolar Disorder Symptom Scale for patients with bipolar disorder.

PubMed

Dennehy, Ellen B; Suppes, Trisha; Crismon, M Lynn; Toprac, Marcia; Carmody, Thomas J; Rush, A John

2004-06-30

The Brief Bipolar Disorder Symptom Scale (BDSS) is a 10-item measure of symptom severity that was derived from the 24-item Brief Psychiatric Rating Scale (BPRS24). It was developed for clinical use in settings where systematic evaluation is desired within the constraints of a brief visit. The psychometric properties of the BDSS were evaluated in 409 adult outpatients recruited from 19 clinics within the public mental health system of Texas, as part of the Texas Medication Algorithm Project (TMAP). The selection process for individual items is discussed in detail, and was based on multiple analyses, including principal components analysis with varimax rotation. Selection of the final items considered the statistical strength and factor loading of items within each of those factors as well as the need for comprehensive coverage of critical symptoms of bipolar disorder. The BDSS demonstrated good psychometric properties in this preliminary investigation. It demonstrated a strong association with the BPRS24 and performed similarly to the BPRS24 in its relationship to other symptom measures. The BDSS demonstrated superior sensitivity to symptom change, and an excellent level of agreement for classification of patients as either responders or non-responders with the BPRS24. Copyright 2004 Elsevier Ireland Ltd.
Development and psychometric properties rating scale of “clinical competency evaluation in mental health nurses”: Exploratory factor analysis

PubMed Central

Moskoei, Sara; Mohtashami, Jamileh; Ghalenoeei, Mahdie; Nasiri, Maliheh; Tafreshi, Mansoreh Zaghari

2017-01-01

Introduction Evaluation of clinical competency in nurses has a distinct importance in healthcare due to its significant impact on improving the quality of patient care and creation of opportunities for professional promotion. This is a psychometric study for development of the “Clinical Competency of Mental Health Nursing”(CCMHN) rating scale. Methods In this methodological research that was conducted in 2015, in Tehran, Iran, the main items were developed after literature review and the validity and reliability of the tool were identified. The face, content (content validity ratio and content validity index) and construct validities were calculated. For face and content validity, experts’ comments were used. Exploratory factor analysis was used to determine the construct validity. The reliability of scale was determined by the internal consistency and inter-rater correlation. The collected data were analyzed by SPSS version 16, using descriptive statistical analysis. Results A scale with 45 items in two parts including Emotional/Moral and Specific Care competencies was developed. Content validity ratio and content validity index were 0.88, 0.97 respectively. Exploratory factor analysis indicated two factors: The first factor with 23.93 eigenvalue and second factor with eigenvalue 2.58. Cronbach’s alpha coefficient for determination of internal consistency was 0.98 and the ICC for confirmation inter-rater correlation was 0.98. Conclusion A scale with 45 items and two areas was developed with appropriate validity and reliability. This scale can be used to assess the clinical competency in nursing students and mental health nurses. PMID:28607650
Questionnaire Construction Manual

DTIC Science & Technology

1989-06-01

or the XYZ helmet? ABC helmet - XYZ helmet 5. The M16 is a better rifle than the M14. True False 6. What is your marital status? -$ Single Married...tinuous scale can provide the respondent with guidance as to the directionality of the rating, and offer the respondent greater discrimination as to...a discrimination as the respondent is capable of giving, and the fineness of scoring can be as great as desired. c. Rating scale items usually take
[Development of the role scale for municipal supervising public health nurses].

PubMed

Hatono, Yoko; Suzuki, Hiroko; Masaki, Naoko

2013-05-01

As public health nurses are becoming increasingly decentralized in municipalities, recommendations for allocating supervising public health nurses are being made. This study aimed to develop a scale for measuring the implementation of role of municipal supervising public health nurses and to test its reliability and validity. Scale items were developed using results of a qualitative inductive analysis of interview data, and the items were then revised following an examination of content validity by experts, resulting in a provisional scale of 17 items. A self-administered, written questionnaire was then completed by supervising public health nurses or public health nurses holding the most senior positions in all municipalities nationwide, with the exception of three prefectures in the Tohoku region (total 1,621 locations). In total, 1,036 responses were received, and 931 were used for analysis (valid response rate = 57.4%). Of these, 406 were completed by supervising public health nurses. After deleting one item as a result of item analysis and conducting principal component analysis, factor analysis was conducted using the major factor method and Promax rotation. One item with high loading on multiple factors was deleted, resulting in a scale comprising 15 items and 3 factors. The cumulative contribution ratio was 56.10%. The three factors were labeled "Promotion of health activities across the whole locality," "Coordination as a PHN role leader," and "Development of the skills of public health nurses". The reliability coefficient of the RMSP (Role Scale for Municipal Supervising Public Health Nurses) as a whole was 0.84 using the split-half method (Spearman-Brown formula) and 0.91 using Cronbach's alpha, confirming internal consistency. In terms of validity, an examination was conducted of the correlation of two RMSP scale scores (strength of awareness of role as a supervising public health nurse and confidence as a supervising public health nurse) and scores on existing scales assessing management abilities, and a significant correlation (P < 0.01) was obtained. Additionally, a comparison of the RMSP scores of decentralized local public health nurses according to rank and years of service in areas where there were no supervising public health nurses with the RMSP scores of supervising public health nurses showed that the scores of supervising public health nurses were higher. The developed scale was found to be reliable and valid for measuring the implementation of supervising public health nurses' role.
Validity and reliability of a pilot scale for assessment of multiple system atrophy symptoms.

PubMed

Matsushima, Masaaki; Yabe, Ichiro; Takahashi, Ikuko; Hirotani, Makoto; Kano, Takahiro; Horiuchi, Kazuhiro; Houzen, Hideki; Sasaki, Hidenao

2017-01-01

Multiple system atrophy (MSA) is a rare progressive neurodegenerative disorder for which brief yet sensitive scale is required in order for use in clinical trials and general screening. We previously compared several scales for the assessment of MSA symptoms and devised an eight-item pilot scale with large standardized response mean [handwriting, finger taps, transfers, standing with feet together, turning trunk, turning 360°, gait, body sway]. The aim of the present study is to investigate the validity and reliability of a simple pilot scale for assessment of multiple system atrophy symptoms. Thirty-two patients with MSA (15 male/17 female; 20 cerebellar subtype [MSA-C]/12 parkinsonian subtype [MSA-P]) were prospectively registered between January 1, 2014 and February 28, 2015. Patients were evaluated by two independent raters using the Unified MSA Rating Scale (UMSARS), Scale for Assessment and Rating of Ataxia (SARA), and the pilot scale. Correlations between UMSARS, SARA, pilot scale scores, intraclass correlation coefficients (ICCs), and Cronbach's alpha coefficients were calculated. Pilot scale scores significantly correlated with scores for UMSARS Parts I, II, and IV as well as with SARA scores. Intra-rater and inter-rater ICCs and Cronbach's alpha coefficients remained high (> 0.94) for all measures. The results of the present study indicate the validity and reliability of the eight-item pilot scale, particularly for the assessment of symptoms in patients with early state multiple system atrophy.
Identifying image preferences based on demographic attributes

NASA Astrophysics Data System (ADS)

Fedorovskaya, Elena A.; Lawrence, Daniel R.

2014-02-01

The intent of this study is to determine what sorts of images are considered more interesting by which demographic groups. Specifically, we attempt to identify images whose interestingness ratings are influenced by the demographic attribute of the viewer's gender. To that end, we use the data from an experiment where 18 participants (9 women and 9 men) rated several hundred images based on "visual interest" or preferences in viewing images. The images were selected to represent the consumer "photo-space" - typical categories of subject matter found in consumer photo collections. They were annotated using perceptual and semantic descriptors. In analyzing the image interestingness ratings, we apply a multivariate procedure known as forced classification, a feature of dual scaling, a discrete analogue of principal components analysis (similar to correspondence analysis). This particular analysis of ratings (i.e., ordered-choice or Likert) data enables the investigator to emphasize the effect of a specific item or collection of items. We focus on the influence of the demographic item of gender on the analysis, so that the solutions are essentially confined to subspaces spanned by the emphasized item. Using this technique, we can know definitively which images' ratings have been influenced by the demographic item of choice. Subsequently, images can be evaluated and linked, on one hand, to their perceptual and semantic descriptors, and, on the other hand, to the preferences associated with viewers' demographic attributes.
Quality and Importance of Health Policy, Reform, and Public Health Topics: A Study in Physician Assistant Education.

PubMed

Angerer-Fuenzalida, Frances M

2018-06-01

As key players in a changing US health care system, physician assistants (PAs) must be prepared to act with a clear understanding of health policy as reform changes are enacted. The purpose of this study was to assess the perceptions of graduating PA students about the importance of health policy, reform, and public health and their perception of their preparedness in these areas. The research question was: Do PA students identify these topic areas as important, and, for each topic area, do they feel adequately prepared with sufficient knowledge for clinical practice? Participants in the study included 352 PA students from 14 PA programs randomly selected from 4 geographic regions of the continental United States. A 20-item instrument, the Health Policy Perception Tool, was developed and validated for data collection. Physician assistant students rated content items high on the importance scale and displayed a wide range of ratings on their perceived preparedness in each content area. Health policy/reform items demonstrated the highest disparity, with students indicating that they were least prepared in content areas relating to the Affordable Care Act, such as patient-centered medical home and accountable care organizations. They also rated health system structure/function items as moderately important, but indicated that they were ill prepared on this topic. Public health topics were rated highly on both scales. Physician assistant programs appear to be addressing public health issues well; however, PA education leaders must address the low levels of preparedness in the other areas of health care, specifically those related to health structure/function and health reform.
The Inventory of Depressive Symptomatology, Clinician Rating (IDS-C) and Self-Report (IDS-SR), and the Quick Inventory of Depressive Symptomatology, Clinician Rating (QIDS-C) and Self-Report (QIDS-SR) in public sector patients with mood disorders: a psychometric evaluation.

PubMed

Trivedi, M H; Rush, A J; Ibrahim, H M; Carmody, T J; Biggs, M M; Suppes, T; Crismon, M L; Shores-Wilson, K; Toprac, M G; Dennehy, E B; Witte, B; Kashner, T M

2004-01-01

The present study provides additional data on the psychometric properties of the 30-item Inventory of Depressive Symptomatology (IDS) and of the recently developed Quick Inventory of Depressive Symptomatology (QIDS), a brief 16-item symptom severity rating scale that was derived from the longer form. Both the IDS and QIDS are available in matched clinician-rated (IDS-C30; QIDS-C16) and self-report (IDS-SR30; QIDS-SR16) formats. The patient samples included 544 out-patients with major depressive disorder (MDD) and 402 out-patients with bipolar disorder (BD) drawn from 19 regionally and ethnicically diverse clinics as part of the Texas Medication Algorithm Project (TMAP). Psychometric analyses including sensitivity to change with treatment were conducted. Internal consistencies (Cronbach's alpha) ranged from 0.81 to 0.94 for all four scales (QIDS-C16, QIDS-SR16, IDS-C30 and IDS-SR30) in both MDD and BD patients. Sad mood, involvement, energy, concentration and self-outlook had the highest item-total correlations among patients with MDD and BD across all four scales. QIDS-SR16 and IDS-SR30 total scores were highly correlated among patients with MDD at exit (c = 0.83). QIDS-C16 and IDS-C30 total scores were also highly correlated among patients with MDD (c = 0.82) and patients with BD (c = 0.81). The IDS-SR30, IDS-C30, QIDS-SR16, and QIDS-C16 were equivalently sensitive to symptom change, indicating high concurrent validity for all four scales. High concurrent validity was also documented based on the SF-12 Mental Health Summary score for the population divided in quintiles based on their IDS or QIDS score. The QIDS-SR16 and QIDS-C16, as well as the longer 30-item versions, have highly acceptable psychometric properties and are treatment sensitive measures of symptom severity in depression.
The knowledge, efficacy, and practices instrument for oral health providers: a validity study with dental students.

PubMed

Behar-Horenstein, Linda S; Garvan, Cyndi W; Moore, Thomas E; Catalanotto, Frank A

2013-08-01

Valid and reliable instruments to measure and assess cultural competence for oral health care providers are scarce in the literature, and most published scales have been contested due to a lack of item analysis and internal estimates of reliability. The purposes of this study were, first, to develop a standardized instrument to measure dental students' knowledge of diversity, skills in culturally competent patient-centered communication, and use of culture-centered practices in patient care and, second, to provide preliminary validity support for this instrument. The initial instrument used in this study was a thirty-six-item Likert-scale survey entitled the Knowledge, Efficacy, and Practices Instrument for Oral Health Providers (KEPI-OHP). This instrument is an adaption of an initially thirty-three-item version of the Multicultural Awareness, Knowledge, and Skills Scale-Counselor Edition (MAKSS-CE), a scale that assesses factors related to social justice, cultural differences among clients, and cross-cultural client management. After the authors conducted cognitive and expert interviews, focus groups, pilot testing, and item analysis, their initial instrument was reduced to twenty-eight items. The KEPI-OHP was then distributed to 916 dental students (response rate=48.6 percent) across the United States to measure its reliability and assess its validity. Both exploratory and confirmatory factor analyses were conducted to test the scale's validity. The modification of the survey into a sensible instrument with a relatively clear factor structure using factor analysis resulted in twenty items. A scree test suggested three expressive factors, which were retained for rotation. Bentler's comparative fit and Bentler and Bonnett's non-normed indices were 0.95 and 0.92, respectively. A three-factor solution, including efficacy of assessment, knowledge of diversity, and culture-centered practice subscales, comprised of twenty-items was identified. The KEPI-OHP was found to have reasonable internal consistency reliability to warrant its use for baseline and repeated measures in assessing changes in dental students' growth in cultural competence across four-year dental curricula.
Measuring disease progression in early Parkinson disease: the National Institutes of Health Exploratory Trials in Parkinson Disease (NET-PD) experience.

PubMed

Parashos, Sotirios A; Luo, Sheng; Biglan, Kevin M; Bodis-Wollner, Ivan; He, Bo; Liang, Grace S; Ross, G Webster; Tilley, Barbara C; Shulman, Lisa M

2014-06-01

Optimizing assessments of rate of progression in Parkinson disease (PD) is important in designing clinical trials, especially of potential disease-modifying agents. To examine the value of measures of impairment, disability, and quality of life in assessing progression in early PD. Inception cohort analysis of data from 413 patients with early, untreated PD who were enrolled in 2 multicenter, randomized, double-blind clinical trials. Participants were randomly assigned to 1 of 5 treatments (67 received creatine, 66 received minocycline, 71 received coenzyme Q10, 71 received GPI-1485, and 138 received placebo). We assessed the association between the rates of change in measures of impairment, disability, and quality of life and time to initiation of symptomatic treatment. Time between baseline assessment and need for the initiation of symptomatic pharmaceutical treatment for PD was the primary indicator of disease progression. After adjusting for baseline confounding variables with regard to the Unified Parkinson's Disease Rating Scale (UPDRS) Part II score, the UPDRS Part III score, the modified Rankin Scale score, level of education, and treatment group, we assessed the rate of change for the following measurements: the UPDRS Part II score; the UPDRS Part III score; the Schwab and England Independence Scale score (which measures activities of daily living); the Total Functional Capacity scale; the 39-item Parkinson's Disease Questionnaire, summary index, and activities of daily living subscale; and version 2 of the 12-item Short Form Health Survey Physical Summary and Mental Summary. Variables reaching the statistical threshold in univariate analysis were entered into a multivariable Cox proportional hazards model using time to symptomatic treatment as the dependent variable. More rapid change (ie, worsening) in the UPDRS Part II score (hazard ratio, 1.15 [95% CI, 1.08-1.22] for 1 scale unit change per 6 months), the UPDRS Part III score (hazard ratio, 1.09 [95% CI, 1.06-1.13] for 1 scale unit change per 6 months), and the Schwab and England Independence Scale score (hazard ratio, 1.29 [95% CI, 1.12-1.48] for 5 percentage point change per 6 months) was associated with earlier need for symptomatic therapy. AND RELEVANCE In early PD, the UPDRS Part II score and Part III score and the Schwab and England Independence Scale score can be used to measure disease progression, whereas the 39-item Parkinson's Disease Questionnaire and summary index, Total Functional Capacity scale, and the 12-item Short Form Health Survey Physical Summary and Mental Summary are not sensitive to change. clinicaltrials.gov Identifiers: NCT00063193 and NCT00076492.
The Revised Body Awareness Rating Questionnaire: Development Into a Unidimensional Scale Using Rasch Analysis.

PubMed

Dragesund, Tove; Strand, Liv Inger; Grotle, Margreth

2018-02-01

The Body Awareness Rating Questionnaire (BARQ) is a self-report questionnaire aimed at capturing how people with long-lasting musculoskeletal pain reflect on their own body awareness. Methods based on classical test theory were applied to the development of the instrument and resulted in 4 subscales. However, the scales were not correlated, and construct validity might be questioned. The primary purpose of this study was to explore the possibility of developing a unidimensional scale from items initially collected for the BARQ using Rasch analysis. A secondary purpose was to investigate the test-retest reliability of a revised version of the BARQ. This was a methodological study. Rasch and reliability analyses were performed for 3 samples of participants with long-lasting musculoskeletal pain. The first Rasch analysis was carried out on 66 items generated for the original BARQ and scored by 300 participants. The items supported by the first analysis were scored by a new group of 127 participants and analyzed in a second Rasch analysis. For the test-retest reliability analysis, 48 participants scored the revised BARQ items twice within 1 week. The 2-step Rasch analysis resulted in a unidimensional 12-item revised version of the BARQ with a 4-point response scale (scores from 0 to 36). It showed a good fit to the Rasch model, with acceptable internal consistency, satisfactory fit residuals, and no disordered thresholds. Test-retest reliability was high, with an intraclass correlation coefficient of .83 (95% CI = .71-.89) and a smallest detectable change of 6.3 points. The small sample size in the second Rasch analysis was a study limitation. The revised BARQ is a unidimensional and feasible measurement of body awareness, recommended for use in the context of body-mind physical therapy approaches for musculoskeletal conditions. © 2017 American Physical Therapy Association

An item-response theory approach to safety climate measurement: The Liberty Mutual Safety Climate Short Scales.

PubMed

Huang, Yueng-Hsiang; Lee, Jin; Chen, Zhuo; Perry, MacKenna; Cheung, Janelle H; Wang, Mo

2017-06-01

Zohar and Luria's (2005) safety climate (SC) scale, measuring organization- and group- level SC each with 16 items, is widely used in research and practice. To improve the utility of the SC scale, we shortened the original full-length SC scales. Item response theory (IRT) analysis was conducted using a sample of 29,179 frontline workers from various industries. Based on graded response models, we shortened the original scales in two ways: (1) selecting items with above-average discriminating ability (i.e. offering more than 6.25% of the original total scale information), resulting in 8-item organization-level and 11-item group-level SC scales; and (2) selecting the most informative items that together retain at least 30% of original scale information, resulting in 4-item organization-level and 4-item group-level SC scales. All four shortened scales had acceptable reliability (≥0.89) and high correlations (≥0.95) with the original scale scores. The shortened scales will be valuable for academic research and practical survey implementation in improving occupational safety. Copyright © 2017 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Development of a Physical Environmental Observational Tool for Dining Environments in Long-Term Care Settings.

PubMed

Chaudhury, Habib; Keller, Heather; Pfisterer, Kaylen; Hung, Lillian

2017-11-10

This paper presents the first standardized physical environmental assessment tool titled Dining Environment Audit Protocol (DEAP) specifically designed for dining spaces in care homes and reports the results of its psychometric properties. Items rated include: adequacy of lighting, glare, personal control, clutter, staff supervision support, restraint use, and seating arrangement option for social interaction. Two scales summarize the prior items and rate the overall homelikeness and functionality of the space. Ten dining rooms in three long-term care homes were selected for assessment. Data were collected over 11 days across 5 weeks. Two trained assessors completed DEAP independently on the same day. Interrater-reliability was completed for lighting, glare, space, homelike aspects, seating arrangements and the two summary scales, homelikeness and functionality of the space. For categorical measures, measure responses were dichotomized at logical points and Cohen's Kappa and concordance on ratings were determined. The two overall rating scales on homelikeness and functionality of space were found to be reliable intraclass correlation coefficient (ICC) (~0.7). The mean rating for homelikeness for Assessor 1 was 3.5 (SD 1.35) and for functionality of the room was 5.3. (SD 0.82; median 5.5). The findings indicate that the tool's interrater-reliability scores are promising. The high concordance on the overall scores for homelikeness and functionality is indicative of the strength of the individual items in generating a reliable global assessment score on these two important aspects of the dining space. © The Author 2017. Published by Oxford University Press on behalf of The Gerontological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Measuring critical thinking in pre-registration midwifery students: A multi-method approach.

PubMed

Carter, Amanda G; Creedy, Debra K; Sidebotham, Mary

2018-02-01

Test the concurrent validity of three newly developed tools (student self-rating, preceptor rating, and reflective writing) that aim to measure critical thinking in midwifery practice. A descriptive matched cohort design was used. Australian research intensive university offering a three year Bachelor of Midwifery programme. Fifty-five undergraduate midwifery students. Students assessed their ability to apply critical thinking in midwifery practice using a 25-item tool and a 5-item subscale in Motivated Strategies for Learning Questionnaire. Clinical preceptors completed a 24-item tool assessing the students' application of critical thinking in practice. Reflective writing by students was assessed by midwifery academics using a 15-item tool. Internal reliability, and concurrent validity were assessed. Correlations, t-tests, multiple regression and confidence levels were calculated for the three scales and associations with student characteristics. The three scales achieved good internal reliability with a Cronbach's alpha coefficient between 0.93 and 0.97. Matched total scores for the three critical thinking scales were moderately correlated; student/preceptor (r=0.36, p<0.01); student/reflective writing (r=0.38, p<0.01); preceptor/reflective writing (r=0.30, p<0.05). All critical thinking mean scores were higher for students with a previous degree, but only significant for reflective writing (t (53)=-2.35, p=0.023). Preceptor ratings were predictive of GPA (beta=0.50, p<0.001, CI=0.10 to 0.30). Students' self-rating scores were predictive of year level (beta=0.32, p<0.05, CI=0.00 to 0.03). The student, preceptor, and reflective writing tools were found to be reliable and valid measures of critical thinking. The three tools can be used individually or in combination to provide students with various sources of feedback to improve their practice. The tools allow formative measurement of critical thinking over time. Further testing of the tools with larger, diverse samples is recommended. Crown Copyright © 2017. Published by Elsevier Ltd. All rights reserved.
Psychometric properties of the pain stages of change questionnaire as evaluated by rasch analysis in patients with chronic musculoskeletal pain

PubMed Central

2014-01-01

Background Our objective was to evaluate the measurement properties of the Pain Stages of Change Questionnaire (PSOCQ) and its four subscales Precontemplation, Contemplation, Action and Maintenance. Methods A total of 231 patients, median age 42 years, with chronic musculoskeletal pain responded to the 30 items in PSOCQ. Thresholds for item scores, and unidimensionality and invariance of the PSOCQ and its four subscales were evaluated by Rasch analysis, partial credit model. Results The items had disordered threshold and needed to be rescored. The 30 items in the PSOCQ did not fit the Rasch model Chi- square item trait statistics. All subscales fitted the Rasch models. The associations to pain (11 point numeric rating scale), emotional distress (Hopkins symptom check list v 25) and self-efficacy (Arthritis Self-Efficacy Scale) were highest for the Precontemplation subscale. Conclusion The present analysis revealed that all four subscales in PSOCQ fitted the Rasch model. No common construct for all subscales were identified, but the Action and Maintenance subscales were closely related. PMID:24646065
Low Quality of Basic Caregiving Environments in Child Care: Actual Reality or Artifact of Scoring?

ERIC Educational Resources Information Center

Norris, Deborah J.; Guss, Shannon

2016-01-01

Quality Rating Improvement Systems (QRIS) frequently include the Infant-Toddler Environment Rating Scale-Revised (ITERS-R) as part of rating and improving child care quality. However, studies utilizing the ITERS-R consistently report low quality, especially for basic caregiving items. This research examined whether the low scores reflected the…
Caught between Stages: Relational Aggression Emerging as a Developmental Advance in At-Risk Preschoolers

ERIC Educational Resources Information Center

Carpenter, Erika M.; Nangle, Douglas W.

2006-01-01

Eighty-two Head Start preschoolers were assessed with a peer rating measure of sociometric status, the Social Skills Rating System for Teachers (Gresham & Elliott, 1990), an Overt Aggression scale culled from items from the Aggressive Behavior subscale of the CBCL-TRF (Achenbach, 1997), and teacher ratings of relational aggression (Crick,…
Mental Illness Stigma Expressed by Police to Police.

PubMed

Stuart, Heather

2017-01-01

This paper describes mental health related stigma expressed by police to police using a newly developed 11-item Police Officer Stigma Scale and reports on the preliminary psychometric properties (factor structure and internal reliability) of this scale. The scale used an indirect measurement approach adapted from the Perceived Devaluation and Discrimination Scale. Five themes appropriate to police culture were adapted and six additional items were added. Responses were rated on a 5-point agreement scale with an additional don't know option. Data were collected from officers attending a mandatory workshop (90.5% response). Exploratory factor analysis showed the scale to be unidimensional and internally reliable (Cronbach's alpha was 0.82). The most endorsed items pertained to avoiding disclosure to a supervisor/manager or to a colleague (85% agreement), that most officers would expect discrimination at work (62%), and that most officers would not want a supervisor or manager who had a mental illness (62%). Findings highlight that (a) Police-to-police mental illness stigma may be a particularly strong feature of police cultures; (b) police should be a focus for targeted anti-stigma interventions; and (c) though further psychometric testing is needed, the Police Office Stigma Scale may provide important insights into the nature and functioning of police-to-police stigma in police cultures in future research.
Ordinal-To-Interval Scale Conversion Tables and National Items for the New Zealand Version of the WHOQOL-BREF

PubMed Central

Billington, D. Rex; Hsu, Patricia Hsien-Chuan; Feng, Xuan Joanna; Medvedev, Oleg N.; Kersten, Paula; Landon, Jason; Siegert, Richard J.

2016-01-01

The World Health Organisation Quality of Life (WHOQOL) questionnaires are widely used around the world and can claim strong cross-cultural validity due to their development in collaboration with international field centres. To enhance conceptual equivalence of quality of life across cultures, optional national items are often developed for use alongside the core instrument. The present study outlines the development of national items for the New Zealand WHOQOL-BREF. Focus groups with members of the community as well as health experts discussed what constitutes quality of life in their opinion. Based on themes extracted of aspects not contained in the existing WHOQOL instrument, 46 candidate items were generated and subsequently rated for their importance by a random sample of 585 individuals from the general population. Applying importance criteria reduced these items to 24, which were then sent to another large random sample (n = 808) to be rated alongside the existing WHOQOL-BREF. A final set of five items met the criteria for national items. Confirmatory factor analysis identified four national items as belonging to the psychological domain of quality of life, and one item to the social domain. Rasch analysis validated these results and generated ordinal-to-interval conversion algorithms to allow use of parametric statistics for domain scores with and without national items. PMID:27812203
New Interview and Observation Measures of the Broader Autism Phenotype: Description of Strategy and Reliability Findings for the Interview Measures.

PubMed

Parr, Jeremy R; De Jonge, Maretha V; Wallace, Simon; Pickles, Andrew; Rutter, Michael L; Le Couteur, Ann S; van Engeland, Herman; Wittemeyer, Kerstin; McConachie, Helen; Roge, Bernadette; Mantoulan, Carine; Pedersen, Lennart; Isager, Torben; Poustka, Fritz; Bolte, Sven; Bolton, Patrick; Weisblatt, Emma; Green, Jonathan; Papanikolaou, Katerina; Baird, Gillian; Bailey, Anthony J

2015-10-01

Clinical genetic studies confirm the broader autism phenotype (BAP) in some relatives of individuals with autism, but there are few standardized assessment measures. We developed three BAP measures (informant interview, self-report interview, and impression of interviewee observational scale) and describe the development strategy and findings from the interviews. International Molecular Genetic Study of Autism Consortium data were collected from families containing at least two individuals with autism. Comparison of the informant and self-report interviews was restricted to samples in which the interviews were undertaken by different researchers from that site (251 UK informants, 119 from the Netherlands). Researchers produced vignettes that were rated blind by others. Retest reliability was assessed in 45 participants. Agreement between live scoring and vignette ratings was very high. Retest stability for the interviews was high. Factor analysis indicated a first factor comprising social-communication items and rigidity (but not other repetitive domain items), and a second factor comprised mainly of reading and spelling impairments. Whole scale Cronbach's alphas were high for both interviews. The correlation between interviews for factor 1 was moderate (adult items 0.50; childhood items 0.43); Kappa values for between-interview agreement on individual items were mainly low. The correlations between individual items and total score were moderate. The inclusion of several factor 2 items lowered the overall Cronbach's alpha for the total set. Both interview measures showed good reliability and substantial stability over time, but the findings were better for factor 1 than factor 2. We recommend factor 1 scores be used for characterising the BAP. © 2015 The Authors Autism Research published by Wiley Periodicals, Inc. on behalf of International Society for Autism Research.
Fluoxetine increases suicide ideation less than placebo during treatment of adults with minor depressive disorder.

PubMed

Garlow, Steven J; Kinkead, Becky; Thase, Michael E; Judd, Lewis L; Rush, A John; Yonkers, Kimberly A; Kupfer, David J; Frank, Ellen; Schettler, Pamela J; Rapaport, Mark Hyman

2013-09-01

Some reports suggest an increase in suicide ideations and behaviors in patients treated with antidepressants. This is an analysis of the impact of fluoxetine on suicide ideations in outpatients with minor depressive disorder. Research subjects were adult outpatients with minor depressive disorder (N = 162), who received fluoxetine or placebo in a prospective, 12-week, double-blind randomized trial. The research participants were evaluated weekly with standard rating scales that included four suicide-related items: item 3 of the Hamilton Rating Scale for Depression (HRSD), item 18 of Inventory of Depressive Symptomatology (IDS-C), and items 15 and 59 of the Hopkins Symptom Checklist (SCL-90). Clinically significant intensification of suicide ideation was defined as an increase of ≥2 points on any of these items. Overall 60/162 subjects (37%) had an increase of ≥1 point during treatment and 17/162 (10.5%) of ≥2 points on at least one suicide item, with 12/81 (14.8%) placebo and 5/81 (6.2%) fluoxetine-treated subjects having a ≥2 point gain. Of the study participants with baseline suicide ideation, 9/22 (40.9%) placebo and 3/24 (12.5%) fluoxetine treated had ≥2 point increase (p = 0.04). Survival analysis revealed that subjects on placebo were significantly more likely (p = 0.050) to experience a ≥2 point increase on one or more item, a difference that emerged early and continued throughout the 12-week trial. Compared to placebo, fluoxetine was not associated with a clinically significant increase in suicide ideation among adults with minor depressive disorder during 12 weeks of treatment. Copyright © 2013 Elsevier Ltd. All rights reserved.
Fluoxetine Increases Suicide Ideation Less than Placebo During Treatment of Adults with Minor Depressive Disorder

PubMed Central

Garlow, Steven J.; Kinkead, Becky; Thase, Michael E.; Judd, Lewis L.; Rush, A. John; Yonkers, Kimberly A.; Kupfer, David J.; Frank, Ellen; Schettler, Pamela J.; Rapaport, Mark Hyman

2013-01-01

Objective Some reports suggest an increase in suicide ideations and behaviors in patients treated with antidepressants. This is an analysis of the impact of fluoxetine on suicide ideations in outpatients with Minor Depressive Disorder. Methods Research subjects were adult outpatients with Minor Depressive Disorder (N=162), who received fluoxetine or placebo in a prospective, 12-week, double blind randomized trial. The research participants were evaluated weekly with standard rating scales that included 4 suicide-related items; item 3 of the Hamilton Rating Scale for Depression (HRSD), item 18 of Inventory of Depressive Symptomatology (IDS-C), and items 15 and 59 of the Hopkins Symptom Checklist (SCL-90). Clinically significant intensification of suicide ideation was defined as an increase of ≥2 on any of these items. Results Overall 60/162 subjects (37%) had an increase of ≥1 point during treatment and 17/162 (10.5%) of ≥2 points on at least one suicide item, with 12/81 (14.8%) placebo and 5/81 (6.2%) fluoxetine treated subjects having a ≥2 point gain. Of the study participants with baseline suicide ideation, 9/22 (40.9%) placebo and 3/24 (12.5%) fluoxetine treated had ≥2 point increase (p=0.04). Survival analysis revealed that subjects on placebo were significantly more likely (p=0.050) to experience a ≥2 point increase on one or more item, a difference that emerged early and continued throughout the 12-week trial. Conclusions Compared to placebo, fluoxetine was not associated with a clinically significant increase in suicide ideation among adults with Minor Depressive Disorder during 12 weeks of treatment. PMID:23786912
Development of a measure of model fidelity for mental health Crisis Resolution Teams.

PubMed

Lloyd-Evans, Brynmor; Bond, Gary R; Ruud, Torleif; Ivanecka, Ada; Gray, Richard; Osborn, David; Nolan, Fiona; Henderson, Claire; Mason, Oliver; Goater, Nicky; Kelly, Kathleen; Ambler, Gareth; Morant, Nicola; Onyett, Steve; Lamb, Danielle; Fahmy, Sarah; Brown, Ellie; Paterson, Beth; Sweeney, Angela; Hindle, David; Fullarton, Kate; Frerichs, Johanna; Johnson, Sonia

2016-12-01

Crisis Resolution Teams (CRTs) provide short-term intensive home treatment to people experiencing mental health crisis. Trial evidence suggests CRTs can be effective at reducing hospital admissions and increasing satisfaction with acute care. When scaled up to national level however, CRT implementation and outcomes have been variable. We aimed to develop and test a fidelity scale to assess adherence to a model of best practice for CRTs, based on best available evidence. A concept mapping process was used to develop a CRT fidelity scale. Participants (n = 68) from a range of stakeholder groups prioritised and grouped statements (n = 72) about important components of the CRT model, generated from a literature review, national survey and qualitative interviews. These data were analysed using Ariadne software and the resultant cluster solution informed item selection for a CRT fidelity scale. Operational criteria and scoring anchor points were developed for each item. The CORE CRT fidelity scale was then piloted in 75 CRTs in the UK to assess the range of scores achieved and feasibility for use in a 1-day fidelity review process. Trained reviewers (n = 16) rated CRT service fidelity in a vignette exercise to test the scale's inter-rater reliability. There were high levels of agreement within and between stakeholder groups regarding the most important components of the CRT model. A 39-item measure of CRT model fidelity was developed. Piloting indicated that the scale was feasible for use to assess CRT model fidelity and had good face validity. The wide range of item scores and total scores across CRT services in the pilot demonstrate the measure can distinguish lower and higher fidelity services. Moderately good inter-rater reliability was found, with an estimated correlation between individual ratings of 0.65 (95% CI: 0.54 to 0.76). The CORE CRT Fidelity Scale has been developed through a rigorous and systematic process. Promising initial testing indicates its value in assessing adherence to a model of CRT best practice and to support service improvement monitoring and planning. Further research is required to establish its psychometric properties and international applicability.
Mokken scaling of the Myocardial Infarction Dimensional Assessment Scale (MIDAS).

PubMed

Thompson, David R; Watson, Roger

2011-02-01

The purpose of this study was to examine the hierarchical and cumulative nature of the 35 items of the Myocardial Infarction Dimensional Assessment Scale (MIDAS), a disease-specific health-related quality of life measure. Data from 668 participants who completed the MIDAS were analysed using the Mokken Scaling Procedure, which is a computer program that searches polychotomous data for hierarchical and cumulative scales on the basis of a range of diagnostic criteria. Fourteen MIDAS items were retained in a Mokken scale and these items included physical activity, insecurity, emotional reaction and dependency items but excluded items related to diet, medication or side-effects. Item difficulty, in item response theory terms, ran from physical activity items (low difficulty) to insecurity, suggesting that the most severe quality of life effect of myocardial infarction is loneliness and isolation. Items from the MIDAS form a strong and reliable Mokken scale, which provides new insight into the relationship between items in the MIDAS and the measurement of quality of life after myocardial infarction. © 2010 Blackwell Publishing Ltd.
Development and initial validation of primary care provider mental illness management and team-based care self-efficacy scales.

PubMed

Loeb, Danielle F; Crane, Lori A; Leister, Erin; Bayliss, Elizabeth A; Ludman, Evette; Binswanger, Ingrid A; Kline, Danielle M; Smith, Meredith; deGruy, Frank V; Nease, Donald E; Dickinson, L Miriam

Develop and validate self-efficacy scales for primary care provider (PCP) mental illness management and team-based care participation. We developed three self-efficacy scales: team-based care (TBC), mental illness management (MIM), and chronic medical illness (CMI). We developed the scales using Bandura's Social Cognitive Theory as a guide. The survey instrument included items from previously validated scales on team-based care and mental illness management. We administered a mail survey to 900 randomly selected Colorado physicians. We conducted exploratory principal factor analysis with oblique rotation. We constructed self-efficacy scales and calculated standardized Cronbach's alpha coefficients to test internal consistency. We calculated correlation coefficients between the MIM and TBC scales and previously validated measures related to each scale to evaluate convergent validity. We tested correlations between the TBC and the measures expected to correlate with the MIM scale and vice versa to evaluate discriminant validity. PCPs (n=402, response rate=49%) from diverse practice settings completed surveys. Items grouped into factors as expected. Cronbach's alphas were 0.94, 0.88, and 0.83 for TBC, MIM, and CMI scales respectively. In convergent validity testing, the TBC scale was correlated as predicted with scales assessing communications strategies, attitudes toward teams, and other teamwork indicators (r=0.25 to 0.40, all statistically significant). Likewise, the MIM scale was significantly correlated with several items about knowledge and experience managing mental illness (r=0.24 to 41, all statistically significant). As expected in discriminant validity testing, the TBC scale had only very weak correlations with the mental illness knowledge and experience managing mental illness items (r=0.03 to 0.12). Likewise, the MIM scale was only weakly correlated with measures of team-based care (r=0.09 to.17). This validation study of MIM and TBC self-efficacy scales showed high internal validity and good construct validity. Copyright © 2016 Elsevier Inc. All rights reserved.
A Measure of Barriers Toward Medical Disclosure Among Health Professionals in the United Arab Emirates.

PubMed

Zaghloul, Ashraf Ahmad; Elsergany, Moetaz; Mosallam, Rasha

2018-03-01

There has been a growing awareness that patients are subject to injuries that can be prevented as a direct consequence of health care. Error disclosure is an effective technique to restore the lost trust with the health care system. The current study aimed to develop a valid and reliable scale to determine the factors facilitating the disclosure of health professionals in health organizations. This study had a cross-sectional design that consisted of 722 responses (response rate of 68.3%) from 1 private and 1 public hospital in Sharjah, United Arab Emirates. The data collection tool included 23 items rated on a Likert scale ranging from 5, strongly agree, to 1, strongly disagree.The internal consistency was established through calculating the split-half reliability for part 1 (12 items), which had a Cronbach coefficient of 0.65, and part 2 (11 items), which had a Cronbach coefficient of 0.62. Scale validity was assessed with the Kaiser-Meyer-Olkin measure of sampling adequacy, which had a value of 0.62, and the Bartlett test of sphericity (approximated χ = 13012.2, P = 0.0001) supported the factorability of the correlation matrix. The varimax rotation revealed 5 components that explained 77.8% of the total variance. The varimax rotation revealed 21 items loaded on the following 5 factors: fear of disclosure and provider image consequences (factor 1), apology (factor 2), organizational culture toward patient safety (factor 3), professional ethics and transparency (factor 4), as well as patient and provider education (factor 5). The disclosure of medical mistakes requires preliminary considerations to effectively and compassionately disclose these events to patients. The validity and reliability of the results support the use of this scale at hospitals as part of the health care providers' disclosure processes.
The Social, Emotional and Behavioural Difficulties of Primary School Children with Poor Attendance Records

ERIC Educational Resources Information Center

Carroll, H. C. M.

2013-01-01

Two complementary studies of poor and better attenders are presented. To measure emotional and behavioural difficulties (EBD) different teacher-completed rating scales were employed, and to determine social difficulties, the studies used sociometry and some items from the scales. One study had a longitudinal design. It revealed that, after…
Construction and Validation of an Observational Scale of Neighborhood Characteristics

ERIC Educational Resources Information Center

McDonell, James R.; Waters, Tracy J.

2011-01-01

This paper reports the development and validation of the Neighborhood Observation Scale, a 41 item measure of neighborhood physical appearance, social appearance, safety, and amenities. Three independent ratings were collected on each of 244 neighborhoods in 132 census block groups in five South Carolina counties, for a total of 732 observations.…
A Brief Measure of Children's Behavior Problems: The Behavior Rating Index for Children.

ERIC Educational Resources Information Center

Stiffman, Arlene R.; And Others

1984-01-01

Describes the development of the Behavior Rating Index for Children (BRIC), a 13-item summated category partition scale that provides a prothetic measure of children's behavior problems. Evaluation of the BRIC with 600 referred and nonreferred children suggested adequate reliability and validity. (JAC)
Characterizing somatization, hypochondriasis, and hysteria in the borderline personality disorder.

PubMed

Snyder, S; Pitts, W M

1986-03-01

Somatization, hypochondriasis, and hysteria have often been considered as associated features of the borderline personality disorder. This study was designed to characterize these three syndromes in the borderline patient. Inpatients with DSM-III borderline personality disorder were compared with controls with dysthymic disorder. Scales and items from standardized rating instruments which measured the three syndromes were scored and compared between groups. Although the hysteria-obvious and hypochondriasis scales of the MMPI and the Hamilton Depression Scale item measuring hypochondriasis were elevated in the borderline group, there were no significant differences between groups. Scores of dysthymic patients significantly exceeded those of borderline patients on four of five MMPI codetypes measuring the three syndromes. Findings are discussed in light of previous psychodynamic, empirical, and research literature.
The Validity and Reliability of the Violence Risk Scale-Sexual Offender Version: Assessing Sex Offender Risk and Evaluating Therapeutic Change

ERIC Educational Resources Information Center

Olver, Mark E.; Wong, Stephen C. P.; Nicholaichuk, Terry; Gordon, Audrey

2007-01-01

The Violence Risk Scale-Sexual Offender version (VRS-SO) is a rating scale designed to assess risk and predict sexual recidivism, to measure and link treatment changes to sexual recidivism, and to inform the delivery of sexual offender treatment. The VRS-SO comprises 7 static and 17 dynamic items empirically or conceptually linked to sexual…

Rasch analysis of the Patient Rated Elbow Evaluation questionnaire.

PubMed

Vincent, Joshua I; MacDermid, Joy C; King, Graham J W; Grewal, Ruby

2015-06-20

The Patient Rated Elbow Evaluation (PREE) was developed as an elbow joint specific measure of pain and disability and validated with classical psychometric methods. More recently, Rasch analysis has contributed new methods for analyzing the clinical measurement properties of self-report outcome measures. The objective of the study was to determine aspects of validity of the PREE using the Rasch model to assess the overall fit of the PREE data, the response scaling, individual item fit, differential item functioning (DIF), local dependency, unidimensionality and person separation index (PSI). A convenience sample of 236 patients (Age range 21-79 years; M: F- 97:139) with elbow disorders were recruited from the Roth│McFarlane Hand and Upper Limb Centre, London, Ontario, Canada. The baseline scores of the PREE were used. Rasch analysis was conducted using RUMM 2030 software on the 3 sub scales of the PREE separately. The 3 sub scales showed misfit initially with disordered thresholds on17 out of 20 items), uniform DIF was observed for two items ("Carrying a 10lbs object" from specific activities subscale for age group; and "household work" from the usual activities subscale for gender); multidimensionality and local dependency. The Pain subscale satisfied Rasch expectations when item 2 "Pain - At rest" was split for age group, while the usual activities subscale readily stood up to Rasch requirements when the item 2 "household work" was split for gender. The specific activities subscale demonstrated fit to the Rasch model when sub test analysis accounted for local dependency. All three subscales of the PREE were well targeted and had high reliability (PSI >0.80). The three subscales of the PREE appear to be robust when tested against the Rasch model when subject to a few alterations. The value of changing the 0-10 format is questionable given its widespread use; further Rasch-based analysis of whether these findings are stable in other samples is warranted.
Item response theory analysis applied to the Spanish version of the Personal Outcomes Scale.

PubMed

Guàrdia-Olmos, J; Carbó-Carreté, M; Peró-Cebollero, M; Giné, C

2017-11-01

The study of measurements of quality of life (QoL) is one of the great challenges of modern psychology and psychometric approaches. This issue has greater importance when examining QoL in populations that were historically treated on the basis of their deficiency, and recently, the focus has shifted to what each person values and desires in their life, as in cases of people with intellectual disability (ID). Many studies of QoL scales applied in this area have attempted to improve the validity and reliability of their components by incorporating various sources of information to achieve consistency in the data obtained. The adaptation of the Personal Outcomes Scale (POS) in Spanish has shown excellent psychometric attributes, and its administration has three sources of information: self-assessment, practitioner and family. The study of possible congruence or incongruence of observed distributions of each item between sources is therefore essential to ensure a correct interpretation of the measure. The aim of this paper was to analyse the observed distribution of items and dimensions from the three Spanish POS information sources cited earlier, using the item response theory. We studied a sample of 529 people with ID and their respective practitioners and family member, and in each case, we analysed items and factors using Samejima's model of polytomic ordinal scales. The results indicated an important number of items with differential effects regarding sources, and in some cases, they indicated significant differences in the distribution of items, factors and sources of information. As a result of this analysis, we must affirm that the administration of the POS, considering three sources of information, was adequate overall, but a correct interpretation of the results requires that it obtain much more information to consider, as well as some specific items in specific dimensions. The overall ratings, if these comments are considered, could result in bias. © 2017 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
Correlates of body mass index in women with fibromyalgia.

PubMed

Timmerman, Gayle M; Calfa, Nicolina A; Stuifbergen, Alexa K

2013-01-01

Excess weight in women with fibromyalgia syndrome (FMS) may further contribute to joint pain and fatigue. However, there is little research addressing weight issues in this population. This study examined the relationship of body mass index (BMI) to quality of life. Quality of life was measured by the 36-Item Short Form Health Survey, severity of FMS, nutritional intake, Barriers to Health Promoting Behaviors for Disabled Persons Scale (BS), and self-efficacy for health-promoting behaviors (Self-Rated Abilities for Health Practices Scale) in women with FMS. Baseline data were collected on 179 women diagnosed with FMS. Controlling for age, BMI was significantly (p < .05) correlated with 36-Item Short Form Health Survey subscales of physical functioning, bodily pain and vitality, severity of FMS using the Tender Point Index, calories, protein, fat, saturated fat, BS, and Self-Rated Abilities for Health Practices Scale subscale for exercise. The findings support a growing body of evidence that excess weight is negatively related to quality of life and pain in women with FMS.
Sources of self-efficacy belief: development and validation of two scales.

PubMed

Liu, Ou Lydia; Wilson, Mark

2010-01-01

Self-efficacy belief has been an instrumental affective factor in predicting student behavior and achievement in academic settings. Although there is abundant literature on efficacy belief per se, the sources of efficacy belief have not been fully researched. Very few instruments exist to quantify the sources of efficacy-beliefs. To fill this void, we developed two scales for the two main sources of self-efficacy belief: past performance and social persuasion. Pilot test data were collected from 255 middle school students. A self-efficacy measure was also administered to the students as a criterion measure. The Rasch rating scale model was used to analyze the data. Information on item fit, item design, content validity, external validity, internal consistency, and person separation reliability was examined. The two scales displayed satisfactory psychometric properties. Applications and limitations of these two scales are also discussed.
Assessing risk markers in intimate partner femicide and severe violence: a new assessment instrument.

PubMed

Echeburúa, Enrique; Fernández-Montalvo, Javier; de Corral, Paz; López-Goñi, José J

2009-06-01

The aim of this study is to develop a scale to predict intimate partner femicide and severe violence. The sample consists of 1,081 batterer men who were reported to the police station. First, the most significant differences between the severe violence group (n = 269) and the less severe violence group (n = 812) in sociodemographic variables are determined. Both aggressors and victims of the severe violence group have a higher rate of immigration. Second, the proposed 20-item scale is derived from a larger 58-item scale, where only the most discriminative items between severe and nonsevere intimate partner violence are taken into account. Psychometric properties of reliability and validity are rather good. Cutoff scores have been proposed according to sensitivity and specificity. This easy-to-use tool appears to be suitable to the requirements of criminal justice professionals and is intended for use in safety planning. Implications of these results for further research are discussed.
Developing a Psychometric Instrument to Measure Physical Education Teachers' Job Demands and Resources.

PubMed

Zhang, Tan; Chen, Ang

2017-01-01

Based on the job demands-resources model, the study developed and validated an instrument that measures physical education teachers' job demands-resources perception. Expert review established content validity with the average item rating of 3.6/5.0. Construct validity and reliability were determined with a teacher sample ( n = 397). Exploratory factor analysis established a five-dimension construct structure matching the theoretical construct deliberated in the literature. The composite reliability scores for the five dimensions range from .68 to .83. Validity coefficients (intraclass correlational coefficients) are .69 for job resources items and .82 for job demands items. Inter-scale correlational coefficients range from -.32 to .47. Confirmatory factor analysis confirmed the construct validity with high dimensional factor loadings (ranging from .47 to .84 for job resources scale and from .50 to .85 for job demands scale) and adequate model fit indexes (root mean square error of approximation = .06). The instrument provides a tool to measure physical education teachers' perception of their working environment.
Developing a Psychometric Instrument to Measure Physical Education Teachers’ Job Demands and Resources

PubMed Central

Zhang, Tan; Chen, Ang

2017-01-01

Based on the job demands–resources model, the study developed and validated an instrument that measures physical education teachers’ job demands–resources perception. Expert review established content validity with the average item rating of 3.6/5.0. Construct validity and reliability were determined with a teacher sample (n = 397). Exploratory factor analysis established a five-dimension construct structure matching the theoretical construct deliberated in the literature. The composite reliability scores for the five dimensions range from .68 to .83. Validity coefficients (intraclass correlational coefficients) are .69 for job resources items and .82 for job demands items. Inter-scale correlational coefficients range from −.32 to .47. Confirmatory factor analysis confirmed the construct validity with high dimensional factor loadings (ranging from .47 to .84 for job resources scale and from .50 to .85 for job demands scale) and adequate model fit indexes (root mean square error of approximation = .06). The instrument provides a tool to measure physical education teachers’ perception of their working environment. PMID:29200808
Marital happiness and sleep disturbances in a multi-ethnic sample of middle-aged women.

PubMed

Troxel, Wendy M; Buysse, Daniel J; Hall, Martica; Matthews, Karen A

2009-01-01

Previous research suggests that divorced individuals, particularly women, have higher rates of sleep disturbances as compared to married individuals. Among the married, however, little is known about the association between relationship quality and sleep. The present study examined the association between marital happiness and self-reported sleep disturbances in a sample of midlife women drawn from the Study of Women's Health Across the Nation (SWAN), a multi-site, multi-ethnic, community-based study (N = 2,148). Marital happiness was measured using a single item from the Dyadic Adjustment Scale, and sleep disturbance was assessed using 4 items from the Women's Health Initiative Insomnia Rating Scale (WHIIRS). After controlling for relevant covariates, maritally happy women reported fewer sleep disturbances, with the association evident among Caucasian women and to a lesser extent among African American women.
Development of the AGREE II, part 2: assessment of validity of items and tools to support application

PubMed Central

Brouwers, Melissa C.; Kho, Michelle E.; Browman, George P.; Burgers, Jako S.; Cluzeau, Françoise; Feder, Gene; Fervers, Béatrice; Graham, Ian D.; Hanna, Steven E.; Makarski, Julie

2010-01-01

Background We established a program of research to improve the development, reporting and evaluation of practice guidelines. We assessed the construct validity of the items and user’s manual in the β version of the AGREE II. Methods We designed guideline excerpts reflecting high-and low-quality guideline content for 21 of the 23 items in the tool. We designed two study packages so that one low-quality and one high-quality version of each item were randomly assigned to each package. We randomly assigned 30 participants to one of the two packages. Participants reviewed and rated the guideline content according to the instructions of the user’s manual and completed a survey assessing the manual. Results In all cases, content designed to be of high quality was rated higher than low-quality content; in 18 of 21 cases, the differences were significant (p < 0.05). The manual was rated by participants as appropriate, easy to use, and helpful in differentiating guidelines of varying quality, with all scores above the mid-point of the seven-point scale. Considerable feedback was offered on how the items and manual of the β-AGREE II could be improved. Interpretation The validity of the items was established and the user’s manual was rated as highly useful by users. We used these results and those of our study presented in part 1 to modify the items and user’s manual. We recommend AGREE II (available at www.agreetrust.org) as the revised standard for guideline development, reporting and evaluation. PMID:20513779
Two types of squalor: findings from a factor analysis of the Environmental Cleanliness and Clutter Scale (ECCS).

PubMed

Snowdon, John; Halliday, Graeme; Hunt, Glenn E

2013-07-01

Most people who collect and hoard, and then have difficulty discarding items, do not live in squalor, even though accumulation of hoarded items can make cleaning very difficult. Commonly, people living in squalor accumulate garbage, but relatively few fulfill proposed criteria for "hoarding disorder." We examined the overlap between hoarding and squalor among people referred because of unacceptable living conditions. Ongoing collection of data by a Squalor Project team, including ratings on the Environmental Cleanliness and Clutter Scale (ECCS), allowed (1) description of characteristics of cases and (2) examination of ratings of uncleanliness, and of the effect of accumulation of items or material on access within dwellings. Principal component analysis was used to examine latent variables underlying the ECCS. The mean age of the referred occupants (108 male, 95 female) was 61.9 years. The mean ECCS score in 186 rated cases was 18.5. Factor analysis of ECCS data showed a two-factor solution as the most plausible. Factor 1, comprising seven squalor items, accounted for 33.7% of the variance. Factor 2 comprised reduced accessibility and accumulation of items of little value (variance 17.6%). Accumulation of garbage loaded equally on the two factors. High levels of squalor and/or accumulation were recorded in 105 (56%) of the 186 dwellings. One-third scored high on accumulation/hoarding, while 38% scored high on squalor; 15% scored high on both squalor and accumulation. A quarter of those scoring high on squalor scored low on hoarding/accumulation. The ECCS is useful when describing whether referred cases show high levels of squalor, hoarding, or both.
Polytomous Latent Scales for the Investigation of the Ordering of Items

ERIC Educational Resources Information Center

Ligtvoet, Rudy; van der Ark, L. Andries; Bergsma, Wicher P.; Sijtsma, Klaas

2011-01-01

We propose three latent scales within the framework of nonparametric item response theory for polytomously scored items. Latent scales are models that imply an invariant item ordering, meaning that the order of the items is the same for each measurement value on the latent scale. This ordering property may be important in, for example,…
Comparability of Mayo-Portland Adaptability Inventory ratings by staff, significant others and people with acquired brain injury.

PubMed

Malec, James F

2004-06-01

To determine the internal consistency, reliability and comparability of the Mayo-Portland Adaptability Inventory (MPAI-4) and sub-scales completed by people with acquired brain injury (ABI), family and significant others (SO) and rehabilitation staff. 134 people with ABI consecutively seen for outpatient rehabilitation evaluation. MPAI-4 protocols based on independent ratings by the people with ABI undergoing evaluation, SO and rehabilitation staff were submitted to Rasch Facets analysis to determine the internal consistency of the overall measure and sub-scales (Ability, Adjustment and Participation indices) for each rater group and for a composite measure based on all rater groups. Rater agreement for individual items was also examined. Rasch indicators of internal consistency were entirely within acceptable limits for 3-rater composite full scale and sub-scale measures; these indicators were generally within acceptable limits for measures based on a single rater group. Item agreement was generally acceptable; disagreements suggested various sources of bias for specific rater groups. The MPAI-4 possesses satisfactory internal consistency regardless of rating source. A composite measure based on ratings made independently by people with ABI, SO and staff may serve as a 'gold standard' for research purposes. In the clinical setting, assessment of varying perspectives and biases may not only best represent outcome as evaluated by all parties involved but be essential to developing effective rehabilitation plans.
Psychometric properties of a short version of the HIV stigma scale, adapted for children with HIV infection.

PubMed

Wiklander, Maria; Rydström, Lise-Lott; Ygge, Britt-Marie; Navér, Lars; Wettergren, Lena; Eriksson, Lars E

2013-11-14

HIV is a stigmatizing medical condition. The concept of HIV stigma is multifaceted, with personalized stigma (perceived stigmatizing consequences of others knowing of their HIV status), disclosure concerns, negative self-image, and concerns with public attitudes described as core aspects of stigma for individuals with HIV infection. There is limited research on HIV stigma in children. The aim of this study was to test a short version of the 40-item HIV Stigma Scale (HSS-40), adapted for 8-18 years old children with HIV infection living in Sweden. A Swedish version of the HSS-40 was adapted for children by an expert panel and evaluated by think aloud interviews. A preliminary short version with twelve items covering the four dimensions of stigma in the HSS-40 was tested. The psychometric evaluation included inspection of missing values, principal component analysis (PCA), internal consistency, and correlations with measures of health-related quality of life (HRQoL). Fifty-eight children, representing 71% of all children with HIV infection in Sweden meeting the inclusion criteria, completed the 12-item questionnaire. Four items concerning participants' experiences of others' reactions to their HIV had unacceptable rates of missing values and were therefore excluded. The remaining items constituted an 8-item scale, the HIV Stigma Scale for Children (HSSC-8), measuring HIV-related disclosure concerns, negative self-image, and concerns with public attitudes. Evidence for internal validity was supported by a PCA, suggesting a three factor solution with all items loading on the same subscales as in the original HSS-40. The scale demonstrated acceptable internal consistency, with exception for the disclosure concerns subscale. Evidence for external validity was supported in correlational analyses with measures of HRQoL, where higher levels of stigma correlated with poorer HRQoL. The results suggest feasibility, reliability, as well as internal and external validity of the HSSC-8, an HIV stigma scale for children with HIV infection, measuring disclosure concerns, negative self-image, and concerns with public attitudes. The present study shows that different aspects of HIV stigma can be assessed among children with HIV in the age group 8-18.
The trucker strain monitor: an occupation-specific questionnaire measuring psychological job strain.

PubMed

De Croon, E M; Blonk, R W; Van der Beek, J; Frings-Dresen, M H

2001-08-01

To develop and validate a short and user-friendly questionnaire measuring psychological job strain in truck drivers. In cooperation with an occupational physician in the Dutch road transport industry we developed items on the basis of face validity and information of existing questionnaires on the subject. These items were pilot-tested, by means of interviews, in 15 truck drivers. Study I examined the factorial structure of the initial 30-item trucker strain monitor (TSM) in a sample of 153 truck drivers. Subsequently, number of items per factor was reduced on the basis of reliability analyses (Cronbach's alpha). Study II examined construct and criterion validity of the TSM in a randomly selected group of 2,000 truck drivers, of whom 1,111 participated (adjusted response = 63%). Additionally, sensitivity and specificity were assessed by examining the ability of the TSM to identify truck drivers with or without self-reported sickness absence in the past 12 months because of psychological complaints. Factor analyses of the initial 30-item TSM revealed a two-factor solution. Item reduction resulted in a six-item work-related fatigue scale and four-item sleeping problems scale with high internal consistency. Results of study II confirmed the internal consistency of the TSM scales and provided support for construct and criterion validity. The composite, work-related fatigue, and sleeping problems scale had a sensitivity of 83%, 80% and 71% respectively, in identifying truck drivers with prior sickness absence because of psychological complaints. Specificity rates were 72%, 73% and 72% respectively. Despite methodological limitations, the results suggest that the TSM is a reliable and valid indicator of psychological job strain in truck drivers. In particular, the composite and work-related fatigue scale identified drivers with prior absenteeism because of psychological complaints, quite accurately. Future longitudinal research in specific sub-groups of truck drivers including both self-reported and objective psychological health measures should evidence whether (1) the distinction between two indicators of psychological job strain is useful, and whether (2) the TSM can be used in screening out truck drivers at risk of developing psychological health problems.
Comparison of alternative versions of the job demand-control scales in 17 European cohort studies: the IPD-Work consortium.

PubMed

Fransson, Eleonor I; Nyberg, Solja T; Heikkilä, Katriina; Alfredsson, Lars; Bacquer, De Dirk; Batty, G David; Bonenfant, Sébastien; Casini, Annalisa; Clays, Els; Goldberg, Marcel; Kittel, France; Koskenvuo, Markku; Knutsson, Anders; Leineweber, Constanze; Magnusson Hanson, Linda L; Nordin, Maria; Singh-Manoux, Archana; Suominen, Sakari; Vahtera, Jussi; Westerholm, Peter; Westerlund, Hugo; Zins, Marie; Theorell, Töres; Kivimäki, Mika

2012-01-20

Job strain (i.e., high job demands combined with low job control) is a frequently used indicator of harmful work stress, but studies have often used partial versions of the complete multi-item job demands and control scales. Understanding whether the different instruments assess the same underlying concepts has crucial implications for the interpretation of findings across studies, harmonisation of multi-cohort data for pooled analyses, and design of future studies. As part of the 'IPD-Work' (Individual-participant-data meta-analysis in working populations) consortium, we compared different versions of the demands and control scales available in 17 European cohort studies. Six of the 17 studies had information on the complete scales and 11 on partial scales. Here, we analyse individual level data from 70 751 participants of the studies which had complete scales (5 demand items, 6 job control items). We found high Pearson correlation coefficients between complete scales of job demands and control relative to scales with at least three items (r > 0.90) and for partial scales with two items only (r = 0.76-0.88). In comparison with scores from the complete scales, the agreement between job strain definitions was very good when only one item was missing in either the demands or the control scale (kappa > 0.80); good for job strain assessed with three demand items and all six control items (kappa > 0.68) and moderate to good when items were missing from both scales (kappa = 0.54-0.76). The sensitivity was > 0.80 when only one item was missing from either scale, decreasing when several items were missing in one or both job strain subscales. Partial job demand and job control scales with at least half of the items of the complete scales, and job strain indices based on one complete and one partial scale, seemed to assess the same underlying concepts as the complete survey instruments.
Comparison of alternative versions of the job demand-control scales in 17 European cohort studies: the IPD-Work consortium

PubMed Central

2012-01-01

Background Job strain (i.e., high job demands combined with low job control) is a frequently used indicator of harmful work stress, but studies have often used partial versions of the complete multi-item job demands and control scales. Understanding whether the different instruments assess the same underlying concepts has crucial implications for the interpretation of findings across studies, harmonisation of multi-cohort data for pooled analyses, and design of future studies. As part of the 'IPD-Work' (Individual-participant-data meta-analysis in working populations) consortium, we compared different versions of the demands and control scales available in 17 European cohort studies. Methods Six of the 17 studies had information on the complete scales and 11 on partial scales. Here, we analyse individual level data from 70 751 participants of the studies which had complete scales (5 demand items, 6 job control items). Results We found high Pearson correlation coefficients between complete scales of job demands and control relative to scales with at least three items (r > 0.90) and for partial scales with two items only (r = 0.76-0.88). In comparison with scores from the complete scales, the agreement between job strain definitions was very good when only one item was missing in either the demands or the control scale (kappa > 0.80); good for job strain assessed with three demand items and all six control items (kappa > 0.68) and moderate to good when items were missing from both scales (kappa = 0.54-0.76). The sensitivity was > 0.80 when only one item was missing from either scale, decreasing when several items were missing in one or both job strain subscales. Conclusions Partial job demand and job control scales with at least half of the items of the complete scales, and job strain indices based on one complete and one partial scale, seemed to assess the same underlying concepts as the complete survey instruments. PMID:22264402
An evaluation of the quick inventory of depressive symptomatology and the hamilton rating scale for depression: a sequenced treatment alternatives to relieve depression trial report.

PubMed

Rush, A John; Bernstein, Ira H; Trivedi, Madhukar H; Carmody, Thomas J; Wisniewski, Stephen; Mundt, James C; Shores-Wilson, Kathy; Biggs, Melanie M; Woo, Ada; Nierenberg, Andrew A; Fava, Maurizio

2006-03-15

Nine DSM-IV-TR criterion symptom domains are evaluated to diagnose major depressive disorder (MDD). The Quick Inventory of Depressive Symptomatology (QIDS) provides an efficient assessment of these domains and is available as a clinician rating (QIDS-C16), a self-report (QIDS-SR16), and in an automated, interactive voice response (IVR) (QIDS-IVR16) telephone system. This report compares the performance of these three versions of the QIDS and the 17-item Hamilton Rating Scale for Depression (HRSD17). Data were acquired at baseline and exit from the first treatment step (citalopram) in the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) trial. Outpatients with nonpsychotic MDD who completed all four ratings within +/-2 days were identified from the first 1500 STAR*D subjects. Both item response theory and classical test theory analyses were conducted. The three methods for obtaining QIDS data produced consistent findings regarding relationships between the nine symptom domains and overall depression, demonstrating interchangeability among the three methods. The HRSD17, while generally satisfactory, rarely utilized the full range of item scores, and evidence suggested multidimensional measurement properties. In nonpsychotic MDD outpatients without overt cognitive impairment, clinician assessment of depression severity using either the QIDS-C16 or HRSD17 may be successfully replaced by either the self-report or IVR version of the QIDS.
Construction and evaluation of a self rating scale for stress-induced Exhaustion Disorder, the Karolinska Exhaustion Disorder Scale

PubMed Central

Besèr, Aniella; Sorjonen, Kimmo; Wahlberg, Kristina; Peterson, Ulla; Nygren, Åke; Åsberg, Marie

2014-01-01

Prolonged stress (≥ six months) may cause a condition which has been named exhaustion disorder (ED) with ICD-10 code F43.8. ED is characterised by exhaustion, cognitive problems, poor sleep and reduced tolerance to further stress. ED can cause long term disability and depressive symptoms may develop. The aim was to construct and evaluate a self-rating scale, the Karolinska Exhaustion Disorder Scale (KEDS), for the assessment of ED symptoms. A second aim was to examine the relationship between self-rated symptoms of ED, depression, and anxiety using KEDS and the Hospital Anxiety and Depression Scale (HAD). Items were selected based on their correspondence to criteria for ED as formulated by the Swedish National Board of Health and Welfare (NBHW), with seven response alternatives in a Likert-format. Self-ratings performed by 317 clinically assessed participants were used to analyse the scale’s psychometric properties. KEDS consists of nine items with a scale range of 0–54. Receiver operating characteristics analysis demonstrated that a cut-off score of 19 was accompanied by high sensitivity and specificity (each above 95%) in the discrimination between healthy subjects and patients with ED. Reliability was satisfactory and confirmatory factor analysis revealed that ED, depression and anxiety are best regarded as different phenomena. KEDS may be a useful tool in the assessment of symptoms of Exhaustion Disorder in clinical as well as research settings. There is evidence that the symptom clusters of ED, anxiety and depression, respectively, reflect three different underlying dimensions. PMID:24236500
Classical test theory and Rasch analysis validation of the Upper Limb Functional Index in subjects with upper limb musculoskeletal disorders.

PubMed

Bravini, Elisabetta; Franchignoni, Franco; Giordano, Andrea; Sartorio, Francesco; Ferriero, Giorgio; Vercelli, Stefano; Foti, Calogero

2015-01-01

To perform a comprehensive analysis of the psychometric properties and dimensionality of the Upper Limb Functional Index (ULFI) using both classical test theory and Rasch analysis (RA). Prospective, single-group observational design. Freestanding rehabilitation center. Convenience sample of Italian-speaking subjects with upper limb musculoskeletal disorders (N=174). Not applicable. The Italian version of the ULFI. Data were analyzed using parallel analysis, exploratory factor analysis, and RA for evaluating dimensionality, functioning of rating scale categories, item fit, hierarchy of item difficulties, and reliability indices. Parallel analysis revealed 2 factors explaining 32.5% and 10.7% of the response variance. RA confirmed the failure of the unidimensionality assumption, and 6 items out of the 25 misfitted the Rasch model. When the analysis was rerun excluding the misfitting items, the scale showed acceptable fit values, loading meaningfully to a single factor. Item separation reliability and person separation reliability were .98 and .89, respectively. Cronbach alpha was .92. RA revealed weakness of the scale concerning dimensionality and internal construct validity. However, a set of 19 ULFI items defined through the statistical process demonstrated a unidimensional structure, good psychometric properties, and clinical meaningfulness. These findings represent a useful starting point for further analyses of the tool (based on modern psychometric approaches and confirmatory factor analysis) in larger samples, including different patient populations and nationalities. Copyright © 2015 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Validation of 5-item and 2-item questionnaires in Chinese version of Dizziness Handicap Inventory for screening objective benign paroxysmal positional vertigo.

PubMed

Chen, Wei; Shu, Liang; Wang, Qian; Pan, Hui; Wu, Jing; Fang, Jie; Sun, Xu-Hong; Zhai, Yu; Dong, You-Rong; Liu, Jian-Ren

2016-08-01

As possible candidate screening instruments for benign paroxysmal positional vertigo (BPPV), studies to validate the Dizziness Handicap Inventory (DHI) sub-scale (5-item and 2-item) and total scores are rare in China. From May 2014 to December 2014, 108(55 with and 53 without BPPV) patients complaining of episodic vertigo in the past week from a vertigo outpatient clinic were enrolled for DHI evaluation, as well as demographic and other clinical data. Objective BPPV was subsequently determined by positional evoking maneuvers under the record of optical Frenzel glasses. Cronbach's coefficient α was used to evaluate the reliability of psychometric scales. The validity of DHI total, 5-item and 2-item questionnaires to screen for BPPV was assessed by receiver operating characteristic (ROC) curves. It revealed that the DHI 5-item questionnaire had good internal consistency (Cronbach's coefficient α = 0.72). Area under the curve of total DHI, 5-item and 2-item scores for discriminating BPPV from those without was 0.678 (95 % CI 0.578-0.778), 0.873(95 % CI 0.807-0.940) and 0.895(95 % CI 0.836-0.953), respectively. It revealed 74.5 % sensitivity and 88.7 % specificity in separating BPPV and those without, with a cutoff value of 12 in the 5-item questionnaire. The corresponding rate of sensitivity and specificity was 78.2 and 88.7 %, respectively, with a cutoff value of 6 in 2-item questionnaire. The present study indicated that both 5-item and 2-item questionnaires in the Chinese version of DHI may be more valid than DHI total score for screening objective BPPV and merit further application in clinical practice in China.

Psychometrics of the self-report safe driving behavior measure for older adults.

PubMed

Classen, Sherrilene; Wen, Pey-Shan; Velozo, Craig A; Bédard, Michel; Winter, Sandra M; Brumback, Babette; Lanford, Desiree N

2012-01-01

We investigated the psychometric properties of the 68-item Safe Driving Behavior Measure (SDBM) with 80 older drivers, 80 caregivers, and 2 evaluators from two sites. Using Rasch analysis, we examined unidimensionality and local dependence; rating scale; item- and person-level psychometrics; and item hierarchy of older drivers, caregivers, and driving evaluators who had completed the SDBM. The evidence suggested the SDBM is unidimensional, but pairs of items showed local dependency. Across the three rater groups, the data showed good person (≥3.4) and item (≥3.6) separation as well as good person (≥.93) and item reliability (≥.92). Cronbach's α was ≥.96, and few items were misfitting. Some of the items did not follow the hypothesized order of item difficulty. The SDBM classified the older drivers into six ability levels, but to fully calibrate the instrument it must be refined in terms of its items (e.g., item exclusion) and then tested among participants of lesser ability. Copyright © 2012 by the American Occupational Therapy Association, Inc.
A large-scale, long-term study of scale drift: The micro view and the macro view

NASA Astrophysics Data System (ADS)

He, W.; Li, S.; Kingsbury, G. G.

2016-11-01

The development of measurement scales for use across years and grades in educational settings provides unique challenges, as instructional approaches, instructional materials, and content standards all change periodically. This study examined the measurement stability of a set of Rasch measurement scales that have been in place for almost 40 years. In order to investigate the stability of these scales, item responses were collected from a large set of students who took operational adaptive tests using items calibrated to the measurement scales. For the four scales that were examined, item samples ranged from 2183 to 7923 items. Each item was administered to at least 500 students in each grade level, resulting in approximately 3000 responses per item. Stability was examined at the micro level analysing change in item parameter estimates that have occurred since the items were first calibrated. It was also examined at the macro level, involving groups of items and overall test scores for students. Results indicated that individual items had changes in their parameter estimates, which require further analysis and possible recalibration. At the same time, the results at the total score level indicate substantial stability in the measurement scales over the span of their use.
Validation of the CMT Pediatric Scale as an outcome measure of disability

PubMed Central

Burns, Joshua; Ouvrier, Robert; Estilow, Tim; Shy, Rosemary; Laurá, Matilde; Pallant, Julie F.; Lek, Monkol; Muntoni, Francesco; Reilly, Mary M.; Pareyson, Davide; Acsadi, Gyula; Shy, Michael E.; Finkel, Richard S.

2012-01-01

Objective Charcot-Marie-Tooth disease (CMT) is a common heritable peripheral neuropathy. There is no treatment for any form of CMT although clinical trials are increasingly occurring. Patients usually develop symptoms during the first two decades of life but there are no established outcome measures of disease severity or response to treatment. We identified a set of items that represent a range of impairment levels and conducted a series of validation studies to build a patient-centered multi-item rating scale of disability for children with CMT. Methods As part of the Inherited Neuropathies Consortium, patients aged 3–20 years with a variety of CMT types were recruited from the USA, UK, Italy and Australia. Initial development stages involved: definition of the construct, item pool generation, peer review and pilot testing. Based on data from 172 patients, a series of validation studies were conducted, including: item and factor analysis, reliability testing, Rasch modeling and sensitivity analysis. Results Seven areas for measurement were identified (strength, dexterity, sensation, gait, balance, power, endurance), and a psychometrically robust 11-item scale constructed (Charcot-Marie-Tooth disease Pediatric Scale: CMTPedS). Rasch analysis supported the viability of the CMTPedS as a unidimensional measure of disability in children with CMT. It showed good overall model fit, no evidence of misfitting items, no person misfit and it was well targeted for children with CMT. Interpretation The CMTPedS is a well-tolerated outcome measure that can be completed in 25-minutes. It is a reliable, valid and sensitive global measure of disability for children with CMT from the age of 3 years. PMID:22522479
Readability and Comprehension of the Geriatric Depression Scale and PROMIS® Physical Function Items in Older African Americans and Latinos

PubMed Central

Paz, Sylvia H.; Jones, Loretta; Calderón, José L.; Hays, Ron D.

2016-01-01

Background Depression and physical function are especially important health domains for the elderly. The Geriatric Depression Scale (GDS) and the Patient-Reported Outcomes Measurement Information System (PROMIS®) Physical Function Item Bank are two surveys commonly used to measure these domains. It is unclear if these two instruments adequately measure these aspects of health in minority elderly. Objective To estimate the readability of the GDS and PROMIS® Physical Function items and to assess their comprehensibility by a sample of African American and Latino elderly. Methods Readability was estimated using the Flesch-Kincaid (F-K) and Flesch-Reading-Ease (FRE) formulae for English versions, and a Spanish adaptation of the FRE formula for the Spanish versions. Comprehension of the GDS and PROMIS items by minority elderly was evaluated with 30 cognitive interviews. Results Readability estimates of a number of items in English and Spanish of the GDS and PROMIS physical functioning items exceed the recommended 5th grade level, or were rated as fairly difficult, difficult, or very difficult to read. Cognitive interviews revealed that many participants felt that more than the two (yes/no) GDS response options were needed to answer the questions. Wording of several PROMIS items was considered confusing and responses potentially uninterpretable because they were based on physical aids. Conclusions Problems with item wording and response options of the GDS and PROMIS Physical Function items may negatively affect reliability and validity of measurement when used with minority elderly. PMID:27599978
Development and validation of the Consumer Quality index instrument to measure the experience and priority of chronic dialysis patients.

PubMed

van der Veer, Sabine N; Jager, Kitty J; Visserman, Ella; Beekman, Robert J; Boeschoten, Els W; de Keizer, Nicolette F; Heuveling, Lara; Stronks, Karien; Arah, Onyebuchi A

2012-08-01

Patient experience is an established indicator of quality of care. Validated tools that measure both experiences and priorities are lacking for chronic dialysis care, hampering identification of negative experiences that patients actually rate important. We developed two Consumer Quality (CQ) index questionnaires, one for in-centre haemodialysis (CHD) and the other for peritoneal dialysis and home haemodialysis (PHHD) care. The instruments were validated using exploratory factor analyses, reliability analysis of identified scales and assessing the association between reliable scales and global ratings. We investigated opportunities for improvement by combining suboptimal experience with patient priority. Sixteen dialysis centres participated in our study. The pilot CQ index for CHD care consisted of 71 questions. Based on data of 592 respondents, we identified 42 core experience items in 10 scales with Cronbach's α ranging from 0.38 to 0.88; five were reliable (α ≥ 0.70). The instrument identified information on centres' fire procedures as the aspect of care exhibiting the biggest opportunity for improvement. The pilot CQ index PHHD comprised 56 questions. The response of 248 patients yielded 31 core experience items in nine scales with Cronbach's α ranging between 0.53 and 0.85; six were reliable. Information on kidney transplantation during pre-dialysis showed most room for improvement. However, for both types of care, opportunities for improvement were mostly limited. The CQ index reliably and validly captures dialysis patient experience. Overall, most care aspects showed limited room for improvement, mainly because patients participating in our study rated their experience to be optimal. To evaluate items with high priority, but with which relatively few patients have experience, more qualitative instruments should be considered.
Convergent and discriminant validity and reliability of the pediatric anxiety rating scale in youth with autism spectrum disorders.

PubMed

Storch, Eric A; Wood, Jeffrey J; Ehrenreich-May, Jill; Jones, Anna M; Park, Jennifer M; Lewin, Adam B; Murphy, Tanya K

2012-11-01

The psychometric properties of the Pediatric Anxiety Rating Scale (PARS), a clinician-administered measure for assessing severity of anxiety symptoms, were examined in 72 children and adolescents diagnosed with an autism spectrum disorder (ASD). The internal consistency of the PARS was 0.59, suggesting that the items were related but not repetitive. The PARS showed high 26-day test-retest (ICC = 0.83) and inter-rater reliability (ICC = 0.86). The PARS was strongly correlated with clinician-ratings of overall anxiety severity and parent-report anxiety measures, supporting convergent validity. Results for divergent validity were mixed. Although the PARS was not associated with the sum of the Social and Communication items on the Autism Diagnostic Observation System, it was moderately correlated with parent-reported inattention, aggression and externalizing behavior. Overall, these results suggest that the psychometric properties of the PARS are adequate for assessing anxiety symptoms in youth with ASD, although additional clarification of divergent validity is needed.
A preliminary study to measure and develop job satisfaction scale for medical teachers

PubMed Central

Bhatnagar, Kavita; Srivastava, Kalpana; Singh, Amarjit; Jadav, S.L.

2011-01-01

Background: Job satisfaction of medical teachers has an impact on quality of medical education and patient care. In this background, the study was planned to develop scale and measure job satisfaction status of medical teachers. Materials and Methods: To generate items pertaining to the scale of job satisfaction, closed-ended and open-ended questionnaires were administered to medical professionals. The job satisfaction questionnaire was developed and rated on Likert type of rating scale. Both quantitative and qualitative methods were used to ascertain job satisfaction among 245 health science faculty of an autonomous educational institution. Factor loading was calculated and final items with strong factor loading were selected. Data were statistically evaluated. Results: Average job satisfaction score was 53.97 on a scale of 1–100. The Cronbach's alpha reliability coefficient was 0.918 for entire set of items. There was statistically significant difference in job satisfaction level across different age groups (P 0.0358) showing a U-shaped pattern and fresh entrants versus reemployed faculty (P 0.0188), former showing lower satisfaction. Opportunity for self-development was biggest satisfier, followed by work, opportunity for promotion, and job security. Factors contributing toward job dissatisfaction were poor utilization of skills, poor promotional prospects, inadequate pay and allowances, work conditions, and work atmosphere. Conclusion: Tertiary care teaching hospitals in autonomous educational institutions need to build infrastructure and create opportunities for their medical professional. Job satisfaction of young entrants needs to be raised further by improving their work environment. This will pave the way for effective delivery of health care. PMID:23271862
An Anesthesia Preinduction Checklist to Improve Information Exchange, Knowledge of Critical Information, Perception of Safety, and Possibly Perception of Teamwork in Anesthesia Teams.

PubMed

Tscholl, David W; Weiss, Mona; Kolbe, Michaela; Staender, Sven; Seifert, Burkhardt; Landert, Daniel; Grande, Bastian; Spahn, Donat R; Noethiger, Christoph B

2015-10-01

An anesthesia preinduction checklist (APIC) to be performed before anesthesia induction was introduced and evaluated with respect to 5 team-level outcomes, each being a surrogate end point for patient safety: information exchange (the percentage of checklist items exchanged by a team, out of 12 total items); knowledge of critical information (the percentage of critical information items out of 5 total items such as allergies, reported as known by the members of a team); team members' perceptions of safety (the median scores given by the members of a team on a continuous rating scale); their perception of teamwork (the median scores given by the members of a team on a continuous rating scale); and clinical performance (the percentage of completed items out of 14 required tasks, e.g., suction device checked). A prospective interventional study comparing anesthesia teams using the APIC with a control group not using the APIC was performed using a multimethod design. Trained observers rated information exchange and clinical performance during on-site observations of anesthesia inductions. After the observations, each team member indicated the critical information items they knew and their perceptions of safety and teamwork. One hundred five teams using the APIC were compared with 100 teams not doing so. The medians of the team-level outcome scores in the APIC group versus the control group were as follows: information exchange: 100% vs 33% (P < 0.001), knowledge of critical information: 100% vs 90% (P < 0.001), perception of safety: 91% vs 84% (P < 0.001), perception of teamwork: 90% vs 86% (P = 0.028), and clinical performance: 93% vs 93% (P = 0.60). This study provides empirical evidence that the use of a preinduction checklist significantly improves information exchange, knowledge of critical information, and perception of safety in anesthesia teams-all parameters contributing to patient safety. There was a trend indicating improved perception of teamwork.
Wiggins Content Scales and the MMPI-2.

PubMed

Kohutek, K J

1992-03-01

The omission of the Wiggins Content Scales occurred because of the number of items deleted as well as the addition of items to the MMPI-2. The purpose of this study is to compare scorings of the items on the Wiggins Scales of the MMPI and the items that remain on these scales on the MMPI-2. The scales of Religious Fundamentalism and Authority Conflict appear to be those most seriously affected by the item change on the MMPI-2. The scales Depression and Family Conflict maintained all of their items, and the remaining nine were not found to be statistically different when the two scorings were compared.
Understanding Interest and Self-Efficacy in the Reading and Writing of Students with Persisting Specific Learning Disabilities during Middle Childhood and Early Adolescence

ERIC Educational Resources Information Center

Abbott, Robert; Mickail, Terry; Richards, Todd; Renninger, K. Ann; Hidi, Suzanne E.; Beers, Scott; Berninger, Virginia

2017-01-01

Three methodological approaches were applied to understand the role of interest and self-efficacy in reading and/or writing in students without and with persisting specific learning disabilities (SLDs) in literacy. For each approach students in grades 4 to 9 completed a survey in which they rated 10 reading items and 10 writing items on a Scale 1…
Comparison of the Efficacy and Safety of Aripiprazole Versus Bupropion Augmentation in Patients With Major Depressive Disorder Unresponsive to Selective Serotonin Reuptake Inhibitors: A Randomized, Prospective, Open-Label Study.

PubMed

Cheon, Eun-Jin; Lee, Kwang-Hun; Park, Young-Woo; Lee, Jong-Hun; Koo, Bon-Hoon; Lee, Seung-Jae; Sung, Hyung-Mo

2017-04-01

The purpose of this study was to compare the efficacy and safety of aripiprazole versus bupropion augmentation in patients with major depressive disorder (MDD) unresponsive to selective serotonin reuptake inhibitors (SSRIs). This is the first randomized, prospective, open-label, direct comparison study between aripiprazole and bupropion augmentation. Participants had at least moderately severe depressive symptoms after 4 weeks or more of SSRI treatment. A total of 103 patients were randomized to either aripiprazole (n = 56) or bupropion (n = 47) augmentation for 6 weeks. Concomitant use of psychotropic agents was prohibited. Montgomery Asberg Depression Rating Scale, 17-item Hamilton Depression Rating scale, Iowa Fatigue Scale, Drug-Induced Extrapyramidal Symptoms Scale, Psychotropic-Related Sexual Dysfunction Questionnaire scores were obtained at baseline and after 1, 2, 4, and 6 weeks of treatment. Overall, both treatments significantly improved depressive symptoms without causing serious adverse events. There were no significant differences in the Montgomery Asberg Depression Rating Scale, 17-item Hamilton Depression Rating scale, and Iowa Fatigue Scale scores, and response rates. However, significant differences in remission rates between the 2 groups were evident at week 6 (55.4% vs 34.0%, respectively; P = 0.031), favoring aripiprazole over bupropion. There were no significant differences in adverse sexual events, extrapyramidal symptoms, or akathisia between the 2 groups. The present study suggests that aripiprazole augmentation is at least comparable to bupropion augmentation in combination with SSRI in terms of efficacy and tolerability in patients with MDD. Both aripiprazole and bupropion could help reduce sexual dysfunction and fatigue in patients with MDD. Aripiprazole and bupropion may offer effective and safe augmentation strategies in patients with MDD who are unresponsive to SSRIs. Double-blinded trials are warranted to confirm the present findings.
Construction and evaluation of a self rating scale for stress-induced exhaustion disorder, the Karolinska Exhaustion Disorder Scale.

PubMed

Besèr, Aniella; Sorjonen, Kimmo; Wahlberg, Kristina; Peterson, Ulla; Nygren, Ake; Asberg, Marie

2014-02-01

Prolonged stress (≥ six months) may cause a condition which has been named exhaustion disorder (ED) with ICD-10 code F43.8. ED is characterised by exhaustion, cognitive problems, poor sleep and reduced tolerance to further stress. ED can cause long term disability and depressive symptoms may develop. The aim was to construct and evaluate a self-rating scale, the Karolinska Exhaustion Disorder Scale (KEDS), for the assessment of ED symptoms. A second aim was to examine the relationship between self-rated symptoms of ED, depression, and anxiety using KEDS and the Hospital Anxiety and Depression Scale (HAD). Items were selected based on their correspondence to criteria for ED as formulated by the Swedish National Board of Health and Welfare (NBHW), with seven response alternatives in a Likert-format. Self-ratings performed by 317 clinically assessed participants were used to analyse the scale's psychometric properties. KEDS consists of nine items with a scale range of 0-54. Receiver operating characteristics analysis demonstrated that a cut-off score of 19 was accompanied by high sensitivity and specificity (each above 95%) in the discrimination between healthy subjects and patients with ED. Reliability was satisfactory and confirmatory factor analysis revealed that ED, depression and anxiety are best regarded as different phenomena. KEDS may be a useful tool in the assessment of symptoms of Exhaustion Disorder in clinical as well as research settings. There is evidence that the symptom clusters of ED, anxiety and depression, respectively, reflect three different underlying dimensions. © 2013 The Authors. Scandinavian Journal of Psychology published by Scandinavian Psychological Associations and John Wiley & Sons Ltd.
Development of a job stressor scale for nurses caring for patients with intractable neurological diseases.

PubMed

Ando, Yukako; Kataoka, Tsuyoshi; Okamura, Hitoshi; Tanaka, Katsutoshi; Kobayashi, Toshio

2013-12-01

The purpose of this research is to verify the reliability and validity of a job stressor scale for nurses caring for patients with intractable neurological diseases. A mail survey was conducted using a self-report questionnaire. The subjects were 263 nurses and assistant nurses working in wards specializing in intractable neurological diseases. The response rate was 71.9% (valid response rate, 66.2%). With regard to reliability, internal consistency and stability were assessed. Internal consistency was examined via Cronbach's alpha. For stability, the test-retest method was performed and stability was examined via intraclass correlation coefficients. With regard to validity, factor validity, criterion-related validity, and content validity were assessed. Exploratory factor analysis was used for factor validity. For criterion-related validity, an existing scale was used as an external criterion; concurrent validity was examined via Spearman's rank correlation coefficients. As a result of analysis, there were 26 items in the scale created with an eight factor structure. Cronbach's a for the 26 items was 0.90; with the exception of two factors, alpha for all of the individual sub-factors was high at 0.7 or higher. The intraclass correlation coefficient for the 26 items was 0.89 (p < 0.001). With regard to criterion-related validity, concurrent validity was confirmed and the correlation coefficient with an external criterion was 0.73 (p < 0.001). For content validity, subjects who responded that "The questionnaire represents a stressor well or to a degree" accounted for 81% of the total responses. Reliability and validity were confirmed, so the scale created in the current research is a usable scale.
Supervisors' Performance Ratings Correlated with Selected Personal Characteristics of Attendants in a Mental Retardation Developmental Center.

ERIC Educational Resources Information Center

Frederick, Joseph; And Others

A research study investigated the relationship between personal characteristics and selected demographic data of 75 attendants in a mental retardation developmental center and the assessment by 24 administrators of the attendants' job performance. Instruments used included a 20-item Direct Care Performance Scale and the Demographic Data Scale,…
Symptom Dimensions of the Psychotic Symptom Rating Scales in Psychosis: A Multisite Study

PubMed Central

Woodward, Todd S.; Jung, Kwanghee; Hwang, Heungsun; Yin, John; Taylor, Laura; Menon, Mahesh; Peters, Emmanuelle; Kuipers, Elizabeth; Waters, Flavie; Lecomte, Tania; Sommer, Iris E.; Daalman, Kirstin; van Lutterveld, Remko; Hubl, Daniela; Kindler, Jochen; Homan, Philipp; Badcock, Johanna C.; Chhabra, Saruchi; Cella, Matteo; Keedy, Sarah; Allen, Paul; Mechelli, Andrea; Preti, Antonio; Siddi, Sara; Erickson, David

2014-01-01

The Psychotic Symptom Rating Scales (PSYRATS) is an instrument designed to quantify the severity of delusions and hallucinations and is typically used in research studies and clinical settings focusing on people with psychosis and schizophrenia. It is comprised of the auditory hallucinations (AHS) and delusions subscales (DS), but these subscales do not necessarily reflect the psychological constructs causing intercorrelation between clusters of scale items. Identification of these constructs is important in some clinical and research contexts because item clustering may be caused by underlying etiological processes of interest. Previous attempts to identify these constructs have produced conflicting results. In this study, we compiled PSYRATS data from 12 sites in 7 countries, comprising 711 participants for AHS and 520 for DS. We compared previously proposed and novel models of underlying constructs using structural equation modeling. For the AHS, a novel 4-dimensional model provided the best fit, with latent variables labeled Distress (negative content, distress, and control), Frequency (frequency, duration, and disruption), Attribution (location and origin of voices), and Loudness (loudness item only). For the DS, a 2-dimensional solution was confirmed, with latent variables labeled Distress (amount/intensity) and Frequency (preoccupation, conviction, and disruption). The within-AHS and within-DS dimension intercorrelations were higher than those between subscales, with the exception of the AHS and DS Distress dimensions, which produced a correlation that approached the range of the within-scale correlations. Recommendations are provided for integrating these underlying constructs into research and clinical applications of the PSYRATS. PMID:24936086
Factor analysis of the Hamilton Depression Rating Scale in Parkinson's disease.

PubMed

Broen, M P G; Moonen, A J H; Kuijf, M L; Dujardin, K; Marsh, L; Richard, I H; Starkstein, S E; Martinez-Martin, P; Leentjens, A F G

2015-02-01

Several studies have validated the Hamilton Depression Rating Scale (HAMD) in patients with Parkinson's disease (PD), and reported adequate reliability and construct validity. However, the factorial validity of the HAMD has not yet been investigated. The aim of our analysis was to explore the factor structure of the HAMD in a large sample of PD patients. A principal component analysis of the 17-item HAMD was performed on data of 341 PD patients, available from a previous cross sectional study on anxiety. An eigenvalue ≥1 was used to determine the number of factors. Factor loadings ≥0.4 in combination with oblique rotations were used to identify which variables made up the factors. Kaiser-Meyer-Olkin measure (KMO), Cronbach's alpha, Bartlett's test, communality, percentage of non-redundant residuals and the component correlation matrix were computed to assess factor validity. KMO verified the sample's adequacy for factor analysis and Cronbach's alpha indicated a good internal consistency of the total scale. Six factors had eigenvalues ≥1 and together explained 59.19% of the variance. The number of items per factor varied from 1 to 6. Inter-item correlations within each component were low. There was a high percentage of non-redundant residuals and low communality. This analysis demonstrates that the factorial validity of the HAMD in PD is unsatisfactory. This implies that the scale is not appropriate for studying specific symptom domains of depression based on factorial structure in a PD population. Copyright © 2014 Elsevier Ltd. All rights reserved.
Conceptualizing Interprofessional Teams as Multi-Team Systems-Implications for Assessment and Training.

PubMed

West, Courtney; Landry, Karen; Graham, Anna; Graham, Lori; Cianciolo, Anna T; Kalet, Adina; Rosen, Michael; Sherman, Deborah Witt

2015-01-01

SGEA 2015 CONFERENCE ABSTRACT (EDITED). Evaluating Interprofessional Teamwork During a Large-Scale Simulation. Courtney West, Karen Landry, Anna Graham, and Lori Graham. CONSTRUCT: This study investigated the multidimensional measurement of interprofessional (IPE) teamwork as part of large-scale simulation training. Healthcare team function has a direct impact on patient safety and quality of care. However, IPE team training has not been the norm. Recognizing the importance of developing team-based collaborative care, our College of Nursing implemented an IPE simulation activity called Disaster Day and invited other professions to participate. The exercise consists of two sessions: one in the morning and another in the afternoon. The disaster scenario is announced just prior to each session, which consists of team building, a 90-minute simulation, and debriefing. Approximately 300 Nursing, Medicine, Pharmacy, Emergency Medical Technicians, and Radiology students and over 500 standardized and volunteer patients participated in the Disaster Day event. To improve student learning outcomes, we created 3 competency-based instruments to evaluate collaborative practice in multidimensional fashion during this exercise. A 20-item IPE Team Observation Instrument designed to assess interprofessional team's attainment of Interprofessional Education Collaborative (IPEC) competencies was completed by 20 faculty and staff observing the Disaster Day simulation. One hundred sixty-six standardized patients completed a 10-item Standardized Patient IPE Team Evaluation Instrument developed from the IPEC competencies and adapted items from the 2014 Henry et al. PIVOT Questionnaire. This instrument assessed the standardized or volunteer patient's perception of the team's collaborative performance. A 29-item IPE Team's Perception of Collaborative Care Questionnaire, also created from the IPEC competencies and divided into 5 categories of Values/Ethics, Roles and Responsibilities, Communication, Teamwork, and Self-Evaluation, was completed by 188 students including 99 from Nursing, 43 from Medicine, 6 from Pharmacy, and 40 participants who belonged to more than one component, were students at another institution, or did not indicate their institution. The team instrument was designed to assess each team member's perception of how well the team and him- or herself met the competencies. Five of the items on the team perceptions questionnaire mirrored items on the standardized patient evaluation: demonstrated leadership practices that led to effective teamwork, discussed care and decisions about that care with patient, described roles and responsibilities clearly, worked well together to coordinate care, and good/effective communication. Internal consistency reliability of the IPE Team Observation Instrument was 0.80. In 18 of the 20 items, more than 50% of observers indicated the item was demonstrated. Of those, 6 of the items were observed by 50% to 75% of the observers, and the remaining 12 were observed by more than 80% of the observers. Internal consistency reliability of the IPE Team's Perception of Collaborative Care Instrument was 0.95. The mean response score-1 (strongly disagree) to 4 (strongly agree)-was calculated for each section of the instrument. The overall mean score was 3.57 (SD = .11). Internal consistency reliability of the Standardized Patient IPE Team Evaluation Instrument was 0.87. The overall mean score was 3.28 (SD = .17). The ratings for the 5 items shared by the standardized patient and team perception instruments were compared using independent sample t tests. Statistically significant differences (p < .05) were present in each case, with the students rating themselves higher on average than the standardized patients did (mean differences between 0.2 and 0.6 on a scale of 1-4). Multidimensional, competency-based instruments appear to provide a robust view of IPE teamwork; however, challenges remain. Due to the large scale of the simulation exercise, observation-based assessment did not function as well as self- and standardized patient-based assessment. To promote greater variation in observer assessments during future Disaster Day simulations, we plan to adjust the rating scale from "not observed," "observed," and "not applicable" to a 4-point scale and reexamine interrater reliability.
The development of an instrument for evaluating clinical teachers: involving stakeholders to determine content validity.

PubMed

Stalmeijer, Renée E; Dolmans, Diana H J M; Wolfhagen, Ineke H A P; Muijtjens, Arno M M; Scherpbier, Albert J J A

2008-01-01

Research indicates that the quality of supervision strongly influences the learning of medical students in clinical practice. Clinical teachers need feedback to improve their supervisory skills. The available instruments either lack a clear theoretical framework or are not suitable for providing feedback to individual teachers. We developed an evaluation instrument based on the 'cognitive apprenticeship model'. The aim was to estimate the content validity of the developed instrument. Item relevance was rated on a five-point scale (1 = highly irrelevant, 5 = highly relevant) by three groups of stakeholders in undergraduate clinical teaching: educationalists (N = 12), doctors (N = 16) and students (N = 12). Additionally, stakeholders commented on content, wording and omission of items. The items were generally rated as very relevant (Mean = 4.3, SD = 0.38, response = 95%) and any differences between the stakeholder groups were small. The results led to elimination of 4 items, rewording of 13 items and addition of 1 item. The cognitive apprenticeship model appears to offer a useful framework for the development of an evaluation instrument aimed at providing feedback to individual clinical teachers on the quality of student supervision. Further studies in larger populations will have to establish the instrument's statistical validity and generalizability.
Usefulness of a Clinician Rating Scale in Identifying Preschool Children with ADHD

ERIC Educational Resources Information Center

Gopin, Chaya; Healey, Dione; Castelli, Katia; Marks, David; Halperin, Jeffrey M.

2010-01-01

Objective: To ascertain the psychometric properties and clinical utility of the Behavioral Rating Inventory for Children (BRIC), a novel clinician inventory for preschoolers. Method: Completion of the BRIC for 214 preschoolers follows 2 evaluation sessions, generally separated by less than 2 weeks. Items are submitted to a Principal Components…
Reliability of a rating procedure to monitor industry self-regulation codes governing alcohol advertising content.

PubMed

Babor, Thomas F; Xuan, Ziming; Proctor, Dwayne

2008-03-01

The purposes of this study were to develop reliable procedures to monitor the content of alcohol advertisements broadcast on television and in other media, and to detect violations of the content guidelines of the alcohol industry's self-regulation codes. A set of rating-scale items was developed to measure the content guidelines of the 1997 version of the U.S. Beer Institute Code. Six focus groups were conducted with 60 college students to evaluate the face validity of the items and the feasibility of the procedure. A test-retest reliability study was then conducted with 74 participants, who rated five alcohol advertisements on two occasions separated by 1 week. Average correlations across all advertisements using three reliability statistics (r, rho, and kappa) were almost all statistically significant and the kappas were good for most items, which indicated high test-retest agreement. We also found high interrater reliabilities (intraclass correlations) among raters for item-level and guideline-level violations, indicating that regardless of the specific item, raters were consistent in their general evaluations of the advertisements. Naïve (untrained) raters can provide consistent (reliable) ratings of the main content guidelines proposed in the U.S. Beer Institute Code. The rating procedure may have future applications for monitoring compliance with industry self-regulation codes and for conducting research on the ways in which alcohol advertisements are perceived by young adults and other vulnerable populations.

Cross-cultural adaptation of the Posttraumatic Stress Disorder Checklist 5 (PCL-5) and Life Events Checklist 5 (LEC-5) for the Brazilian context.

PubMed

Lima, Eduardo de Paula; Vasconcelos, Alina Gomide; Berger, William; Kristensen, Christian Haag; Nascimento, Elizabeth do; Figueira, Ivan; Mendlowicz, Mauro Vitor

2016-01-01

To describe the process of cross-cultural adaptation of the Posttraumatic Stress Disorder Checklist 5 (PCL-5) and the Life Events Checklist 5 (LEC-5) for the Brazilian sociolinguistic context. The adaptation process sought to establish conceptual, semantic, and operational equivalence between the original items of the questionnaire and their translated versions, following standardized protocols. Initially, two researchers translated the original version of the scale into Brazilian Portuguese. Next, a native English speaker performed the back-translation. Quantitative and qualitative criteria were used to evaluate the intelligibility of items. Five specialists compared the original and translated versions and assessed the degree of equivalence between them in terms of semantic, idiomatic, cultural and conceptual aspects. The degree of agreement between the specialists was measured using the content validity coefficient (CVC). Finally, 28 volunteers from the target population were interviewed in order to assess their level of comprehension of the items. CVCs for items from both scales were satisfactory for all criteria. The mean comprehension scores were above the cutoff point established. Overall, the results showed that the adapted versions' items had adequate rates of equivalence in terms of concepts and semantics. The translation and adaptation processes were successful for both scales, resulting in versions that are not only equivalent to the originals, but are also intelligible for the population at large.
Construct Validity of the Spanish Versions of the Memorial Symptom Assessment Scale Short Form and Condensed Form: Rasch Analysis of Responses in Oncology Outpatients.

PubMed

Llamas-Ramos, Inés; Llamas-Ramos, Rocío; Buz, José; Cortés-Rodríguez, María; Martín-Nogueras, Ana María

2018-06-01

The Memorial Symptom Assessment Scale (MSAS) is a self-rating instrument for the assessment of symptom distress in cancer patients. The Spanish version of the MSAS has recently been validated. However, we lack evidence of the internal construct validity of the shorter versions (short form [MSAS-SF] and condensed form [CMSAS]). In addition, rigorous testing of these scales with modern psychometric methods is needed. The aim of this study was to evaluate the internal construct validity and reliability of the Spanish versions of the MSAS-SF and CMSAS in oncology outpatients using Rasch analysis. Data from a convenience sample of oncology outpatients receiving chemotherapy (n = 306; mean age 60 years; 63% women) at a university hospital were analyzed. The Rasch unidimensional measurement model was used to examine response category functioning, item hierarchy, targeting, unidimensionality, reliability, and differential item functioning by age, gender, and marital status. The response category structure of the symptom distress items was improved by collapsing two categories. The scales were adequately targeted to the study patients, showed overall Rasch model fit (mean Infit MnSq ranged from 0.98 to 1.05), met criteria for unidimensionality, and the reliability of scores was good (person reliability > 0.80), except for the CMSAS prevalence scale. Only four items showed differential item functioning. The present study demonstrated that the Spanish versions of the MSAS-SF and CMSAS have adequate psychometric properties to evaluate symptom distress in oncology outpatients. Additional studies of the CMSAS are recommended. Copyright © 2018 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
King's Parkinson's disease pain scale, the first scale for pain in PD: An international validation.

PubMed

Chaudhuri, K Ray; Rizos, A; Trenkwalder, C; Rascol, O; Pal, S; Martino, D; Carroll, C; Paviour, D; Falup-Pecurariu, C; Kessel, B; Silverdale, M; Todorova, A; Sauerbier, A; Odin, P; Antonini, A; Martinez-Martin, P

2015-10-01

Pain is a key unmet need and a major aspect of non-motor symptoms of Parkinson's disease (PD). No specific validated scales exist to identify and grade the various types of pain in PD. We report an international, cross-sectional, open, multicenter, one-point-in-time evaluation with retest study of the first PD-specific pain scale, the King's PD Pain Scale. Its seven domains include 14 items, each item scored by severity (0-3) multiplied by frequency (0-4), resulting in a subscore of 0 to 12, with a total possible score range from 0 to 168. One hundred seventy-eight PD patients with otherwise unexplained pain (age [mean ± SD], 64.38 ± 11.38 y [range, 29-85]; 62.92% male; duration of disease, 5.40 ± 4.93 y) and 83 nonspousal non-PD controls, matched by age (64.25 ± 11.10 y) and sex (61.45% males) were studied. No missing data were noted, and floor effect was observed in all domains. The difference between mean and median King's PD Pain Scale total score was less than 10% of the maximum observed value. Skewness was marginally high (1.48 for patients). Factor analysis showed four factors in the King's PD Pain Scale, explaining 57% of the variance (Kaiser-Mayer-Olkin, 0.73; sphericity test). Cronbach's alpha was 0.78, item-total correlation mean value 0.40, and item homogeneity 0.22. Correlation coefficients of the King's PD Pain Scale domains and total score with other pain measures were high. Correlation with the Scale for Outcomes in PD-Motor, Non-Motor Symptoms Scale total score, and quality of life measures was high. The King's PD Pain Scale seems to be a reliable and valid scale for grade rating of various types of pain in PD. © 2015 International Parkinson and Movement Disorder Society.
The Chinese version of the Myocardial Infarction Dimensional Assessment Scale (MIDAS): Mokken scaling

PubMed Central

2012-01-01

Background Hierarchical scales are very useful in clinical practice due to their ability to discriminate precisely between individuals, and the original English version of the Myocardial Infarction Dimensional Assessment Scale has been shown to contain a hierarchy of items. The purpose of this study was to analyse a Mandarin Chinese translation of the Myocardial Infarction Dimensional Assessment Scale for a hierarchy of items according to the criteria of Mokken scaling. Data from 180 Chinese participants who completed the Chinese translation of the Myocardial Infarction Dimensional Assessment Scale were analysed using the Mokken Scaling Procedure and the 'R' statistical programme using the diagnostics available in these programmes. Correlation between Mandarin Chinese items and a Chinese translation of the Short Form (36) Health Survey was also analysed. Findings Fifteen items from the Mandarin Chinese Myocardial Infarction Dimensional Assessment Scale were retained in a strong and reliable Mokken scale; invariant item ordering was not evident and the Mokken scaled items of the Chinese Myocardial Infarction Dimensional Assessment Scale correlated with the Short Form (36) Health Survey. Conclusions Items from the Mandarin Chinese Myocardial Infarction Dimensional Assessment Scale form a Mokken scale and this offers further insight into how the items of the Myocardial Infarction Dimensional Assessment Scale relate to the measurement of health-related quality of life people with a myocardial infarction. PMID:22221696
[Reliability of the Japanese version of the Scale for the Assessment and Rating of Ataxia (SARA)].

PubMed

Sato, Kazunori; Yabe, Ichiro; Soma, Hiroyuki; Yasui, Kenichi; Ito, Mizuki; Shimohata, Takayoshi; Onodera, Osamu; Nakashima, Kenji; Sobue, Gen; Nishizawa, Masatoyo; Sasaki, Hidenao

2009-05-01

The International Cooperative Ataxia Rating Scale (ICARS) is widely used as a scale for the assessment of the severity of cerebellar ataxia. However, this scale comprises several items; thus, making the application of this scale is not sufficiently practical to perform daily assessment of ataxic patients. A new rating scale--Scale for the Assessment and Rating of Ataxia (SARA)--was shown to provide highly reliable assessments; further, the scores on SARA correlated with the ICARS score and the Barthel index. After obtaining the permission, original SARA was translated into Japanese. To examine the reliability and internal consistency of the Japanese version of the SARA for the assessment of cerebellar ataxia in 66 patients with spinocerebellar degeneration. Intraclass coefficients (ICC) were observed to be greater than 0.8 except in the case of the inter-rater "finger chase" and "fast alternating hand movement" tests. The Japanese version of SARA is highly reliable and very useful for the assessment of cerebellar ataxia on a daily basis.
The Conscientious Responders Scale Helps Researchers Verify the Integrity of Personality Questionnaire Data.

PubMed

Marjanovic, Zdravko; Bajkov, Lisa; MacDonald, Jennifer

2018-01-01

The Conscientious Responders Scale is a five-item embeddable validity scale that differentiates between conscientious and indiscriminate responding in personality-questionnaire data (CR & IR). This investigation presents further evidence of its validity and generalizability across two experiments. Study 1 tests its sensitivity to questionnaire length, a known cause of IR, and tries to provoke IR by manipulating psychological reactance. As expected, short questionnaires produced higher Conscientious Responders Scale scores than long questionnaires, and Conscientious Responders Scale scores were unaffected by reactance manipulations. Study 2 tests concerns that the Conscientious Responders Scale's unusual item content could potentially irritate and baffle responders, ironically increasing rates of IR. We administered two nearly identical questionnaires: one with an embedded Conscientious Responders Scale and one without the Conscientious Responders Scale. Psychometric comparisons revealed no differences across questionnaires' means, variances, interitem response consistencies, and Cronbach's alphas. In sum, the Conscientious Responders Scale is highly sensitive to questionnaire length-a known correlate of IR-and can be embedded harmlessly in questionnaires without provoking IR or changing the psychometrics of other measures.
Construction and Validation of a Women's Autonomy Measurement Scale with Reference to Utilization of Maternal Health Care Services in Nepal.

PubMed

Bhandari, T R; Dangal, G; Sarma, P S; Kutty, V R

2014-01-01

Women's autonomy is one of the predictors of maternal health care service utilization. This study aimed to construct and validate a scale for measuring women's autonomy with relevance to developing countries. We conducted a study for construction and validation of a scale in Rupandehi and further validated in Kapilvastu districts of Nepal. Initially, we administered a 24-item preliminary scale and finalized a 23-item scale using psychometric tests. After defining the construct of women's autonomy, we pooled 194 items and selected 24 items to develop a preliminary scale. The scale development process followed different steps i.e. definition of construct, generation of items pool, pretesting, analysis of psychometric test and further validation. The new scale was strongly supported by Cronbach's Alpha value (0.84), test-retest Pearson correlation (0.87), average content validity ratio (0.8) and overall agreement- Kappa value of the items (0.83) whereas all values were found satisfactory. From factor analysis, we selected 23 items for the final scale which show good convergent and discriminant validity. From preliminary draft, we removed one item; the remaining 23 items were loaded in five factors. All five factors had single loading items by suppressing absolute coefficient value less than 0.45 and average coefficient was more than 0.60 of each factor. Similarly, the factors and loaded items had good convergent and discriminant validity which further showed strong measurement capacity of the scale. The new scale is a reliable tool for assessing women's autonomy in developing countries. We recommend for further use and validation of the scale for ensuring the measurement capacity.
Psychometric evaluation of the pediatric and parent-proxy Patient-Reported Outcomes Measurement Information System and the Neurology and Traumatic Brain Injury Quality of Life measurement item banks in pediatric traumatic brain injury.

PubMed

Bertisch, Hilary; Rivara, Frederick P; Kisala, Pamela A; Wang, Jin; Yeates, Keith Owen; Durbin, Dennis; Zonfrillo, Mark R; Bell, Michael J; Temkin, Nancy; Tulsky, David S

2017-07-01

The primary objective is to provide evidence of convergent and discriminant validity for the pediatric and parent-proxy versions of the Patient-Reported Outcomes Measurement Information System (PROMIS) Anxiety, Depression, Anger, Peer Relations, Mobility, Pain Interference, and Fatigue item banks, the Neurology Quality of Life measurement system (Neuro-QOL) Cognition-General Concerns and Stigma item banks, and the Traumatic Brain Injury Quality of Life (TBI-QOL) Executive Function and Headache item banks in a pediatric traumatic brain injury (TBI) sample. Participants were 134 parent-child (ages 8-18 years) days. Children all sustained TBI and the dyads completed outcome ratings 6 months after injury at one of six medical centers across the United States. Ratings included PROMIS, Neuro-QOL, and TBI-QOL item banks, as well as the Pediatric Quality of Life inventory (PedsQL), the Health Behavior Inventory (HBI), and the Strengths and Difficulties Questionnaire (SDQ) as legacy criterion measures against which these item banks were validated. The PROMIS, Neuro-QOL, and TBI-QOL item banks demonstrated good convergent validity, as evidenced by moderate to strong correlations with comparable scales on the legacy measures. PROMIS, Neuro-QOL, and TBI-QOL item banks showed weaker correlations with ratings of unrelated constructs on legacy measures, providing evidence of discriminant validity. Our results indicate that the constructs measured by the PROMIS, Neuro-QOL, and TBI-QOL item banks are valid in our pediatric TBI sample and that it is appropriate to use these standardized scores for our primary study analyses.
[Development of a cell phone addiction scale for korean adolescents].

PubMed

Koo, Hyun Young

2009-12-01

This study was done to develop a cell phone addiction scale for Korean adolescents. The process included construction of a conceptual framework, generation of initial items, verification of content validity, selection of secondary items, preliminary study, and extraction of final items. The participants were 577 adolescents in two middle schools and three high schools. Item analysis, factor analysis, criterion related validity, and internal consistency were used to analyze the data. Twenty items were selected for the final scale, and categorized into 3 factors explaining 55.45% of total variance. The factors were labeled as withdrawal/tolerance (7 items), life dysfunction (6 items), and compulsion/persistence (7 items). The scores for the scale were significantly correlated with self-control, impulsiveness, and cell phone use. Cronbach's alpha coefficient for the 20 items was .92. Scale scores identified students as cell phone addicted, heavy users, or average users. The above findings indicate that the cell phone addiction scale has good validity and reliability when used with Korean adolescents.
Rasch analysis of the Chedoke-McMaster Attitudes towards Children with Handicaps scale.

PubMed

Armstrong, Megan; Morris, Christopher; Tarrant, Mark; Abraham, Charles; Horton, Mike C

2017-02-01

Aim To assess whether the Chedoke-McMaster Attitudes towards Children with Handicaps (CATCH) 36-item total scale and subscales fit the unidimensional Rasch model. Method The CATCH was administered to 1881 children, aged 7-16 years in a cross-sectional survey. Data were used from a random sample of 416 for the initial Rasch analysis. The analysis was performed on the 36-item scale and then separately for each subscale. The analysis explored fit to the Rasch model in terms of overall scale fit, individual item fit, item response categories, and unidimensionality. Item bias for gender and school level was also assessed. Revised scales were then tested on an independent second random sample of 415 children. Results Analyses indicated that the 36-item overall scale was not unidimensional and did not fit the Rasch model. Two scales of affective attitudes and behavioural intention were retained after four items were removed from each due to misfit to the Rasch model. Additionally, the scaling was improved when the two most negative response categories were aggregated. There was no item bias by gender or school level on the revised scales. Items assessing cognitive attitudes did not fit the Rasch model and had low internal consistency as a scale. Conclusion Affective attitudes and behavioural intention CATCH sub-scales should be treated separately. Caution should be exercised when using the cognitive subscale. Implications for Rehabilitation The 36-item Chedoke-McMaster Attitudes towards Children with Handicaps (CATCH) scale as a whole did not fit the Rasch model; thus indicating a multi-dimensional scale. Researchers should use two revised eight-item subscales of affective attitudes and behavioural intentions when exploring interventions aiming to improve children's attitudes towards disabled people or factors associated with those attitudes. Researchers should use the cognitive subscale with caution, as it did not create a unidimensional and internally consistent scale. Therefore, conclusions drawn from this scale may not accurately reflect children's attitudes.
Validity and reliability of naturalistic driving scene categorization Judgments from crowdsourcing.

PubMed

Cabrall, Christopher D D; Lu, Zhenji; Kyriakidis, Miltos; Manca, Laura; Dijksterhuis, Chris; Happee, Riender; de Winter, Joost

2018-05-01

A common challenge with processing naturalistic driving data is that humans may need to categorize great volumes of recorded visual information. By means of the online platform CrowdFlower, we investigated the potential of crowdsourcing to categorize driving scene features (i.e., presence of other road users, straight road segments, etc.) at greater scale than a single person or a small team of researchers would be capable of. In total, 200 workers from 46 different countries participated in 1.5days. Validity and reliability were examined, both with and without embedding researcher generated control questions via the CrowdFlower mechanism known as Gold Test Questions (GTQs). By employing GTQs, we found significantly more valid (accurate) and reliable (consistent) identification of driving scene items from external workers. Specifically, at a small scale CrowdFlower Job of 48 three-second video segments, an accuracy (i.e., relative to the ratings of a confederate researcher) of 91% on items was found with GTQs compared to 78% without. A difference in bias was found, where without GTQs, external workers returned more false positives than with GTQs. At a larger scale CrowdFlower Job making exclusive use of GTQs, 12,862 three-second video segments were released for annotation. Infeasible (and self-defeating) to check the accuracy of each at this scale, a random subset of 1012 categorizations was validated and returned similar levels of accuracy (95%). In the small scale Job, where full video segments were repeated in triplicate, the percentage of unanimous agreement on the items was found significantly more consistent when using GTQs (90%) than without them (65%). Additionally, in the larger scale Job (where a single second of a video segment was overlapped by ratings of three sequentially neighboring segments), a mean unanimity of 94% was obtained with validated-as-correct ratings and 91% with non-validated ratings. Because the video segments overlapped in full for the small scale Job, and in part for the larger scale Job, it should be noted that such reliability reported here may not be directly comparable. Nonetheless, such results are both indicative of high levels of obtained rating reliability. Overall, our results provide compelling evidence for CrowdFlower, via use of GTQs, being able to yield more accurate and consistent crowdsourced categorizations of naturalistic driving scene contents than when used without such a control mechanism. Such annotations in such short periods of time present a potentially powerful resource in driving research and driving automation development. Copyright © 2017 Elsevier Ltd. All rights reserved.
Item response modeling: a psychometric assessment of the children's fruit, vegetable, water, and physical activity self-efficacy scales among Chinese children.

PubMed

Wang, Jing-Jing; Chen, Tzu-An; Baranowski, Tom; Lau, Patrick W C

2017-09-16

This study aimed to evaluate the psychometric properties of four self-efficacy scales (i.e., self-efficacy for fruit (FSE), vegetable (VSE), and water (WSE) intakes, and physical activity (PASE)) and to investigate their differences in item functioning across sex, age, and body weight status groups using item response modeling (IRM) and differential item functioning (DIF). Four self-efficacy scales were administrated to 763 Hong Kong Chinese children (55.2% boys) aged 8-13 years. Classical test theory (CTT) was used to examine the reliability and factorial validity of scales. IRM was conducted and DIF analyses were performed to assess the characteristics of item parameter estimates on the basis of children's sex, age and body weight status. All self-efficacy scales demonstrated adequate to excellent internal consistency reliability (Cronbach's α: 0.79-0.91). One FSE misfit item and one PASE misfit item were detected. Small DIF were found for all the scale items across children's age groups. Items with medium to large DIF were detected in different sex and body weight status groups, which will require modification. A Wright map revealed that items covered the range of the distribution of participants' self-efficacy for each scale except VSE. Several self-efficacy scales' items functioned differently by children's sex and body weight status. Additional research is required to modify the four self-efficacy scales to minimize these moderating influences for application.
A systematic review of clinician-rated instruments to assess adults' levels of functioning in specialised public sector mental health services.

PubMed

Burgess, Philip M; Harris, Meredith G; Coombs, Tim; Pirkis, Jane E

2017-04-01

Functioning is one of the key domains emphasised in the routine assessment of outcomes that has been occurring in specialised public sector mental health services across Australia since 2002, via the National Outcomes and Casemix Collection. For adult consumers (aged 18-64), the 16-item Life Skills Profile (LSP-16) has been the instrument of choice to measure functioning. However, review of the National Outcomes and Casemix Collection protocol has highlighted some limitations to the current approach to measuring functioning. A systematic review was conducted to identify, against a set of pre-determined criteria, the most suitable existing clinician-rated instruments for the routine measurement of functioning for adult consumers. We used two existing reviews of functioning measures as our starting point and conducted a search of MEDLINE and PsycINFO to identify articles relating to additional clinician-rated instruments. We evaluated identified instruments using a hierarchical, criterion-based approach. The criteria were as follows: (1) is brief (<50 items) and simple to score, (2) is not made redundant by more recent instruments, (3) relevant version has been scientifically scrutinised, (4) considers functioning in a contemporary way and (5) demonstrates sound psychometric properties. We identified 20 relevant instruments, 5 of which met our criteria: the LSP-16, the Health of the Nation Outcome Scales, the Illness Management and Recovery Scale-Clinician Version, the Multnomah Community Ability Scale and the Personal and Social Performance Scale. Further work is required to determine which, if any, of these instruments satisfy further criteria relating to their appropriateness for assessing functioning within relevant service contexts, acceptability to clinicians and consumers, and feasibility in routine practice. This should involve seeking stakeholders' opinions (e.g. about the specific domains of functioning covered by each instrument and the language used in individual items) and testing completion rates in busy service settings.
Further psychometric evaluation and revision of the Mayo-Portland Adaptability Inventory in a national sample.

PubMed

Malec, James F; Kragness, Miriam; Evans, Randall W; Finlay, Karen L; Kent, Ann; Lezak, Muriel D

2003-01-01

To evaluate the internal consistency of the Mayo-Portland Adaptability Inventory (MPAI), further refine the instrument, and provide reference data based on a large, geographically diverse sample of persons with acquired brain injury (ABI). 386 persons, most with moderate to severe ABI. Outpatient, community-based, and residential rehabilitation facilities for persons with ABI located in the United States: West, Midwest, and Southeast. Rasch, item cluster, principal components, and traditional psychometric analyses for internal consistency of MPAI data and subscales. With rescoring of rating scales for 4 items, a 29-item version of the MPAI showed satisfactory internal consistency by Rasch (Person Reliability=.88; Item Reliability=.99) and traditional psychometric indicators (Cronbach's alpha=.89). Three rationally derived subscales for Ability, Activity, and Participation demonstrated psychometric properties that were equivalent to subscales derived empirically through item cluster and factor analyses. For the 3 subscales, Person Reliability ranged from.78 to.79; Item Reliability, from.98 to.99; and Cronbach's alpha, from.76 to.83. Subscales correlated moderately (Pearson r =.49-.65) with each other and strongly with the overall scale (Pearson r=.82-.86). Outcome after ABI is represented by the unitary dimension described by the MPAI. MPAI subscales further define regions of this dimension that may be useful for evaluation of clinical cases and program evaluation.
Marital Happiness and Sleep Disturbances in a Multi-Ethnic Sample of Middle-Aged Women

PubMed Central

Troxel, Wendy M.; Buysse, Daniel J.; Hall, Martica; Matthews, Karen A.

2009-01-01

Previous research suggests that divorced individuals, particularly women, have higher rates of sleep disturbances as compared to married individuals. Among the married, however, little is known about the association between relationship quality and sleep. The present study examined the association between marital happiness and self-reported sleep disturbances in a sample of midlife women drawn from the Study of Women’s Health Across the Nation (SWAN), a multi-site, multi-ethnic, community-based study (N=2,148). Marital happiness was measured using a single-item from the Dyadic Adjustment Scale and sleep disturbance was assessed using 4-items from the Women’s Health Initiative Insomnia Rating Scale (WHIIRS). After controlling for relevant covariates, maritally happy women reported fewer sleep disturbances, with the association evident among Caucasian women and to a lesser extent among African American women. PMID:19116797
Using the Patient Health Questionnaire-9 to measure depression among racially and ethnically diverse primary care patients.

PubMed

Huang, Frederick Y; Chung, Henry; Kroenke, Kurt; Delucchi, Kevin L; Spitzer, Robert L

2006-06-01

The Patient Health Questionnaire depression scale (PHQ-9) is a well-validated, Diagnostic and Statistical Manual of Mental Disorders- Fourth Edition (DSM-IV) criterion-based measure for diagnosing depression, assessing severity and monitoring treatment response. The performance of most depression scales including the PHQ-9, however, has not been rigorously evaluated in different racial/ethnic populations. Therefore, we compared the factor structure of the PHQ-9 between different racial/ethnic groups as well as the rates of endorsement and differential item functioning (DIF) of the 9 items of the PHQ-9. The presence of DIF would indicate that responses to an individual item differ significantly between groups, controlling for the level of depression. A combined dataset from 2 separate studies of 5,053 primary care patients including non-Hispanic white (n=2,520), African American (n=598), Chinese American (n=941), and Latino (n=974) patients was used for our analysis. Exploratory principal components factor analysis was used to derive the factor structure of the PHQ-9 in each of the 4 racial/ethnic groups. A generalized Mantel-Haenszel statistic was used to test for DIF. One main factor that included all PHQ-9 items was found in each racial/ethnic group with alpha coefficients ranging from 0.79 to 0.89. Although endorsement rates of individual items were generally similar among the 4 groups, evidence of DIF was found for some items. Our analyses indicate that in African American, Chinese American, Latino, and non-Hispanic white patient groups the PHQ-9 measures a common concept of depression and can be effective for the detection and monitoring of depression in these diverse populations.
Differences in psychiatric symptoms among Asian patients with depression: a multi-country cross-sectional study.

PubMed

Sulaiman, Ahmad H; Bautista, Dianne; Liu, Chia-Yih; Udomratn, Pichet; Bae, Jae Nam; Fang, Yiru; Chua, Hong C; Liu, Shen-Ing; George, Tom; Chan, Edwin; Tian-mei, Si; Hong, Jin Pyo; Srisurapanont, Manit; Rush, A John

2014-04-01

The aim of this study was to compare the symptomatic and clinical features of depression among five groups of patients with major depressive disorder (MDD) living in China, Korea, Malaysia/Singapore, Taiwan, and Thailand. Consecutive consenting adults (aged 18-65) who met DSM-IV criteria for non-psychotic MDD – based on the Mini International Neuropsychiatric Interview – and who were free of psychotropic medication were evaluated in a cross-sectional study. Depressive symptoms were evaluated using the 10-item Montgomery–Asberg Depression Rating Scale (MADRS) and the 13-item depression subscale of the Symptoms Checklist 90-Revised (SCL-90-R). In addition, the 10-item SCL-90-R Anxiety Subscale was completed. ancova were conducted, adjusting for confounders: age, completion of secondary education, marital status, work status, religion, index episode duration, and depressive severity. For the magnitude of differences, a threshold of 0.10 was taken as the minimum effect size representing clinical significance, and an effect size of 0.25 was considered moderate. Four MADRS symptoms differentiated these five groups, the most prominent being ‘lassitude’ and ‘inner tension’. Nine SCL-90-R depression items also differentiated the groups, as did eight SCL-90-R Anxiety Subscale items. The MADRS lassitude item had the largest effect size (0.131). The rest of those statistically significant differences did not exceed 0.10. MDD is more similar than different among outpatients in these diverse Asian countries. The between-country differences, while present and not due to chance, are small enough to enable the use of common clinician and self-report rating scales in studies involving Asians with MDD from various ethnic backgrounds.
A Mixed-methods Study to Assess Interrater Reliability and Nurse Perception of the Braden Scale in a Tertiary Acute Care Setting.

PubMed

Ho, Chester H; Cheung, Amanda; Southern, Danielle; Ocampo, Wrechelle; Kaufman, Jaime; Hogan, David B; Baylis, Barry; Conly, John M; Stelfox, Henry T; Ghali, William A

2016-12-01

Research regarding the reliability of the Braden Scale and nurses' perspectives on the instrument for predicting pressure ulcer (PU) risk in acute care settings is limited. A mixed-methods study was conducted in a tertiary acute care facility to examine interrater reliability (IRR) of the Braden Scale and its subscales, and a qualitative survey using semi-structured interviews was conducted among nurses caring for patients in acute care units to gain nurse perspective regarding scale usability. Data were extracted from a previous retrospective, randomized, controlled trial involving adult patients with compromised mobility receiving care in a tertiary acute care hospital in Canada. One-way, intraclass correlation coefficients (ICCs) were calculated on item and total scores, and kappa statistics were used to determine reliability of categorizing patients on their risk. Interview results were categorized by common themes. Reliability was assessed on 64 patients, where nurses and research staff independently assessed enrolled participants at baseline and after 72 hours using the Braden Scale as it appeared on an electronic medical record. IRR for the total score was high (ICC = 0.807). The friction and shear item had the lowest reliability (ICC = 0.266). Reliability of categorizing patients' level of risk had moderate agreement (κ = 0.408). Three (3) major and 12 subthemes emerged from the 14 nurse interviews; nurses were aware of the scale's purpose but were uncertain of its effectiveness, some items were difficult to rate, and questions were raised as to whether using the scale enhanced patient care. Aspects identified by nurses to enhance usability included: 1) changes to the electronic version (incorporating the scale into daily assessment documents with readily available item descriptions), 2) additional training, and 3) easily available resource material to improve reliability and usability of scale. These findings need to be considered when using the Braden Scale in clinical practice. Further study of the value of the total Braden Scale and its subscales is warranted.
The Responsive Environmental Assessment for Classroom Teaching (REACT): the dimensionality of student perceptions of the instructional environment.

PubMed

Nelson, Peter M; Demers, Joseph A; Christ, Theodore J

2014-06-01

This study details the initial development of the Responsive Environmental Assessment for Classroom Teachers (REACT). REACT was developed as a questionnaire to evaluate student perceptions of the classroom teaching environment. Researchers engaged in an iterative process to develop, field test, and analyze student responses on 100 rating-scale items. Participants included 1,465 middle school students across 48 classrooms in the Midwest. Item analysis, including exploratory and confirmatory factor analysis, was used to refine a 27-item scale with a second-order factor structure. Results support the interpretation of a single general dimension of the Classroom Teaching Environment with 6 subscale dimensions: Positive Reinforcement, Instructional Presentation, Goal Setting, Differentiated Instruction, Formative Feedback, and Instructional Enjoyment. Applications of REACT in research and practice are discussed along with implications for future research and the development of classroom environment measures. PsycINFO Database Record (c) 2014 APA, all rights reserved.
Measuring hospital care from the patients' perspective: an overview of the CAHPS Hospital Survey development process.

PubMed

Goldstein, Elizabeth; Farquhar, Marybeth; Crofton, Christine; Darby, Charles; Garfinkel, Steven

2005-12-01

To describe the developmental process for the CAHPS Hospital Survey. A pilot was conducted in three states with 19,720 hospital discharges. A rigorous, multi-step process was used to develop the CAHPS Hospital Survey. It included a public call for measures, multiple Federal Register notices soliciting public input, a review of the relevant literature, meetings with hospitals, consumers and survey vendors, cognitive interviews with consumer, a large-scale pilot test in three states and consumer testing and numerous small-scale field tests. The current version of the CAHPS Hospital Survey has survey items in seven domains, two overall ratings of the hospital and five items used for adjusting for the mix of patients across hospitals and for analytical purposes. The CAHPS Hospital Survey is a core set of questions that can be administered as a stand-alone questionnaire or combined with a broader set of hospital specific items.

Examining Player Anger in World of Warcraft

NASA Astrophysics Data System (ADS)

Barnett, Jane; Coulson, Mark; Foreman, Nigel

This questionnaire study of the sources of anger in World of Warcraft applies classical quantitative measurement scale construction to a new problem, generating a host of questionnaire items that could find use in future studies, and identifying four major categories of events that cause negative effect among players. First, 33 players provided examples of in-game scenarios that had made them angry, and their responses were culled to create a 93-item battery rated by hundreds of player respondents in terms of anger intensity and anger frequency. An iterative process of factor analysis and scale reliability assessment led to a 28-item instrument measuring four anger-provoking factors: Raids/Instances, Griefers, Perceived Time Wasting, and Anti-social Players. These anger-causing scenarios were then illustrated by concrete examples from player and researcher experiences in World of Warcraft. One striking finding is that players become angry at other players' negative behavior, regardless of whether that behavior was intended to harm.
Assessing the Universal Structure of Personality in Early Adolescence: The NEO-PI-R and NEO-PI-3 in 24 Cultures

PubMed Central

De Fruyt, Filip; De Bolle, Marleen; McCrae, Robert R.; Terracciano, Antonio; Costa, Paul T.

2010-01-01

The structure and psychometric characteristics of the NEO-PI-3, a more readable version of the NEO-PI-R, are examined and compared with NEO-PI-R characteristics using data from college student observer ratings of 5,109 adolescents aged 12 to 17 from 24 cultures. Replacement items in the PI-3 showed on average stronger item/total correlations and slightly improved facet reliabilities compared with the NEO-PI-R in both English- and non-English-speaking samples. NEO-PI-3 replacement items did not substantially affect scale means compared with the original scales. Analyses across and within cultures confirmed the intended factor structure of both versions when used to describe young adolescents. We discuss implications of these cross-cultural findings for the advancement of studies in adolescence and personality development across the lifespan. PMID:19419953
Assessing the universal structure of personality in early adolescence: The NEO-PI-R and NEO-PI-3 in 24 cultures.

PubMed

De Fruyt, Filip; De Bolle, Marleen; McCrae, Robert R; Terracciano, Antonio; Costa, Paul T

2009-09-01

The structure and psychometric characteristics of the NEO Personality Inventory-3 (NEO-PI-3), a more readable version of the Revised NEO Personality Inventory (NEO-PI-R), are examined and compared with NEO-PI-R characteristics using data from college student observer ratings of 5,109 adolescents aged 12 to 17 years from 24 cultures. Replacement items in the PI-3 showed on average stronger item-total correlations and slightly improved facet reliabilities compared with the NEO-PI-R in both English- and non-English-speaking samples. NEO-PI-3 replacement items did not substantially affect scale means compared with the original scales. Analyses across and within cultures confirmed the intended factor structure of both versions when used to describe young adolescents. The authors discuss implications of these cross-cultural findings for the advancement of studies in adolescence and personality development across the lifespan.
Applying the revised Chinese Job Content Questionnaire to assess psychosocial work conditions among Taiwan's hospital workers

PubMed Central

2011-01-01

Background For hospital accreditation and health promotion reasons, we examined whether the 22-item Job Content Questionnaire (JCQ) could be applied to evaluate job strain of individual hospital employees and to determine the number of factors extracted from JCQ. Additionally, we developed an Excel module of self-evaluation diagnostic system for consultation with experts. Methods To develop an Excel-based self-evaluation diagnostic system for consultation to experts to make job strain assessment easier and quicker than ever, Rasch rating scale model was used to analyze data from 1,644 hospital employees who enrolled in 2008 for a job strain survey. We determined whether the 22-item Job Content Questionnaire (JCQ) could evaluate job strain of individual employees in work sites. The respective item responding to specific groups' occupational hazards causing job stress was investigated by using skewness coefficient with its 95% CI through item-by-item analyses. Results Each of those 22 items on the questionnaire was examined to have five factors. The prevalence rate of Chinese hospital workers with high job strain was 16.5%. Conclusions Graphical representations of four quadrants, item-by-item bar chart plots and skewness 95% CI comparison generated in Excel can help employers and consultants of an organization focusing on a small number of key areas of concern for each worker in job strain. PMID:21682912
Applying the revised Chinese Job Content Questionnaire to assess psychosocial work conditions among Taiwan's hospital workers.

PubMed

Chien, Tsair-Wei; Lai, Wen-Pin; Wang, Hsien-Yi; Hsu, Sen-Yen; Castillo, Roberto Vasquez; Guo, How-Ran; Chen, Shih-Chung; Su, Shih-Bin

2011-06-18

For hospital accreditation and health promotion reasons, we examined whether the 22-item Job Content Questionnaire (JCQ) could be applied to evaluate job strain of individual hospital employees and to determine the number of factors extracted from JCQ. Additionally, we developed an Excel module of self-evaluation diagnostic system for consultation with experts. To develop an Excel-based self-evaluation diagnostic system for consultation to experts to make job strain assessment easier and quicker than ever, Rasch rating scale model was used to analyze data from 1,644 hospital employees who enrolled in 2008 for a job strain survey. We determined whether the 22-item Job Content Questionnaire (JCQ) could evaluate job strain of individual employees in work sites. The respective item responding to specific groups' occupational hazards causing job stress was investigated by using skewness coefficient with its 95% CI through item-by-item analyses. Each of those 22 items on the questionnaire was examined to have five factors. The prevalence rate of Chinese hospital workers with high job strain was 16.5%. Graphical representations of four quadrants, item-by-item bar chart plots and skewness 95% CI comparison generated in Excel can help employers and consultants of an organization focusing on a small number of key areas of concern for each worker in job strain.
Development and Validation of Triarchic Psychopathy Scales from the Multidimensional Personality Questionnaire

PubMed Central

Brislin, Sarah J.; Drislane, Laura E.; Smith, Shannon Toney; Edens, John F.; Patrick, Christopher J.

2015-01-01

Psychopathy is conceptualized by the triarchic model as encompassing three distinct phenotypic constructs: boldness, meanness, and disinhibition. In the current study, the Multidimensional Personality Questionnaire (MPQ), a normal-range personality measure, was evaluated for representation of these three constructs. Consensus ratings were used to identify MPQ items most related to each triarchic (Tri) construct. Scale measures were developed from items indicative of each construct, and scores for these scales were evaluated for convergent and discriminant validity in community (N = 176) and incarcerated samples (N = 240). A cross the two samples, MPQ-Tri scale scores demonstrated good internal consistencies and relationships with criterion measures of various types consistent with predictions based on the triarchic model. Findings are discussed in terms of their implications for further investigation of the triarchic model constructs in preexisting datasets that include the MPQ, in particular longitudinal and genetically informative datasets. PMID:25642934
Development and validation of the Alcohol Myopia Scale.

PubMed

Lac, Andrew; Berger, Dale E

2013-09-01

Alcohol myopia theory conceptualizes the ability of alcohol to narrow attention and how this demand on mental resources produces the impairments of self-inflation, relief, and excess. The current research was designed to develop and validate a scale based on this framework. People who were alcohol users rated items representing myopic experiences arising from drinking episodes in the past month. In Study 1 (N = 260), the preliminary 3-factor structure was supported by exploratory factor analysis. In Study 2 (N = 289), the 3-factor structure was substantiated with confirmatory factor analysis, and it was superior in fit to an empirically indefensible 1-factor structure. The final 14-item scale was evaluated with internal consistency reliability, discriminant validity, convergent validity, criterion validity, and incremental validity. The alcohol myopia scale (AMS) illuminates conceptual underpinnings of this theory and yields insights for understanding the tunnel vision that arises from intoxication.
A self-rating scale to measure tridoṣas in children

PubMed Central

Suchitra, S.P.; Nagendra, H.R.

2013-01-01

Background: Self – rating inventories to assess the Prakṛti (constitution) and personality have been developed and validated for adults. To analyze the effect of personality development programs on Prakṛti of the children, standardized scale is not available. Hence, present study was carried out to develop and standardize Caraka Child Personality inventory (CCPI). Materials and Methods: The 77- item CCPI scale was developed on the basis of translation of Sanskrit verses describing vātaja (a), pittaja (b) and kaphaja prakṛti (c) characteristics described in Ayurveda texts and by taking the opinions of 5 Ayurveda experts and psychologists. The scale was administered on children of the age group 8-12 years in New Generation National public school, Bangalore. Results: This inventory was named CCPI and showed excellent internal consistency. The Cronbach's alpha for A, B and C scales were 0.54, 0.64 and 0.64 respectively. The Split - Half reliability scores for A, B and C subscales were 0.64. 0.60 and 0.66 respectively. Factor validity coefficient Scores on each item was above 0.4. Scores on vātaja, pittaja and kaphaja scales were inversely correlated. Test-retest reliability scores for A,B and C scales were 0.87,0.88 and 0.89 respectively. The result of CCPI was compared with a parent rating scale Ayurveda Child Personality Inventory (ACPI). Subscales of CCPI correlated significantly highly (above 0.80) with subscales of ACPI which was done for the purpose of cross-validation with respect to ACPI. Conclusions: The prakṛti of the children can be measured consistently by this scale. Correlations with ACPI pointed toward concurrent validity. PMID:25284940
Self-Stigma of Mental Illness Scale – Short Form: Reliability and Validity

PubMed Central

Corrigan, Patrick W.; Michaels, Patrick J.; Vega, Eduardo; Gause, Michael; Watson, Amy C.; Rüsch, Nicolas

2012-01-01

The internalization of public stigma by persons with serious mental illnesses may lead to self-stigma, which harms self-esteem, self-efficacy, and empowerment. Previous research has evaluated a hierarchical model that distinguishes among stereotype awareness, agreement, application to self, and harm to self with the 40-item Self-Stigma of Mental Illness Scale (SSMIS). This study addressed SSMIS critiques (too long, contains offensive items that discourages test completion) by strategically omitting half of the original scale’s items. Here we report reliability and validity of the 20-item short form (SSMIS-SF) based on data from three previous studies. Retained items were rated less offensive by a sample of consumers. Results indicated adequate internal consistencies for each subscale. Repeated measures ANOVAs showed subscale means progressively diminished from awareness to harm. In support of its validity, the harm subscale was found to be inversely and significantly related to self-esteem, self-efficacy, empowerment, and hope. After controlling for level of depression, these relationships remained significant with the exception of the relation between empowerment and harm SSMIS-SF subscale. Future research with the SSMIS-SF should evaluate its sensitivity to change and its stability through test-rest reliability. PMID:22578819
Assessing social isolation in motor neurone disease: a Rasch analysis of the MND Social Withdrawal Scale.

PubMed

Gibbons, Chris J; Thornton, Everard W; Ealing, John; Shaw, Pamela J; Talbot, Kevin; Tennant, Alan; Young, Carolyn A

2013-11-15

Social withdrawal is described as the condition in which an individual experiences a desire to make social contact, but is unable to satisfy that desire. It is an important issue for patients with motor neurone disease who are likely to experience severe physical impairment. This study aims to reassess the psychometric and scaling properties of the MND Social Withdrawal Scale (MND-SWS) domains and examine the feasibility of a summary scale, by applying scale data to the Rasch model. The MND Social Withdrawal Scale was administered to 298 patients with a diagnosis of MND, alongside the Hospital Anxiety and Depression Scale. The factor structure of the MND Social Withdrawal Scale was assessed using confirmatory factor analysis. Model fit, category threshold analysis, differential item functioning (DIF), dimensionality and local dependency were evaluated. Factor analysis confirmed the suitability of the four-factor solution suggested by the original authors. Mokken scale analysis suggested the removal of item five. Rasch analysis removed a further three items; from the Community (one item) and Emotional (two items) withdrawal subscales. Following item reduction, each scale exhibited excellent fit to the Rasch model. A 14-item Summary scale was shown to fit the Rasch model after subtesting the items into three subtests corresponding to the Community, Family and Emotional subscales, indicating that items from these three subscales could be summed together to create a total measure for social withdrawal. Removal of four items from the Social Withdrawal Scale led to a four factor solution with a 14-item hierarchical Summary scale that were all unidimensional, free for DIF and well fitted to the Rasch model. The scale is reliable and allows clinicians and researchers to measure social withdrawal in MND along a unidimensional construct. © 2013. Published by Elsevier B.V. All rights reserved.
The cross-cultural adaptation of the DASH questionnaire in Thai (DASH-TH).

PubMed

Tongprasert, Siam; Rapipong, Jeeranan; Buntragulpoontawee, Montana

2014-01-01

Clinical measurement. Currently there are no self-report questionnaires in Thai to evaluate disability levels in patients suffering from upper extremity musculoskeletal disorders. To translate and cross-cultural adaptation the disabilities of the arm, shoulder and hand (DASH) questionnaire to Thai version and to evaluate content validity, construct validity and internal consistency of the questionnaire. The DASH-TH was produced by following cross-cultural adaptation guidelines stated by the Institute for Work and Health (IWH). Forty Thai patients with arm, shoulder or hand problems participated in field testing of the questionnaire. Content validity was determined by obtaining the item-objective congruence (IOC) value for each questionnaire item. Correlation between the DASH-TH score and numeric rating scale was used to assess construct validity. Internal consistency of DASH-TH was measured using Cronbach's alpha coefficient. Forty patients (14 males, 26 females) with arm, shoulder or hand problems enrolled in the present study. The average age of patients was 44.8 years. The index of item-objective congruence (IOC) of each item ranged from 0.7 to 1.0. The Cronbach's alpha coefficient of the questionnaire was 0.938. There was no correlation between DASH-TH score and numeric rating scale. The DASH-TH has high content validity and internal consistency. N/A. Copyright © 2014 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Reliability of scores between stroke patients and significant others on the Reintegration to Normal Living (RNL) Index.

PubMed

Tooth, Leigh R; McKenna, Kryss T; Smith, Melinda; O'Rourke, Peter K

2003-05-06

This study measured reliability between stroke patients' and significant others' scores on items on the Reintegration to Normal Living (RNL) Index and whether there were any scoring biases. The 11-item RNL Index was administered to 57 pairs of patients and significants six months after stroke rehabilitation. The index was scored using a 10-point visual analogue scale. Patient and significant other demographic information and data on patients' clinical, functional and cognitive status were collected. Reliability was measured using the intra-class correlation coefficient (ICC) and percent agreement. Overall poor reliability was found for the RNL Index total score (ICC=.36, 95% CI .07 to .59) and the daily functioning subscale (ICC=.24, 95% Cl -.003 to .46) and moderate reliability was found for the perception of self subscale (ICC= .55, 95% Cl .28 to .73). There was a moderate bias for patients to rate themselves as achieving better reintegration than was indicated by significant others, although no demographic or clinical factors were associated with this bias. Exact match agreement was best for the subjective items and worse for items reflecting mobility around the community and participation in a work activity. Caution is needed when interpreting patient information reported by significant others on the RNL Index. The use of a shorter scale to rate the RNL Index requires investigation.
Rater Evaluations for Psychiatric Instruments and Cultural Differences: The PANSS in China and United States

PubMed Central

Aggarwal, Neil Krishan; Zhang, Xiang Yang; Stefanovics, Elina; Chen, Da Chun; Xiu, Mei Hong; Xu, Ke; Rosenheck, Robert A.

2013-01-01

This article compares Positive and Negative Syndrome Scale (PANSS) data from Chinese and American inpatients with chronic schizophrenia to show how differences in item ratings may reflect cultural attitudes of raters. The Chinese sample (N=504) came from Beijing Huilongguan Hospital. The American sample came from 268 PANSS assessments of CATIE subjects hospitalized for 15 days or more to optimize equivalence of the samples. Controlling for age and gender, the Chinese sample scored significantly lower for total score by 25% (p<.0001), for the positive sub-scale by 35% (p<.0001), and on the general sub-scale by 32% (p<.0001), but not significantly different on the negative sub-scale score (+0.26%, p=0.76). However, the Chinese sample scored 26% higher on the item on poor rapport (p<.0001), 10.2% higher on passive social withdrawal (p=.003), and most notably 46% higher on the item on lack of judgment and insight (p<.0001). These results remain broadly consistent across gender sub-group analyses. Differences seem to be best explained by both cultural differences in patient clinical presentations as well as varying American and Chinese cultural values affecting rater judgment. PMID:22922237
Work environment impact scale: testing the psychometric properties of the Swedish version.

PubMed

Ekbladh, Elin; Fan, Chia-Wei; Sandqvist, Jan; Hemmingsson, Helena; Taylor, Renée

2014-01-01

The Work Environment Impact Scale (WEIS) is an assessment that focuses on the fit between a person and his or her work environment. It is based on Kielhofner's Model of Human Occupation and designed to gather information on how clients experience their work environment. The aim of this study was to examine the psychometric properties of the Swedish version of the WEIS assessment instrument. In total, 95 ratings on the 17-item WEIS were obtained from a sample of clients with experience of sick leave due to different medical conditions. Rasch analysis was used to analyze the data. Overall, the WEIS items together cohered to form a single construct of increasingly challenging work environmental factors. The hierarchical ordering of the items along the continuum followed a logical and expected pattern, and the participants were validly measured by the scale. The three occupational therapists serving as raters validly used the scale, but demonstrated a relatively high rater separation index, indicating differences in rater severity. The findings provide evidence that the Swedish version of the WEIS is a psychometrically sound assessment across diagnoses and occupations, which can provide valuable information about experiences of work environment challenges.
Dimensions of insight in schizophrenia: Exploratory factor analysis of items from multiple self- and interviewer-rated measures of insight.

PubMed

Konsztowicz, Susanna; Schmitz, Norbert; Lepage, Martin

2018-03-10

Insight in schizophrenia is regarded as a multidimensional construct that comprises aspects such as awareness of the disorder and recognition of the need for treatment. The proposed number of underlying dimensions of insight is variable in the literature. In an effort to identify a range of existing dimensions of insight, we conducted a factor analysis on combined items from multiple measures of insight. We recruited 165 participants with enduring schizophrenia (treated for >3years). Exploratory factor analysis was conducted on itemized scores from two interviewer-rated measures of insight: the Schedule for the Assessment of Insight-Expanded and the abbreviated Scale to assess Unawareness of Mental Disorder; and two self-report measures: the Birchwood Insight Scale and the Beck Cognitive Insight Scale. A five-factor solution was selected as the best-fitting model, with the following dimensions of insight: 1) awareness of illness and the need for treatment; 2) awareness and attribution of symptoms and consequences; 3) self-certainty; 4) self-reflectiveness for objectivity and fallibility; and 5) self-reflectiveness for errors in reasoning and openness to feedback. Insight in schizophrenia is a multidimensional construct comprised of distinct clinical and cognitive domains of awareness. Multiple measures of insight, both clinician- and self-rated, are needed to capture all of the existing dimensions of insight. Future exploration of associations between the various dimensions and their potential determinants will facilitate the development of clinically useful models of insight and effective interventions to improve outcome. Copyright © 2018 Elsevier B.V. All rights reserved.
Assessing Technical Performance and Determining the Learning Curve in Cleft Palate Surgery Using a High-Fidelity Cleft Palate Simulator.

PubMed

Podolsky, Dale J; Fisher, David M; Wong Riff, Karen W; Szasz, Peter; Looi, Thomas; Drake, James M; Forrest, Christopher R

2018-06-01

This study assessed technical performance in cleft palate repair using a newly developed assessment tool and high-fidelity cleft palate simulator through a longitudinal simulation training exercise. Three residents performed five and one resident performed nine consecutive endoscopically recorded cleft palate repairs using a cleft palate simulator. Two fellows in pediatric plastic surgery and two expert cleft surgeons also performed recorded simulated repairs. The Cleft Palate Objective Structured Assessment of Technical Skill (CLOSATS) and end-product scales were developed to assess performance. Two blinded cleft surgeons assessed the recordings and the final repairs using the CLOSATS, end-product scale, and a previously developed global rating scale. The average procedure-specific (CLOSATS), global rating, and end-product scores increased logarithmically after each successive simulation session for the residents. Reliability of the CLOSATS (average item intraclass correlation coefficient (ICC), 0.85 ± 0.093) and global ratings (average item ICC, 0.91 ± 0.02) among the raters was high. Reliability of the end-product assessments was lower (average item ICC, 0.66 ± 0.15). Standard setting linear regression using an overall cutoff score of 7 of 10 corresponded to a pass score for the CLOSATS and the global score of 44 (maximum, 60) and 23 (maximum, 30), respectively. Using logarithmic best-fit curves, 6.3 simulation sessions are required to reach the minimum standard. A high-fidelity cleft palate simulator has been developed that improves technical performance in cleft palate repair. The simulator and technical assessment scores can be used to determine performance before operating on patients.
Constructing a question bank based on script concordance approach as a novel assessment methodology in surgical education.

PubMed

Aldekhayel, Salah A; Alselaim, Nahar A; Magzoub, Mohi Eldin; Al-Qattan, Mohammad M; Al-Namlah, Abdullah M; Tamim, Hani; Al-Khayal, Abdullah; Al-Habdan, Sultan I; Zamakhshary, Mohammed F

2012-10-24

Script Concordance Test (SCT) is a new assessment tool that reliably assesses clinical reasoning skills. Previous descriptions of developing SCT-question banks were merely subjective. This study addresses two gaps in the literature: 1) conducting the first phase of a multistep validation process of SCT in Plastic Surgery, and 2) providing an objective methodology to construct a question bank based on SCT. After developing a test blueprint, 52 test items were written. Five validation questions were developed and a validation survey was established online. Seven reviewers were asked to answer this survey. They were recruited from two countries, Saudi Arabia and Canada, to improve the test's external validity. Their ratings were transformed into percentages. Analysis was performed to compare reviewers' ratings by looking at correlations, ranges, means, medians, and overall scores. Scores of reviewers' ratings were between 76% and 95% (mean 86% ± 5). We found poor correlations between reviewers (Pearson's: +0.38 to -0.22). Ratings of individual validation questions ranged between 0 and 4 (on a scale 1-5). Means and medians of these ranges were computed for each test item (mean: 0.8 to 2.4; median: 1 to 3). A subset of test items comprising 27 items was generated based on a set of inclusion and exclusion criteria. This study proposes an objective methodology for validation of SCT-question bank. Analysis of validation survey is done from all angles, i.e., reviewers, validation questions, and test items. Finally, a subset of test items is generated based on a set of criteria.
Item Response Theory Models for Wording Effects in Mixed-Format Scales

ERIC Educational Resources Information Center

Wang, Wen-Chung; Chen, Hui-Fang; Jin, Kuan-Yu

2015-01-01

Many scales contain both positively and negatively worded items. Reverse recoding of negatively worded items might not be enough for them to function as positively worded items do. In this study, we commented on the drawbacks of existing approaches to wording effect in mixed-format scales and used bi-factor item response theory (IRT) models to…
Development and validation of a socioculturally competent trust in physician scale for a developing country setting.

PubMed

Gopichandran, Vijayaprasad; Wouters, Edwin; Chetlapalli, Satish Kumar

2015-05-03

Trust in physicians is the unwritten covenant between the patient and the physician that the physician will do what is in the best interest of the patient. This forms the undercurrent of all healthcare relationships. Several scales exist for assessment of trust in physicians in developed healthcare settings, but to our knowledge none of these have been developed in a developing country context. To develop and validate a new trust in physician scale for a developing country setting. Dimensions of trust in physicians, which were identified in a previous qualitative study in the same setting, were used to develop a scale. This scale was administered among 616 adults selected from urban and rural areas of Tamil Nadu, south India, using a multistage sampling cross sectional survey method. The individual items were analysed using a classical test approach as well as item response theory. Cronbach's α was calculated and the item to total correlation of each item was assessed. After testing for unidimensionality and absence of local dependence, a 2 parameter logistic Semajima's graded response model was fit and item characteristics assessed. Competence, assurance of treatment, respect for the physician and loyalty to the physician were important dimensions of trust. A total of 31 items were developed using these dimensions. Of these, 22 were selected for final analysis. The Cronbach's α was 0.928. The item to total correlations were acceptable for all the 22 items. The item response analysis revealed good item characteristic curves and item information for all the items. Based on the item parameters and item information, a final 12 item scale was developed. The scale performs optimally in the low to moderate trust range. The final 12 item trust in physician scale has a good construct validity and internal consistency. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Development and validation of a socioculturally competent trust in physician scale for a developing country setting

PubMed Central

Gopichandran, Vijayaprasad; Wouters, Edwin; Chetlapalli, Satish Kumar

2015-01-01

Trust in physicians is the unwritten covenant between the patient and the physician that the physician will do what is in the best interest of the patient. This forms the undercurrent of all healthcare relationships. Several scales exist for assessment of trust in physicians in developed healthcare settings, but to our knowledge none of these have been developed in a developing country context. Objectives To develop and validate a new trust in physician scale for a developing country setting. Methods Dimensions of trust in physicians, which were identified in a previous qualitative study in the same setting, were used to develop a scale. This scale was administered among 616 adults selected from urban and rural areas of Tamil Nadu, south India, using a multistage sampling cross sectional survey method. The individual items were analysed using a classical test approach as well as item response theory. Cronbach's α was calculated and the item to total correlation of each item was assessed. After testing for unidimensionality and absence of local dependence, a 2 parameter logistic Semajima's graded response model was fit and item characteristics assessed. Results Competence, assurance of treatment, respect for the physician and loyalty to the physician were important dimensions of trust. A total of 31 items were developed using these dimensions. Of these, 22 were selected for final analysis. The Cronbach's α was 0.928. The item to total correlations were acceptable for all the 22 items. The item response analysis revealed good item characteristic curves and item information for all the items. Based on the item parameters and item information, a final 12 item scale was developed. The scale performs optimally in the low to moderate trust range. Conclusions The final 12 item trust in physician scale has a good construct validity and internal consistency. PMID:25941182

The Validity of the Teacher Burnout Scale for Use with Special Education Teachers

ERIC Educational Resources Information Center

Cook, Bradley Caro

2012-01-01

Unique stressors can cause special education teachers to experience burnout at twice the rate of their peers in general education. The purpose of this study was to determine if the Teacher Burnout Scale (TBS) is able to accurately predict burnout in special education teachers even though it does not include items that reflect the unique factors…
The Bedford Alzheimer nursing-severity scale to assess dementia severity in advanced dementia: a nonparametric item response analysis and a study of its psychometric characteristics.

PubMed

Galindo-Garre, Francisca; Hendriks, Simone A; Volicer, Ladislav; Smalbrugge, Martin; Hertogh, Cees M P M; van der Steen, Jenny T

2014-02-01

The Bedford Alzheimer Nursing-Severity Scale (BANS-S) assesses disease severity in patients with advanced Alzheimer's disease. Since Alzheimer is a progressive disease, studying the hierarchy of the items in the scale can be useful to evaluate the progression of the disease. Data from 164 Alzheimer's patients and 186 patients with other dementia were analyzed using the Mokken Scaling Methodology to determine whether respondents can be ordered in the trait dementia severity, and to study whether an ordering between the items exist. The scalability of the scale was evaluated by the H coefficient. Results showed that the BANS-S is a reliable and medium scale (0.4≤H<0.5) for the Alzheimer group. All items with the exception of the item about mobility could be ordered. When later item was eliminated from the scale, the H coefficient decreased indicating that the scalability of the scale in the original form is more accurate than in the shorter version. For the other dementia group, the BANS-S did not fit any of the Mokken Scaling models because the scale was not unidimensional. In this group, a shorter version of the scale without the sleeping cycle item and the mobility item has better reliability and scalability properties than the original scale.
Farsi version of social skills rating system-secondary student form: cultural adaptation, reliability and construct validity.

PubMed

Eslami, Ahmad Ali; Amidi Mazaheri, Maryam; Mostafavi, Firoozeh; Abbasi, Mohamad Hadi; Noroozi, Ensieh

2014-01-01

Assessment of social skills is a necessary requirement to develop and evaluate the effectiveness of cognitive and behavioral interventions. This paper reports the cultural adaptation and psychometric properties of the Farsi version of the social skills rating system-secondary students form (SSRS-SS) questionnaire (Gresham and Elliot, 1990), in a normative sample of secondary school students. A two-phase design was used that phase 1 consisted of the linguistic adaptation and in phase 2, using cross-sectional sample survey data, the construct validity and reliability of the Farsi version of the SSRS-SS were examined in a sample of 724 adolescents aged from 13 to 19 years. Content validity index was excellent, and the floor/ceiling effects were low. After deleting five of the original SSRS-SS items, the findings gave support for the item convergent and divergent validity. Factor analysis revealed four subscales. RESULTS showed good internal consistency (0.89) and temporal stability (0.91) for the total scale score. Findings demonstrated support for the use of the 27-item Farsi version in the school setting. Directions for future research regarding the applicability of the scale in other settings and populations of adolescents are discussed.
Systematic content evaluation and review of measurement properties of questionnaires for measuring self-reported fatigue among older people.

PubMed

Egerton, Thorlene; Riphagen, Ingrid I; Nygård, Arnhild J; Thingstad, Pernille; Helbostad, Jorunn L

2015-09-01

The assessment of fatigue in older people requires simple and user-friendly questionnaires that capture the phenomenon, yet are free from items indistinguishable from other disorders and experiences. This study aimed to evaluate the content, and systematically review and rate the measurement properties of self-report questionnaires for measuring fatigue, in order to identify the most suitable questionnaires for older people. This study firstly involved identification of questionnaires that purport to measure self-reported fatigue, and evaluation of the content using a rating scale developed for the purpose from contemporary understanding of the construct. Secondly, for the questionnaires that had acceptable content, we identified studies reporting measurement properties and rated the methodological quality of those studies according to the COSMIN system. Finally, we extracted and synthesised the results of the studies to give an overall rating for each questionnaire for each measurement property. The protocol was registered with PROSPERO (CRD42013005589). Of the 77 identified questionnaires, twelve were selected for review after content evaluation. Methodological quality varied, and there was a lack of information on measurement error and responsiveness. The PROMIS-Fatigue item bank and short forms perform the best. The FACIT-Fatigue scale, Parkinsons Fatigue Scale, Perform Questionnaire, and Uni-dimensional Fatigue Impact Scale also perform well and can be recommended. Minor modifications to improve performance are suggested. Further evaluation of unresolved measurement properties, particularly with samples including older people, is needed for all the recommended questionnaires.
Influence of Labeling on Ratings of Infants: A Prematurity Prejudice.

ERIC Educational Resources Information Center

Miller, Michael D.; Ottinger, Donald R.

Two full term and two preterm infants were videotaped while being administered six items from the Brazelton Scale. Infants were assigned alternately the labels "preterm" and "fullterm" and shown to a group of 256 undergraduate students. It was hypothesized that: (1) subjects who view infants labeled as preterm would rate them lower on objective…
Formative Assessment Using Direct Behavior Ratings: Evaluating Intervention Effects of Daily Behavior Report Cards

ERIC Educational Resources Information Center

Sims, Wesley A.; Riley-Tillman, Chris; Cohen, Daniel R.

2017-01-01

This study examined the treatment sensitivity of "Direct Behavior Rating-Single Item Scales" (DBR-SIS) in response to an evidence-based intervention delivered in a single-case, multiple-baseline design. DBR-SIS was used as a formative assessment in conjunction with a frequently used intervention in schools, a Daily Behavior Report Card…
Readability and Comprehension of the Geriatric Depression Scale and PROMIS® Physical Function Items in Older African Americans and Latinos.

PubMed

Paz, Sylvia H; Jones, Loretta; Calderón, José L; Hays, Ron D

2017-02-01

Depression and physical function are particularly important health domains for the elderly. The Geriatric Depression Scale (GDS) and the Patient-Reported Outcomes Measurement Information System (PROMIS ® ) physical function item bank are two surveys commonly used to measure these domains. It is unclear if these two instruments adequately measure these aspects of health in minority elderly. The aim of this study was to estimate the readability of the GDS and PROMIS ® physical function items and to assess their comprehensibility using a sample of African American and Latino elderly. Readability was estimated using the Flesch-Kincaid and Flesch Reading Ease (FRE) formulae for English versions, and a Spanish adaptation of the FRE formula for the Spanish versions. Comprehension of the GDS and PROMIS ® items by minority elderly was evaluated with 30 cognitive interviews. Readability estimates of a number of items in English and Spanish of the GDS and PROMIS ® physical functioning items exceed the U.S. recommended 5th-grade threshold for vulnerable populations, or were rated as 'fairly difficult', 'difficult', or 'very difficult' to read. Cognitive interviews revealed that many participants felt that more than the two (yes/no) GDS response options were needed to answer the questions. Wording of several PROMIS ® items was considered confusing, and interpreting responses was problematic because they were based on using physical aids. Problems with item wording and response options of the GDS and PROMIS ® physical function items may reduce reliability and validity of measurement when used with minority elderly.
Refining and validating the Social Interaction Anxiety Scale and the Social Phobia Scale.

PubMed

Carleton, R Nicholas; Collimore, Kelsey C; Asmundson, Gordon J G; McCabe, Randi E; Rowa, Karen; Antony, Martin M

2009-01-01

The Social Interaction Anxiety Scale and Social Phobia Scale are companion measures for assessing symptoms of social anxiety and social phobia. The scales have good reliability and validity across several samples, however, exploratory and confirmatory factor analyses have yielded solutions comprising substantially different item content and factor structures. These discrepancies are likely the result of analyzing items from each scale separately or simultaneously. The current investigation sets out to assess items from those scales, both simultaneously and separately, using exploratory and confirmatory factor analyses in an effort to resolve the factor structure. Participants consisted of a clinical sample (n 5353; 54% women) and an undergraduate sample (n 5317; 75% women) who completed the Social Interaction Anxiety Scale and Social Phobia Scale, along with additional fear-related measures to assess convergent and discriminant validity. A three-factor solution with a reduced set of items was found to be most stable, irrespective of whether the items from each scale are assessed together or separately. Items from the Social Interaction Anxiety Scale represented one factor, whereas items from the Social Phobia Scale represented two other factors. Initial support for scale and factor validity, along with implications and recommendations for future research, is provided. (c) 2009 Wiley-Liss, Inc.
Validation of a 4-item Negative Symptom Assessment (NSA-4): a short, practical clinical tool for the assessment of negative symptoms in schizophrenia.

PubMed

Alphs, Larry; Morlock, Robert; Coon, Cheryl; Cazorla, Pilar; Szegedi, Armin; Panagides, John

2011-06-01

The 16-item Negative Symptom Assessment (NSA-16) scale is a validated tool for evaluating negative symptoms of schizophrenia. The psychometric properties and predictive power of a four-item version (NSA-4) were compared with the NSA-16. Baseline data from 561 patients with predominant negative symptoms of schizophrenia who participated in two identically designed clinical trials were evaluated. Ordered logistic regression analysis of ratings using NSA-4 and NSA-16 were compared with ratings using several other standard tools to determine predictive validity and construct validity. Internal consistency and test--retest reliability were also analyzed. NSA-16 and NSA-4 scores were both predictive of scores on the NSA global rating (odds ratio = 0.83-0.86) and the Clinical Global Impressions--Severity scale (odds ratio = 0.91-0.93). NSA-16 and NSA-4 showed high correlation with each other (Pearson r = 0.85), similar high correlation with other measures of negative symptoms (demonstrating convergent validity), and lesser correlations with measures of other forms of psychopathology (demonstrating divergent validity). NSA-16 and NSA-4 both showed acceptable internal consistency (Cronbach α, 0.85 and 0.64, respectively) and test--retest reliability (intraclass correlation coefficient, 0.87 and 0.82). This study demonstrates that NSA-4 offers accuracy comparable to the NSA-16 in rating negative symptoms in patients with schizophrenia. Copyright © 2011 John Wiley & Sons, Ltd.
An Evaluation of the Quick Inventory of Depressive Symptomatology and the Hamilton Rating Scale for Depression: A Sequenced Treatment Alternatives to Relieve Depression Trial Report

PubMed Central

Rush, A. John; Bernstein, Ira H.; Trivedi, Madhukar H.; Carmody, Thomas J.; Wisniewski, Stephen; Mundt, James C.; Shores-Wilson, Kathy; Biggs, Melanie M.; Woo, Ada; Nierenberg, Andrew A.; Fava, Maurizio

2010-01-01

Background Nine DSM-IV-TR criterion symptom domains are evaluated to diagnose major depressive disorder (MDD). The Quick Inventory of Depressive Symptomatology (QIDS) provides an efficient assessment of these domains and is available as a clinician rating (QIDS-C16), a self-report (QIDS-SR16), and in an automated, interactive voice response (IVR) (QIDS-IVR16) telephone system. This report compares the performance of these three versions of the QIDS and the 17-item Hamilton Rating Scale for Depression (HRSD17). Methods Data were acquired at baseline and exit from the first treatment step (citalopram) in the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) trial. Outpatients with nonpsychotic MDD who completed all four ratings within ±2 days were identified from the first 1500 STAR*D subjects. Both item response theory and classical test theory analyses were conducted. Results The three methods for obtaining QIDS data produced consistent findings regarding relationships between the nine symptom domains and overall depression, demonstrating interchangeability among the three methods. The HRSD17, while generally satisfactory, rarely utilized the full range of item scores, and evidence suggested multidimensional measurement properties. Conclusions In nonpsychotic MDD outpatients without overt cognitive impairment, clinician assessment of depression severity using either the QIDS-C16 or HRSD17 may be successfully replaced by either the self-report or IVR version of the QIDS. PMID:16199008
Reliability of the Client-Centeredness of Goal Setting (C-COGS) Scale in Acquired Brain Injury Rehabilitation.

PubMed

Doig, Emmah; Prescott, Sarah; Fleming, Jennifer; Cornwell, Petrea; Kuipers, Pim

2016-01-01

To examine the internal reliability and test-retest reliability of the Client-Centeredness of Goal Setting (C-COGS) scale. The C-COGS scale was administered to 42 participants with acquired brain injury after completion of multidisciplinary goal planning. Internal reliability of scale items was examined using item-partial total correlations and Cronbach's α coefficient. The scale was readministered within a 1-mo period to a subsample of 12 participants to examine test-retest reliability by calculating exact and close percentage agreement for each item. After examination of item-partial total correlations, test items were revised. The revised items demonstrated stronger internal consistency than the original items. Preliminary evaluation of test-retest reliability was fair, with an average exact percent agreement across all test items of 67%. Findings support the preliminary reliability of the C-COGS scale as a tool to evaluate and promote client-centered goal planning in brain injury rehabilitation. Copyright © 2016 by the American Occupational Therapy Association, Inc.
Parental perceptions of children's oral health: The Early Childhood Oral Health Impact Scale (ECOHIS)

PubMed Central

Pahel, Bhavna Talekar; Rozier, R Gary; Slade, Gary D

2007-01-01

Background Dental disease and treatment experience can negatively affect the oral health related quality of life (OHRQL) of preschool aged children and their caregivers. Currently no valid and reliable instrument is available to measure these negative influences in very young children. The objective of this research was to develop the Early Childhood Oral Health Impact Scale (ECOHIS) to measure the OHRQL of preschool children and their families. Methods Twenty-two health professionals evaluated a pool of 45 items that assess the impact of oral health problems on 6-14-year-old children and their families. The health professionals identified 36 items as relevant to preschool children. Thirty parents rated the importance of these 36 items to preschool children; 13 (9 child and 4 family) items were considered important. The 13-item ECOHIS was administered to 295 parents of 5-year-old children to assess construct validity and internal consistency reliability (using Cronbach's alpha). Test-retest reliability was evaluated among another sample of parents (N = 46) using the intraclass correlation coefficient (ICC). Results ECOHIS scores on the child and parent sections indicating worse quality of life were significantly associated with fair or poor parental ratings of their child's general and oral health, and the presence of dental disease in the child. Cronbach's alphas for the child and family sections were 0.91 and 0.95 respectively, and the ICC for test-retest reliability was 0.84. Conclusion The ECOHIS performed well in assessing OHRQL among children and their families. Studies in other populations are needed to further establish the instrument's technical properties. PMID:17263880
The inter-rater reliability test of the modified Morse Fall Scale among patients ≥ 55 years old in an acute care hospital in Singapore.

PubMed

Tang, Wing Sze; Chow, Yeow Leng; Koh, Serena Siew Lin

2014-02-01

A prospective, descriptive study was conducted in an acute care hospital in Singapore to determine the inter-rater reliability of the modified Morse Fall Scale by evaluating the degrees of agreement on the ratings of the individual items and overall score between the 'gold standard' assessor and the facility assessors. One hundred and forty-two subjects were recruited during the 1.5 month data collection period. The simple and weighted κ-values were all > 0.8 except for the item 'effects of medications' (κ and κw = 0.63), and the correlation coefficient (rs = 0.89) was significantly high at a significance level of < 0.001. The modified Morse Fall Scale was shown to be a reliable fall risk assessment tool having a relative high inter-rater reliability level for the overall score and individual items. This study provides evidence-based psychometric support for the clinical application of this tool. © 2013 Wiley Publishing Asia Pty Ltd.
The German VR Simulation Realism Scale--psychometric construction for virtual reality applications with virtual humans.

PubMed

Poeschl, Sandra; Doering, Nicola

2013-01-01

Virtual training applications with high levels of immersion or fidelity (for example for social phobia treatment) produce high levels of presence and therefore belong to the most successful Virtual Reality developments. Whereas display and interaction fidelity (as sub-dimensions of immersion) and their influence on presence are well researched, realism of the displayed simulation depends on the specific application and is therefore difficult to measure. We propose to measure simulation realism by using a self-report questionnaire. The German VR Simulation Realism Scale for VR training applications was developed based on a translation of scene realism items from the Witmer-Singer-Presence Questionnaire. Items for realism of virtual humans (for example for social phobia training applications) were supplemented. A sample of N = 151 students rated simulation realism of a Fear of Public Speaking application. Four factors were derived by item- and principle component analysis (Varimax rotation), representing Scene Realism, Audience Behavior, Audience Appearance and Sound Realism. The scale developed can be used as a starting point for future research and measurement of simulation realism for applications including virtual humans.
LC-PROM: Validation of a patient reported outcomes measure for liver cirrhosis patients.

PubMed

Zhang, Ying; Yang, Yuanyuan; Lv, Jing; Zhang, Yanbo

2016-05-10

The aim of the study is to develop a specific patient-reported scale of liver cirrhosis according to the Patient Reported Outcome guidelines of the Food and Drug Administration (FDA), and to examine its capacity to fill gaps in this field. A conceptual framework was developed and a preliminary item pool developed through literature review and interviews of 10 patients with liver cirrhosis. With the preliminary items, we performed a pilot survey that included a cognitive test with patients and interviews with experts; the focus was on content and language of the scale. In the item selection stage, seven statistical methods including discrete trends method, discrimination analysis, exploratory factor analysis, Cronbach's α coefficient, correlation coefficient, test-retest reliability, Item-Response Theory were applied to survey data from 200 subjects (150 liver cirrhosis patients and 50 controls). This produced the preliminary Liver Cirrhosis Patient-reported Outcome Measure (LC-PROM). In the next stage, we conducted the survey with 620 subjects (500 patients and 120 controls) to validate reliability, validity and acceptability of this scale. The 55 items and 13 dimensions addressed four domains: physical, psychological, social, and therapeutic. Cronbach's α coefficients were 0.921 for the total scale; the confirmatory factor analysis, t-tests and ANOVA supported scale validity; the model fit index as Root Mean Square Error of Approximation (RMSEA), Root Mean Square Residual (RMR), Normed Fit Index (NFI), Non-Normed Fit Index (NNFI), Comparative Fit Index (CFI) and Incremental Fit Index (IFI) met the criterion generally. The acceptance ratio and response rate indicated good feasibility. This study developed an accurate and stable patient-reported outcome scale of liver cirrhosis, which is able to evaluate clinical effects effectively, is helpful to patients in recognizing their health condition, and contributes to clinical decision making both for patients and physicians. Additionally, the LC-PROM can perform as an ultimate assessment of medical and health care effects and can inform clinical trials of new drugs for liver cirrhosis.
A family-specific use of the Measure of Processes of Care for Service Providers (MPOC-SP).

PubMed

Siebes, R C; Nijhuis, B J G; Boonstra, A M; Ketelaar, M; Wijnroks, L; Reinders-Messelink, H A; Postema, K; Vermeer, A

2008-03-01

To examine the validity and utility of the Dutch Measure of Processes of Care for Service Providers (MPOC-SP) as a family-specific measure. A validation study. Five paediatric rehabilitation settings in the Netherlands. The MPOC-SP was utilized in a general (reflecting on services provided for all clients and clients' families) and family-specific way (filled out in reference to a particular child and his or her family). Professionals providing rehabilitation and educational services to children with cerebral palsy. For construct validity, Pearson's product-moment correlation coefficients (r ) between the scales were calculated. The ability of service providers to discriminate between general and family-specific ratings was examined by exploration of absolute difference scores. One hundred and sixteen service professionals filled out 240 family-specific MPOC-SPs. In addition, a subgroup of 81 professionals filled out a general MPOC-SP. For each professional, family-specific and general scores were paired, resulting in 151 general-family-specific MPOC-SP pairs. The construct validity analyses confirmed the scale structure: 21 items (77.8%) loaded highest in the original MPOC-SP factors, and all items correlated best and significantly with their own scale score (r 0.565 to 0.897; P<0.001). Intercorrelations between the scales ranged from r = 0.159 to r = 0.522. In total, 94.4% of the mean absolute difference scores between general and family-specific scale scores were larger than the expected difference. Service providers were able to discriminate between general and family-specific MPOC-SP item ratings. The family-specific MPOC-SP is a valid measure that can be used for individual evaluation of family-centred services and can be the impetus for family-related quality improvement.
[Psychometric attributes of Scales for Outcomes in Parkinson's Disease-Cognition (SCOPA-Cog), Castilian language].

PubMed

Martínez-Martín, P; Frades-Payo, B; Rodríguez-Blázquez, C; Forjaz, M J; de Pedro-Cuesta, J

To test the psychometric attributes of the Scales for Outcomes in Parkinson's Disease-Cognition (SCOPA-Cog), in Castilian language. It is a multicenter, cross-sectional study carried out on 387 Parkinson's disease (PD) patients. They were 70% in Hoehn and Yahr stages 2 or 3; their mean age was 65,8 years and they underwent the disease for 8,1 years. Rater-based -SCOPA-Motor, modified Parkinson's Psychosis Rating Scale, Clinical Impression of Severity Index for PD (CISI-PD), Cumulative Illness Rating Scale-Geriatrics- and self-administered -SCOPA-Autonomic, SCOPA-Sleep, SCOPA-Psychosocial, Hospital Anxiety and Depression Scale, EuroQoL- assessments were applied. For SCOPA-Cog, the following psychometric attributes were analysed: acceptability, internal consistency, dimensionality, construct validity, and precision. A cut-off point for dementia and SCOPA-Cog score's predictors were explored. SCOPA-Cog was free from floor and ceiling effect. The internal consistency was satisfactory (alpha = 0,83) and the item-total correlation resulted equal or upper than 0,45. Two factors were identified (52% of variance), one of them formed by 3 out of the 4 memory-related items. The correlation with other measures was weak (rS < 0,35), except for the CISI-PD's item 'cognitive state' (rS = 0,51). SCOPA-Cog scored significantly different for Hoehn and Yahr stages and for patients grouped by age, age at onset of PD, and education. The standard error of measurement was 3,02. A cut-off point 19/20 reached 76% sensitivity and specificity for dementia. Age and age at onset of PD resulted the strongest predictors. SCOPA-Cog is a consistent, valid, and precise measure for assessment of the cognitive disorder in PD.
Development and Validation of Triarchic Construct Scales from the Psychopathic Personality Inventory

PubMed Central

Hall, Jason R.; Drislane, Laura E.; Patrick, Christopher J.; Morano, Mario; Lilienfeld, Scott O.; Poythress, Norman G.

2014-01-01

The Triarchic model of psychopathy describes this complex condition in terms of distinct phenotypic components of boldness, meanness, and disinhibition. Brief self-report scales designed specifically to index these psychopathy facets have thus far demonstrated promising construct validity. The present study sought to develop and validate scales for assessing facets of the Triarchic model using items from a well-validated existing measure of psychopathy—the Psychopathic Personality Inventory (PPI). A consensus rating approach was used to identify PPI items relevant to each Triarchic facet, and the convergent and discriminant validity of the resulting PPI-based Triarchic scales were evaluated in relation to multiple criterion variables (i.e., other psychopathy inventories, antisocial personality disorder features, personality traits, psychosocial functioning) in offender and non-offender samples. The PPI-based Triarchic scales showed good internal consistency and related to criterion variables in ways consistent with predictions based on the Triarchic model. Findings are discussed in terms of implications for conceptualization and assessment of psychopathy. PMID:24447280
Development and validation of Triarchic construct scales from the psychopathic personality inventory.

PubMed

Hall, Jason R; Drislane, Laura E; Patrick, Christopher J; Morano, Mario; Lilienfeld, Scott O; Poythress, Norman G

2014-06-01

The Triarchic model of psychopathy describes this complex condition in terms of distinct phenotypic components of boldness, meanness, and disinhibition. Brief self-report scales designed specifically to index these psychopathy facets have thus far demonstrated promising construct validity. The present study sought to develop and validate scales for assessing facets of the Triarchic model using items from a well-validated existing measure of psychopathy-the Psychopathic Personality Inventory (PPI). A consensus-rating approach was used to identify PPI items relevant to each Triarchic facet, and the convergent and discriminant validity of the resulting PPI-based Triarchic scales were evaluated in relation to multiple criterion variables (i.e., other psychopathy inventories, antisocial personality disorder features, personality traits, psychosocial functioning) in offender and nonoffender samples. The PPI-based Triarchic scales showed good internal consistency and related to criterion variables in ways consistent with predictions based on the Triarchic model. Findings are discussed in terms of implications for conceptualization and assessment of psychopathy.
Development of the Abbreviated Masculine Gender Role Stress Scale

PubMed Central

Swartout, Kevin M.; Parrott, Dominic J.; Cohn, Amy M.; Hagman, Brett T.; Gallagher, Kathryn E.

2014-01-01

Data gathered from six independent samples (n = 1,729) that assessed men’s masculine gender role stress in college and community males were aggregated used to determine the reliability and validity of an abbreviated version of the Masculine Gender Role Stress Scale (MGRS scale). The 15 items with the highest item-to-total scale correlations were used to create an abbreviated MGRS scale. Psychometric properties of each of the 15-items were examined with Item Response Theory (IRT) analysis, using the discrimination and threshold parameters. IRT results showed that the abbreviated scale may hold promise at capturing the same amount of information as the full 40-item scale. Relative to the 40-item scale, the total score of the abbreviated MGRS scale demonstrated comparable convergent validity using the measurement domains of masculine identity, hyper-masculinity, trait anger, anger expression, and alcohol involvement. An abbreviated MGRS scale may be recommended for use in clinical practice and research settings to reduce cost, time, and patient/participant burden. Additionally, IRT analyses identified items with higher discrimination and threshold parameters that may be used to screen for problematic gender role stress in men who may be seen in routine clinical or medical practice. PMID:25528163

Some links on this page may take you to non-federal websites. Their policies may differ from this site.